Filters








20,639 Hits in 3.6 sec

Recognizing Ontology-Applicable Multiple-Record Web Documents [chapter]

David W. Embley, Yiu-Kai Ng, Li Xu
2001 Lecture Notes in Computer Science  
As a step toward solving this problem, we propose a technique for recognizing which multiple-record Web documents apply to an ontologically specified application.  ...  Automatically recognizing which Web documents are "of interest" for some specified application is non-trivial.  ...  Concluding Remarks We presented an approach for recognizing which multiple-record Web documents apply to an ontology.  ... 
doi:10.1007/3-540-45581-7_41 fatcat:4fhwyo23gfds5hgg4owugqwpcu

Automatic Location and Separation of Records: A Case Study in the Genealogical Domain [chapter]

Troy Walker, David W. Embley
2004 Lecture Notes in Computer Science  
Locating specific chunks (records) of information within documents on the web is an interesting and nontrivial problem.  ...  Experiments we have conducted show this technique yields an average of 92% recall and 93% precision for locating and separating genealogical records in web documents.  ...  We divide web pages into three categories based on how they present information: singlerecord documents (Figure 1) , simple multiple-record documents (Figure 2) , and complex multiple-record documents  ... 
doi:10.1007/978-3-540-30466-1_28 fatcat:thif4iuxoffa5h2cwqdw6qsqhy

Conceptual-model-based data extraction from multiple-record Web pages

D.W. Embley, D.M. Campbell, Y.S. Jiang, S.W. Liddle, D.W. Lonsdale, Y.-K. Ng, R.D. Smith
1999 Data & Knowledge Engineering  
For these kinds of data-rich, multiple-record documents (e.g. advertisements, movie reviews, weather reports, travel information, sports summaries, financial statements, obituaries, and many others) we  ...  By parsing the ontology, we can automatically produce a database scheme and recognizers for constants and keywords, and then invoke routines to recognize and extract data from unstructured documents and  ...  To make our approach general, we fix the ontology parser, Web record extractor, keyword and constant recognizer, and database record generator; we change only the ontology as we move from one application  ... 
doi:10.1016/s0169-023x(99)00027-0 fatcat:v6imf5uvmvfrzlejks5wkstinq

Information Extraction from the Web by Matching Visual Presentation Patterns [chapter]

Radek Burget
2017 Lecture Notes in Computer Science  
This makes the information extraction from web documents a challenging problem.  ...  The documents available in the World Wide Web contain large amounts of information presented in tables, lists or other visually regular structures.  ...  Method Overview We assume processing of web documents containing multiple data records corresponding to the same concept.  ... 
doi:10.1007/978-3-319-68723-0_2 fatcat:kg6nnj4xs5dfdcyh7oyevk3cau

Reusing ontologies and language components for ontology generation

Deryle Lonsdale, David W. Embley, Yihong Ding, Li Xu, Martin Hepp
2010 Data & Knowledge Engineering  
Semantic Web ontologies can provide useful input for ontology reuse. However, the automated reuse of such ontologies remains underexplored.  ...  Realizing the Semantic Web involves creating ontologies, a tedious and costly challenge. Reuse can reduce the cost of ontology engineering.  ...  Most of these lexicons and data recognizers are collected from the Web.  ... 
doi:10.1016/j.datak.2009.08.003 fatcat:2xbzcng675cevcg466wurrmccu

A Conceptual-Modeling Approach to Extracting Data from the Web [chapter]

D. W. Embley, D. M. Campbell, Y. S. Jiang, S. W. Liddle, Y. -K. Ng, D. W. Quass, R. D. Smith
1998 Lecture Notes in Computer Science  
By parsing the ontology, w e can automatically produce a database scheme and recognizers for constants and keywords, and then invoke routines to recognize and extract data from unstructured documents and  ...  Experiments show that it is possible to achieve good recall and precision ratios for documents that are rich in recognizable constants and narrow in ontological breadth.  ...  To make our approach general, we x the ontology parser, Web record extractor, keyword and constant recognizer, and database record generator; we c hange only the ontology as we m o ve from one application  ... 
doi:10.1007/978-3-540-49524-6_7 fatcat:ne6lvzbcdrfctgrdv73vixjg7a

Record-boundary discovery in Web documents

D. W. Embley, Y. Jiang, Y.-K. Ng
1999 Proceedings of the 1999 ACM SIGMOD international conference on Management of data - SIGMOD '99  
Without rst chunking documents that contain multiple records according to record boundaries, extraction of record information will not likely succeed.  ...  In this paper we describe a heuristic approach t o discovering record boundaries in Web documents.  ...  Concluding Remarks We h a ve described a heuristic approach to discovering record boundaries in unstructured Web documents containing multiple records of interest separated by one or more tags.  ... 
doi:10.1145/304182.304223 dblp:conf/sigmod/EmbleyJN99 fatcat:ty4ygupexzc2xdidyjboyutjri

Record-boundary discovery in Web documents

D. W. Embley, Y. Jiang, Y.-K. Ng
1999 SIGMOD record  
Without rst chunking documents that contain multiple records according to record boundaries, extraction of record information will not likely succeed.  ...  In this paper we describe a heuristic approach t o discovering record boundaries in Web documents.  ...  Concluding Remarks We h a ve described a heuristic approach to discovering record boundaries in unstructured Web documents containing multiple records of interest separated by one or more tags.  ... 
doi:10.1145/304181.304223 fatcat:z5ygnb76qvhhffykevz77t2wrq

Applying Ontologies for Semantic Information Integration on Electronic Medical Records (EMRs)

Suarez Barón M. J., Ospina Becerra V.E., Salinas Valencia K.E.
2018 International Journal of Applied Engineering Research  
This work describes the development of ontology in the domain of Electronic Medical Records (EMRs).  ...  In addition, the ontology is complemented by the implementation of a semantic web service that facilitates the interrogation, maintenance and consultation of the ontology.  ...  This service would work automatically for multiple XML documents containing information from relational databases found in hospitals.  ... 
doi:10.37622/ijaer/13.16.2018.12443-12448 fatcat:beg6dhuqbjb3jp7nxwib4nyzom

Semantic web applications to e-science in silico experiments

Jun Zhao, Carole Goble, Robert Stevens
2004 Alternate track papers & posters of the 13th international conference on World Wide Web - WWW Alt. '04  
Compared to annotating general Web documents, annotations for scientific data require more sophisticated professional knowledge to recognize concepts from documents, and more complex text extraction and  ...  We used COHSE (Conceptual Open Hypermedia Services Environment) to annotate and browse provenance logs from my Grid 1 project, which are conceptually linked together as a hypertext Web of provenance logs  ...  In the process of automatic annotations, language terms are recognized by the control of DOM (Document Object Model) objects in documents and unsupervisedly mapped to lexicons in an ontology.  ... 
doi:10.1145/1010432.1010502 fatcat:kkwbup3adbemtjz5rppd7mclw4

Semantic web applications to e-science in silico experiments

Jun Zhao, Carole Goble, Robert Stevens
2004 Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters - WWW Alt. '04  
Compared to annotating general Web documents, annotations for scientific data require more sophisticated professional knowledge to recognize concepts from documents, and more complex text extraction and  ...  We used COHSE (Conceptual Open Hypermedia Services Environment) to annotate and browse provenance logs from my Grid 1 project, which are conceptually linked together as a hypertext Web of provenance logs  ...  In the process of automatic annotations, language terms are recognized by the control of DOM (Document Object Model) objects in documents and unsupervisedly mapped to lexicons in an ontology.  ... 
doi:10.1145/1013367.1013437 dblp:conf/www/ZhaoGS04 fatcat:x4et7bng65gopnprzjojywpv7m

Concept Based Dynamic Ontology Creation for Job Recommendation System

Uma Pavan Kumar Kethavarapu, S. Saraswathi
2016 Procedia Computer Science  
In the second stage the stored input files are used by the similarity measure and ontology creation module by generating the corresponding Web Ontology Language (.owl) file.  ...  It does not adequately fit new applications requirements, because they need a more dynamic ontology and the possibility to manage a considerable quantity of concepts that human cannot achieve alone.  ...  In the next stage these multiple .csv's are used to generate the .owl file. And in the generation of ontology we are giving some query so as to output the records.  ... 
doi:10.1016/j.procs.2016.05.282 fatcat:njg67zldufbqlcovau5tfj3ygq

Magpie

John Domingue, Martin Dzbor
2004 Proceedings of the 9th international conference on Intelligent user interface - IUI '04  
Semantic layers are annotations of a web page, with a set of applicable semantic services attached to the annotated items.  ...  Magpie is an extension to the Internet Explorer that automatically creates a semantic layer for web pages using a user-selected ontology.  ...  The web services terminology emphasizes multiple roles played by a web browser and an ontology-based server.  ... 
doi:10.1145/964478.964479 fatcat:mi3y5wipcrfqfho4r4xs3gjkni

A Novel Framework for Data Extraction from Multiple Repositories and Generation of Ontologies using Inverted Indexing Technique

Sudeepthi Govathoti, M. Surendra Prasad Babu
2017 International Journal of Database Theory and Application  
of Ontologies.  ...  It is a fact that information retrieval and data extraction are difficult tasks in handling the large collection of web documents.  ...  The structure of the web has taken many evolutions from web of documents to web of applications [1] . The Semantic framework provides strong architectural foundation for next generation of web.  ... 
doi:10.14257/ijdta.2017.10.7.07 fatcat:pv7dlqcpzvf3vhu33dopu7puu4

Ontology-based extraction and structuring of information from data-rich unstructured documents

David W. Embley, Douglas M. Campbell, Randy D. Smith, Stephen W. Liddle
1998 Proceedings of the seventh international conference on Information and knowledge management - CIKM '98  
When applied to a list of several similar unstructured documents, the result is a populated database structured according to and ltered with respect to the application ontology.  ...  In experiments we conducted on two di erent t ypes of unstructured documents taken from the Web, our approach attained recall ratios in the 80 and 90 range and precision ratios near 98.  ...  We believe that for applications which are data rich and narrow i n o n tological breadth, the approach presented here shows great promise.  ... 
doi:10.1145/288627.288641 dblp:conf/cikm/EmbleyCSL98 fatcat:g3wdosbfovd33kluj7vir6vose
« Previous Showing results 1 — 15 out of 20,639 results