A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2010; you can also visit the original URL.
The file type is application/pdf
.
Filters
Recognizing Ontology-Applicable Multiple-Record Web Documents
[chapter]
2001
Lecture Notes in Computer Science
As a step toward solving this problem, we propose a technique for recognizing which multiple-record Web documents apply to an ontologically specified application. ...
Automatically recognizing which Web documents are "of interest" for some specified application is non-trivial. ...
Concluding Remarks We presented an approach for recognizing which multiple-record Web documents apply to an ontology. ...
doi:10.1007/3-540-45581-7_41
fatcat:4fhwyo23gfds5hgg4owugqwpcu
Automatic Location and Separation of Records: A Case Study in the Genealogical Domain
[chapter]
2004
Lecture Notes in Computer Science
Locating specific chunks (records) of information within documents on the web is an interesting and nontrivial problem. ...
Experiments we have conducted show this technique yields an average of 92% recall and 93% precision for locating and separating genealogical records in web documents. ...
We divide web pages into three categories based on how they present information: singlerecord documents (Figure 1) , simple multiple-record documents (Figure 2) , and complex multiple-record documents ...
doi:10.1007/978-3-540-30466-1_28
fatcat:thif4iuxoffa5h2cwqdw6qsqhy
Conceptual-model-based data extraction from multiple-record Web pages
1999
Data & Knowledge Engineering
For these kinds of data-rich, multiple-record documents (e.g. advertisements, movie reviews, weather reports, travel information, sports summaries, financial statements, obituaries, and many others) we ...
By parsing the ontology, we can automatically produce a database scheme and recognizers for constants and keywords, and then invoke routines to recognize and extract data from unstructured documents and ...
To make our approach general, we fix the ontology parser, Web record extractor, keyword and constant recognizer, and database record generator; we change only the ontology as we move from one application ...
doi:10.1016/s0169-023x(99)00027-0
fatcat:v6imf5uvmvfrzlejks5wkstinq
Information Extraction from the Web by Matching Visual Presentation Patterns
[chapter]
2017
Lecture Notes in Computer Science
This makes the information extraction from web documents a challenging problem. ...
The documents available in the World Wide Web contain large amounts of information presented in tables, lists or other visually regular structures. ...
Method Overview We assume processing of web documents containing multiple data records corresponding to the same concept. ...
doi:10.1007/978-3-319-68723-0_2
fatcat:kg6nnj4xs5dfdcyh7oyevk3cau
Reusing ontologies and language components for ontology generation
2010
Data & Knowledge Engineering
Semantic Web ontologies can provide useful input for ontology reuse. However, the automated reuse of such ontologies remains underexplored. ...
Realizing the Semantic Web involves creating ontologies, a tedious and costly challenge. Reuse can reduce the cost of ontology engineering. ...
Most of these lexicons and data recognizers are collected from the Web. ...
doi:10.1016/j.datak.2009.08.003
fatcat:2xbzcng675cevcg466wurrmccu
A Conceptual-Modeling Approach to Extracting Data from the Web
[chapter]
1998
Lecture Notes in Computer Science
By parsing the ontology, w e can automatically produce a database scheme and recognizers for constants and keywords, and then invoke routines to recognize and extract data from unstructured documents and ...
Experiments show that it is possible to achieve good recall and precision ratios for documents that are rich in recognizable constants and narrow in ontological breadth. ...
To make our approach general, we x the ontology parser, Web record extractor, keyword and constant recognizer, and database record generator; we c hange only the ontology as we m o ve from one application ...
doi:10.1007/978-3-540-49524-6_7
fatcat:ne6lvzbcdrfctgrdv73vixjg7a
Record-boundary discovery in Web documents
1999
Proceedings of the 1999 ACM SIGMOD international conference on Management of data - SIGMOD '99
Without rst chunking documents that contain multiple records according to record boundaries, extraction of record information will not likely succeed. ...
In this paper we describe a heuristic approach t o discovering record boundaries in Web documents. ...
Concluding Remarks We h a ve described a heuristic approach to discovering record boundaries in unstructured Web documents containing multiple records of interest separated by one or more tags. ...
doi:10.1145/304182.304223
dblp:conf/sigmod/EmbleyJN99
fatcat:ty4ygupexzc2xdidyjboyutjri
Record-boundary discovery in Web documents
1999
SIGMOD record
Without rst chunking documents that contain multiple records according to record boundaries, extraction of record information will not likely succeed. ...
In this paper we describe a heuristic approach t o discovering record boundaries in Web documents. ...
Concluding Remarks We h a ve described a heuristic approach to discovering record boundaries in unstructured Web documents containing multiple records of interest separated by one or more tags. ...
doi:10.1145/304181.304223
fatcat:z5ygnb76qvhhffykevz77t2wrq
Applying Ontologies for Semantic Information Integration on Electronic Medical Records (EMRs)
2018
International Journal of Applied Engineering Research
This work describes the development of ontology in the domain of Electronic Medical Records (EMRs). ...
In addition, the ontology is complemented by the implementation of a semantic web service that facilitates the interrogation, maintenance and consultation of the ontology. ...
This service would work automatically for multiple XML documents containing information from relational databases found in hospitals. ...
doi:10.37622/ijaer/13.16.2018.12443-12448
fatcat:beg6dhuqbjb3jp7nxwib4nyzom
Semantic web applications to e-science in silico experiments
2004
Alternate track papers & posters of the 13th international conference on World Wide Web - WWW Alt. '04
Compared to annotating general Web documents, annotations for scientific data require more sophisticated professional knowledge to recognize concepts from documents, and more complex text extraction and ...
We used COHSE (Conceptual Open Hypermedia Services Environment) to annotate and browse provenance logs from my Grid 1 project, which are conceptually linked together as a hypertext Web of provenance logs ...
In the process of automatic annotations, language terms are recognized by the control of DOM (Document Object Model) objects in documents and unsupervisedly mapped to lexicons in an ontology. ...
doi:10.1145/1010432.1010502
fatcat:kkwbup3adbemtjz5rppd7mclw4
Semantic web applications to e-science in silico experiments
2004
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters - WWW Alt. '04
Compared to annotating general Web documents, annotations for scientific data require more sophisticated professional knowledge to recognize concepts from documents, and more complex text extraction and ...
We used COHSE (Conceptual Open Hypermedia Services Environment) to annotate and browse provenance logs from my Grid 1 project, which are conceptually linked together as a hypertext Web of provenance logs ...
In the process of automatic annotations, language terms are recognized by the control of DOM (Document Object Model) objects in documents and unsupervisedly mapped to lexicons in an ontology. ...
doi:10.1145/1013367.1013437
dblp:conf/www/ZhaoGS04
fatcat:x4et7bng65gopnprzjojywpv7m
Concept Based Dynamic Ontology Creation for Job Recommendation System
2016
Procedia Computer Science
In the second stage the stored input files are used by the similarity measure and ontology creation module by generating the corresponding Web Ontology Language (.owl) file. ...
It does not adequately fit new applications requirements, because they need a more dynamic ontology and the possibility to manage a considerable quantity of concepts that human cannot achieve alone. ...
In the next stage these multiple .csv's are used to generate the .owl file. And in the generation of ontology we are giving some query so as to output the records. ...
doi:10.1016/j.procs.2016.05.282
fatcat:njg67zldufbqlcovau5tfj3ygq
Magpie
2004
Proceedings of the 9th international conference on Intelligent user interface - IUI '04
Semantic layers are annotations of a web page, with a set of applicable semantic services attached to the annotated items. ...
Magpie is an extension to the Internet Explorer that automatically creates a semantic layer for web pages using a user-selected ontology. ...
The web services terminology emphasizes multiple roles played by a web browser and an ontology-based server. ...
doi:10.1145/964478.964479
fatcat:mi3y5wipcrfqfho4r4xs3gjkni
A Novel Framework for Data Extraction from Multiple Repositories and Generation of Ontologies using Inverted Indexing Technique
2017
International Journal of Database Theory and Application
of Ontologies. ...
It is a fact that information retrieval and data extraction are difficult tasks in handling the large collection of web documents. ...
The structure of the web has taken many evolutions from web of documents to web of applications [1] . The Semantic framework provides strong architectural foundation for next generation of web. ...
doi:10.14257/ijdta.2017.10.7.07
fatcat:pv7dlqcpzvf3vhu33dopu7puu4
Ontology-based extraction and structuring of information from data-rich unstructured documents
1998
Proceedings of the seventh international conference on Information and knowledge management - CIKM '98
When applied to a list of several similar unstructured documents, the result is a populated database structured according to and ltered with respect to the application ontology. ...
In experiments we conducted on two di erent t ypes of unstructured documents taken from the Web, our approach attained recall ratios in the 80 and 90 range and precision ratios near 98. ...
We believe that for applications which are data rich and narrow i n o n tological breadth, the approach presented here shows great promise. ...
doi:10.1145/288627.288641
dblp:conf/cikm/EmbleyCSL98
fatcat:g3wdosbfovd33kluj7vir6vose
« Previous
Showing results 1 — 15 out of 20,639 results