35,035 Hits in 4.0 sec

Web Table Extraction, Retrieval and Augmentation: A Survey [article]

Shuo Zhang, Krisztian Balog
2020 arXiv   pre-print
, and table augmentation.  ...  In particular, we organize existing literature into six main categories of information access tasks: table extraction, table interpretation, table search, question answering, knowledge base augmentation  ...  Web Table Search Table Extraction Table Interpretation Table Augmentation Question Answering Knowledge Base Augmentation High level applications Low-level tasks [:,j] ), and table entities  ... 
arXiv:2002.00207v2 fatcat:wss5iylwdbh5ziso4fjr4n6zfe

Discovering semantic biomedical relations utilizing the Web

Saurav Sahay, Sougata Mukherjea, Eugene Agichtein, Ernest V. Garcia, Shamkant B. Navathe, Ashwin Ram
2008 ACM Transactions on Knowledge Discovery from Data  
The extracted relations can be used to construct and augment ontologies and knowledge bases.  ...  For this purpose we retrieve relevant information from Web Search engines and Pubmed database using various lexico-syntactic patterns as queries over SOAP web services.  ...  Table V shows coverage and correctness values for the extracted relations.  ... 
doi:10.1145/1342320.1342323 fatcat:xehpbfmuwva6bikugrflhc4qq4

From web tables to a knowledge graph: prospects of an end-to-end solution [article]

Alexey Shigarov, Nikita Dorodnykh, Alexander Yurin, Andrey Mikhailov, Viacheslav Paramonov
Interrelated named entities can be extracted from web-tables and mapped to a knowledge graph.  ...  This paper discusses prospects of an end-to-end solution for the knowledge graph population by entities extracted from web-tables of predefined types.  ...  Web table extraction, retrieval, and augmentation: a survey.  ... 
doi:10.6084/m9.figshare.16621528.v1 fatcat:gwhxeemhqvds7d54dykreqez3a

Augmented EHR: Enrichment of EHR with Contents from Semantic Web Sources

Alejandro Mañas-García, José Alberto Maldonado, Mar Marcos, Diego Boscá, Montserrat Robles
2021 Applied Sciences  
This work presents methods to combine data from the Semantic Web into existing EHRs, leading to an augmented EHR.  ...  The results are converted into a standardized EHR extract according to an archetype. This work sets the foundations to transform Semantic Web contents into normalized EHR extracts.  ...  (C) Querying Semantic Web datasets. The next step is to build a SPARQL query to retrieve the augmentation content.  ... 
doi:10.3390/app11093978 doaj:8172b04fc6334b999d0b4dd399244190 fatcat:2ycx3aw6vrg7xfoxh4dxp5672m

Table Understanding: Rethinking of the Problem [article]

Alexey Shigarov
This report presents our rethinking of the Table Understanding problem  ...  Web table extraction, retrieval, and augmentation: a survey.  ...  Web table extraction, retrieval, and augmentation: a survey.  ... 
doi:10.6084/m9.figshare.14836122.v1 fatcat:ommchmvvx5gfbdnmo5qebx5o4e

On-demand new word learning using world wide web

Stanislas Oger, Georges Linares, Frederic Bechet, Pascal Nocera
2008 Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing  
We first demonstrate the relevance of the Web for the OOV word retrieval. Then, different methods are proposed to retrieve the hypothesis words.  ...  Most of the Web-based methods for lexicon augmenting consist in capturing global semantic features of the targeted domain in order to collect relevant documents from the Web.  ...  In order to evaluate this assumption, we measure the rate on the web of the n-grams containing OOV words extracted from the exact transcripts.  ... 
doi:10.1109/icassp.2008.4518607 dblp:conf/icassp/OgerLBN08 fatcat:xfkk4ap355hqbkdlt7ian4qzm4

Enriching Existing Test Collections with OXPath [chapter]

Philipp Schaer, Mandy Neumann
2017 Lecture Notes in Computer Science  
We present a light-weight alternative that employs the web data extraction language OXPath to harvest data to be added to an existing test collection from web resources.  ...  This allows the re-use of this collection for other evaluation purposes like bibliometrics-enhanced retrieval.  ...  We propose a light-weight method for extending and augmenting the documents sets in test collections by incorporating the web extraction language OXPath.  ... 
doi:10.1007/978-3-319-65813-1_16 fatcat:ureylxntjreblpb2ty52ifqzt4

A hybrid classifier approach for Web retrieved documents classification

R.S. Bot, Yi-fang Brook Wu, Xin Chen, Quanzhi Li
2004 International Conference on Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004.  
The paper p resents a hybrid technique for the classification of web returned hits into concept hierarchies. The technique involves a combination of manual and automatic classifiers.  ...  At first, all web returned documents are assigned to human defined categories using m anual classifiers, and then automatic classifiers are used to generate a concept hierarchy for each of these categories  ...  Introduction The Internet information overflow phenomenon makes it very difficult for web users to search and retrieve the information they need in a reasonable amount of time.  ... 
doi:10.1109/itcc.2004.1286474 dblp:conf/itcc/BotWCL04 fatcat:y554wlnkxzfwroxtmoox53zwje

Share Me - A Digital Annotation Sharing Service for Paper Documents with Multiple Clients Support

Kazuma Tanaka, Motoi Iwata, Kai Kunze, Masakazu Iwamura, Koishi Kise
2013 2013 2nd IAPR Asian Conference on Pattern Recognition  
Our service uses a real-time document image retrieval method called Locally Likely Arrangement Hashing (LLAH) for providing information associated with the document.  ...  We present the prototype implementation, and provide a discussion covering the technical details of the system.  ...  This library provides core functions of LLAH: feature extraction, feature matching, optimized hash table, and database serialization.  ... 
doi:10.1109/acpr.2013.182 dblp:conf/acpr/TanakaIKIK13 fatcat:blqspf5i4vgvpmv5brrzdkuphu

Anchor text mining for translation of Web queries

Wen-Hsiang Lu, Lee-Feng Chien, Hsi-Jian Lee
2004 ACM Transactions on Information Systems  
A series of experiments has been conducted, including performance tests on term translation extraction, cross-language information retrieval, and translation suggestions for practical Web search services  ...  through the mining of Web anchor texts and link structures.  ...  Mark Sanderson and the anonymous reviewers for their valuable comments and suggestions. Many thanks are given to Mr.  ... 
doi:10.1145/984321.984324 fatcat:75mnaq3qmza6vdduluhn3yhm5m

Semantic Retrieval Approach for Web Documents

Hany M, Khaled M., Nagdy M.
2011 International Journal of Advanced Computer Science and Applications  
In this paper, we propose the semantic information retrieval approach to extract the information from the web documents in certain domain (jaundice diseases) by collecting the domain relevant documents  ...  Using Semantic Web is a way to increase the precision of information retrieval systems.  ...  Output (for the Web page). Save and its Content in DB table. EndDo TABLE III III .  ... 
doi:10.14569/ijacsa.2011.020912 fatcat:6ppyaixuufgztnauxw3ps6sl5y

Public Opinion Channel: A System for Augmenting Social Intelligence of a Community [chapter]

Tomohiro Fukuhara, Toyoaki Nishida, Shunsuke Uemura
2001 Lecture Notes in Computer Science  
We propose POC prototype system for augmenting social intelligence of a community by eliciting and circulating diverse opinions.  ...  To augment social intelligence of a community, (1) eliciting diverse opinions from community members, and (2) circulating opinions in the community are important.  ...  Table 5 : 5 Sentences found in reterival results from a Web search engine. Sentences are extracted based on feature phrases.  ... 
doi:10.1007/3-540-45548-5_7 fatcat:otan6h355fhbzkj3fpwfg5ii74

Self-training Improves Pre-training for Natural Language Understanding [article]

Jingfei Du, Edouard Grave, Beliz Gunel, Vishrav Chaudhary, Onur Celebi, Michael Auli, Ves Stoyanov, Alexis Conneau
2020 arXiv   pre-print
To obtain additional data for a specific task, we introduce SentAugment, a data augmentation method which computes task-specific query embeddings from labeled data to retrieve sentences from a bank of  ...  billions of unlabeled sentences crawled from the web.  ...  We introduce SentAugment, a new data augmentation method for NLP that retrieves relevant sentences from a large web data corpus.  ... 
arXiv:2010.02194v1 fatcat:i4btr6525zb7pe3dd2bfjum3ra

Context Disambiguation Based Semantic Web Search for Effective Information Retrieval

2011 Journal of Computer Science  
in the web page.  ...  Results: The context of the user query is identified and formulated. The user query is enriched to get more relevant web pages that the user needs.  ...  This selected core is augmented with the user query and passed to the web searcher to retrieve the results of the enriched queries.  ... 
doi:10.3844/jcssp.2011.548.553 fatcat:angllqchabf5lbo7zugonsyo5y

KnowMore – knowledge base augmentation with structured web markup

Ran Yu, Ujwal Gadiraju, Besnik Fetahu, Oliver Lehmberg, Dominique Ritze, Stefan Dietze, Claudia d'Amato, Agnieszka Lawrynowicz, Jens Lehmann
2018 Semantic Web Journal  
Knowledge bases are in wide-spread use for aiding tasks such as information extraction and information retrieval, where Web search is a prominent example.  ...  We perform a thorough evaluation on a subset of the Web Data Commons dataset and show significant potential for augmenting existing KBs.  ...  While the extraction of structured data from Web documents is costly and error-prone, the recent emergence of embedded and structured Web page markup has provided an unprecedented source of explicit entity-centric  ... 
doi:10.3233/sw-180304 fatcat:7qd7ozt5fjfrzmf7gckw3dqxly
« Previous Showing results 1 — 15 out of 35,035 results