Filters








19,253 Hits in 3.1 sec

Exploring the Knowledge in Semi Structured Data Sets with Rich Queries

Jürgen Umbrich, Sebastian Blohm
2008 Extended Semantic Web Conference  
We describe a system that incorporates both, semantic annotations of Wikipedia articles into the search process and allows for rich annotation search, enabling users to formulate queries based on their  ...  The outcome of this work is an application consisting of semantic annotators, an extended search engine and an interactive user interface.  ...  Acknowledgements This work has been supported by MFG Stiftung, Baden-Würrtemberg and by the X-Media project (www.x-media-project.org) sponsored by the European Commission as part of the Information Society  ... 
dblp:conf/esws/UmbrichB08 fatcat:gf6p4ba3ubg2le3niuuuah3oky

Extracting Events from Wikipedia as RDF Triples Linked to Widespread Semantic Web Datasets [chapter]

Carlo Aliprandi, Francesco Ronzano, Andrea Marchetti, Maurizio Tesconi, Salvatore Minutoli
2011 Lecture Notes in Computer Science  
We extract events from the KAF semantic annotation and then we structure each event as a set of RDF triples linked to both DBpedia and WordNet.  ...  Starting from the deep parsing of a set of English Wikipedia articles, we produce a semantic annotation compliant with the Knowledge Annotation Format (KAF).  ...  Mining of Wikipedia has also been carried out by applying Natural Language Processing: a dump of the English Wikipedia has been shallow parsed and semantically annotated [18] .  ... 
doi:10.1007/978-3-642-21796-8_10 fatcat:u7hli4ngqjhrxksxq35jxpmihq

From Wikipedia to Semantic Relationships: a Semi-automated Annotation Approach

Maria Ruiz-Casado, Enrique Alfonseca, Pablo Castells
2006 Semantic Wiki Workshop  
In this paper, an experiment is presented for the automatic annotation of several semantic relationships in the Wikipedia, a collaborative on-line encyclopedia.  ...  This methodology requires as information source any written, general-domain corpora and applies natural language processing techniques to extract the relationships from the textual corpora.  ...  We depart from our previous work in the extraction of semantic relationships to annotate the Wikipedia [9, 10] , consisting in disambiguating Wikipedia encyclopedic entries with respect to Word-Net, and  ... 
dblp:conf/semwiki/Ruiz-CasadoAC06 fatcat:zf72em6bzbgexohajg4wkmjnxy

DBpedia NIF: Open, Large-Scale and Multilingual Knowledge Extraction Corpus [article]

Milan Dojchinovski and Julio Hernandez and Markus Ackermann and Amit Kirschenbaum and Sebastian Hellmann
2018 arXiv   pre-print
In the past decade, the DBpedia community has put significant amount of effort on developing technical infrastructure and methods for efficient extraction of structured information from Wikipedia.  ...  We describe the dataset creation process and the NLP Interchange Format (NIF) used to model the content, links and the structure the information of the Wikipedia articles.  ...  We thank all contributors to the dataset and especially Martin Brümmer for the initial implementation and Markus Freudenberg for integration of the extraction as part of the DBpedia Extraction framework  ... 
arXiv:1812.10315v1 fatcat:mebmc2tutvaklens4nj7nsyuka

Learning to Tag and Tagging to Learn: A Case Study on Wikipedia

P. Mika, M. Ciaramita, H. Zaragoza, J. Atserias
2008 IEEE Intelligent Systems  
In this paper, we consider the problem of semantically annotating Wikipedia.  ...  By creating a semantic mapping among vocabularies from two sources: Wikipedia and the original annotated corpus, we are able to improve our tagger on the Wikipedia.  ...  Mihalcea and Csomai [12] use effectively information extracted from Wikipedia for improving keyword extraction and word sense disambiguation, and also identify important concepts in Wikipedia articles  ... 
doi:10.1109/mis.2008.85 fatcat:kszuudiwsvcolc4pwdh6jvanmm

Frame-Semantic Web: a Case Study for Korean

Jungyeul Park, Sejin Nam, Youngsik Kim, YoungGyun Hahm, Dosam Hwang, Key-Sun Choi
2014 International Semantic Web Conference  
We also provide the Wikipedia coverage by Korean FrameNet lexicons in the context of constructing a knowledge base from sentences in Wikipedia to show the usefulness of our work on frame semantics in the  ...  This paper presents how frame semantics becomes a frame-semantic web.  ...  and complex range of semantic information as well.  ... 
dblp:conf/semweb/ParkNKHHC14 fatcat:ww2h7quswjdencm54tvafylhly

LensingWikipedia: Parsing text for the interactive visualization of human history

Ravikiran Vadlapudi, Maryam Siahbani, Anoop Sarkar, John Dill
2012 2012 IEEE Conference on Visual Analytics Science and Technology (VAST)  
Extracting information from text is challenging. Most current practices treat text as a bag of words or word clusters, ignoring valuable linguistic information.  ...  The novelty lies in using state-of-the-art Natural Language Processing (NLP) tools to automatically annotate text which provides a basis for new and powerful interactive visualizations.  ...  ACKNOWLEDGEMENTS This work was partially funded through a Collaborative Research and Development (CRD) grant from NSERC, Canada based on a generous contribution from The Boeing Company and AeroInfo Systems  ... 
doi:10.1109/vast.2012.6400530 dblp:conf/ieeevast/VadlapudiSSD12 fatcat:zp5iihixfnarteve2u2yylkz3a

Frame Semantics Annotation Made Easy with DBpedia

Marco Fossati, Sara Tonelli, Claudio Giuliano
2013 International Semantic Web Conference  
Results prove that such strategy improves on the standard annotation workflow, both in terms of accuracy and of time consumption.  ...  In this paper, we present a novel approach to accomplish this task by leveraging information automatically extracted from DBpedia.  ...  Semantic Role Annotation. Manual annotation of semantic roles has been recently addressed via crowdsourcing in [9] and [7] .  ... 
dblp:conf/semweb/Fossati13 fatcat:lptan63tnzc2xpjqm4t3t4qgyy

Continuous Semantics to Analyze Real-Time Data

Amit Sheth, Christopher Thomas, Pankaj Mehra
2010 IEEE Internet Computing  
Acknowledgments We acknowledge Meena Nagarajan's input and partial support from US National Science Foundation Award IIS-0842129 and a Hewlett-Packard Innovation Grant.  ...  Finally, we apply the ontology to extract semantic metadata or to semantically annotate data in unseen or new corpora.  ...  Twitris uses the domain model to semantically annotate and support semantic analysis of the original tweets (as in Figure 2a ) and subsequent tweets (see Figure 2d ).  ... 
doi:10.1109/mic.2010.137 fatcat:vftujtmq2fekfds6v6avrmwngm

Semantic Annotation, Analysis and Comparison: A Multilingual and Cross-lingual Text Analytics Toolkit

Lei Zhang, Achim Rettinger
2014 Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics  
Within the context of globalization, multilinguality and cross-linguality for information access have emerged as issues of major interest.  ...  In this paper, we demonstrate such a toolkit, which supports both service-oriented and user-oriented interfaces for semantically annotating, analyzing and comparing multilingual texts across the boundaries  ...  Table 1 shows the statistics of the Wikipedia articles in English, German, Spanish and French as well as the cross-language links between the them in these languages extracted from Wikipedia snapshots  ... 
doi:10.3115/v1/e14-2004 dblp:conf/eacl/ZhangR14 fatcat:wddn4zrzzrbellqqznoo74ixma

DAEDALUS at ImageCLEF Wikipedia Retrieval 2010: Expanding with Semantic Information from Context

Sara Lana-Serrano, Julio Villena-Román, José Carlos González Cristóbal
2010 Conference and Labs of the Evaluation Forum  
For the semantic annotation, DBpedia ontology and YAGO classification schema are used.  ...  Furthermore, the use of semantic information in the process of multimedia information extraction poses two hard challenges still to solve: how to automatically extract the high level features associated  ...  Text Extraction: Ad-hoc scripts are run on the files that contain image annotations, on the Wikipedia articles and on the topics.  ... 
dblp:conf/clef/Lana-SerranoVC10 fatcat:pbkw24li3fdmxc2zhwevppemki

Multilingual Named-Entity Recognition from Parallel Corpora

Andreea Bodnari, Aurélie Névéol, Özlem Uzuner, Pierre Zweigenbaum, Peter Szolovits
2013 Conference and Labs of the Evaluation Forum  
We use the sentence alignment of the parallel corpora, the word alignment generated by the GIZA++[8] tool, and Wikipedia-based word alignment in order to transfer system predictions made by individual  ...  Each language model benefits from the external knowledge extracted from biomedical and general domain resources.  ...  The Wikipedia knowledge-based features are dependent on the language and are extracted based on the respective language version of Wikipedia.  ... 
dblp:conf/clef/BodnariNUZS13 fatcat:6gg5sqncsjervc52ldtlumuz6u

Entity Extraction: From Unstructured Text to DBpedia RDF triples

Peter Exner, Pierre Nugues
2012 International Semantic Web Conference  
This system is based on a pipeline of text processing modules that includes a semantic parser and a coreference solver.  ...  We applied our system to over 114,000 Wikipedia articles and we could extract more than 1,000,000 triples.  ...  This research was supported by Vetenskapsrådet, the Swedish research council, under grant 621-2010-4800 and has received funding from the European Union's seventh framework program (FP7/2007-2013) under  ... 
dblp:conf/semweb/ExnerN12 fatcat:q63maq4zoner5jvlzih6rd3biq

OPIEC: An Open Information Extraction Corpus [article]

Kiril Gashteovski, Sebastian Wanner, Sven Hertling, Samuel Broscheit, Rainer Gemulla
2019 arXiv   pre-print
, linguistic annotations, and semantic annotations including spatial and temporal information.  ...  In this paper, we release, describe, and analyze an OIE corpus called OPIEC, which was extracted from the text of English Wikipedia.  ...  Statistics Basic statistics such as corpus sizes, frequency of various semantic annotations, and information about the length of the extracted triples of OPIEC and its subcorpora are shown in Tab. 2.  ... 
arXiv:1904.12324v1 fatcat:h5uy6ldnc5hnxgsjwdcf6u4uea

Exploiting User Queries and Web Communities in Semantic Annotation

Norberto Fernández García, José M. Blázquez del Toro, Luis Sánchez Fernández, Vicente Luque Centeno
2005 International Semantic Web Conference  
We also describe how we can take benefit of the information generated and maintained by Web Communities as Wikipedia in order to achieve our goal.  ...  In this paper we describe the SQAPS system, which aims at providing a mean of exploiting for semantic annotation the effort of users who every day look for information on the Web.  ...  Acknowledgements This work has been partially funded by the Ministerio de Educación y Ciencia de España, as part of the Infoflex Project, TIC2003-07208.  ... 
dblp:conf/semweb/GarciaTFC05 fatcat:c6z4d5a7rzgk3nvqgkhb5oiwhi
« Previous Showing results 1 — 15 out of 19,253 results