Filters








354 Hits in 2.7 sec

WikiAnalytics: Ad-hoc querying of highly heterogeneous structured data

Andrey Balmin, Emiran Curtmola
2010 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010)  
We describe, WIKIANALYTICS, a system that facilitates data extraction from the Wikipedia infobox collection.  ...  Therefore, we cluster documents based on fields and values that contain the query keywords. We build a universal navigational lattice (UNL) over all such discovered clusters.  ...  A major challenge in querying infoboxes is the diversity of their structure. Every infobox instance has an equivalent of a type -wiki template that renders the infobox wikitext into HTML.  ... 
doi:10.1109/icde.2010.5447751 dblp:conf/icde/BalminC10 fatcat:oul3a74r4fgvdeomlgttpu2x5q

DBpedia - A crystallization point for the Web of Data

Christian Bizer, Jens Lehmann, Georgi Kobilarov, Sören Auer, Christian Becker, Richard Cyganiak, Sebastian Hellmann
2009 Journal of Web Semantics  
The DBpedia project is a community effort to extract structured information from Wikipedia and to make this information accessible on the Web.  ...  Over the last year, an increasing number of data publishers have begun to set data-level links to DBpedia resources, making DBpedia a central interlinking hub for the emerging Web of Data.  ...  Generic versus mapping-based infobox extraction The type of wiki contents that is most valuable for the DBpedia extraction are Wikipedia infoboxes.  ... 
doi:10.1016/j.websem.2009.07.002 fatcat:eaus7na2vjf3nnzuygqyngbdta

DBpedia - A Crystallization Point for the Web of Data

Christian Bizer, Jens Lehmann, Georgi Kobilarov, SSren Auer, Christian Becker, Richard Cyganiak, Sebastian Hellmann
2009 Social Science Research Network  
The DBpedia project is a community effort to extract structured information from Wikipedia and to make this information accessible on the Web.  ...  Over the last year, an increasing number of data publishers have begun to set data-level links to DBpedia resources, making DBpedia a central interlinking hub for the emerging Web of Data.  ...  Generic versus mapping-based infobox extraction The type of wiki contents that is most valuable for the DBpedia extraction are Wikipedia infoboxes.  ... 
doi:10.2139/ssrn.3199424 fatcat:3dnuye4lrja37e5tzlcz2bgspm

The knowledge organization of DBpedia: a case study

Cristina Pattuelli, Sara Rubinow
2013 Journal of Documentation  
Infobox templates are created and reused by Wikipedia contributors who also supply the documentation and the rules that determine their format and use.  ...  Like Wikipedia, DBpedia grants editing rights to anyone motivated to create manual mappings of Wikipedia infobox templates.  ... 
doi:10.1108/jd-07-2012-0084 fatcat:rrzgauiwg5dz5a4s6ga6agvi6i

Information arbitrage across multi-lingual Wikipedia

Eytan Adar, Michael Skinner, Daniel S. Weld
2009 Proceedings of the Second ACM International Conference on Web Search and Data Mining - WSDM '09  
Analyzing four large language domains (English, Spanish, French, and German), we present Ziggurat, an automated system for aligning Wikipedia infoboxes, creating new infoboxes as necessary, filling in  ...  The rapid globalization of Wikipedia is generating a parallel, multi-lingual corpus of unprecedented scale.  ...  Additional thanks to Ivan Beschastnikh, Travis Kriplean, Raphael Hoffman, Fei Wu and Mausam for their feedback, advice, and labeling.  ... 
doi:10.1145/1498759.1498813 dblp:conf/wsdm/AdarSW09 fatcat:qwzvovhp3bbhtacckdosa7txma

Autonomously semantifying wikipedia

Fei Wu, Daniel S. Weld
2007 Proceedings of the sixteenth ACM conference on Conference on information and knowledge management - CIKM '07  
We identify several types of structures which can be automatically enhanced in Wikipedia (e.g., link structure, taxonomic data, infoboxes, etc.), and we describe a prototype implementation of a self-supervised  ...  This paper argues that autonomously "Semantifying Wikipedia" is the best way to solve the problem.  ...  ACKNOWLEDGMENTS We thank Oren Etzioni, Alex Yates, Matt Broadhead, and Michele Banko for providing the code of their software and useful discussions.  ... 
doi:10.1145/1321440.1321449 dblp:conf/cikm/WuW07 fatcat:sqw6noesufhgletzdzz76bak34

Profiling linked open data with ProLOD

Christoph Bohm, Felix Naumann, Ziawasch Abedjan, Dandy Fenz, Toni Grutze, Daniel Hefenbrock, Matthias Pohl, David Sonnabend
2010 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010)  
With ProLOD, we propose a suite of methods ranging from the domain level (clustering, labeling), via the schema level (matching, disambiguation), to the data level (data type detection, pattern detection  ...  Such data emerge from different sources, such as open source communities (e.g., Wikipedia) or projects dedicated to a specific topic (e.g., DrugBank [1]).  ...  Additionally, we thank Christian Bizer and Georgi Kobilarov for providing DBpedia to the community and for their valuable feedback on ProLOD.  ... 
doi:10.1109/icdew.2010.5452762 dblp:conf/icde/BohmNAFGHPS10 fatcat:b5diltwdd5chji56nuvwpapncm

WHAD: Wikipedia historical attributes data

Enrique Alfonseca, Guillermo Garrido, Jean-Yves Delort, Anselmo Peñas
2013 Language Resources and Evaluation  
We present a study of vandalism identification in Wikipedia edits that uses only features from the infoboxes, and show that we can obtain, on this dataset, an accuracy comparable to a state-of-the-art  ...  This paper describes the generation of temporally anchored infobox attribute data from the Wikipedia history of revisions.  ...  Acknowledgments The research leading to these results has received funding from the European Unions Seventh Framework Programme (FP7/2007(FP7/ -2013 under grant agreement number 257790; the Spanish Ministry  ... 
doi:10.1007/s10579-013-9232-5 fatcat:ntbxefw45bb4hc6vzrg64jmhpu

"Tell me more" using Ladders in Wikipedia

Siarhei Bykau, Jihwan Lee, Divesh Srivastava, Yannis Velegrakis
2017 Proceedings of the 20th International Workshop on the Web and Databases - WebDB'17  
We empirically evaluate our technique for constructing ladders on multiple Wikipedia datasets against baseline techniques, including one based on a learning-based technique to populate infoboxes and another  ...  We provide novel algorithms to efficiently construct ladders, making use of supervised learning techniques to account for the different kinds of edits that happen in Wikipedia articles.  ...  We refer to this type of linking between some information mentioned in the Infobox, and the same information in the text as spatial linkage.  ... 
doi:10.1145/3068839.3068847 dblp:conf/webdb/BykauLSV17 fatcat:wdkxzcn2fjavdc4p6ifozdiljm

Multilingual Schema Matching for Wikipedia Infoboxes [article]

Thanh Nguyen, Viviane Moreira, Huong Nguyen, Hoa Nguyen, Juliana Freire
2011 arXiv   pre-print
The availability of documents in multiple languages also opens up new opportunities for querying structured Wikipedia content, and in particular, to enable answers that straddle different languages.  ...  We also present a case study which demonstrates that the multilingual mappings we derive lead to substantial improvements in answer quality and coverage for structured queries over Wikipedia content.  ...  We thank Gosse Bouma, Sabine Massmann and Erhard Rahm for sharing their software with us, and the reviewers for their constructive comments.  ... 
arXiv:1110.6651v1 fatcat:6fuogzmdvjdufhcfzzdlvixnme

Multilingual schema matching for Wikipedia infoboxes

Thanh Nguyen, Viviane Moreira, Huong Nguyen, Hoa Nguyen, Juliana Freire
2011 Proceedings of the VLDB Endowment  
The availability of documents in multiple languages also opens up new opportunities for querying structured Wikipedia content, and in particular, to enable answers that straddle different languages.  ...  We also present a case study which demonstrates that the multilingual mappings we derive lead to substantial improvements in answer quality and coverage for structured queries over Wikipedia content.  ...  We thank Gosse Bouma, Sabine Massmann and Erhard Rahm for sharing their software with us, and the reviewers for their constructive comments.  ... 
doi:10.14778/2078324.2078329 fatcat:eexeqy6rnbcx3lrczjewdzvaeq

Charting DBpedia: Towards a Cartography of a Major Linked Dataset [chapter]

M. Cristina Pattuelli, Sara Rubinow
2012 Categories, Contexts and Relations in Knowledge Organization  
This analysis opens up a new area of research to which the knowledge organization community can make a significant contribution.  ...  Like Wikipedia, DBpedia grants editing rights to anyone motivated to create manual mappings of Wikipedia infobox templates.  ...  Infobox templates are created and reused by Wikipedia contributors who also supply the documentation and the rules that determine their format and use.  ... 
doi:10.5771/9783956504402-75 fatcat:c7jbpk2yyngxva4szxnki2pfkq

DocGenealogy – Visualizing the doctoral advisors and mentors genealogic tree

David Paiva Fernandes, Elizabeth Simão Carvalho
2018 Abakós  
DocGenealogy uses the Wikipedia available data on doctoral advisement and mentoring to find out and track the existing relationships between advisors or mentors and their students.  ...  The main objective of the work is to evaluate the effectiveness of the graph-oriented techniques in the interpretation of data from Wilipedia infoboxes.  ...  Although not mandatory, the use of infoboxes is quite common and about 33% of Wikipedia articles contained an infobox (Figure 2 illustrates an infobox inside a Wikipedia page).  ... 
doi:10.5752/p.2316-9451.2018v6n2p3-20 fatcat:imph72wh6fcd3cy7hbjmdy6doq

Automatically refining the wikipedia infobox ontology

Fei Wu, Daniel S. Weld
2008 Proceeding of the 17th international conference on World Wide Web - WWW '08  
We demonstrate how the resulting ontology may be used to enhance Wikipedia with improved query processing and other features.  ...  The combined efforts of human volunteers have recently extracted numerous facts from Wikipedia, storing them as machine-harvestable object-attribute-value triples in Wikipedia infoboxes.  ...  Category Tags: Many infobox classes have their own Wikipedia pages, and sometimes a special type of category, "XXX infobox templates," is used to tag those pages.  ... 
doi:10.1145/1367497.1367583 dblp:conf/www/WuW08 fatcat:mvtatwwc2jg7bhszipk5em4nga

Semantic Data Extraction from Infobox Wikipedia Template

Amira AbdEl-atey, Sherif El-etriby, Arabi S. kishk
2012 International Journal of Computer Applications  
Keywords Wikipedia ; semantic web ; DBpedia; data extraction framework; structured knowledge; wikipedia templates; media wiki software; infobox template.  ...  We test the project to get structured data as triples from some Wikipedia resources. We clarify examples of car resource and Berlin resource.  ...  Infobox template A special type of templates is infobox, aiming at generating consistently-formatted boxes for certain content in articles describing instances of a specific type.  ... 
doi:10.5120/5072-7464 fatcat:oduwsuusune6fgvdi3xxwiqrdi
« Previous Showing results 1 — 15 out of 354 results