A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2015; you can also visit the original URL.
The file type is application/pdf
.
Filters
WikiAnalytics: Ad-hoc querying of highly heterogeneous structured data
2010
2010 IEEE 26th International Conference on Data Engineering (ICDE 2010)
We describe, WIKIANALYTICS, a system that facilitates data extraction from the Wikipedia infobox collection. ...
Therefore, we cluster documents based on fields and values that contain the query keywords. We build a universal navigational lattice (UNL) over all such discovered clusters. ...
A major challenge in querying infoboxes is the diversity of their structure. Every infobox instance has an equivalent of a type -wiki template that renders the infobox wikitext into HTML. ...
doi:10.1109/icde.2010.5447751
dblp:conf/icde/BalminC10
fatcat:oul3a74r4fgvdeomlgttpu2x5q
DBpedia - A crystallization point for the Web of Data
2009
Journal of Web Semantics
The DBpedia project is a community effort to extract structured information from Wikipedia and to make this information accessible on the Web. ...
Over the last year, an increasing number of data publishers have begun to set data-level links to DBpedia resources, making DBpedia a central interlinking hub for the emerging Web of Data. ...
Generic versus mapping-based infobox extraction The type of wiki contents that is most valuable for the DBpedia extraction are Wikipedia infoboxes. ...
doi:10.1016/j.websem.2009.07.002
fatcat:eaus7na2vjf3nnzuygqyngbdta
DBpedia - A Crystallization Point for the Web of Data
2009
Social Science Research Network
The DBpedia project is a community effort to extract structured information from Wikipedia and to make this information accessible on the Web. ...
Over the last year, an increasing number of data publishers have begun to set data-level links to DBpedia resources, making DBpedia a central interlinking hub for the emerging Web of Data. ...
Generic versus mapping-based infobox extraction The type of wiki contents that is most valuable for the DBpedia extraction are Wikipedia infoboxes. ...
doi:10.2139/ssrn.3199424
fatcat:3dnuye4lrja37e5tzlcz2bgspm
The knowledge organization of DBpedia: a case study
2013
Journal of Documentation
Infobox templates are created and reused by Wikipedia contributors who also supply the documentation and the rules that determine their format and use. ...
Like Wikipedia, DBpedia grants editing rights to anyone motivated to create manual mappings of Wikipedia infobox templates. ...
doi:10.1108/jd-07-2012-0084
fatcat:rrzgauiwg5dz5a4s6ga6agvi6i
Information arbitrage across multi-lingual Wikipedia
2009
Proceedings of the Second ACM International Conference on Web Search and Data Mining - WSDM '09
Analyzing four large language domains (English, Spanish, French, and German), we present Ziggurat, an automated system for aligning Wikipedia infoboxes, creating new infoboxes as necessary, filling in ...
The rapid globalization of Wikipedia is generating a parallel, multi-lingual corpus of unprecedented scale. ...
Additional thanks to Ivan Beschastnikh, Travis Kriplean, Raphael Hoffman, Fei Wu and Mausam for their feedback, advice, and labeling. ...
doi:10.1145/1498759.1498813
dblp:conf/wsdm/AdarSW09
fatcat:qwzvovhp3bbhtacckdosa7txma
Autonomously semantifying wikipedia
2007
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management - CIKM '07
We identify several types of structures which can be automatically enhanced in Wikipedia (e.g., link structure, taxonomic data, infoboxes, etc.), and we describe a prototype implementation of a self-supervised ...
This paper argues that autonomously "Semantifying Wikipedia" is the best way to solve the problem. ...
ACKNOWLEDGMENTS We thank Oren Etzioni, Alex Yates, Matt Broadhead, and Michele Banko for providing the code of their software and useful discussions. ...
doi:10.1145/1321440.1321449
dblp:conf/cikm/WuW07
fatcat:sqw6noesufhgletzdzz76bak34
Profiling linked open data with ProLOD
2010
2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010)
With ProLOD, we propose a suite of methods ranging from the domain level (clustering, labeling), via the schema level (matching, disambiguation), to the data level (data type detection, pattern detection ...
Such data emerge from different sources, such as open source communities (e.g., Wikipedia) or projects dedicated to a specific topic (e.g., DrugBank [1]). ...
Additionally, we thank Christian Bizer and Georgi Kobilarov for providing DBpedia to the community and for their valuable feedback on ProLOD. ...
doi:10.1109/icdew.2010.5452762
dblp:conf/icde/BohmNAFGHPS10
fatcat:b5diltwdd5chji56nuvwpapncm
WHAD: Wikipedia historical attributes data
2013
Language Resources and Evaluation
We present a study of vandalism identification in Wikipedia edits that uses only features from the infoboxes, and show that we can obtain, on this dataset, an accuracy comparable to a state-of-the-art ...
This paper describes the generation of temporally anchored infobox attribute data from the Wikipedia history of revisions. ...
Acknowledgments The research leading to these results has received funding from the European Unions Seventh Framework Programme (FP7/2007(FP7/ -2013 under grant agreement number 257790; the Spanish Ministry ...
doi:10.1007/s10579-013-9232-5
fatcat:ntbxefw45bb4hc6vzrg64jmhpu
"Tell me more" using Ladders in Wikipedia
2017
Proceedings of the 20th International Workshop on the Web and Databases - WebDB'17
We empirically evaluate our technique for constructing ladders on multiple Wikipedia datasets against baseline techniques, including one based on a learning-based technique to populate infoboxes and another ...
We provide novel algorithms to efficiently construct ladders, making use of supervised learning techniques to account for the different kinds of edits that happen in Wikipedia articles. ...
We refer to this type of linking between some information mentioned in the Infobox, and the same information in the text as spatial linkage. ...
doi:10.1145/3068839.3068847
dblp:conf/webdb/BykauLSV17
fatcat:wdkxzcn2fjavdc4p6ifozdiljm
Multilingual Schema Matching for Wikipedia Infoboxes
[article]
2011
arXiv
pre-print
The availability of documents in multiple languages also opens up new opportunities for querying structured Wikipedia content, and in particular, to enable answers that straddle different languages. ...
We also present a case study which demonstrates that the multilingual mappings we derive lead to substantial improvements in answer quality and coverage for structured queries over Wikipedia content. ...
We thank Gosse Bouma, Sabine Massmann and Erhard Rahm for sharing their software with us, and the reviewers for their constructive comments. ...
arXiv:1110.6651v1
fatcat:6fuogzmdvjdufhcfzzdlvixnme
Multilingual schema matching for Wikipedia infoboxes
2011
Proceedings of the VLDB Endowment
The availability of documents in multiple languages also opens up new opportunities for querying structured Wikipedia content, and in particular, to enable answers that straddle different languages. ...
We also present a case study which demonstrates that the multilingual mappings we derive lead to substantial improvements in answer quality and coverage for structured queries over Wikipedia content. ...
We thank Gosse Bouma, Sabine Massmann and Erhard Rahm for sharing their software with us, and the reviewers for their constructive comments. ...
doi:10.14778/2078324.2078329
fatcat:eexeqy6rnbcx3lrczjewdzvaeq
Charting DBpedia: Towards a Cartography of a Major Linked Dataset
[chapter]
2012
Categories, Contexts and Relations in Knowledge Organization
This analysis opens up a new area of research to which the knowledge organization community can make a significant contribution. ...
Like Wikipedia, DBpedia grants editing rights to anyone motivated to create manual mappings of Wikipedia infobox templates. ...
Infobox templates are created and reused by Wikipedia contributors who also supply the documentation and the rules that determine their format and use. ...
doi:10.5771/9783956504402-75
fatcat:c7jbpk2yyngxva4szxnki2pfkq
DocGenealogy – Visualizing the doctoral advisors and mentors genealogic tree
2018
Abakós
DocGenealogy uses the Wikipedia available data on doctoral advisement and mentoring to find out and track the existing relationships between advisors or mentors and their students. ...
The main objective of the work is to evaluate the effectiveness of the graph-oriented techniques in the interpretation of data from Wilipedia infoboxes. ...
Although not mandatory, the use of infoboxes is quite common and about 33% of Wikipedia articles contained an infobox (Figure 2 illustrates an infobox inside a Wikipedia page). ...
doi:10.5752/p.2316-9451.2018v6n2p3-20
fatcat:imph72wh6fcd3cy7hbjmdy6doq
Automatically refining the wikipedia infobox ontology
2008
Proceeding of the 17th international conference on World Wide Web - WWW '08
We demonstrate how the resulting ontology may be used to enhance Wikipedia with improved query processing and other features. ...
The combined efforts of human volunteers have recently extracted numerous facts from Wikipedia, storing them as machine-harvestable object-attribute-value triples in Wikipedia infoboxes. ...
Category Tags: Many infobox classes have their own Wikipedia pages, and sometimes a special type of category, "XXX infobox templates," is used to tag those pages. ...
doi:10.1145/1367497.1367583
dblp:conf/www/WuW08
fatcat:mvtatwwc2jg7bhszipk5em4nga
Semantic Data Extraction from Infobox Wikipedia Template
2012
International Journal of Computer Applications
Keywords Wikipedia ; semantic web ; DBpedia; data extraction framework; structured knowledge; wikipedia templates; media wiki software; infobox template. ...
We test the project to get structured data as triples from some Wikipedia resources. We clarify examples of car resource and Berlin resource. ...
Infobox template A special type of templates is infobox, aiming at generating consistently-formatted boxes for certain content in articles describing instances of a specific type. ...
doi:10.5120/5072-7464
fatcat:oduwsuusune6fgvdi3xxwiqrdi
« Previous
Showing results 1 — 15 out of 354 results