Aligning freebase with the YAGO ontology

Elena Demidova, Iryna Oelze, Wolfgang Nejdl
2013 Proceedings of the 22nd ACM international conference on Conference on information & knowledge management - CIKM '13  
Linked Open Data (LOD) has emerged as the de-facto standard for publishing data on the Web. The cross-domain large scale Freebase and YAGO datasets represent central hubs and reference points for the LOD cloud. Freebase is an open-world dataset, which contains about 22 million entities and more than 350 million facts in more than 100 domains. The scale of Freebase makes it difficult for the users to get an overview of the data and efficiently retrieve the desired information. Integration of
more » ... Integration of Freebase with the YAGO ontology that contains more than 360,000 concepts enables us to provide more semantic information for Freebase and to facilitate novel applications, such as efficient query construction, over large scale data. In this paper we analyze the structure of YAGO in more depth and show how to match YAGO and Freebase categories. The new YAGO+F structure that results from our matching tightly connects both datasets and provides an important next step to systematically interconnect LOD subcollections. We make our YAGO+F structure available online in the hope that it can provide a good starting point for future applications, which can build upon a wide variety of Freebase data clearly arranged in the semantic categories of YAGO. 1 The LOD cloud diagram: 2
doi:10.1145/2505515.2505546 dblp:conf/cikm/DemidovaON13 fatcat:fmkpq5bin5aazeaxjrqr4ysmi4