Statistical Analyses of Named Entity Disambiguation Benchmarks

Nadine Steinmetz, Magnus Knuth, Harald Sack
2013 International Semantic Web Conference  
In the last years, various tools for automatic semantic annotation of textual information have emerged. The main challenge of all approaches is to solve ambiguity of natural language and assign unique semantic entities according to the present context. To compare the different approaches a ground truth namely an annotated benchmark is essential. But, besides the actual disambiguation approach the achieved evaluation results are also dependent on the characteristics of the benchmark dataset and
more » ... he expressiveness of the dictionary applied to determine entity candidates. This paper presents statistical analyses and mapping experiments on different benchmarks and dictionaries to identify characteristics and structure of the respective datasets.
dblp:conf/semweb/SteinmetzKS13 fatcat:4gad4bhlyvct3fdgyyobx2veva