A Study on Linking Wikipedia Categories to Wordnet Synsets using Text Similarity

Antonio Toral, Óscar Ferrández, Eneko Agirre, Rafael Muñoz
2009 Recent Advances in Natural Language Processing  
This paper studies the application of text similarity methods to disambiguate ambiguous links between WordNet nouns and Wikipedia categories. The methods range from word overlap between glosses, random projections, WordNetbased similarity, and a full-fledged textual entailment system. Both unsupervised and supervised combinations have been tried. The goldstandard with disambiguated links is publicly available. The results range from 64.7% for the first sense heuristic, 68% for an unsupervised
more » ... mbination, and up to 77.74% for a supervised combination.
dblp:conf/ranlp/ToralFAM09 fatcat:k3dg7pc52zcwnf7tna4ywkea2q