A Japanese-Chinese Cross-Language Entity Linking Method with Entity Disambiguation Based on Document Similarity

Xiang Song, Jialiang Zhou, Fuminori Kimura, Akira Maeda
2016 International Journal of Knowledge Engineering  
In this paper, we propose a method to automatically discover links between valuable keyphrases in a Japanese document and corresponding Chinese encyclopedia pages. The proposed method has three stages. First, we translate Japanese keyphrases into Chinese using a combination of three translation methods. Second, we extract all Chinese encyclopedia articles of the translated keyphrases. Third, we translate the original Japanese document into Chinese and make a vector of noun frequencies. We
more » ... ate the cosine similarities of original articles and all candidate Chinese encyclopedia ones. To find the appropriateness of term description pages for disambiguation, we make a rank with cosine similarity by comparing a Japanese document with Chinese encyclopedia articles. Finally, we add a link from a Japanese keyphrase to top-ranking Chinese encyclopedia article. In this paper, we use Wikipedia and Baidu Baike (an online encyclopedia published by Baidu, a Chinese search engine) articles to conduct our experiment. Although we achieved an accuracy rate of 81% by using Wikipedia, we achieved an accuracy rate of 97% by using Baidu Baike.
doi:10.18178/ijke.2016.2.3.065 fatcat:timle7kmf5akxlghxudugbxhzq