Classifying Articles in Chinese Wikipedia with Fine-Grained Named Entity Types

Jie Zhou, Bicheng Li, Yongwang Tang
2014 Journal of Computing Science and Engineering  
Named entity classification of Wikipedia articles is a fundamental research area that can be used to automatically build large-scale corpora of named entity recognition or to support other entity processing, such as entity linking, as auxiliary tasks. This paper describes a method of classifying named entities in Chinese Wikipedia with fine-grained types. We considered multi-faceted information in Chinese Wikipedia to construct four feature sets, designed different feature selection methods for
more » ... each feature, and fused different features with a vector space using different strategies. Experimental results show that the explored feature sets and their combination can effectively improve the performance of named entity classification.
doi:10.5626/jcse.2014.8.3.137 fatcat:z6hdhxqxc5ee7pqn4gmaf5oroi