A Chinese Corpus for Fine-grained Entity Typing [article]

Chin Lee, Hongliang Dai, Yangqiu Song, Xin Li
2020 arXiv   pre-print
Fine-grained entity typing is a challenging task with wide applications. However, most existing datasets for this task are in English. In this paper, we introduce a corpus for Chinese fine-grained entity typing that contains 4,800 mentions manually labeled through crowdsourcing. Each mention is annotated with free-form entity types. To make our dataset useful in more possible scenarios, we also categorize all the fine-grained types into 10 general types. Finally, we conduct experiments with
more » ... neural models whose structures are typical in fine-grained entity typing and show how well they perform on our dataset. We also show the possibility of improving Chinese fine-grained entity typing through cross-lingual transfer learning.
arXiv:2004.08825v1 fatcat:6i4dtckyenflzntuqh5c2yoney