A Hybrid Chinese Language Model based on a Combination of Ontology with Statistical Method

Dequan Zheng, Tiejun Zhao, Sheng Li, Hao Yu
2005 International Joint Conference on Natural Language Processing  
In this paper, we present a hybrid Chinese language model based on a combination of ontology with statistical method. In this study, we determined the structure of such a Chinese language model. This structure is firstly comprised of an ontology description framework for Chinese words and a representation of Chinese lingual ontology knowledge. Subsequently, a Chinese lingual ontology knowledge bank is automatically acquired by determining, for each word, its cooccurrence with semantic,
more » ... s, and syntactic information from the training corpus and the usage of Chinese words will be gotten from lingual ontology knowledge bank for a actual document. To evaluate the performance of this language model, we completed two groups of experiments on texts reordering for Chinese information retrieval and texts similarity computing. Compared with previous works, the proposed method improved the precision of nature language processing.
dblp:conf/ijcnlp/ZhengZLY05 fatcat:fqnrrdqyejbptlrfwzdwkybyke