A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
A Pragmatic Chinese Word Segmentation Approach Based on Mixing Models
2006
International Journal of Computational Linguistics and Chinese Language Processing
A pragmatic Chinese word segmentation approach is presented in this paper based on mixing language models. Chinese word segmentation is composed of several hard sub-tasks, which usually encounter different difficulties. The authors apply the corresponding language model to solve each special sub-task, so as to take advantage of each model. First, a class-based trigram is adopted in basic word segmentation, which applies the Absolute Discount Smoothing algorithm to overcome data sparseness. The
dblp:journals/ijclclp/JiangGW06
fatcat:ndmgsgdxfjg7ficoz3y2gx2aua