A General and Multi-lingual Phrase Chunking Model Based on Masking Method [chapter]

Yu-Chieh Wu, Chia-Hui Chang, Yue-Shi Lee
2006 Lecture Notes in Computer Science  
Several phrase chunkers have been proposed over the past few years. Some state-of-the-art chunkers achieved better performance via integrating external resources, e.g., parsers and additional training data, or combining multiple learners. However, in many languages and domains, such external materials are not easily available and the combination of multiple learners will increase the cost of training and testing. In this paper, we propose a mask method to improve the chunking accuracy. The
more » ... imental results show that our chunker achieves better performance in comparison with other deep parsers and chunkers. For CoNLL-2000 data set, our system achieves 94.12 in F rate. For the base-chunking task, our system reaches 92.95 in F rate. When porting to Chinese, the performance of the base-chunking task is 92.36 in F rate. Also, our chunker is quite efficient. The complete chunking time of a 50K words document is about 50 seconds.
doi:10.1007/11671299_17 fatcat:kbg352emune3bigqj6ewl4ssd4