A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check
2018
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
Chinese spelling check (CSC) is a challenging yet meaningful task, which not only serves as a preprocessing in many natural language processing (NLP) applications, but also facilitates reading and understanding of running texts in peoples' daily lives. However, to utilize datadriven approaches for CSC, there is one major limitation that annotated corpora are not enough in applying algorithms and building models. In this paper, we propose a novel approach of constructing CSC corpus with
doi:10.18653/v1/d18-1273
dblp:conf/emnlp/WangSLHZ18
fatcat:2oqyi2fleff5lftjig4xvpedgy