A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Automatic Chinese Confusion Words Extraction Using Conditional Random Fields and the Web
2013
Workshop on Chinese Language Processing
A ready set of commonly confused words plays an important role in spelling error detection and correction in texts. In this paper, we present a system named ACE (Automatic Confusion words Extraction), which takes a Chinese word as input (e.g., "不脛而走") and automatically outputs its easily confused words (e.g., "不徑 徑 徑 徑而走", "不逕 逕 逕 逕而走"). The purpose of ACE is similar to web-based set expansion -the problem of finding all instances (e.g. "Halloween", "Thanksgiving Day", "Independence Day", etc.)
dblp:conf/acl-sighan/WangCW13
fatcat:7b6gtlvesncatdsrhca6dmugpi