A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Application of the Tightness Continuum Measure to Chinese Information Retrieval
2010
Workshop on Multiword Expressions
Most word segmentation methods employed in Chinese Information Retrieval systems are based on a static dictionary or a model trained against a manually segmented corpus. These general segmentation approaches may not be optimal because they disregard information within semantic units. We propose a novel method for improving word-based Chinese IR, which performs segmentation according to the tightness of phrases. In order to evaluate the effectiveness of our method, we employ a new test
dblp:conf/mwe/XuGRK10
fatcat:alvvqoborfes3jzkhxsaarj7ai