Aligning a parallel English-Chinese corpus statistically with lexical criteria

Dekai Wu
1994 Proceedings of the 32nd annual meeting on Association for Computational Linguistics -  
We describe our experience with automatic alignment of sentences in parallel English-Chinese texts. Our report concerns three related topics: (1) progress on the HKUST English-Chinese Parallel Bilingual Corpus; (2) experiments addressing the applicability of Gale ~ Church's (1991) lengthbased statistical method to the task of alignment involving a non-Indo-European language; and (3) an improved statistical method that also incorporates domain-specific lexical cues.
doi:10.3115/981732.981744 dblp:conf/acl/Wu94 fatcat:msh6lodu6fhlxl3gov5txm4o4m