Analysis and Prediction of Unalignable Words in Parallel Text

Frances Yung, Kevin Duh, Yuji Matsumoto
2014 Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, volume 2: Short Papers  
Professional human translators usually do not employ the concept of word alignments, producing translations 'sense-forsense' instead of 'word-for-word'. This suggests that unalignable words may be prevalent in the parallel text used for machine translation (MT). We analyze this phenomenon in-depth for Chinese-English translation. We further propose a simple and effective method to improve automatic word alignment by pre-removing unalignable words, and show improvements on hierarchical MT
more » ... rarchical MT systems in both translation directions.
doi:10.3115/v1/e14-4037 dblp:conf/eacl/YungDM14 fatcat:aazabds3fzbc5n3fynxxheroky