A Ranking-based Approach to Word Reordering for Statistical Machine Translation

Nan Yang, Mu Li, Dongdong Zhang, Nenghai Yu
2012 Annual Meeting of the Association for Computational Linguistics  
Long distance word reordering is a major challenge in statistical machine translation research. Previous work has shown using source syntactic trees is an effective way to tackle this problem between two languages with substantial word order difference. In this work, we further extend this line of exploration and propose a novel but simple approach, which utilizes a ranking model based on word order precedence in the target language to reposition nodes in the syntactic parse tree of a source
more » ... tence. The ranking model is automatically derived from word aligned parallel data with a syntactic parser for source language based on both lexical and syntactical features. We evaluated our approach on largescale Japanese-English and English-Japanese machine translation tasks, and show that it can significantly outperform the baseline phrasebased SMT system. * This work has been done while the first author was visiting
dblp:conf/acl/YangLZY12 fatcat:jv2qqn7m6vb4fbz7fnpjisrgda