A hierarchical phrase-based model for English-Persian statistical machine translation

Mahsa Mohaghegh, Abdolhossein Sarrafzadeh
2012 2012 International Conference on Innovations in Information Technology (IIT)  
In this paper we show that a hierarchical phrasebased translation system will outperform a classical (nonhierarchical) phrase-based system in the English-to-Persian translation direction, yet for the Persian-to-English direction, the classical phrase-based system is preferable. We seek to explain why this is so, and detail a series of translation experiments with our SMT system using various bilingual corpora each with both toolkits Moses (non-hierarchical) and Joshua (hierarchical).
more » ... mponent; statistical machine translation, natural language processing, hierarchical phrase-based models I.
doi:10.1109/innovations.2012.6207733 fatcat:tdketlhnd5afhmnf6iez4fbmem