Incremental translation using hierarchichal phrase-based translation system

Maryam Siahbani, Ramtin Mehdizadeh Seraj, Baskaran Sankaran, Anoop Sarkar
2014 2014 IEEE Spoken Language Technology Workshop (SLT)  
Hierarchical phrase-based machine translation [1] (Hiero) is a prominent approach for Statistical Machine Translation usually comparable to or better than conventional phrase-based systems. But Hiero typically uses the CKY decoding algorithm which requires the entire input sentence before decoding begins, as it produces the translation in a bottom-up fashion. Leftto-right (LR) decoding [2] is a promising decoding algorithm for Hiero that produces the output translation in left to right order.
more » ... this paper we focus on simultaneous translation using the Hiero translation framework. In simultaneous translation, translations are generated incrementally as source language speech input is processed. We propose a novel approach for incremental translation by integrating segmentation and decoding in LR-Hiero. We compare two incremental decoding algorithms for LR-Hiero and present translation quality scores (BLEU) and the latency of generating translations for both decoders on audio lectures from the TED collection.
doi:10.1109/slt.2014.7078552 dblp:conf/slt/SiahbaniSSS14 fatcat:ppuccnbp5vddhcphju4y6lkysy