Learning Continuous Phrase Representations for Translation Modeling

Jianfeng Gao, Xiaodong He, Wen-tau Yih, Li Deng
2014 Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)  
This paper tackles the sparsity problem in estimating phrase translation probabilities by learning continuous phrase representations, whose distributed nature enables the sharing of related phrases in their representations. A pair of source and target phrases are projected into continuous-valued vector representations in a low-dimensional latent space, where their translation score is computed by the distance between the pair in this new space. The projection is performed by a neural network
more » ... se weights are learned on parallel training data. Experimental evaluation has been performed on two WMT translation tasks. Our best result improves the performance of a state-of-the-art phrase-based statistical machine translation system trained on WMT 2012 French-English data by up to 1.3 BLEU points.
doi:10.3115/v1/p14-1066 dblp:conf/acl/GaoHYD14 fatcat:rjacb4xpgva7hboww4znpomppm