A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Multimodal Neural Machine Translation for Low-resource Language Pairs using Synthetic Data
2018
Proceedings of the Workshop on Deep Learning Approaches for Low-Resource NLP
In this paper, we investigate the effectiveness of training a multimodal neural machine translation (MNMT) system with image features for a lowresource language pair, Hindi and English, using synthetic data. A threeway parallel corpus which contains bilingual texts and corresponding images is required to train a MNMT system with image features. However, such a corpus is not available for low resource language pairs. To address this, we developed both a synthetic training dataset and a manually
doi:10.18653/v1/w18-3405
dblp:conf/acl-deeplo/ChowdhuryHL18
fatcat:hmav4zatmzgazjx3wzq2nox74i