Exemplar-Controllable Paraphrasing and Translation using Bitext [article]

Mingda Chen, Sam Wiseman, Kevin Gimpel
2021 arXiv   pre-print
Most prior work on exemplar-based syntactically controlled paraphrase generation relies on automatically-constructed large-scale paraphrase datasets, which are costly to create. We sidestep this prerequisite by adapting models from prior work to be able to learn solely from bilingual text (bitext). Despite only using bitext for training, and in near zero-shot conditions, our single proposed model can perform four tasks: controlled paraphrase generation in both languages and controlled machine
more » ... anslation in both language directions. To evaluate these tasks quantitatively, we create three novel evaluation datasets. Our experimental results show that our models achieve competitive results on controlled paraphrase generation and strong performance on controlled machine translation. Analysis shows that our models learn to disentangle semantics and syntax in their latent representations, but still suffer from semantic drift.
arXiv:2010.05856v2 fatcat:ea63apjjqzhrbfgrq6uorq2c5e