A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Using TectoMT as a Preprocessing Tool for Phrase-Based Statistical Machine Translation
[chapter]
2010
Lecture Notes in Computer Science
We present a systematic comparison of preprocessing techniques for two language pairs: English-Czech and English-Hindi. The two target languages, although both belonging to the Indo-European language family, show significant differences in morphology, syntax and word order. We describe how TectoMT, a successful framework for analysis and generation of language, can be used as preprocessor for a phrasebased MT system. We compare the two language pairs and the optimal sets of source-language
doi:10.1007/978-3-642-15760-8_28
fatcat:ee27wbvfrvh25eaqlbiwiusngy