A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Bilingual corpus cleaning focusing on translation literality
2002
7th International Conference on Spoken Language Processing (ICSLP 2002)
unpublished
When we automatically acquire translation knowledge from a bilingual corpus, redundant rules are generated due to translation variety. To overcome this problem, we propose bilingual corpus cleaning based on translation literality. Word-level correspondence and phrase-level correspondence are applied as the criteria of literality. Using these criteria, a bilingual corpus was cleaned, and translation knowledge for a pattern-based MT system was acquired from the cleaned corpus. As a result, the
doi:10.21437/icslp.2002-505
fatcat:h26kjrwy7jhspjj7sg4jnuwbva