Models for Inuktitut-English word alignment

Charles Schafer, Elliott Franco Drábek
2005 Proceedings of the ACL Workshop on Building and Using Parallel Texts - ParaText '05   unpublished
This paper presents a set of techniques for bitext word alignment, optimized for a language pair with the characteristics of Inuktitut-English. The resulting systems exploit cross-lingual affinities at the sublexical level of syllables and substrings, as well as regular patterns of transliteration and the tendency towards monotonicity of alignment. Our most successful systems were based on classifier combination, and we found different combination methods performed best under the target evaluation metrics of F-measure and alignment error rate.
doi:10.3115/1654449.1654463 fatcat:w7qsek5vgvhf7jxxl4k3d75y5y