The Internet Archive has a preservation copy of this work in our general collections.
The file type is application/pdf
.
A Finite State and Data-Oriented Method for Grapheme to Phoneme Conversion
[article]
2000
arXiv
pre-print
A finite-state method, based on leftmost longest-match replacement, is presented for segmenting words into graphemes, and for converting graphemes into phonemes. A small set of hand-crafted conversion rules for Dutch achieves a phoneme accuracy of over 93%. The accuracy of the system is further improved by using transformation-based learning. The phoneme accuracy of the best system (using a large set of rule templates and a 'lazy' variant of Brill's algoritm), trained on only 40K words, reaches 99% accuracy.
arXiv:cs/0003074v1
fatcat:heyzhdvo3fcdfklm5xhgybzjp4