Morphological Analysis of the Dravidian Language Family

Arun Kumar, Ryan Cotterell, Lluís Padró, Antoni Oliver
2017 Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers  
The Dravidian family is one of the most widely spoken set of languages in the world, yet there are very few annotated resources available to NLP researchers. To remedy this, we create DravMorph, a corpus annotated for morphological segmentation and part-of-speech. Also, we exploit novel features and higher-order models to achieve promising results on these corpora on both tasks, beating techniques proposed in the literature by as much as 4 points in segmentation F 1 .
doi:10.18653/v1/e17-2035 dblp:conf/eacl/KumarPCO17 fatcat:jyogzw4h7zcy3h2glzlxg3qzum