Arabic morphological tagging, diacritization, and lemmatization using lexeme models and feature ranking

Ryan Roth, Owen Rambow, Nizar Habash, Mona Diab, Cynthia Rudin
2008 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies Short Papers - HLT '08   unpublished
We investigate the tasks of general morphological tagging, diacritization, and lemmatization for Arabic. We show that for all tasks we consider, both modeling the lexeme explicitly, and retuning the weights of individual classifiers for the specific task, improve the performance.
doi:10.3115/1557690.1557721 fatcat:4wozkxsnxrbd5axpe3vpfogi3e