Lexical Disambiguation of Arabic Language: An Experimental Study

Laroussi Merhben, Anis Zouaghi, Mounir Zrigui
2012 POLIBITS Research Journal on Computer Science and Computer Engineering With Applications  
In this paper we test some supervised algorithms that most of the existing related works of word sense disambiguation have cited. Due to the lack of linguistic data for the Arabic language, we work on non-annotated corpus and with the help of four annotators; we were able to annotate the different samples containing the ambiguous words. Since that, we test the Naïve Bayes algorithm, the decision lists and the exemplar based algorithm. During the experimental study, we test the influence of the
more » ... indow size on the disambiguation quality, the derivation and the technique of smoothing for the (2n+1)-grams. For these tests the exemplar based algorithm achieves the best rate of precision. Index Terms-Supervised algorithms, training data, Naïve Bayes, decision list, exemplar based algorithm, window size.
doi:10.17562/pb-46-5 fatcat:2nmk3kaw6bfkfenefivr5au4me