Improved ICHI square feature selection method for Arabic classifiers

Hadeel N. Alshaer, Mohammed A. Otair, Laith Abualigah
2020 International Journal of Informatics and Communication Technology (IJ-ICT)  
<span>Feature selection problem is one of the main important problems in the text and data mining domain. </span><span>This paper presents a comparative study of feature selection methods for Arabic text classification. Five of the feature selection methods were selected: ICHI square, CHI square, Information Gain, Mutual Information and Wrapper. It was tested with five classification algorithms: Bayes Net, Naive Bayes, Random Forest, Decision Tree and Artificial Neural Networks. In addition,
more » ... a Collection was used in Arabic consisting of 9055 documents, which were compared by four criteria: Precision, Recall, F-measure and Time to build model. The results showed that the improved ICHI feature selection got almost all the best results in comparison with other methods.</span>
doi:10.11591/ijict.v9i3.pp157-170 fatcat:v5qqbpfdlvdmpjlutxourb6q4q