Building an Ensemble of Fine-Tuned Naive Bayesian Classifiers for Text Classification

Khalil El Hindi, Hussien AlSalman, Safwan Qasem, Saad Al Ahmadi
2018 Entropy  
Text classification is one domain in which the naive Bayesian (NB) learning algorithm performs remarkably well. However, making further improvement in performance using ensemble-building techniques proved to be a challenge because NB is a stable algorithm. This work shows that, while an ensemble of NB classifiers achieves little or no improvement in terms of classification accuracy, an ensemble of fine-tuned NB classifiers can achieve a remarkable improvement in accuracy. We propose a
more » ... propose a fine-tuning algorithm for text classification that is both more accurate and less stable than the NB algorithm and the fine-tuning NB (FTNB) algorithm. This improvement makes it more suitable than the FTNB algorithm for building ensembles of classifiers using bagging. Our empirical experiments, using 16-benchmark text-classification data sets, show significant improvement for most data sets.
doi:10.3390/e20110857 pmid:33266581 fatcat:oldng2o32jhyvhif6of4abagv4