Ensemble of feature sets and classification algorithms for sentiment classification

Rui Xia, Chengqing Zong, Shoushan Li
2011 Information Sciences  
In this paper, we make a comparative study of the effectiveness of ensemble technique for sentiment classification. The ensemble framework is applied to sentiment classification tasks, with the aim of efficiently integrating different feature sets and classification algorithms to synthesize a more accurate classification procedure. First, two types of feature sets are designed for sentiment classification, namely the part-of-speech based feature sets and the word-relation based feature sets.
more » ... ond, three well-known text classification algorithms, namely naï ve Bayes, maximum entropy and support vector machines, are employed as base-classifiers for each of the feature sets. Third, three types of ensemble methods, namely the fixed combination, weighted combination and meta-classifier combination, are evaluated for three ensemble strategies. A wide range of comparative experiments are conducted on five widely-used datasets in sentiment classification. Finally, some in-depth discussion is presented and conclusions are drawn about the effectiveness of ensemble technique for sentiment classification.
doi:10.1016/j.ins.2010.11.023 fatcat:z4mxmo4hufggdhkzwgn7fu46ue