Hybrid Model using Stack-Based Ensemble Classifier and Dictionary Classifier to Improve Classification Accuracy of Twitter Sentiment Analysis

Sangeeta Rani
2020 International Journal of Emerging Trends in Engineering Research  
Ensemble classifiers are widely used for the enhancement of accuracy of twitter sentiment classification. In the present research, a hybrid model based on stack based ensemble classifiers and dictionary based classifier is used for tweet classification as positive and negative. To enhance accuracy of classification, sentiment score retrieved from dictionary based classifier is added to the feature vector to get enhanced feature set and the hybrid stack based ensemble model is implemented on
more » ... enhanced feature set. Three machine learning classifiers svmRadial, C5.0, NB are used to build stacked based ensemble classifier using GLM and RF as Meta learners. Three data sets viz. Kaggle -US Airline Twitter Sentiment Data Set, Sentiment 140 Twitter Data Set, and Real time manually labeled data set related to 'Clean India Mission' are used for the implementation of the proposed model. Caret library of R Studio is used for creating the stack based ensemble of classifiers. The results show that the proposed hybrid model that used sentiment score as one of the features in feature set performed better with an accuracy of 0.8742223 for Kaggle -US Airline Twitter Sentiment Data Set, 0.8881453 for data set related to 'Clean India Mission' and 0.9953593 for Sentiment 140 Twitter Data Set, as compared to machine learning classifiers and other ensemble classifiers.
doi:10.30534/ijeter/2020/02872020 fatcat:z6k4ia4bdre4pj7g6j4herm4rm