A Comparative study on Term Weighting Methods for Automated Telugu Text Categorization with Effective Classifiers

Vishnu Murthy G, Vishnu Vardhan B, Sarangam K, Vijay pal Reddy P
2013 International Journal of Data Mining & Knowledge Management Process  
Automatic Text categorization refers to the process of assigning a category or some categories automatically among predefined ones. Text categorization is challenging in Indian languages has rich in morphology, a large number of word forms and large feature spaces. This paper investigates the performance of different classification approaches using different term weighting approaches in order to decide the most applicable one to Telugu text classification problem. We have investigated on
more » ... nt term weighting methods for Telugu corpus in combination with Naive Bayes ( NB), Support Vector Machine (SVM) and k Nearest Neighbor (kNN) classifiers.
doi:10.5121/ijdkp.2013.3606 fatcat:6zi7iupmmzegzadym2imufrene