Survey Paper on Feature Extraction Methods in Text Categorization

Dixa Saxena, S. K., K. N.
2017 International Journal of Computer Applications  
As the world is moving towards globalization, digitization of text has been escalating a lot and the need to organize, categorize and classify text has become obligatory. Disorganization or little categorization and sorting of text may result in dawdling response time of information retrieval. There has been the 'curse of dimensionality' (as termed by Bellman)[1] problem, namely the inherent sparsity of high dimensional spaces. Thus, the search for a possible presence of some unspecified
more » ... re in such a high dimensional space can be difficult. This is the task of feature reduction methods. They obtain the most relevant information from the original data and represent the information in a lower dimensionality space. In this paper, all the applied methods on feature extraction on text categorization from the traditional bag-of-words model approach to the unconventional neural networks are discussed. General Terms Text mining, feature extraction, neural networks, deep learning Keywords
doi:10.5120/ijca2017914145 fatcat:u27ruhdwe5entafxdwadlhguzm