A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Effect of Feature Smoothing Methods in Text Classification Tasks
english
2004
Proceedings of the 4th International Workshop on Pattern Recognition in Information Systems
unpublished
english
The number of features to be considered in a text classification system is given by the size of the vocabulary and this is normally in the range of the tens or hundreds of thousands even for small tasks. This leads to parameter estimation problems for statistical based methods and countermeasures have to be found. One of the most widely used methods consists of reducing the size of the vocabulary according to a well defined criterion in order to be able to reliably estimate the set of
doi:10.5220/0002682001080117
fatcat:vsj6wkmtg5cshpxg4t7gllcw3a