A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Information-theoretic feature selection algorithms for text classification
Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005.
A major characteristic of text document classification problem is extremely high dimensionality of text data. In this paper we present four new algorithms for feature/word selection for the purpose of text classification. We use sequential forward selection methods based on improved mutual information criterion functions. The performance of the proposed evaluation functions compared to the information gain which evaluate features individually is discussed. We present experimental results using
doi:10.1109/ijcnn.2005.1556452
fatcat:qtynbdewbjebnj66mtvbuuivxy