Unsupervised Feature Selection Algorithm Based on Information Gain

Zhong Li, Yang Jing, Lijing Yao, Binbin Gan
2019 Proceedings of the 2019 International Conference on Big Data, Electronics and Communication Engineering (BDECE 2019)   unpublished
Feature selection aims to select a smaller feature subset from the rate data which maintains the characteristics of the original data and has similar or better performance in data mining. traditional information theory often divides the relevance and redundancy of the features into consideration in unsupervised feature selection. This article proposes a supervised feature selection algorithm based on information gain analysis . this algorithm is to analyze the correlation between feature and
more » ... ginal data and the redundancy between features and selected features based on the mutual information. The potential information gain of the feature is calculated for the feature sorting . At last, the feature is selected according to the gain penalty factor . The experimental results of multiple classifiers on multiple standard datasets show that the proposed algorithm achieves or better than the classification accuracy of the original data on the basis of effectively reducing the data dimension.
doi:10.2991/acsr.k.191223.015 fatcat:2r2reosm7bhmph54omxrki36si