COMPOUND DIVERSITY FUNCTIONS FOR ENSEMBLE SELECTION

ALBERT HUNG-REN KO, ROBERT SABOURIN, ALCEU DE SOUZA BRITTO
2009 International journal of pattern recognition and artificial intelligence  
An effective way to improve a classification method's performance is to create ensembles of classifiers. Two elements are believed to be important in constructing an ensemble: (a) the performance of each individual classifier; and (b) diversity among the classifiers. Nevertheless, most works based on diversity suggest that there exists only weak correlation between classifier performance and ensemble accuracy. We propose compound diversity functions which combine the diversities with the
more » ... ies with the performance of each individual classifier, and show that there is a strong correlation between the proposed functions and ensemble accuracy. Calculation of the correlations with different ensemble creation methods, different problems and different classification algorithms on 0.624 million ensembles suggests that most compound diversity functions are better than traditional diversity measures. The population-based Genetic Algorithm was used to search for the best ensembles on a handwritten numerals recognition problem and to evaluate 42.24 million ensembles. The statistical results indicate that compound diversity functions perform better than traditional diversity measures, and are helpful in selecting the best ensembles.
doi:10.1142/s021800140900734x fatcat:34dcfk2cfvai3mipndfbjuapbu