Resampling or Reweighting: A Comparison of Boosting Implementations

Chris Seiffert, Taghi M. Khoshgoftaar, Jason Van Hulse, Amri Napolitano
2008 2008 20th IEEE International Conference on Tools with Artificial Intelligence  
Boosting has been shown to improve the performance of classifiers in many situations, including when data is imbalanced. There are, however, two possible implementations of boosting, and it is unclear which should be used. Boosting by reweighting is typically used, but can only be applied to base learners which are designed to handle example weights. On the other hand, boosting by resampling can be applied to any base learner. In this work, we empirically evaluate the differences between these
more » ... nces between these two boosting implementations using imbalanced training data. Using 10 boosting algorithms, 4 learners and 15 datasets, we find that boosting by resampling performs as well as, or significantly better than, boosting by reweighting (which is often the default boosting implementation). We therefore conclude that in general, boosting by resampling is preferred over boosting by weighting.
doi:10.1109/ictai.2008.59 dblp:conf/ictai/SeiffertKHN08 fatcat:t3bf235vorbezabfvowyewbtx4