Mining Multiple Models

Graham J. Williams
2006 Contributions to Probability and Statistics: Applications and Challenges  
Data mining is much more than simply building statistical models from large collections of data. In particular, this paper records a core task of mining as exploring through the space of models that are built in a data mining project. The idea was first introduced through the concept of multiple inductive learning (MIL) (Williams, 1988 (Williams, , 1991 and further developed in practise as mining the data mine (Williams and Huang, 1997). Many data mining advances that have since emerged have
more » ... ther developed the idea: multiple modelling, ensemble learning, bagging, and boosting all help the data miner explore different ideas and look for different insights in modelling. In this paper we review these ideas and a number of data mining projects that highlight the significant role played by mining the data mine.
doi:10.1142/9789812772466_0022 fatcat:k57vtscysvbjdmage7l73yl57u