Explanation vs Performance in Data Mining: A Case Study with Predicting Runaway Projects

Tim MENZIES, Osamu MIZUNO, Yasunari TAKAGI, Tohru KIKUNO
2009 Journal of Software Engineering and Applications  
Often, the explanatory power of a learned model must be traded off against model performance. In the case of predicting runaway software projects, we show that the twin goals of high performance and good explanatory power are achievable after applying a variety of data mining techniques (discrimination, feature subset selection, rule covering algorithms). This result is a new high water mark in predicting runaway projects. Measured in terms of precision, this new model is as good as can be
more » ... ted for our data. Other methods might out-perform our result (e.g. by generating a smaller, more explainable model) but no other method could out-perform the precision of our learned model.
doi:10.4236/jsea.2009.24030 fatcat:qrafpsm7p5cldkwdgpp4pzwvsm