Finding Best Mining Scheme for Development of Multinomial Software Fault Prediction Model

Dipti Kumari, Kumar Rajnish
2016 International Journal of Multimedia and Ubiquitous Engineering  
This paper discuss different classification methods toward reliability and quality improvement of software systems by predicting fault-prone module before testing. Classification capability of Data mining techniques and Object-oriented property based knowledge stored in Object-Oriented metrics are used to classify the software module as fault-prone in different error categories or not fault-prone. Three versions of Eclipse, the java-based Open source Integrated Development environment as
more » ... for training and testing all the classification based data mining techniques are used. First of all, Threshold base feature ranking (i.e. Area under the ROC curve) is used for selecting effective OOmetrics in building prediction model. After that using those subsets of selected attributes, classification models are built with 41 different classifiers for multinomial classification in fault detection systems. Finally, the performance of a classifier is evaluated with respect to the PRC performance metric. Based on the performance results appropriate classifiers (Random Committee, Random Tree, Randomizable Filtered classifier and IBK) which depict a higher performance and accuracy compared to the others are selected. Our results indicate that Random Tree, Random Committee and Randomizable Filtered Classifier have same performance. IBK classifier also has same performance but little bit less and Kstar has less performance compared to previous four selected classifiers.
doi:10.14257/ijmue.2016.11.5.17 fatcat:n346pg7zkrakbhsjtq25vzreiu