Utilization of Data Mining Techniques for Analysis of Breast Cancer Dataset Using R

Keerti Yeulkar
2017 International Journal for Research in Applied Science and Engineering Technology  
for the diagnosis of cancer medical professionals need an accurate & reliable prediction techniques. There are various techniques that is being used for the diagnosis purpose. Classification is a data mining function that assigns items in a collection to target groups or classes. Two algorithms c4.5 and naive bayes has been applied to the breast cancer dataset to analyse the accuracy of algorithm. Pre-processing techniques have been applied to prepare the formatted dataset from the raw dataset
more » ... nd identify the relevant attribute for classification. Test samples has been randomly selected from the dataset. The results are presented and discussed. Keywords: c4.5 algorithm, naive bayes algorithm , chi.squared selection process, seer dataset, breast cancer diagnosis. I.
doi:10.22214/ijraset.2017.3074 fatcat:tatdrlfmefezxpvmjdvapwvfrq