Data Mining a Prostate Cancer Dataset Using Rough Sets

Kenneth Revett, Sergio Tenreiro de Magalhaes, Henrique M. D. Santos
2006 2006 3rd International IEEE Conference Intelligent Systems  
Prostate cancer remains one of the leading causes of cancer death worldwide, with a reported incidence rate of 650,000 cases per annum worldwide. The causal factors of prostate cancer still remain to be determined. In this paper, we investigate a medical dataset containing clinical information on 502 prostate cancer patients using the machine learning technique of rough sets. Our preliminary results yield a classification accuracy of 90%, with high sensitivity and specificity (both at
more » ... ely 91%). Our results yield a predictive positive value (PPN) of 81% and a predictive negative value (PNV) of 95%. In addition to the high classification accuracy of our system, the rough set approach also provides a rule-based inference mechanism for information extraction that is suitable for integration into a rule-based system. The generated rules relate directly to the attributes and their values and provide a direct mapping between them.
doi:10.1109/is.2006.348433 fatcat:6fa3oj7ml5arhpnzw7i5ukj5pi