A Hybrid multi-stage Learning technique based on Brain Storming Optimization algorithm for Breast Cancer Recurrence Prediction
Journal of King Saud University: Computer and Information Sciences
Please cite this article as: Alwohaibi, M., Alzaqebah, M., Alotaibi, N.M., Alzahrani, A.M., Zouch, M., A Hybrid multi-stage Learning technique based on Brain Storming Optimization algorithm for Breast Cancer Recurrence Abstract: Breast cancer disease is considered to be the second leading reason for death among women. Unfortunately, even if the treatment of cancer started soon after diagnosis, the cancer cells may remain in the body, and cancer may recur. Various Machine Learning (ML) methods
... ning (ML) methods to predict breast cancer recurrence were applied recently, and the ML methods' performance needs to be examined to determine the proper method for prediction. Usually, the datasets contain many features which may sometimes mislead the prediction process; as some features may lead to confusion or inaccurate prediction. Thus, in this study, two breast cancer recurrence datasets were statistically analyzed and further refined by Brain Storming Optimization algorithm (BSO). The proposed multi-stages technique consists of three main stages; first, the statistical feature selection methods (SFM) which statistically select the discriminative features based on importance ranking and features correlations and statistical hypothesis testing, to be passed to the second stage. The 2 features are ranked based on their correlation with the class variable. The second stage namely the multi classifier (MC), which evaluates each method based on three classifiers and produces a combination of features performed by two SFM and three classifiers. In the third stage, the best combination of the selected features was recognized by the BSO algorithm to search for an optimal solution that produces the highest accuracy. In addition, the BSO algorithm has been modified to deal with the feature selection problem. The performance of the proposed technique was evaluated by stratified 10-fold cross-validation. As a result, the multi-stage learning technique showed to be effective in ranking the features and improved the classification accuracy for breast cancer recurrence.