Prediction of Protein Subcellular Multi-locations with a Min-Max Modular Support Vector Machine [chapter]

Yang Yang, Bao-Liang Lu
2006 Lecture Notes in Computer Science  
How to predict subcellular multi-locations of proteins with machine learning techniques is a challenging problem in computational biology community. Regarding the protein multi-location problem as a multi-label pattern classification problem, we propose a new predicting method for dealing with the protein subcellular localization problem in this paper. Two key points of the proposed method are to divide a seriously unbalanced multi-location problem into a number of more balanced two-class
more » ... blems by using the part-versus-part task decomposition approach, and learn all of the subproblems by using the min-max modular support vector machine (M 3 -SVM). To evaluate the effectiveness of the proposed method, we perform experiments on yeast protein data set by using two kinds of task decomposition strategies and three kinds of feature extraction methods. The experimental results demonstrate that our method achieves the highest prediction accuracy, which is much better than that obtained by the existing approach based on the traditional support vector machine.
doi:10.1007/11760191_98 fatcat:4j35kvia6nhkxen3rerwz5kkl4