A new genetic algorithm in proteomics: Feature selection for SELDI-TOF data

Christelle Reynès, Robert Sabatier, Nicolas Molinari, Sylvain Lehmann
2008 Computational Statistics & Data Analysis  
Mass spectrometry from clinical specimens is used in order to identify biomarkers in a diagnosis. Thus, a reliable method for both feature selection and classification is required. A novel method is proposed to find biomarkers in SELDI-TOF in order to perform robust classification.The feature selection is based on a new genetic algorithm. Concerning the classification, a method which takes into account the great variability on intensity by using decision stumps has been developed. Moreover, as
more » ... he samples are often small, it is more appropriate to use the decision stumps simultaneously than building a complete tree. The thresholds of the decision stumps are determined in the same genetic algorithm. Finally, the method was generalized to more than two groups based on pairwise coupling. The obtained algorithm was applied on two data sets: a publicly available one containing two groups allowing a comparison with other methods from the literature and a new one containing three groups.
doi:10.1016/j.csda.2008.02.025 fatcat:vue2rhy7jvc3vjxapxmqwjtsa4