Multidimensional support vector machines for visualization of gene expression data

Daisuke Komura, Hiroshi Nakamura, Shuichi Tsutsumi, Hiroyuki Aburatani, Sigeo Ihara
2004 Proceedings of the 2004 ACM symposium on Applied computing - SAC '04  
Motivation: Since DNA microarray experiments provide us with huge amount of gene expression data, they should be analyzed with statistical methods to extract the meanings of experimental results. Some dimensionality reduction methods such as Principal Component Analysis (PCA) are used to roughly visualize the distribution of high dimensional gene expression data. However, in the case of binary classification of gene expression data, PCA does not utilize class information when choosing axes.
more » ... clearly separable data in the original space may not be so in * To whom correspondence should be addressed. the reduced space used in PCA. Results: For visualization and class prediction of gene expression data, we have developed a new SVMbased method called multidimensional SVMs, that generate multiple orthogonal axes. This method projects high dimensional data into lower dimensional space to exhibit properties of the data clearly and to visualize a distribution of the data roughly. Furthermore, the multiple axes can be used for class prediction. The basic properties of conventional SVMs are retained in our method: solutions of mathematical programming are sparse, and nonlinear classification is implemented implicitly through the use of kernel functions. The application of our method to
doi:10.1145/967900.967936 dblp:conf/sac/KomuraNTAI04 fatcat:uqe4futk5zcdbm6rmt7onhcv5e