Robust Sparse Principal Component Analysis

Christophe Croux, Peter Filzmoser, Heinrich Fritz
2011 Social Science Research Network  
A method for principal component analysis is proposed that is sparse and robust at the same time. The sparsity delivers principal components that have loadings on a small number of variables, making them easier to interpret. The robustness makes the analysis resistant to outlying observations. The principal components correspond to directions that maximize a robust measure of the variance, with an additional penalty term to take sparseness into account. We propose an algorithm to compute the
more » ... rse and robust principal components. The algorithm computes the components sequentially, and thus it can handle data sets with more variables than observations. The method is applied on several real data examples, and diagnostic plots for detecting outliers and for selecting the degree of sparsity are provided. A simulation experiment studies the effect on statistical efficiency by requiring both robustness and sparsity.
doi:10.2139/ssrn.1868107 fatcat:ffepyy3p6ndblavuhrwvzl2axi