Dual Principal Component Pursuit: Improved Analysis and Efficient Algorithms

Zhihui Zhu, Yifan Wang, Daniel P. Robinson, Daniel Q. Naiman, René Vidal, Manolis C. Tsakiris
2018 Neural Information Processing Systems  
Recent methods for learning a linear subspace from data corrupted by outliers are based on convex 1 and nuclear norm optimization and require the dimension of the subspace and the number of outliers to be sufficiently small [27] . In sharp contrast, the recently proposed Dual Principal Component Pursuit (DPCP) method [22] can provably handle subspaces of high dimension by solving a non-convex 1 optimization problem on the sphere. However, its geometric analysis is based on quantities that are
more » ... fficult to interpret and are not amenable to statistical analysis. In this paper we provide a refined geometric analysis and a new statistical analysis that show that DPCP can tolerate as many outliers as the square of the number of inliers, thus improving upon other provably correct robust PCA methods. We also propose a scalable Projected Sub-Gradient Method (DPCP-PSGM) for solving the DPCP problem and show that it achieves linear convergence even though the underlying optimization problem is non-convex and non-smooth. Experiments on road plane detection from 3D point cloud data demonstrate that DPCP-PSGM can be more efficient than the traditional RANSAC algorithm, which is one of the most popular methods for such computer vision applications.
dblp:conf/nips/ZhuWRNVT18 fatcat:ftputeghizcsjlc6uowf74nl6m