The counterintuitive mechanism of graph-based semi-supervised learning in the big data regime

Xiaoyi Mai, Romain Couillet
2017 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)  
To cite this version: Xiaoyi Mai, Romain Couillet. The counterintuitive mechanism of graph-based semi-supervised learning in the big data regime. ABSTRACT In this article, a new approach is proposed to study the performance of graph-based semi-supervised learning methods, under the assumptions that the dimension of data p and their number n grow large at the same rate and that the data arise from a Gaussian mixture model. Unlike small dimensional systems, the large dimensions allow for a Taylor
more » ... expansion to linearize the weight (or kernel) matrix W , thereby providing in closed form the limiting performance of semi-supervised learning algorithms. This notably allows to predict the classification error rate as a function of the normalization parameters and of the choice of the kernel function. Despite the Gaussian assumption for the data, the theoretical findings match closely the performance achieved with real datasets, particularly here on the popular MNIST database.
doi:10.1109/icassp.2017.7952671 dblp:conf/icassp/MaiC17 fatcat:eugvari3j5apdi6d36maakjfdi