A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
K-means clustering via principal component analysis
2004
Twenty-first international conference on Machine learning - ICML '04
Principal component analysis (PCA) is a widely used statistical technique for unsupervised dimension reduction. K-means clustering is a commonly used data clustering for performing unsupervised learning tasks. Here we prove that principal components are the continuous solutions to the discrete cluster membership indicators for K-means clustering. New lower bounds for K-means objective function are derived, which is the total variance minus the eigenvalues of the data covariance matrix. These
doi:10.1145/1015330.1015408
dblp:conf/icml/DingH04a
fatcat:smptjr22srgfhlhfjijejknku4