Improved K-means Algorithm with the Pretreatment of PCA Dimension Reduction

Hongtao Liu, Fang Chen, Wu Yu, Xu Ke, Dai Tian
2015 International Journal of Hybrid Information Technology  
The improvements we have made are to get the optimal K value, to obtain the initial cluster centers and to calculate the distance by the feature weight. Meanwhile, to cater to the characters of dataset, the IWK-means algorithm uses the principal component analysis (PCA) method as a pretreatment to reduce the dimension of dataset. Finally, the proposed method is experimentally validated on the datasets from the UCI Machine Learning Repository and compared with the existing clustering algorithms
more » ... y the two evaluation criteria of Rand Index and Adjusted Rand Index
doi:10.14257/ijhit.2015.8.6.19 fatcat:6kgqswgrnzhv7beqmxfb7fpkuu