Parallel Differentially Private K-Means Implementation Using COMPSs Framework

Sukgamon Sukpisit, Srdjan Skrbic, Dusan Jakovetic
2020 Zenodo  
K-means is one of the most important clustering algorithms, but it does introduce a risk of privacy disclosure in the clustering process. One approach to solving this problem is by applying differential privacy to K-means clustering algorithm to effectively prevent privacy disclosure. Increasing amounts of information generated in big data processing scenarios make clustering a challenging task. In order to deal with the problem, various approaches to the parallelization of clustering
more » ... have been attempted. This paper presents an implementation of a differentially private k-means clustering algorithm that uses -differential privacy, based on the COMPSs framework for parallel computing. The experimental results show that the proposed implementation scales well and can be used to efficiently process large datasets using high-performance computing equipment.
doi:10.5281/zenodo.4314275 fatcat:yewipue4ljeojhqim7bnbimc7u