A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Comparison of Distributed K-Means and Distributed Fuzzy C-Means Algorithms for Text Clustering
2017
Communications in Science and Technology
Text clustering has been developed in distributed system due to increasing data. The popular algorithms like K-Means (KM) and Fuzzy C-Means (FCM) are combined with Map Reduce algorithm in Hadoop Environment to be distributable and parallelizable. The problem is performance comparison between Distributed KM (DKM) and Distributed FCM (DFCM) that uses Tanimoto Distance Measure (TDM) has not been studied yet. It is important because TDM's characteristics are scale invariant while allowing
doi:10.21924/cst.2.1.2017.46
fatcat:eowu56427jhuhkpbpe4jecgpn4