A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Comparison Clustering using Cosine and Fuzzy set based Similarity Measures of Text Documents
[article]
2015
arXiv
pre-print
Keeping in consideration the high demand for clustering, this paper focuses on understanding and implementing K-means clustering using two different similarity measures. We have tried to cluster the documents using two different measures rather than clustering it with Euclidean distance. Also a comparison is drawn based on accuracy of clustering between fuzzy and cosine similarity measure. The start time and end time parameters for formation of clusters are used in deciding optimum similarity measure.
arXiv:1505.00168v1
fatcat:ttwtgydysrcvfhxigujhbyf7uu