Text Clustering Algorithms: A Review

Himanshu Suyal, Amit Panwar, Ajit Singh Negi
2014 International Journal of Computer Applications  
With the growth of Internet, large amount of text data is increasing, which are created by different media like social networking sites, web, and other informatics sources, etc. This data is in unstructured format which makes it tedious to analyze it, so we need methods and algorithms which can be used with various types of text formats. Clustering is an important part of the data mining. Clustering is the process of dividing the large &similar type of text into the same class. Clustering is
more » ... ely used in many applications like medical, biology, signal processing, etc. This paper briefly covers the various kinds of text clustering algorithm, present scenario of the text clustering algorithm, analysis and comparison of various aspects which contain sensitivity, stability. Algorithm contains traditional clustering like hierarchal clustering, density based clustering and self-organized map clustering. General Terms Text clustering, supervised and unsupervised clustering
doi:10.5120/16946-7075 fatcat:op3xwjavtraehgknfkf3hidcqy