A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2013; you can also visit the original URL.
The file type is application/pdf
.
Study of Ontology or Thesaurus Based Document Clustering and Information Retrieval
2012
Journal of Engineering and Applied Sciences
Document clustering generates clusters from the whole document collection automatically and is used in many fields, including data mining and information retrieval. Clustering text data faces a number of new challenges. Among others, the volume of text data, dimensionality, sparsity and complex semantics are the most important ones. These characteristics of text data require clustering techniques to be scalable to large and high dimensional data, and able to handle sparsity and semantics. In
doi:10.3923/jeasci.2012.342.347
fatcat:qsdkroainjc4ljxmfunfiqwonm