Semantic Classification, Keyword Mining and Search Space Optimization for digital ecosystems

Nikunj Yadav, Yanu Gupta, Manish Kumar, Ratna Sanyal
2010 Journal of Multimedia Processing and Technologies  
The volume of documents in the digital repositories numbers in thousands and is increasing constantly, in such a scenario it becomes a very important issue to organize and retrieve these documents in a way that relates to the human mind. In this paper, we present a novel approach to classify the documents in a digital repository and find the semantically significant keywords related to those documents to make the organization and the retrieval of the documents faster and more efficient. We
more » ... ach this problem using Probabilistic Latent Semantic Analysis with incomplete training data to organize them and mark the relevant keywords. This approach makes the classification faster and instead of the unlabeled clustering gives classification with well defined topics relating to human logic.
dblp:journals/jmpt/YadavGKS10 fatcat:pvyl2cwk2vcmniylpdqctlgqbq