Integrating instance-level and attribute-level knowledge into document clustering

Jinlong Wang, Shunyao Wu, Gang Li, Zhe Wei
2011 Computer Science and Information Systems  
In this paper, we present a document clustering framework incorporating instance-level knowledge in the form of pairwise constraints and attribute-level knowledge in the form of keyphrases. Firstly, we initialize weights based on metric learning with pairwise constraints, then simultaneously learn two kinds of knowledge by combining the distance-based and the constraint-based approaches, finally evaluate and select clustering result based on the degree of users' satisfaction. The experimental
more » ... sults demonstrate the effectiveness and potential of the proposed method.
doi:10.2298/csis100906003w fatcat:cdknvwmarbbvpl5fch2yrj6y6q