Personalized Document Clustering: A Collaborative-Filtering-Based Approach

Chih-Ping Wei, Chin-Sheng Yang, Han-Wei Hsiao
2004 Pacific Asia Conference on Information Systems  
To manage the ever-increasing volume of documents, individuals and organizations frequently organize their documents into categories that facilitate document management and subsequent information access and browsing. However, document clustering is intentional acts that reflect individual preferences with regard to the semantic coherency and relevant categorization of documents. Hence, an effective document clustering must consider individual preferences and needs to support personalization in
more » ... ocument categorization. In this study, we design and implement a collaborative-filtering-based document-clustering (CFC) technique by incorporating an individual's and his/her neighbors' partial clusterings for supporting personalized document clustering. The empirical evaluation results suggest that the use of an individual's partial clustering can achieve a better personalized clustering result than does the content-based document clustering technique. Moreover, use of the collaborative-filtering approach for expanding an individual's partial clustering can further improve personalized clustering, measured by cluster recall and precision.
dblp:conf/pacis/WeiYH04 fatcat:ahtxbz2mcbd4vizd42am3k6jse