A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2005; you can also visit the original URL.
The file type is application/pdf
.
Automatic Textual Document Categorization Using Multiple Similarity-Based Models
[chapter]
2001
Proceedings of the 2001 SIAM International Conference on Data Mining
We develop a similarity-based textual document categorization method called the generalized instance set (GIS) algorithm. GIS integrates the advantages of linear classifiers and k-nearest neighbour algorithm by generalization of selected instances. To further enhance the performance, we propose a meta-model framework which combines the strength of different variants of GIS algorithm as well as state-of-the-art existing algorithms using multivariate regression analysis on document feature
doi:10.1137/1.9781611972719.20
dblp:conf/sdm/LaiL01
fatcat:rndqqveyuvesllffg3ojbmjody