A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is
A Cluster-based Approach to Filtering Spam under Skewed Class Distributions
2007 40th Annual Hawaii International Conference on System Sciences (HICSS'07)
The purpose of this research is to propose an appropriate classification approach to improving the effectiveness of spam filtering on the issue of skewed class distributions. A clustering-based classifier is proposed to first cluster documents into several groups, and then an equal number of keywords are extracted from each group to alleviate the problem caused by skewed class distributions. Experiments are conducted to validate the effectiveness of the proposed classifier. The results showdoi:10.1109/hicss.2007.7 dblp:conf/hicss/HsiaoCH07 fatcat:emgbfxoi5vfwrcxdr4qnlkzk2q