Content-based concept drift detection for Email spam filtering

Morteza Zi Hayat, Javad Basiri, Leila Seyedhossein, Azadeh Shakery
2010 2010 5th International Symposium on Telecommunications  
The continued growth of Email usage, which is naturally followed by an increase in unsolicited emails so called spams, motivates research in spam filtering area. In the context of spam filtering systems, addressing the evolving nature of spams, which leads to obsolete the related models, has been always a challenge. In this paper an adaptive spam filtering system based on language model is proposed which can detect concept drift based on computing the deviation in email contents distribution.
more » ... e proposed method can be used along with any existing classifier; particularly in this paper we use Naïve Bayes method as classifier. The proposed method has been evaluated with Enron data set. The results indicate the efficiency of the method in detecting concept drift and its superiority over Naïve Bayes classifier in terms of accuracy.
doi:10.1109/istel.2010.5734082 fatcat:eas6glnpnval5ji5qj4hftinni