Layout Based Spam Filtering

Claudiu N.Musat
2007 Zenodo  
Due to the constant increase in the volume of information available to applications in fields varying from medical diagnosis to web search engines, accurate support of similarity becomes an important task. This is also the case of spam filtering techniques where the similarities between the known and incoming messages are the fundaments of making the spam/not spam decision. We present a novel approach to filtering based solely on layout, whose goal is not only to correctly identify spam, but
more » ... o warn about major emerging threats. We propose a mathematical formulation of the email message layout and based on it we elaborate an algorithm to separate different types of emails and find the new, numerically relevant spam types.
doi:10.5281/zenodo.1085252 fatcat:vfc4au6t2nb4bapyqzxqvrsbvy