SMS Spam Filtering Using N - gram method, Information Gain Metric and an Improved Version of SVDD Classifier

Mohamed El Boujnouni, Mohammed V University, Faculty of Sciences Rabat, Laboratory of Conception and Systems (Microelectronic and Informatics) Avenue Ibn Battouta B.P 1014, Rabat, Morocco
2017 Journal of Engineering Science and Technology Review  
Text messages (SMS or Short Message Service) are widely used form of mobile communication; their popularity is attributed to many factors including low cost sending, simple delivery mode and convenient usage. However, an unsolicited kind of SMS (Spam) has been appeared and caused major problems for users and mobile service providers. In this paper we propose a new SMS Spam filter able to distinguish between legitimate messages and Spam. The proposed filter is based on three components: N-grams
more » ... omponents: N-grams method to extract features from short messages, information gain ratio to select the most relevant features, and an improved version of Support Vector Domain Description to detect SMS Spam. Experimental results on a large benchmark dataset of real-world benign and Spam SMS have shown the performance and effectiveness of the proposed filter.
doi:10.25103/jestr.101.18 fatcat:hfd4wuu3xneltaym56ub4e5czy