Performance Analysis of Email Classifiers for Detection of Spam

Zahra Masood

Abstract


Growing usage of email has also increased size of email data, this data involves important as well as undesirable emails. Amount of unwanted emails(spam) has increased enormously. Blocking spam sources doesn’t works well in this era. For saving resources its vital to separate spam and essential emails(ham). Email servers are prepared to tackle this situation. Problem is handled by different algorithms that automate the system instead of manually separating emails. Our work addresses the selection of algorithm, whose outcome will precisely allocate labels to emails and will be efficient enough to give results in adequate time. So, that emails can be classified correctly into inbox and spam folders in adequate time by email server. Three different machine learning classifiers are analyzed over a dataset, providing a criterion that will categorize them according to their time, precision, recall and accuracy.

Keywords


Email;classifier;time;spam;ham;precision;recall;machine learning

References


W. A. Awad and S. M. ELseuofi, "MACHINE LEARNING METHODS FOR SPAM E-MAIL," International Journal of Computer Science & Information Technology (IJCSIT), vol. 3, no. 1, 2011.

V. P. Karthika Renuka D, "Latent Semantics Indexing Based SVM Model for Email Spam Classification," Journal of Science and Industrial Research, vol. 73, pp. 437-442, 2014.

T. M. N. Mirza, "An Evaluation on the Efficiency of Hybrid Feature Selection in Spam Email Classification," International Journal of Advance Scientific Research and Engineering Trends, vol. 1, no. 4, 2016.

K.Renuka , Visalakshi, "Latent Semantics Indexing Based SVM Model for Email Spam Classification", NISCAIR-CSIR, JSIR vol.73, no 07, 2014.

A. M. Kibriya, E Frank, B. Pfahringer, and G. Holmes, "Multinomial Naive Bayes for Text Categorization Revisited", Australasian Joint Conference on Artificial Intelligence, 2005

S. Raschka, "Naive Bayes and Text Classification", Cornell University Libraray 2014

S. Wanger, M. Zimmermann, E. Ntoutsi, M. Spiliopoulou, "Ageing-Based Multinomial Naive Bayes Classifiers Over Opinionated Data Streams", 2016.

L. Jiang, C. Li, "A CFS-Based Feature Weighting Approach to Naive Bayes Text Classifiers",2014

Smera, Rockey, Rekha, T. Sunny, "A Hybrid Spam Filtering Technique Using Bayesian Spam Filters and Artificial Immunity Spam Filters", International Journal of Engineering Research & Technology (IJERT), Vol. 3 No. 5, 2014.

S. Maharana, M. Mohite and P. Wadekar, " Email Clustering Using Lingo Algorithm ", International Journal of Computer Science Trends and Technology, Vol. 2, No.6, 2014.

R. Mall, K. Arenberg, J. A.K. Suykens , "Kernel Spectral Document Clustering Using Unsupervised Precision-Recall Metrics.", International Joint Conference on Neural Networks (IJCNN), 2015

K.Renuka , Visalakshi, "Latent Semantics Indexing Based SVM Model for Email Spam Classification", NISCAIR-CSIR, 2014

I. Alsmadi, I. Alhami, "Clustering and classification of email ", Journal of King Saud University, 2015.

V. Metsis, I. Androutsopoulos, G. Paliouras "Spam Filtering With Naïve Bayes" K.P.Murphy, "Naïve Bayes Classifiers" , 2006.

A. Bhowmick, M. Hazarika, "Machine Learning for E-mail Spam Filtering: Review, Techniques and Trends", 2016.

S.Pundalik Teli, S. Biradar, "Effective Email Classification for Spam and Non-spam", International Journal of Advanced Research in Computer Science and Software Engineering, vol 4, Issue 6, 2014

S. Wanger, M. Zimmermann, E. Ntoutsi, M. Spiliopoulou, "Ageing-Based Multinomial Naive Bayes Classifiers Over Opinionated Data Streams",2016.

L. Jiang, C. Li, "A CFS-Based Feature Weighting Approach to Naive Bayes Text Classifiers", 2014




DOI: http://dx.doi.org/10.24949%2Fnjes.v12i2.262

Refbacks

  • There are currently no refbacks.


ISSN (Print): 2070-9900   ISSN (Online): 2411-6319