Performance Analysis of Email Classifiers for Detection of Spam
Keywords:Email, classifier, time, spam, ham, precision, recall, machine learning
AbstractGrowing usage of email has also increased size of email data, this data involves important as well as undesirable emails. Amount of unwanted emails(spam) has increased enormously. Blocking spam sources doesn’t works well in this era. For saving resources its vital to separate spam and essential emails(ham). Email servers are prepared to tackle this situation. Problem is handled by different algorithms that automate the system instead of manually separating emails. Our work addresses the selection of algorithm, whose outcome will precisely allocate labels to emails and will be efficient enough to give results in adequate time. So, that emails can be classified correctly into inbox and spam folders in adequate time by email server. Three different machine learning classifiers are analyzed over a dataset, providing a criterion that will categorize them according to their time, precision, recall and accuracy.
W. A. Awad and S. M. ELseuofi, "MACHINE LEARNING METHODS FOR SPAM E-MAIL," International Journal of Computer Science & Information Technology (IJCSIT), vol. 3, no. 1, 2011.
V. P. Karthika Renuka D, "Latent Semantics Indexing Based SVM Model for Email Spam Classification," Journal of Science and Industrial Research, vol. 73, pp. 437-442, 2014.
T. M. N. Mirza, "An Evaluation on the Efficiency of Hybrid Feature Selection in Spam Email Classification," International Journal of Advance Scientific Research and Engineering Trends, vol. 1, no. 4, 2016.
K.Renuka , Visalakshi, "Latent Semantics Indexing Based SVM Model for Email Spam Classification", NISCAIR-CSIR, JSIR vol.73, no 07, 2014.
A. M. Kibriya, E Frank, B. Pfahringer, and G. Holmes, "Multinomial Naive Bayes for Text Categorization Revisited", Australasian Joint Conference on Artificial Intelligence, 2005
S. Raschka, "Naive Bayes and Text Classification", Cornell University Libraray 2014
S. Wanger, M. Zimmermann, E. Ntoutsi, M. Spiliopoulou, "Ageing-Based Multinomial Naive Bayes Classifiers Over Opinionated Data Streams", 2016.
L. Jiang, C. Li, "A CFS-Based Feature Weighting Approach to Naive Bayes Text Classifiers",2014
Smera, Rockey, Rekha, T. Sunny, "A Hybrid Spam Filtering Technique Using Bayesian Spam Filters and Artificial Immunity Spam Filters", International Journal of Engineering Research & Technology (IJERT), Vol. 3 No. 5, 2014.
S. Maharana, M. Mohite and P. Wadekar, " Email Clustering Using Lingo Algorithm ", International Journal of Computer Science Trends and Technology, Vol. 2, No.6, 2014.
R. Mall, K. Arenberg, J. A.K. Suykens , "Kernel Spectral Document Clustering Using Unsupervised Precision-Recall Metrics.", International Joint Conference on Neural Networks (IJCNN), 2015
K.Renuka , Visalakshi, "Latent Semantics Indexing Based SVM Model for Email Spam Classification", NISCAIR-CSIR, 2014
I. Alsmadi, I. Alhami, "Clustering and classification of email ", Journal of King Saud University, 2015.
V. Metsis, I. Androutsopoulos, G. Paliouras "Spam Filtering With Naïve Bayes" K.P.Murphy, "Naïve Bayes Classifiers" , 2006.
A. Bhowmick, M. Hazarika, "Machine Learning for E-mail Spam Filtering: Review, Techniques and Trends", 2016.
S.Pundalik Teli, S. Biradar, "Effective Email Classification for Spam and Non-spam", International Journal of Advanced Research in Computer Science and Software Engineering, vol 4, Issue 6, 2014
S. Wanger, M. Zimmermann, E. Ntoutsi, M. Spiliopoulou, "Ageing-Based Multinomial Naive Bayes Classifiers Over Opinionated Data Streams",2016.
L. Jiang, C. Li, "A CFS-Based Feature Weighting Approach to Naive Bayes Text Classifiers", 2014
Copyright (c) 2019 NUST Journal of Engineering Sciences
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.