Spam Classification Using Deep Learning Technique
DOI:
https://doi.org/10.26438/ijcse/v6i5.383386Keywords:
Spam, Deep Learning, Machine Learning, Classify, WEKAAbstract
Deep Learning technique which is a new area of Machine Learning is showing huge promise in achieving the original goals of Machine learning: Artificial Intelligence. Deep Learning is being applied in every machine learning problem and has shown great results. In this paper, we evaluate the problem of spam classification using Deep Learning Technique and compare the result with other state-of-art machine learning techniques. The machine learning techniques used in the comparison are: Random Forest, Multinomial Naïve Bayesian and Support Vector Machine. The dataset used in the experiment is the CSDMC_2010 and Enron dataset and the platform used is the WEKA interface. Common features are extracted from the body of the spam and feature vector table is constructed, which is used on all the model. Our experiment shows that Deep Learning model outperform all the other machine learning techniques in terms of true positive & true negative and even in the overall accuracy.
References
Thiago S. Guzella and Walmir M. Caminhas, “A review of machine learning approaches to spam filtering”, Expert System with Applications”, Elsevier, Vol-36, pp 10206-10222, 2009.
G. Cormack, “Email spam filtering: A systematic Review", Foundations and Trends in Information Retrieval , Vol-1, no. 4, pp. 335–455, 2008.
M. Sahami, S. Dumais, D. Heckerman and E. Horvitz, “A Bayesian Approach to Filtering Junk Email,” AAAI Technical Report WS-98-05, AAAI Workshop on Learning for Text Categorization, 1998.
Drucker H, Wu D, Vapnik VN. “Support Vector Machines for Spam Categorization”, IEEE Transactions on Neural Networks Vol-10, Issue-5, pp 1048-1054, 1999.
Yudong Zhang, Shuihua Wang, Preetha Phillips, Genlin Ji, “Binary PSO with mutation operator for feature selection using decision tree applied to spam detection”, knowledge-Based Systems, Elsevier, Vol-64, pp 22-31, 2014.
Zhang L, Zhu J, Yao T, “An Evaluation of Statistical Spam Filtering Techniques Spam Filtering as Text Categorization”, ACM Transactions on Asian Language Information Processing (TALIP), Vol-3, Issue 4, pp 243-269, 2004.
Almeida TA, Yamakami A, “Content-Based Spam Filtering”, The 2010 International Joint Conference on Neural Networks (IJCNN), Barcelona, pp 1-7, 2010.
Lin Li and Chi Li, ”Research and Improvement of a Spam Filter based on Naïve Bayes”, Proceedings of the 2015 Seventh International Conference on Intelligent Human-Machine Systems and Cybernetics, 2015
Amayri O, Bouguila N, “A study of Spam Filtering using Support Vector Machines”, Artificial Intelligence Review, Vol-34, Isuue 1, pp 73-108, 2010.
Koprinska I, Poon J, Clark J, Chan J, “Learning to Classify e-mail”, Information Sciences, Vol-177, issue 10, pp 2167-2187, 2007.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
