Spam Classification Using Deep Learning Technique

Authors

  • Singh AB Dept. of Computer Science and Engineering, National Institute of Technology, Manipur, India
  • Singh SB Dept. of Computer Science and Engineering, National Institute of Technology, Manipur, India
  • Singh KM Dept. of Computer Science and Engineering, National Institute of Technology, Manipur, India

DOI:

https://doi.org/10.26438/ijcse/v6i5.383386

Keywords:

Spam, Deep Learning, Machine Learning, Classify, WEKA

Abstract

Deep Learning technique which is a new area of Machine Learning is showing huge promise in achieving the original goals of Machine learning: Artificial Intelligence. Deep Learning is being applied in every machine learning problem and has shown great results. In this paper, we evaluate the problem of spam classification using Deep Learning Technique and compare the result with other state-of-art machine learning techniques. The machine learning techniques used in the comparison are: Random Forest, Multinomial Naïve Bayesian and Support Vector Machine. The dataset used in the experiment is the CSDMC_2010 and Enron dataset and the platform used is the WEKA interface. Common features are extracted from the body of the spam and feature vector table is constructed, which is used on all the model. Our experiment shows that Deep Learning model outperform all the other machine learning techniques in terms of true positive & true negative and even in the overall accuracy.

References

Thiago S. Guzella and Walmir M. Caminhas, “A review of machine learning approaches to spam filtering”, Expert System with Applications”, Elsevier, Vol-36, pp 10206-10222, 2009.

G. Cormack, “Email spam filtering: A systematic Review", Foundations and Trends in Information Retrieval , Vol-1, no. 4, pp. 335–455, 2008.

M. Sahami, S. Dumais, D. Heckerman and E. Horvitz, “A Bayesian Approach to Filtering Junk Email,” AAAI Technical Report WS-98-05, AAAI Workshop on Learning for Text Categorization, 1998.

Drucker H, Wu D, Vapnik VN. “Support Vector Machines for Spam Categorization”, IEEE Transactions on Neural Networks Vol-10, Issue-5, pp 1048-1054, 1999.

Yudong Zhang, Shuihua Wang, Preetha Phillips, Genlin Ji, “Binary PSO with mutation operator for feature selection using decision tree applied to spam detection”, knowledge-Based Systems, Elsevier, Vol-64, pp 22-31, 2014.

Zhang L, Zhu J, Yao T, “An Evaluation of Statistical Spam Filtering Techniques Spam Filtering as Text Categorization”, ACM Transactions on Asian Language Information Processing (TALIP), Vol-3, Issue 4, pp 243-269, 2004.

Almeida TA, Yamakami A, “Content-Based Spam Filtering”, The 2010 International Joint Conference on Neural Networks (IJCNN), Barcelona, pp 1-7, 2010.

Lin Li and Chi Li, ”Research and Improvement of a Spam Filter based on Naïve Bayes”, Proceedings of the 2015 Seventh International Conference on Intelligent Human-Machine Systems and Cybernetics, 2015

Amayri O, Bouguila N, “A study of Spam Filtering using Support Vector Machines”, Artificial Intelligence Review, Vol-34, Isuue 1, pp 73-108, 2010.

Koprinska I, Poon J, Clark J, Chan J, “Learning to Classify e-mail”, Information Sciences, Vol-177, issue 10, pp 2167-2187, 2007.

Downloads

Published

2025-11-13
CITATION
DOI: 10.26438/ijcse/v6i5.383386
Published: 2025-11-13

How to Cite

[1]
A. Singh, S. Singh, and K. M. Singh, “Spam Classification Using Deep Learning Technique”, Int. J. Comp. Sci. Eng., vol. 6, no. 5, pp. 383–386, Nov. 2025.

Issue

Section

Research Article