Stochastic Methods to Find Maximum Likelihood for Spam E-mail Classification

  • Seyed M.-H. MansourbeigiEmail author
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 927)


The increasing volume of unsolicited bulk e-mails leads to the need for reliable stochastic spam detection methods for the classification of the received sequence of e-mails. When a sequence of emails is received by a recipient during a time period, the spam filters have already classified them as spam or not spam. Due to the dynamic nature of the spam, there might be emails marked as not spam but are actually real spams and vice versa. For the sake of security, it is important to be able to detect real spam emails. This paper utilizes stochastic methods to refine the preliminary spam detection and to find maximum likelihood for spam e-mail classification. The method is based on the Bayesian theorem, hidden Markov model (HMM), and the Viterbi algorithm.


  1. 1.
    Sahami, M., Dumais, S., Heckerman, D., Horvitz, E.: A Bayesian approach to filtering junk e-mail. In: AAAI 1998 Workshop on Learning for Text Categorization (1998)Google Scholar
  2. 2.
    Al-Jarrah, O., Khaterz, I., Al-Duwairi, B.: Identifying potentially useful email header features for email spam filtering. In: ICDS The Sixth International Conference on Digital Society (2012)Google Scholar
  3. 3.
    Awad, W.A., ELseuofi, S.M.: Machine learning methods for spam e-mail classification. Int. J. Comput. Sci. Inform. Technol. 3(1), 173–184 (2011)CrossRefGoogle Scholar
  4. 4.
    Eberhardt, J.J.: Bayesian spam detection. Scholarly Horizons: University of Minnesota, Morris Journal, vol. 2, no. 1 (2015)Google Scholar
  5. 5.
    Freeman, D.M.: Using naive Bayes to detect spammy names in social networks. In: Proceedings of the 2013 ACM Workshop on Artificial Intelligence and Security, AISec 2013, pp. 3–12. New York (2013)Google Scholar
  6. 6.
    Lee, S., Jeong, I., Choi, S.: Dynamically weighted hidden Markov model for spam deobfuscation. In: IJCAI 2007, Proceedings of the 20th International Joint Conference on Artificial Intelligence, pp. 2523–2529 (2007)Google Scholar
  7. 7.
    Lee, H., Ng, A.Y.: Spam deobfuscation using a hidden Markov model. In: Proceedings of the 2nd Conference on Email and Anti-Spam, Stanford University (2005)Google Scholar
  8. 8.
    Jaswal, V., Sood, N.: Spam detection system using hidden Markov model. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 3(7), 304–308 (2013)Google Scholar
  9. 9.
    Roy, S., Patra, A., Sau, S., Mandal, K., Kunar, S.: An efficient spam filtering techniques for email account. Am. J. Eng. Res. 2(10), 63–73 (2013)Google Scholar
  10. 10.
    Viterbi, A.J.: Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Trans. Inf. Theor. 13(2), 260–267 (1967)CrossRefGoogle Scholar
  11. 11.
    Papoulis, A.: Probability, Random Variables, and Stochastic Processes. McGraw-Hill Series in Electrical Engineering, 3rd edn. (1991)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.Department of Computer Science, College of EngineeringUtah State UniversityLoganUSA

Personalised recommendations