Skip to main content

Email Classification Using Supervised Learning Algorithms

  • Conference paper
  • First Online:
Progress in Advanced Computing and Intelligent Engineering

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 564))

Abstract

In the world of Internet today, huge amount of data is transferred between computers in the form of emails. Consequently, it is getting difficult to sort the important emails manually from the unimportant ones. Email classification has been extensively studied and researched in the past but most of the research has been in the field of spam detection and filtering. This paper focuses on classifying emails into custom folders that are relevant to the user. We have used two different approaches here—Naïve Bayes classifier and k-nearest neighbors algorithm. The Naïve Bayes classifier is based on a probabilistic model, while the k-nearest neighbors algorithm is based on a similarity measure with the training emails. We propose the method of using these two approaches in email classification, analyze the performance of these algorithms, and compare their results. Then, we propose some future work for further optimization and better efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

References

  1. Email Statistics Report, 2015–2019 conducted by, The Radicati Group, Inc. A Technology Market Research Firm, Palo Alto, CA, USA

    Google Scholar 

  2. Zhou, L., Wang, L., Ge, X., Shi, Q.: A clustering-Based KNN improved algorithm CLKNN for text classification. In: 2010 2nd International Asia Conference on Informatics in Control, Automation and Robotics (CAR), vol.3, no., pp. 212–215, 6–7 Mar 2010

    Google Scholar 

  3. Wajeed, M.A., Adilakshmi, T.: Semi-supervised text classification using enhanced KNN algorithm. In: 2011 World Congress on Information and Communication Technologies (WICT), vol., no., pp. 138–142, 11–14 Dec 2011

    Google Scholar 

  4. Harisinghaney, A.; Dixit, A.; Gupta, S.; Arora, A., “Text and image based spam email classification using KNN, Naïve Bayes and Reverse DBSCAN algorithm,” in Optimization, Reliabilty, and Information Technology (ICROIT), 2014 International Conference on, vol., no., pp. 153–155, 6–8 Feb. 2014

    Google Scholar 

  5. Chakrabarty, A., Roy, S.: An optimized k-NN classifier based on minimum spanning tree for email filtering. In: 2014 2nd International Conference on Business and Information Management (ICBIM), vol., no., pp. 47–52, 9–11 Jan 2014

    Google Scholar 

  6. Zhang, Y., Lijun, Z., Jianfeng, Y., Zhanhuai, L.: Using association features to enhance the performance of Naive Bayes text classifier. In: 2003. ICCIMA 2003. Proceedings of the Fifth International Conference on Computational Intelligence and Multimedia Applications, vol., no., pp. 336–341, 27–30 Sept 2003

    Google Scholar 

Download references

Acknowledgements

We would like to thank Prof. V.K. Sambhe for providing guidance to us in this project, giving important suggestions and helping in carefully reviewing this paper. We would also like to thank the other faculty members of the Computer Engineering and Information Technology Department of V.J.T.I. for their valuable inputs and suggestions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Akshay Bhadra .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bhadra, A., Hitawala, S., Modi, R., Salunkhe, S. (2018). Email Classification Using Supervised Learning Algorithms. In: Saeed, K., Chaki, N., Pati, B., Bakshi, S., Mohapatra, D. (eds) Progress in Advanced Computing and Intelligent Engineering. Advances in Intelligent Systems and Computing, vol 564. Springer, Singapore. https://doi.org/10.1007/978-981-10-6875-1_9

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-6875-1_9

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-6874-4

  • Online ISBN: 978-981-10-6875-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics