Abstract
In the world of Internet today, huge amount of data is transferred between computers in the form of emails. Consequently, it is getting difficult to sort the important emails manually from the unimportant ones. Email classification has been extensively studied and researched in the past but most of the research has been in the field of spam detection and filtering. This paper focuses on classifying emails into custom folders that are relevant to the user. We have used two different approaches here—Naïve Bayes classifier and k-nearest neighbors algorithm. The Naïve Bayes classifier is based on a probabilistic model, while the k-nearest neighbors algorithm is based on a similarity measure with the training emails. We propose the method of using these two approaches in email classification, analyze the performance of these algorithms, and compare their results. Then, we propose some future work for further optimization and better efficiency.
References
Email Statistics Report, 2015–2019 conducted by, The Radicati Group, Inc. A Technology Market Research Firm, Palo Alto, CA, USA
Zhou, L., Wang, L., Ge, X., Shi, Q.: A clustering-Based KNN improved algorithm CLKNN for text classification. In: 2010 2nd International Asia Conference on Informatics in Control, Automation and Robotics (CAR), vol.3, no., pp. 212–215, 6–7 Mar 2010
Wajeed, M.A., Adilakshmi, T.: Semi-supervised text classification using enhanced KNN algorithm. In: 2011 World Congress on Information and Communication Technologies (WICT), vol., no., pp. 138–142, 11–14 Dec 2011
Harisinghaney, A.; Dixit, A.; Gupta, S.; Arora, A., “Text and image based spam email classification using KNN, Naïve Bayes and Reverse DBSCAN algorithm,” in Optimization, Reliabilty, and Information Technology (ICROIT), 2014 International Conference on, vol., no., pp. 153–155, 6–8 Feb. 2014
Chakrabarty, A., Roy, S.: An optimized k-NN classifier based on minimum spanning tree for email filtering. In: 2014 2nd International Conference on Business and Information Management (ICBIM), vol., no., pp. 47–52, 9–11 Jan 2014
Zhang, Y., Lijun, Z., Jianfeng, Y., Zhanhuai, L.: Using association features to enhance the performance of Naive Bayes text classifier. In: 2003. ICCIMA 2003. Proceedings of the Fifth International Conference on Computational Intelligence and Multimedia Applications, vol., no., pp. 336–341, 27–30 Sept 2003
Acknowledgements
We would like to thank Prof. V.K. Sambhe for providing guidance to us in this project, giving important suggestions and helping in carefully reviewing this paper. We would also like to thank the other faculty members of the Computer Engineering and Information Technology Department of V.J.T.I. for their valuable inputs and suggestions.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Bhadra, A., Hitawala, S., Modi, R., Salunkhe, S. (2018). Email Classification Using Supervised Learning Algorithms. In: Saeed, K., Chaki, N., Pati, B., Bakshi, S., Mohapatra, D. (eds) Progress in Advanced Computing and Intelligent Engineering. Advances in Intelligent Systems and Computing, vol 564. Springer, Singapore. https://doi.org/10.1007/978-981-10-6875-1_9
Download citation
DOI: https://doi.org/10.1007/978-981-10-6875-1_9
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6874-4
Online ISBN: 978-981-10-6875-1
eBook Packages: EngineeringEngineering (R0)