Skip to main content

Multistage Email Spam Filtering Based on Three-Way Decisions

  • Conference paper
Rough Sets and Knowledge Technology (RSKT 2013)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8171))

Included in the following conference series:

Abstract

A ternary, three-way decision strategy to email spam filtering divides incoming emails into three folders, namely, a mail folder consisting of emails that we accept as being legitimate, a spam folder consisting of emails that we reject as being legitimate, and a third folder consisting of emails that we cannot accept nor reject based on available information. The introduction of the third folder enables us to reduce both acceptance and rejection errors. Many existing ternary approaches are essentially a single-stage process. In this paper, we propose a model of multistage three-way email spam filtering based on principles of granular computing and rough sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Androutsopoulos, I., Paliouras, G., Karkaletsis, V., Sakkis, G., Spyropoulos, C.D., Stamatopoulos, P.: Learning to filter spam e-mail: A comparison of a naive Bayesian and a memory-based approach. In: Proceedings of PKDD 2000, pp. 1–13 (2000)

    Google Scholar 

  2. Azam, N., Yao, J.T.: Multiple criteria decision analysis with game-theoretic rough sets. In: Li, T., Nguyen, H.S., Wang, G., Grzymala-Busse, J.W., Janicki, R., Hassanien, A.E., Yu, H. (eds.) RSKT 2012. LNCS (LNAI), vol. 7414, pp. 399–408. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  3. Cristianini, N., Shawe-Taylor, I.: An Introduction to Support Vector Machines and Other Kernel-base Learning Methods. Cambridge University Press, Cambridge (2000)

    Book  Google Scholar 

  4. Deng, X.F., Yao, Y.Y.: A multifaceted analysis of probabilistic three-way decisions (manuscript, 2013)

    Google Scholar 

  5. Grzymala-Busse, J.W.: LERS - A system for learning from examples based on rough sets. In: Słowiński, R. (ed.) Intelligent Decision Support, pp. 3–18. Kluwer Academic Publishers, Boston (1992)

    Chapter  Google Scholar 

  6. Grzymala-Busse, J.W.: A local version of the MLEM2 algorithm for rule induction. Fundamenta Informaticae 100, 99–116 (2010)

    MathSciNet  MATH  Google Scholar 

  7. Jia, X.Y., Li, W.W., Shang, L., Chen, J.J.: An optimization viewpoint of decision-theoretic rough set model. In: Yao, J.T., Ramanna, S., Wang, G., Suraj, Z. (eds.) RSKT 2011. LNCS (LNAI), vol. 6954, pp. 457–465. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  8. Jia, X.Y., Zheng, K., Li, W.W., Liu, T.T., Shang, L.: Three-way decisions solution to filter spam email: An empirical study. In: Yao, J.T., Yang, Y., Słowiński, R., Greco, S., Li, H., Mitra, S., Polkowski, L. (eds.) RSCTC 2012. LNCS (LNAI), vol. 7413, pp. 287–296. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  9. Li, H.X., Zhou, X.Z.: Risk decision making based on decision-theoretic rough set: A three-way view decision model. International Journal of Computational Intelligence Systems 4, 1–11 (2011)

    Google Scholar 

  10. Liu, D., Li, T.R., Li, H.X.: A multiple-category classification approach with decision-theoretic rough sets. Fundamenta Informaticae 115, 173–188 (2012)

    MathSciNet  MATH  Google Scholar 

  11. Liu, D., Li, T.R., Liang, D.C.: A three-way government decision analysis with decision-theoretic rough sets. International Journal of Uncertainty and Knowledge-based Systems 20, 119–132 (2012)

    Article  Google Scholar 

  12. Pantel, P., Lin, D.K.: SpamCop: A spam classification & organization program. In: AAAI Workshop on Learning for Text Categorization. AAAI Technical Report WS-98-05, pp. 95–98 (1998)

    Google Scholar 

  13. Pedrycz, W.: Granular Computing: Analysis and Design of Intelligent Systems. CRC Press/Francis Taylor, Boca Raton (2013)

    Book  Google Scholar 

  14. Robinson, G.: A statistical approach to the spam problem, spam detection. Linux Journal (107) (2003), http://www.linuxjournal.com/article/6467 (retrieved on April 25, 2013)

  15. Sahami, M., Dumais, S., Heckerman, D., Horvitz, E.: A Bayesian approach to filtering junk e-mail. In: AAAI workshop on learning for text categorization. AAAI Technical Report WS-98-05, Madison, Wisconsin (1998)

    Google Scholar 

  16. Schapire, E., Singer, Y.: BoosTexter: A boosting-based system for text categorization. Machine Learning 39, 135–168 (2000)

    Article  MATH  Google Scholar 

  17. Yao, Y.Y.: Probabilistic rough set approximations. International Journal of Approximate Reasoning 49, 255–271 (2008)

    Article  MATH  Google Scholar 

  18. Yao, Y.Y.: Information granulation and rough set approximation. International Journal of Intelligent Systems 16, 87–104 (2001)

    Article  MATH  Google Scholar 

  19. Yao, Y.Y.: An outline of a theory of three-way decisions. In: Yao, J.T., Yang, Y., Słowiński, R., Greco, S., Li, H.X., Mitra, S., Polkowski, L. (eds.) RSCTC 2012. LNCS, vol. 7413, pp. 1–17. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  20. Yao, Y.Y.: Granular computing and sequential three-way decisions. In: Lingras, P., Wolski, M., Cornelis, C., Mitra, S., Wasilewski, P. (eds.) RSKT 2013. LNCS (LNAI), vol. 8171, pp. 16–27. Springer, Heidelberg (2013)

    Google Scholar 

  21. Yao, Y.Y., Deng, X.F.: Sequential three-way decisions with probabilistic rough sets. In: Proceedings of the 10th IEEE International Conference on Cognitive Informatics and Cognitive Computing, pp. 120–125 (2011)

    Google Scholar 

  22. Yih, W.T., McCann, R., Kolcz, A.: Improving spam filtering by detecting gray mail. In: Proceedings of the 4th Conference on Email and Anti-Spam, CEAS 2007 (2007)

    Google Scholar 

  23. Yu, H., Chu, S.S., Yang, D.C.: Autonomous knowledge-oriented clustering using decision-theoretic rough set theory. Fundamenta Informaticae 115, 141–156 (2012)

    MathSciNet  MATH  Google Scholar 

  24. Zhao, W., Zhang, Z.: An email classification model based on rough set theory. In: Proceedings of the International Conference on Active Media Technology, pp. 403–408 (2005)

    Google Scholar 

  25. Zhou, B., Yao, Y.Y., Luo, J.G.: Cost-sensitive three-way email spam filtering. Journal of Intelligent Information Systems (2013), doi:10.1007/s10844-013-0254-7

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Li, J., Deng, X., Yao, Y. (2013). Multistage Email Spam Filtering Based on Three-Way Decisions. In: Lingras, P., Wolski, M., Cornelis, C., Mitra, S., Wasilewski, P. (eds) Rough Sets and Knowledge Technology. RSKT 2013. Lecture Notes in Computer Science(), vol 8171. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41299-8_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-41299-8_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-41298-1

  • Online ISBN: 978-3-642-41299-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics