Abstract
A ternary, three-way decision strategy to email spam filtering divides incoming emails into three folders, namely, a mail folder consisting of emails that we accept as being legitimate, a spam folder consisting of emails that we reject as being legitimate, and a third folder consisting of emails that we cannot accept nor reject based on available information. The introduction of the third folder enables us to reduce both acceptance and rejection errors. Many existing ternary approaches are essentially a single-stage process. In this paper, we propose a model of multistage three-way email spam filtering based on principles of granular computing and rough sets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Androutsopoulos, I., Paliouras, G., Karkaletsis, V., Sakkis, G., Spyropoulos, C.D., Stamatopoulos, P.: Learning to filter spam e-mail: A comparison of a naive Bayesian and a memory-based approach. In: Proceedings of PKDD 2000, pp. 1–13 (2000)
Azam, N., Yao, J.T.: Multiple criteria decision analysis with game-theoretic rough sets. In: Li, T., Nguyen, H.S., Wang, G., Grzymala-Busse, J.W., Janicki, R., Hassanien, A.E., Yu, H. (eds.) RSKT 2012. LNCS (LNAI), vol. 7414, pp. 399–408. Springer, Heidelberg (2012)
Cristianini, N., Shawe-Taylor, I.: An Introduction to Support Vector Machines and Other Kernel-base Learning Methods. Cambridge University Press, Cambridge (2000)
Deng, X.F., Yao, Y.Y.: A multifaceted analysis of probabilistic three-way decisions (manuscript, 2013)
Grzymala-Busse, J.W.: LERS - A system for learning from examples based on rough sets. In: Słowiński, R. (ed.) Intelligent Decision Support, pp. 3–18. Kluwer Academic Publishers, Boston (1992)
Grzymala-Busse, J.W.: A local version of the MLEM2 algorithm for rule induction. Fundamenta Informaticae 100, 99–116 (2010)
Jia, X.Y., Li, W.W., Shang, L., Chen, J.J.: An optimization viewpoint of decision-theoretic rough set model. In: Yao, J.T., Ramanna, S., Wang, G., Suraj, Z. (eds.) RSKT 2011. LNCS (LNAI), vol. 6954, pp. 457–465. Springer, Heidelberg (2011)
Jia, X.Y., Zheng, K., Li, W.W., Liu, T.T., Shang, L.: Three-way decisions solution to filter spam email: An empirical study. In: Yao, J.T., Yang, Y., Słowiński, R., Greco, S., Li, H., Mitra, S., Polkowski, L. (eds.) RSCTC 2012. LNCS (LNAI), vol. 7413, pp. 287–296. Springer, Heidelberg (2012)
Li, H.X., Zhou, X.Z.: Risk decision making based on decision-theoretic rough set: A three-way view decision model. International Journal of Computational Intelligence Systems 4, 1–11 (2011)
Liu, D., Li, T.R., Li, H.X.: A multiple-category classification approach with decision-theoretic rough sets. Fundamenta Informaticae 115, 173–188 (2012)
Liu, D., Li, T.R., Liang, D.C.: A three-way government decision analysis with decision-theoretic rough sets. International Journal of Uncertainty and Knowledge-based Systems 20, 119–132 (2012)
Pantel, P., Lin, D.K.: SpamCop: A spam classification & organization program. In: AAAI Workshop on Learning for Text Categorization. AAAI Technical Report WS-98-05, pp. 95–98 (1998)
Pedrycz, W.: Granular Computing: Analysis and Design of Intelligent Systems. CRC Press/Francis Taylor, Boca Raton (2013)
Robinson, G.: A statistical approach to the spam problem, spam detection. Linux Journal (107) (2003), http://www.linuxjournal.com/article/6467 (retrieved on April 25, 2013)
Sahami, M., Dumais, S., Heckerman, D., Horvitz, E.: A Bayesian approach to filtering junk e-mail. In: AAAI workshop on learning for text categorization. AAAI Technical Report WS-98-05, Madison, Wisconsin (1998)
Schapire, E., Singer, Y.: BoosTexter: A boosting-based system for text categorization. Machine Learning 39, 135–168 (2000)
Yao, Y.Y.: Probabilistic rough set approximations. International Journal of Approximate Reasoning 49, 255–271 (2008)
Yao, Y.Y.: Information granulation and rough set approximation. International Journal of Intelligent Systems 16, 87–104 (2001)
Yao, Y.Y.: An outline of a theory of three-way decisions. In: Yao, J.T., Yang, Y., Słowiński, R., Greco, S., Li, H.X., Mitra, S., Polkowski, L. (eds.) RSCTC 2012. LNCS, vol. 7413, pp. 1–17. Springer, Heidelberg (2012)
Yao, Y.Y.: Granular computing and sequential three-way decisions. In: Lingras, P., Wolski, M., Cornelis, C., Mitra, S., Wasilewski, P. (eds.) RSKT 2013. LNCS (LNAI), vol. 8171, pp. 16–27. Springer, Heidelberg (2013)
Yao, Y.Y., Deng, X.F.: Sequential three-way decisions with probabilistic rough sets. In: Proceedings of the 10th IEEE International Conference on Cognitive Informatics and Cognitive Computing, pp. 120–125 (2011)
Yih, W.T., McCann, R., Kolcz, A.: Improving spam filtering by detecting gray mail. In: Proceedings of the 4th Conference on Email and Anti-Spam, CEAS 2007 (2007)
Yu, H., Chu, S.S., Yang, D.C.: Autonomous knowledge-oriented clustering using decision-theoretic rough set theory. Fundamenta Informaticae 115, 141–156 (2012)
Zhao, W., Zhang, Z.: An email classification model based on rough set theory. In: Proceedings of the International Conference on Active Media Technology, pp. 403–408 (2005)
Zhou, B., Yao, Y.Y., Luo, J.G.: Cost-sensitive three-way email spam filtering. Journal of Intelligent Information Systems (2013), doi:10.1007/s10844-013-0254-7
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, J., Deng, X., Yao, Y. (2013). Multistage Email Spam Filtering Based on Three-Way Decisions. In: Lingras, P., Wolski, M., Cornelis, C., Mitra, S., Wasilewski, P. (eds) Rough Sets and Knowledge Technology. RSKT 2013. Lecture Notes in Computer Science(), vol 8171. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41299-8_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-41299-8_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41298-1
Online ISBN: 978-3-642-41299-8
eBook Packages: Computer ScienceComputer Science (R0)