Abstract
In this article an approach to the automatic classification of email messages in mailboxes has been proposed. The aim of this paper is to devise methods to build decision tables from the collection of email messages on which it is possible to build Ant Colony Optimization-based ensemble classifiers, whose application allows to use the collection of emails without cleaning, at the same time improving the accuracy of the email folders classification. The proposed method has been tested by the selected algorithms on the Enron Email Dataset. The results confirm that the proposed solutions allows to improve the accuracy of classification of new emails to folders.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bekkerman, R., McCallum, A., Huang, G.: Automatic categorization of email into folders: Benchmark experiments on enron and sri corpora. Center for Intelligent Information Retrieval, Technical report IR (2004)
Boryczka, U., Kozak, J.: Ant colony decision trees – a new method for constructing decision trees based on ant colony optimization. In: Pan, J.-S., Chen, S.-M., Nguyen, N.T. (eds.) ICCCI 2010, Part I. LNCS, vol. 6421, pp. 373–382. Springer, Heidelberg (2010)
Boryczka, U., Probierz, B., Kozak, J.: An ant colony optimization algorithm for an automatic categorization of emails. In: Hwang, D., Jung, J.J., Nguyen, N.-T. (eds.) ICCCI 2014. LNCS, vol. 8733, pp. 583–592. Springer, Heidelberg (2014)
Boryczka, U., Kozak, J.: Ant colony decision forest meta-ensemble. In: Nguyen, N.-T., Hoang, K., Jȩdrzejowicz, P. (eds.) ICCCI 2012, Part II. LNCS, vol. 7654, pp. 473–482. Springer, Heidelberg (2012)
Boryczka, U., Kozak, J.: On-the-Go adaptability in the new ant colony decision forest approach. In: Nguyen, N.T., Attachoo, B., Trawiński, B., Somboonviwat, K. (eds.) ACIIDS 2014, Part II. LNCS, vol. 8398, pp. 157–166. Springer, Heidelberg (2014)
Breiman, L.: Bagging predictors. Machine Learning 24(2), 123–140 (1996)
Breiman, L.: Random forests. Machine Learning 45(1), 5–32 (2001)
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Chapman and Hall, New York (1984)
Bühlmann, P., Hothorn, T.: Boosting algorithms: Regularization, prediction and model fitting. Statistical Science 22(4), 477–505 (2007)
Doerner, K.F., Merkle, D., Stützle, T.: Special issue on ant colony optimization. Swarm Intelligence 3(1), 1–2 (2009)
Dorigo, M., Caro, G.D., Gambardella, L.: Ant algorithms for distributed discrete optimization. Artif. Life 5(2), 137–172 (1999)
Dorigo, M., Stützle, T.: Ant Colony Optimization. MIT Press, Cambridge (2004)
Efron, B.: Bootstrap methods: Another look at the jackknife. The Annals of Statistics 7(1), 1–26 (1979)
Freund, Y., Schapire, R.E.: Experiments with a new boosting algorithm. In: International Conference on Machine Learning, pp. 148–156 (1996)
Grasse, P.P.: Termitologia, vol. II. Masson, Paris (1984)
Kiritchenko, S., Matwin, S.: Email classification with co-training. University of Ottawa, Technical report (2002)
Lewis, D.D.: Representation and Learning in Information Retrieval. Ph.D. thesis, Department of Computer Science, University of Massachusetts (1992)
Rudin, C., Schapire, R.E.: Margin-based ranking and an equivalence between AdaBoost and RankBoost. J. Mach. Learn. Res. 10, 2193–2232 (2009)
Schapire, R.E.: The strength of weak learnability. Machine Learning 5, 197–227 (1990)
Wang, M., He, Y., Jiang, M.: Text categorization of enron email corpus based on information bottleneck and maximal entropy (2010)
Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques, 3rd edn. Morgan Kaufmann Publishers Inc. (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Boryczka, U., Probierz, B., Kozak, J. (2015). Adaptive Ant Colony Decision Forest in Automatic Categorization of Emails. In: Nguyen, N., Trawiński, B., Kosala, R. (eds) Intelligent Information and Database Systems. ACIIDS 2015. Lecture Notes in Computer Science(), vol 9011. Springer, Cham. https://doi.org/10.1007/978-3-319-15702-3_44
Download citation
DOI: https://doi.org/10.1007/978-3-319-15702-3_44
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-15701-6
Online ISBN: 978-3-319-15702-3
eBook Packages: Computer ScienceComputer Science (R0)