Computational Collective Intelligence pp 89-98 | Cite as
A New Algorithm to Categorize E-mail Messages to Folders with Social Networks Analysis
Abstract
This paper presents a new approach to an automatic categorization of email messages into folders. The aim of this paper is to create a new algorithm that will allow one to improve the accuracy with which emails are assigned to folders (Email Foldering Problem) by using solutions that have been applied in Ant Colony Optimization algorithms (ACO) and Social Networks Analysis (SNA). The new algorithm that is proposed here has been tested on the publicly available Enron email data set. The obtained results confirm that this approach allows one to better organize new emails into folders based on an analysis of previous correspondence.
Keywords
E-mail Foldering Problem Ant Colony Optimization Enron E-mail Social Network AnalysisPreview
Unable to display preview. Download preview PDF.
References
- 1.Aral, S., Van Alstyne, M.: Network structure & information advantage (2007)Google Scholar
- 2.Beckers, R., Goss, S., Deneubourg, J., Pasteels, J.M.: Colony size, communication and ant foraging strategy. Psyche 96, 239–256 (1989)CrossRefGoogle Scholar
- 3.Bekkerman, R., McCallum, A., Huang, G.: Automatic categorization of email into folders: Benchmark experiments on enron and sri corpora. Center for Intelligent Information Retrieval, Technical Report IR (2004)Google Scholar
- 4.Boryczka, U., Probierz, B., Kozak, J.: An ant colony optimization algorithm for an automatic categorization of emails. In: Hwang, D., Jung, J.J., Nguyen, N.-T. (eds.) ICCCI 2014. LNCS, vol. 8733, pp. 583–592. Springer, Heidelberg (2014) Google Scholar
- 5.Boryczka, U., Kozak, J.: Enhancing the effectiveness of ant colony decision tree algorithms by co-learning. Applied Soft Computing 30, 166–178 (2015). http://www.sciencedirect.com/science/article/pii/S1568494615000575 CrossRefGoogle Scholar
- 6.Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Chapman & Hall, New York (1984)MATHGoogle Scholar
- 7.Chapanond, A., Krishnamoorthy, M., Yener, B.: Graph theoretic and spectral analysis of enron email data. Computational and Mathematical Organization Theory 11(3), 265–281 (2005)CrossRefMATHGoogle Scholar
- 8.Cummings, J.N., Cross, R.: Structural properties of work groups and their consequences for performance. Social Networks 25, 197–210 (2003)CrossRefGoogle Scholar
- 9.Do, H., Choi, P., Lee, H.: Dynamic surveillance: a case study with enron email data set. In: Kim, Y., Lee, H., Perrig, A. (eds.) WISA 2013. LNCS, vol. 8267, pp. 81–99. Springer, Heidelberg (2014) CrossRefGoogle Scholar
- 10.Dorigo, M., Caro, G.D., Gambardella, L.: Ant algorithms for distributed discrete optimization. Artif. Life 5(2), 137–172 (1999)CrossRefGoogle Scholar
- 11.Gloor, P., Grippa, F., Putzke, J., Lassenius, C., Fuehres, H., Fischbach, K., Schoder, D.: Measuring social capital in creative teams through sociometric sensors. International Journal of Organisational Design and Engineering (2012)Google Scholar
- 12.Gloor, P.A.: Swarm Creativity: Competitive Advantage through Collaborative Innovation Networks. Oxford University Press, USA (2006)CrossRefGoogle Scholar
- 13.Keila, P., Skillicorn, D.: Structure in the enron email dataset. Computational and Mathematical Organization Theory 11(3), 183–199 (2005)CrossRefMATHGoogle Scholar
- 14.Moreno, J.L.: Who Shall Survive? Foundations of Sociometry, Group Psychotherapy and Sociodrama. Beacon House, Beacon (1953, 1978)Google Scholar
- 15.Shetty, J., Adibi, J.: The enron email dataset database schema and brief statistical report. Tech. rep. (2004)Google Scholar
- 16.Shetty, J., Adibi, J.: Discovering important nodes through graph entropy the case of enron email database. In: Proceedings of the 3rd International Workshop on Link Discovery, LinkKDD 2005, pp. 74–81. ACM, New York (2005)Google Scholar
- 17.Theraulaz, G., Goss, S., Gervet, J., Deneubourg, J.: Swarm intelligence in wasps colonies: an example of task assignment in multiagents systems. In: Proceedings of the 1990 IEEE International Symposium on Intelligent Control, pp. 135–143 (1990)Google Scholar
- 18.Verhaeghe, J., Deneubourg, J.: Experimental study and modelling of food recruitment in the ant tetramorium impurum. Insectes Sociaux 30, 347–360 (1983)CrossRefGoogle Scholar
- 19.Wilson, G.C., Banzhaf, W.: Discovery of email communication networks from the enron corpus with a genetic algorithm using social network analysis (2009)Google Scholar