A New Algorithm to Categorize E-mail Messages to Folders with Social Networks Analysis

Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9330)

Abstract

This paper presents a new approach to an automatic categorization of email messages into folders. The aim of this paper is to create a new algorithm that will allow one to improve the accuracy with which emails are assigned to folders (Email Foldering Problem) by using solutions that have been applied in Ant Colony Optimization algorithms (ACO) and Social Networks Analysis (SNA). The new algorithm that is proposed here has been tested on the publicly available Enron email data set. The obtained results confirm that this approach allows one to better organize new emails into folders based on an analysis of previous correspondence.

Keywords

E-mail Foldering Problem Ant Colony Optimization Enron E-mail Social Network Analysis 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Aral, S., Van Alstyne, M.: Network structure & information advantage (2007)Google Scholar
  2. 2.
    Beckers, R., Goss, S., Deneubourg, J., Pasteels, J.M.: Colony size, communication and ant foraging strategy. Psyche 96, 239–256 (1989)CrossRefGoogle Scholar
  3. 3.
    Bekkerman, R., McCallum, A., Huang, G.: Automatic categorization of email into folders: Benchmark experiments on enron and sri corpora. Center for Intelligent Information Retrieval, Technical Report IR (2004)Google Scholar
  4. 4.
    Boryczka, U., Probierz, B., Kozak, J.: An ant colony optimization algorithm for an automatic categorization of emails. In: Hwang, D., Jung, J.J., Nguyen, N.-T. (eds.) ICCCI 2014. LNCS, vol. 8733, pp. 583–592. Springer, Heidelberg (2014) Google Scholar
  5. 5.
    Boryczka, U., Kozak, J.: Enhancing the effectiveness of ant colony decision tree algorithms by co-learning. Applied Soft Computing 30, 166–178 (2015). http://www.sciencedirect.com/science/article/pii/S1568494615000575 CrossRefGoogle Scholar
  6. 6.
    Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Chapman & Hall, New York (1984)MATHGoogle Scholar
  7. 7.
    Chapanond, A., Krishnamoorthy, M., Yener, B.: Graph theoretic and spectral analysis of enron email data. Computational and Mathematical Organization Theory 11(3), 265–281 (2005)CrossRefMATHGoogle Scholar
  8. 8.
    Cummings, J.N., Cross, R.: Structural properties of work groups and their consequences for performance. Social Networks 25, 197–210 (2003)CrossRefGoogle Scholar
  9. 9.
    Do, H., Choi, P., Lee, H.: Dynamic surveillance: a case study with enron email data set. In: Kim, Y., Lee, H., Perrig, A. (eds.) WISA 2013. LNCS, vol. 8267, pp. 81–99. Springer, Heidelberg (2014) CrossRefGoogle Scholar
  10. 10.
    Dorigo, M., Caro, G.D., Gambardella, L.: Ant algorithms for distributed discrete optimization. Artif. Life 5(2), 137–172 (1999)CrossRefGoogle Scholar
  11. 11.
    Gloor, P., Grippa, F., Putzke, J., Lassenius, C., Fuehres, H., Fischbach, K., Schoder, D.: Measuring social capital in creative teams through sociometric sensors. International Journal of Organisational Design and Engineering (2012)Google Scholar
  12. 12.
    Gloor, P.A.: Swarm Creativity: Competitive Advantage through Collaborative Innovation Networks. Oxford University Press, USA (2006)CrossRefGoogle Scholar
  13. 13.
    Keila, P., Skillicorn, D.: Structure in the enron email dataset. Computational and Mathematical Organization Theory 11(3), 183–199 (2005)CrossRefMATHGoogle Scholar
  14. 14.
    Moreno, J.L.: Who Shall Survive? Foundations of Sociometry, Group Psychotherapy and Sociodrama. Beacon House, Beacon (1953, 1978)Google Scholar
  15. 15.
    Shetty, J., Adibi, J.: The enron email dataset database schema and brief statistical report. Tech. rep. (2004)Google Scholar
  16. 16.
    Shetty, J., Adibi, J.: Discovering important nodes through graph entropy the case of enron email database. In: Proceedings of the 3rd International Workshop on Link Discovery, LinkKDD 2005, pp. 74–81. ACM, New York (2005)Google Scholar
  17. 17.
    Theraulaz, G., Goss, S., Gervet, J., Deneubourg, J.: Swarm intelligence in wasps colonies: an example of task assignment in multiagents systems. In: Proceedings of the 1990 IEEE International Symposium on Intelligent Control, pp. 135–143 (1990)Google Scholar
  18. 18.
    Verhaeghe, J., Deneubourg, J.: Experimental study and modelling of food recruitment in the ant tetramorium impurum. Insectes Sociaux 30, 347–360 (1983)CrossRefGoogle Scholar
  19. 19.
    Wilson, G.C., Banzhaf, W.: Discovery of email communication networks from the enron corpus with a genetic algorithm using social network analysis (2009)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  1. 1.Institute of Computer ScienceUniversity of SilesiaSosnowiecPoland

Personalised recommendations