Conversation Detection in Email Systems

  • Shai Erera
  • David Carmel
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4956)


This work explores a novel approach for conversation detection in email mailboxes. This approach clusters messages into coherent conversations by using a similarity function among messages that takes into consideration all relevant email attributes, such as message subject, participants, date of submission, and message content. The detection algorithm is evaluated against a manual partition of two email mailboxes into conversations. Experimental results demonstrate the superiority of our detection algorithm over several other alternative approaches.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Aaron, H., Jen-Yuan, Y.: Email thread reassembly using similarity matching. In: Proceedings of the Third Conference on Email and Anti-Spam (CEAS) (2006)Google Scholar
  2. 2.
    Gabor, C., Keno, A., Roger, W.: BuzzTrack: Topic Detection and Tracking in Email. In: Proceedings of the 12th international conference on Intelligent user interfaces IUI 2007, ACM Press, New York (2007)Google Scholar
  3. 3.
    Kalman, Y.M., Rafaeli, S.: Email Chronemics: Unobtrusive Profiling of Response Times. In: Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS 2005), vol. 04, pp. 108.2 (2005)Google Scholar
  4. 4.
    Kerr, B.: THREAD ARCS: An Email Thread Visualization. In: Proceedings of IEEE InfoVis, Seattle, WA, pp. 211–218 (2003)Google Scholar
  5. 5.
    Klimt, B., Yang, Y.: Introducing the Enron Corpus. In: Proceedings of the First Conference on Email and Anti-Spam (CEAS), Mountain View, CA (2004)Google Scholar
  6. 6.
    Lam, D., Rohall, S.L., Schmandt, C., Stern, M.K.: Exploiting e-mail structure to improve summarization. In: ACM 2002 Conference on Computer Supported Cooperative Work (CSCW2002), New Orlenes, LA (2002)Google Scholar
  7. 7.
    Lewis, D.D., Gale, A.W.: A Sequential Algorithm for Training Text Classifiers. In: Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval, Dublin, Ireland, pp. 3–12 (1994)Google Scholar
  8. 8.
    Lewis, D.D., Knowels, K.A.: Threading Electronic Mail: a preliminary study. In Information Processing and Management 33(2), 209–217 (1997)CrossRefGoogle Scholar
  9. 9.
    Rudy, I.A.: A Critical Review of Research on Electronic Mail. European Journal of Information Systems 4, 198–213 (1996)CrossRefGoogle Scholar
  10. 10.
    The Internet Society. RFC 2822 – Internet Message Format (2001),

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Shai Erera
    • 1
    • 2
  • David Carmel
    • 2
  1. 1.University of HaifaHaifaIsrael
  2. 2.IBM Research Lab in HaifaIsrael

Personalised recommendations