Automatic Advisor for Detecting Summarizable Chat Conversations in Online Instant Messages

  • Fajri KotoEmail author
  • Omar AbdillahEmail author
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 463)


In this paper, we report the first work ever of detecting the summarizable chat conversation in order to improve the quality of summarization and system performance, especially in real time server-based system like online instant messaging. Summarizable chat conversation means that the document assessed could produce a meaningful summary for human. Our study intends to answer the question: what are the characteristics of a summarizable chat and how to distinguish it with non-summarizable chat conversation. To conduct the experiment, corpora of 536 chat conversations was constructed manually. Technically, we used 19 attributes and grouped them by feature sets of (1) chat attribute, (2) lexical, and (3) Rapid Automatic Keyword Extraction (RAKE). As result, our work reveals that the features can classify summarizable chat by 78.36 % as our highest accuracy, performed by feature selection with SVM.


Chat Summarizable Non-summarizable Feature selection 


  1. 1.
    Sood, A., Mohamed, T.P., Varma, V.: Topic-focused summarization of chat conversations. In: Advances in Information Retrieval, pp. 800–803 (2013)Google Scholar
  2. 2.
    Uthus, D.C., Aha, D.W.: Plans toward automated chat summarization. In: Proceedings of the Workshop on Automatic Summarization for Different Genres, Media, and Languages. ACL (2011)Google Scholar
  3. 3.
    Koto, F., Sakriani S., Neubig, G., Toda, T., Adriani, M., Nakamura, S.: The use of semantic and acoustic features for open-domain TED talk summarization. In: Proceedings of The 6th Asia Pacific Signal and Information Processing Association (APSIPA). Siem Reap, Cambodia (2014)Google Scholar
  4. 4.
    Werry, C.C.: In: Linguistic and Interactional Features of Internet Relay Chat, pp. 47–64 (1996)Google Scholar
  5. 5.
    Hering, S.C.: Interactional coherence in CMC. In: Proceedings of the Thirty-Second Annual Hawaii International Conference on System Sciences (1999)Google Scholar
  6. 6.
    Hering, S.C.: Computer-mediated conversation: introduction and overview. In: Language@ Internet (2010)Google Scholar
  7. 7.
    Zhou, L., Hovy, E.: Digesting virtual “geek” culture: the summarization of technical Internet Relay Chats. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 298–305. ACL (2005)Google Scholar
  8. 8.
    Zechner, K.: Automatic summarization of open-domain multiparty dialogues in diverse genres. Comput. Linguist. 28(4), 447–485 (2002)CrossRefGoogle Scholar
  9. 9.
    Murray, G., Renals, S., Carletta, J., Moore, J.: Evaluating automatic summaries of meeting recordings. In: Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, pp. 33–40. ACL (2005)Google Scholar
  10. 10.
    Cohen, J.: A coefficient of agreement for nominal scales. Educ. Psychol. Meas. 20(1), 37–46 (1960)CrossRefGoogle Scholar
  11. 11.
    Altman, D.G.: Practical Statistics for Medical research, vol. 20(1). Chapman Hall/CRC Press, London (1990)Google Scholar
  12. 12.
    Berry, M.W., Kogan, J.: “Text Mining”: Applications and Theory. Wiley, West Sussex, PO19 8SQ, UK (2010)Google Scholar
  13. 13.
    Akthar, F., Hahne, C.: RapidMiner 5 Operator Reference. In: Rapid-I GmbH (2012)Google Scholar
  14. 14.
    Montgomery, D.C., Peck, E.A., Vining, G.G.: Introduction to Linear Regression Analysis, vol. 821. Wiley, West Sussex, PO19 8SQ, UK (2012)Google Scholar
  15. 15.
    Lewis, D.D.: Naive (Bayes) at forty: the independence assumption in information retrieval. In: Educational and Psychological Machine learning: ECML-98, pp. 4–15. Springer, Berlin, Heidelberg (1998)Google Scholar
  16. 16.
    Fu, L.M.: Neural Network in Computer Intelligence. MIT-Press, McGraw-Hill International Edition (1994)Google Scholar
  17. 17.
    Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge university press (2000)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  1. 1.Advanced Research LabSamsung R&D Institute IndonesiaJakartaIndonesia

Personalised recommendations