Skip to main content
Log in

A just-in-time keyword extraction from meeting transcripts using temporal and participant information

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

In a meeting, it is often desirable to extract the keywords from each utterance as soon as it is spoken. Therefore, this paper proposes a just-in-time keyword extraction from meeting transcripts. The proposed method considers three major factors that make it different from keyword extraction from normal texts. The first factor is the temporal history of the preceding utterances that grants higher importance to recent utterances than older ones, and the second is topic relevance, which focuses only on the preceding utterances relevant to the current utterance. The final factor is the participants. The utterances spoken by the current speaker should be considered more important than those spoken by other participants. The proposed method considers these factors simultaneously under a graph-based keyword extraction with some graph operations. Experiments on two data sets in English and Korean show that consideration of these factors results in improved performance in keyword extraction from meeting transcripts.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

Notes

  1. This data set is available at http://ml.knu.ac.kr/dataset/keywordextraction.html.

  2. A guideline was given to the annotators that keywords must be a single word and the maximum number of keywords per utterance is five.

  3. The instructions how to segment a meeting into several topic segments are given at http://groups.inf.ed.ac.uk/ami/corpus/Guidelines/TopicSegmentationGuidelinesNonScenario.pdf

References

  • Abilhoa, W.D., & de Castro L.N. (2014). A keyword extraction method from twitter messages represented as graphs. Applied Mathematics and Computation, 240 (0), 308–325.

    Article  Google Scholar 

  • Carletta, J. (1996). Assessing agreement on classification tasks: The kappa statistic. Computational Linguistics, 22(2), 249–254.

    Google Scholar 

  • Chen, Y.N., Huang, Y., Kong, S.Y., & Lee, L.S. (2010). Automatic key term extraction from spoken course lectures using branching entropy and prosodic/semantic features. In Proceedings of IEEE Workshop on Spoken Language Technology, pp 265–270.

  • Frank, E., Paynter, G.W., Witten, I.H., Gutwin, C., & Nevill-Manning, C.G. (1999). Domain-specific keyphrase extraction. In Proceedings of the 18th International Joint Conference on Artificial intelligence, pp. 668–671.

  • Hulth, A. (2003). Improved automatic keyword extraction given more linguistic knowledge. In Proceedings of International Conference on Empirical Methods in Natural Language Processing, pp. 216–223.

  • Janin, A., Baron, D., Edwards, J., Ellis, D., Gelbart, D., Morgan, N., Peskin, B., Pfau, T., Shriberg, E., Stolcke, A., & Wooters, C. (2003). The icsi meeting corpus. In Proceedings of International Conference on Acoustics, Speech, and Signal Processing, pp. 364–367.

  • Liu, F., Liu, F., & Liu, Y. (2008). Automatic keyword extraction for the meeting corpus using supervised approach and bigram expansion. In Proceedings of IEEE Spoken Language Technology, pp. 181–184.

  • Liu, F., Pennell, D., Liu, F., & Liu, Y. (2009). Unsupervised approaches for automatic keyword extraction using meeting transcripts. In Proceedings of Annual Conference of the North American Chapter of the ACL, pp. 620–628.

  • Liu, F., Liu, F., & Liu, Y. (2011). A supervised framework for keyword extraction from meeting transcripts. IEEE Transactions on Audio, Speech, and Language Processing, 19(3), 538–548.

    Article  Google Scholar 

  • Liu, Z., Huang, W., Zheng, Y., & Sun, M. (2010). Automatic keyphrase extraction via topic decomposition. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 366–376.

  • Medelyan, O., Frank, E., & Witten, I.H. (2009). Human-competitive tagging using automatic keyphrase extraction. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pp. 1318–1327.

  • Mihalcea, R., & Tarau, P. (2004). Textrank: Bringing order into texts. In Proceedings of International Conference on Empirical Methods in Natural Language Processing, pp. 404–411.

  • Murray, G., Renals, S., Carletta, J., & Moore, J. (2005). Evaluating automatic summaries of meeting recordings. In Proceedings of the Workshop on Machine Translation and Summarization Evaluation, pp. 39–52.

  • Nenkova, A., & Passonneau, R. (2004). Evaluating content selection in summarization: The pyramid method. In Proceedings of Annual Conference of the North American Chapter of the ACL, pp. 145–152.

  • Saga, R., & Tsuji, H. (2013). Improved keyword extraction by separation into multiple document sets according to time series. In Proceedings of the 15th International Conference on Human-Computer Interaction, pp. 450–453.

  • Song, H. J., Go, J., Park, S. B., & Park, S. Y. (2013). A just-in-time keyword extraction from meeting transcripts. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (pp. 888–896).

  • Tomokiyo, T., & Hurst, M. (2003). A language model approach to keyphrase extraction. In Proceedings of the ACL 2003 workshop on Multiword expressions: analysis, pp. 33–40.

  • Toutanova, K., Klein, D., Manning, C.D., & Singer, Y. (2003). Feature-rich part-of-speech tagging with a cyclic dependency network. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, pp. 173–180.

  • Turney, P. D. (2000). Learning algorithms for keyphrase extraction. Information Retrieval, 2, 303–336.

    Article  Google Scholar 

  • Turney, P.D. (2003). Coherent keyphrase extraction via web mining. In Proceedings of the 18th International Joint Conference on Artificial intelligence, pp. 434–439.

  • Wan, X., & Xiao, J. (2008). Collabrank: Towards a collaborative approach to single-document keyphrase extraction. In Proceedings of International Conference on Computational Linguistics, pp. 969–976.

  • Wan, X., Yang, J., & Xiao, J. (2007). Towards an iterative reinforcement approach for simultaneous document summarization and keyword extraction. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp. 552–559.

  • Wozniak, R.H. (1999). Classics in psychology, 1855–1914: Historical Essays. Thoemmes Press.

  • Wu, Z., & Giles, C. L. (2013). Measuring term informativeness in context. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (pp. 259–269).

  • Xu, S., Yang, S., & Lau, F.C.M. (2010). Keyword extraction and headline generation using novel word features. In Proceedings of the Twenty-Fourth AAAI Conference on Artifical Intelligence, pp. 1461–1466.

  • Zhao, W.X., Jiang, J., He, J., Song, Y., Achananuparp, P., Lim, E.P., & Li, X. (2011). Topical keyphrase extraction from twitter. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 379–388.

Download references

Acknowledgments

This study was supported by the BK21 Plus project (SW Human Resource Development Program for Supporting Smart Life) funded by the Ministry of Education, School of Computer Science and Engineering, Kyungpook National University, Korea (21A20131600005).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Seong-Bae Park.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Song, HJ., Go, J., Park, SB. et al. A just-in-time keyword extraction from meeting transcripts using temporal and participant information. J Intell Inf Syst 48, 117–140 (2017). https://doi.org/10.1007/s10844-015-0391-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-015-0391-2

Keywords

Navigation