Skip to main content

Discussion Data Analytics

  • Chapter
  • First Online:
  • 1256 Accesses

Abstract

Evidence-based research, such as research on big data applications, has been receiving much attention and has led to the proposal of techniques for improving the quality of life by storing and analyzing data on daily activities in large quantities. These types of techniques have been applied in the education sector, but a crucial problem remains to be overcome: it is generally difficult to record intellectual activities and accumulate and analyze such data on a large scale. Since this kind of data is not possible to compress in a manner, such as taking the average, it is necessary to maintain the original data as the instances of cases. Such human intellectual-activity data should be treated as big data in the near future. We have been developing a discussion mining system that records face-to-face meetings in detail, analyzes their content, and conducts knowledge discovery. Looking back on past discussion content by browsing documents, such as minutes, is an effective means for conducting future activities. In meetings at which some research topics are regularly discussed, such as seminars in laboratories, the presenters are required to discuss future issues by checking urgent matters from the discussion records. We call statements including advice or requests proposed at previous meetings “task statements” and propose a method for automatically extracting them. With this method, based on certain semantic attributes and linguistic characteristics of statements, a statistical machine learning model is created using logistic regression analysis. A statement is judged whether it is a task statement according to its probability. We also developed a method that maintains the extraction accuracy by using the discussion mining system and its extension on the basis of task statement extraction over a long period. Specifically, we constructed an initial discriminant model of task statements and then applied active learning to new meeting minutes to improve the extraction accuracy. Active learning also has the advantage of reducing labeling costs in supervised machine learning. We explain the improvement in extraction accuracy and reduction in labeling costs with our method and confirm its effectiveness through simulations we conducted.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   119.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD   159.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  • R. Barzilay, M. Lapata, Modeling local coherence: an entity-based approach. Comput. Linguist. 34(1), 1–34 (2008)

    Article  Google Scholar 

  • K. Georgala, A. Kosmopoulos, and G. Paliouras, Spam Filtering: An Active Learning Approach using Incremental Clustering, in Proceedings of the 4th International Conference on Web Intelligence, Mining and Semantics, No. 23 (2014)

    Google Scholar 

  • D. D. Lewis, W. A. Gale, A Sequential Algorithm for Training Text Classifiers, in Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 3–12 (ACM/Springer, 1994)

    Google Scholar 

  • A. Liu, L. Reyzin, B. D. Ziebart, Shift-Pessimistic Active Learning using Robust Bias-Aware Prediction, in Proceedings of the AAAI Conference on Artificial Intelligence (2015)

    Google Scholar 

  • T. Mikolov, I. Sutskever, K. Chen, G. Corrado, J. Dean, Distributed Representations of Words and Phrases and their Compositionality, in Advances in Neural Information Processing Systems, ed by Burges, C.J.C., Bottou, L., Welling, M., Ghahramani, Z., Weinberger, K.Q., pp. 3111–3119 (2013)

    Google Scholar 

  • T. Mikolov, K. Chen, G. Corrado, J. Dean, Efficient Estimation of Word Representations in Vector Space. arXiv preprint, arXiv:1301.3781 (2013)

  • K. Nagao, K. Hasida, Automatic Text Summarization Based on the Global Document Annotation, in Proceedings of the Seventeenth International Conference on Computational Linguistics (COLING-98), pp. 917–921 (1998)

    Google Scholar 

  • K. Nagao, K. Inoue, N. Morita, S. Matsubara, Automatic Extraction of Task Statements from Structured Meeting Content, in Proceedings of the 7th International Conference on Knowledge Discovery and Information Retrieval (KDIR 2015) (2015)

    Google Scholar 

  • K. Nagao, K. Kaji, D. Yamamoto, H. Tomobe, Discussion Mining: Annotation-Based Knowledge Discovery from Real World Activities, in Advances in Multimedia Information Processing—PCM 2004, LNCS, Vol. 3331, pp. 522–531 (Springer, 2005)

    Google Scholar 

  • W. Nakano, T. Kobayashi, Y. Katsuyama, S. Naoi, H. Yokota, Treatment of Laser Pointer and Speech Information in Lecture Scene Retrieval, in Proceedings of the 8th IEEE International Symposium on Multimedia 2006, pp. 927–932 (2006)

    Google Scholar 

  • T. Nishida, Conversation quantization for conversational knowledge process. Int. J. Comput. Sci. Eng. 3(2), 134–144 (2007)

    Google Scholar 

  • N. Roy, A. Mccallum, Toward Optimal Active Learning through Monte Carlo Estimation of Error Reduction, in Proceedings of the 18th International Conference on Machine Learning, pp. 441–448 (ICML 2001)

    Google Scholar 

  • B. Settles, M. Craven, An Analysis of Active Learning Strategies for Sequence Labeling Tasks, in Proceedings of the Conference on Empirical Methods in Natural Language Processing (Association for Computational Linguistics, 2008)

    Google Scholar 

  • B. Settles, Active Learning Literature Survey, Computer Sciences Technical Report 1648, University of Wisconsin-Madison (2010)

    Google Scholar 

  • H. Shimodaira, Improving Predictive Inference under Covariate Shift by Weighting the Log-Likelihood Function. J. Stat. Plann. Infer. 90, 227–244 (2000)

    Article  MathSciNet  Google Scholar 

  • M. Sugiyama, M. Kawanabe, Machine Learning in Non-Stationary Environments: Introduction to Covariate Shift Adaptation (The MIT Press, 2012)

    Google Scholar 

  • T. Schultz, A. Waibel, M. Bett, F. Metze, Y. Pan, K. Ries, T. Schaaf, H. Soltau, W. Martin, H. Yu, K. Zechner, The ISL Meeting Room System, in Proceedings of the Workshop on Hands-Free Speech Communication (HSC-2001) (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Katashi Nagao .

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Nagao, K. (2019). Discussion Data Analytics. In: Artificial Intelligence Accelerates Human Learning. Springer, Singapore. https://doi.org/10.1007/978-981-13-6175-3_2

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-6175-3_2

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-6174-6

  • Online ISBN: 978-981-13-6175-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics