Abstract
Evidence-based research, such as research on big data applications, has been receiving much attention and has led to the proposal of techniques for improving the quality of life by storing and analyzing data on daily activities in large quantities. These types of techniques have been applied in the education sector, but a crucial problem remains to be overcome: it is generally difficult to record intellectual activities and accumulate and analyze such data on a large scale. Since this kind of data is not possible to compress in a manner, such as taking the average, it is necessary to maintain the original data as the instances of cases. Such human intellectual-activity data should be treated as big data in the near future. We have been developing a discussion mining system that records face-to-face meetings in detail, analyzes their content, and conducts knowledge discovery. Looking back on past discussion content by browsing documents, such as minutes, is an effective means for conducting future activities. In meetings at which some research topics are regularly discussed, such as seminars in laboratories, the presenters are required to discuss future issues by checking urgent matters from the discussion records. We call statements including advice or requests proposed at previous meetings “task statements” and propose a method for automatically extracting them. With this method, based on certain semantic attributes and linguistic characteristics of statements, a statistical machine learning model is created using logistic regression analysis. A statement is judged whether it is a task statement according to its probability. We also developed a method that maintains the extraction accuracy by using the discussion mining system and its extension on the basis of task statement extraction over a long period. Specifically, we constructed an initial discriminant model of task statements and then applied active learning to new meeting minutes to improve the extraction accuracy. Active learning also has the advantage of reducing labeling costs in supervised machine learning. We explain the improvement in extraction accuracy and reduction in labeling costs with our method and confirm its effectiveness through simulations we conducted.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
R. Barzilay, M. Lapata, Modeling local coherence: an entity-based approach. Comput. Linguist. 34(1), 1–34 (2008)
K. Georgala, A. Kosmopoulos, and G. Paliouras, Spam Filtering: An Active Learning Approach using Incremental Clustering, in Proceedings of the 4th International Conference on Web Intelligence, Mining and Semantics, No. 23 (2014)
D. D. Lewis, W. A. Gale, A Sequential Algorithm for Training Text Classifiers, in Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 3–12 (ACM/Springer, 1994)
A. Liu, L. Reyzin, B. D. Ziebart, Shift-Pessimistic Active Learning using Robust Bias-Aware Prediction, in Proceedings of the AAAI Conference on Artificial Intelligence (2015)
T. Mikolov, I. Sutskever, K. Chen, G. Corrado, J. Dean, Distributed Representations of Words and Phrases and their Compositionality, in Advances in Neural Information Processing Systems, ed by Burges, C.J.C., Bottou, L., Welling, M., Ghahramani, Z., Weinberger, K.Q., pp. 3111–3119 (2013)
T. Mikolov, K. Chen, G. Corrado, J. Dean, Efficient Estimation of Word Representations in Vector Space. arXiv preprint, arXiv:1301.3781 (2013)
K. Nagao, K. Hasida, Automatic Text Summarization Based on the Global Document Annotation, in Proceedings of the Seventeenth International Conference on Computational Linguistics (COLING-98), pp. 917–921 (1998)
K. Nagao, K. Inoue, N. Morita, S. Matsubara, Automatic Extraction of Task Statements from Structured Meeting Content, in Proceedings of the 7th International Conference on Knowledge Discovery and Information Retrieval (KDIR 2015) (2015)
K. Nagao, K. Kaji, D. Yamamoto, H. Tomobe, Discussion Mining: Annotation-Based Knowledge Discovery from Real World Activities, in Advances in Multimedia Information Processing—PCM 2004, LNCS, Vol. 3331, pp. 522–531 (Springer, 2005)
W. Nakano, T. Kobayashi, Y. Katsuyama, S. Naoi, H. Yokota, Treatment of Laser Pointer and Speech Information in Lecture Scene Retrieval, in Proceedings of the 8th IEEE International Symposium on Multimedia 2006, pp. 927–932 (2006)
T. Nishida, Conversation quantization for conversational knowledge process. Int. J. Comput. Sci. Eng. 3(2), 134–144 (2007)
N. Roy, A. Mccallum, Toward Optimal Active Learning through Monte Carlo Estimation of Error Reduction, in Proceedings of the 18th International Conference on Machine Learning, pp. 441–448 (ICML 2001)
B. Settles, M. Craven, An Analysis of Active Learning Strategies for Sequence Labeling Tasks, in Proceedings of the Conference on Empirical Methods in Natural Language Processing (Association for Computational Linguistics, 2008)
B. Settles, Active Learning Literature Survey, Computer Sciences Technical Report 1648, University of Wisconsin-Madison (2010)
H. Shimodaira, Improving Predictive Inference under Covariate Shift by Weighting the Log-Likelihood Function. J. Stat. Plann. Infer. 90, 227–244 (2000)
M. Sugiyama, M. Kawanabe, Machine Learning in Non-Stationary Environments: Introduction to Covariate Shift Adaptation (The MIT Press, 2012)
T. Schultz, A. Waibel, M. Bett, F. Metze, Y. Pan, K. Ries, T. Schaaf, H. Soltau, W. Martin, H. Yu, K. Zechner, The ISL Meeting Room System, in Proceedings of the Workshop on Hands-Free Speech Communication (HSC-2001) (2001)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Nagao, K. (2019). Discussion Data Analytics. In: Artificial Intelligence Accelerates Human Learning. Springer, Singapore. https://doi.org/10.1007/978-981-13-6175-3_2
Download citation
DOI: https://doi.org/10.1007/978-981-13-6175-3_2
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-6174-6
Online ISBN: 978-981-13-6175-3
eBook Packages: Computer ScienceComputer Science (R0)