Skip to main content

Supporting Creation of FAQ Dataset for E-Learning Chatbot

  • Conference paper
  • First Online:
Intelligent Decision Technologies 2019

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 142))

Abstract

Recently, many universities provide e-learning systems for supporting classes. Though the system is an effective and efficient learning environment, it usually lacks a dynamic user support systems. A chatbot is a good choice to support a dynamic Q&A system; however, it is difficult to collect the large number of Q&A data or high-quality datasets required to train the chatbot model to obtain high accuracy. In this paper, we propose a novel framework for supporting dataset creation. This framework provides two recommendation algorithms: creating new questions and aggregating semantically similar answers. We evaluated our framework and confirmed that the framework can improve the quality of an FAQ dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    For example, Facebook bot on Messenger https://developers.facebook.com/videos/f8-2016/introducing-bots-on-messenger/.

  2. 2.

    The proposed dataset is available on a public repository server: https://doi.org/10.5281/zenodo.2557319.

References

  1. Behúň, K., Herout, A., Páldy, A.: Kinect-supported dataset creation for human pose estimation. In: SCCG ’14, pp. 55–62. ACM, New York, NY, USA (2014)

    Google Scholar 

  2. Jeon, J., Croft, W.B., Lee, J.H.: Finding similar questions in large question and answer archives. In: CIKM ’05, pp. 84–90. ACM, New York, NY, USA (2005)

    Google Scholar 

  3. Leveling, J.: Monolingual and crosslingual SMS-based FAQ retrieval. In: FIRE ’12 & ’13, pp. 3:1–3:6. ACM, New York, NY, USA (2007)

    Google Scholar 

  4. Morris, M.R., Teevan, J., Panovich, K.: What do people ask their social networks, and why? A survey study of status message Q&A behavior. In: CHI ’10, pp. 1739–1748. ACM, New York, NY, USA (2010)

    Google Scholar 

  5. Otsuka, A., Nishida, K., Bessho, K., Asano, H., Tomita, J.: Query expansion with neural question-to-answer translation for FAQ-based question answering. In: WWW ’18, pp. 1063–1068. Republic and Canton of Geneva, Switzerland (2018)

    Google Scholar 

  6. Pinto, G., Torres, W., Castor, F.: A study on the most popular questions about concurrent programming. In: PLATEAU 2015, pp. 39–46. ACM, New York, NY, USA (2015)

    Google Scholar 

  7. Rodofile, N.R., Radke, K., Foo, E.: Framework for SCADA cyber-attack dataset creation. In: ACSW ’17, pp. 69:1–69:10. ACM, New York, NY, USA (2017)

    Google Scholar 

  8. Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining Multi-label Data, pp. 667–685. Springer US, Boston, MA (2010)

    Chapter  Google Scholar 

Download references

Acknowledgements

We would like to appreciate Mr. Okamura (Tokyo Metropolitan University), and Mr. Kouda, Mr. Suzuki, and Mr. Toya (Alpha Computer Ltd) for their support in collecting and initially organizing Q&A data. This work was supported by JSPS KAKENHI Grant Number 18H01057.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yasunobu Sumikawa .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sumikawa, Y., Fujiyoshi, M., Hatakeyama, H., Nagai, M. (2020). Supporting Creation of FAQ Dataset for E-Learning Chatbot. In: Czarnowski, I., Howlett, R., Jain, L. (eds) Intelligent Decision Technologies 2019. Smart Innovation, Systems and Technologies, vol 142. Springer, Singapore. https://doi.org/10.1007/978-981-13-8311-3_1

Download citation

Publish with us

Policies and ethics