Skip to main content
Log in

A survey on construction and enhancement methods in service chatbots design

  • Regular Paper
  • Published:
CCF Transactions on Pervasive Computing and Interaction Aims and scope Submit manuscript

Abstract

Chatbots are being widely applied in many service industries to help schedule meetings, online shopping, restaurant reservations, customer care and so on. The key to the success of the service chatbots design is to provide satisfying responses to the given user’s requests. This survey aims to provide a comprehensive review of chatbots construction and enhancement methods. We first introduce major techniques for the three core design philosophies, which are rule-based, retrieval-based and generation-based methods, followed by a brief summary of the evaluation metrics. Then we present methods to enhance service chatbot’s capabilities with either an ensemble of multiple chatbots, collaborating with human workers or learning from users. Finally, in future directions we discuss the promising response generation models for chatbots using the recent progress in the transformer and contextual embeddings, as well as potential ways to construct a chatbot with personality to achieve a better user experience.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  • Azaria, A., Krishnamurthy, J., Mitchell, T.M.: Instructable intelligent personal agent. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, AAAI’16, pp. 2681–2689. AAAI Press (2016). http://dl.acm.org/citation.cfm?id=3016100.3016277

  • Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. CoRR arXiv:abs/1409.0473 (2014)

  • Bengio, Y.: Learning deep architectures for AI. Found. Trends Mach. Learn. 2(1), 1–127 (2009). https://doi.org/10.1561/2200000006

    Article  MathSciNet  MATH  Google Scholar 

  • Bengio, Y., Ducharme, R., Vincent, P., Janvin, C.: A neural probabilistic language model. J. Mach. Learn. Res. 3, 1137–1155 (2003)

    MATH  Google Scholar 

  • Chai, J., Lin, J., Zadrozny, W., Ye, Y., Stys-Budzikowska, M., Horvath, V., Kambhatla, N., Wolf, C.: The role of a natural language conversational interface in online sales: a case study. Int. J. Speech Technol. 4(3), 285–295 (2001). https://doi.org/10.1023/A:1011316909641

    Article  MATH  Google Scholar 

  • Chen, H., Liu, X., Yin, D., Tang, J.: A survey on dialogue systems: recent advances and new frontiers. SIGKDD Explor. Newsl. 19(2), 25–35 (2017). https://doi.org/10.1145/3166054.3166058

    Article  Google Scholar 

  • Chung, J., Çaglar Gülçehre, Cho, K., Bengio, Y.: Empirical evaluation of gated recurrent neural networks on sequence modeling. CoRR arXiv:abs/1412.3555 (2014)

  • Colby, K.M.: Artificial Paranoia: A Computer Simulation of Paranoid Processes. Elsevier Science Inc., New York (1975)

    Google Scholar 

  • Cranshaw, J., Elwany, E., Newman, T., Kocielnik, R., Yu, B., Soni, S., Teevan, J., Monroy-Hernández, A.: Calendar.help: designing a workflow-based scheduling agent with humans in the loop. In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, CHI ’17, pp. 2382–2393. ACM, New York, NY, USA (2017). https://doi.org/10.1145/3025453.3025780

  • Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. CoRR arXiv:abs/1810.04805 (2018)

  • Dialog flow: Retrieved: 2018-12-10. (2018). https://dialogflow.com/. Accessed 10 Dec 2018

  • Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: Liblinear: a library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)

    MATH  Google Scholar 

  • Ferrucci, D.A., Brown, E.W., Chu-Carroll, J., Fan, J., Gondek, D., Kalyanpur, A., Lally, A., Murdock, J.W., Nyberg, E., Prager, J.M., Schlaefer, N., Welty, C.A.: Building watson: an overview of the deepqa project. AI Mag. 31(3), 59–79 (2010)

    Article  Google Scholar 

  • Ghazvininejad, M., Brockett, C., Chang, M.W., Dolan, W.B., Gao, J., tau Yih, W., Galley, M.: A knowledge-grounded neural conversation model. In: AAAI (2018)

  • Grudin, J., Jacques, R.: Chatbots, humbots, and the quest for artificial general intelligence. In: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, CHI ’19, pp. 209:1–209:11. ACM, New York, NY, USA (2019). https://doi.org/10.1145/3290605.3300439

  • Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997). https://doi.org/10.1162/neco.1997.9.8.1735

    Article  Google Scholar 

  • Hu, B., Lu, Z., Li, H., Chen, Q.: Convolutional neural network architectures for matching natural language sentences. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 27, pp. 2042–2050. Curran Associates Inc, New York (2014)

    Google Scholar 

  • Hu, T., Xu, A., Liu, Z., You, Q., Guo, Y., Sinha, V., Luo, J., Akkiraju, R.: Touch your heart: a tone-aware chatbot for customer care on social media. In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, CHI ’18, pp. 415:1–415:12. ACM, New York, NY, USA (2018). https://doi.org/10.1145/3173574.3173989

  • Huang, T.H.K., Chang, J.C., Bigham, J.P.: Evorus: A crowd-powered conversational assistant built to automate itself over time. In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, CHI ’18, pp. 295:1–295:13. ACM, New York, NY, USA (2018). https://doi.org/10.1145/3173574.3173869

  • Ibm watson assistant: Retrieved: 2018-12-10. (2018). https://assistant-us-south.watsonplatform.net/us-south/b3a5bd9b-9ea9-4be8-9ec7-145f04f69453/home. Accessed 10 Dec 2018

  • Isbell Jr, C.L., Kearns, M., Kormann, D., Singh, S., Stone, P.: Cobot in lambdamoo: A social statistics agent. pp. 36–41 (2000)

  • Jain, M., Kota, R., Kumar, P., Patel, S.N.: Convey: Exploring the use of a context view for chatbots. In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, CHI ’18, pp. 468:1–468:6. ACM, New York, NY, USA (2018). https://doi.org/10.1145/3173574.3174042

  • Ji, Z., Lu, Z., Li, H.: An information retrieval approach to short text conversation. CoRR arXiv: abs/1408.6988 (2014)

  • Kaelbling, L.P., Littman, M.L., Moore, A.W.: Reinforcement learning: a survey. J. Artif. Int. Res. 4(1), 237–285 (1996)

    Google Scholar 

  • Li, J., Galley, M., Brockett, C., Spithourakis, G., Gao, J., Dolan, B.: A persona-based neural conversation model. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 994–1003. Association for Computational Linguistics (2016). https://doi.org/10.18653/v1/P16-1094

  • Li, T.J.J., Azaria, A., Myers, B.A.: Sugilite: creating multimodal smartphone automation by demonstration. In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, CHI ’17, pp. 6038–6049. ACM, New York, NY, USA (2017). https://doi.org/10.1145/3025453.3025483

  • Li, Y., Luo, X., Zheng, Y., Xu, P., Fu, H.: Sweepcanvas: sketch-based 3d prototyping on an rgb-d image. In: Proceedings of the 30th Annual ACM Symposium on User Interface Software and Technology, pp. 387–399. ACM (2017)

  • Liao, Q.V., Mas-ud Hussain, M., Chandar, P., Davis, M., Khazaeni, Y., Crasso, M.P., Wang, D., Muller, M., Shami, N.S., Geyer, W.: All work and no play? In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, CHI ’18, pp. 3:1–3:13. ACM, New York, NY, USA (2018). https://doi.org/10.1145/3173574.3173577

  • Liu, C.W., Lowe, R., Serban, I., Noseworthy, M., Charlin, L., Pineau, J.: How not to evaluate your dialogue system: An empirical study of unsupervised evaluation metrics for dialogue response generation. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 2122–2132. Association for Computational Linguistics (2016). https://doi.org/10.18653/v1/D16-1230

  • Lowe, R., Pow, N., Serban, I., Pineau, J.: The ubuntu dialogue corpus: a large dataset for research in unstructured multi-turn dialogue systems. In: SIGDIAL Conference (2015)

  • Lu, Z., Li, H.: A deep architecture for matching short texts. In: Burges, C.J.C., Bottou, L., Welling, M., Ghahramani, Z., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems, vol. 26, pp. 1367–1375. Curran Associates Inc, New York (2013)

    Google Scholar 

  • Luo, X., Lin, Z., Wang, Y., Nie, Z.: Cochat: Enabling bot and human collaboration for task completion. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (2018)

  • Luong, M.T., V. Le, Q., Sutskever, I., Vinyals, O., Kaiser, L.: Multi-task sequence to sequence learning. In: Proceedings of ICLR, San Juan, Puerto Rico (2015)

  • Magazine, C.: Chatbot report 2018: Global trends and analysis (2018). Retrieved: 2018-12-10. https://chatbotsmagazine.com/chatbot-report-2018-global-trends-and-analysis-4d8bbe4d924b

  • Mauldin, M.L.: Chatterbots, tinymuds, and the turing test: entering the loebner prize competition. In: Proceedings of the Twelfth National Conference on Artificial Intelligence (Vol. 1), AAAI ’94, pp. 16–21. American Association for Artificial Intelligence, Menlo Park, CA, USA (1994). http://dl.acm.org/citation.cfm?id=199288.199285

  • Mesnil, G., Dauphin, Y., Yao, K., Bengio, Y., Deng, L., Hakkani-Tur, D., He, X., Heck, L., Tur, G., Yu, D., Zweig, G.: Using recurrent neural networks for slot filling in spoken language understanding. Trans. Audio Speech Lang. Proc. 23(3), 530–539 (2015)

    Article  Google Scholar 

  • Microsoft: Cognitive services: language understanding (luis). Retrieved: 2018-12-10. https://www.luis.ai/home (2018)

  • Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Proceedings of the 26th International Conference on Neural Information Processing Systems, Vol. 2, NIPS’13, pp. 3111–3119. Curran Associates Inc., USA (2013). http://dl.acm.org/citation.cfm?id=2999792.2999959

  • Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: Bleu: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (2002). http://aclweb.org/anthology/P02-1040

  • Pennington, J., Socher, R., Manning, C.D.: Glove: Global vectors for word representation. In: EMNLP, vol. 14, pp. 1532–1543 (2014)

  • Peters, M.E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., Zettlemoyer, L.: Deep contextualized word representations. In: Proc. of NAACL (2018)

  • Qiu, M., Li, F.L., Wang, S., Gao, X., Chen, Y., Zhao, W., Chen, H., Huang, J., Chu, W.: Alime chat: A sequence to sequence and rerank based chatbot engine. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 498–503. Association for Computational Linguistics (2017). https://doi.org/10.18653/v1/P17-2079

  • Qiu, M., Li, F.L., Wang, S., Gao, X., Chen, Y., Zhao, W., Chen, H., Huang, J., Chu, W.: Alime chat: a sequence to sequence and rerank based chatbot engine. In: ACL (2017)

  • Radford, A., Narasimhan, K., Salimans, T., Sutskever, I.: Improving language under- standing with unsupervised learning. In: Technical report, OpenAI (2018)

  • Ramesh, K., Ravishankaran, S., Joshi, A., Chandrasekaran, K.: A survey of design techniques for conversational agents. In: Kaushik, S., Gupta, D., Kharb, L., Chahal, D. (eds.) Information, Communication and Computing Technology, pp. 336–350. Springer Singapore, Singapore (2017)

    Chapter  Google Scholar 

  • Ritter, A., Cherry, C., Dolan, W.B.: Data-driven response generation in social media. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP ’11, pp. 583–593. Association for Computational Linguistics, Stroudsburg, PA, USA (2011). http://dl.acm.org/citation.cfm?id=2145432.2145500

  • Schlesinger, A., O’Hara, K.P., Taylor, A.S.: Let’s talk about race: Identity, chatbots, and ai. In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, CHI ’18, pp. 315:1–315:14. ACM, New York, NY, USA (2018). https://doi.org/10.1145/3173574.3173889

  • Schuster, M., Paliwal, K.: Bidirectional recurrent neural networks. Trans. Signal Process. 45(11), 2673–2681 (1997). https://doi.org/10.1109/78.650093

    Article  Google Scholar 

  • Serban, I., Sankar, C., Germain, M., Zhang, S., Lin, Z., Subramanian, S., Kim, T., Pieper, M., Chandar, A.P.S., Ke, N.R., Mudumba, S., de Brébisson, A., Sotelo, J., Suhubdy, D., Michalski, V., Nguyen, A., Pineau, J., Bengio, Y.: A deep reinforcement learning chatbot. CoRR arXiv:abs/1709.02349 (2017)

  • Serban, I.V., Sordoni, A., Bengio, Y., Courville, A., Pineau, J.: Building end-to-end dialogue systems using generative hierarchical neural network models. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, AAAI’16, pp. 3776–3783. AAAI Press (2016). http://dl.acm.org/citation.cfm?id=3016387.3016435

  • Shang, L., Lu, Z., Li, H.: Neural responding machine for short-text conversation. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 1577–1586. Association for Computational Linguistics (2015). https://doi.org/10.3115/v1/P15-1152

  • Shawar, B.A., Atwell, E.: Chatbots: are they really useful? LDV Forum 22, 29–49 (2007)

    Google Scholar 

  • Shum, Hy, He, Xd, Li, D.: From eliza to xiaoice: challenges and opportunities with social chatbots. Front. Inf. Technol. Electron. Eng. 19(1), 10–26 (2018). https://doi.org/10.1631/FITEE.1700826

    Article  Google Scholar 

  • Song, Y., Li, C.T., Nie, J.Y., Zhang, M., Zhao, D., Yan, R.: An ensemble of retrieval-based and generation-based human–computer conversation systems. In: Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI-18, pp. 4382–4388. International Joint Conferences on Artificial Intelligence Organization (2018). https://doi.org/10.24963/ijcai.2018/609

  • Sukhbaatar, S., Szlam, A., Weston, J., Fergus, R.: End-to-end memory networks. In: Cortes, C., Lawrence, N.D., Lee, D.D., Sugiyama, M., Garnett, R. (eds.) Advances in Neural Information Processing Systems, vol. 28, pp. 2440–2448. Curran Associates Inc, New York (2015)

    Google Scholar 

  • Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 2, NIPS’14, pp. 3104–3112. MIT Press, Cambridge, MA, USA (2014). http://dl.acm.org/citation.cfm?id=2969033.2969173

  • Sutton, R.S., Barto, A.G.: Introduction to Reinforcement Learning, 1st edn. MIT Press, Cambridge (1998)

    MATH  Google Scholar 

  • Turing, A.M.: Computing machinery and intelligence. Mind LIX(236), 433–460 (1950). https://doi.org/10.1093/mind/LIX.236.433

    Article  MathSciNet  Google Scholar 

  • Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L.u., Polosukhin, I.: Attention is all you need. In: I. Guyon, U.V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, R. Garnett (eds.) Advances in Neural Information Processing Systems 30, pp. 5998–6008. Curran Associates, Inc., New York (2017)

  • Vinyals, O., Le, Q.V.: A neural conversational model. CoRR arXiv:abs/1506.05869 (2015)

  • WIKIPEDIA: Xiaoice. Retrieved: 2018-12-10. https://en.wikipedia.org/wiki/Xiaoice (2018)

  • Walker, M., Aberdeen, J., Boland, J., Bratt, E., Garofolo, J., Hirschman, L., Le, A., Lee, S., Narayanan, S., Papineni, K., Pellom, B., Polifroni, J., Potamianos, A., Prabhu, P., Rudnicky, A., Sanders, G., Seneff, S., Stallard, D., Whittaker, S.: Darpa communicator dialog travel planning systems: the June 2000 data collection. pp. 1371–1374 (2001)

  • Wallace, R.S.: The Anatomy of A.L.I.C.E, pp. 181–210. Springer Netherlands, Dordrecht (2009). https://doi.org/10.1007/978-1-4020-6710-5_13

    Chapter  Google Scholar 

  • Weizenbaum, J.: Eliza—a computer program for the study of natural language communication between man and machine. Commun. ACM 9(1), 36–45 (1966). https://doi.org/10.1145/365153.365168

    Article  Google Scholar 

  • Xu, A., Liu, Z., Guo, Y., Sinha, V., Akkiraju, R.: A new chatbot for customer service on social media. In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, CHI ’17, pp. 3506–3510. ACM, New York, NY, USA (2017). https://doi.org/10.1145/3025453.3025496

  • Yan, R., Song, Y., Wu, H.: Learning to respond with deep neural networks for retrieval-based human–computer conversation system. In: Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’16, pp. 55–64. ACM, New York, NY, USA (2016). https://doi.org/10.1145/2911451.2911542

  • Ye, J., Chow, J.H., Chen, J., Zheng, Z.: Stochastic gradient boosted distributed decision trees. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, CIKM ’09, pp. 2061–2064. ACM, New York, NY, USA (2009). https://doi.org/10.1145/1645953.1646301

  • Young, S., Gašić, M., Thomson, B., Williams, J.D.: Pomdp-based statistical spoken dialog systems: a review. Proc. IEEE 101(5), 1160–1179 (2013). https://doi.org/10.1109/JPROC.2012.2225812

    Article  Google Scholar 

  • Zhou, X., Dong, D., Wu, H., Zhao, S., Yu, D., Tian, H., Liu, X., Yan, R.: Multi-view response selection for human–computer conversation. In: EMNLP (2016)

  • Zhou, X., Li, L., Dong, D., Liu, Y., Chen, Y., Zhao, W.X., Yu, D., Wu, H.: Multi-turn response selection for chatbots with deep attention matching network. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1118–1127. Association for Computational Linguistics (2018). http://aclweb.org/anthology/P18-1103

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhenhui Peng.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Peng, Z., Ma, X. A survey on construction and enhancement methods in service chatbots design. CCF Trans. Pervasive Comp. Interact. 1, 204–223 (2019). https://doi.org/10.1007/s42486-019-00012-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s42486-019-00012-3

Keywords

Navigation