A survey on construction and enhancement methods in service chatbots design

Peng, Zhenhui; Ma, Xiaojuan

doi:10.1007/s42486-019-00012-3

A survey on construction and enhancement methods in service chatbots design

Regular Paper
Published: 24 September 2019

Volume 1, pages 204–223, (2019)
Cite this article

CCF Transactions on Pervasive Computing and Interaction Aims and scope Submit manuscript

2583 Accesses
14 Citations
Explore all metrics

Abstract

Chatbots are being widely applied in many service industries to help schedule meetings, online shopping, restaurant reservations, customer care and so on. The key to the success of the service chatbots design is to provide satisfying responses to the given user’s requests. This survey aims to provide a comprehensive review of chatbots construction and enhancement methods. We first introduce major techniques for the three core design philosophies, which are rule-based, retrieval-based and generation-based methods, followed by a brief summary of the evaluation metrics. Then we present methods to enhance service chatbot’s capabilities with either an ensemble of multiple chatbots, collaborating with human workers or learning from users. Finally, in future directions we discuss the promising response generation models for chatbots using the recent progress in the transformer and contextual embeddings, as well as potential ways to construct a chatbot with personality to achieve a better user experience.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Azaria, A., Krishnamurthy, J., Mitchell, T.M.: Instructable intelligent personal agent. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, AAAI’16, pp. 2681–2689. AAAI Press (2016). http://dl.acm.org/citation.cfm?id=3016100.3016277
Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. CoRR arXiv:abs/1409.0473 (2014)
Bengio, Y.: Learning deep architectures for AI. Found. Trends Mach. Learn. 2(1), 1–127 (2009). https://doi.org/10.1561/2200000006
Article MathSciNet MATH Google Scholar
Bengio, Y., Ducharme, R., Vincent, P., Janvin, C.: A neural probabilistic language model. J. Mach. Learn. Res. 3, 1137–1155 (2003)
MATH Google Scholar
Chai, J., Lin, J., Zadrozny, W., Ye, Y., Stys-Budzikowska, M., Horvath, V., Kambhatla, N., Wolf, C.: The role of a natural language conversational interface in online sales: a case study. Int. J. Speech Technol. 4(3), 285–295 (2001). https://doi.org/10.1023/A:1011316909641
Article MATH Google Scholar
Chen, H., Liu, X., Yin, D., Tang, J.: A survey on dialogue systems: recent advances and new frontiers. SIGKDD Explor. Newsl. 19(2), 25–35 (2017). https://doi.org/10.1145/3166054.3166058
Article Google Scholar
Chung, J., Çaglar Gülçehre, Cho, K., Bengio, Y.: Empirical evaluation of gated recurrent neural networks on sequence modeling. CoRR arXiv:abs/1412.3555 (2014)
Colby, K.M.: Artificial Paranoia: A Computer Simulation of Paranoid Processes. Elsevier Science Inc., New York (1975)
Google Scholar
Cranshaw, J., Elwany, E., Newman, T., Kocielnik, R., Yu, B., Soni, S., Teevan, J., Monroy-Hernández, A.: Calendar.help: designing a workflow-based scheduling agent with humans in the loop. In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, CHI ’17, pp. 2382–2393. ACM, New York, NY, USA (2017). https://doi.org/10.1145/3025453.3025780
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. CoRR arXiv:abs/1810.04805 (2018)
Dialog flow: Retrieved: 2018-12-10. (2018). https://dialogflow.com/. Accessed 10 Dec 2018
Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: Liblinear: a library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)
MATH Google Scholar
Ferrucci, D.A., Brown, E.W., Chu-Carroll, J., Fan, J., Gondek, D., Kalyanpur, A., Lally, A., Murdock, J.W., Nyberg, E., Prager, J.M., Schlaefer, N., Welty, C.A.: Building watson: an overview of the deepqa project. AI Mag. 31(3), 59–79 (2010)
Article Google Scholar
Ghazvininejad, M., Brockett, C., Chang, M.W., Dolan, W.B., Gao, J., tau Yih, W., Galley, M.: A knowledge-grounded neural conversation model. In: AAAI (2018)
Grudin, J., Jacques, R.: Chatbots, humbots, and the quest for artificial general intelligence. In: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, CHI ’19, pp. 209:1–209:11. ACM, New York, NY, USA (2019). https://doi.org/10.1145/3290605.3300439
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997). https://doi.org/10.1162/neco.1997.9.8.1735
Article Google Scholar
Hu, B., Lu, Z., Li, H., Chen, Q.: Convolutional neural network architectures for matching natural language sentences. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 27, pp. 2042–2050. Curran Associates Inc, New York (2014)
Google Scholar
Hu, T., Xu, A., Liu, Z., You, Q., Guo, Y., Sinha, V., Luo, J., Akkiraju, R.: Touch your heart: a tone-aware chatbot for customer care on social media. In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, CHI ’18, pp. 415:1–415:12. ACM, New York, NY, USA (2018). https://doi.org/10.1145/3173574.3173989
Huang, T.H.K., Chang, J.C., Bigham, J.P.: Evorus: A crowd-powered conversational assistant built to automate itself over time. In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, CHI ’18, pp. 295:1–295:13. ACM, New York, NY, USA (2018). https://doi.org/10.1145/3173574.3173869
Ibm watson assistant: Retrieved: 2018-12-10. (2018). https://assistant-us-south.watsonplatform.net/us-south/b3a5bd9b-9ea9-4be8-9ec7-145f04f69453/home. Accessed 10 Dec 2018
Isbell Jr, C.L., Kearns, M., Kormann, D., Singh, S., Stone, P.: Cobot in lambdamoo: A social statistics agent. pp. 36–41 (2000)
Jain, M., Kota, R., Kumar, P., Patel, S.N.: Convey: Exploring the use of a context view for chatbots. In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, CHI ’18, pp. 468:1–468:6. ACM, New York, NY, USA (2018). https://doi.org/10.1145/3173574.3174042
Ji, Z., Lu, Z., Li, H.: An information retrieval approach to short text conversation. CoRR arXiv: abs/1408.6988 (2014)
Kaelbling, L.P., Littman, M.L., Moore, A.W.: Reinforcement learning: a survey. J. Artif. Int. Res. 4(1), 237–285 (1996)
Google Scholar
Li, J., Galley, M., Brockett, C., Spithourakis, G., Gao, J., Dolan, B.: A persona-based neural conversation model. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 994–1003. Association for Computational Linguistics (2016). https://doi.org/10.18653/v1/P16-1094
Li, T.J.J., Azaria, A., Myers, B.A.: Sugilite: creating multimodal smartphone automation by demonstration. In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, CHI ’17, pp. 6038–6049. ACM, New York, NY, USA (2017). https://doi.org/10.1145/3025453.3025483
Li, Y., Luo, X., Zheng, Y., Xu, P., Fu, H.: Sweepcanvas: sketch-based 3d prototyping on an rgb-d image. In: Proceedings of the 30th Annual ACM Symposium on User Interface Software and Technology, pp. 387–399. ACM (2017)
Liao, Q.V., Mas-ud Hussain, M., Chandar, P., Davis, M., Khazaeni, Y., Crasso, M.P., Wang, D., Muller, M., Shami, N.S., Geyer, W.: All work and no play? In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, CHI ’18, pp. 3:1–3:13. ACM, New York, NY, USA (2018). https://doi.org/10.1145/3173574.3173577
Liu, C.W., Lowe, R., Serban, I., Noseworthy, M., Charlin, L., Pineau, J.: How not to evaluate your dialogue system: An empirical study of unsupervised evaluation metrics for dialogue response generation. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 2122–2132. Association for Computational Linguistics (2016). https://doi.org/10.18653/v1/D16-1230
Lowe, R., Pow, N., Serban, I., Pineau, J.: The ubuntu dialogue corpus: a large dataset for research in unstructured multi-turn dialogue systems. In: SIGDIAL Conference (2015)
Lu, Z., Li, H.: A deep architecture for matching short texts. In: Burges, C.J.C., Bottou, L., Welling, M., Ghahramani, Z., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems, vol. 26, pp. 1367–1375. Curran Associates Inc, New York (2013)
Google Scholar
Luo, X., Lin, Z., Wang, Y., Nie, Z.: Cochat: Enabling bot and human collaboration for task completion. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (2018)
Luong, M.T., V. Le, Q., Sutskever, I., Vinyals, O., Kaiser, L.: Multi-task sequence to sequence learning. In: Proceedings of ICLR, San Juan, Puerto Rico (2015)
Magazine, C.: Chatbot report 2018: Global trends and analysis (2018). Retrieved: 2018-12-10. https://chatbotsmagazine.com/chatbot-report-2018-global-trends-and-analysis-4d8bbe4d924b
Mauldin, M.L.: Chatterbots, tinymuds, and the turing test: entering the loebner prize competition. In: Proceedings of the Twelfth National Conference on Artificial Intelligence (Vol. 1), AAAI ’94, pp. 16–21. American Association for Artificial Intelligence, Menlo Park, CA, USA (1994). http://dl.acm.org/citation.cfm?id=199288.199285
Mesnil, G., Dauphin, Y., Yao, K., Bengio, Y., Deng, L., Hakkani-Tur, D., He, X., Heck, L., Tur, G., Yu, D., Zweig, G.: Using recurrent neural networks for slot filling in spoken language understanding. Trans. Audio Speech Lang. Proc. 23(3), 530–539 (2015)
Article Google Scholar
Microsoft: Cognitive services: language understanding (luis). Retrieved: 2018-12-10. https://www.luis.ai/home (2018)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Proceedings of the 26th International Conference on Neural Information Processing Systems, Vol. 2, NIPS’13, pp. 3111–3119. Curran Associates Inc., USA (2013). http://dl.acm.org/citation.cfm?id=2999792.2999959
Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: Bleu: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (2002). http://aclweb.org/anthology/P02-1040
Pennington, J., Socher, R., Manning, C.D.: Glove: Global vectors for word representation. In: EMNLP, vol. 14, pp. 1532–1543 (2014)
Peters, M.E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., Zettlemoyer, L.: Deep contextualized word representations. In: Proc. of NAACL (2018)
Qiu, M., Li, F.L., Wang, S., Gao, X., Chen, Y., Zhao, W., Chen, H., Huang, J., Chu, W.: Alime chat: A sequence to sequence and rerank based chatbot engine. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 498–503. Association for Computational Linguistics (2017). https://doi.org/10.18653/v1/P17-2079
Qiu, M., Li, F.L., Wang, S., Gao, X., Chen, Y., Zhao, W., Chen, H., Huang, J., Chu, W.: Alime chat: a sequence to sequence and rerank based chatbot engine. In: ACL (2017)
Radford, A., Narasimhan, K., Salimans, T., Sutskever, I.: Improving language under- standing with unsupervised learning. In: Technical report, OpenAI (2018)
Ramesh, K., Ravishankaran, S., Joshi, A., Chandrasekaran, K.: A survey of design techniques for conversational agents. In: Kaushik, S., Gupta, D., Kharb, L., Chahal, D. (eds.) Information, Communication and Computing Technology, pp. 336–350. Springer Singapore, Singapore (2017)
Chapter Google Scholar
Ritter, A., Cherry, C., Dolan, W.B.: Data-driven response generation in social media. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP ’11, pp. 583–593. Association for Computational Linguistics, Stroudsburg, PA, USA (2011). http://dl.acm.org/citation.cfm?id=2145432.2145500
Schlesinger, A., O’Hara, K.P., Taylor, A.S.: Let’s talk about race: Identity, chatbots, and ai. In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, CHI ’18, pp. 315:1–315:14. ACM, New York, NY, USA (2018). https://doi.org/10.1145/3173574.3173889
Schuster, M., Paliwal, K.: Bidirectional recurrent neural networks. Trans. Signal Process. 45(11), 2673–2681 (1997). https://doi.org/10.1109/78.650093
Article Google Scholar
Serban, I., Sankar, C., Germain, M., Zhang, S., Lin, Z., Subramanian, S., Kim, T., Pieper, M., Chandar, A.P.S., Ke, N.R., Mudumba, S., de Brébisson, A., Sotelo, J., Suhubdy, D., Michalski, V., Nguyen, A., Pineau, J., Bengio, Y.: A deep reinforcement learning chatbot. CoRR arXiv:abs/1709.02349 (2017)
Serban, I.V., Sordoni, A., Bengio, Y., Courville, A., Pineau, J.: Building end-to-end dialogue systems using generative hierarchical neural network models. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, AAAI’16, pp. 3776–3783. AAAI Press (2016). http://dl.acm.org/citation.cfm?id=3016387.3016435
Shang, L., Lu, Z., Li, H.: Neural responding machine for short-text conversation. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 1577–1586. Association for Computational Linguistics (2015). https://doi.org/10.3115/v1/P15-1152
Shawar, B.A., Atwell, E.: Chatbots: are they really useful? LDV Forum 22, 29–49 (2007)
Google Scholar
Shum, Hy, He, Xd, Li, D.: From eliza to xiaoice: challenges and opportunities with social chatbots. Front. Inf. Technol. Electron. Eng. 19(1), 10–26 (2018). https://doi.org/10.1631/FITEE.1700826
Article Google Scholar
Song, Y., Li, C.T., Nie, J.Y., Zhang, M., Zhao, D., Yan, R.: An ensemble of retrieval-based and generation-based human–computer conversation systems. In: Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI-18, pp. 4382–4388. International Joint Conferences on Artificial Intelligence Organization (2018). https://doi.org/10.24963/ijcai.2018/609
Sukhbaatar, S., Szlam, A., Weston, J., Fergus, R.: End-to-end memory networks. In: Cortes, C., Lawrence, N.D., Lee, D.D., Sugiyama, M., Garnett, R. (eds.) Advances in Neural Information Processing Systems, vol. 28, pp. 2440–2448. Curran Associates Inc, New York (2015)
Google Scholar
Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 2, NIPS’14, pp. 3104–3112. MIT Press, Cambridge, MA, USA (2014). http://dl.acm.org/citation.cfm?id=2969033.2969173
Sutton, R.S., Barto, A.G.: Introduction to Reinforcement Learning, 1st edn. MIT Press, Cambridge (1998)
MATH Google Scholar
Turing, A.M.: Computing machinery and intelligence. Mind LIX(236), 433–460 (1950). https://doi.org/10.1093/mind/LIX.236.433
Article MathSciNet Google Scholar
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L.u., Polosukhin, I.: Attention is all you need. In: I. Guyon, U.V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, R. Garnett (eds.) Advances in Neural Information Processing Systems 30, pp. 5998–6008. Curran Associates, Inc., New York (2017)
Vinyals, O., Le, Q.V.: A neural conversational model. CoRR arXiv:abs/1506.05869 (2015)
WIKIPEDIA: Xiaoice. Retrieved: 2018-12-10. https://en.wikipedia.org/wiki/Xiaoice (2018)
Walker, M., Aberdeen, J., Boland, J., Bratt, E., Garofolo, J., Hirschman, L., Le, A., Lee, S., Narayanan, S., Papineni, K., Pellom, B., Polifroni, J., Potamianos, A., Prabhu, P., Rudnicky, A., Sanders, G., Seneff, S., Stallard, D., Whittaker, S.: Darpa communicator dialog travel planning systems: the June 2000 data collection. pp. 1371–1374 (2001)
Wallace, R.S.: The Anatomy of A.L.I.C.E, pp. 181–210. Springer Netherlands, Dordrecht (2009). https://doi.org/10.1007/978-1-4020-6710-5_13
Chapter Google Scholar
Weizenbaum, J.: Eliza—a computer program for the study of natural language communication between man and machine. Commun. ACM 9(1), 36–45 (1966). https://doi.org/10.1145/365153.365168
Article Google Scholar
Xu, A., Liu, Z., Guo, Y., Sinha, V., Akkiraju, R.: A new chatbot for customer service on social media. In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, CHI ’17, pp. 3506–3510. ACM, New York, NY, USA (2017). https://doi.org/10.1145/3025453.3025496
Yan, R., Song, Y., Wu, H.: Learning to respond with deep neural networks for retrieval-based human–computer conversation system. In: Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’16, pp. 55–64. ACM, New York, NY, USA (2016). https://doi.org/10.1145/2911451.2911542
Ye, J., Chow, J.H., Chen, J., Zheng, Z.: Stochastic gradient boosted distributed decision trees. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, CIKM ’09, pp. 2061–2064. ACM, New York, NY, USA (2009). https://doi.org/10.1145/1645953.1646301
Young, S., Gašić, M., Thomson, B., Williams, J.D.: Pomdp-based statistical spoken dialog systems: a review. Proc. IEEE 101(5), 1160–1179 (2013). https://doi.org/10.1109/JPROC.2012.2225812
Article Google Scholar
Zhou, X., Dong, D., Wu, H., Zhao, S., Yu, D., Tian, H., Liu, X., Yan, R.: Multi-view response selection for human–computer conversation. In: EMNLP (2016)
Zhou, X., Li, L., Dong, D., Liu, Y., Chen, Y., Zhao, W.X., Yu, D., Wu, H.: Multi-turn response selection for chatbots with deep attention matching network. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1118–1127. Association for Computational Linguistics (2018). http://aclweb.org/anthology/P18-1103

Download references

Author information

Authors and Affiliations

Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong
Zhenhui Peng & Xiaojuan Ma

Authors

Zhenhui Peng
View author publications
You can also search for this author in PubMed Google Scholar
Xiaojuan Ma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhenhui Peng.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Peng, Z., Ma, X. A survey on construction and enhancement methods in service chatbots design. CCF Trans. Pervasive Comp. Interact. 1, 204–223 (2019). https://doi.org/10.1007/s42486-019-00012-3

Download citation

Received: 27 May 2019
Accepted: 29 August 2019
Published: 24 September 2019
Issue Date: November 2019
DOI: https://doi.org/10.1007/s42486-019-00012-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A survey on construction and enhancement methods in service chatbots design

Abstract

Access this article

Similar content being viewed by others

Chatbots: A Survey of the Technology

Chatbot Dimensions that Matter: Lessons from the Trenches

An Overview of Chatbot Technology

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A survey on construction and enhancement methods in service chatbots design

Abstract

Access this article

Similar content being viewed by others

Chatbots: A Survey of the Technology

Chatbot Dimensions that Matter: Lessons from the Trenches

An Overview of Chatbot Technology

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation