Big Data for Conversational Interfaces: Current Opportunities and Prospects

Griol, David; Molina, Jose M.; Callejas, Zoraida

doi:10.1007/978-3-319-45498-6_6

David Griol³,
Jose M. Molina³ &
Zoraida Callejas⁴

6118 Accesses

Abstract

As conversational technologies develop, we demand more from them. For instance, we want our conversational assistants to be able to solve our queries in multiple domains, to manage information from different usually unstructured sources, to be able to perform a variety of tasks, and understand open conversational language. However, developing the resources necessary to develop systems with such capabilities demands much time and effort, as for each domain, task or language, data must be collected, annotated following an schema that is usually not portable, the models must be trained over the annotated data, and their accuracy must be evaluated. In recent years, there has been a growing interest in investigating alternatives to manual effort that allow exploiting automatically the huge amount of resources available in the web. In this chapter we describe the main initiatives to extract, process and contextualize information from these rich and heterogeneous sources for the various tasks involved in dialog systems, including speech processing, natural language understanding and dialog management.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 159.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Abdennadher S, Aly M, Bhler D, Minker W, Pittermann J (2007) Becam tool - a semi-automatic tool for bootstrapping emotion corpus annotation and management. In: Proceedings of the international conference on spoken language processing (Interspeech’2007), pp 946–949
Google Scholar
Agerri R, Artola X, Beloki Z, Rigau G, Soroa A (2015) Big data for natural language processing: a streaming approach. Knowl-Based Syst 79:36–42
Article Google Scholar
Bahl L, Jelinek F, Mercer R (1990) A maximum likelihood approach to continuous speech recognition. Readings in Speech recognition, pp 308–319
Google Scholar
Baimbetov Y, Khalil I, Steinbauer M, Anderst-Kotsis G (2015) Using Big Data for emotionally intelligent mobile services through multi-modal emotion recognition. Springer, pp 127–138
Google Scholar
Batliner A, Burkhardt F, van Ballegooy M, Noth E (2006) A taxonomy of applications that utilize emotional awareness. In: Proceedings of 1st international language technologies conference (IS-LTC 06), pp 246–250
Google Scholar
Bickmore T, Giorgino T (2004) Some novel aspects of health communication from a dialogue systems perspective. In: Proceedings of AAAI fall symposium on dialogue systems for health communication, pp 275–291
Google Scholar
Bos J, Klein E, Lemon O, Oka T (2003) DIPPER: description and formalisation of an information-state update dialogue system architecture. In: Proceedings of the SIGdial, pp 115–124
Google Scholar
Chung G (2004) Developing a flexible spoken dialog system using simulation. In: Proceedings of ACL, pp 63–70
Google Scholar
Cohn DA, Atlas L, Ladner R (1994) Improving generalization with active learning. Mach Learn 15(2):201–221
Google Scholar
Cuayhuitl H, Renals S, Lemon O, Shimodaira H (2005) Human-computer dialogue simulation using hidden Markov models. In: Proceedings of ASRU, pp 290–295
Google Scholar
Dutoit T (1996) An introduction to text-to-speech synthesis. Kluwer Academic Publishers
Google Scholar
Eckert W, Levin E, Pieraccini R (1997) User modeling for spoken dialogue system evaluation. In: Proceedings of ASRU, pp 80–87
Google Scholar
Eckert W, Levin E, Pieraccini R (1998) Automatic evaluation of spoken dialogue systems. Technical report, TR98.9.1, ATT Labs Research
Google Scholar
Esteve Y, Raymond C, Bechet F, Mori RD (2003) Conceptual decoding for spoken dialog systems. In: Proceedings of European conference on speech communications and technology (Eurospeech’03). vol 1, pp 617–620
Google Scholar
Fabbrizio GD, Tur G, Hakkani-Tr D, Gilbert M, Renger B, Gibbon D, Liu Z, Shahraray B (2008) Bootstrapping spoken dialogue systems by exploiting reusable libraries. Nat Lang Eng 14(3):313–335
Article Google Scholar
Fraser M, Gilbert G (1991) Simulating speech systems. Comput Speech Lang 5:81–99
Article Google Scholar
Georgila K, Henderson J, Lemon O (2005) Learning user simulations for information state update dialogue systems. In: Proceedings of Eurospeech’05, pp. 893–896
Google Scholar
Gibbon D, Mertins I (Eds.), R.M.: Handbook of multimodal and spoken dialogue systems: resources, terminology and product evaluation. Kluwer Academic Publishers (2000)
Google Scholar
Gudivada VN, Rao D, Raghavan VV (2015) Big data driven natural language processing research and applications, vol 33. Elsevier, pp 203–238
Google Scholar
He Y, Young S (2003) A data-driven spoken language understanding system. In: Proceedings of IEEE Automatic speech recognition and understanding workshop (ASRU’03), pp 583–588
Google Scholar
Heeman P (2007) Combining reinforcement learning with information-state update rules. In: Proceedings of the 8th Annual conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL’07), pp 268–275
Google Scholar
Heinroth T, Minker W (2012) Introducing spoken dialogue systems into intelligent environments. Kluwer Academic Publishers, Springer
Google Scholar
Hempel T (2008) Usability of speech dialog systems: listening to the target audience. Springer
Google Scholar
Hinton G, Deng L, Yu D, Dahl G, Mohamed A, Jaitly N, Senior A, Vanhoucke V, Nguyen P, Sainath T, Kingsbury B (2012) Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups. IEEE Signal Process Mag 29(6):82–97
Article Google Scholar
Hori C, Ohtake K, Misu T, Kashioka H, Nakamura S (2009) Recent advances in WFST-based dialog system. In: Proceedings of the international conference on spoken language processing (Interspeech’2009), pp 268–271
Google Scholar
Hoxha J, Weng C (2016) Leveraging dialog systems research to assist biomedical researchers interrogation of big clinical data. J Biomed Inf 61:176–184
Article Google Scholar
Hurtado L, Planells J, Segarra E, Sanchis E, Griol D (2010) A stochastic finite-state transducer approach to spoken dialog management. In: Proceedings of the international conference on spoken language processing (Interspeech’2010), pp 3002–3005
Google Scholar
Jelinek F (1990) Self-organized language modeling for speech recognition. Readings in Speech recognition, pp 450–506
Google Scholar
Jelinek F, Lafferty MR (1992) Basic methods of probabilistic context free grammars. Springer, pp 345–360
Google Scholar
Jung S, Lee C, Kim K, Lee D, Lee G (2011) Hybrid user intention modeling to diversify dialog simulations. Comput Speech Lang 25(2):307–326
Article Google Scholar
Lane I, Ueno S, Kawahara T (2004) Cooperative dialogue planning with user and situation models via example-based training. In: Proceedings of workshop on man-machine symbiotic systems, pp 2837–2840, Kyoto, Japan
Google Scholar
Laroche R, Putois G, Bretier P, Young S, Lemon O (2008) Requirements analysis and theory for statistical learning approaches in automaton-based dialogue management. Technical report, School of Informatics, Edinburgh University, Edinburgh, UK
Google Scholar
Lee C, Jung S, Kim K, Lee GG (2010) Hybrid approach to robust dialog management using agenda and dialog examples. Comput Speech Lang 24(4):609–631
Article Google Scholar
Lemon O (2011) Learning what to say and how to say it: joint optimisation of spoken dialogue management and natural language generation. Comput Speech Lang 25(2):210–221
Article Google Scholar
Lemon O, Pietquin O (2012) Data-Driven methods for adaptive spoken dialogue systems. Computational learning for conversational interfaces. Springer, Berlin
Book Google Scholar
Lemon O, Georgila K, Henderson J (2006) Evaluating effectiveness and portability of reinforcement learned dialogue strategies with real users: the TALK TownInfo evaluation. In: Proceedings of IEEE-ACL workshop on spoken language technology (SLT’06), pp 178–181
Google Scholar
Levin E, Pieraccini R (1995) Concept-based spontaneous speech understanding system. In: Proceedings of European conference on speech communications and technology (Eurospeech’95). pp. 555–558 (1995)
Google Scholar
Levin E, Pieraccini R, Eckert W (2000) A stochastic model of human-machine interaction for learning dialog strategies. IEEE Trans Speech Audio Process 8(1):11–23
Article Google Scholar
Lin B, Lee L (2001) Computer aided analysis and design for spoken dialogue systems based on quantitative simulations. IEEE Trans Speech Audio Process 9(5):534–548
Article Google Scholar
Litman D, Forbes-Riley K (2006) Recognizing student emotions and attitudes on the basis of utterances in spoken tutoring dialogues with both human and computer tutors. Speech Commun 48(5):559–590
Article Google Scholar
Liu Y, Shriberg E (2005) Does active learning help automatic dialog act tagging in meeting data. In: Proceedings of the international conference on spoken language processing (Interspeech’2005), pp 2777–2780, Lisbon, Portugal
Google Scholar
López V, Eisman E, Castro J, Zurita J (2011) A case based reasoning model for multilingual language generation in dialogues. Expert Syst Appl 39(8):7330–7337
Article Google Scholar
López-Cózar R, Callejas Z, McTear M (2006) Testing the performance of spoken dialogue systems by means of an artificially simulated user. Artif Intell Rev 26:291–323
Article Google Scholar
López-Cózar R, la Torre AD, Segura J, Rubio A, Sánchez V (2003) Assessment of dialogue systems by means of a new simulation technique. Speech Commun 40(3):387–407
Article Google Scholar
López-Cózar R, Callejas Z (2008) ASR post-correction for spoken dialogue systems based on semantic, syntactic, lexical and contextual information. Comput Speech Lang 50(8–9):745–766
Google Scholar
López-Cózar R, Callejas Z, Griol D (2010) ASR post-correction for spoken dialogue systems based on semantic, syntactic, lexical and contextual information. Knowl-Based Syst 23(5):471–485
Article Google Scholar
Mayer-Schonberger V (2003) Big data: a revolution that will transform how we live, work, and think. Eamon Dolan-Houghton Mifflin Harcourt
Google Scholar
McTear MF, Callejas Z, Griol D (2016) The conversational interface. Springer
Google Scholar
Meng HH, Wai C, Pieraccini R (2003) The use of belief networks for mixed-initiative dialog modeling. IEEE Trans Speech Audio Process 11(6):757–773
Article Google Scholar
Minker W (1999) Design considerations for knowledge source representations of a stochastically-based natural language understanding component. Speech Commun 28(2):141–154
Article Google Scholar
Minker W, Waibel A, Mariani J (1999) Stochastically-based semantic analysis. Kluwer Academic Publishers, Dordrecht (Holland)
Book Google Scholar
Mller S, Englert R, Engelbrecht K, Hafner V, Jameson A, Oulasvirta A, Raake A, Reithinger N (2006) MeMo: towards automatic usability evaluation of spoken dialogue services by user error simulations. In: Proceedings of the Interspeech, pp 1786–1789
Google Scholar
Najafabadi M, Villanustre F, Khoshgoftaar T, Seliya N, WaldEmail R, Muharemagic E (2015) Deep learning applications and challenges in big data analytics. J Big Data 2(1)
Google Scholar
Oh AH, Rudnicky AI (2000) Stochastic language generation for spoken dialogue systems. In: Proceedings of ANLP/NAACL workshop on conversational systems, pp 27–32
Google Scholar
O’Shaughnessy D (2008) Automatic speech recognition: history, methods and challenges. Pattern Recogn 41(10):2965–2979
Article Google Scholar
O’Shea J, Bandar Z, Crockett K (2012) A multi-classifier approach to dialogue act classification using function words. Lecture notes in computer science, vol 7270, pp 119–143
Google Scholar
Paek T, Horvitz E (2000) Conversation as action under uncertainty. In: Proceedings of the 16th conference on uncertainty in artificial intelligence, pp 455–464
Google Scholar
Paek T, Pieraccini R (2008) Automating spoken dialogue management design using machine learning: an industry perspective. Speech Commun 50(8–9):716–729
Article Google Scholar
Pieraccini R (2012) The voice in the machine: building computers that understand speech. MIT Press
Google Scholar
Planells J, Hurtado L, Sanchis E, Segarra E (2012) An online generated transducer to increase dialog manager coverage. In: Proceedings of the international conference on spoken language processing (Interspeech’2012)
Google Scholar
Rabiner L, Juang B, Lee C (1996) An overview of automatic speech recognition. Kluwer Academic Publishers, pp 1–30
Google Scholar
Rojas-Barahona L, Giorgino T (2009) Adaptable dialog architecture and runtime engine (adarte): a framework for rapid prototyping of health dialog systems. Int J Med Inf 78:56–68
Article Google Scholar
Roy N, Pineau J, Thrun S (2000) Spoken dialogue management using probabilistic reasoning. In: Proceedings of the 38th Annual meeting of the association for computational linguistics (ACL’00), pp 93–100
Google Scholar
Schatzmann J, Thomson B, Weilhammer K, Ye H, Young S (2007) Agenda-based user simulation for bootstrapping a POMDP dialogue system. In: Proceedings of HLT/NAACL, pp 149–152
Google Scholar
Schatzmann J, Thomson B, Young S (2007) Statistical user simulation with a hidden agenda. In: Proceedings of SIGdial, pp 273–282
Google Scholar
Schatzmann J, Weilhammer K, Stuttle M, Young S (2006) A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies. Knowl Eng Rev 21(2):97–126
Article Google Scholar
Segarra E et al (2002) Extracting semantic information through automatic learning techniques. Int J Pattern Recogn Artif Intell 16(3):301–307
Article Google Scholar
Seide F, Li G, Yu D (2011) Conversational speech transcription using context-dependent deep neural networks. In: Proceedings of the 12th annual conference of the international speech communication association (InterSpeech 2011), pp 437–440. Florence, Italy
Google Scholar
Shamim-Hossain M, Muhammad G, Alhamid MF, Song B, Al-Mutib K (2016) Audio-visual emotion recognition using big data towards 5G. Mobile Netw Appl 1:1–11
Article Google Scholar
Singh S, Kearns M, Litman D, Walker M (1999) Reinforcement learning for spoken dialogue systems. In: Proceedings of neural information processing systems (NIPS’99), pp 956–962
Google Scholar
Singh S, Litman D, Kearns M, Walker M (2002) Optimizing dialogue management with reinforcement leaning: experiments with the NJFun system. J Artif Intell 16:105–133
Google Scholar
Suendermann D, Pieraccini R (2012) One year of contender: what have we learned about assessing and tuning industrial spoken dialog systems? In: Proceedings of NAACL-HLT workshop on future directions and needs in the spoken dialog community: tools and data (SDCTD’12), pp 45–48
Google Scholar
Thomson B, Schatzmann J, Weilhammer K, Ye H, Young S (2007) Training a real-world POMDP-based Dialog System. In: Proceedings of NAACL-HLT-Dialog’07 workshop on bridging the gap: academic and industrial research in dialog technologies, pp 9–16
Google Scholar
Torres F, Sanchis E, Segarra E (2003) Development of a stochastic dialog manager driven by semantics. In: Proceedings of European conference on speech communications and technology (Eurospeech’03), pp 605–608
Google Scholar
Torres F, Sanchis E, Segarra E (2008) User simulation in a stochastic dialog system. Comput Speech Lang 22:230–255
Article Google Scholar
Torres F, Sanchis E, Segarra E (2008) User simulation in a stochastic dialog system. Comput Speech Lang 22(3):230–255
Article Google Scholar
Traum D, Larsson S (2003) The information state approach to dialogue management. Kluwer, pp 325–353
Google Scholar
Tsilfidis A, Mporas I, Mourjopoulos J, Fakotakis N (2013) Automatic speech recognition performance in different room acoustic environments with and without dereverberation preprocessing. Comput Speech Lang 27(1):380–395
Article Google Scholar
Venkataraman A, Stolcke A, Shriberg E (2002) Automatic dialog act labeling with minimal supervision. In: Proceedings of the 9th Australian international conference on speech science & technology
Google Scholar
Vipperla R, Wolters M, Renals S (2012) Spoken dialogue interfaces for older people. IOS Press, pp 118–137
Google Scholar
Wilks Y, Catizone R, Worgan S, Turunen M (2011) Some background on dialogue management and conversational speech for dialogue systems. Comput Speech Lang 25:128–139
Article Google Scholar
Williams J, Poupart P, Young S (2006) Partially Observable Markov decision processes with continuous observations for dialogue management. Springer, pp 191–217
Google Scholar
Williams J, Young S (2007) Partially observable Markov decision processes for spoken dialog systems. Comput Speech Lang 21(2):393–422
Article Google Scholar
Williams J (2009) The best of both worlds: unifying conventional dialog systems and pomdps. In: Proceedings of Interspeech, pp 1173–1176
Google Scholar
Wu WL, Lu RZ, Duan JY, Liu H, Gao F, Chen YQ (2010) Spoken language understanding using weakly supervised learning. Comput Speech Lang 24(2):358–382
Article Google Scholar
Young S (2002) The statistical approach to the design of spoken dialogue systems. Technical report, CUED/F-INFENG/TR.433, Cambridge University Engineering Department, Cambridge, UK
Google Scholar
Young S, Gasic M, Thomson B, Williams J (2013) Pomdp-based statistical spoken dialogue systems: a review. In: Proceedings of the IEEE, pp 1–18, Montreal, Canada
Google Scholar
Young S, Williams J, Schatzmann J, Stuttle M, Weilhammer K (2005) The hidden information state approach to dialogue management. Technical report, Department of Engineering, University of Cambridge, Cambridge, UK
Google Scholar
Young S, Schatzmann J, Weilhammer K, Ye H (2007) The hidden information state approach to dialogue management. In: Proceedings of the 32nd IEEE international conference on acoustics, speech, and signal processing (ICASSP), pp 149–152
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Carlos III University of Madrid, Avda, de la Universidad, 30, 28911, Leganés, Spain
David Griol & Jose M. Molina
Department of Languages and Computer Systems, University of Granada, CITIC-UGR, C/ Pdta. Daniel Saucedo Aranda S/n, 18071, Granada, Spain
Zoraida Callejas

Authors

David Griol
View author publications
You can also search for this author in PubMed Google Scholar
Jose M. Molina
View author publications
You can also search for this author in PubMed Google Scholar
Zoraida Callejas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to David Griol .

Editor information

Editors and Affiliations

ETSI Industriales de Ciudad Real, University of Castilla-La Mancha, Ciudad Real, Spain
Fausto Pedro García Márquez
Drexel University, Philadelphia, Pennsylvania, USA
Benjamin Lev

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Griol, D., Molina, J.M., Callejas, Z. (2017). Big Data for Conversational Interfaces: Current Opportunities and Prospects. In: García Márquez, F., Lev, B. (eds) Big Data Management . Springer, Cham. https://doi.org/10.1007/978-3-319-45498-6_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-45498-6_6
Published: 17 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-45497-9
Online ISBN: 978-3-319-45498-6
eBook Packages: Business and ManagementBusiness and Management (R0)

Publish with us

Policies and ethics