Skip to main content

Big Data for Conversational Interfaces: Current Opportunities and Prospects

  • Chapter
  • First Online:
Big Data Management

Abstract

As conversational technologies develop, we demand more from them. For instance, we want our conversational assistants to be able to solve our queries in multiple domains, to manage information from different usually unstructured sources, to be able to perform a variety of tasks, and understand open conversational language. However, developing the resources necessary to develop systems with such capabilities demands much time and effort, as for each domain, task or language, data must be collected, annotated following an schema that is usually not portable, the models must be trained over the annotated data, and their accuracy must be evaluated. In recent years, there has been a growing interest in investigating alternatives to manual effort that allow exploiting automatically the huge amount of resources available in the web. In this chapter we describe the main initiatives to extract, process and contextualize information from these rich and heterogeneous sources for the various tasks involved in dialog systems, including speech processing, natural language understanding and dialog management.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 159.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.w3.org/TR/voicexml20/.

  2. 2.

    http://htk.eng.cam.ac.uk/.

  3. 3.

    http://cmusphinx.sourceforge.net/.

  4. 4.

    https://web.stanford.edu/class/cs124/lec/languagemodeling.pdf.

References

  1. Abdennadher S, Aly M, Bhler D, Minker W, Pittermann J (2007) Becam tool - a semi-automatic tool for bootstrapping emotion corpus annotation and management. In: Proceedings of the international conference on spoken language processing (Interspeech’2007), pp 946–949

    Google Scholar 

  2. Agerri R, Artola X, Beloki Z, Rigau G, Soroa A (2015) Big data for natural language processing: a streaming approach. Knowl-Based Syst 79:36–42

    Article  Google Scholar 

  3. Bahl L, Jelinek F, Mercer R (1990) A maximum likelihood approach to continuous speech recognition. Readings in Speech recognition, pp 308–319

    Google Scholar 

  4. Baimbetov Y, Khalil I, Steinbauer M, Anderst-Kotsis G (2015) Using Big Data for emotionally intelligent mobile services through multi-modal emotion recognition. Springer, pp 127–138

    Google Scholar 

  5. Batliner A, Burkhardt F, van Ballegooy M, Noth E (2006) A taxonomy of applications that utilize emotional awareness. In: Proceedings of 1st international language technologies conference (IS-LTC 06), pp 246–250

    Google Scholar 

  6. Bickmore T, Giorgino T (2004) Some novel aspects of health communication from a dialogue systems perspective. In: Proceedings of AAAI fall symposium on dialogue systems for health communication, pp 275–291

    Google Scholar 

  7. Bos J, Klein E, Lemon O, Oka T (2003) DIPPER: description and formalisation of an information-state update dialogue system architecture. In: Proceedings of the SIGdial, pp 115–124

    Google Scholar 

  8. Chung G (2004) Developing a flexible spoken dialog system using simulation. In: Proceedings of ACL, pp 63–70

    Google Scholar 

  9. Cohn DA, Atlas L, Ladner R (1994) Improving generalization with active learning. Mach Learn 15(2):201–221

    Google Scholar 

  10. Cuayhuitl H, Renals S, Lemon O, Shimodaira H (2005) Human-computer dialogue simulation using hidden Markov models. In: Proceedings of ASRU, pp 290–295

    Google Scholar 

  11. Dutoit T (1996) An introduction to text-to-speech synthesis. Kluwer Academic Publishers

    Google Scholar 

  12. Eckert W, Levin E, Pieraccini R (1997) User modeling for spoken dialogue system evaluation. In: Proceedings of ASRU, pp 80–87

    Google Scholar 

  13. Eckert W, Levin E, Pieraccini R (1998) Automatic evaluation of spoken dialogue systems. Technical report, TR98.9.1, ATT Labs Research

    Google Scholar 

  14. Esteve Y, Raymond C, Bechet F, Mori RD (2003) Conceptual decoding for spoken dialog systems. In: Proceedings of European conference on speech communications and technology (Eurospeech’03). vol 1, pp 617–620

    Google Scholar 

  15. Fabbrizio GD, Tur G, Hakkani-Tr D, Gilbert M, Renger B, Gibbon D, Liu Z, Shahraray B (2008) Bootstrapping spoken dialogue systems by exploiting reusable libraries. Nat Lang Eng 14(3):313–335

    Article  Google Scholar 

  16. Fraser M, Gilbert G (1991) Simulating speech systems. Comput Speech Lang 5:81–99

    Article  Google Scholar 

  17. Georgila K, Henderson J, Lemon O (2005) Learning user simulations for information state update dialogue systems. In: Proceedings of Eurospeech’05, pp. 893–896

    Google Scholar 

  18. Gibbon D, Mertins I (Eds.), R.M.: Handbook of multimodal and spoken dialogue systems: resources, terminology and product evaluation. Kluwer Academic Publishers (2000)

    Google Scholar 

  19. Gudivada VN, Rao D, Raghavan VV (2015) Big data driven natural language processing research and applications, vol 33. Elsevier, pp 203–238

    Google Scholar 

  20. He Y, Young S (2003) A data-driven spoken language understanding system. In: Proceedings of IEEE Automatic speech recognition and understanding workshop (ASRU’03), pp 583–588

    Google Scholar 

  21. Heeman P (2007) Combining reinforcement learning with information-state update rules. In: Proceedings of the 8th Annual conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL’07), pp 268–275

    Google Scholar 

  22. Heinroth T, Minker W (2012) Introducing spoken dialogue systems into intelligent environments. Kluwer Academic Publishers, Springer

    Google Scholar 

  23. Hempel T (2008) Usability of speech dialog systems: listening to the target audience. Springer

    Google Scholar 

  24. Hinton G, Deng L, Yu D, Dahl G, Mohamed A, Jaitly N, Senior A, Vanhoucke V, Nguyen P, Sainath T, Kingsbury B (2012) Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups. IEEE Signal Process Mag 29(6):82–97

    Article  Google Scholar 

  25. Hori C, Ohtake K, Misu T, Kashioka H, Nakamura S (2009) Recent advances in WFST-based dialog system. In: Proceedings of the international conference on spoken language processing (Interspeech’2009), pp 268–271

    Google Scholar 

  26. Hoxha J, Weng C (2016) Leveraging dialog systems research to assist biomedical researchers interrogation of big clinical data. J Biomed Inf 61:176–184

    Article  Google Scholar 

  27. Hurtado L, Planells J, Segarra E, Sanchis E, Griol D (2010) A stochastic finite-state transducer approach to spoken dialog management. In: Proceedings of the international conference on spoken language processing (Interspeech’2010), pp 3002–3005

    Google Scholar 

  28. Jelinek F (1990) Self-organized language modeling for speech recognition. Readings in Speech recognition, pp 450–506

    Google Scholar 

  29. Jelinek F, Lafferty MR (1992) Basic methods of probabilistic context free grammars. Springer, pp 345–360

    Google Scholar 

  30. Jung S, Lee C, Kim K, Lee D, Lee G (2011) Hybrid user intention modeling to diversify dialog simulations. Comput Speech Lang 25(2):307–326

    Article  Google Scholar 

  31. Lane I, Ueno S, Kawahara T (2004) Cooperative dialogue planning with user and situation models via example-based training. In: Proceedings of workshop on man-machine symbiotic systems, pp 2837–2840, Kyoto, Japan

    Google Scholar 

  32. Laroche R, Putois G, Bretier P, Young S, Lemon O (2008) Requirements analysis and theory for statistical learning approaches in automaton-based dialogue management. Technical report, School of Informatics, Edinburgh University, Edinburgh, UK

    Google Scholar 

  33. Lee C, Jung S, Kim K, Lee GG (2010) Hybrid approach to robust dialog management using agenda and dialog examples. Comput Speech Lang 24(4):609–631

    Article  Google Scholar 

  34. Lemon O (2011) Learning what to say and how to say it: joint optimisation of spoken dialogue management and natural language generation. Comput Speech Lang 25(2):210–221

    Article  Google Scholar 

  35. Lemon O, Pietquin O (2012) Data-Driven methods for adaptive spoken dialogue systems. Computational learning for conversational interfaces. Springer, Berlin

    Book  Google Scholar 

  36. Lemon O, Georgila K, Henderson J (2006) Evaluating effectiveness and portability of reinforcement learned dialogue strategies with real users: the TALK TownInfo evaluation. In: Proceedings of IEEE-ACL workshop on spoken language technology (SLT’06), pp 178–181

    Google Scholar 

  37. Levin E, Pieraccini R (1995) Concept-based spontaneous speech understanding system. In: Proceedings of European conference on speech communications and technology (Eurospeech’95). pp. 555–558 (1995)

    Google Scholar 

  38. Levin E, Pieraccini R, Eckert W (2000) A stochastic model of human-machine interaction for learning dialog strategies. IEEE Trans Speech Audio Process 8(1):11–23

    Article  Google Scholar 

  39. Lin B, Lee L (2001) Computer aided analysis and design for spoken dialogue systems based on quantitative simulations. IEEE Trans Speech Audio Process 9(5):534–548

    Article  Google Scholar 

  40. Litman D, Forbes-Riley K (2006) Recognizing student emotions and attitudes on the basis of utterances in spoken tutoring dialogues with both human and computer tutors. Speech Commun 48(5):559–590

    Article  Google Scholar 

  41. Liu Y, Shriberg E (2005) Does active learning help automatic dialog act tagging in meeting data. In: Proceedings of the international conference on spoken language processing (Interspeech’2005), pp 2777–2780, Lisbon, Portugal

    Google Scholar 

  42. López V, Eisman E, Castro J, Zurita J (2011) A case based reasoning model for multilingual language generation in dialogues. Expert Syst Appl 39(8):7330–7337

    Article  Google Scholar 

  43. López-Cózar R, Callejas Z, McTear M (2006) Testing the performance of spoken dialogue systems by means of an artificially simulated user. Artif Intell Rev 26:291–323

    Article  Google Scholar 

  44. López-Cózar R, la Torre AD, Segura J, Rubio A, Sánchez V (2003) Assessment of dialogue systems by means of a new simulation technique. Speech Commun 40(3):387–407

    Article  Google Scholar 

  45. López-Cózar R, Callejas Z (2008) ASR post-correction for spoken dialogue systems based on semantic, syntactic, lexical and contextual information. Comput Speech Lang 50(8–9):745–766

    Google Scholar 

  46. López-Cózar R, Callejas Z, Griol D (2010) ASR post-correction for spoken dialogue systems based on semantic, syntactic, lexical and contextual information. Knowl-Based Syst 23(5):471–485

    Article  Google Scholar 

  47. Mayer-Schonberger V (2003) Big data: a revolution that will transform how we live, work, and think. Eamon Dolan-Houghton Mifflin Harcourt

    Google Scholar 

  48. McTear MF, Callejas Z, Griol D (2016) The conversational interface. Springer

    Google Scholar 

  49. Meng HH, Wai C, Pieraccini R (2003) The use of belief networks for mixed-initiative dialog modeling. IEEE Trans Speech Audio Process 11(6):757–773

    Article  Google Scholar 

  50. Minker W (1999) Design considerations for knowledge source representations of a stochastically-based natural language understanding component. Speech Commun 28(2):141–154

    Article  Google Scholar 

  51. Minker W, Waibel A, Mariani J (1999) Stochastically-based semantic analysis. Kluwer Academic Publishers, Dordrecht (Holland)

    Book  Google Scholar 

  52. Mller S, Englert R, Engelbrecht K, Hafner V, Jameson A, Oulasvirta A, Raake A, Reithinger N (2006) MeMo: towards automatic usability evaluation of spoken dialogue services by user error simulations. In: Proceedings of the Interspeech, pp 1786–1789

    Google Scholar 

  53. Najafabadi M, Villanustre F, Khoshgoftaar T, Seliya N, WaldEmail R, Muharemagic E (2015) Deep learning applications and challenges in big data analytics. J Big Data 2(1)

    Google Scholar 

  54. Oh AH, Rudnicky AI (2000) Stochastic language generation for spoken dialogue systems. In: Proceedings of ANLP/NAACL workshop on conversational systems, pp 27–32

    Google Scholar 

  55. O’Shaughnessy D (2008) Automatic speech recognition: history, methods and challenges. Pattern Recogn 41(10):2965–2979

    Article  Google Scholar 

  56. O’Shea J, Bandar Z, Crockett K (2012) A multi-classifier approach to dialogue act classification using function words. Lecture notes in computer science, vol 7270, pp 119–143

    Google Scholar 

  57. Paek T, Horvitz E (2000) Conversation as action under uncertainty. In: Proceedings of the 16th conference on uncertainty in artificial intelligence, pp 455–464

    Google Scholar 

  58. Paek T, Pieraccini R (2008) Automating spoken dialogue management design using machine learning: an industry perspective. Speech Commun 50(8–9):716–729

    Article  Google Scholar 

  59. Pieraccini R (2012) The voice in the machine: building computers that understand speech. MIT Press

    Google Scholar 

  60. Planells J, Hurtado L, Sanchis E, Segarra E (2012) An online generated transducer to increase dialog manager coverage. In: Proceedings of the international conference on spoken language processing (Interspeech’2012)

    Google Scholar 

  61. Rabiner L, Juang B, Lee C (1996) An overview of automatic speech recognition. Kluwer Academic Publishers, pp 1–30

    Google Scholar 

  62. Rojas-Barahona L, Giorgino T (2009) Adaptable dialog architecture and runtime engine (adarte): a framework for rapid prototyping of health dialog systems. Int J Med Inf 78:56–68

    Article  Google Scholar 

  63. Roy N, Pineau J, Thrun S (2000) Spoken dialogue management using probabilistic reasoning. In: Proceedings of the 38th Annual meeting of the association for computational linguistics (ACL’00), pp 93–100

    Google Scholar 

  64. Schatzmann J, Thomson B, Weilhammer K, Ye H, Young S (2007) Agenda-based user simulation for bootstrapping a POMDP dialogue system. In: Proceedings of HLT/NAACL, pp 149–152

    Google Scholar 

  65. Schatzmann J, Thomson B, Young S (2007) Statistical user simulation with a hidden agenda. In: Proceedings of SIGdial, pp 273–282

    Google Scholar 

  66. Schatzmann J, Weilhammer K, Stuttle M, Young S (2006) A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies. Knowl Eng Rev 21(2):97–126

    Article  Google Scholar 

  67. Segarra E et al (2002) Extracting semantic information through automatic learning techniques. Int J Pattern Recogn Artif Intell 16(3):301–307

    Article  Google Scholar 

  68. Seide F, Li G, Yu D (2011) Conversational speech transcription using context-dependent deep neural networks. In: Proceedings of the 12th annual conference of the international speech communication association (InterSpeech 2011), pp 437–440. Florence, Italy

    Google Scholar 

  69. Shamim-Hossain M, Muhammad G, Alhamid MF, Song B, Al-Mutib K (2016) Audio-visual emotion recognition using big data towards 5G. Mobile Netw Appl 1:1–11

    Article  Google Scholar 

  70. Singh S, Kearns M, Litman D, Walker M (1999) Reinforcement learning for spoken dialogue systems. In: Proceedings of neural information processing systems (NIPS’99), pp 956–962

    Google Scholar 

  71. Singh S, Litman D, Kearns M, Walker M (2002) Optimizing dialogue management with reinforcement leaning: experiments with the NJFun system. J Artif Intell 16:105–133

    Google Scholar 

  72. Suendermann D, Pieraccini R (2012) One year of contender: what have we learned about assessing and tuning industrial spoken dialog systems? In: Proceedings of NAACL-HLT workshop on future directions and needs in the spoken dialog community: tools and data (SDCTD’12), pp 45–48

    Google Scholar 

  73. Thomson B, Schatzmann J, Weilhammer K, Ye H, Young S (2007) Training a real-world POMDP-based Dialog System. In: Proceedings of NAACL-HLT-Dialog’07 workshop on bridging the gap: academic and industrial research in dialog technologies, pp 9–16

    Google Scholar 

  74. Torres F, Sanchis E, Segarra E (2003) Development of a stochastic dialog manager driven by semantics. In: Proceedings of European conference on speech communications and technology (Eurospeech’03), pp 605–608

    Google Scholar 

  75. Torres F, Sanchis E, Segarra E (2008) User simulation in a stochastic dialog system. Comput Speech Lang 22:230–255

    Article  Google Scholar 

  76. Torres F, Sanchis E, Segarra E (2008) User simulation in a stochastic dialog system. Comput Speech Lang 22(3):230–255

    Article  Google Scholar 

  77. Traum D, Larsson S (2003) The information state approach to dialogue management. Kluwer, pp 325–353

    Google Scholar 

  78. Tsilfidis A, Mporas I, Mourjopoulos J, Fakotakis N (2013) Automatic speech recognition performance in different room acoustic environments with and without dereverberation preprocessing. Comput Speech Lang 27(1):380–395

    Article  Google Scholar 

  79. Venkataraman A, Stolcke A, Shriberg E (2002) Automatic dialog act labeling with minimal supervision. In: Proceedings of the 9th Australian international conference on speech science & technology

    Google Scholar 

  80. Vipperla R, Wolters M, Renals S (2012) Spoken dialogue interfaces for older people. IOS Press, pp 118–137

    Google Scholar 

  81. Wilks Y, Catizone R, Worgan S, Turunen M (2011) Some background on dialogue management and conversational speech for dialogue systems. Comput Speech Lang 25:128–139

    Article  Google Scholar 

  82. Williams J, Poupart P, Young S (2006) Partially Observable Markov decision processes with continuous observations for dialogue management. Springer, pp 191–217

    Google Scholar 

  83. Williams J, Young S (2007) Partially observable Markov decision processes for spoken dialog systems. Comput Speech Lang 21(2):393–422

    Article  Google Scholar 

  84. Williams J (2009) The best of both worlds: unifying conventional dialog systems and pomdps. In: Proceedings of Interspeech, pp 1173–1176

    Google Scholar 

  85. Wu WL, Lu RZ, Duan JY, Liu H, Gao F, Chen YQ (2010) Spoken language understanding using weakly supervised learning. Comput Speech Lang 24(2):358–382

    Article  Google Scholar 

  86. Young S (2002) The statistical approach to the design of spoken dialogue systems. Technical report, CUED/F-INFENG/TR.433, Cambridge University Engineering Department, Cambridge, UK

    Google Scholar 

  87. Young S, Gasic M, Thomson B, Williams J (2013) Pomdp-based statistical spoken dialogue systems: a review. In: Proceedings of the IEEE, pp 1–18, Montreal, Canada

    Google Scholar 

  88. Young S, Williams J, Schatzmann J, Stuttle M, Weilhammer K (2005) The hidden information state approach to dialogue management. Technical report, Department of Engineering, University of Cambridge, Cambridge, UK

    Google Scholar 

  89. Young S, Schatzmann J, Weilhammer K, Ye H (2007) The hidden information state approach to dialogue management. In: Proceedings of the 32nd IEEE international conference on acoustics, speech, and signal processing (ICASSP), pp 149–152

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to David Griol .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this chapter

Cite this chapter

Griol, D., Molina, J.M., Callejas, Z. (2017). Big Data for Conversational Interfaces: Current Opportunities and Prospects. In: García Márquez, F., Lev, B. (eds) Big Data Management . Springer, Cham. https://doi.org/10.1007/978-3-319-45498-6_6

Download citation

Publish with us

Policies and ethics