Skip to main content

Detecting Multiple Domains from User’s Utterance in Spoken Dialog System

  • Chapter
Natural Language Dialog Systems and Intelligent Assistants

Abstract

Multi-domain spoken dialog system should be able to detect more than one domain from a user’s utterance. However, it is difficult to train an accurate binary classifier of a domain based on only positive and unlabeled examples. This paper improves hierarchical clustering algorithm to automatically identify reliable negative examples among unlabeled examples. This paper also verifies three linkage criteria that measure the distance between two clusters. In experiments, the proposed method resulted in the highest gain of F 1 score compared to the existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  • Chang C, Lin C (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2(3):27:1–27:27

    Google Scholar 

  • Dempster AP, Laird NM, Rubin DB (1997) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B Methodol 39(1):1–38

    Google Scholar 

  • Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning, 2nd edn. Springer, New York, pp 520–528

    Book  MATH  Google Scholar 

  • Lane I, Kawahara T, Matsui T, Nakamura S (2007) Out-of-domain utterance detection using classification confidences of multiple topics. IEEE Trans Audio Speech Lang Process 15(1):150–161

    Article  Google Scholar 

  • Li X, Liu B (2003) Learning to classify texts using positive and unlabeled data. In: Proceedings of the 18th international joint conference on artificial intelligence, Acapulco, Mexico, August 2003

    Google Scholar 

  • Li X, Roth D (2002) Learning question classifiers. In: Proceedings of the 19th international conference on computational linguistics, Taipei, Taiwan, September 2002

    Google Scholar 

  • Liu B, Lee WS, Yu PS, Li X (2002) Partially supervised classification of text documents. In: Proceedings of the 19th international conference on machine learning, New South Wales, Sydney, July 2002

    Google Scholar 

  • Liu B, Dai Y, Li X, Lee WS, Yu PS (2003) Building text classifiers using positive and unlabeled examples. In: Proceedings of the 3rd IEEE international conference on data mining, Melbourne, Florida, USA, November 2003

    Google Scholar 

  • McCallum A, Nigam K (1998) A comparison of event models for Naive Bayes text classification. In: Proceedings of the 15th natural conference on artificial intelligence: workshop on learning from text categorization, Madison, Wisconsin, USA, July 1998

    Google Scholar 

  • Rocchio J (1971) Relevance feedback in information retrieval. In: The smart retrieval system: experiments in automatic document processing, Englewood Cliffs, New Jersey, USA, 1971

    Google Scholar 

  • Ryu S, Lee D, Lee I, Han S, Lee GG, Kim M, Kim K (2012) A hierarchical domain model-based multi-domain selection framework for multi-domain dialog systems. In: Proceedings of the 24th international conference on computational linguistics, Mumbai, India, December 2012

    Google Scholar 

  • Schölkopf B, Platt JC, Shawe-Taylor J, Smola AJ, Williamson RC (2001) Estimating the support of a high-dimensional distribution. Neural Comput 13(7):1443–1471

    Article  MATH  Google Scholar 

  • Yu H, Han J, Chang KC (2002) PEBL: positive example based learning for web page classification using SVM. In: Proceedings of the 8th ACM SIGKDD international conference of knowledge discovery and data mining, Edmonton, Alberta, Canada, July 2002

    Google Scholar 

Download references

Acknowledgments

This work was supported by ICT R&D program of MSIP/IITP [14-824-09-014, Basic Software Research in Human-level Lifelong Machine Learning (Machine Learning Center)]. This work was supported by National Research Foundation of Korean (NRF) [NRF-2014R1A2A1A01003041, Development of Multi-party Anticipatory Knowledge-Intensive Natural Language Dialog System].

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Seonghan Ryu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Ryu, S., Song, J., Koo, S., Kwon, S., Lee, G.G. (2015). Detecting Multiple Domains from User’s Utterance in Spoken Dialog System. In: Lee, G., Kim, H., Jeong, M., Kim, JH. (eds) Natural Language Dialog Systems and Intelligent Assistants. Springer, Cham. https://doi.org/10.1007/978-3-319-19291-8_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-19291-8_10

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-19290-1

  • Online ISBN: 978-3-319-19291-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics