Skip to main content

Part of the book series: SpringerBriefs in Electrical and Computer Engineering ((BRIEFSSPEECHTECH))

  • 616 Accesses

Abstract

This chapter introduces the basic goal of language identification (LID) and its impacts on real-life applications. A brief overview of the basic features used for developing LID systems has been given and different categories of LID systems are also discussed here. Eventually, the primary issues in developing LID systems and the major contributions of this book towards solving those issues have been highlighted.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. V.M. Vanishree, Provision for Linguistic Diversity and Linguistic Minorities in India. Master’s thesis. Applied Linguistics, St. Mary’s University College, Strawberry Hill, London, February 2011

    Google Scholar 

  2. F. Runstein, F. Violaro, An isolated-word speech recognition system using neural networks. Circuits Syst. 1, 550–553 (1995)

    Google Scholar 

  3. A. Kocsor, L. Toth, Application of Kernel-based feature space transformations and learning methods to phoneme classification. Appl. Intell. 21, 129–142 (2004)

    Article  MATH  Google Scholar 

  4. R. Halavati, S.B. Shouraki, S.H. Zadeh, Recognition of human speech phonemes using a novel fuzzy approach. Appl. Soft Comput. 7, 828–839 (2007)

    Article  Google Scholar 

  5. T. Hao, M. Chao-Hong, L. Lin-Shan, An initial attempt for phoneme recognition using Structured Support Vector Machine (SVM), in IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), pp. 4926–4929 (2010)

    Google Scholar 

  6. S. Furui, Cepstral analysis techniques for automatic speaker verification. IEEE Trans. Audio Speech Lang. Process. 29(2), 254–272 (1981)

    Article  Google Scholar 

  7. D.A. Reynolds, R.C. Rose, Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans. Audio, Speech Lang. Process. 3(1), 4–17 (1995)

    Google Scholar 

  8. D.A. Reynolds, Speaker identification and verification using gaussian mixture speaker models. Speech Commun. 17, 91–108 (1995)

    Article  Google Scholar 

  9. M. Sugiyama, Automatic language recognition using acoustic features, in IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 813–816, May 1991

    Google Scholar 

  10. K.S. Rao, S. Maity, V.R. Reddy, Pitch synchronous and glottal closure based speech analysis for language recognition. Int. J. Speech Technol. (Springer) 16(4), 413–430 (2013)

    Article  Google Scholar 

  11. J. Balleda, H.A. Murthy, T. Nagarajan, Language identification from short segments of speech. in International Conference on Spoken Language Processing (ICSLP), pp. 1033–1036, October 2000

    Google Scholar 

  12. V.R. Reddy, S. Maity, K.S. Rao, Recognition of Indian languages using multi-level spectral and prosodic features. Int. J. Speech Technol. (Springer) 16(4), 489–510 (2013)

    Article  Google Scholar 

  13. S.G. Koolagudi, K. Sreenivasa Rao, Emotion recognition from speech using sub-syllabic and pitch synchronous spectral features. Int. J. Speech Technol. (Springer) 15(3), 495–511 (2012)

    Article  Google Scholar 

  14. K. Sreenivasa Rao, S.G. Koolagudi, Emotion Recognition using Speech Features. (Springer, 2012). ISBN 978-1-4614-5142-6

    Google Scholar 

  15. K. Sreenivasa Rao, S.G. Koolagudi, Robust Emotion Recognition Using Spectral And Prosodic Features. (Springer, 2012). ISBN 978-1-4614-6359-7

    Google Scholar 

  16. S.G. Koolagudi, D. Rastogi, K. Sreenivasa Rao, Spoken language identification using spectral features. Communications in Computer and Information Science (CCIS): Contemporary Computing, vol. 306, (Springer, 2012), pp. 496–497

    Google Scholar 

  17. D. Neiberg, K. Elenius, K. Laskowski, Emotion recognition in spontaneous speech using GMMs, in Internation Speech Communication and Association (INTERSPEECH), September 2006

    Google Scholar 

  18. D. Bitouk, R. Verma, A. Nenkova, Class-level spectral features for emotion recognition. Speech Commun. 52(7), 613–625 (2009)

    Google Scholar 

  19. K.S. Rao, B. Yegnanarayana, Modeling durations of syllables using neural networks. Comput. Speech Lang. 21, 282–295 (2007)

    Article  Google Scholar 

  20. A.G. Adami, R. Mihaescu, D.A. Reynolds, J.J. Godfrey, Modeling prosodic dynamics for speaker recognition, in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 4, April 2003

    Google Scholar 

  21. L. Mary, B. Yegnanarayana, Extraction and representation of prosodic features for language and speaker recognition. Speech Commun. 50(10), 782–796 (2008)

    Article  Google Scholar 

  22. K.S. Rao, S.G. Koolagudi, R.R. Vempada, Emotion recognition from speech using global and local prosodic features. Int. J. Speech Technol. 16(2), 143–160 (2013)

    Article  Google Scholar 

  23. K. Sreenivasa Rao, S.G. Koolagudi, Identification of hindi dialects and emotions using spectral and prosodic features of speech. J. Syst. Cybern. Inform. 9(4), 24–33 (2011)

    Google Scholar 

  24. J. Yadav, K. Sreenivasa Rao, Emotional-speech synthesis from neutral-speech using prosody imposition, in International Conference on Recent Trends in Computer Science and Engineering (ICRTCSE-2014), Central University of Bihar, Patna, India, 8–9, February 2014

    Google Scholar 

  25. D. Martinez, L. Burget, L. Ferrer, N. Scheffer, i-vector based prosodic system for language identification, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4861–4864, March 2012

    Google Scholar 

  26. J. Makhoul, Linear prediction: a tutorial review. Proc. IEEE 63(4), 561–580 (1975)

    Article  Google Scholar 

  27. B. Yegnanarayana, T.K. Raja, Performance of linear prediction analysis on speech with additive noise, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (1977)

    Google Scholar 

  28. C.S. Gupta, S.R.M. Prasanna, B. Yegnanarayana, Autoassociative neural network models for online speaker verification using source features from vowels, in IEEE International Joint Conference Neural Networks May 2002

    Google Scholar 

  29. D. Pati, S.R.M. Prasanna, Subsegmental, segmental and suprasegmental processing of linear prediction residual for speaker information. Int. J. Speech Technol. (Springer) 14(1), 49–63 (2011)

    Article  Google Scholar 

  30. D. Pati, D. Nandi, K. Sreenivasa Rao, Robustness of excitation source information for language independent speaker recognition, in 16th International Oriental COCOSDA Conference, Gurgoan, India, November 2013

    Google Scholar 

  31. D. Pati, S.R.M. Prasanna, A comparative study of explicit and implicit modelling of subsegmental speaker-specific excitation source information. Sadhana (Springer) 38(4), 591–620 (2013)

    Google Scholar 

  32. A. Bajpai, B. Yegnanarayana, Exploring features for audio clip classification using LP residual and AANN models, in International Conference on Intelligent Sensing and Information Processing, pp. 305–310, January 2004

    Google Scholar 

  33. K.S. Rao, S.G. Koolagudi, Characterization and recognition of emotions from speech using excitation source information. Int. J. Speech Technol. (Springer) 16, 181–201 (2013)

    Article  Google Scholar 

  34. A.V. Singh, J. Mukhopadhyay, K. Sreenivasa Rao, K. Viswanath, Classification of infant cries using dynamics of epoch features. J. Intell. Syst. 22(3), 253–267 (2013)

    Google Scholar 

  35. A.V. Singh, J. Mukhopadyay, S.B.S. Kumar, K. Sreenivasa Rao, Infant cry recognition using excitation source features, in IEEE INDICON, Mumbai, India, December 2013

    Google Scholar 

  36. S.R.M. Prasanna, C.S. Gupta, B. Yegnanarayana, Extraction of speaker-specific excitation information from linear prediction residual of speech. Speech Commun. 48, 1243–1261 (2006)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to K. Sreenivasa Rao .

Rights and permissions

Reprints and permissions

Copyright information

© 2015 The Author(s)

About this chapter

Cite this chapter

Rao, K.S., Nandi, D. (2015). Introduction. In: Language Identification Using Excitation Source Features. SpringerBriefs in Electrical and Computer Engineering(). Springer, Cham. https://doi.org/10.1007/978-3-319-17725-0_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-17725-0_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-17724-3

  • Online ISBN: 978-3-319-17725-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics