Introduction

Rao, K. Sreenivasa; Nandi, Dipanjan

doi:10.1007/978-3-319-17725-0_1

K. Sreenivasa Rao⁴ &
Dipanjan Nandi⁴

Part of the book series: SpringerBriefs in Electrical and Computer Engineering ((BRIEFSSPEECHTECH))

616 Accesses

Abstract

This chapter introduces the basic goal of language identification (LID) and its impacts on real-life applications. A brief overview of the basic features used for developing LID systems has been given and different categories of LID systems are also discussed here. Eventually, the primary issues in developing LID systems and the major contributions of this book towards solving those issues have been highlighted.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

V.M. Vanishree, Provision for Linguistic Diversity and Linguistic Minorities in India. Master’s thesis. Applied Linguistics, St. Mary’s University College, Strawberry Hill, London, February 2011
Google Scholar
F. Runstein, F. Violaro, An isolated-word speech recognition system using neural networks. Circuits Syst. 1, 550–553 (1995)
Google Scholar
A. Kocsor, L. Toth, Application of Kernel-based feature space transformations and learning methods to phoneme classification. Appl. Intell. 21, 129–142 (2004)
Article MATH Google Scholar
R. Halavati, S.B. Shouraki, S.H. Zadeh, Recognition of human speech phonemes using a novel fuzzy approach. Appl. Soft Comput. 7, 828–839 (2007)
Article Google Scholar
T. Hao, M. Chao-Hong, L. Lin-Shan, An initial attempt for phoneme recognition using Structured Support Vector Machine (SVM), in IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), pp. 4926–4929 (2010)
Google Scholar
S. Furui, Cepstral analysis techniques for automatic speaker verification. IEEE Trans. Audio Speech Lang. Process. 29(2), 254–272 (1981)
Article Google Scholar
D.A. Reynolds, R.C. Rose, Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans. Audio, Speech Lang. Process. 3(1), 4–17 (1995)
Google Scholar
D.A. Reynolds, Speaker identification and verification using gaussian mixture speaker models. Speech Commun. 17, 91–108 (1995)
Article Google Scholar
M. Sugiyama, Automatic language recognition using acoustic features, in IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 813–816, May 1991
Google Scholar
K.S. Rao, S. Maity, V.R. Reddy, Pitch synchronous and glottal closure based speech analysis for language recognition. Int. J. Speech Technol. (Springer) 16(4), 413–430 (2013)
Article Google Scholar
J. Balleda, H.A. Murthy, T. Nagarajan, Language identification from short segments of speech. in International Conference on Spoken Language Processing (ICSLP), pp. 1033–1036, October 2000
Google Scholar
V.R. Reddy, S. Maity, K.S. Rao, Recognition of Indian languages using multi-level spectral and prosodic features. Int. J. Speech Technol. (Springer) 16(4), 489–510 (2013)
Article Google Scholar
S.G. Koolagudi, K. Sreenivasa Rao, Emotion recognition from speech using sub-syllabic and pitch synchronous spectral features. Int. J. Speech Technol. (Springer) 15(3), 495–511 (2012)
Article Google Scholar
K. Sreenivasa Rao, S.G. Koolagudi, Emotion Recognition using Speech Features. (Springer, 2012). ISBN 978-1-4614-5142-6
Google Scholar
K. Sreenivasa Rao, S.G. Koolagudi, Robust Emotion Recognition Using Spectral And Prosodic Features. (Springer, 2012). ISBN 978-1-4614-6359-7
Google Scholar
S.G. Koolagudi, D. Rastogi, K. Sreenivasa Rao, Spoken language identification using spectral features. Communications in Computer and Information Science (CCIS): Contemporary Computing, vol. 306, (Springer, 2012), pp. 496–497
Google Scholar
D. Neiberg, K. Elenius, K. Laskowski, Emotion recognition in spontaneous speech using GMMs, in Internation Speech Communication and Association (INTERSPEECH), September 2006
Google Scholar
D. Bitouk, R. Verma, A. Nenkova, Class-level spectral features for emotion recognition. Speech Commun. 52(7), 613–625 (2009)
Google Scholar
K.S. Rao, B. Yegnanarayana, Modeling durations of syllables using neural networks. Comput. Speech Lang. 21, 282–295 (2007)
Article Google Scholar
A.G. Adami, R. Mihaescu, D.A. Reynolds, J.J. Godfrey, Modeling prosodic dynamics for speaker recognition, in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 4, April 2003
Google Scholar
L. Mary, B. Yegnanarayana, Extraction and representation of prosodic features for language and speaker recognition. Speech Commun. 50(10), 782–796 (2008)
Article Google Scholar
K.S. Rao, S.G. Koolagudi, R.R. Vempada, Emotion recognition from speech using global and local prosodic features. Int. J. Speech Technol. 16(2), 143–160 (2013)
Article Google Scholar
K. Sreenivasa Rao, S.G. Koolagudi, Identification of hindi dialects and emotions using spectral and prosodic features of speech. J. Syst. Cybern. Inform. 9(4), 24–33 (2011)
Google Scholar
J. Yadav, K. Sreenivasa Rao, Emotional-speech synthesis from neutral-speech using prosody imposition, in International Conference on Recent Trends in Computer Science and Engineering (ICRTCSE-2014), Central University of Bihar, Patna, India, 8–9, February 2014
Google Scholar
D. Martinez, L. Burget, L. Ferrer, N. Scheffer, i-vector based prosodic system for language identification, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4861–4864, March 2012
Google Scholar
J. Makhoul, Linear prediction: a tutorial review. Proc. IEEE 63(4), 561–580 (1975)
Article Google Scholar
B. Yegnanarayana, T.K. Raja, Performance of linear prediction analysis on speech with additive noise, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (1977)
Google Scholar
C.S. Gupta, S.R.M. Prasanna, B. Yegnanarayana, Autoassociative neural network models for online speaker verification using source features from vowels, in IEEE International Joint Conference Neural Networks May 2002
Google Scholar
D. Pati, S.R.M. Prasanna, Subsegmental, segmental and suprasegmental processing of linear prediction residual for speaker information. Int. J. Speech Technol. (Springer) 14(1), 49–63 (2011)
Article Google Scholar
D. Pati, D. Nandi, K. Sreenivasa Rao, Robustness of excitation source information for language independent speaker recognition, in 16th International Oriental COCOSDA Conference, Gurgoan, India, November 2013
Google Scholar
D. Pati, S.R.M. Prasanna, A comparative study of explicit and implicit modelling of subsegmental speaker-specific excitation source information. Sadhana (Springer) 38(4), 591–620 (2013)
Google Scholar
A. Bajpai, B. Yegnanarayana, Exploring features for audio clip classification using LP residual and AANN models, in International Conference on Intelligent Sensing and Information Processing, pp. 305–310, January 2004
Google Scholar
K.S. Rao, S.G. Koolagudi, Characterization and recognition of emotions from speech using excitation source information. Int. J. Speech Technol. (Springer) 16, 181–201 (2013)
Article Google Scholar
A.V. Singh, J. Mukhopadhyay, K. Sreenivasa Rao, K. Viswanath, Classification of infant cries using dynamics of epoch features. J. Intell. Syst. 22(3), 253–267 (2013)
Google Scholar
A.V. Singh, J. Mukhopadyay, S.B.S. Kumar, K. Sreenivasa Rao, Infant cry recognition using excitation source features, in IEEE INDICON, Mumbai, India, December 2013
Google Scholar
S.R.M. Prasanna, C.S. Gupta, B. Yegnanarayana, Extraction of speaker-specific excitation information from linear prediction residual of speech. Speech Commun. 48, 1243–1261 (2006)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Indian Institute of Technology Kharagpur, Kharagpur, West Bengal, India
K. Sreenivasa Rao & Dipanjan Nandi

Authors

K. Sreenivasa Rao
View author publications
You can also search for this author in PubMed Google Scholar
Dipanjan Nandi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to K. Sreenivasa Rao .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Rao, K.S., Nandi, D. (2015). Introduction. In: Language Identification Using Excitation Source Features. SpringerBriefs in Electrical and Computer Engineering(). Springer, Cham. https://doi.org/10.1007/978-3-319-17725-0_1

Download citation

DOI: https://doi.org/10.1007/978-3-319-17725-0_1
Published: 16 April 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-17724-3
Online ISBN: 978-3-319-17725-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics