Skip to main content

Language Identification—A Brief Review

  • Chapter
  • First Online:
Language Identification Using Excitation Source Features

Part of the book series: SpringerBriefs in Electrical and Computer Engineering ((BRIEFSSPEECHTECH))

Abstract

This chapter provides compendious reviews about both the explicit and implicit LID systems present in the literature. Existing works related to language identification in Indian context are briefly discussed. The related works about the excitation source features are also presented here. Various speech features and models proposed in the context of language identification are briefly reviewed in this chapter. The motivation for the present work from the existing literature is briefly discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. R. Leonard, G. Doddington, Automatic language identification. Technical Report RADC-TR-74-200 (Air Force Rome Air Development Center, Technical Report) August 1974

    Google Scholar 

  2. R. Leonard, Language Recognition Test and Evaluation. Technical Report RADCTR-80-83 (Air Force Rome Air Development Center, Technical Report). March 1980

    Google Scholar 

  3. A.S. House, E.P. Neuberg, Toward automatic identification of the languages of an utterance. J. Acoust. Soc. Am. 62(3), 708–713 (1977)

    Article  Google Scholar 

  4. K.P. Li, T.J. Edwards, Statistical models for automatic language identification, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 884–887, April 1980

    Google Scholar 

  5. L.F. Lamel, J.L. Gauvain, Cross lingual experiments with phone recognition. in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 507–510, April 1993

    Google Scholar 

  6. L.F. Lamel, J.L. Gauvain, Language identification using phonebased acoustic likelihoods, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, , pp. I/293–I/296, April 1994

    Google Scholar 

  7. Y. Muthusamy, R. Cole, M. Gopalakrishnan, A segment-based approach to automatic language identification, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 353–356, April 1991

    Google Scholar 

  8. K.M. Berkling, T. Arai, E. Bernard, Analysis of phoneme based features for language identification, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. I/289–I/292, April 1994

    Google Scholar 

  9. R.C.F. Tucker, M. Carey, E. Parris, Automatic language identification using sub-word models, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. I/301–I/30, April 1994

    Google Scholar 

  10. M.A. Zissman, E. Singer, Automatic language identification of telephone speech messages using phoneme recognition and N-gram modeling, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1 pp. I/305–I/308, (1994)

    Google Scholar 

  11. S. Kadambe, J. Hieronymus, Language identification with phonological and lexical models, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 5, pp. 3507–351, May 1995

    Google Scholar 

  12. Y. Yan, E. Barnard, An approach to automatic language identification based on language-dependent phone recognition, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 5, pp. 3511–3514, May 1995

    Google Scholar 

  13. J. Navratil, W. Zuhlke, Phonetic-context mapping in language identification. Eur. Speech Commun. Assoc. (EUROSPEECH) 1, 71–74 (1997)

    Google Scholar 

  14. T.J. Hazen, V.W. Zue, Segment-based automatic language identification. J. Acoust. Soc. Am. 101, 2323–2331 (1997)

    Article  Google Scholar 

  15. K. Kirchhoff, S. Parandekar, Multi-stream statistical n-gram modeling with application to automatic language identification, in European Speech Communication Association (EUROSPEECH), pp. 803–806, (2001)

    Google Scholar 

  16. T. Gleason, M. Zissman, Composite background models and score standardization for language identification systems, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 529–532 (2001)

    Google Scholar 

  17. V. Ramasubramanian, A.K.V.S. Jayram, T.V. Sreenivas, Language identification using parallel sub-word recognition - an ergodic HMM equivalence, European Speech Communication Association (EUROSPEECH) (Geneva, Switzerland), September 2003

    Google Scholar 

  18. J. Gauvain, A. Messaoudi, H. Schwenk, Language recognition using phone latices, in International Speech Communication Association (INTERSPEECH), pp. 25–28 (2004)

    Google Scholar 

  19. W. Shen, W. Campbell, T. Gleason, D. Reynolds, E. Singer, Experiments with lattice-based PPRLM language identification, in Speaker and Language Recognition Workshop, pp. 1–6 (2006)

    Google Scholar 

  20. H. Li, B. Ma, C.H. Lee, A vector space modeling approach to spoken language identification. IEEE Trans. Audio Speech Lang. Process. 15(1), 271–284 (2007)

    Article  Google Scholar 

  21. K.C. Sim, H. Li, On acoustic diversification front-end for spoken language identification. IEEE Trans. Audio Speech Lang. Process. 16(5), 1029–1037 (2008)

    Article  Google Scholar 

  22. R. Tong, B. Ma, H. Li, E.S. Chng, A target-oriented phonotactic front-end for spoken language recognition. IEEE Trans. Audio Speech Lang. Process. 17(7), 1335–1347 (2009)

    Article  Google Scholar 

  23. G.R. Botha, E. Barnard, Factors that affect the accuracy of text-based language identification. Comput. Speech Lang. 26(5), 307–320 (2012)

    Article  Google Scholar 

  24. N. Barroso, K. Lopez de Ipina, C. Hernandez, A. Ezeiza, M. Grana, Semantic speech recognition in the Basque context Part II: language identification for under-resourced languages. Int. J. Speech Technol. 15(1), 41–47 (2012)

    Article  Google Scholar 

  25. S.M. Siniscalchi, J. Reed, T. Svendsen, C.-H. Lee, Universal attribute characterization of spoken languages for automatic spoken language recognition. Comput. Speech Lang. 27(1), 209–227 (2013)

    Article  Google Scholar 

  26. J.T. Foil, Language identification using noisy speech, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 861–864, (1986)

    Google Scholar 

  27. F. Goodman, A. Martin, R. Wohlford, Improved automatic language identification in noisy speech, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 528–531, May 1989

    Google Scholar 

  28. M. Sugiyama, Automatic language recognition using acoustic features, in IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 813–816, May 1991

    Google Scholar 

  29. D. Morgan, L. Riek, W. Mistretta, C. Scofield, P. Grouin, F. Hull, Experiments in language identification with neural networks. Int. Joint Conf. Neural Netw. 2, 320–325 (1992)

    Google Scholar 

  30. M. Zissman, Automatic language identification using gaussian mixture and hidden markov models, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 2, pp. 399–402, April 1993

    Google Scholar 

  31. D.A. Reynolds, R.C. Rose, Robust text -independent speaker identification using gaussian mixture speaker models. IEEE Trans. Audio Speech Lang. Process. 3(1), 72–83 (1995)

    Article  Google Scholar 

  32. S. Itahashi, J. Zhou, K. Tanaka, Spoken language discrimination using speech fundamental frequency, in International Conference on Spoken Language Processing (ICSLP), pp. 1899–1902, (1994)

    Google Scholar 

  33. I. Shuichi, D. Liang, Language identification based on speech fundamental frequency, in European Speech Communication Association (EUROSPEECH), pp. 1359–1362 (1995)

    Google Scholar 

  34. K.P. Li, Automatic language identification using syllabic spectral features, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. I/297–I/300, April 1994

    Google Scholar 

  35. F. Pellegrino, R. Andre-Obrecht, An unsupervised approach to language identification, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 2, pp. 833–836, Mar 1999

    Google Scholar 

  36. J.L. Rouas, J. Farinas, F. Pellegrino, R. Andr-Obrecht, Rhythmic unit extraction and modelling for automatic language identification. Speech Commun. 47, 436–456 (2005)

    Article  Google Scholar 

  37. J.L. Rouas, Automatic prosodic variations modeling for language and dialect discrimination. IEEE Trans. Audio Speech Lang. Process. 15(6), 1904–1911 (2007)

    Article  Google Scholar 

  38. A. Sangwan, M. Mehrabani, J. Hansen, Automatic language analysis and identification based on speech production knowledge, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 5006–5009, March 2010

    Google Scholar 

  39. D. Martinez, L. Burget, L. Ferrer, N. Scheffer, i-vector based prosodic system for language identification, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4861–4864, March 2012

    Google Scholar 

  40. J. Balleda, H.A. Murthy, T. Nagarajan, Language Identification from Short Segments of Speech, in International Conference on Spoken Language Processing (ICSLP), pp. 1033–1036, October 2000

    Google Scholar 

  41. T. Nagarajan, Implicit system for spoken language identification, Ph.D. dissertation, Indian Institute of Technology Madras, India (2004)

    Google Scholar 

  42. A.K.V.S. Jayaram, V. Ramasubramanian, T.V. Sreenivas, Language identification using parallel sub-word recognition, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 32–35, April 2003

    Google Scholar 

  43. L. Mary, B. Yegnanarayana, Autoassociative neural network models for language identification. in Internatioanl Conference on Intelligent Sensing and Information Processing, pp. 317–320 (2004)

    Google Scholar 

  44. L. Mary, Multilevel implicit features for language and speaker recognition, Ph.D. dissertation, Indian Institute of Technology Madras, India (2006)

    Google Scholar 

  45. K.S. Rao, S. Maity, V.R. Reddy, Pitch synchronous and glottal closure based speech analysis for language recognition. Int. J. Speech Technol. (Springer) 16(4), 413–430 (2013)

    Article  Google Scholar 

  46. V.R. Reddy, S. Maity, K.S. Rao, Identification of indian languages using multi-level spectral and prosodic features. Int. J. Speech Technol. (Springer) 16(4), 489–511 (2013)

    Article  Google Scholar 

  47. S. Jothilakshmi, V. Ramalingam, S. Palanivel, A hierarchical language identification system for Indian languages. Digital Signal Process. (Elsevier) 22(3), 544–553 (2012)

    Article  MathSciNet  Google Scholar 

  48. B. Bhaskar, D. Nandi, K.S. Rao, Analysis of language identification performance based on gender and hierarchial grouping approaches, in International Conference on Natural Language Processing, December 2013

    Google Scholar 

  49. B. Yegnanarayana, T.K. Raja, Perfoemance of linear prediction analysis on speech with additive noise, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (1977)

    Google Scholar 

  50. B. Yegnanarayana, S.R.M. Prasanna, J. Zachariah, C. Gupta, Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system. IEEE Trans. Audio Speech Lang. Process. 13(4), 575–582 (2005)

    Article  Google Scholar 

  51. C.S. Gupta, S.R.M. Prasanna, B. Yegnanarayana, Autoassociative neural network models for online speaker verification using source features from vowels, in IEEE International Joint Conference Neural Networks, May 2002

    Google Scholar 

  52. D. Pati, S.R.M. Prasanna, Subsegmental, segmental and suprasegmental processing of linear prediction residual for speaker information. Int. J. Speech Technol. (Springer) 14(1), 49–63 (2011)

    Article  Google Scholar 

  53. D. Pati, D. Nandi, K. Sreenivasa Rao, Robustness of excitation source information for language independent speaker recognition, in 16th International Oriental COCOSDA Conference, Gurgoan, November 2013

    Google Scholar 

  54. A. Bajpai, B. Yegnanarayana, Exploring features for audio clip classification using LP residual and AANN models, in International Conference on Intelligent Sensing and Information Processing, pp. 305–310, January 2004

    Google Scholar 

  55. K.S. Rao, S.G. Koolagudi, Characterization and recognition of emotions from speech using excitation source information. Int. J. Speech Technol. (Springer) 16, 181–201 (2013)

    Article  Google Scholar 

  56. K.S. Rao, B. Yegnanarayana, Duration modification using glottal closure instants and vowel onset points. Speech Commun. 51(12), 1263–1269 (2009)

    Article  Google Scholar 

  57. K.S. Rao, B. Yegnanarayana, Prosody modification using instants of significant excitation. IEEE Trans. Audio Speech Lang. Process. 14(3), 972–980 (2006)

    Article  Google Scholar 

  58. K.S. Rao, S.R.M. Prasanna, B. Yegnanarayana, Determination of instants of significant excitation in speech using Hilbert envelope and group delay function. IEEE Signal Process. Lett. 14(10), 762–765 (2007)

    Article  Google Scholar 

  59. K.S. Rao, Unconstrained pitch contour modification using instants of significant excitation. Circuits Syst. Signal Process. (Springer) 31(6), 2133–2152 (2012)

    Article  Google Scholar 

  60. K.S. Rao, Voice conversion by mapping the speaker-specific features using pitch synchronous approach. Comput. Speech Lang. 24(3), 474–494 (2010)

    Article  Google Scholar 

  61. R. Hussain Laskar, K. Banerjee, F. Ahmed Talukdar, K. Sreenivasa Rao, A pitch synchronous approach to design voice conversion system using source-filter correlation. Int. J. Speech Technol. (Springer) 15(3), 419–431 (2012)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to K. Sreenivasa Rao .

Rights and permissions

Reprints and permissions

Copyright information

© 2015 The Author(s)

About this chapter

Cite this chapter

Rao, K.S., Nandi, D. (2015). Language Identification—A Brief Review. In: Language Identification Using Excitation Source Features. SpringerBriefs in Electrical and Computer Engineering(). Springer, Cham. https://doi.org/10.1007/978-3-319-17725-0_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-17725-0_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-17724-3

  • Online ISBN: 978-3-319-17725-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics