Skip to main content

Automatic Multilingual System from Speech

  • Conference paper
  • First Online:
Computational Intelligence in Pattern Recognition

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 999))

Abstract

Language recognition is the way by which the language of a digital speech utterance is recognized automatically by a computer. Commenced Language Identification Systems sequentially transform the speech signal into discrete units, and then apply statistical methods on the resultant units to extract their language information. Today, a large number of audio retrieval features exists for automatic speech and language recognition. The proposed method has nominated an automatic system for well-known multi-languages. The identification has been done using a new set of audio features. The suitable feature has been adopted. This includes Zero-Crossing Rate, Spectral Flux, Pitch, Mel-frequency Cepstral Coefficients, Tempo, and Short-Time Energy. These features have been used exclusively for identifying the language along with the help of classifiers and feature selection algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Garg, A., Gupta, V., Jindal, M.: A survey of language identification techniques and applications. J. Emerg. Technol. Web Intell. 6(4), 388–400 (2014)

    Google Scholar 

  2. Karpagavalli, S., Chandra, E.: A review on automatic speech recognition architecture and approaches. Int. J. Signal Process. Image Process. Pattern Recogn. 9(4), 393–404 (2016)

    Google Scholar 

  3. Grothe, L., De Luca, E.W., Nürnberger, A.: A Comparative Study on Language Identification Methods. LREC (2008)

    Google Scholar 

  4. Lakhani, V.A., Mahadev, R.: Multi-Language Identification Using Convolutional Recurrent Neural Network. arXiv preprint arXiv: 1611.04010 (2016)

    Google Scholar 

  5. Chandrasekhar, V., Sargin, M.E., Ross, D.A.: Automatic language identification in music videos with low level audio and visual features. In: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE (2011)

    Google Scholar 

  6. Itrat, M. et al.: Automatic Language Identification for Languages of Pakistan. Int. J. Comput. Sci. Netw. Secur. (IJCSNS) 17.2

    Google Scholar 

  7. Adami, A.G., Hermansky, H.: Segmentation of speech for speaker and language recognition. In: Eighth European Conference on Speech Communication and Technology (2003)

    Google Scholar 

  8. Dehak, N., Torres-Carrasquillo, P.A., Reynolds, D., Dehak, R.: Language recognition via i-vectors and dimensionality reduction.” In Twelfth annual conference of the international speech communication association. 2011

    Google Scholar 

  9. Yu, Chengzhu et al. “UTD-CRSS system for the NIST 2015 language recognition i-vector machine learning challenge. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE (2016)

    Google Scholar 

  10. Gwon, Y.L., et al.: Language recognition via sparse coding. INTERSPEECH (2016)

    Google Scholar 

  11. Behravan, H., Kinnunen, T., Hautamäki, V.: Out-of-set i-Vector selection for open-set language identification. Odyssey 2016, 303–310 (2016)

    Article  Google Scholar 

  12. Vatanen, T., Väyrynen, J.J., Virpioja, S.: Language identification of short text segments with N-gram models. LREC (2010)

    Google Scholar 

  13. Torres-Carrasquillo, P.A., Reynolds, D.A, Deller, J.R.: Language identification using Gaussian mixture model tokenization. In: 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1. IEEE (2002)

    Google Scholar 

  14. Karpagavalli, S., Chandra, E.: A review on automatic speech recognition architecture and approaches. Int. J. Signal Process. Image Process. Pattern Recogn. 9(4), 393–404 (2016)

    Google Scholar 

  15. Lartillot, O., Toiviainen, P.: A Matlab toolbox for musical feature extraction from audio. In: International Conference on Digital Audio Effects (2007)

    Google Scholar 

  16. Hall, M., et al.: The WEKA data mining software: an update. ACM SIGKDD Explor Newsl 11.1(2009), 10–18

    Article  Google Scholar 

Download references

Acknowledgements

This chapter does not contain any studies with human participants or animals performed by any of the authors.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Akshay Chatterjee .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Agarwal, S., Chatterjee, A., Yasmin, G. (2020). Automatic Multilingual System from Speech. In: Das, A., Nayak, J., Naik, B., Pati, S., Pelusi, D. (eds) Computational Intelligence in Pattern Recognition. Advances in Intelligent Systems and Computing, vol 999. Springer, Singapore. https://doi.org/10.1007/978-981-13-9042-5_13

Download citation

Publish with us

Policies and ethics