Skip to main content

Some Issues Related to Phone Recognition and Language Identification Using Phonetic Engine

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 24))

Abstract

The introduction of a phonetic engine in the literature is that it is a system which transforms a speech signal into some symbolic form. Phone recognition is a primary task of a PE. Among the various other applications of PE, one very useful application is the Language Identification (LID). This chapter discusses some issues pertaining to the use of PE in phone recognition as well as in language identification. Here, we have used PEs for three Indian Languages: Manipuri, Assamese and Bengali, in building the LID system. These languages are widely spoken in the Northeastern region of India. In the development of PEs, the International Phonetic Alphabet (IPA) symbols are used in the data transcription process. In modeling the phonetic units Hidden Markov Models (HMM) have been used in the training phase. These trained HMMs are then used in phone recognitions leading to the identification of language(s) of unknown test utterances. The overall phone recognition accuracies reported by the existing PEs for the above selected languages are \(62.11\%\) for standard Manipuri language, \(59.0\%\) for Kakching Dialect of Manipuri, \(43.28\%\) for standard Assamese and \(48.58\%\) for Bengali language. Automatic LID is possible using a set of PEs in testing unknown utterances due to the language bias of these systems. Various levels of identification rates reported in some LID tasks carried out with PEs are discussed here to look into the issues belonging to it.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Devi, Y.C.: Acoustic-Phonetic Feature based Manipuri Dialect identification: standard and Kakching Dialect. Master’s thesis, North eastern Hill University (2016)

    Google Scholar 

  2. Dutta, S.K., Nandakishore, S., Singh, L.J.: Development of language identification system using phonetic engine. In: Proceedings of I3CS’15 (2015)

    Google Scholar 

  3. Dutta, S.K., Nandakishore, S., Singh, L.J.: Development of manipuri phonetic engine and its application in language identification. IJETR 3(issue 8), 200–203 (August 2015)

    Google Scholar 

  4. Eswar, P.: A Rule-based Approach for Spotting Characters from Continuous Speech in Indian Languages. Ph.D. thesis, Department of Computer Science and Engineering, IIT Madras (1990)

    Google Scholar 

  5. Gangashetty, S.V.: Neural Network model for Recognition of Consonent-Vowel units in multiple languages. Ph.D. thesis, Department of Computer Science and Engineering, IIT Madras (2004)

    Google Scholar 

  6. Hazen, T.J., Zue, V.W.: Automatic language identification using a segment based approach. In: Eurospeech (1993)

    Google Scholar 

  7. F IIIT(Hayderabad), IIT(Kanpur), Thapar University, IIT(Guwahati), Tezpur University, North Eastern Hill University, RIT(Kottayam), DA-IICT, IIT(Hyderabad), IIT(Kharagpur): Development of prosodically guide phonetic engine for searching speech database in indian languages. Tech. rep., DEITY, Government of INDIA (2015)

    Google Scholar 

  8. International Phonetic Association (ed.): Handbook of International Phonetic Association. Cambridge University Press (1999)

    Google Scholar 

  9. Li, K.P.: Automatic language identification using syllabic spectral features. In: International conference on Acoustic, Speech and Signal processing (1994)

    Google Scholar 

  10. Lopes, C., Perdigao, F.: Phone Recogniton in TIMIT database. Prof. Ivo ipsid (2011)

    Google Scholar 

  11. Nagesh, A., Sadanadam, M.: Language identification using ergodic hidden markov model. International Journal of Advanced research in Computer Science and software Engineering 2(11), 297–301 (2012)

    Google Scholar 

  12. Rabiner, L., Juang, B.: Fundamentals of Speech Processing. Prantice Hall (1993)

    Google Scholar 

  13. Rong, T.: Automatic Speaker and Language Identification. Ph.D. thesis, Nayang Technical University (2006)

    Google Scholar 

  14. Sarma, B.D., Sarma, M., Prasanna, S.R.M.: Development of assamese phonetic engine: Some issues. In: Annual IEEE India Cnference (INDICON) (2013)

    Google Scholar 

  15. Schultz, T., Kirchoffe, K.: Multilingual Speech processing. Elsevier (2006)

    Google Scholar 

  16. Wang, L.: Automatic Spoken Language Identification. Ph.D. thesis, Faculty of Engineering, The University of New South Wales (2008)

    Google Scholar 

  17. Wong, K.Y.E.: Automatic Spoken Language Identification using Acoustic phonetic Speech information. Ph.D. thesis, School of Electrical and Electronics system Engineering, Queensland University of Technology (2001)

    Google Scholar 

  18. Yegnanarayana, B., Gangashetty, S.V.: Machine learning for speech recognition- an illustartion of phonetic engine using hidden markov models. In: Frontiers of Interface Between Statistics and Sciences (2009)

    Google Scholar 

  19. Yegnanarayana, B., Gangashetty, S.V., Rajendran, S., Murty, K.S.R., Dhananjaya, N., Guruprasad, S.: A phonetic engine for indian languages. In: ICON (2009)

    Google Scholar 

  20. Zissman, M.: Comparision of four appraches to automatic language identification of telephone speech. IEEE trans. Speech and Audio Processing 4(1), 33–44 (1996)

    Google Scholar 

Download references

Acknowledgements

The authors like to acknowledge the assistance received from the students Salam Nandakishore and Y Chandrika Devi in carrying out the experimental works.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sushanta Kabir Dutta .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dutta, S.K., Singh, L.J. (2018). Some Issues Related to Phone Recognition and Language Identification Using Phonetic Engine. In: Mandal, J., Saha, G., Kandar, D., Maji, A. (eds) Proceedings of the International Conference on Computing and Communication Systems. Lecture Notes in Networks and Systems, vol 24. Springer, Singapore. https://doi.org/10.1007/978-981-10-6890-4_28

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-6890-4_28

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-6889-8

  • Online ISBN: 978-981-10-6890-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics