Skip to main content

Large Vocabulary Speech Recognition: Speaker Dependent and Speaker Independent

  • Conference paper
  • First Online:
Information Systems Design and Intelligent Applications

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 339))

  • 1693 Accesses

Abstract

This paper addresses the problem of large vocabulary isolated word and continuous Kannada speech recognition using the syllables and combination of Hidden Markov Model (HMM) and Normal fit method. The models designed for speaker dependent and speaker independent mode of working. This experiment has covered 6 million words among the 10 million words from Hampi text corpus. Here 3-state Baum–Welch algorithm is used for training. For the 2 successor outputted λ(A, B, pi) is combined and passed into normal fit, the outputted normal fit parameter is labeled has syllable or sub-word. In terms of memory requirement and recognition rate the proposed model is compared with Gaussian Mixture Model and HMM (3-state Baum–Welch algorithm). This paper clearly shows that combination of HMM and normal fit technique will reduce the memory size while building and storing the speech models and works with excellent recognition rate. The average WRR is 91.22 % and average WER is 8.78 %. All computations are done using mat lab.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 299.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 379.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Swamy, P.P., Guru, D.S. (eds.): Multimedia Processing, Communication and Computing Applications. Lecture Notes in Electrical Engineering, vol. 213, pp. 333–345. Springer, India. doi:10.1007/978-81-322-1143-3_27 (2013)

    Google Scholar 

  2. Hemakumar, G., Punitha, P.: Speech recognition technology: a survey on Indian languages. Int. J. Inf. Sci. Intell. Syst. 2(4), 1–38 (2013)

    Google Scholar 

  3. Gaikwad, S.K., et al.: A review on speech recognition technique. Int. J. Comput. Appl. (0975–8887) 10(3), 16–24 (2010)

    Google Scholar 

  4. Rabiner, L., Jung, B.-H.: Fundamentals of Speech Recognition. Pearson Education, Singapore (1993)

    Google Scholar 

  5. Hemakumar, G., Punitha, P.: Automatic segmentation of Kannada speech signal into syllables and sub-words: noised and noiseless signals. Int. J. Sci. Eng. Res. 5(1), 1707–1711 (2013)

    Google Scholar 

  6. Mat lab R2009a help menu on statistics toolbox online: www.mathworks.com/help/

  7. http://en.wikipedia.org/wiki/normal_distribution#cite_note-kri127-33

  8. http://www.mathworks.in/help/stats/statset.html

  9. David, D.: Expectation-Maximization: Application to Gaussian Mixture Model Parameter Estimation. Lecture Notes Published on April 23 (2009)

    Google Scholar 

  10. Rabiner, L.R., et al.: Speaker-independent recognition of isolated words using clustering techniques. IEEE Trans. Acoust. Speech Sig. Process. 27(4), 336–349 (1979)

    Article  MATH  Google Scholar 

  11. Carlo, T.: Estimating Gaussian Mixture Densities with EM—A Tutorial. Duke University

    Google Scholar 

  12. Dimov, D., Azmanov, I.: Experimental specifics of using HMM in isolated word speech recognition. In: International Conference on Computer Systems and Technologies—CompSysTech (2005)

    Google Scholar 

  13. Grewal, S.S., Kumar, D.: Isolated word recognition system for English language. Int. J. Inf. Technol. Knowl. Manag. 2(2), 447–450 (2010)

    Google Scholar 

  14. Nandyala, S.P., Kishore Kumar, T.: Real time isolated word recognition using adaptive algorithm. In: International Conference on Industrial and Intelligent Information (ICIII 2012), IPCSIT, vol. 31 © (2012). IACSIT Press, Singapore (2012)

    Google Scholar 

  15. Revathi, A., et al.: Text independent speaker recognition and speaker independent speech recognition using iterative clustering approach. Int. J. Comput. Sci. Inf. Technol. (IJCSIT) 1(2), 30–40 (2009)

    Google Scholar 

  16. Das, B.P., Parekh, R.: Recognition of isolated words using features based on LPC, MFCC, ZCR and STE, with neural network classifiers. Int. J. Mod. Eng. Res. (IJMER) 2(3), 854–858 (2012)

    Google Scholar 

Download references

Acknowledgments

The authors would like to thank for Bharathiar University for giving an opportunity to pursuing part-time Ph.D. degree. Authors would like to thanks for all our friends, reviewers and Editorial staff for their help during preparation of this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to G. Hemakumar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer India

About this paper

Cite this paper

Hemakumar, G., Punitha, P. (2015). Large Vocabulary Speech Recognition: Speaker Dependent and Speaker Independent. In: Mandal, J., Satapathy, S., Kumar Sanyal, M., Sarkar, P., Mukhopadhyay, A. (eds) Information Systems Design and Intelligent Applications. Advances in Intelligent Systems and Computing, vol 339. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2250-7_8

Download citation

  • DOI: https://doi.org/10.1007/978-81-322-2250-7_8

  • Published:

  • Publisher Name: Springer, New Delhi

  • Print ISBN: 978-81-322-2249-1

  • Online ISBN: 978-81-322-2250-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics