Advertisement

Improved Phone Recognition Using Excitation Source Features

  • P. M. Hisham
  • D. Pravena
  • Y. Pardhu
  • V. Gokul
  • B. Abhitej
  • D. Govind
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 384)

Abstract

Phone recognizers serve as the preprocessing unit for speech recognition systems and phonetic engines. Even though, most of the state of the art speech recognition achieve relatively better accuracy at the sentence level, the phone level recognition performance falls way below the sentence level performance. The increased recognition rates at the sentence levels are achieved with help of refined language models used for the language under consideration. Therefore, the objective of the present work is to improve the phoneme level accuracy of the hidden markov model(HMM) based acoustic phone models by combining excitation source features with the conventional mel frequency cepstral coefficients (MFCC) for American English. TIMIT and CMU Arctic database, is used for the experiments in the present work. The average spectral energy around the zero-frequency region of each frame is used as the excitation source feature to combine with the 13 MFCC features. The effectiveness of the phoneme recognition is confirmed by a 0.5% increase in the phone recognition accuracy against the state of the art HMM-GMM acoustic models with MFCC features.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Sreejith, A., Mary, L., Riyas, K.S., Joseph, A., Augustine, A.: Automatic prosodic labeling and broad class phonetic engine for malayalam. In: Proc. Int. Conf. Control, Communication and Computing (ICCC) (2013)Google Scholar
  2. 2.
    Ghahremani, P., BabaAli, B., Povey, D., Reidhammer, K., Trmal, J., Khudanpur, S.: A pitch extraction algorithm tuned for automatic speech recognition. In: Proc. ICASSP 2014 (2014)Google Scholar
  3. 3.
    Hidden Markov Model Toolkit (HTK) Book, University of Cambridge (2003)Google Scholar
  4. 4.
    Kruger, S.E., Schaffoner, M., Katz, M., Andelic, E., Wendemuth, A.: Using support vector machines in a hmm based speech recognition system. In: Proc. SPECOM (2005)Google Scholar
  5. 5.
    Stadermann, J., Rigoll, G.: A hybrid svm/hmm acoustic modeling approach to automatic speech recognition. In: INTERSPEECH (2004)Google Scholar
  6. 6.
    Dahl, G.E., Yu, D., Dend, L., Acero, A.: Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition. IEEE Trans. Audio, Speech and Lang. Process. 20(1), 31–41 (2012)CrossRefGoogle Scholar
  7. 7.
    Deekshitha, G., Mary, L.: Prosodically guided phonetic engine. In: Proc. IEEE International Conference on Signal Process., Informatics Commun. and Energy Sys. (2015)Google Scholar
  8. 8.
    Murty, K.S.R., Yegnanarayana, B.: Epoch extraction from speech signals. IEEE Trans. Audio, Speech and Language Process. 16(8), 1602–1614 (2008)CrossRefGoogle Scholar
  9. 9.
    Govind, D., Prasana, S.R.M., Yegnanarayana, B.: Significance of glottal activity detection for duration modification. In: Proc. Speech Prosody (2012)Google Scholar
  10. 10.
    Murty, K.S.R., Yegnanarayana, B.: Characterization of glottal activity from speech signals. IEEE Signal Processing Letters 16(6), 469–472 (2009)CrossRefGoogle Scholar
  11. 11.
    Garafolo, J., et al.: TIMIT: Acoustic-Phonetic Continuous Speech Corpus LDC93S1. Linguistic Data Consortium (1993)Google Scholar
  12. 12.
    Kominek, J., Black, A.: CMU-Arctic speech databases. In: 5th ISCA Speech Synthesis Workshop, Pittsburgh, PA, pp. 223–224 (2004)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  • P. M. Hisham
    • 1
  • D. Pravena
    • 1
  • Y. Pardhu
    • 2
  • V. Gokul
    • 2
  • B. Abhitej
    • 2
  • D. Govind
    • 1
  1. 1.Center for Excellence in Computational Engineering and NetworkingAmrita Vishwa Vidyapeetham(University)CoimbatoreIndia
  2. 2.Department of Computer Science and EngineeringAmrita Vishwa VidyapeethamKollamIndia

Personalised recommendations