Speaker Recognition Using MFCC and Hybrid Model of VQ and GMM

  • Dhruv Desai
  • Maulin Joshi
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 235)


Speaker recognition is widely used for automatic authentication of speaker’s identity based on human biological features. Speaker recognition extracts, characterizes and recognizes the information about speaker identity. For feature extraction and speaker modeling many algorithms are being used. In this paper, we have proposed speaker recognition system based on hybrid approach using Mel Frequency Cepstrum Coefficient (MFCC) as feature extraction and combination of vector quantization (VQ) and Gaussian Mixture Modeling (GMM) for speaker modeling. Our approach is able to recognize speaker for both text dependent and text independent speech and uses relative index as confidence measures in case of contradiction in recognition process by GMM and VQ. Simulation results highlight the efficacy of proposed method compared to earlier work.


Feature Extraction Feature Matching Mel Frequency Cepstral Coefficient (MFCC) Gaussian mixture modeling 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Hai, J., Joo, E.M.: Improved linear predictive coding method for speech recognition. In: Information, Communications and Signal Processing and Fourth Pacific Rim Conference on Multimedia. Proceedings of the Joint Conference of the Fourth International Conference, vol. 3, pp. 1614–1618 (2003)Google Scholar
  2. 2.
    Muda, L., Begam, M., Elamvazuthi, I.: Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques. Journal of Computing 2(3), 138–141 (2010)Google Scholar
  3. 3.
    Hasan, R., Jamil, M., Rahman, G.R.S.: Speaker Identification Using Mel Frequency Cepstral Coefficients. In: 3rd International Conference on Electrical & Computer Engineering ICECE, pp. 565–568 (2004)Google Scholar
  4. 4.
    Tiwari, V.: MFCC and its applications in speaker recognition. International Journal on Emerging Technologies 1, 19–22 (2010)Google Scholar
  5. 5.
    Shende, A., Mishra, S., Kumar, S.: Comparison of Different Parameters Used In GMM Based Automatic Speaker Recognition. International Journal of Soft Computing and Engineering (IJSCE) 1(3), 14–18 (2011) ISSN: 2231-2307Google Scholar
  6. 6.
    Reynolds, D.A., Rose, R.C.: Robust Text-Independent Speaker Identification using Gaussian Mixture Speaker Models. IEEE Transactions on Speech and Audio Processing 3, 72–83 (1995)CrossRefGoogle Scholar
  7. 7.
    Bagul, S.G., Shastri, R.K.: Text Independent Speaker Recognition System using GMM. International Journal of Scientific and Research Publications 2(10), 1–5 (2012)Google Scholar
  8. 8.
    Jayana, H.S., Mahadeva Prasana, S.R.: Analysis, Feature Extraction, Modeling and Testing Techniques for Speaker Recognition. International Journal of Institution of Electronics and Telecommunication Engineers (IETE ) 26(3), 181–190 (2009)Google Scholar
  9. 9.
    Kumar, P., Jakhanwal, N., Chandra, M.: Text Dependent Speaker Identification in Noisy Environment. In: International Conference on Device and Communication (ICDeCom), pp. 1–4 (2011)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  1. 1.Department of Electronics enginneringSarvajanik College of Engineering and TechnologySuratIndia
  2. 2.Department of Electronics & CommunicationSarvajanik College of Engineering and TechnologySuratIndia

Personalised recommendations