Combination of pitch synchronous analysis and fisher criterion for speaker identification

Zeng, Yumin; Wu, Zhenyang

doi:10.1007/s11767-007-0034-z

Combination of pitch synchronous analysis and fisher criterion for speaker identification

Published: November 2007

Volume 24, pages 828–834, (2007)
Cite this article

Journal of Electronics (China)

Zeng Yumin^1,2 &
Wu Zhenyang¹

32 Accesses
Explore all metrics

Abstract

A novel text independent speaker identification system is proposed. In the proposed system, the 12-order perceptual linear predictive cepstrum and their delta coefficients in the span of five frames are extracted from the segmented speech based on the method of pitch synchronous analysis. The Fisher ratios of the original coefficients then be calculated, and the coefficients whose Fisher ratios are bigger are selected to form the 13-dimensional feature vectors of speaker. The Gaussian mixture model is used to model the speakers. The experimental results show that the identification accuracy of the proposed system is obviously better than that of the systems based on other conventional coefficients like the linear predictive cepstral coefficients and the Mel-frequency cepstral coefficients.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

M. Faundez-Zanuy and E. Monte-Moreno. State-of-the-art in speaker recognition. IEEE A&E Systems Magazine, 5(2005), 7–12.
Article Google Scholar
J. Campbell. Speaker recognition: a tutorial. Proceedings of the IEEE, 85(1997)9, 1437–1462.
Article Google Scholar
F. Bimbot, J. Bonastre, C. Fredouille, et al. A tutorial on text-independent speaker verification. EURASIP Journal on Applied Signal Processing, 4(2004), 430–451.
Article Google Scholar
H. Hermansky. Perceptual linear predictive (PLP) analysis of speech. The Journal of the Acoustic Society of America, 87(1994)4, 1738–1752.
Article Google Scholar
T. Quatieri, R. Dunn, and D. Reynolds. On the influence of rate, pitch, and spectrum on automatic speaker recognition performance. Proceedings of International Conference on Spoken Language Processing (ICSLP’2000), Beijing, China, Oct. 16–20, 2000, vol.2, 491–494.
G. Doddington, M. Przybocki, and A. Martin, et al. The NIST speaker recognition evaluation-Overview, methodology, system, results, perspective. Speech Communication, 31(2002)2/3, 225–254.
Google Scholar
Y. J. Kim and J. H. Chung. Pitch synchronous cepstrum for robust speaker recognition over telephone channels. IEE Electronics Letters, 40(2004)3, 207–209.
Article Google Scholar
S. Chen and H. Wang. Improvement of speaker recognition by combining residual and prosodic features with acoustic features. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP’2004), Montreal, Canada, May 17–21, 2004, vol.1, 93–96.
K. Rama and B. Yegnanarayana. Combining evidence from residual phase and MFCC features for speaker recognition. IEEE Signal Processing Letters, 13(2006)1, 52–55.
Article Google Scholar
L. Rabiner, M. Cheng, and A. Rosenberg, et al. A comparative study of several pitch detection algorithms. IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-24(1976)5, 399–417.
Article Google Scholar
J. Wolf. Efficent acoustic parameters for speaker recognition. The Journal of the Acoustic Society of America, 51(1971)6, 2044–2056.
Article Google Scholar
N. Kanedera, T. Arai, and H. Hermansky, et al. On the importance of various modulation frequencies for speech recognition. Proceedings of 5th European Conference on Speech Communication and Technology (EUROSPEECH’97), Rhodes, Greece, Sept. 22–25, 1997, 1079–1082.
D. Reynolds and R. Rose. Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans. on Speech and Audio Processing, 3(1995)1, 72–83.
Article Google Scholar
NoiseX92 noise database. Http://spib.rice.edu/spib/select_noise.html, Nov. 15, 2002.

Download references

Author information

Authors and Affiliations

School of Information Science and Engineering, Southeast University, Nanjing, 210096, China
Zeng Yumin & Wu Zhenyang
School of Physics and Technology, Nanjing Normal University, Nanjing, 210097, China
Zeng Yumin

Authors

Zeng Yumin
View author publications
You can also search for this author in PubMed Google Scholar
Wu Zhenyang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zeng Yumin.

About this article

Cite this article

Zeng, Y., Wu, Z. Combination of pitch synchronous analysis and fisher criterion for speaker identification. J. Electron.(China) 24, 828–834 (2007). https://doi.org/10.1007/s11767-007-0034-z

Download citation

Received: 10 February 2007
Revised: 15 March 2007
Issue Date: November 2007
DOI: https://doi.org/10.1007/s11767-007-0034-z

Key words

CLC index

TN912.3

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Combination of pitch synchronous analysis and fisher criterion for speaker identification

Abstract

Access this article

Similar content being viewed by others

A comprehensive survey on automatic speech recognition using neural networks

Chinese dialect speech recognition: a comprehensive survey

Milestones in speaker recognition

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Key words

CLC index

Navigation

Combination of pitch synchronous analysis and fisher criterion for speaker identification

Abstract

Access this article

Similar content being viewed by others

A comprehensive survey on automatic speech recognition using neural networks

Chinese dialect speech recognition: a comprehensive survey

Milestones in speaker recognition

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Share this article

Key words

CLC index

Search

Navigation