Advertisement

A Novel Method for Rapid Speaker Adaptation Using Reference Support Speaker Selection

  • Jian Wang
  • Zhen Yang
  • Jianjun Lei
  • Jun Guo
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4285)

Abstract

In this paper, we propose a novel method for rapid speaker adaptation based on speaker selection, called reference support speaker selection (RSSS). The speakers, who are acoustically close to the test speaker, are selected from reference speakers using our proposed algorithm. Furthermore, a single-pass re-estimation procedure, conditioned on the selected speakers is shown. The proposed method can quickly obtain a more optimal reference speaker subset because the selection is dynamically determined according to reference support vectors. This adaptation strategy was evaluated in a large vocabulary speech recognition task. From the experiments, we confirm the effectiveness of proposed method.

Keywords

Support Vector Machine Support Vector Target Speaker Optimal Hyperplane Maximum Likelihood Linear Regression 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Gauvain, J.L., Lee, C.H.: Maximum a posterior estimation for multivariate Gaussian observations of Markov chains. IEEE Trans. Speech Audio Processing 2, 291–298 (1994)CrossRefGoogle Scholar
  2. 2.
    Legetter, C.J., Woodland, P.C.: Maximum likelihood linear regression for speaker adaptation of continuous density HMM’s in Compute. Speech Lang. 9, 171–186 (1996)CrossRefGoogle Scholar
  3. 3.
    Sankar, A., Beaufays, F., Digalakis, V.: Training data clustering for improved speech recognition. In: Proc. Eurospeech, pp. 502–505.4 (1995)Google Scholar
  4. 4.
    Padmanabhan, M., Bahl, L.R., Nahamoo, D., Picheny, M.A.: Speaker clustering and transformation for speaker adaptation in speech recognition systems. IEEE Trans. Speech Audio Processing, January 1998, 71–77 (1998)Google Scholar
  5. 5.
    Scholkopf, B., Burges, C., Vapnik, V.: Incorporating invariances in support vector learing machines. In: Vorbrüggen, J.C., von Seelen, W., Sendhoff, B. (eds.) ICANN 1996. LNCS, vol. 1112, pp. 47–52. Springer, Heidelberg (1996)Google Scholar
  6. 6.
    Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (1998)MATHGoogle Scholar
  7. 7.
    Burges, C.J.C.: A tutorial on support vector machines for pattern recognition. Knowledge Discovery Data Mining 2(2), 121–167 (1998)CrossRefGoogle Scholar
  8. 8.
    Gunn, S.R.: Support vector machines for classification and regression. Technical Report Image Speech and Intelligent Systems Research Group, University of Southampton (1997)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Jian Wang
    • 1
  • Zhen Yang
    • 1
  • Jianjun Lei
    • 1
  • Jun Guo
    • 1
  1. 1.School of Information EngineeringBeijing University of Posts and TelecommunicationsBeijingChina

Personalised recommendations