International Conference on Text, Speech and Dialogue

TSD 2005: Text, Speech and Dialogue pp 342-347

Compact Representation of Speech Using 2-D Cepstrum – An Application to Slovak Digits Recognition

  • Roman Jarina
  • Michal Kuba
  • Martin Paralic
Conference paper

DOI: 10.1007/11551874_44

Volume 3658 of the book series Lecture Notes in Computer Science (LNCS)
Cite this paper as:
Jarina R., Kuba M., Paralic M. (2005) Compact Representation of Speech Using 2-D Cepstrum – An Application to Slovak Digits Recognition. In: Matoušek V., Mautner P., Pavelka T. (eds) Text, Speech and Dialogue. TSD 2005. Lecture Notes in Computer Science, vol 3658. Springer, Berlin, Heidelberg

Abstract

HMM speech recogniser with a small number of acoustic observations based on 2-D cepstrum (TDC) is proposed. TDC represents both static and dynamic features of speech implicitly in matrix form. It is shown that TDC analysis enables a compact representation of speech signals. Thus a great advantage of the proposed model is a massive reduction of speech features used for recognition what lessens computational and memory requirements, so it may be favourable for limited-power ASR applications. Experiments on isolated Slovak digits recognition task show that the method gives comparable results as the conventional MFCC approach. For speech degraded by additive white noise, it reaches better performance than the MFCC method.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Roman Jarina
    • 1
  • Michal Kuba
    • 1
  • Martin Paralic
    • 1
  1. 1.Department of TelecommunicationsUniversity of ZilinaZilinaSlovakia