Skip to main content

Data-Driven Analysis of Speech

  • Conference paper
  • First Online:
Text, Speech and Dialogue (TSD 1999)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1692))

Included in the following conference series:

Abstract

We show on results taken from recent studies from our laboratory that conventional speech analysis techniques for ASR (such as Mel cepstrum or PLP) in combination with dynamic features (such as estimates of derivatives of cepstral feature trajectories) are sub-optimal and could be improved. The improvements can be derived by employing large labeled databases which allow for studying how is the linguistic information distributed in time and in frequency as well as for a design of discrimitative spectral basis and temporal RASTA filters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. C. Avendano, S. van Vuuren and H. Hermansky. Data-based RASTA-like filter design for channel normalization in ASR. In ICSLP’96, volume 4, pages 2087–2090, Philadelphia, PA, USA, October 1996.

    Google Scholar 

  2. H. Hermansky. The modulation spectrum in automatic recognition of speech. In S. Furui, B.-H. Juang and W, Chou, editor, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding. IEEE Signal Processing Society, 1997.

    Google Scholar 

  3. H. Hermansky. Should recognizers have ears? In Tutorial and Research Workshop on Robust speech recognition for unknown communication channels, pages 1–10, Pont-a-Mousson, France, April 1997. ESCA-NATO.

    Google Scholar 

  4. H. Hermansky. Should recognizers have ears? Speech Communication, 25(1–3):3–27, 1998.

    Article  Google Scholar 

  5. H. Hermansky, N. Malayath. Spectral basis functions from discriminant analysis. In ICSLP’98, Sydney, Australia, 1998.

    Google Scholar 

  6. H. Yang. Personal communications.

    Google Scholar 

  7. H. Yang, S. van Vuuren and H. Hermansky. Relevancy of time-frequency features for phonetic classi_cation of phonemes. In ICASSP’99, pages 225–229, Phoenix, Arizona, 1999.

    Google Scholar 

  8. J.B. Allen. How do humans process and recognize speech? IEEE Trans. on Speech and Audio Processing, 2:567–577, 1994.

    Article  Google Scholar 

  9. S. van Vuuren and H. Hermansky. Data-driven design of RASTA-like filters. In Eurospeech’97, Rhodes, Greece, 1997. ESCA.

    Google Scholar 

  10. S. van Vuuren, T. Kamm, J. Luettin and H. Hermansky. Presentation of the 1997 summer workshop on innovative techniques for continuous speech asr. In available on the http://www.clsp.jhu.edu. Johns Hopkins University, August 1997.

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hermansky, H. (1999). Data-Driven Analysis of Speech. In: Matousek, V., Mautner, P., Ocelíková, J., Sojka, P. (eds) Text, Speech and Dialogue. TSD 1999. Lecture Notes in Computer Science(), vol 1692. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48239-3_2

Download citation

  • DOI: https://doi.org/10.1007/3-540-48239-3_2

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-66494-9

  • Online ISBN: 978-3-540-48239-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics