Skip to main content

Integration of an On-line Kaldi Speech Recogniser to the Alex Dialogue Systems Framework

  • Conference paper

Part of the Lecture Notes in Computer Science book series (LNAI,volume 8655)


This paper describes the integration of an on-line Kaldi speech recogniser into the Alex Dialogue Systems Framework (ADSF). As the Kaldi OnlineLatgenRecogniser is written in C++, we first developed a Python wrapper for the recogniser so that the ADSF, written in Python, could interface with it. Training scripts for acoustic and language modelling were developed and integrated into ADSF, and acoustic and language models were build. Finally, optimal recogniser parameters were determined and evaluated. The dialogue system Alex with the new speech recogniser is evaluated on Public Transport Information (PTI) domain.


  • automatic speech recognition
  • Kaldi
  • Alex
  • dialogue systems

This is a preview of subscription content, access via your institution.

Buying options

USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Skantze, G., Schlangen, D.: Incremental dialogue processing in a micro-domain. In: Proc. ECACL, pp. 745–753 (2009)

    Google Scholar 

  2. Akinobu, L.: Open-Source Large Vocabulary CSR Engine Julius (2014),

  3. Allauzen, C., Riley, M., Schalkwyk, J., Skut, W., Mohri, M.: OpenFst: A general and efficient weighted finite-state transducer library. In: Holub, J., Žďárek, J. (eds.) CIAA 2007. LNCS, vol. 4783, pp. 11–23. Springer, Heidelberg (2007)

    CrossRef  Google Scholar 

  4. Huggins-Daines, D., Kumar, M., Chan, A., Black, A., Ravishankar, M., Rudnicky, A.: Pocketsphinx: A free, real-time continuous speech recognition system for hand-held devices. In: Proc. ICASSP, pp. I–I (December 2006)

    Google Scholar 

  5. D. Povey, M. Hannemann, G. Boulianne, L. Burget, A. Ghoshal, M. Janda, M. Karafiát, S. Kombrink, P. Motlicek, Y. Qian at al.: Generating exact lattices in the WFST framework. In Proc. ICASSP, pp. 4213–4216 (2012)

    Google Scholar 

  6. Rybach, D., Hahn, S., Lehnen, P., Nolden, D., Sundermeyer, M., Tüske, Z., Wiesler, S., Schlüter, R., Ney, H.: The RASR-The RWTH Aachen University open source speech recognition toolkit. In: Proc. IEEE Automatic Speech Recognition and Understanding Workshop (2011)

    Google Scholar 

  7. Povey, D., et al.: The Kaldi speech recognition toolkit. In: Proc. ASRU, Hawaii, US, pp. 1–4 (December 2011)

    Google Scholar 

  8. Public Transport Information System for Czech Republic,

  9. Korvas, M., Plátek, O., Dušek, O., Žilka, L., Jurčćček, F.: Free English and Czech telephone speech corpus shared under the CC-BY-SA 3.0 license. In: Proceedings of International Conference on Language Resources and Evaluation (to be published, 2014)

    Google Scholar 

  10. The Kaldi ASR toolkit (2014),

  11. The Alex Dialogue Systems Framework (2014),

  12. The OnlineLatgenRecogniser (2014),

  13. The pyfst library: OpenFst in Python (2014),

Download references

Author information

Authors and Affiliations


Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Plátek, O., Jurčíček, F. (2014). Integration of an On-line Kaldi Speech Recogniser to the Alex Dialogue Systems Framework. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2014. Lecture Notes in Computer Science(), vol 8655. Springer, Cham.

Download citation

  • DOI:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-10815-5

  • Online ISBN: 978-3-319-10816-2

  • eBook Packages: Computer ScienceComputer Science (R0)