Skip to main content

Efficiency of Speech Alignment for Semi-automated Subtitling in Dutch

  • Conference paper
Text, Speech and Dialogue (TSD 2011)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6836))

Included in the following conference series:

Abstract

This paper describes the use of speech alignment to aid in the process of subtitling Dutch TV programs. The recognizer aligns the audio stream with an existing transcript. The goal is therefore not to transcribe but to generate the correct timing of every word. The system performs subtasks such as audio segmentation, transcript preprocessing, alignment and subtitle compression. The result is not perfect but good enough to gain efficiency when used by a professional subtitler as a starting point to refine and finalize the subtitles. In our tests, considerable time savings of 47 to 53% on average are obtained, such that the generation of subtitles for a 1 hour program, is lowered from between 4 and 7 hours to between 2.5 and 4 hours. This is all the more important in the context of an increased pressure from user groups on governments and broadcasters to reach 100% subtitled TV programs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Meinedo, H., Viveiros, M., da Silva Neto, J.P.: Evaluation of a Live Broadcast News Subtitling System for Portuguese. In: Proc. Interspeech 2008, Brisbane, Australia, pp. 508–511 (September 2008)

    Google Scholar 

  2. Homma, S., Kobayashi, A., Oku, T., Sato, S., Imai, T., Takagi, T.: New Real-Time Closed-Captioning System for Japanese Broadcast News Programs. In: Miesenberger, K., Klaus, J., Zagler, W.L., Karshmer, A.I. (eds.) ICCHP 2008. LNCS, vol. 5105, pp. 651–654. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  3. Vandecatseye, A., Martens, J.-P.: A Fast, Accurate and Stream-Based Speaker Segmentation and Clustering Algorithm. In: Proceedings of the 8th European Conference on Speech Communication and Technology, Eurospeech 2003, Geneva, Switzerland, vol. 2, pp. 941–944 (September 2003)

    Google Scholar 

  4. Demuynck, K., Roelens, J., Van Compernolle, D., Wambacq, P.: SPRAAK: An Open Source SPeech Recognition and Automatic Annotation Kit. In: Proc. Interspeech 2008, Brisbane, Australia, p. 495 (September 2008)

    Google Scholar 

  5. Demuynck, K., Puurula, A., Van Compernolle, D., Wambacq, P.: The ESAT 2008 System for N-Best Dutch Speech Recognition Benchmark. In: Proc. IEEE ASRU Workshop, Merano, Italy, pp. 339–343 (December 2009)

    Google Scholar 

  6. Demuynck, K., Laureys, T., Wambacq, P., Van Compernolle, D.: Automatic phonemic labeling and segmentation of spoken Dutch. In: Proc. LREC-2004, Lisbon, Portugal, pp. 61–64 (May 2004)

    Google Scholar 

  7. Mertens, P., Vercammen, F.: ”FONILEX manual”, K.U.Leuven – CCL Technical report (1998), http://bach.arts.kuleuven.be/fonilex

  8. Daelemans, W., Höthker, A., Tjong Kim Sang, E.: Automatic sentence simplification for subtitling in Dutch and English. In: Proc. LREC 2004, Lisbon, Portugal, pp. 1045–1048 ( May 2004)

    Google Scholar 

  9. Boersma, P., Weenink, D.: Praat: doing phonetics by computer (Version 5.1.05) (Computer program), from http://www.praat.org/ (retrieved May 1, 2009)

  10. Wambacq, P., Vanroose, P., Yang, X., Duchateau, J., Van Uytsel, D.H.: Speech Recognition for Subtitling Purposes. In: Proc. 5th Intl. Conf. Languages & The Media, Berlin, Germany, p. 46 (November 2004) (Abstract)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wambacq, P., Demuynck, K. (2011). Efficiency of Speech Alignment for Semi-automated Subtitling in Dutch. In: Habernal, I., Matoušek, V. (eds) Text, Speech and Dialogue. TSD 2011. Lecture Notes in Computer Science(), vol 6836. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23538-2_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-23538-2_16

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-23537-5

  • Online ISBN: 978-3-642-23538-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics