Deploying Prerecorded Audio Description for Musical Theater Using Live Performance Tracking

Vander Wilt, Dirk; Farbood, Morwaread Mary

doi:10.1007/978-3-030-70210-6_13

Dirk Vander Wilt¹¹ &
Morwaread Mary Farbood¹¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12631))

Included in the following conference series:

International Symposium on Computer Music Multidisciplinary Research

1000 Accesses

Abstract

Audio description, an accessibility service used by blind or visually impaired individuals, provides spoken descriptions of visual content. This alternative format allows those with low or no vision the ability to access information that sighted people obtain visually. In this paper a method for deploying prerecorded audio description in a live musical theater environment is presented. This method uses a reference audio recording and an online time warping algorithm to align tracks of audio description with live performances. A software implementation that is integrated into an existing theatrical workflow is also described. This system is used in two evaluation experiments that show the method successfully aligns multiple recordings of works of musical theater in order to automatically trigger prerecorded, descriptive audio in real time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A new approach to creating and deploying audio description for live theater

Article 08 May 2020

MashtaCycle: On-Stage Improvised Audio Collage by Content-Based Similarity and Gesture Recognition

Audio-to-Audio Alignment for Performances Tracking

References

Arzt, A.: Flexible and robust music tracking. Ph.D. dissertation, Johannes Kepler University, Linz (2016)
Google Scholar
Arzt, A., Widmer, G., Dixon, S.: Automatic page turning for musicians via real-time machine listening. In: Proceedings of the European Conference on Artificial Intelligence, pp. 241–245 (2008)
Google Scholar
Branje, C.J., Fels, D.I.: LiveDescribe: can amateur describers create high-quality audio description? J. Vis. Impair. Blind. 106(3), 154–165 (2012)
Article Google Scholar
Campos, V.P., de Araujo, T.M.U., de Souza Filho, G.L., Goncalves, L.M.G.: CineAD: a system for automated audio description script generation for the visually impaired. Universal Access in the Information Society, pp. 1–13 (2018)
Google Scholar
Dixon, S.: Live tracking of musical performances using on-line time warping. In: Proceedings of the 8th International Conference on Digital Audio Effects, pp. 92–97 (2005)
Google Scholar
Dubagunta, S.P.: A simple MFCC extractor using C++ STL and C++11. Source code at (2016). http://www.github.com/dspavankumar/compute-mfcc
Fryer, L.: An Introduction to Audio Description: A Practical Guide. Routledge, London (2016)
Book Google Scholar
Lertwongkhanakool, N., Kertkeidkachorn, N., Punyabukkana, P., Suchato, A.: An automatic real-time synchronization of live speech with its transcription approach. Eng. J. 19(5), 81–99 (2015)
Article Google Scholar
Litsyn, E., Pipko, H.: System and method for distribution and synchronized presentation of content. U.S. Patent Application 16/092,775, 2 May 2019
Google Scholar
Logan, B.: Mel frequency cepstral coefficients for music modeling. ISMIR. 270, 1–11 (2000)
Google Scholar
Muda, L., Begam, M., Elamvazuthi, I.: Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping (DTW) techniques. J. Comput. 2(3) (2010)
Google Scholar
Plaza, M.: Cost-effectiveness of audio description process: a comparative analysis of outsourcing and “in-house" methods. Int. J. Prod. Res. 55, 3480–3496 (2017)
Article Google Scholar
Sakoe, H., Chiba, S.: Dynamic programming algorithm optimisation for spoken word recognition. IEEE Trans. Acoust. Speech Signal Process. 26, 43–49 (1978)
Article Google Scholar
Snyder, J.: The visual made verbal: A comprehensive training manual and guide to the history and applications of audio description. American Council of the Blind (2014)
Google Scholar
Szarkowska, A.: Text-to-speech audio description: towards a wider availability of AD. J. Spec. Transl. 15, 142–162 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

New York University, New York, NY, USA
Dirk Vander Wilt & Morwaread Mary Farbood

Authors

Dirk Vander Wilt
View author publications
You can also search for this author in PubMed Google Scholar
Morwaread Mary Farbood
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dirk Vander Wilt .

Editor information

Editors and Affiliations

Laboratoire PRISM, CNRS-AMU, Marseille, France
Richard Kronland-Martinet
Laboratoire PRISM, CNRS-AMU, Marseille, France
Sølvi Ystad
Laboratoire PRISM, CNRS-AMU, Marseille, France
Mitsuko Aramaki

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vander Wilt, D., Farbood, M.M. (2021). Deploying Prerecorded Audio Description for Musical Theater Using Live Performance Tracking. In: Kronland-Martinet, R., Ystad, S., Aramaki, M. (eds) Perception, Representations, Image, Sound, Music. CMMR 2019. Lecture Notes in Computer Science(), vol 12631. Springer, Cham. https://doi.org/10.1007/978-3-030-70210-6_13

Download citation

DOI: https://doi.org/10.1007/978-3-030-70210-6_13
Published: 10 March 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-70209-0
Online ISBN: 978-3-030-70210-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Deploying Prerecorded Audio Description for Musical Theater Using Live Performance Tracking

Abstract

Access this chapter

Similar content being viewed by others

A new approach to creating and deploying audio description for live theater

MashtaCycle: On-Stage Improvised Audio Collage by Content-Based Similarity and Gesture Recognition

Audio-to-Audio Alignment for Performances Tracking

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Deploying Prerecorded Audio Description for Musical Theater Using Live Performance Tracking

Abstract

Access this chapter

Similar content being viewed by others

A new approach to creating and deploying audio description for live theater

MashtaCycle: On-Stage Improvised Audio Collage by Content-Based Similarity and Gesture Recognition

Audio-to-Audio Alignment for Performances Tracking

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation