Abstract
Text-to-speech alignment is the alignment of a textual transcript to an audio stream. The computed synchronization data is a mapping from words in the text transcript to temporal intervals in the audio. This alignment provides a basic tool that facilitates many applications and has wide-spread general applicability.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer Science+Business Media New York
About this chapter
Cite this chapter
Owen, C.B., Makedon, F. (1999). Text-to-Speech Alignment. In: Computed Synchronization for Multimedia Applications. The Springer International Series in Engineering and Computer Science, vol 513. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-4830-7_6
Download citation
DOI: https://doi.org/10.1007/978-1-4757-4830-7_6
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-5093-2
Online ISBN: 978-1-4757-4830-7
eBook Packages: Springer Book Archive