Abstract
Language can be viewed as the result of a complex encoding process which maps a message into a stream of symbols: phonemes, graphemes, morphemes, words ...depending on the level of representation. At each level of representation, specific constraints like phonotactical, morphological or grammatical constraints apply, greatly reducing the possible combinations of symbols and introducing statistical dependencies between them. Numerous probabilistic models have been developed in the area of speech and language processing to capture these dependencies. In this chapter, we explore the potentiality of the multigram model to learn variable-length dependencies in strings of phonemes and in strings of graphemes. In the multigram approach described here, a string of symbols is viewed as a concatenation of independent variable-length subsequences of symbols. The ability of the multigram model to learn relevant subsequences of phonemes is illustrated by the selection of multiphone units for speech synthesis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Deligne, S., Yvon, F., Bimbot, F. (2001). Selection of Multiphone Synthesis Units and Grapheme-to-Phoneme Transcription Using Variable-Length Modeling of Strings. In: Damper, R.I. (eds) Data-Driven Techniques in Speech Synthesis. Telecommunications Technology & Applications Series. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-3413-3_6
Download citation
DOI: https://doi.org/10.1007/978-1-4757-3413-3_6
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-4733-8
Online ISBN: 978-1-4757-3413-3
eBook Packages: Springer Book Archive