Selection of Multiphone Synthesis Units and Grapheme-to-Phoneme Transcription Using Variable-Length Modeling of Strings

Deligne, Sabine; Yvon, François; Bimbot, Frédéric

doi:10.1007/978-1-4757-3413-3_6

Sabine Deligne²,
François Yvon³ &
Frédéric Bimbot⁴

Part of the book series: Telecommunications Technology & Applications Series ((TTAP))

122 Accesses

Abstract

Language can be viewed as the result of a complex encoding process which maps a message into a stream of symbols: phonemes, graphemes, morphemes, words ...depending on the level of representation. At each level of representation, specific constraints like phonotactical, morphological or grammatical constraints apply, greatly reducing the possible combinations of symbols and introducing statistical dependencies between them. Numerous probabilistic models have been developed in the area of speech and language processing to capture these dependencies. In this chapter, we explore the potentiality of the multigram model to learn variable-length dependencies in strings of phonemes and in strings of graphemes. In the multigram approach described here, a string of symbols is viewed as a concatenation of independent variable-length subsequences of symbols. The ability of the multigram model to learn relevant subsequences of phonemes is illustrated by the selection of multiphone units for speech synthesis.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

IBM T. J. Watson Research Center, Yorktown Heights, USA
Sabine Deligne
Départment Informatique, ENST, Paris, France
François Yvon
IRISA, Rennes, France
Frédéric Bimbot

Authors

Sabine Deligne
View author publications
You can also search for this author in PubMed Google Scholar
François Yvon
View author publications
You can also search for this author in PubMed Google Scholar
Frédéric Bimbot
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Image, Speech and Intelligent Systems (ISIS) Research Group, Department of Electronics and Computer Science, University of Southampton, SO17 1BJ, Southampton, UK
Robert I. Damper

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Deligne, S., Yvon, F., Bimbot, F. (2001). Selection of Multiphone Synthesis Units and Grapheme-to-Phoneme Transcription Using Variable-Length Modeling of Strings. In: Damper, R.I. (eds) Data-Driven Techniques in Speech Synthesis. Telecommunications Technology & Applications Series. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-3413-3_6

Download citation

DOI: https://doi.org/10.1007/978-1-4757-3413-3_6
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-4733-8
Online ISBN: 978-1-4757-3413-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics