A Model of Timing for Nonsegmental Phonological Structure

Local, John; Ogden, Richard

doi:10.1007/978-1-4612-1894-4_9

John Local &
Richard Ogden

287 Accesses
39 Citations

Abstract

Usually the problem of timing in speech synthesis is construed as the search for appropriate algorithms for altering durations of speech units under various conditions (e.g., stressed versus unstressed syllables, final versus non-final position, nature of surrounding segments). This chapter proposes a model of phonological representation and phonetic interpretation based on Firthian prosodic analysis [Fir57], which is instantiated in the YorkTalk speech generation system. In this model timing is treated as part of phonetic interpretation and not as an integral part of phonological representation. This leads us to explore the possibility that speech rhythm is the product of relationships between abstract constituents of linguistic structure of which there is no single optimal distinguished unit.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

D. Abercrombie. Syllable quantity and enclitics in English. In Honour of Daniel Jones, D. Abercrombie, D. B. Fry, P. A. D. MacCarthy, N. C. Scott and J. L. Trim, eds. Longman Green, London, 216–222, 1964.
Google Scholar
C. P. Browman and L. M. Goldstein. Towards an articulatory phonology. Phonololgy Yearbook 3:219–252, 1989.
Google Scholar
W. N. Campbell and S. D. Isard. Segment durations in a syllable frame. J. Phonetics 19:37–47, 1991.
Google Scholar
J. C. Carnochan. Gemination in Hausa. In Studies in Linguistic Analysis, Special Volume of the Philological Society, 2nd edition, 49–81, 1957.
Google Scholar
N. Chomsky and M. Halle. The Sound Pattern of English. Harper & Row: New York, 1968.
Google Scholar
C. H. Coker, N. Umeda, and C. P. Browman. Automatic synthesis from ordinary English text. IEEE Transactions on Audio and Electroacoustics, AU-21, 3:293–298, 1973.
Article Google Scholar
J. C. Coleman. The phonetic interpretation of headed phonological structures containing overlapping constituents. Phonology Yearbook 9(1):1–44, 1992.
Article Google Scholar
J. C. Coleman. Polysyllabic words in the YorkTalk synthesis system. In Papers in Laboratory Phonology HI, P. Keating, ed. Cambridge University Press, 293–324, 1993.
Google Scholar
J. R. Firth. A synopsis of Linguistic Theory. In Studies in Linguistic Analysis, Special Volume of the Philological Society, 2nd edition, 1–32, 1957.
Google Scholar
C. A. Fowler. Coarticulation and theories of extrinsic timing. Journal of Phonetics 8:113–133, 1980.
Google Scholar
C. A. Fowler. A relationship between coarticulation and compensatory shortening. Phonetica 38:35–50, 1981.
Article Google Scholar
C. A. Fowler. Converging sources of evidence for spoken and pereived rhythms of speech: Cyclic production of vowels in sequences of monosyllabic stress feet. Journal of Experimental Psychology: General 112:386–412, 1983.
Article Google Scholar
E. J. A. Henderson. Prosodies in Siamese. Asia Major 1:198–215, 1949.
Google Scholar
E. J. A. Henderson. The phonology of loanwords in some South-East Asian languages. Transactions of the Philological Society 131–158, 1952.
Google Scholar
J. Kelly. Swahili phonologcal structure: A prosodic view. In Le Swahili et ses Limites, M. F. Rombi, ed. Editions Recherche sur les Civilisations, Paris, 25–31, 1989.
Google Scholar
J. Kelly. Systems for open syllabics in North Welsh. In Studies in Systemic Phonology, P. Tench, ed. Pinter Publishers, London and New York, 87–97, 1992.
Google Scholar
M. Kenstowicz. Phonology in Generative Grammar. Basil Blackwell, Oxford, 1994.
Google Scholar
D. H. Klatt. Klattalk: The conversion of English text to speech. Unpublished manuscript, Massachusetts Institute of Technology, Cambridge, MA.
Google Scholar
D. H. Klatt. Review of text-to-speech conversion for English. Journal of the Acoustical Society of America 82(3):737–793, 1987.
Article Google Scholar
B. Lindblom and K. Rapp. Some temporal regularities of spoken Swedish. Papers in Linguistics from the University of Stockholm 21:1–59, 1973.
Google Scholar
J. K. Local. Some rhythm, resonance and quality variations in urban Tyneside speech. In Studies in the Pronunciation of English: A Commemorative Volume in Honour of A C Gimson, S. Ramsaren, ed. Routledge, London, 286–292, 1990.
Google Scholar
J. K. Local. Modelling assimilation in a non-segmental rule-free phonology. In Papers in Laboratory Phonology II, G. J. Docherty and D. R. Ladd, eds. CUP, Cambridge, 190–223, 1992.
Google Scholar
J. K. Local and R. A. Ogden. Temporal exponents of word-structure in English. York Research Papers in Linguistics. YLLS/RP 1994.
Google Scholar
S. Y. Manuel, S. Shattuck-Hufnagel, M. Huffman, K. N. Stevens, R. Carlson, and S. Hunnicutt. Studies of vowel and consonant reduction. In Proceedings of ICSLP 2:943–946, 1992.
Google Scholar
R. A. Ogden. Parametric interpretation in YorkTalk. York Papers in Linguistics 16:81–99, 1992.
Google Scholar
R. A. Ogden. European Patent Application 93307872.7 — YorkTalk. 1993.
Google Scholar
B. H. Partee. Compositionality. In Varieties of Formal Semantics, F. Landman and F. Veltman, eds. Foris, Dordrecht, 281–312, 1984.
Google Scholar
M. D. Riley. Tree-based modeling for speech synthesis. In Talking Machines: Theories, Models, and Designs, G. Bailly and C. Benoit, eds. Elsevier, North-Holland, Amsterdam, 265–273, 1992.
Google Scholar
A. Simpson. The phonologies of the English auxiliary system. In Who Climbs the Grammar Tree? R. Tracy, ed. Niemeyer, Tuebingen, 209–219, 1992.
Google Scholar
C. L. Smith. Prosodic patterns in the coordination of vowel and consonant gestures. Paper given at the Fourth Laboratory Phonology Meeting, Oxford, August, 1993.
Google Scholar
R. K. Sprigg. Vowel harmony in Lhasa Tibetan: Prosodic analysis applied to interrelated vocalic features of successive syllables. Bulletin of the School of Oriental and African Studies 24:116–138, 1966.
Article Google Scholar
J. P. H. van Santen. Deriving text-to-speech durations from natural speech. In Talking Machines: Theories, Models, and Designs, G. Bailly and C. Benoit, eds. Elsevier, North-Holland, Amsterdam, 275–285, 1992.
Google Scholar
J. P. H. van Santen. Assignment of segmental duration in text-to-speech synthesis. Computer Speech & Language 8:95–128, 1994.
Article Google Scholar
J. P. H. van Santen, J. Coleman, and M. Randolph. Effects of postvocalic voicing on the time course of vowels and diphthongs. Journal of the Acoustical Society of America 92:2444, 1992.
Google Scholar
D. Wheeler. Aspects of a Categorial Theory of Phonology. Graduate Linguistics Student Association, University of Massachusetts at Amherst, 1981.
Google Scholar
K. Wiik. On a third type of speech rhythm: Foot timing. In Proceedings of the Twelfth International Congress of Phonetic Sciences, Aix-en-Provence, 3:298–301, 1991.
Google Scholar

Download references

Authors

John Local
View author publications
You can also search for this author in PubMed Google Scholar
Richard Ogden
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Bell Laboratories Room 2D-452, 600 Mountain Avenue, Murray Hill, NJ, 07974-0636, USA
Jan P. H. van Santen
Bell Laboratories Room 2D-447, 600 Mountain Avenue, Murray Hill, NJ, 07974-0636, USA
Joseph P. Olive
Bell Laboratories Room 2D-451, 600 Mountain Avenue, Murray Hill, NJ, 07974-0636, USA
Richard W. Sproat
AT&T Research Room 2C-409, 600 Mountain Avenue, Murray Hill, NJ, 07974-0636, USA
Julia Hirschberg

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Local, J., Ogden, R. (1997). A Model of Timing for Nonsegmental Phonological Structure. In: van Santen, J.P.H., Olive, J.P., Sproat, R.W., Hirschberg, J. (eds) Progress in Speech Synthesis. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-1894-4_9

Download citation

DOI: https://doi.org/10.1007/978-1-4612-1894-4_9
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4612-7328-8
Online ISBN: 978-1-4612-1894-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics