A Linear-Scaling Approach to Speaker Variability in Poly-segmental Formant Ensembles

Clermont, Frantz

doi:10.1007/978-3-540-74122-0_11

Frantz Clermont¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4441))

1210 Accesses

Abstract

A linear-scaling approach is introduced for handling acoustic-phonetic manifestations of inter-speaker differences. The approach is motivated (i) by the similarity commonly observed amongst formant-frequency patterns resulting from different speakers’ productions of the same utterance, and (ii) by the fact that there are linear-scaling properties associated with similarity. In methodological terms, formant patterns are obtained for a set of segments selected from a fixed utterance, which we call poly-segmental formant ensembles. Linear transformations of these ensembles amongst different speakers are then sought and interpreted as a set of scaling relations. Using multi-speaker data based on Australian English “hello”, it is shown that the transformations afford a significant reduction of inter-speaker dissimilarity by inverse similarity. The proposed approach is thus able to unlock regularity in formant-pattern variability from speaker to speaker, without prior knowledge of the exact causes of the speaker differences manifested in the data at hand.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Broad, D.J., Clermont, F.: Linear Scaling of Vowel Formant Ensembles (VFEs) in Consonantal Contexts. Speech Communication 37, 175–195 (2002)
Article MATH Google Scholar
Chiba, T., Kajiyama, M.: The Vowel − Its Nature and Structure. Phonetic Society of Japan, Tokyo (1958)
Google Scholar
Clermont, F., Zetterholm, E.: F-Pattern Analysis of Professional Imitations of “hallå” in three Swedish Dialects, Working Papers, Department of Linguistics and Phonetics, University of Lund 52, 25–28 (2006)
Google Scholar
Laver, J.: The Phonetic Description of Vowel Quality. Cambridge University Press, Cambridge (1980)
Google Scholar
Nolan, F.: The Phonetic Bases of Speaker Recognition. Cambridge University Press, Cambridge (1983)
Google Scholar
Ohta, K., Fuchi, H.: Vowel Constancy on Antimetrical Vocal Tract Shapes between Males and Females. Progress Report on Speech Research, Bulletin of the Electrotechnical Laboratory (Ibaraki, Japan) 48, 17–21 (1984)
Google Scholar
Paige, A., Zue, V.W.: Calculation of Vocal Tract Length. IEEE Transactions on Audio. ElectroAcoustics 18, 268–270 (1970)
Article Google Scholar
Rose, P.: Differences and Distinguishability in the Acoustic Characteristics of ”hello” in Voices of Similar-Sounding Speakers: A Forensic Phonetic Investigation. Australian Review of Applied Linguistics 22, 1–42 (1999)
Google Scholar
Wakita, H.: Normalization of Vowels by Vocal-Tract Length and its Application to Vowel Identification. IEEE Transactions on Acoustics, Speech and Signal Processing 25, 183–192 (1977)
Article Google Scholar

Download references

Author information

Authors and Affiliations

JP French Associates, Forensic Speech and Acoustics Laboratory, York, United Kingdom
Frantz Clermont

Authors

Frantz Clermont
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Christian Müller

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Clermont, F. (2007). A Linear-Scaling Approach to Speaker Variability in Poly-segmental Formant Ensembles. In: Müller, C. (eds) Speaker Classification II. Lecture Notes in Computer Science(), vol 4441. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74122-0_11

Download citation

DOI: https://doi.org/10.1007/978-3-540-74122-0_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74121-3
Online ISBN: 978-3-540-74122-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics