Abstract
A linear-scaling approach is introduced for handling acoustic-phonetic manifestations of inter-speaker differences. The approach is motivated (i) by the similarity commonly observed amongst formant-frequency patterns resulting from different speakers’ productions of the same utterance, and (ii) by the fact that there are linear-scaling properties associated with similarity. In methodological terms, formant patterns are obtained for a set of segments selected from a fixed utterance, which we call poly-segmental formant ensembles. Linear transformations of these ensembles amongst different speakers are then sought and interpreted as a set of scaling relations. Using multi-speaker data based on Australian English “hello”, it is shown that the transformations afford a significant reduction of inter-speaker dissimilarity by inverse similarity. The proposed approach is thus able to unlock regularity in formant-pattern variability from speaker to speaker, without prior knowledge of the exact causes of the speaker differences manifested in the data at hand.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Broad, D.J., Clermont, F.: Linear Scaling of Vowel Formant Ensembles (VFEs) in Consonantal Contexts. Speech Communication 37, 175–195 (2002)
Chiba, T., Kajiyama, M.: The Vowel − Its Nature and Structure. Phonetic Society of Japan, Tokyo (1958)
Clermont, F., Zetterholm, E.: F-Pattern Analysis of Professional Imitations of “hallå” in three Swedish Dialects, Working Papers, Department of Linguistics and Phonetics, University of Lund 52, 25–28 (2006)
Laver, J.: The Phonetic Description of Vowel Quality. Cambridge University Press, Cambridge (1980)
Nolan, F.: The Phonetic Bases of Speaker Recognition. Cambridge University Press, Cambridge (1983)
Ohta, K., Fuchi, H.: Vowel Constancy on Antimetrical Vocal Tract Shapes between Males and Females. Progress Report on Speech Research, Bulletin of the Electrotechnical Laboratory (Ibaraki, Japan) 48, 17–21 (1984)
Paige, A., Zue, V.W.: Calculation of Vocal Tract Length. IEEE Transactions on Audio. ElectroAcoustics 18, 268–270 (1970)
Rose, P.: Differences and Distinguishability in the Acoustic Characteristics of ”hello” in Voices of Similar-Sounding Speakers: A Forensic Phonetic Investigation. Australian Review of Applied Linguistics 22, 1–42 (1999)
Wakita, H.: Normalization of Vowels by Vocal-Tract Length and its Application to Vowel Identification. IEEE Transactions on Acoustics, Speech and Signal Processing 25, 183–192 (1977)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Clermont, F. (2007). A Linear-Scaling Approach to Speaker Variability in Poly-segmental Formant Ensembles. In: Müller, C. (eds) Speaker Classification II. Lecture Notes in Computer Science(), vol 4441. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74122-0_11
Download citation
DOI: https://doi.org/10.1007/978-3-540-74122-0_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74121-3
Online ISBN: 978-3-540-74122-0
eBook Packages: Computer ScienceComputer Science (R0)