Abstract
In this study we investigate whether a classification algorithm originally designed for authorship verification can be used to classify speakers according to their gender, age, regional background and level of education by investigating the lexical content and the pronunciation of their speech. Contrary to other speaker classification techniques, our algorithm does not base its decisions on direct measurements of the speech signal; rather it learns characteristic speech features of speaker classes by analysing the orthographic and broad phonetic transcription of speech from members of these classes. The resulting class profiles are subsequently used to verify whether unknown speakers belong to these classes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
van Halteren, H.: Author Verification by Linguistic Profiling: An exploration of the parameter space. ACM Transactions on Speech and Language Processing 4(1) (2007)
Oostdijk, N.: The Design of the Spoken Dutch Corpus. In: Peters, P., Collins, P., Smith, A. (eds.) New Frontiers of Corpus Research, pp. 105–112. Rodopi, Amsterdam (2002)
Laver, J.: Principles of phonetics. Cambridge University Press, Cambridge (1995)
Van Bael, C., Boves, L., Strik, H., van den Heuvel, H.: Automatic Phonetic Transcription of Large Speech Corpora: a Comparative Study. In: Proceedings of ICSLP-Interspeech 2006, Pittsburgh PA, pp. 1085–1088 (2006)
Elffers, B., Van Bael, C., Strik, H.: ADAPT: Algorithm for Dynamic Alignment of Phonetic Transcriptions. Internal report, Department of Language & Speech, Radboud University Nijmegen, the Netherlands. Electronically (2005), available from http://lands.let.ru.nl/literature/elffers.2005.1.pdf
Binnenpoorte, D.: Phonetic Transcriptions of Large Speech Corpora. Ph.D. Dissertation. Radboud University Nijmegen, the Netherlands (2006)
Cucchiarini, C.: Phonetic Transcription: a Methodological and Empirical Study. Ph.D. Dissertation. University of Nijmegen, the Netherlands (1993)
Verhoeven, J., De Pauw, G., Kloots, H.: Speech rate in a pluricentric language: A comparison between Dutch in Belgium and the Netherlands. Language and Speech 47(3), 297–308 (2004)
Byrd, D.: Relations of Sex and Dialect to Reduction. Speech Communiciation 15, 39–54 (1994)
Henton, C.: Acoustic variability in the vowels of female and male speakers. The Journal of the Acoustical Society of America (JASA)Â 94(4), 2387 (1994)
Binnenpoorte, C., Van Bael, C., den Os, E., Boves, L.: Gender in everyday speech and language: A corpus-based study. In: Proceedings of Interspeech 2005, Lisbon, Portugal, pp. 2213–2216 (2005)
Verstraeten, B., van de Velde, H.: Socio-geographical variation of /r/ in standard Dutch. In: van de Velde, H., van Hout, R. (eds.) r-atics, sociolinguistic, phonetic and phonological characteristics of /r/. Etudes & Travaux - ILVP/ULB. No 4. Brussels, pp.45–61 (2001)
Hol, A.R.: Dialectgrenzen in Gelderland. In: Wingens, M.F.M., Demoed, H.B., Scholten, F.W.J. (eds.) Gelders Erfgoed, Gelders cultuurhistorisch kwartaalblad, 2006-2, pp. 11–13 (2006)
Keune, K., Ernestus, M., van Hout, R., Baayen, R.H.: Variation in Dutch: From Written MOGELIJK to Spoken MOK. Corpus Linguistics and Linguistic Theory 1(2), 183–223 (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Van Bael, C., van Halteren, H. (2007). Speaker Classification by Means of Orthographic and Broad Phonetic Transcriptions of Speech. In: Müller, C. (eds) Speaker Classification II. Lecture Notes in Computer Science(), vol 4441. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74122-0_22
Download citation
DOI: https://doi.org/10.1007/978-3-540-74122-0_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74121-3
Online ISBN: 978-3-540-74122-0
eBook Packages: Computer ScienceComputer Science (R0)