Skip to main content

Speaker Classification by Means of Orthographic and Broad Phonetic Transcriptions of Speech

  • Chapter
Speaker Classification II

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4441))

  • 1214 Accesses

Abstract

In this study we investigate whether a classification algorithm originally designed for authorship verification can be used to classify speakers according to their gender, age, regional background and level of education by investigating the lexical content and the pronunciation of their speech. Contrary to other speaker classification techniques, our algorithm does not base its decisions on direct measurements of the speech signal; rather it learns characteristic speech features of speaker classes by analysing the orthographic and broad phonetic transcription of speech from members of these classes. The resulting class profiles are subsequently used to verify whether unknown speakers belong to these classes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. van Halteren, H.: Author Verification by Linguistic Profiling: An exploration of the parameter space. ACM Transactions on Speech and Language Processing 4(1) (2007)

    Google Scholar 

  2. Oostdijk, N.: The Design of the Spoken Dutch Corpus. In: Peters, P., Collins, P., Smith, A. (eds.) New Frontiers of Corpus Research, pp. 105–112. Rodopi, Amsterdam (2002)

    Google Scholar 

  3. Laver, J.: Principles of phonetics. Cambridge University Press, Cambridge (1995)

    Google Scholar 

  4. Van Bael, C., Boves, L., Strik, H., van den Heuvel, H.: Automatic Phonetic Transcription of Large Speech Corpora: a Comparative Study. In: Proceedings of ICSLP-Interspeech 2006, Pittsburgh PA, pp. 1085–1088 (2006)

    Google Scholar 

  5. Elffers, B., Van Bael, C., Strik, H.: ADAPT: Algorithm for Dynamic Alignment of Phonetic Transcriptions. Internal report, Department of Language & Speech, Radboud University Nijmegen, the Netherlands. Electronically (2005), available from http://lands.let.ru.nl/literature/elffers.2005.1.pdf

  6. Binnenpoorte, D.: Phonetic Transcriptions of Large Speech Corpora. Ph.D. Dissertation. Radboud University Nijmegen, the Netherlands (2006)

    Google Scholar 

  7. Cucchiarini, C.: Phonetic Transcription: a Methodological and Empirical Study. Ph.D. Dissertation. University of Nijmegen, the Netherlands (1993)

    Google Scholar 

  8. Verhoeven, J., De Pauw, G., Kloots, H.: Speech rate in a pluricentric language: A comparison between Dutch in Belgium and the Netherlands. Language and Speech 47(3), 297–308 (2004)

    Article  Google Scholar 

  9. Byrd, D.: Relations of Sex and Dialect to Reduction. Speech Communiciation 15, 39–54 (1994)

    Article  Google Scholar 

  10. Henton, C.: Acoustic variability in the vowels of female and male speakers. The Journal of the Acoustical Society of America (JASA) 94(4), 2387 (1994)

    Google Scholar 

  11. Binnenpoorte, C., Van Bael, C., den Os, E., Boves, L.: Gender in everyday speech and language: A corpus-based study. In: Proceedings of Interspeech 2005, Lisbon, Portugal, pp. 2213–2216 (2005)

    Google Scholar 

  12. Verstraeten, B., van de Velde, H.: Socio-geographical variation of /r/ in standard Dutch. In: van de Velde, H., van Hout, R. (eds.) r-atics, sociolinguistic, phonetic and phonological characteristics of /r/. Etudes & Travaux - ILVP/ULB. No 4. Brussels, pp.45–61 (2001)

    Google Scholar 

  13. Hol, A.R.: Dialectgrenzen in Gelderland. In: Wingens, M.F.M., Demoed, H.B., Scholten, F.W.J. (eds.) Gelders Erfgoed, Gelders cultuurhistorisch kwartaalblad, 2006-2, pp. 11–13 (2006)

    Google Scholar 

  14. Keune, K., Ernestus, M., van Hout, R., Baayen, R.H.: Variation in Dutch: From Written MOGELIJK to Spoken MOK. Corpus Linguistics and Linguistic Theory 1(2), 183–223 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Christian Müller

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Van Bael, C., van Halteren, H. (2007). Speaker Classification by Means of Orthographic and Broad Phonetic Transcriptions of Speech. In: Müller, C. (eds) Speaker Classification II. Lecture Notes in Computer Science(), vol 4441. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74122-0_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74122-0_22

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74121-3

  • Online ISBN: 978-3-540-74122-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics