A Review of Personality in Voice-Based Man Machine Interaction

  • Florian Metze
  • Alan Black
  • Tim Polzehl
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6762)


In this paper, we will discuss state-of-the-art techniques for personality-aware user interfaces, and summarize recent work in automatically recognizing and synthesizing speech with “personality”. We present an overview of personality “metrics”, and show how they can be applied to the perception of voices, not only the description of personally known individuals. We present use cases for personality-aware speech input and/ or output, and discuss approaches at defining “personality” in this context. We take a middle-of-the-road approach, i.e. we will not try to uncover all fundamental aspects of personality in speech, but we’ll also not aim for ad-hoc solutions that serve a single purpose, for example to create a positive attitude in a user, but do not generate transferable knowledge for other interfaces.


voice user interface paralinguistic information speech processing 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Apple, W., Streeter, L.A., Krauss, R.M.: Effects of pitch and speech rate on personal attributions. Journal of Personality and Social Psychology 37(5), 715–727 (1979)CrossRefGoogle Scholar
  2. 2.
    Bickmore, T., Cassell, J.: Social Dialogue with Embodied Conversational Agents. In: Natural, Intelligent and Effective Interaction with Multimodal Dialogue Systems. Kluwer Academic, New York (2004)Google Scholar
  3. 3.
    Bulut, M., Lee, S., Narayanan, S.: A statistical approach for modeling prosody features using postags for emotional speech synthesis. In: Proc. ICASSP, Honolulu, HI (2007)Google Scholar
  4. 4.
    Cassell, J., Sullivan, J., Prevost, S., Churchill, E.F. (eds.): Embodied Conversational Agents. MIT Press, Cambridge (2000)Google Scholar
  5. 5.
    Catrambone, R., Stasko, J., Xiao, J.: Anthropomorphic agents as a user interface paradigm: Experimental findings and a framework for research. In: Proc. 24th Annual Conference of the Cognitive Science Society, Fairfax, USA (August 2002)Google Scholar
  6. 6.
    Chen, Y., Naveed, A., Porzel, R.: Behavior and preference in minimal personality: A study on embodied conversational agents. In: Proc. ICMI-MLMI. ACM Press, New York (2010)Google Scholar
  7. 7.
    Costa, P.T., McCrae, R.R.: Revised NEO Personality Inventory (NEO-PI-R) and NEO Five-Factor Inventory (NEO-FFI) manual. Psychological Assessment Resources (1992)Google Scholar
  8. 8.
    Costello, A.B., Osborne, J.W.: Best practices in exploratory factor analysis. Practical Assessment, Research & Evaluation 10(7) (July 2005)Google Scholar
  9. 9.
    Drapela, V.J.: A Review of Personality Theories, 2nd edn. Charles C. Thomas Publ. (1995)Google Scholar
  10. 10.
    Eide, E., Bakis, R., Hamza, W., Pitrelli, J.: Multilayered extensions to the speech synthesis markup language for describing expressiveness. In: Proc. Eurospeech, Geneva, Switzerland (2003)Google Scholar
  11. 11.
    Gill, A.J., French, R.M.: Level of Representation and Semantic Distance: Rating Author Personality from Texts. In: Proc. Euro Cogsci, Delphi, Greece (2007)Google Scholar
  12. 12.
    Goldberg, L.R.: The structure of phenotypic personality traits. American Psychologist 48, 26–34 (1993)CrossRefGoogle Scholar
  13. 13.
    Hunt, A., Black, A.: Unit selection in a concatenative speech synthesis system using a large speech database. In: Proc. ICASSP, Atlanta, Georgia, vol. 1 (1996)Google Scholar
  14. 14.
    Jin, Q., Toth, A., Black, A., Schultz, T.: Is voice transformation a threat to speaker identification? In: Proc. ICASSP, Las Vegas, USA, NV (2008)Google Scholar
  15. 15.
    Mairesse, F., Walker, M.A., Mehl, M.R., Moore, R.K.: Using Linguistic Cues for the Automatic Recognition of Personality in Conversation and Text. Journal of Artificial Intelligence Research (JAIR) 30, 457–500 (2007)zbMATHGoogle Scholar
  16. 16.
    Nass, C., Brave, S.: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship. MIT Press, Cambridge (2005)Google Scholar
  17. 17.
    Nass, C., Lee, K.M.: Does computer-synthesized speech manifest personality? Experimental tests of recognition, similarity-attraction, and consistency-attraction. Journal of Experimental Psychology: Applied 7, 171–181 (2001)Google Scholar
  18. 18.
    Nass, C., Moon, Y., Fogg, B., Reeves, B., Dryer, D.C.: Can computer personalities be human personalities? International J. of Human-Computer Studies 43(2), 223–239 (1995)CrossRefGoogle Scholar
  19. 19.
    Oberlander, J., Gill, A.J.: Individual Differences and Implicit Language: Personality, Parts-of-Speech and Pervasiveness. In: Proc. Cogsci, Chicago, IL, USA (2004)Google Scholar
  20. 20.
    Pentland, A.: Social signal processing. IEEE Signal Proc. Magazine 24(4), 108–111 (2007)CrossRefGoogle Scholar
  21. 21.
    Picard, R.W.: Affective Computing (1995)Google Scholar
  22. 22.
    Polzehl, T., Möller, S., Metze, F.: Automatically assessing acoustic manifestations of personality in speech. In: Proc. SLT Workshop. IEEE, Berkeley (2010)Google Scholar
  23. 23.
    Polzehl, T., Schmitt, A., Metze, F., Wagner, M.: Anger recognition in speech using acoustic and linguistic cues. Speech Communication, Special Issue on Sensing Emotion and Affect - Facing Realism in Speech Processing (2011)Google Scholar
  24. 24.
    Reeves, B., Nass, C.: The Media Equation: How People Treat Computers, Television, and New Media like Real People and Places. Cambridge University Press, Cambridge (1996)Google Scholar
  25. 25.
    Ryckman, R.M.: Theories of Personality. Thomson/Wadsworth, Belmont CA (2004)Google Scholar
  26. 26.
    Scherer, K.R., Scherer, U.: Speech Behavior and Personality. Speech Evaluation in Psychiatry, 115–135 (1981)Google Scholar
  27. 27.
    Schuller, B., Steidl, S., Batliner, A.: The INTERSPEECH 2009 emotion challenge. In: Proc. INTERSPEECH, ISCA, Brighton, UK (September 2009)Google Scholar
  28. 28.
    Syrdal, A., Conkie, A., Kim, Y., Beutnagel, M.: Speech acts and dialog TTS. In: Proc. SSW 7, Keihanna, Japan (2010)Google Scholar
  29. 29.
    Türk, O., Schröder, M.: Evaluation of expressive speech synthesis with voice conversion and copy re-synthesis techniques. IEEE Trans. on ASLP 18(5), 965–973 (2010)Google Scholar
  30. 30.
    Witten, I.H., Frank, E., Trigg, L., Hall, M., Holmes, G., Cunningham, S.J.: Weka: Practical machine learning tools and techniques with java implementations (1999)Google Scholar
  31. 31.
    Zen, H., Tokuda, K., Black, A.: Statistical parametric speech synthesis. Speech Communication 51(11), 1059–1064 (2009)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Florian Metze
    • 1
  • Alan Black
    • 1
  • Tim Polzehl
    • 2
  1. 1.LTICarnegie Mellon UniversityPittsburghUSA
  2. 2.Q&U LabTechnische Universität BerlinBerlinGermany

Personalised recommendations