A Review of Personality in Voice-Based Man Machine Interaction
Abstract
In this paper, we will discuss state-of-the-art techniques for personality-aware user interfaces, and summarize recent work in automatically recognizing and synthesizing speech with “personality”. We present an overview of personality “metrics”, and show how they can be applied to the perception of voices, not only the description of personally known individuals. We present use cases for personality-aware speech input and/ or output, and discuss approaches at defining “personality” in this context. We take a middle-of-the-road approach, i.e. we will not try to uncover all fundamental aspects of personality in speech, but we’ll also not aim for ad-hoc solutions that serve a single purpose, for example to create a positive attitude in a user, but do not generate transferable knowledge for other interfaces.
Keywords
voice user interface paralinguistic information speech processingPreview
Unable to display preview. Download preview PDF.
References
- 1.Apple, W., Streeter, L.A., Krauss, R.M.: Effects of pitch and speech rate on personal attributions. Journal of Personality and Social Psychology 37(5), 715–727 (1979)CrossRefGoogle Scholar
- 2.Bickmore, T., Cassell, J.: Social Dialogue with Embodied Conversational Agents. In: Natural, Intelligent and Effective Interaction with Multimodal Dialogue Systems. Kluwer Academic, New York (2004)Google Scholar
- 3.Bulut, M., Lee, S., Narayanan, S.: A statistical approach for modeling prosody features using postags for emotional speech synthesis. In: Proc. ICASSP, Honolulu, HI (2007)Google Scholar
- 4.Cassell, J., Sullivan, J., Prevost, S., Churchill, E.F. (eds.): Embodied Conversational Agents. MIT Press, Cambridge (2000)Google Scholar
- 5.Catrambone, R., Stasko, J., Xiao, J.: Anthropomorphic agents as a user interface paradigm: Experimental findings and a framework for research. In: Proc. 24th Annual Conference of the Cognitive Science Society, Fairfax, USA (August 2002)Google Scholar
- 6.Chen, Y., Naveed, A., Porzel, R.: Behavior and preference in minimal personality: A study on embodied conversational agents. In: Proc. ICMI-MLMI. ACM Press, New York (2010)Google Scholar
- 7.Costa, P.T., McCrae, R.R.: Revised NEO Personality Inventory (NEO-PI-R) and NEO Five-Factor Inventory (NEO-FFI) manual. Psychological Assessment Resources (1992)Google Scholar
- 8.Costello, A.B., Osborne, J.W.: Best practices in exploratory factor analysis. Practical Assessment, Research & Evaluation 10(7) (July 2005)Google Scholar
- 9.Drapela, V.J.: A Review of Personality Theories, 2nd edn. Charles C. Thomas Publ. (1995)Google Scholar
- 10.Eide, E., Bakis, R., Hamza, W., Pitrelli, J.: Multilayered extensions to the speech synthesis markup language for describing expressiveness. In: Proc. Eurospeech, Geneva, Switzerland (2003)Google Scholar
- 11.Gill, A.J., French, R.M.: Level of Representation and Semantic Distance: Rating Author Personality from Texts. In: Proc. Euro Cogsci, Delphi, Greece (2007)Google Scholar
- 12.Goldberg, L.R.: The structure of phenotypic personality traits. American Psychologist 48, 26–34 (1993)CrossRefGoogle Scholar
- 13.Hunt, A., Black, A.: Unit selection in a concatenative speech synthesis system using a large speech database. In: Proc. ICASSP, Atlanta, Georgia, vol. 1 (1996)Google Scholar
- 14.Jin, Q., Toth, A., Black, A., Schultz, T.: Is voice transformation a threat to speaker identification? In: Proc. ICASSP, Las Vegas, USA, NV (2008)Google Scholar
- 15.Mairesse, F., Walker, M.A., Mehl, M.R., Moore, R.K.: Using Linguistic Cues for the Automatic Recognition of Personality in Conversation and Text. Journal of Artificial Intelligence Research (JAIR) 30, 457–500 (2007)MATHGoogle Scholar
- 16.Nass, C., Brave, S.: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship. MIT Press, Cambridge (2005)Google Scholar
- 17.Nass, C., Lee, K.M.: Does computer-synthesized speech manifest personality? Experimental tests of recognition, similarity-attraction, and consistency-attraction. Journal of Experimental Psychology: Applied 7, 171–181 (2001)Google Scholar
- 18.Nass, C., Moon, Y., Fogg, B., Reeves, B., Dryer, D.C.: Can computer personalities be human personalities? International J. of Human-Computer Studies 43(2), 223–239 (1995)CrossRefGoogle Scholar
- 19.Oberlander, J., Gill, A.J.: Individual Differences and Implicit Language: Personality, Parts-of-Speech and Pervasiveness. In: Proc. Cogsci, Chicago, IL, USA (2004)Google Scholar
- 20.Pentland, A.: Social signal processing. IEEE Signal Proc. Magazine 24(4), 108–111 (2007)CrossRefGoogle Scholar
- 21.Picard, R.W.: Affective Computing (1995)Google Scholar
- 22.Polzehl, T., Möller, S., Metze, F.: Automatically assessing acoustic manifestations of personality in speech. In: Proc. SLT Workshop. IEEE, Berkeley (2010)Google Scholar
- 23.Polzehl, T., Schmitt, A., Metze, F., Wagner, M.: Anger recognition in speech using acoustic and linguistic cues. Speech Communication, Special Issue on Sensing Emotion and Affect - Facing Realism in Speech Processing (2011)Google Scholar
- 24.Reeves, B., Nass, C.: The Media Equation: How People Treat Computers, Television, and New Media like Real People and Places. Cambridge University Press, Cambridge (1996)Google Scholar
- 25.Ryckman, R.M.: Theories of Personality. Thomson/Wadsworth, Belmont CA (2004)Google Scholar
- 26.Scherer, K.R., Scherer, U.: Speech Behavior and Personality. Speech Evaluation in Psychiatry, 115–135 (1981)Google Scholar
- 27.Schuller, B., Steidl, S., Batliner, A.: The INTERSPEECH 2009 emotion challenge. In: Proc. INTERSPEECH, ISCA, Brighton, UK (September 2009)Google Scholar
- 28.Syrdal, A., Conkie, A., Kim, Y., Beutnagel, M.: Speech acts and dialog TTS. In: Proc. SSW 7, Keihanna, Japan (2010)Google Scholar
- 29.Türk, O., Schröder, M.: Evaluation of expressive speech synthesis with voice conversion and copy re-synthesis techniques. IEEE Trans. on ASLP 18(5), 965–973 (2010)Google Scholar
- 30.Witten, I.H., Frank, E., Trigg, L., Hall, M., Holmes, G., Cunningham, S.J.: Weka: Practical machine learning tools and techniques with java implementations (1999)Google Scholar
- 31.Zen, H., Tokuda, K., Black, A.: Statistical parametric speech synthesis. Speech Communication 51(11), 1059–1064 (2009)CrossRefGoogle Scholar