Abstract
This paper reviews what is currently known about voice identification by human listeners. Our own experimental data from a four-year research program into this topic is used to elucidate, support, and in some cases to contradict published work into the effects on voice identification of such factors as speech sample size and quality, voice disguise, delay in holding voice identification sessions, incidental as opposed to intentional memory for voices, the effects of the age of the witness, training in specific modes of encoding voices, and the relationship between objective accuracy and subjective feelings of certainty of correctness. It is concluded that the caution and suspicion currently accorded to visual identification must be extended also, and perhaps more so, to voice identification.
Similar content being viewed by others
Reference Notes
Williams, C.E.The effects of selected factors on the aural identification of speakers. (Report ESD-TDR-65-153), Air Force Systems Command, Hanscom Field, 1964 (unpublished).
Clarke, F.R., Becker, R.W., & Nixon, J.C.Characteristics that determine speaker recognition. (Report ESD-TR-66-636), Electronics Systems Division, Air Force Systems Command, Hanscom Field, December, 1966.
Haggard, M., & Summerfield, Q.Sample size and perceptual parameters in speaker verification by human listeners. Manuscript submitted for publication, 1980.
Hollien, H., & McGlone, R.E.An evaluation of the “voice print” technique of speaker identification. Proceedings: Canadian Conference on Crime Counter-Measures, 1976, pp 39–45.
Clifford, B.R., & Denot, H.Visual and verbal testimony and identification under conditions of stress. Manuscript submitted for publication, 1980.
Clifford, B.R., & McCardle, G.Memory for voices. Manuscript submitted for publication, 1980.
Truby, H.M.Voice recognition by man, animal and machine. Seventh International Congress of the Phonetic Sciences. Montreal Proceedings, 1972.
Nerbonne, G.P.The identification of speaker characteristics on the basis of aural cues. Unpublished Ph.D. dissertation, Michigan State University, Ann Arbor, Michigan, 1967.
Abrams, A.S.Auditory cues and racial identification. Paper presented at the Annual Convention of the American Speech and Hearing Association, Washington, D.C., November 21–24, 1975.
Bryden, J.D.An acoustic and social dialect analysis of perceptual variables in listener identification and rating of negro speakers. Unpublished Doctoral dissertation, University of Virginia, 1968.
Ryan, W.J., & Burk, K.W.Predictors of age in the male voice. Paper presented at the 84th Meeting of the Acoustical Society of America. Miami Beach, Florida, November 28–December 1, 1972.
Burk, K.W., Hoyer, E.A., Fey, M. & Charlip, W.S.Perceptual and acoustical correlates of aging in the female voice. Paper presented at the Annual Convention of the American Speech and Hearing Association, Washington, D.C., November 21–24, 1975.
Hartman, D.E., & Danhauer, J.L.Perceptual features of aging in male speech. Paper presented at the 90th Meeting of the Acoustical Society of America, San Francisco, California, November 3–7, 1975.
Stroud, R.V.A study of the relationship between social distance and speech differences of white and negro high school students. Unpublished Masters thesis, Bowling Green State University, 1956.
References
Atal, B.S. Automatic speaker recognition based on pitch contours.Journal of the Acoustical Society of America, 1972,52, 1687–1697.
Bartholomeus, B. Voice identification by nursery school children.Canadian Journal of Psychology, 1973,27, 464–472.
Bolt, R.H., Cooper, F.S., David, E.E., Denes, P.B., Pickett, J.M., & Stevens, K.N. Speaker identification by speech spectrograms: A scientist's view of its reliability for legal purposes.Journal of the Acoustical Society of America, 1970,47, 597–612.
Bower, G., & Karlin, M. Depth of processing pictures of faces and recognition memory.Journal of Experimental Psychology, 1974,103, 751–757.
Bricker, P.D., & Pruzansky, S. Effects of stimulus content and duration on talker identification.Journal of the Acoustical Society of America, 1966,40, 1441–1449.
Buckout, R. Eyewitness testimony.Scientific American, 1974,231, 23–31.
Buckout, R., Alper, A., Chern, S., Siverberg, G., & Slomovits, M. Determinants of Eyewitness performance on a lineup.Bulletin of the Psychonomic Society, 1974,4, 191–192.
Bunge, E. Speaker recognition by computer.Phillips Technical Review, 1977,37, 207–219.
Bull, R., & Clifford, B.R. Identification: The Devlin Report.New Scientist, 1976,70, 307–308.
Cantril, H., & Allport, G.W.The Psychology of Radio. New York: Harper, 1935.
Carterette, E.C., & Barnebey, A. Recognition memory for voices. In A. Cohen & G. Nooteboom (Eds.)Structures and Processes in Speech Perception. Heidelberg and New York: Springer, 1975.
Clifford, B.R. A critique of eyewitness research. In M.M. Gruneberg, P.E. Morris, and R.N. Sykes (Eds.)Practical Aspects of Memory, London, New York, San Francisco: Academic Press, 1978.
Clifford, B.R., & Bull, R.The Psychology of Person Identification. London: Routledge & Kegan Paul, 1978.
Clifford, B.R., & Prior, D. Levels of processing and capacity allocation.Perceptual and Motor Skills, 1980,50, 829–830.
Clifford, B.R., & Scott, J. Individual and situational factors in eyewitness testimony.Journal of Applied Psychology, 1978,63, 352–359.
Cole, R.A. Different memory functions for consonants and vowels.Cognitive Psychology, 1972,4, 39–54.
Cole, R.A., Coltheart, M., & Allard, F. Memory of a speaker's voice: Reaction time to same- or differentvoice letters.Quarterly Journal of Experimental Psychology, 1974,26, 1–7.
Coleman, R.O. Male and female voice quality and its relationship to vowel format frequencies.Journal of Speech and Hearing Research, 1971,14, 565–577.
Coleman, R.O. Speaker identification in the absence of inter-subject differences in glottal source characteristics.Journal of the Acoustical Society of America, 1973,53, 1741–1743.
Compton, A.J. Effects of filtering and vocal duration upon the identification of speakers, aurally.Journal of the Acoustical Society of America, 1963,35, 1748–1752.
Craik, F.I.M., & Kirsner, K. The effect of speaker's voice on word recognition.Quarterly Journal of Experimental Psychology, 197426, 274–284.
Deffenbacher, K.A., Brown, E.L., & Sturgill, W. Some predictors of eyewitness memory accuracy. In Gruneberg, M.M., Morris, P.E., and Sykes, R.N. (Eds.)Practical Aspects of Memory, New York: Academic Press, 1975.
Deutsch, D. Experiments in short term memory for tonal pitch and their implications for theories of non verbal memory. In D. Deutsch & J.A. Deutsch (Eds.)Short Term Memory, New York: Academic Press, 1975.
Devlin, Lord P.Report to the Secretary of State for the Home Department of the Departmental Committee on the Evidence of Identification in Criminal Cases. H.M.S.O., 1976.
Dickens, M., & Sawyer, G.M. An experimental comparison of vocal qualities among mixed groups of whites and negroes.Southern Speech Journal, 1962,18, 178–185.
Doehring, D.G., & Ross, R.W. Voice recognition by matching to sample.Journal of Psycholinguistic Research, 1972,1, 233–242.
Doob, A., & Kirschenbaum, H. Bias in police lineups—partial remembering.Journal of Police Science and Administration, 1973,1, 287–293.
Friedlander, B.Z. Receptive language development: Issues and problems.Merrill-Palmer Quarterly of Behaviour and Development, 1970,16, 7–15.
Geiselman, R.E., & Bellezza, F.S. Long term memory for speaker's voice and source location.Memory and Cognition, 1976,4, 483–489.
Geiselman, R.E., & Bellezza, F.S. Incidental retention of speaker's voice.Memory and Cognition, 1977,5, 658–665.
Haggard, M.P. Selectivity versus summation in multiple observation tasks: Evidence with spectrum parameter noise in speech.Acta Psychologica, 1973,37, 285–299.
Harms, L.S. Listener judgment of status cues in speech.Quarterly Journal of Speech, 1961,47, 164–168.
Harms, L.S. Listener comprehension of speakers of three status groups.Language and Speech, 1963,4, 109–112.
Hecker, M.P. Speaker recognition—an interpretive survey of the literature.A.S.H.A., Monograph 16. American Speech and Hearing Association, 1971.
Hintzman, D.L., Block, R.A., & Inskeep, N.R. Memory for mode of input.Journal of Verbal Learning and Verbal Behaviour, 1972,11, 741–749.
Ingemann, F. Identification of speaker's voice from voiceless fricatives.Journal of the Acoustical Society of America, 1968,44, 1142–1144.
Kramer, E. The judgement of personality characteristics and emotions from non-verbal properties of speech.Psychological Bulletin, 1963,60, 408–420.
Larson, V.S., & Larson, C.H. Reactions to pronunciation. In R.E. McDavid & W.M. Austin (Eds.)Communication Barriers to the Culturally Deprived, Washington, D.C.: U.S. Office of Education, 1966.
Lass, N.J., & Harvey, L.A. An investigation of speaker photograph identification.Journal of the Acoustical Society of America, 1976,59, 1232–1236.
Lass, N.J., Hughes, K.R., Bowyer, M.D., Waters, L.T., & Bourne, V.T. Speaker sex identification from voiced, whispered and filtered isolated vowels.Journal of the Acoustical Society of America, 1976,59, 675–678.
Lass, N.J., Beverly, A.S., Nicosia, D.K., & Simpson, L.A. An investigation by means of direct estimation of speaker height and weight identification.Journal of Phonetics, 1978,6, 69–76.
Light, L.L., Stanbury, C., Rubins, C., & Linde, S. Memory for modality of presentation: Within-modality discrimination.Memory and Cognition, 1973,1, 395–400.
Lindsay, R.C.L., Wells, G.L., & Rumpel, C. Can people detect eyewitness identification accuracy within and across situations?Journal of Applied Psychology, 1981,66, 79–89.
Lipton, J.P. On the psychology of eyewitness testimony.Journal of Applied Psychology, 1977,62, 90–95.
Loftus, E.F.Eyewitness Testimony. Cambridge: Harvard University Press, 1979.
Loftus, E.F., Miller, D.G., & Burns, H.J. Semantic integration of verbal information into a visual memory.Journal of Experimental Psychology: Human Learning and Memory, 1978,4, 19–31.
Mann, V.A., Diamond, R., & Carey, S. Development of voice recognition: Parallels with face recognition.Journal of Experimental Child Psychology, 1979,27, 153–165.
McGehee, F. The reliability of the identification of the human voice.Journal of General Psychology, 1937,17, 249–271.
McGehee, F. An experimental investigation of voice recognition.Journal of General Psychology, 1944,31, 53–65.
McGlone, R.E., Hollien, P., & Hollien, H. Acoustic analysis of voice disguise related to voice identification.Journal of the Acoustical Society of America, 1977,62, 31–35.
Miller, J.E. Decapitation and recapitation, a study in voice qualities.Journal of the Acoustical Society of America, 1964,42, 2002.
Morton, J. A functional model of memory. In D.A. Norman (Ed.)Models for Human Memory, New York: Academic Press, 1970.
Murray, T., & Cort, S. Aural identification of children's voices.Journal of Auditory Research, 1971,11, 260–262.
Murdock, B.B.Human Memory: Theory and data. Hillsdale, New York: Erlbaum Press, 1974.
Murdock, B.B., & Walker, D.K. Modality effects in free recall.Journal of Verbal Learning and Verbal Behaviour, 1969,8, 665–676.
Patterson, K.E., & Baddeley, A.D. When face recognition fails.Journal of Experimental Psychology: Human Learning and Memory, 1977,3, 406–417.
Pear, T.H.Voice and Personality. New York: Wiley, 1931.
Pollack, I., Pickett, J.M., & Sumby, W.H. On the identification of speakers by voice.Journal of the Acoustical Society of America, 1954,26, 403–406.
Ptacek, P.H., & Sanders, E.K. Age recognition from voice.Journal of Speech and Hearing Research, 1966,9, 273–277.
Reich, A., Moll, K., & Curtis, J. Effects of selected vocal disguise upon spectrographic speaker identification.Journal of the Acoustical Society of America, 1976,60, 919–925.
Saslove, H., & Yarmey, A.D. Long term auditory memory: Speaker identification.Journal of Applied Psychology, 1980,65, 111–116.
Scherer, K.R. Personality inference from voice quality: the loud voice of extraversion.European Journal of Social Psychology, 1978,8, 467–487.
Schwartz, M.F. Identification of speaker's sex from isolated voiceless fricatives.Journal of the Acoustical Society of America, 1968,43, 1178–1179.
Schwartz, M.F., & Rhine, H.E. Identification of speaker's sex from isolated whispered vowels.Journal of the Acoustical Society of America, 1968,44, 1736–1737.
Shipp, F.T., & Hollien, H. Perception of the aging male voice.Journal of Speech and Hearing Research, 1969,12, 703–710.
Stevens, K.N., Williams, C.E., Carbonell, J.R., & Wood, B. Speaker authentication and verification: A comparison of spectrographic and auditory presentations of speech materials.Journal of the Acoustical Society of America, 1968,44, 1596–1607.
Wolf, J.J. Efficient acoustic parameters for speaker recognition.Journal of the Acoustical Society of America, 1972,51, 2044–2056.
Wells, G.L., Lindsay, R.C.L., & Ferguson, T.J. Accuracy, confidence, and juror perceptions in eyewitness identification.Journal of Applied Psychology, 1979,64, 440–448.
Yarmey, A.D.The Psychology of Eyewitness Testimony. London and New York: The Free Press, 1979.
Author information
Authors and Affiliations
Additional information
Part of the research discussed in this paper was conducted under the auspices of a grant from the British Home Office to the author and Ray Bull. The author would like to thank the issue editor for his very useful comments on an earlier draft of this paper, and Harriet Rathborn for running many of the experiments.
About this article
Cite this article
Clifford, B.R. Voice identification by human listeners: On earwitness reliability. Law Hum Behav 4, 373–394 (1980). https://doi.org/10.1007/BF01040628
Issue Date:
DOI: https://doi.org/10.1007/BF01040628