Human Physiology

, Volume 45, Issue 6, pp 577–586 | Cite as

Speech and Non-Speech Sound Categorization in Auditory Cortex: fMRI Correlates

  • V. M. Shklovsky
  • S. A. VarlamovEmail author
  • A. G. Petrushevsky
  • L. A. MayorovaEmail author


We studied the functional structure of the auditory cortex by identifying and comparing the spatial localization of activation areas in response to speech and non-speech stimuli using functional magnetic resonance imaging (fMRI). We also performed a similar comparison of activation zones in response to male and female voices. We found that there are specific areas for speech and non-speech auditory stimuli and overlapping areas; the speech area is significantly larger as compared with others. The activation areas responding to male and female voices overlap, though not significantly; the influence of female voice was stronger. These results suggest that there are special areas in the auditory cortex for auditory signal processing.


speech perception superior temporal cortex fMRI planum temporale 



We are grateful to the staff of the Center for Speech Pathology and Neurorehabilitation for their help in collecting the stimulus material.


Conflict of interests. The authors declare no explicit and potential conflicts of interest associated with the publication of this article.

Statement of compliance with standards of research involving humans as subjects. All studies were conducted in accordance with the principles of biomedical ethics set out in the Declaration of Helsinki in 1964 and its subsequent updates, and approved by the local bioethical committees of the Center for Speech Pathology and Neurorehabilitation and the Institute of Higher Nervous Activity and Neurophysiology of the Russian Academy of Sciences (Moscow). Each study participant provided voluntary written informed consent signed by them after his explanations potential risks and benefits, as well as the nature of the forthcoming investigations.


  1. 1.
    Luriya, A.R., Vysshie korkovye funktsii cheloveka i ikh narusheniya pri lokal’nykh porazheniyakh mozga (Higher Human Cortical Functions and Their Disorders in Local Brain Lesions), Moscow: Mosk. Gos. Univ., 1962.Google Scholar
  2. 2.
    Diehl, R.L., Lotto, A.J., and Holt, L.L., Speech perception, Annu. Rev. Psychol., 2004, vol. 55, p. 149.CrossRefGoogle Scholar
  3. 3.
    Zatorre, R.J., Belin, P., and Penhune, V.B., Structure and function of auditory cortex: music and speech, Trends Cognit. Sci., 2002, vol. 6, p. 37.CrossRefGoogle Scholar
  4. 4.
    Joanisse, M.F. and Gati, J.S., Overlapping neural regions for processing rapid temporal cues in speech and nonspeech signals, NeuroImage, 2003, vol. 19, p. 64.CrossRefGoogle Scholar
  5. 5.
    Tremblay, P., Baroni, M., and Hasson, U., Processing of speech and non-speech sounds in the supratemporal plane: auditory input preference does not predict sensitivity to statistical structure, NeuroImage, 2013, vol. 66, p. 318.CrossRefGoogle Scholar
  6. 6.
    Marie, D., Roth, M., Lacoste, R., et al., Left brain asymmetry of the planum temporale in a nonhominid primate: Redefining the origin of brain specialization for language, Cereb. Cortex, 2018, vol. 28, no. 5, p. 1808.CrossRefGoogle Scholar
  7. 7.
    Zheng, Z.Z., Munhall, K.G., and Johnsrude, I.S., Functional overlap between regions involved in speech perception and in monitoring one’s own voice during speech production, J. Cognit. Neurosci., 2010, vol. 22, no. 8, p. 1770.CrossRefGoogle Scholar
  8. 8.
    Christoffels, I.K., Formisano, E., and Schiller, N.O., Neural correlates of verbal feedback processing: an fMRI study employing overt speech, Hum. Brain Mapp., 2007, vol. 28, no. 9, p. 868.CrossRefGoogle Scholar
  9. 9.
    Hickok, G., Okada, K., and Serences, J.T., Area Spt in the human planum temporale supports sensory-motor integration for speech processing, J. Neurophysiol., 2008, vol. 101, no. 5, p. 2725.CrossRefGoogle Scholar
  10. 10.
    Zheng, Z.Z., The functional specialization of the planum temporale, J. Neurophysiol., 2009, vol. 102, no. 6, p. 3079.CrossRefGoogle Scholar
  11. 11.
    Griffiths, T.D. and Warren, J.D., The planum temporale as a computational hub, Trends Neurosci., 2002, vol. 25, no. 7, p. 348.CrossRefGoogle Scholar
  12. 12.
    Hawkins, S., Roles and representations of systematic fine phonetic detail in speech understanding, J. Phonetics, 2003, vol. 31, p. 373.CrossRefGoogle Scholar
  13. 13.
    McMurray, B., Tanenhaus, M.K., and Aslin, R.N., Gradient effects of within-category phonetic variation on lexical access, Cognition, 2002, vol. 86, p. B33.CrossRefGoogle Scholar
  14. 14.
    Mottonen, R., Calvert, G., Jaaskelainen, I., et al., Perceiving identical sounds as speech or non-speech modulates activity in the left posterior superior temporal sulcus, NeuroImage, 2006, no. 30, p. 563.CrossRefGoogle Scholar
  15. 15.
    Petrides, M. and Pandya, D.N., Comparative cytoarchitectonic analysis of the human and the macaque ventrolateral prefrontal cortex and corticocortical connection patterns in the monkey, Eur. J. Neurosci., 2002, vol. 16, no. 2, p. 291.CrossRefGoogle Scholar
  16. 16.
    Romanski, L.M. and Averbeck, B.B., Neural representation of vocalizations in the primate ventrolateral prefrontal cortex, J. Neurophysiol., 2005, vol. 93, p. 734.CrossRefGoogle Scholar
  17. 17.
    Romanski, L.M. and Goldman-Rakic, P.S., An auditory domain in primate prefrontal cortex, Nat. Neurosci., 2002, vol. 5, no. 1, p. 15.CrossRefGoogle Scholar
  18. 18.
    Fecteau, S., Sensitivity to voice in human prefrontal cortex, J. Neurophysiol., 2005, vol. 94, no. 3, p. 2251.CrossRefGoogle Scholar
  19. 19.
    Joassin, F., Maurage, P., and Campanella, S., The neural network sustaining the crossmodal processing of human gender from faces and voices: an fMRI study, NeuroImage, 2011, vol. 54, no. 2, p. 1654.CrossRefGoogle Scholar
  20. 20.
    Welcome Trust Centre for Neuroimaging: Scholar
  21. 21.
    Friston, K.J., Holmes, A.P., Worsley, K.J., et al., Statistical parametric maps in functional imaging: a general linear approach, Hum. Brain Mapp., 1994, vol. 2, no. 4, p. 189.CrossRefGoogle Scholar
  22. 22.
    Wilke, M. and Schmithorst, V.J., A combined bootstrap/histogram analysis approach for computing a lateralization index from neuroimaging data, NeuroImage, 2006, vol. 33, no. 2, p. 522.CrossRefGoogle Scholar
  23. 23.
    Wilke, M. and Lidzba, K., LI-tool: a new toolbox to assess lateralization in functional MR-data, J. Neurosci. Methods, 2007, vol. 163, no. 1, p. 128.CrossRefGoogle Scholar
  24. 24.
    Chan, A., Dykstra, A., Jayaram, V., et al., Speech-specific tuning of neurons in human superior temporal gyrus, Cereb. Cortex, 2014, vol. 24, no. 10, p. 2679.CrossRefGoogle Scholar
  25. 25.
    Fan, C.S.D., Zhu, X., Dosch, H.G., et al., Language related differences of the sustained response evoked by natural speech sounds, PLoS One, 2017, vol. 12, no. 7, p. e0180441.CrossRefGoogle Scholar
  26. 26.
    Wernicke, C., The symptom complex of aphasia, Proc. Boston Colloquium for the Philosophy of Science 1966/1968, New York: Springer-Verlag, 1969, vol. 4, p. 34.Google Scholar
  27. 27.
    Luria, A.R., Traumatic Aphasia, Hague: Mouton, 1970.Google Scholar
  28. 28.
    Andermann, M., Patterson, R.D., Vogt, C., et al., Neuromagnetic correlates of voice pitch, vowel type, and speaker size in auditory cortex, NeuroImage, 2017, vol. 158, p. 79.CrossRefGoogle Scholar
  29. 29.
    Bonte, M., Hausfeld, L., Scharke, W., et al., Task-dependent decoding of speaker and vowel identity from auditory cortical response patterns, J. Neurosci., 2014, vol. 34, no. 13, p. 4548.CrossRefGoogle Scholar
  30. 30.
    Formisano, E., De Martino, F., Bonte, M., and Goebel, R., “Who” is saying “what”? Brain-based decoding of human voice and speech, Science, 2008, vol. 322, no. 5903, p. 970.CrossRefGoogle Scholar
  31. 31.
    Liu, P., Cole, P., Gilmore, R., et al., Young children’s neural processing of their mother’s voice: an fMRI study, Neuropsychologia, 2019, vol. 122, p. 11.CrossRefGoogle Scholar
  32. 32.
    Gardumi, A., Ivanov, D., Havlicek, M., et al., Tonotopic maps in human auditory cortex using arterial spin labeling, Hum. Brain Mapp., 2017, vol. 38, no. 3, p. 1140.CrossRefGoogle Scholar
  33. 33.
    Bonte, M., Ley, A., Scharke, W., and Formisano, E., Developmental refinement of cortical systems for speech and voice processing, NeuroImage, 2016, vol. 128, p. 373.CrossRefGoogle Scholar
  34. 34.
    Simon, J.Z., The encoding of auditory objects in auditory cortex: Insights from magnetoencephalography, Int. J. Psychophysiol., 2015, vol. 95, no. 2, p. 184.CrossRefGoogle Scholar
  35. 35.
    Markiewicz, C.J. and Bohland, J.W., Mapping the cortical representation of speech sounds in a syllable repetition task, NeuroImage, 2016, vol. 141, p. 174.CrossRefGoogle Scholar
  36. 36.
    Bethmann, A. and Brechmann, A., On the definition and interpretation of voice selective activation in the temporal cortex, Front. Hum. Neurosci., 2014, vol. 8, p. 499.CrossRefGoogle Scholar

Copyright information

© Pleiades Publishing, Inc. 2019

Authors and Affiliations

  1. 1.Center for Speech Pathology and Neurorehabilitation of the Moscow Department of Health, clinical base of the Serbsky Federal Medical Research Center for Psychiatry and Addictology, Ministry of Health of RussiaMoscowRussia
  2. 2.Moscow State UniversityMoscowRussia
  3. 3.Institute of Higher Nervous Activity and Neurophysiology, Russian Academy of SciencesMoscowRussia
  4. 4.Federal Research and Clinical Center of Intensive Care Medicine and RehabilitologyMoscowRussia

Personalised recommendations