Abstract
Studies of auditory object perception claim that semantic properties dominate acoustic properties in determining identification accuracy. Yet the direction of the semantic effect is mixed, with some studies showing an advantage for detecting incongruent sounds and others reporting a congruent sound advantage. Here we examine the role of the participant’s attentional set when identifying auditory objects in naturalistic soundscapes. We varied the acoustic and semantic properties of the sounds orthogonally in two experiments. In Experiment 1 participants tuned their attention broadly to detect any change between two successive soundscapes (e.g., two restaurant soundscapes, with and without a child coughing). In Experiment 2 they tuned attention more narrowly to a probe presented after a soundscape (e.g., a restaurant soundscape with a child coughing, followed by the coughing sound alone). In both experiments, semantic relations between the objects and backgrounds helped to disambiguate objects that blended acoustically with the background. When attending globally (Experiment 1), objects that were acoustically similar yet semantically incongruent tended to be missed (e.g., bouncing basketball on a construction site), as though camouflaged by the gist of the soundscape. When attending locally (Experiment 2), semantically congruent foil objects led to false positive reports under acoustically similar conditions (hammering sounds on a construction site), as though the gist of the soundscape contributed to their plausible inclusion. In summary, although attentional set had a strong influence on the specific kinds of errors made, both results pointed to participants using a semantically congruent high-level schema to report the sounds they heard.
Similar content being viewed by others
Data availability
Neither of the two experiments were preregistered. The data and stimulus material for all experiments are available via the Open Science Framework at: https://osf.io/4va8j/, https://doi.org/10.17605/OSF.IO/4VA8J.
Notes
The discussion in this paper does not include the identification of human speech and music, since both of these types of sounds are semantically and acoustically special and highly predictive (Agres, 2019; Large et al., 1995; Warren, 1970). Non-speech and non-musical sounds do not have dedicated processes for their acoustic features and are much less predictable than speech or music (Gygi & Shafiro, 2011.
References
Agres, K. R. (2019). Change detection and schematic processing in music. Psychology of Music, 47(2), 173–193.
Attneave, F. (1955). Symmetry, information, and memory for patterns. The American Journal of Psychology, 68(2), 209–222.
Austen, E., & Enns, J. T. (2000). Change detection: Paying attention to detail. Psyche, 6(11), 808–817.
Austen, E. L., & Enns, J. T. (2003). Change detection in an attended face depends on the expectation of the observer. Journal of Vision, 3(1), 7–7.
Ballas, J. A. (1993). Common factors in the identification of an assortment of brief everyday sounds. Journal of Experimental Psychology: Human Perception and Performance, 19(2), 250.
Ballas, J. A., & Mullins, T. (1991). Effects of context on the identification of everyday sounds. Human Performance, 4(3), 199–219.
Bartlett, F. C. (1932). Remembering: A study in experimental and social psychology. Cambridge University Press.
Bosch, Vanden, der Nederlanden, C. M., Snyder, J. S., & Hannon, E. E. (2016). Children use object-level category knowledge to detect changes in complex auditory scenes. Developmental Psychology, 52(11), 1867.
Botvinick, M., & Bylsma, L. M. (2005). Regularization in short-term memory for serial order. Journal of Experimental Psychology: Learning, Memory, and Cognition, 31(2), 351.
Bourrier, S. C., Berman, M. G., & Enns, J. T. (2018). Cognitive strategies and natural environments interact in influencing executive function. Frontiers in Psychology, 9, 1248.
Brennan, A. A., Watson, M. R., Kingstone, A., & Enns, J. T. (2011). Person perception informs understanding of cognition during visual search. Attention, Perception, & Psychophysics, 73, 1672–1693.
Carello, C., Anderson, K. L., & Kunkler-Peck, A. J. (1998). Perception of object length by sound. Psychological science, 9(3), 211–214.
Clark, C. W., Marler, P., & Beeman, K. (1987). Quantitative analysis of animal vocal phonology: An application to swamp sparrow song. Ethology, 76, 101–115.
Cortopassi, K. A., & Bradbury, J. W. (2000). The comparison of harmonically rich sounds using spectrographic cross-correlation and principal coordinates analysis. Bioacoustics, 11(2), 89–127.
Cramer, E. R. A. (2013). Measuring consistency: Spectrogram cross- correlation versus targeted acoustic parameters. Bioacoustics, 22(3), 247–257.
Deese, J. (1959). On the prediction of occurrence of particular verbal intrusions in immediate recall. Journal of Experimental Psychology, 58(1), 17.
Ernst, M. O., & Banks, M. S. (2002). Humans integrate visual and haptic information in a statistically optimal fashion. Nature, 415(6870), 429–433.
Fredrickson, B. L. (2004). The broaden–and–build theory of positive emotions. Philosophical Transactions of the Royal Society of London Series B: Biological Sciences, 359(1449), 1367–1377.
Freed, D. J. (1990). Auditory correlates of perceived mallet hardness for a set of recorded percussive sound events. The Journal of the Acoustical Society of America, 87(1), 311–322.
Gaston, J., Dickerson, K., Hipp, D., & Gerhardstein, P. (2017). Change deafness for real spatialized environmental scenes. Cognitive Research: Principles and Implications, 2(1), 1–13.
Gregg, M. K., & Samuel, A. G. (2008). Change deafness and the organizational properties of sounds. Journal of Experimental Psychology: Human Perception and Performance, 34(4), 974.
Gregg, M. K., & Samuel, A. G. (2009). The importance of semantics in auditory representations. Attention, Perception, & Psychophysics, 71, 607–619.
Gregg, M. K., Irsik, V. C., & Snyder, J. S. (2014). Change deafness and object encoding with recognizable and unrecognizable sounds. Neuropsychologia, 61, 19–30.
Gygi, B., & Shafiro, V. (2011). The incongruency advantage for environmental sounds presented in natural auditory scenes. Journal of Experimental Psychology: Human Perception and Performance, 37(2), 551.
Gygi, B., Kidd, G. R., & Watson, C. S. (2004). Spectral-temporal factors in the identification of environmental sounds. The Journal of the Acoustical Society of America, 115(3), 1252–1265.
Hudson Kam, C. L., & Chang, A. (2009). Investigating the cause of language regularization in adults: Memory constraints or learning effects? Journal of Experimental Psychology: Learning, Memory, and Cognition, 35(3), 815.
Irsik, V. C., Bosch, Vanden, der Nederlanden, C. M., & Snyder, J. S. (2016). Broad attention to multiple individual objects may facilitate change detection with complex auditory scenes. Journal of Experimental Psychology: Human Perception and Performance, 42(11), 1806–1817. https://doi.org/10.1037/xhp0000266
Jefferies, L. N., Ghorashi, S., Kawahara, J. I., & tDiLollo, V. (2007). Ignorance is bliss: The role of observer expectation in dynamic spatial tuning of the attentional focus. Perception & Psychophysics, 69(7), 1162–1174.
Jefferies, L. N., Enns, J. T., & Di Lollo, V. (2014). The flexible focus: Whether spatial attention is unitary or divided depends on observer goals. Journal of Experimental Psychology: Human Perception and Performance, 40(2), 465.
Josephs, E., Drew, T., & Wolfe, J. (2016). Shuffling your way out of change blindness. Psychonomic Bulletin & Review, 23, 193–200.
Khanna, H., Gaunt, S. L. L., & McCallum, D. A. (1997). Digital spectrographic cross- correlation: Tests of sensitivity. Bioacoustics, 7, 209–234.
Knapen, T., Brascamp, J., Adams, W. J., & Graf, E. W. (2009). The spatial scale of perceptual memory in ambiguous figure perception. Journal of Vision, 9(13), 16–16.
Kubovy, M., & Van Valkenburg, D. (2001). Auditory and visual objects. Cognition, 80(1–2), 97–126.
Kunkler-Peck, A. J., & Turvey, M. T. (2000). Hearing shape. Journal of Experimental Psychology: Human Perception and Performance, 26(1), 279.
Lanning, J. M., & Stilp, C. (2020). Natural music context biases musical instrument categorization. Attention, Perception, & Psychophysics, 82, 2209–2214.
Large, E. W., Palmėr, C., & Pollack, J. B. (1995). Reduced memory representations for music. Cognitive Science, 19(1), 53–96.
Leech, R., Gygi, B., Aydelott, J., & Dick, F. (2009). Informational factors in identifying environmental sounds in natural auditory scenes. The Journal of the Acoustical Society of America, 126(6), 3147–3155.
Li, X., Logan, R. J., & Pastore, R. E. (1991). Perception of acoustic source characteristics: Walking sounds. The Journal of the Acoustical Society of America, 90(6), 3036–3049.
Murray, M. M., Camen, C., Andino, S. L. G., Bovet, P., & Clarke, S. (2006). Rapid brain discrimination of sounds of objects. Journal of Neuroscience, 26(4), 1293–1302.
Navon, D. (1977). Forest before trees: The precedence of global features in visual perception. Cognitive Psychology, 9(3), 353–383.
Pardilla-Delgado, E., Payne, J. D. (2017). The deese-roediger-McDermott (DRM) task: a simple cognitive paradigm to investigate false memories in the laboratory. Journal of visualized experiments: JoVE, 119, e54793. https://doi.org/10.3791/54793.
Perfors, A. (2012). When do memory limitations lead to regularization? An experimental and computational investigation. Journal of Memory and Language, 67(4), 486–506.
Reason, J. (1990). Human error. Cambridge University Press.
Repp, B. H. (1987). The sound of two hands clapping: An exploratory study. The Journal of the Acoustical Society of America, 81(4), 1100–1109.
Rhodes, S., Cowan, N., Hardman, K. O., & Logie, R. H. (2018). Informed guessing in change detection. Journal of Experimental Psychology: Learning, Memory, and Cognition, 44(7), 1023.
Roediger, H. L., & McDermott, K. B. (1995). Creating false memories: Remembering words not presented in lists. Journal of Experimental Psychology: Learning, Memory, and Cognition, 21(4), 803.
Sablé-Meyer, M., Fagot, J., Caparos, S., van Kerkoerle, T., Amalric, M., & Dehaene, S. (2021). Sensitivity to geometric shape regularity in humans and baboons: A putative signature of human singularity. Proceedings of the National Academy of Sciences, 118(16), e2023123118.
Shafiro, V. (2008). Identification of environmental sounds with varying spectral resolution. Ear and Hearing, 29(3), 401–420.
Sierro, J., De Kort, S. R., & Hartley, I. R. (2023). Sound properties affect measurement of vocal consistency in birdsong: Validation of the spectrogram cross correlation method (SPCC). The Journal of the Acoustical Society of America, 154(2), 699–708.
Stanislaw, H., & Todorov, N. (1999). Calculation of signal detection theory measures. Behavior research methods, instruments, & computers, 31(1), 137–149.
Smilek, D., Enns, J. T., Eastwood, J. D., & Merikle, P. M. (2006). Relax! Cognitive strategy influences visual search. Visual Cognition, 14(4–8), 543–564.
Warren, R. M. (1970). Perceptual restoration of missing speech sounds. Science, 167(3917), 392–393.
Warren, W. H., & Verbrugge, R. R. (1984). Auditory perception of breaking and bouncing events: A case study in ecological acoustics. Journal of Experimental Psychology: Human perception and performance, 10(5), 704.
Watson, M. R., Brennan, A. A., Kingstone, A., & Enns, J. T. (2010). Looking versus seeing: Strategies alter eye movements during visual search. Psychonomic Bulletin & Review, 17(4), 543–549.
Acknowledgements
We are deeply grateful to Brian Gygi for generously providing the auditory stimulus materials. This research was funded by a Discovery Grant to JTE from the Natural Sciences and Engineering Council of Canada. NB was supported by a Summer Undergraduate Student Research Assistantship from the Natural Sciences and Engineering Council of Canada.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Dudarev, V., Kai, J., Brar, N. et al. Listening to trees in the forest: Attentional set influences how semantic and acoustic factors interact in auditory perception. Atten Percept Psychophys 86, 381–391 (2024). https://doi.org/10.3758/s13414-023-02835-w
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.3758/s13414-023-02835-w