All reported analyses and associated data files can be accessed on the Open Science Framework (https://osf.io/a862f/). The analyses are based on summary statistics; however, the raw (trial-by-trial) data for each task are also available on the Open Science Framework (https://osf.io/43dhv/).
Affective measures
Aesthetic ratings
Natural soundscapes were aesthetically preferred over urban soundscapes (Full: t (61) = 5.61, p < .001, d = 1.41, BF = 2.41e4; Naïve: t (44) = 4.65, p < .001, d = 1.38, BF = 610.2). For all 63 participants, the mean rating of natural soundscapes was 2.31 (SD = 0.37) and the mean rating of urban soundscapes was 1.84 (SD = 0.28). For the subset of 46 naïve participants, the mean rating of natural soundscapes was 2.23 (SD = 0.41) and the mean rating of urban soundscapes was 1.82 (SD = 0.29).
PANAS
We analyzed changes in the PANAS using a 2 (time: pre-, post-) × 2 (soundscape: nature, urban) mixed factorial ANOVA and Bayesian equivalent. For positive affect (PA), there was a significant main effect of time (Full: F (1, 61) = 10.62, p = .002, d = 0.41, BF = 17.12; Naïve: F (1, 44) = 9.26, p = .004, d = 0.45, BF = 9.327), characterized by overall lower scores post-intervention compared to pre-intervention. There was a marginal main effect of soundscape type when considering all participants – with natural soundscape participants exhibiting lower scores than urban soundscape participants – though this effect was not significant when limited to the naïve participants (Full: F (1, 61) = 3.28, p = .075, d = 0.46, BF = 1.22; Naïve: F (1, 44) = 2.32, p = .135, d = 0.45, BF = 0.93). The interaction between time and soundscape type was not significant (Full: F (1, 61) = 0.22, p = .644, d = 0.12, BF = 0.27; Naïve: F (1, 44) = 0.13, p = .718, d = 0.11, BF = 0.29).
For negative affect (NA), there was also a main effect of time (Full: F (1, 61) = 9.73, p = .003, d = 0.39, BF = 9.87; Naïve: F (1, 44) = 9.74, p = .003, d = 0.44, BF = 8.36), characterized by overall lower scores post-intervention compared to pre-intervention. However, the main effect of soundscape type (Full: F (1, 61) = 0.39, p = .536, d = 0.16, BF = 0.52; Naïve: F (1, 44) = 0.77, p = .386, d = 0.26, BF = 0.61), in addition to the interaction between time and soundscape type (Full: F (1, 61) = 2.50, p = .119, d = 0.40, BF = 0.71; Naïve: F (1, 44) = 1.45, p = .235, d = 0.36, BF = 0.50), was not significant for either the full or the naïve group of participants.
Cognitive measure
We analyzed changes in the composite cognitive measure using a 2 (time: pre-, post-) × 2 (soundscape: nature, urban) mixed factorial ANOVA and Bayesian equivalent. In this analysis, the main effect of time was significant (Full: F (1, 61) = 12.91, p < .001, d = 0.42, BF = 20.857; Naïve: F (1, 44) = 8.42, p = .006, d = 0.37, BF = 3.13), with participants displaying higher post-intervention compared to pre-intervention scores, i.e., a learning effect. The main effect of soundscape type was not significant, and the Bayes factor was inconclusive (Full: F (1, 61) = 3.08, p = .084, d = 0.44, BF = 1.127; Naïve: F (1, 44) = 1.94, p = .171, d = 0.41, BF = 0.89). Critically, there was a significant interaction between time and soundscape type (Full: F (1, 61) = 7.92, p = .007, d = 0.71, BF = 6.651; Naïve: F (1, 44) = 6.58, p = .014, d = 0.76, BF = 3.08), plotted in Fig. 1. This interaction was characterized by a greater improvement in the natural sound condition compared to the urban sound condition, and the effect size of the interaction can be interpreted as medium to large (Cohen, 1988).
Follow-up analyses demonstrated that the interaction between time and soundscape was best conceptualized as a nature-related benefit to performance. Performance on the cognitive measure did not significantly differ between participants in the natural and urban soundscape conditions during pre-intervention (Full: t (61) = 0.74, p = .462, d = 0.19, BF = 0.324; Naïve: t (44) = .61, p = .545, d = 0.18, BF = 0.341); by post-intervention, however, participants assigned to natural soundscapes were significantly outperforming participants assigned to urban soundscapes (Full: t (61) = 2.59, p = .006, d = 0.65, BF = 8.054; Naïve: t (44) = 2.13, p = .020, d = 0.63, BF = 3.395). Within participants, the cognitive improvement from pre- to post-intervention was significant and positive for natural sounds (Full: t (30) = 4.56, p < .001, d = 0.82, BF = 311.1; Naïve: t (20) = 4.64, p < .001, d = 1.01, BF = 182.3) and not significant for urban sounds (Full: t (31) = 0.55, p = .587, d = 0.10, BF = 0.217; Naïve: t (24) = 0.26, p = .796, d = 0.05, BF = 0.22).
Relationship between affective and cognitive measures
For our affective measures, we did not find evidence that listening to natural versus urban soundscapes differentially influenced participants’ positive or negative affect, measured by the PANAS, even though natural sounds were more preferred aesthetically to urban sounds. However, these analyses by themselves do not answer whether the affective measures relate to the observed improvement in cognitive performance. To assess this question, we correlated cognitive improvement (post- minus pre-intervention score) with changes in the PANAS (both positive and negative affect), as well as with aesthetic ratings.
The relationship between changes in positive affect and cognitive improvement was not significant (Full: r (61) = .14, p = .276, d = 0.28, BF = 0.28; Naïve: r (44) = .06, p = .703, d = 0.12, BF = 0.20). The relationship between changes in negative affect and cognitive improvement was also not significant (Full: r (61) = .15, p = .231, d = 0.30, BF = 0.32; Naïve: r (44) = .11, p = .455, d = 0.22, BF = 0.24). Finally, the relationship between aesthetic ratings and cognitive improvement was also not significant (Full: r (61) = .18, p = .168, d = 0.37, BF = 0.40; Naïve: r (44) = .22, p = .147, d = 0.45 BF = 0.51).
For completeness, the means, standard deviations, and effect sizes of changes from pre- to post-intervention for each of the measures (separated by full and naïve participant groups) are provided in Table 1.
Table 1 Pre- and post-intervention scores for the computerized tasks