Bubbles results
We considered two performance metrics as dependent measures: the amount of information (i.e., number of bubbles) required to achieve the target performance of 75% correct for each emotion and the actual percentage correct achieved (NB: with a small number of trials it is not possible to perfectly stabilize performance at the target 75% correct); see Fig. 1A. Alongside this, we examined the quality of the bubbles solution, i.e. the visual information that is significantly associated with categorization of each emotional expression. A one-way repeated measures ANOVA with task version (1, 2, 3) as the within-subjects factor indicated a clear main effect on the amount of information required to achieve good performance levels (F(2,58) = 3.8, p = 0.029, η
2=0.12). Planned comparisons revealed that participants required significantly less information for task version 3 (M = 85 bubbles) compared to task version 2 (M = 97bubbles, F(1,29) = 5.6, p = 0.025, η
2 = 0.16), but there was no such drop in number of bubbles for task version 2 compared to task version 1 (M = 93 bubbles, F(1,29) = 0.9, p = 0.35, η
2 = 0.03). An equivalent ANOVA on percentage correct scores indicated a trend for a main effect of condition here too (F(1.3, 37.6) = 3.4, p = 0.06, η
2 = 0.11), with planned comparisons again showing that participants are performing slightly better in task version 3 (74.4%) compared to task version 2 (72%, F(1,29) = 4.2, p = 0.049, η
2
= 0.13), but with no improvement for task version 2 compared to task version 1 (73.4%, F(1,29) = 2.3, p = 0.14, η
2 = 0.07).
To evaluate the effectiveness of the task version manipulations on the quality of the bubbles solution we considered the information processing results for the two most well-researched emotional expression categorizations: fear and happiness.Footnote 1 The critical visual information for both fear and happiness categorizations has been confirmed across a number of studies in typical adult participants. For fearful categorizations the crucial visual information has been repeatedly shown to comprise the wide-open eyes across scales in higher spatial frequencies (scales 1–3), alongside the open mouth (scales 3 and 4, e.g., Adolphs et al., 2005; Smith & Merlusca, 2015; Smith et al., 2005, F. Smith & Schyns, 2009;). For happiness categorizations it is the wide-open mouth, from fine detail in the higher spatial frequencies through to the broad low spatial frequency mouth shape information (Adolphs et al., 2005; Smith & Merlusca, 2015; Smith et al., 2005, F. Smith & Schyns, 2009).
For both fear and happy, and all three task version scenarios, the bubbles solution replicates most
Footnote 2 of the key features of these established processing profiles. Figure 1B shows only those regions that pass the corrected statistical tests (p<0.05, Chauvin et al., 2005) highlighted on a sample face. Significant regions observed under task version 1 are coded in red, those from task version 2 in green, and finally those of task version 3 in blue. Note that where the same regions were significant in multiple task versions it is color-coded in the RGB color space combined color (e.g., the same region significant for task version 1(red) and 3(blue) would be coded in purple, the same region significant for all task versions would be coded in white). Figure 1C presents the information association maps (z-scores) for all positive associations between information sampling and performance for each condition in turn across the five spatial frequency bands sampled prior to applying the statistical threshold.
Importantly, not all task versions produced equally clear profiles of information use. Close inspection of the results reveals that for fear categorizations, it is only in task version 3 – where social interaction and participant engagement are maximized – that both eyes reach significance in the highest spatial frequency band. Similarly, for happiness categorizations it is only in task version 3 that the entire higher spatial frequency mouth reaches significance. Furthermore, when considering the absolute strength of the association between the important pixel locations and performance (the un-thresholded z-scores, presented in Fig. 1C) the largest values are generally observed for task version 3, see Table 1.Footnote 3
Table 1 Maximal strength of the association between information location and performance (measured in z-scores) indicating a stronger association for task version 3 for scales 1–3 for both fear and happy, and again for scale 5 for fear categorizations
Motivation Questionnaire results
One participant failed to understand the instructions with regard to the questionnaire (choosing to answer only one of the eight questions at each administration), and the data for one participant, completing one condition, was lost due to experimenter error leaving 28 participants. A one-way repeated measures ANOVA, with task version as the factor with three levels, was conducted for each question in turn (GG correction reported for violations of sphericity). Significant effects were further explored with post-hoc follow-up t-tests (Bonferroni corrected for multiple comparisons); see Fig. 2 for average responses per condition. We observed a significant effect of condition for participant’s self-reported enjoyment (F(2,54) = 4.7, p = 0.013, η
2 = 0.15), interest (F(2,54) = 3.86, p = 0.027, η
2 = 0.13), desire to do well (F(2,54) = 4.4, p = 0.016, η
2 = 0.14), pressure felt (F(1.57,42.27) = 6.6, p = 0.006, η
2 = 0.2), and connectedness to the experimenter (F(1.26, 34.06) = 13.2, p<0.001, η
2 = 0.33), with a clear trend for an effect also on the effort they expended (F(2,54) = 3.04, p = 0.056, η
2 = 0.1). There was no effect of experimental condition on their desire to be good at the task (F(2,54) = 1.8, p = 0.17, η
2=0.06) or how important they felt the task was (F(2,54) = 0.74, p = 0.48, η
2 = 0.03).
Planned comparisons confirmed that participants enjoyed participating in condition 3 more than condition 2 (F(1,27) = 6.4, p = 0.018, η
2 = 0.19), but with no such benefit for condition 2 over condition 1 (F(1,27) = 0.6, p = 0.45, η
2 = 0.02). Similarly, participants expended more effect in condition 3 compared to condition 2 (F(1,27) = 3.98, p = 0.056, η
2 = 0.13), with no difference between conditions 1 and 2 (F(1,27) = 0.3, p = 0.59. η
2 = 0.11). They also tried to do well more for condition 3 than condition 2 (F(1,27) = 4.7, p = 0.039, η
2 = 0.15), with no difference between conditions 1 and 2 (F(1,27) = 1.35, p = 0.26, η
2 = 0.05). As expected, participants felt more connected to the experimenter in condition 3 than condition 2 (F(1,27) = 15.9, p<0.001, η
2 = 0.37), but this came at the cost of feeling more pressure (F(1,27) = 6.7, p = 0.013, η
2 = 0.2). Again there was no difference for either connectedness or pressure felt between conditions 1 and 2 (F(1,27) = 1.3, 0.36, p = 0.26, 0.55 , η
2 = 0.05, 0.013, respectively). Finally, participants’ interest in the experiment did not increase significantly between conditions 2 and 3 (F(1,27) = 0.58, p = 0.45, η
2 = 0.02), but rather there was a trend for interest to be significantly greater for condition 2 than for condition 1 (F(1,27) = 4.0, p = 0.056, η
2 = 0.13).
In an exploratory analysis we then asked whether subjective feelings representing engagement with the task might be directly correlated with markers of task performance (percentage correct, mean number of bubbles) within each task version. We considered self-report measures of effort expended as the best proxy for task engagement and found clear relationships between increased engagement and improvements in the behavioral performance metrics for all task versions, but most so for task version 3 (V1: Accuracy, r2(28) = 0.40, p = 0.03, Information required, r2(28) = −0.33, p = 0.09; V2: Accuracy, r2(28) = 0.52, p = 0.005*, Information required: r2(28) = −0.46, p = 0.013; V3: Accuracy, r2(28) = 0.53, p = 0.004*, Information required: r2(28) = −0.49, p = 0.009*; *denotes Bonferoni corrected significant effects). Note that engagement with the task as approximated by effort expended was not directly correlated with “pressure felt” under any task version (r2(28) = 0.19, 0.03, 0.22, p = 0.35,0.86,0.26, respectively) and in particular the increased pressure felt under task version 3 did not seem to be a significant driving force of improved performance (Accuracy, r2(28) = 0.089, p = 0.65 Information required, r2(28) = −0.16, p = 0.43). Similarly, increased feelings of connectedness to the researcher did not correlate significantly with performance under any task version (V3: r2(28) < 0.21, p > 0.28; V1, V2: r2(28) < 0.23, p > 0.23).