Metacognition tracks sensitivity following involuntary shifts of visual attention

Recht, Samuel; Mamassian, Pascal; de Gardelle, Vincent

doi:10.3758/s13423-022-02212-y

Metacognition tracks sensitivity following involuntary shifts of visual attention

Brief Report
Open access
Published: 16 November 2022

Volume 30, pages 1136–1147, (2023)
Cite this article

Download PDF

You have full access to this open access article

Psychonomic Bulletin & Review Aims and scope Submit manuscript

Metacognition tracks sensitivity following involuntary shifts of visual attention

Download PDF

2605 Accesses
5 Altmetric
Explore all metrics

Abstract

Salient, exogenous cues have been shown to induce a temporary boost of perceptual sensitivity in their immediate vicinity. In two experiments involving uninformative exogenous cues presented at various times before a target stimulus, we investigated whether human observers (N = 100) were able to monitor the involuntary increase in performance induced by such transients. We found that an increase of perceptual sensitivity (in a choice task) and encoding precision (in a free-estimation task) occurred approximately 100 ms after cue onset, and was accompanied by an increase in confidence about the perceptual response. These simultaneous changes in sensitivity and confidence resulted in stable metacognition across conditions. These results suggest that metacognition efficiently tracks the effects of a reflexive attentional mechanism known to evade voluntary control, and illustrate a striking ability of high-level cognition to capture fleeting, low-level sensory modulations.

Think twice: Re-assessing confidence improves visual metacognition

Article Open access 22 December 2023

Resilience of perceptual metacognition in a dual-task paradigm

Article 23 July 2020

Heuristic use of perceptual evidence leads to dissociation between performance and metacognitive sensitivity

Article 20 January 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Sometimes, the saliency of an event (e.g., a bee hovering too close) or the meaning of a signal (e.g., a finger pointing at a snake) is such that we quickly disengage from the ongoing task to reallocate our attention elsewhere. Selective spatial attention has been defined as the prioritization and enhancement of a stimulus at a particular location (Carrasco, 2011; Nobre & Kastner, 2014; Posner, 1980). Two forms of attention have classically been distinguished. Endogenous attention is goal-directed and prioritizes information that is deemed relevant for the observer. It has a slow deployment rate (~300 ms) but can be sustained in time. By contrast, exogenous attention enables an organism to react quickly and automatically to a potential threat: it is a fast (~100 ms) and involuntary, albeit short-lasting process (Carrasco, 2011; Nobre & Kastner, 2014). In psychophysical experiments, exogenous orienting is triggered by sharp contrast transients in the vicinity of a target (Carrasco, 2011; Solomon & Morgan, 2018), the latter then being reported more quickly (Jonides & Irwin, 1981; Posner, 1980) and more accurately (Carrasco, 2011), even when the cue is uninformative (Ling & Carrasco, 2006; Nakayama & Mackeben, 1989; Remington, Johnston, & Yantis, 1992).

Since spatial attention affects perceptual performance, knowing whether attention was deployed is a good indication of the quality of one’s own perception. This metacognitive knowledge is useful to regulate behavior: a driver, for example, may decide to slow down if unsure about the color of a traffic light. The subjective estimation of a decision’s accuracy about a visual stimulus can be probed experimentally using confidence judgments. How well confidence judgments track performance is also known as metacognitive ability, or simply metacognition (Mamassian, 2016). Whether metacognition can monitor the fluctuation of performance linked to attention is, however, still unclear.

While the effect of attention on metacognition has been considered in the literature, the findings are mixed: some studies showed dissociations between accuracy and confidence during manipulation of spatial attention (Kurtz et al., 2017; Wilimzig et al., 2008) or temporal attention (Recht, Mamassian, & de Gardelle, 2019). Other studies suggested that spatial attention increases both sensitivity and confidence (Denison et al., 2018; Zizlsperger et al., 2012). Most of these studies, however, considered endogenous orienting of attention.

Metacognition is usually depicted as a high-level process: merely under voluntary control, it could potentially share some of the neuronal bases involved in the orienting of endogenous attention (Gilbert & Li, 2013). Its relation to exogenous attention, however, is much less clear: it could even be argued that exogenous attention should evade metacognitive monitoring as much as voluntary control. On one hand, because of its unrepressed nature, exogenous attention could impede metacognition by disrupting high-level cognitive monitoring. On the other hand, the change in signal induced by exogenous attention being largely bottom-up and transient, its effects may simply remain unnoticed by metacognition. These predictions are consistent with the results of one study that found no effect of involuntary attention on confidence (Kurtz et al., 2017).

Another contrasting view depicts metacognition as strongly yoked to the sensory evidence used in perception (e.g., Kiani & Shadlen, 2009), implying that the effect of exogenous attention should be reflected in metacognition. A recent study investigating the combined effects of exogenous and endogenous attention found that metacognition adequately reflected changes induced by attention (Landry et al., 2021). However, no study to date has considered the effect of exogenous attention on confidence by contrasting valid/invalid non-predictive cues exclusively. Assessing whether confidence tracks such a fleeting attentional mechanism should provide important insights on the versatility and limits of metacognitive monitoring.

To arbitrate between these two views, we used an exogenous pre-cueing approach combined with confidence judgments. Participants categorized the orientation of a low contrast Gabor patch (Experiment 1; e.g., Pestilli & Carrasco, 2005) or estimated the orientation of a “clock” (Experiment 2) briefly presented at one of two locations, and then indicated their confidence. The target stimulus was preceded by a peripheral pre-cue, unpredictive of the target’s location, and the onset asynchrony between the cue and the target (hereafter CTOA) was varied. We hypothesized that if confidence could track sensitivity, we should observe a positive effect of valid exogenous pre-cues on confidence mostly at short CTOAs. We also investigated how accurately confidence judgments reflect task performance (i.e., metacognition).

For both experiments, we found evidence that confidence efficiently tracks the involuntary, short-lasting gain in sensitivity induced by exogenous attention. Notably, the deployment of exogenous attention did not disrupt metacognition, which remained stable across all experimental conditions. These results suggest that metacognitive monitoring is able to process the effects of certain low-level sensory modulations occurring beyond the realm of voluntary control.

Experiment 1

Material and methods

Participants

Ten right-handed participants were recruited from the French RISC pool of participants. The sample size was estimated from the validity effect size of two previous studies involving a similar exogenous paradigm. White, Lunau, and Carrasco (2014) found an effect size of d = 1.13, therefore requiring N = 10 to achieve 85% power in a two-tailed t-test. Liu, Pestilli, and Carrasco (2005) found η2 = 0.65, requiring N = 8 to achieve 85% power. We therefore chose N = 10 as a target. This choice is also consistent with a more recent study reporting d = 1.16 (requiring N = 10) with the same paradigm (Fernández, Li, & Carrasco 2019). Estimates were conducted using G*Power. Participants provided informed written consent prior to the experiment and received 30 euros for their time. The experiment was divided into three sessions of 1 h each, over 3 different days. The experimental procedure received approval from the Paris School of Economics (PSE) ethics review board.

Stimuli

Target and distractor consisted in two 2° Gabor patches (spatial frequency: 5 c/°; fixed 12% contrast) with Gaussian envelope. The target was oriented either clockwise or counter-clockwise relative to vertical; its orientation was calibrated beforehand for each participant to reach a 75% average accuracy in the main task (see Calibration section below). The distractor was always horizontal. The target and the distractor were displayed at 5° eccentricity from the center of the screen, on the horizontal midline, one on each side of the screen. A 0.4° fixation dot was presented at the center of the screen. The pre-cue consisted of a 2° black line displayed 1.5° above the target/distractor center. Stimuli were presented on a gray background. The experiment was programmed using Python and the PsychoPy toolbox (Peirce, 2007), and ran on a computer running Linux Ubuntu.

Procedure

Participants sat in a dark room during the experiment, 57 cm from the screen (CRT monitor, 1,920 × 1,080 pixels, 100-Hz refresh rate), with their head maintained using a chinrest. After a 200-ms inter-stimulus interval (ITI), each trial started with the fixation dot displayed for a duration sampled from an exponential decay (scale: 500 ms, bounded within the [300, 1,000] ms interval). This was done to maximize temporal uncertainty about stimuli onset. At the end of this delay, the pre-cue was presented for 60 ms. After a variable cue-to-target onset asynchrony (five different CTOA conditions: 100, 150, 250, 450, and 850 ms, equally spaced in logarithmic scale), both target and distractor were displayed on either side of the fixation dot for 30 ms. Participants were informed that the target was always the non-horizontal Gabor. Participants were asked to categorize the target as clockwise versus counter-clockwise (Type 1 decision) and press the corresponding key on the keyboard (left arrow for counterclockwise, right arrow for clockwise). In 50% of the trials, the target appeared at the same location as the cue (“valid” condition), and for the remaining trials at the opposite location (“invalid” condition). After their response, participants were prompted to report their confidence in their response using the up/down arrow keys (Type 2 decision): is your confidence for this trial higher or lower than average? We reasoned that this form of confidence judgment would encourage participants to report high and low confidence in a balanced manner over the whole experiment, which would be beneficial for our statistical analyses. Participants started with ten practice trials with feedback prior to the calibration (see below), which was then followed by the main experiment. Participants were provided with a 10-s break every 60 trials. The design was fully factorial with 5 CTOAs conditions × 2 pre-cue conditions (valid/invalid), with pseudo randomization per virtual blocks of 20 trials.

Participants were instructed to fixate the center of the screen during the whole trial period, given that target location was unpredictable. The cue was fully unpredictive, and participants had no further incentives to orient their attention voluntarily towards the cued location. As such, no eye-tracking monitoring was used in the present study, but it is reasonable to assume that participants maintained their gaze at the center to maximize their chance to properly discriminate the target. We cannot exclude that a small proportion of the trials might have been affected by incorrect fixation. Although the pattern of results of Experiment 1 suggests that participants did not move their eyes towards the cued location even at longer CTOAs, it might account for some of the negligible evidence observed in the long CTOA condition. Participants completed three sessions of 1 h each, with 560 trials per session (1,680 trials in total).

Calibration

The psychometric function relating orientation discrimination (the proportion of “counterclockwise” responses) to target orientation was estimated prior to the beginning of the experiment for each participant in order to aim for a 75% average perceptual accuracy in the main task. From the participant's perspective, the task during this calibration part looked similar to the one in the main experiment, but the orientation of the target was varied from trial to trial using an Accelerated Stochastic Approximation (ASA) staircase procedure (Kesten, 1958). In the calibration part, the cue was systematically displayed on both the target and the distractor side, CTOA was fixed at 100 ms, and confidence judgments were not requested. At the end of the calibration, the psychometric curve was estimated using Maximum Likelihood Estimation (MLE), to extract angle values (separately for clockwise and counterclockwise targets) leading to 75% accuracy. These values were then kept constant for the main task, to reduce the risk of inflating metacognitive ability estimates (Rahnev & Fleming, 2019).

Figure 1 shows the experimental protocol for Experiment 1.

Measures

We were interested in estimating both perceptual (Type 1) and meta-perceptual (Type 2) sensitivities. We thus used Signal Detection Theory (SDT) to estimate Type 1 sensitivity (d'), which provided us with a bias-free measure of accuracy (Green & Swets, 1966; Macmillan & Creelman, 2005). Trials were grouped using the clockwise-oriented category as signal, leading to four categories of trials: (a) hits, where a CW target was correctly reported as CW; (b) misses, in which a CW target was reported as CCW; (c) false alarms, where a CCW target was reported as CW; (d) correct rejections, where CCW was reported as CCW. This grouping was conducted for each participant and each condition separately, and sensitivity (d') was calculated as the difference in z-scores between the hit rate and the false alarm rate.

As a proxy for Type 2 sensitivity (that is, how well confidence ratings relate to objective accuracy), we used the Meta-d’/d’ ratio, as it is less prone than other measures to shifts in Type 1 sensitivity or response bias. Meta-d’ corresponds to the Type 1 sensitivity that would produce the collected Type 2 (or confidence) responses, if the observers were optimal at the metacognitive level (Maniscalco & Lau, 2012). This value, the meta-d’, can then be compared to the actual sensitivity (d') objectively measured for each participant. In particular, the meta-d' is equal to the d' when the participant has optimal metacognitive access to Type 1 decision information. The ratio meta-d'/d', or “m-ratio” is referred to as metacognitive efficiency. To investigate the effect of cueing on metacognitive efficiency, we thus considered the m-ratio, after estimating d’ and meta-d’ using Maximum Likelihood methods. This procedure was applied for each participant, CTOA and pre-cue validity separately.

Analyses

For clarity, and because we were interested in within- not between-participant variability, the error bars in the following figures are based on the 95% confidence interval (CIs) of the within-participant variability. These CIs were calculated using the Cousineau-Morey intervals (Baguley, 2012; Cousineau, 2005; Morey, 2008). All the analyses were carried out using the R programming language (version 4.0.4, R Core Team, 2013). When necessary, ANOVAs were corrected using the Greenhouse-Geisser adjustment and t-tests were corrected using the Welch-Satterthwaite adjustment. We report the V-statistic from Wilcoxon’s signed rank test using uppercase an T when the Shapiro-Wilk normality test failed, and Student’s t-statistic using lowercase a t otherwise. Bayes factors were calculated using the “ttestBF” functions for t-tests, the “correlationBF” for correlation test, and the “anovaBF” function for ANOVAs, from the BayesFactor R package (version 0.9.12-4.3, Morey & Rouder, 2018). The Bayes factor for interactions in ANOVAs was estimated by comparing a model with the main effects to a model with both the main effects and the interaction. For the Wilcoxon signed rank tests, the Bayes factors were estimated using JASP (version 0.16.1.0). For all analyses, we used the default prior distribution provided with the package. We always report the Bayes factor in favor of the alternative hypothesis (BF₁₀), with values above 3 providing evidence in favor of the alternative hypothesis and values below 0.33 evidence in favor of the null. Meta-d’ values were estimated using the “fit_meta_d_MLE” function from the “metaSDT” R package (version 0.6.0).

Results

Exogenous pre-cues affect performance and confidence at short CTOA

We first evaluated how performance and confidence were affected by exogenous pre-cues, with separate ANOVAs for sensitivity, response times (RTs), and average confidence as successive dependent variables, and with pre-cue validity and cue-to-target onset asynchrony (CTOA) as independent variables.

For sensitivity, we found a significant interaction between CTOA and validity (F(3.2,28.8) = 4.25, p = 0.012, g = 0.08, BF₁₀ = 0.89), with no significant main effect of CTOA (F(2.9,26.3) = 1.18, g = 0.04, p = 0.334, BF₁₀ = 0.12) or validity (F(1,9) = 3.7, g = 0.03, p = 0.088, BF₁₀ = 0.76). Paired Wilcoxon tests at each CTOA confirmed a significant gain in sensitivity for the valid compared to the invalid condition at short CTOAs (100 ms: T(9) = 52, p = 0.0098, r = 0.79, BF₁₀ = 21.48; 150 ms: T(9) = 50, p = 0.020, r = 0.72, BF₁₀ = 7.52) and some evidence for an absence of effect at longer CTOAs (all p > 0.30, BF₁₀ < 0.36). These results confirmed that our cueing procedure successfully triggered exogenous attention (Fig. 2A).

To discard a potential speed-accuracy trade-off, we examined median response times (Fig. 2B), which exhibited the same pattern as sensitivity did. The repeated-measures ANOVA showed an effect of CTOA (F(2.0, 18.2) = 5.41, g = 0.01, p = 0.01, BF₁₀ = 0.06), no effect of validity (F(1,9) = 3.4, p = 0.1, g = 0.003, BF₁₀ = 0.23), but an interaction (F(2.54, 22.87) = 5.10, p = 0.01, g = 0.01, BF₁₀ = 0.09). However, Bayes factors were strongly favoring the null for both CTOA and the interaction. These results demonstrate that the effect of exogenous attention on sensitivity was not the result of a speed-accuracy tradeoff.

Confidence was affected similarly (Fig. 2C). The ANOVA showed a main effect of CTOA (F(2.1,18.6) = 10.11, g = 0.09, p = 0.001, BF₁₀ = 0.87), no effect of validity (F(1,9) = 3.9, g = 0.006, p = 0.079, , BF₁₀ = 0.27), but an interaction between CTOA and validity, despite the Bayes factor providing moderate evidence in favor of the null (F(2,18.1) = 4.07, g = 0.01, p = 0.034, BF₁₀ = 0.12). Paired Wilcoxon tests at each CTOA confirmed a higher confidence for the valid than for the invalid condition at 100 ms CTOA (T(9) = 48, d = 0.66, r = 0.037, BF₁₀ = 3.45, Wilcoxon test), but not for other CTOAs (p > 0.08). In other words, confidence and performance both increased for valid trials at short CTOAs. In addition, we note confidence decreases with CTOAs, which might reflect the increase in response times at longer CTOAs (this effect is unlikely due to temporal expectations, given the higher proportion of short CTOAs). Of note, while sensitivity was boosted for both 100 ms and 150 ms CTOAs, confidence was only significantly boosted at 100 ms. This discrepancy might potentially be due to a lack of power, given that second-order ratings like confidence are usually noisier than first-order responses (Mamassian, 2016).

To confirm the similarity between sensitivity and confidence, we calculated the cueing effect (valid minus invalid) for confidence and sensitivity at each CTOA, and evaluated Pearson’s correlation across CTOAs for each participant. As expected, these correlations were globally positive across participants (mean correlation: r = 0.62; Wilcoxon test: T(9) = 47, p = 0.048, BF₁₀ = 3.00).

Metacognitive sensitivity

To check the presence of overall metacognitive insight, we compared participants’ perceptual sensitivity between high and low confidence trials. A repeated-measures ANOVA with sensitivity as the dependent variable and CTOA, validity, and confidence as independent variables indicated only a significant effect of confidence on sensitivity (F(1,9) = 76.59, g = 0.67, p < 0.001, BF₁₀ = 1.67 × 10⁴²), with no other main effects or interactions (all p > 0.09, BF₁₀ < 0.02). In other words, when participants expressed higher confidence, their sensitivity was indeed higher, which indicates some metacognitive sensitivity (Fig. 3A and B).

Metacognition is stable across conditions

We quantified metacognitive efficiency (Fig. 3C) as the ratio of meta-d’ over d’ for each CTOA, cue validity condition and participant. Metacognitive efficiency appeared stable across conditions (Fig. 3C). An ANOVA on the m-ratio showed no significant effect of CTOA (F(1.7, 15.4) = 2.27, g = 0.06, p = 0.14, BF₁₀ = 0.31) or validity (F(1,9) = 0.002, g < 0.001, p = 0.97, BF₁₀ = 0.22), and no interaction (F(3.5, 31.3) = 0.9, g = 0.02, p = 0.6, BF₁₀ = 0.14). Notably, Bayes factors provided evidence for an absence of effects, which is consistent with the interpretation that validity (or CTOA) affected both meta-d’ and d’ in a similar way, leading to a stable metacognitive efficiency. In other words, participants evaluated their performance adequately despite its fluctuation with cue validity and CTOA.