Confidence modulates the decodability of scene prediction during partially-observable maze exploration in humans

Katayama, Risa; Yoshida, Wako; Ishii, Shin

doi:10.1038/s42003-022-03314-y

Confidence modulates the decodability of scene prediction during partially-observable maze exploration in humans

Article
Open access
Published: 19 April 2022

Volume 5, article number 367, (2022)
Cite this article

Download PDF

You have full access to this open access article

Communications Biology

Confidence modulates the decodability of scene prediction during partially-observable maze exploration in humans

Download PDF

1649 Accesses
1 Citation
41 Altmetric
5 Mentions
Explore all metrics

Abstract

Prediction ability often involves some degree of uncertainty—a key determinant of confidence. Here, we sought to assess whether predictions are decodable in partially-observable environments where one’s state is uncertain, and whether this information is sensitive to confidence produced by such uncertainty. We used functional magnetic resonance imaging-based, partially-observable maze navigation tasks in which subjects predicted upcoming scenes and reported their confidence regarding these predictions. Using a multi-voxel pattern analysis, we successfully decoded both scene predictions and subjective confidence from activities in the localized parietal and prefrontal regions. We also assessed confidence in their beliefs about where they were in the maze. Importantly, prediction decodability varied according to subjective scene confidence in the superior parietal lobule and state confidence estimated by the behavioral model in the inferior parietal lobule. These results demonstrate that prediction in uncertain environments depends on the prefrontal-parietal network within which prediction and confidence interact.

Belief inference for hierarchical hidden states in spatial navigation

Article Open access 21 May 2024

Learning of distant state predictions by the orbitofrontal cortex in humans

Article Open access 11 June 2019

Decoding the view expectation during learned maze navigation from human fronto-parietal network

Article Open access 03 December 2015

Introduction

Animals, including humans, have difficulty in decision-making due to past, present, and future uncertainties related to their geographical surroundings. For instance, when navigating a complex environment, knowing which way to go is more challenging if you are uncertain about your initial location, especially in a setting in which many places look similar. Working out both where you are and where you are going involves accepting assumptions and using them to make predictions. These beliefs can then be updated based on whether your predictions are correct or not. Over time, this leads to varying levels of confidence about your whereabouts and ability to accurately predict future conditions.

Predictions are considered high-order functions since they incorporate varying degrees of uncertainty to facilitate effective decision-making. Additionally, these predictions are considered subjective because they may not be entirely rooted in objective reality. In fully observable decision-making contexts in which one’s current state is certain, it has been shown that explicit predictions can be decoded from brain activity. This has been demonstrated in the context of spatial navigation^1,2 and perceptual decision-making tasks^3,4,5 but has not yet been determined in partially observable environments. Although previous studies of Bayesian modeling reproduced subjective beliefs in predictions during a human navigation task in a partially observable maze⁶, no studies have specifically investigated the decodability of explicit predictions.

Assuming that confidence is a measure of introspection to neural representations, it would increase as the environment becomes more predictable, to the point that the uncertainty of the belief would be resolved. Likewise, the decodability of predictions would also increase as the predictions become more confidently refined⁷. Previous studies have consistently shown that anterior prefrontal cortical activity correlates with subjective confidence estimates^8,9,10,11. It was also reported that information necessary for decision-making, such as value difference⁹ and liking rating¹⁰, is represented by brain activity in the same prefrontal region. In the context of multi-voxel pattern analysis (MVPA), confidence has been decoded from regions in the prefrontal and parietal cortices^12,13, and one parietal region was identified to encode decisions¹². Although these studies have suggested that internal representations and their introspections are tightly coupled, uncertainty stems directly from the experimental stimuli. In reality, environmental uncertainty or unpredictability must be dynamically resolved by continuously incorporating new information into the decision-making process¹⁴. Therefore, confidence changes are associated with uncertainty resolution. However, the relationship between confidence and decodability for predictions has never been assessed under dynamic, lifelike conditions because of difficulties in monitoring and reproducing dynamically changing internal representations. Thus, we used Bayesian modeling, which was also used to assess neurophysiological activities during a rodent goal-reaching task¹⁵, to study and reproduce subjects’ predictions and confidence based on their behaviors.

In this study, we sought to examine the following: (i) whether explicit predictions of the upcoming scene can be decoded from brain activity, and, if so, where this information is localized, and (ii) whether subjective confidence in these predictions can be decoded, and how confidence interacts with the predictions themselves. Accordingly, we conducted functional magnetic resonance imaging (fMRI) scanning during a virtual maze navigation task in which subjects explored a previously learned maze consisting of four-walled rooms with either an open or a closed door on each wall (Fig. 1, Supplementary Fig. 1). Subjects were initially placed in an unknown room with an unknown orientation (state) and provided only 3D scenes of the rooms. When subjects explored the maze by selecting one of the open doors leading to the next room, they were occasionally asked to predict the 3D scene that would be seen in the next room and to report their confidence for that prediction. This task involved decision-making combined with uncertainty resolution stemming from the partial observability of the environment, that is, the uncertainty in our paradigm is a composite of the uncertainty about the observed information in the context of standard perceptual tasks and the uncertainty arising from memory limitations in pure memory tasks. We hypothesized that scene prediction in a partially observable environment would be encoded in the parietal and prefrontal cortices, where activity is frequently observed in spatial navigation and planning^16,17,18,19, while its confidence would be encoded in the anterior part of the prefrontal cortex^8,9. However, it remains unclear whether prediction and confidence interact at the level of neural activity. We supposed that if the confidence modulates the neural representation encoding scene prediction, the decodability of the prediction would vary with the confidence level.

Results

Thirty-three healthy subjects (aged 20–32 years; four females) performed the maze navigation task (Fig. 1). Six subjects were excluded from the analyses due to low scene prediction accuracies (27.1–31.5%, see also Supplementary Fig. 2). Missed trials (mean frequency ± standard deviation [SD]: 0.7 ± 1.1%), defined as trials in which subjects did not complete the upcoming scene prediction and confidence reporting within the allotted time (4.5 s), were also excluded from the analyses of individuals.

Behavioral results

The average prediction accuracy was 54.6 ± 14.4% (mean ± SD) for 27 subjects who were included in the behavioral analysis. Figure 2a represents an example of a subject’s behavioral profile in the prediction trials of four consecutive games. The prediction accuracy was significantly higher when the confidence was high (confidence levels 3 and 4) compared to when it was low (confidence levels 1 and 2) (Fig. 2b, one-sided Wilcoxon signed-rank test, p = 3.3 × 10⁻⁶). Subjects were able to accurately assess their confidence even though they were required to do so before they chose a predicted scene.

The initial states varied for each game and remained unknown to the subjects. Therefore, to predict an upcoming scene, the subjects were required to infer the hidden current state from the history of actions and observed scenes (i.e., the task was a partially observable Markov decision process [MDP]²⁰). As such, the uncertainty about the current state could be resolved as subjects continued looking for the goal, and accordingly, the scene prediction accuracy and confidence would have likely increased. The scene prediction accuracy increased with every successive prediction trial within a single game (Fig. 2c, r = 0.27, p = 1.3 × 10⁻⁵), as did the confidence level (Fig. 2d, r = 0.29, p = 2.6 × 10⁻⁶). Furthermore, the prediction accuracy was significantly correlated with the number of recent consecutive correct prediction trials (Fig. 2e, r = 0.34, p = 6.6 × 10⁻⁵); however, prediction accuracy was not correlated with the number of consecutive incorrect prediction trials (r = −0.08, p = 0.35). These results lend support to our assumption that the subjects resolved state uncertainty and were able to infer their state more accurately with continued maze exploration.

We predicted that the subjects would make quicker choices when their confidence levels regarding the upcoming scene prediction were higher than when they were lower. Aligned rank transformation analyses of variance revealed that scene choice reaction times (RTs) were significantly shorter in the high-confidence trials than in the low-confidence trials (Fig. 2f, F(1,5435) = 77.83, p = 1.5 × 10⁻¹⁸) across all subjects. Similarly, scene choice RTs were shorter in the correct trials than in the incorrect trials (F(1,5435) = 33.89, p = 6.2 × 10⁻⁹). There was no interaction effect between the prediction correctness and confidence level (F(1,5435) = 1.8 × 10⁻², p = 0.89).

Neural correlates of scene prediction

First, we carried out a univariate general linear model analysis during predicting the upcoming scene (first 4 s of the delay period) and found significantly higher brain (BOLD) responses in the bilateral superior parietal lobules (SPL; Brodmann area [BA] 7), bilateral inferior parietal lobule (IPL; BA40), left dorsal premotor cortex (PMd; BA6), and left anterior prefrontal cortex (aPFC; BA10) (Fig. 3a; the time-series of the brain activity in each ROI were shown in Supplementary Fig. 3). The statistics are summarized in Supplementary Table 1. One subject was excluded from the imaging and decoding analyses due to his/her larger head motion; accordingly, the following analyses included 26 subjects. We used voxel clusters in the cortical regions above as regions of interest (ROIs) for subsequent decoding analyses.

**Fig. 3: Imaging analysis results and decoding accuracies.**

Decoding analysis of scene prediction and confidence

We performed a multi-voxel pattern analysis using the voxel-wise activity patterns of the SPL, IPL, PMd, and aPFC. For the SPL and IPL, the left and right ROIs were combined, as in the recent decoding studies on navigation^1,2. As the inputs for both the scene prediction and the confidence decoders, we used brain images of the delay period during which the subjects predicted an upcoming scene in their mind without any visual information. To probe the sensitivity of the decoders to pattern activation time course, we constructed the decoders of the scene prediction and the subject’s reported confidence level at nine different time points, the 0th to the 8th decoding periods: the decoders at the t-th period used four consecutive scans starting from t s after the onset of the delay period (Supplementary Fig. 4a).

For each decoding period, we constructed six binary classifiers, each of which corresponds to the probability associated with one type of scene, and the scene with the maximum probability was defined as the output of the decoder. In the following decoding analyses, we used the scenes chosen by the subjects for the target labels of the scene decoder regardless of whether they were correct or not. Scene #3 was excluded due to its rarity (Fig. 1c, 13.9 ± 7.0 trials, 6.8 ± 3.2% of the whole). For the confidence decoder, we used a binary classifier with high (confidence level 3 or 4) and low (confidence level 1 or 2) confidence categories. One subject was excluded from the confidence decoding analysis because he/she reported high confidence in three out of 233 prediction trials. Mean decoding accuracies in terms of leave-one-session-out (LOSO) cross-validation (CV) were averaged across all subjects, representing a total of 26 for scene prediction decoding and 25 for confidence decoding.

The MVPA for scene prediction generated classification accuracies that were significantly higher than chance in SPL, IPL, and PMd using a one-sided Wilcoxon signed-rank test (Fig. 3b). However, results from the aPFC were not significant (Fig. 3b, SPL, 18.6 ± 2.3%, p = 3.3 × 10⁻⁴; IPL, 18.5 ± 2.0%, p = 1.9 × 10⁻⁴; PMd, 17.9 ± 1.8%, p = 1.1 × 10⁻³; aPFC, 17.2 ± 1.8%, p = 9.8 × 10⁻²). In contrast, the confidence level could be decoded from activity within all four ROIs (Fig. 3c, SPL, 63.4 ± 9.2%, p = 1.2 × 10⁻⁵; IPL, 63.9 ± 9.0%, p = 1.1 × 10⁻⁵; PMd, 60.8 ± 8.1%, p = 2.2 × 10⁻⁵; aPFC, 59.4 ± 8.4%, p = 3.4 × 10⁻⁵). The results of our time-series decoding analysis are shown in Supplementary Fig. 4b–e. There was no significant positive correlation between the number of samples (frequency) and the decoding accuracy of each scene in all four ROIs at nine decoding periods (i.e., 36 decoders in total; Supplementary Table 2). Additionally, while the size of ROIs varied from 497 to 2686 voxels, the number of selected features was almost constant between the different ROIs (Supplementary Fig. S4f, g). Therefore, the unbalanced number of samples and dimensionalities of ROIs were found not to distort the decoding results. We also confirmed that there was no positive bias in the decoding accuracy even though the data for decoder evaluation was also used for the ROI selection²¹ (Supplementary Fig. S4h, i).

Based on the analysis of scene choice RTs, we expected that the distinctiveness of scene prediction may be different when the subjects had high confidence in the prediction compared to low confidence, and when the subjects successfully predicted the true upcoming scene compared to when they did not. We assessed the accuracy of the scene prediction decoders in the high-confidence versus low-confidence trials and in the correct versus incorrect trials. Here, we constructed four independent decoders trained and tested using different subsets of data: correct or incorrect prediction trials (regardless of the confidence level), and high- or low-confidence trials (regardless of the correctness) using a leave-one-game-out (LOGO) CV procedure. For each decoder, subjects who had fewer than three samples in the training subset for any individual scene label were excluded from the analysis. Consequently, we used data from 17 subjects for the decoders trained by the high-confidence trials (high-scene-confidence-only decoder), 21 subjects for the low-scene-confidence-only decoder, 24 subjects for the correct-only decoder, and 18 subjects for the incorrect-only decoder. We examined the accuracies in the SPL, IPL, and PMd, which corresponded to the ROIs with acceptable scene prediction decodability.

When comparing the decoding accuracies between the two categories of prediction correctness (correct versus incorrect trials), the three ROIs showed similar patterns of scene prediction decoding accuracies; the decoders tended to exhibit significantly higher accuracies than chance in the correct trials, but not in the incorrect trials (Fig. 3d). However, the voxel activity patterns of the SPL allowed us to decode the subjects’ predicted scenes, even in the incorrect trials, especially in the relatively early stages of the delay period. The differences between the accuracies of the correct-only and the incorrect-only decoders were highly significant for the 5th to the 8th decoding period with the SPL responses, for the 4th and 8th period with the IPL responses, and only for the 4th period with the PMd responses. We also confirmed that the beta estimates and the percent signal changes (PSCs) in the ROIs (SPL, IPL, and PMd) were not significantly different between the compared conditions (Supplementary Fig. 5a, c). Therefore, the differences in the scene prediction decoding accuracy were not due to the activity differences that were examined in the univariate analyses.

When separately decoding scene prediction between the high and low confidence levels, the decodability in the time-series analysis behaved differently between the ROIs (Fig. 3e). When trained with the high-confidence trials, the decoders with the SPL responses exhibited significantly higher accuracy than chance for the 0th and the 3rd to 8th periods, while the accuracy of the low-scene-confidence-only decoder did not differ from chance (except for the 8th period). In addition, for the 6th to 8th periods, the decoding accuracy in the high-scene-confidence trials was significantly higher than that in the low-confidence trials (one-sided Wilcoxon rank-sum test, for the 6th period, p = 6.8 × 10⁻⁵; the 7th, p = 1.5 × 10⁻⁴; the 8th, p = 6.9 × 10⁻⁴). This difference was more pronounced when comparing the trials with the highest confidence level (confidence level 4) with those with the lowest level (confidence level 1) (Supplementary Fig. 5e, for the 3rd period, p = 2.7 × 10⁻²; for the 4th period, p = 1.6 × 10⁻²). To assess the relationship between the confidence level and scene prediction decodability, we also compared the decoding accuracy for the 6th period on three scales: the highest (confidence level 4), moderate (2 and 3), and lowest (1). We found that the decoding accuracy significantly increased as the confidence level increased (Supplementary Fig. 5f, r = 0.52, p = 3.8 × 10⁻⁴). Additionally, we confirmed that there was no significant difference in the beta estimates between the high-scene-confidence trials and the low-scene-confidence trials (Supplementary Fig. 5d). When comparing the PSCs between the high-confidence trials and the low-confidence trials, there was a weakly significant difference from t = 8 to 10 s since the onset of the delay period (Supplementary Fig. 5b). These results indicate that the difference in decodability depending on the confidence level in SPL, especially up to the 7th period, is not due to the difference at the univariate level. On assessing the voxel activity patterns in the IPL and PMd, the scene prediction decoder outperformed chance in the later decoding periods (for the 5th to the 8th period in the IPL; for the 4th to the 8th period in the PMd). However, the difference between the accuracies of the high-scene-confidence-only and the low-scene-confidence-only decoders was weakly significant only in the 6th and 8th periods for IPL (the 6th period, p = 4.2 × 10⁻²; the 8th, p = 2.6 × 10⁻²), and only in the 6th period for PMd (p = 2.8 × 10⁻²). In summary, the correctness of the scene prediction affects the scene prediction decodability with the voxel activity patterns of SPL, IPL, and PMd, while the confidence level of the scene prediction influenced the decodability with the SPL responses only.

Computational model of maze navigation behavior

To predict an upcoming scene, subjects must infer their hidden current state and then mentally simulate the next state based on their chosen action and the environmental model (maze structure). State inference was inherently uncertain at the beginning of each game, but prediction accuracy was improved as subjects explored the maze more and completed more prediction trials (Fig. 2c). To reproduce the subjects’ internal decision-making process, we implemented a hidden Markov model (HMM) of the subjects’ maze exploration behaviors based on previous modeling studies^6,21 (Supplementary Fig. 6). We integrated the following cognitive state variables into our HMM: i) a state inference, which is the belief about one’s location and orientation in the maze, and ii) the confidence level for the state inference (i.e., high or low state confidence). We assumed that the subjects used a simple switching mechanism between two strategies depending on their state confidence;^21,22,23 when they were uncertain about their state inference (low state-confidence level), they moved forward if possible to maximize information to identify the current state (forward-dominant strategy), while when they were certain about their state (high state-confidence level), they tended to move to grid spaces that they had not previously visited (efficient-exploration strategy). In terms of negative log evidence and AIC, this model performed better than the other models with a single strategy (Supplementary Tables 3 and 4).

Figure 4a represents two examples of subjects’ behaviors in the maze (left panels) and the most probable paths produced by our model (right panels). The upper and lower panels correspond to subjects with representatively good and poor scene prediction performances, respectively. Our model was good at predicting subjects’ actions in the action trials (95.8 ± 3.0% overall, and 94.2 ± 6.6% or 84.7 ± 12.9% when a scene had two or three open doors, respectively), as well as subjects’ scene choices in the prediction trials (63.3 ± 15.0%). The HMM reproduced the subjects’ scene choices in all correct trials (number of correct trials per subject, 112.6 ± 30.9), while its reproducibility in incorrect trials was significantly lower than that in correct trials (21.0 ± 10.8%; number of incorrect trials per subject, 92.7 ± 32.7).

**Fig. 4: Results of behavioral and decoding analyses based on the computational model of human navigation.**

The model allowed us to estimate the progression of state-confidence levels for each subject. If the model is reasonable, the state confidence should increase as exploration progresses; if the state-confidence level is high, subjects would be able to make correct scene choices in the prediction trials. The proportion of high-state confidence levels increased as the number of prediction trials increased within a given game (Fig. 4b, r = 0.63, p = 2.6 × 10⁻²⁸), and the prediction accuracy was significantly higher in the high-state-confidence trials than in the low-state-confidence trials (Fig. 4c, one-sided Wilcoxon signed-rank test, p = 5.8 × 10⁻⁶).

The state-confidence estimated by our model matched with the scene-prediction-confidence levels reported by the subjects: 63.2 ± 10.5% for 27 subjects (see also “Discussion”). The state-confidence was also decodable from the voxel activity patterns of four ROIs in which the scene-confidence was found to be decodable (Fig. 4d, one-sided Wilcoxon signed-rank test, SPL, 59.5 ± 6.0%, p = 7.8 × 10⁻⁶; IPL, 58.5 ± 5.9%, p = 5.6 × 10⁻⁶; PMd, 56.0 ± 5.3%, p = 2.4 × 10⁻⁵; aPFC, 55.2 ± 6.1%, p = 3.3 × 10⁻⁴). The time-series decoding analyses yielded the same results, regardless of the decoding time points (Supplementary Fig. 7a).

We compared the scene prediction decoding accuracies between the high- and low-state-confidence trials. Two decoders were individually trained using high- and low-state-confidence trials and evaluated using the LOGO CV procedure. Some subjects were excluded from each of the two decoders given the small number of samples (≤2) for at least one type of scene, resulting in a sample size of 21 subjects for the high-state-confidence-only decoders and 20 subjects for the low-state-confidence-only decoders.

Figure 4e represents the results of the SPL, IPL, and PMd. As an overall trend, the time-series decoding analysis revealed that the scene prediction decoders’ accuracy was significantly higher than chance in the high-state-confidence trials within all three ROIs, but not in the low-state-confidence trials, except for the latter decoding period (the 4th to the 8th). The difference in the decoding accuracies between the high- and low-state-confidence trials was highly significant when using the voxel activity patterns of IPL for the 5th period (one-sided Wilcoxon rank-sum test, p = 4.7 × 10⁻³). We also confirmed that the beta estimate in IPL was not different between the high- and the low-state-confidence trials and that the PSCs did not consistently show a significant difference at the time points corresponding to the 5th decoding period (Supplementary Fig. 7a, b). When the decoders were trained with the SPL or PMd responses, there were decoding periods for which the scene prediction decoders exhibited a weaker significant difference in accuracy between the two state-confidence levels.

Discussion

This study demonstrated that future scenes predicted by human subjects during maze navigation, as well as their corresponding prediction confidence, can be decoded from fMRI activity patterns in localized regions of the prefrontal and parietal cortices. To successfully predict an upcoming scene in the task, subjects needed to infer their current state about which uncertainty was intrinsic due to the partial-observability of the maze. Our decoding target was subjective prediction, which had to be mentally simulated by incorporating subjective inference, given that neither the visual information given by the environment nor the memorized map information was sufficient to independently guide maze exploration behaviors. This highlights the major contribution of our study: decoding subjective prediction adds to the scope of most previous studies that have focused on decoding aspects of the information provided to the subjects directly, including visual^24,25, tactile²⁶, and pain²⁷, or distinctly maintained in the brain, such as memory^{28,29,30,31,32,33}, imagination^34,35,36,37 and dreaming³⁸. Although some recent decoding studies have revealed that predicted visual events are also decodable using fMRI^1,2,5 and electroencephalography³, they did not take into consideration the prediction uncertainty or confidence. The current study uses ongoing mental simulation in the context of uncertainty and allows us to specifically investigate the neural basis of prediction uncertainty.

Using a decoding analysis for four ROIs that were identified by univariate analysis, we showed that both scene prediction and confidence level were decodable from the voxel activity patterns in the SPL, IPL, and PMd. The time-series analysis also showed that the scene prediction decoding accuracy gradually increased from the 5th to 7th periods, which included the BOLD signals after 4–6 s from the delay onset in the SPL and IPL. Although there was no clear peak or a sharp drop in accuracy following the delay period, this could be due to the wide range of time during which the upcoming scene was predicted, or to the need to maintain the predicted scene throughout the prediction trial to report it in the subsequent scene choice period. Meanwhile, our decoding analysis demonstrated that only confidence could be decoded from the aPFC activity. These results suggest that the aPFC may be involved in encoding subjective confidence, but not the associated prediction. Recent studies have shown that the aPFC is a key brain area involved in the metacognitive assessment. In addition, human fMRI experiments have revealed that this region shows significant activation during self-confidence reporting in perceptual⁹ and value-based decision-making tasks⁸. It was also demonstrated that metacognitive accuracy in perceptual decision-making is significantly correlated with the strength of functional connectivity arising from the lateral aPFC³⁹. Neuroanatomically, the gray matter volume⁴⁰, white matter microstructure⁴⁰ and gray matter microstructure⁴¹ in the aPFC are correlated with individual introspective ability. Furthermore, some studies have also reported that perceptual confidence can be decoded from the lateral PFC subregions using MVPA^12,13. Our results are therefore consistent with those of previous studies, supporting the theory that the aPFC may be an important center for metacognitive processing.

Behaviorally, when a subject’s confidence in the scene prediction was high, they tended to make a correct scene choice more quickly than when their confidence was low, which is consistent with previous findings in perceptual judgment tasks, such as motion detection^42,43 and two-choice discrimination tasks⁴⁴. Neurophysiologically, short RTs have been associated with decreased variability of neural activities, such as reproducible activity patterns in rodents⁴⁵ and firing rates in nonhuman primates⁴⁶. Furthermore, a human transcranial magnetic stimulation study showed analogous results in which the variability in corticospinal excitability was suppressed in fast-response trials in a bandit task⁴⁷. Based on these findings, we expected that the variability of fMRI voxel activity patterns representing a predicted scene would decrease when the subjects’ confidence level was high compared to low, enabling the decoder to distinguish the voxel patterns more accurately.

When we assessed the effects of confidence on the decodability of scene prediction, we found that the scene prediction decoders trained with the SPL activities in the high-scene-confidence trials performed significantly better than those with the low-scene-confidence trials, especially in the 6th to the 8th decoding period. An additional decoding analysis was performed by excluding one multiplicative condition ([correct, incorrect]×[high, low]) out of 2 × 2, and it was suggested that the neural representation of scene prediction in SPL was affected by both the confidence level (Supplementary Fig. 5g, ART-ANOVA, 6th period, F(1,96) = 9.37, p = 2.9 × 10⁻³; 7th period, F(1,96) = 4.25, p = 4.2 × 10⁻²) and the prediction correctness in the 6th and 7th decoding periods (6th period, F(1,96) = 4.10, p = 4.6 × 10⁻²; 7th period, F(1,96) = 4.42, p = 3.8 × 10⁻²). Previous studies have demonstrated that MVPA decoding accuracy is influenced by some behavioral and perceptual performance measures, such as tactile discrimination ability²⁶, spatial memory accuracy⁴⁸, reaction time⁴⁹, and familiarity with auditory stimuli, both intra-⁵⁰ and inter-individually^26,48,49. Moreover, MVPA has shown that the neural distance, which is the distance from a classifier hyperplane in the activation space, is robustly correlated with RT in the context of categorization tasks^51,52. Thus, this study adds to the existing literature by uncovering a clear relationship between decoding accuracy and subjective reporting of metacognition.

The prediction confidence level affected the scene prediction decoding accuracy in the SPL, whereas there was no effect on the decodability in the IPL. These results suggest that SPL represents the predicted scene itself and that IPL could be involved in other processes related to scene prediction. Based on the state-confidence level estimated by our HMM, the IPL scene prediction decoder was found to perform better in high-state-confidence trials, while the SPL decoder did not differ between high and low state-confidence levels (Fig. 4e). We also determined that only the IPL scene prediction decoder, excluding the correct and high-state-confidence trials, failed to show higher accuracy than chance (Supplementary Fig. 7d). These results support that state confidence partly affects the decoding accuracy of scene prediction with IPL responses. In addition, when we looked at the time-series of decodability, the IPL scene prediction decodability in the high-state-confidence trials reached a peak slightly earlier (the 6th period) than the SPL scene prediction decodability in the high-scene-confidence trials (the 7th period). These results suggest that IPL may be related to the neural representation of state inference, which functions as upstream information processing when predicting the upcoming scene.

The decoding accuracies of scene prediction in the IPL and SPL were thus influenced by the confidence level for state inference and scene prediction, respectively. The parietal cortex is known to be involved in spatial information processing, including navigation^16,17,18,19 and there is abundant evidence from human fMRI^{53,54,55,56,57,58}, nonhuman primate^59,60,61,62, and rodent physiological studies⁶³ that the posterior parietal cortex is crucial in egocentric forms of spatial information processing. In contrast, IPL has been shown to be associated with allocentric spatial information processing, such as viewer-independent spatial memory⁶⁴, object-based spatial judgement^65,66, and object-based attention⁶⁷. Positron emission tomography studies have also shown that the IPL may be involved in the conversion of allocentric-to-egocentric spatial representation in navigation tasks⁶⁸. Our decoding results could be interpreted to suggest that IPL represents the allocentric state inference based on a memorized map, possibly encoded in the hippocampus, while the predicted scene is encoded in SPL as egocentric spatial information for navigation. Although this is speculative and our current data are not suitable for further detailed analysis, the decoding of subjects’ state inference, that is, subjects’ belief of the position in the maze, may possibly support our interpretation.

In contrast to the SPL and IPL, the PMd decoding accuracy of scene prediction was affected by neither scene-prediction-confidence nor state confidence, although it was significantly higher than chance in the correct trials. Previous studies have reported that the PMd showed significant activation related to action selection⁶⁹ and significant decodability of movement-related information such as hand⁷⁰ and target positions⁷¹. Based on these studies, we speculated that PMd might specifically be involved in action planning in our maze exploration task because a scene is defined by a set of open (passable) or closed (impassable) doors in our experimental design. Unfortunately, we found no further support for this speculation in this study.

Our HMM-based behavioral model allowed us to successfully estimate subjects’ state confidence, although it was not explicitly reported by the subjects in the experiment. According to our view of an incremental Bayesian filtering process, scene prediction should incorporate a mental simulation process utilizing the memory of the maze structure after inferring the current state in the maze. Since the scene prediction follows the state inference, it includes additional uncertainties that vary from subject to subject, such as the degree of the imperfectness of the maze structure memory. The subject-reported scene-prediction-confidence may not necessarily match the estimated (but objectively reconstructed by our model) state confidence, as represented in the subjects’ brain. In fact, the agreement between the scene-prediction confidence (subject-reported, high or low) and the state confidence (model-estimated, high or low) was 63.2 ± 10.5% for all prediction trials, suggesting that there may be a difference between these two types of confidence. The difference in the time-course decodability between the SPL and IPL corresponded to this difference between the confidence types.

This study demonstrated that the localized neural representations of predictions during maze exploration differed depending on the confidence level of prediction. In addition, we noted that SPL and IPL appear to have different involvements in scene prediction. However, there are some limitations that should be addressed. Considering the assumed decision-making process underlying maze exploration, it is plausible that confidence or uncertainty about scene prediction could be affected by prior knowledge (i.e., the memory of the maze structure), which may not be perfect. Accordingly, future studies should probe and verify the effects of imperfect prior knowledge on confidence. Moreover, the detailed process of generating confidence from state inference and scene prediction has yet to be explored. It would be an interesting avenue for future studies to consider and verify how the parietal-prefrontal downstream circuit, which includes the IPL, SPL, and aPFC that would hierarchically encode prediction during maze exploration, decodes uncertainty and subjective confidence, to better enhance our understanding of the neural substrates involved in the decision-making process in uncertain environments.

Methods

Subjects

Thirty-three healthy subjects were recruited to participate in the experiment and provided written informed consent. This study was approved by the ethical committees of the Advanced Telecommunications Research Institute International, Japan, and the Graduate School of Informatics, Kyoto University, Japan. Six subjects whose scene prediction accuracy in the experimental task was not significantly higher than chance (one-sided z-test, p ≥ 0.01, see also Supplementary Fig. 2) were excluded from analyses. Another subject was also excluded from imaging and decoding analyses due to his/her large head motion (more than 5% of TRs at Framewise Displacement threshold 0.5 mm). No statistical method was used to predetermine the sample size, but the sample size for our analyses was comparable to those generally employed in the field.

Maze exploratory navigation task

The experiment consisted of two tasks: a training task outside the fMRI scanner to learn the structure of the maze (see “Training task”), followed by a maze scene prediction task inside the fMRI scanner. We used a single 5 × 5 grid maze for all subjects, in which each grid had either an open (passable) or closed (impassable) door on all four sides (Supplementary Fig. 1)⁷². We used the same maze for all subjects so that there would be no difference in the level of task difficulty between individuals. The maze was partially observable such that at each state (position and orientation), subjects could only observe the current scene (i.e., the status of the doors to the left, right, and forward)^2,6,21. We created a maze to satisfy partial observability, that is, at least two consecutive observations from the initial state are required to identify the current state. The experimental and training tasks were programmed using Psychopy3⁷³.

In the scene prediction task, subjects freely explored the maze and were intermittently asked to predict the upcoming scene and to rate their level of confidence about the prediction (Fig. 1). Each game started in an unknown initial state (i.e., a combination of position and orientation), with the current 3D scene (i.e., the status of the doors to the left, forward, and right) displayed on a screen. Subjects were requested to choose an action to move to the left, forward, or right grid space by pressing a button within 2.5 s, and subsequently, the 3D scene at the next state was presented in the next trial (Fig. 1a). If subjects chose an impassable door, they remained in the same state and the same scene was presented again in the subsequent trial (2.7 ± 3.1 trials for 27 subjects, 0.6 ± 0.7%). If an action was not taken within the allotted time, one of the passable doors was chosen by the computer (12.7 ± 9.8 trials, 2.8 ± 2.0%). After repeating an action trial 1–5 times, a prediction trial was performed (Fig. 1b). In a prediction trial, a fixation cross was displayed for 4–6 s (delay period) instead of the next scene presentation, and subjects were requested to predict the upcoming scene. In principle, although the upcoming scene was determined by the state and action in the previous trial, the intrinsic uncertainty of each state in the partial-observable maze required subjects to make inferences about their current state based on the history of actions and observed scenes. After the delay period, subjects were first asked to evaluate their level of confidence about their upcoming prediction on a four-point scale (1: lowest confidence; 4: highest confidence). The display positions of the four options were randomized for each prediction trial, and a white frame appeared around the selected option as feedback (1.5 s). After the confidence report, four out of seven possible scenes were displayed, and the subjects were requested to select the scene corresponding to their prediction of the upcoming scene. The scene options always included the correct upcoming scene and three other scenes, and their display positions on the screen were randomized. Distracters were selected from the set of possible scenes in which the local structure was consistent with the true state (i.e., states that could be reached from scenes similar to the scene presented in the previous trial using the selected action). Here, the design included four out of seven existing scenes as the predicted scene options, rather than the complete set of existing scenes, to make it easier for subjects to report their prediction by choosing one option within the time limit. Importantly, subjects were asked to report their confidence level prior to the presentation of the scene options, because there is a possibility that the limited options induce a large change in their prediction confidence. After selecting a scene option, a green or red frame appeared around the chosen scene, indicating that the choice was correct or incorrect, respectively. In the next action trial, the scene of the true subsequent state was presented, irrespective of the correctness of the subject in scene prediction. The allotted time for both reporting confidence level and selecting an upcoming scene option was limited to 4.5 s in order for subjects to make each choice as soon as possible.

Once a subject reached the termination condition, a yellow star was displayed on the door, leading to the final state. A termination condition was reached if the subjects were experiencing the state for the first time and had performed at least five prediction trials. Subjects were not provided details regarding the termination condition, but they were informed that both the initial and final states varied between games. Each game consisted of 5–20 blocks (5.5 ± 1.2), with each block consisting of 1–5 action trials and one prediction trial. The number of action trials was randomized for each block. Each subject performed up to 40 total games (38.2 ± 4.2 games, 208.5 ± 19.5 prediction trials), which were divided into three or four sessions.

In our experimental design, the subjects judged their decision confidence before indicating their predicted scene. One may note the effect of this prospective confidence evaluation, since, in many previous studies, confidence was reported retrospectively. A previous study suggested that prospective confidence is more discrepant from objective performance than retrospective confidence⁷⁴, but another study reported that there was no significant difference in the confidence rating when decision making was not communicated but performed before the confidence judgement⁷⁵. In our current study, subjects performed the confidence report followed by the predicted scene choice to prevent their scene prediction and confidence judgment from being modified after the presented scene options.

Training task

Subjects performed multiple training sessions to sufficiently learn the structure of the maze outside of the fMRI scanner on the day before or the same day as the scanning experiment. If the subjects performed the training sessions on the previous day of the scanning experiment, they received re-training on a short version of the training task (one or two training sessions) to avoid the confounding effects of forgetting. One training session consisted of five games in two parts: the first three games were in the practice part in which subjects explored the maze while referring to a printed 2D map, and the latter two were in the test part where they explored the maze without the map and were occasionally asked to predict the upcoming scene.

In the practice games, a printed 2D map of the maze was given to each subject, and they were free to refer to it at any time. At the beginning of each game, the 2D maze map showing the initial state was displayed on a computer screen for 5 s; the initial state, comprised of both position and orientation, was depicted as a red arrow on one of the grid squares. The initial state varied between games. Subjects were then given unlimited time to select a movement action from the initial state by pressing a button, after which the 3D scene corresponding to the next state was presented. When the subjects reached a termination condition, a yellow star appeared on a door leading to the final grid. A game was terminated if subjects visited a grid for the first time in the game after they performed twenty action selections. Subjects were not instructed about the termination condition but were informed that the final state varied between games. In the practice part, subjects thus performed only the action trials in the maze navigation task.

The test games were similar to the maze navigation task but without confidence ratings. Here, subjects performed the scene prediction task from an unknown initial state and were not able to refer to the printed map. Each trial began with the 3D scene corresponding to the current state displayed on the screen and subjects were given 2.5 s to select an action. After several action trials, subjects were required to perform a prediction: a fixation point was displayed for 4 s (delay period) and then were given 4.5 s to choose the option corresponding to their prediction of the upcoming scene from four options. Like the experimental task, the options consisted of the true upcoming scene and three distractor scenes randomly selected from the set of scenes with a similar local structure to the correct option, and then a green (correct) or red (incorrect) frame was presented around the selected scene as the choice feedback. The subsequent action trial displayed the true upcoming scene, irrespective of the subject’s scene prediction correctness. We refer to a set of 1–5 action trials followed by a prediction trial as a block, and subjects repeated these blocks until they reached a termination condition. The first block included ten action trials, allowing the subjects to explore the maze to gather more information about their position in the maze before the first prediction trial. The termination conditions were identical to those used in the scanning experiment. The subjects performed two games in the test part, each of which consisted of 5–48 blocks (7.0 ± 4.2).

All subjects performed at least seven training task sessions, up to a maximum of 1.5 h in total. If a subject’s scene prediction accuracy averaged across two test games exceeded 80%, he/she was allowed to end the training task. The best prediction accuracy was 79.2 ± 19.9% for all 33 subjects, 85.3 ± 16.2% for 27 subjects who were included in the behavioral analysis, and 85.0 ± 16.5% for 26 subjects who were included in the imaging and decoding analyses. Here, the best prediction accuracy means the best accuracy for each subject across all sessions, while session accuracy was averaged over two test games.

Image acquisition and analysis

A 3.0-Tesla Siemens MAGNETOM Prisma fit scanner (Siemens Healthineers, Erlangen, Germany) with a standard 64 channel phased array head coil was used for image acquisition. We acquired interleaved T2*-weighted echo-planar images (EPIs) (TR, 1000 ms; TE, 30 ms; flip angle, 50°; matrix size, 100 × 100; field of view, 200 × 200; voxel size, 2 × 2 × 2.5 mm; number of slices, 66). Volume acquisition was synchronized with the onset of the fixation cross-presentation during each prediction trial. We also acquired whole-brain high-resolution T1-weighted structural images using a standard MPRAGE sequence (TR, 2250 ms; TE, 3.06 ms; flip angle, 9°; field of view, 256 × 256; voxel size, 1 × 1 × 1 mm).

Imaging data were analyzed using SPM12 (Wellcome Department of Cognitive Neurology, London, UK). For each subject, all functional images were aligned to the first image as a reference, coregistered to the individual high-resolution anatomical image, normalized into an MNI template, and spatially smoothed with a Gaussian kernel filter (FWHM, 8 mm).

Our univariate analysis was based on the generalized linear model (GLM) approach. Our GLM included seven regressors coding for onsets and durations of events in each session: action selection and moving scenery in the action trials, delay period, confidence evaluation, feedback for confidence evaluation, predicted scene choice, and scene choice feedback in the prediction trials. The durations of choice-related events (action selection, confidence evaluation, and predicted scene choice) were defined as the time between the option presentation and the subject’s response. The durations of moving and choice feedback were fixed to 1.5 s. For the delay period (regressor-of-interest), although the time length varied trial-by-trial, we modeled it as a boxcar function for 4 s (the minimum duration of the delay period). These regressors were convolved with a hemodynamic response function (HRF). Additionally, motion correction parameters produced during realignment were included as nuisance variables for the GLM. The first-level GLM analysis was performed using the contrast vector whose element was 1 for the regressor-of-interests, and 0 otherwise. We then performed a group random effect analysis using anatomically localized cerebral cortex to find cortical voxels that were significantly and commonly activated during prediction across all subjects. We established statistical thresholds at the voxel level of p < 0.001 (uncorrected) and at the cluster level of p < 0.05 (FWE-corrected). We extracted regions of interest (ROIs) from the identified voxels, and the BOLD signal patterns in each ROI were used for the decoding analysis.

To complement the univariate ROI analysis, we also performed whole-brain searchlight analyses (see “Searchlight analysis for the scene prediction and confidence”). For scene prediction, we confirmed that there were no clusters other than the four ROIs extracted by the univariate analysis that showed significantly higher decoding accuracy than chance (voxel level, p < 0.001; cluster level, FWE-corrected p < 0.05). For confidence level, we found that the regions where the confidence can be decoded were widely distributed over the cerebrum, which may be attributed to the task design in which subjects were instructed to predict the upcoming scene during the delay period.

Decoding analysis

Voxel activity patterns during the delay period were used to decode both scene prediction and confidence. All fMRI data were spatially realigned, normalized, and smoothed with a Gaussian kernel (8 mm FWHM), and preprocessed with linear trend removal and z-score normalization for each voxel in every run over the time series but not convolved with HRF.

In the time-series decoding analysis, the decoder at each time t in the time course (t-th decoding period) used as its input the voxel-wise BOLD signal intensities averaged over four volumes corresponding to t s to t + 3 s (i.e., (t + 1)-th to (t + 4)-th scan volumes) after the onset of the delay period (Supplementary Fig. 4a). We limited the time-series decoding analysis up to the 8th period in order to cover 4–6 s after the delay onset, which exhibited the peak brain activity evoked by the delay cue, and to ensure that the scene prediction decoder was as unaffected as possible by information provided in the scene choice period (average 7.5 s after the delay onset), during which the subjects’ predicted (chosen) scene was displayed.

For the scene prediction decoder, there were seven possible scenes. Each scene was labeled with a 3-bit binary number in which each bit corresponded to the status (open: 1; closed: 0) of the left, forward, and right door, respectively. We used the scenes chosen by the subjects for the target labels of the scene decoder regardless of whether they were correct or not. Label 3 (011 in binary code) was excluded from the analysis because of its rare occurrence (Fig. 1c); thus, we used six labels for the decoding analysis. The label for the confidence decoder was either low (confidence level 1 or 2) or high (confidence level 3 or 4). One subject was excluded from the confidence decoding analysis because he/she reported high confidence in only three trials (1.3%).

We used a sparse logistic regression (SLR)⁷⁶ as a supervised learning algorithm because it incorporates Bayesian automatic selection of relevant features (voxels), which prevents overfitting problems in high-dimensional neuroimaging data. This method has been used for MVPA in the previous studies^2,12,24. Scene prediction decoders included six scene labels, and we used six one-versus-the-rest classifiers with SLR as the decoder, where a classifier for scene k outputs the probability that the input brain activity pattern x represents scene k, P(scene = k; x), and scene k’ with the maximum probability among six classifiers, k’=argmax P(scene = k; x), is defined as the integrated decoder output.

To deal with unbalanced training data sets (Fig. 1c), we used an undersampling method to assign an equal number of samples to each label. Although the trial numbers were actually unbalanced between the compared conditions when training the classifiers in the conditional decoding analyses, there were no significant differences in the number of samples: between correct (51.3 ± 18.1 trials) and incorrect trials (47.7 ± 17.9 trials, one-sided Wilcoxon rank-sum test, p = 0.29), between high-confidence (45.9 ± 17.9 trials) and low-confidence trials (61.0 ± 32.6 trials, p = 6.7 × 10⁻²), between high-state-confidence (55.9 ± 25.8 trials) and low-state-confidence trials (53.4 ± 15.8 trials, p = 0.51).

To assess decoder accuracies, we used LOSO CV in which each decoder was trained using a training data set from three out of four sessions, and the remaining session was used as test data for validation. When evaluating the decoders with the trials divided into two categories according to confidence or correctness, we used LOGO CV. In each fold of the LOSO and LOGO validations, we repeated the following procedure 100 times to account for fluctuations in accuracy due to selected samples in the undersampling phase: random under-sampling from the training data set, training the decoder, and evaluating the decoder’s accuracy. The decoding analysis method was implemented using Brain Decoder Toolbox⁷⁷.

Searchlight analysis for the scene prediction and confidence

To complement the univariate ROI analysis, we conducted whole-brain searchlight analyses with 10 mm radius spheres centered around a given voxel for the session-wise unsmoothed beta estimates. In the scene prediction searchlight analysis, the seven different scenes predicted by the subjects were modeled in the GLM as seven individual regressors during the delay period. Other than the delay period, we used the same regressors as in the original GLM. For the confidence level, we used the GLM with two regressors according to the subject’s confidence level (high or low) in the delay period. To create a subject-level whole-brain accuracy map, we used a linear support vector machine⁷⁸ and the accuracy of each voxel was evaluated using leave-one-session-out (LOSO) cross-validation (CV). The individual accuracy maps were normalized and smoothed using a Gaussian kernel (8 mm FWHM)^11,79 and then applied to the group random effect analysis using anatomically localized cerebral cortex.

Permutation test for the scene prediction and confidence decoding analyses

We also performed a two-step permutation test to test the null hypothesis that the decoding accuracy of scene prediction and confidence was not different from the chance level, and confirmed that the decoding accuracies in our analyses do not exceed the theoretical chance level merely by chance⁸⁰. Within each ROI for each decoding period, first (i) we performed intra-subject permutation, i.e., repeated each decoding analysis N_sbj times in each of which labels to be the decoding targets were randomly permuted within sessions for each subject. We used N_sbj = 150. Then (ii) we drew for each subject one result randomly from the pool of these permutation results including the original decoding accuracy, and averaged among the subjects to calculate the group-level permutation result. (iii) Step (ii) was repeated N_group times to acquire the null distribution for group statistical analysis. We used N_group = 1000. The significance of the difference between the group-level null distribution and the original decoding accuracies was tested using a one-sided Wilcoxon rank-sum test. We did not perform the permutation tests to inspect the significance of the conditional decoding analysis results because they were found to require too long a computation time. Note, however, that our under-sampling technique has made the prior of labels for each of the conditional decoders uniform. We also assessed the significance of the decoding accuracies of scene prediction and confidence through group-level permutation testing and confirmed that our decoding results were not disturbingly high (Supplementary Fig. 4d, e).

Behavioral model based on a HMM

We constructed a behavioral model for each subject based on an HMM with a latent variable denoting confidence level. The model simulates a generative process of a subject’s actions (a*) based on the sequence of observed 3D scenes (o*) and internal cognitive states. A subject’s cognitive state was modeled as the state inference (location and orientation) within the maze (h), the operant state (m), and the confidence level about the state inference (or state confidence, c_h). In action trial t during the maze navigation task, subjects were assumed to probabilistically alter their state confidence (c_h,t), subjectively infer the hidden true state (${h}_{t}^{* }$) as its estimate from the history of observations (h_t), and then select an action (${a}_{t}^{* }$) based on a decision strategy determined by the state confidence. To efficiently reach an unspecified goal (i.e., with as few actions as possible), it is essential to estimate the hidden true state and visit previously unexplored areas of the maze. We hypothesized that subjects switched between two action selection strategies depending on the cognitive state, based on a previous study:^21,22,23 one is a forward-dominant strategy and the other is an efficient-exploration strategy. Throughout the behavioral model, a variable with an asterisk (*) is real (physical), observable, and objective, and a variable without an asterisk is internal (cognitive), unobservable, and subjective. A variable with a hat (^) explicitly denotes a prediction.

We developed the subject behavioral model as follows (Supplementary Fig. 6): at t = 1, all possible states that are consistent with the first scene (${o}_{1}^{* }$) were extracted as candidates of the hidden true state, where H₁ is the set of those states. One of the sets was chosen as an initial state estimate (h₁), for which the posterior probability was expressed as $P\left({h}_{1}\right)=1/\left|{H}_{1}\right|$. Here, $\left|{H}_{1}\right|$ is the number of elements in H₁. At the start of a game, subjects were assumed to have low confidence level about their state inference, so the state confidence was set at 0 with a probability one: $P({c}_{h,1}=0)=1$ (Step 1). At t ≥ 1, the action selection strategy (π_t) was determined using the current state confidence (c_h,t). If the state confidence was low (c_h,t = 0), π_t was set as the forward-dominant strategy; if the state confidence was high (c_h,t = 1), π_t was set as the efficient-exploration strategy (Step 2). According to π_t, an action (${a}_{t}^{* }$) was probabilistically selected based on h_t. The action strategy is described in further detail below (Step 3). A new state estimate was calculated as ${\hat{h}}_{t+1}$, based on h_t and ${a}_{t}^{* }$, accounting for the maze structure. Then, the upcoming scene was predicted as ${\hat{o}}_{t+1}$, based on ${\hat{h}}_{t+1}$ (Step 4). After the subjects moved to the true next state (${h}_{t+1}^{* }$)—based on the previous state (${h}_{t}^{* }$) and action (${a}_{t}^{* }$)—the true next scene (${o}_{t+1}^{* }$) was observed. Note that the real state (${h}_{t+1}^{* }$) may differ from its estimate (${\hat{h}}_{t+1}$) (Step 5). If ${\hat{o}}_{t+1}$ matched ${o}_{t+1}^{* }$, the subjects were assumed to consider their previous inference to be confirmed. This was called an update mode, represented by ${m}_{t+1}=0.$ In this case, ${\hat{h}}_{t+1}$ was subjectively confirmed as the new state inference, ${h}_{t+1}={\hat{h}}_{t+1}$. Concurrently, the state confidence c_h,t was updated to ${c}_{h,t+1}$ stochastically with the transition probability (P_UD). The transition of the state confidence is described in further detail below (Step 5a). If ${\hat{o}}_{t+1}$ did not match ${o}_{t+1}^{* }$, the subjects were assumed to dispose their previous inference ${\hat{h}}_{t+1}.$ This was called a backtrack mode, represented by ${m}_{t+1}=1$. In this case, a new set of states (${H}_{t+1}$) was constructed to account for the current observation (${o}_{t+1}^{* }$) and the history of past observations. A new state estimate was randomly chosen from ${H}_{t+1}$ as ${h}_{t+1}$, for which the posterior probability was expressed as $P\left({h}_{t+1}\right)=1/\left|{H}_{t+1}\right|$. The new state confidence (${c}_{h,t+1}$) was stochastically determined, depending on c_h,t and according to the transition probability (P_BT) (Step 5b). The procedure then backs to step 2 with t ← t+1.

Subjects were considered to have low state confidence (${c}_{h,t}=0$) when they were uncertain about their state in the maze (Supplementary Fig. 6, Step 3). In this case, the subjects were assumed to take “info-max” behaviors to efficiently identify the state; in other words, they moved forward or chose an action at random if a forward move was not possible. This forward-dominant strategy was defined as follows: if the door in front of the subject was open, the forward movement was considered the optimal action. If the door in front of the subject was closed and both the left and right doors were open, the right move was considered the optimal action (based on retrospective reports from all subjects). If there was only one open door, the action in the passable direction was considered optimal.

We assumed that the subjects’ action selection was probabilistic and that they chose an optimal action with probability α as follows:

$$P\left({a}_{t}|{h}_{t},{c}_{h,t}=0\right)=\left\{\begin{array}{cc}\alpha \hfill& {{{{{{\rm{if}}}}}}}\;{a}_{t}\;{{{{{\rm{is}}}}}}\; {{{{{\rm{optimal}}}}}}\\ (1-\alpha )/{N}_{{{{{{{\rm{nopt}}}}}}}} & {{{{{\rm{otherwise}}}}}}\hfill\end{array}\right.$$

(1)

N_nopt denotes the number of allowable (i.e., passable doors) but non-optimal actions.

On the other hand, when the state confidence was high (${c}_{h,t}=1$), subjects were considered to be certain about their state in the maze (Supplementary Fig. 6, Step 3). In this case, subjects preferentially moved to grid spaces that they had not yet explored. This efficient-exploration strategy was defined as follows: if there was only one open door, the optimal action was in the single passable direction. If there were two or three open doors, the subjects preferentially chose the door leading to an unexplored grid space: if there were one or more accessible adjacent grid spaces that the subjects had yet to visit, all actions leading to the unexplored grids were considered optimal. If all of the accessible grid spaces had been visited, the optimal action was considered to be choosing the shortest path to the nearest unexplored grid space.

The action selection probability in the efficient-exploration strategy is defined as follows:

$$P\left({a}_{t}|{h}_{t},{c}_{h,t}=1\right)=\left\{\begin{array}{cc}\beta /{N}_{{{{{{{\rm{opt}}}}}}}} \hfill& {{{{{{\rm{if}}}}}}}\;{a}_{t}\;{{{{{\rm{is}}}}}}\; {{{{{\rm{optimal}}}}}}\\ (1-\beta )/{N}_{{{{{{{\rm{nopt}}}}}}}} & {{{{{\rm{otherwise}}}}}}\hfill\end{array}\right.$$

(2)

where β is the probability of optimal action selection, and N_opt and N_nopt denote the numbers of optimal actions and allowable but non-optimal actions, respectively.

We assumed that the subjects stochastically changed their state confidence levels (high or low) based on whether or not their scene prediction matched the observed scene. If the observation (${o}_{t+1}^{* }$) agreed with the scene prediction (${\hat{o}}_{t+1}$), the subjects were assumed to become more confident about their state estimate, whereas if ${o}_{t+1}^{* }$ disagreed with ${\hat{o}}_{t+1}$, they were assumed to become less confident.

More concretely, when the subjects had low confidence levels about their state estimate (c_h,t = 0) but there was no discrepancy between the predicted scene and the observed scene (update mode; Supplementary Fig. 6, Step 5a), their confidence level was switched to high with a probability of p_L→H. If the state confidence was already high (${c}_{h,t}=1$), it stayed high. In contrast, when the subjects were confident about their state estimate (${c}_{h,t}=1$) but the predicted scene differed from the observed scene (backtrack mode; Supplementary Fig. 6, Step 5b), the confidence level was switched to low with a probability of p_H→L. If the state confidence was already low (${c}_{h,t}=0$), it stayed low.

In summary, the dynamics of the confidence level were defined by a Markov process depending on the operant state:

In the update mode (${m}_{t+1}=0$),

$$\left(P({c}_{h,t+1}=0) \quad P({c}_{h,t+1}=1)\right)=\left(P({c}_{h,t}=0) \quad P({c}_{h,t}=1)\right){P}_{{{{{{{\rm{UD}}}}}}}} \\ {{{{{\rm{where}}}}}} \quad {P}_{{{{{{{\rm{UD}}}}}}}}=\left(\begin{array}{cc}1-{p}_{L\to H} & {p}_{L\to H}\\ 0 & 1\end{array}\right)$$

(3)

In the backtrack mode (${m}_{t+1}=1$),

$$\left(P({c}_{h,t+1}=0) \quad P({c}_{h,t+1}=1)\right)=\left(P({c}_{h,t}=0) \quad P({c}_{h,t}=1)\right){P}_{{{{{{{\rm{BT}}}}}}}} \\ {{{{{\rm{where}}}}}} \quad {P}_{{{{{{{\rm{BT}}}}}}}}=\left(\begin{array}{cc}1 & 0\\ {p}_{H\to L} & 1-{p}_{H\to L}\end{array}\right)$$

(4)

The two parameters in this Markov process (${p}_{L\to H}$ and ${p}_{H\to L}$) were determined using type-II maximum likelihood estimation (MLE).

According to the Bayesian filtering method, the sequence of the subject’s cognitive states was estimated based on the sequence of actions (${a}_{1:T-1}^{* }$) and the observed scenes (${o}_{1:T}^{* }$). The posterior probability of the cognitive state at time t + 1 was obtained from the previous one at time t using the following incremental Bayesian equation:

$$ P\left({c}_{h,1:t+1},{h}_{1:t+1},{m}_{1:t+1}|{a}_{1:t}^{* },{o}_{1:t+1}^{* }\right)\\ \quad=\frac{P\left({c}_{h,t+1},{h}_{t+1},{m}_{t+1}|{{{c}_{h,1:t},h}_{1:t},{m}_{1:t},a}_{1:t}^{* },{o}_{1:t+1}^{* }\right)P({a}_{t}^{* }{{{{{\rm{|}}}}}}{c}_{h,1:t},{h}_{1:t},{m}_{1:t},{a}_{1:t-1}^{* })P\left({{c}_{h,1:t},h}_{1:t},{m}_{1:t}|{a}_{1:t-1}^{* },{o}_{1:t}^{* }\right)}{P({a}_{t}^{* }{{{{{\rm{|}}}}}}{a}_{1:t-1}^{* })}$$

(5)

where we used simplified time-series representations like ${a}_{1:t}^{* }=\left\{{a}_{1}^{* },{a}_{2}^{* },\ldots ,{a}_{t}^{* }\right\}$. By repeating this calculation from first time step, 0, to the terminal time step, T, we obtained the posterior probability of the sequence of cognitive states. One likelihood term (the second term in the numerator of Eq. (5)) corresponded to steps 2 and 3 above. Another likelihood term (the first term in the numerator of Eq. (5)) was calculated as follows, based on the subject’s behavioral model and corresponding to steps 4 and 5:

$$ P\left({c}_{h,t+1},{h}_{t+1},{m}_{t+1}|{c}_{h,1:t},{{h}_{1:t},{m}_{1:t},a}_{1:t,}^{* }{o}_{1:t+1}^{* }\right)\\ \quad=P({c}_{h,t+1}{{{{{\rm{|}}}}}}{c}_{h,t},{m}_{t+1})P\left({h}_{t+1}|{h}_{1:t},{m}_{1:t+1},{a}_{1:t}^{* },{o}_{1:t+1}^{* }\right)P({m}_{t+1}{{{{{\rm{|}}}}}}{h}_{t},{a}_{t}^{* },{o}_{t+1}^{* })$$

(6)

When the operant state was update mode (${m}_{t+1}=0$), the second term in Eq. (6) was equal to the previous term:

$$P\left({h}_{t+1}|{{h}_{1:t},m}_{t+1}=0,{{m}_{1:t},a}_{1:t}^{* },{o}_{1:t+1}^{* }\right)=P({h}_{t}{{{{{\rm{|}}}}}}{h}_{1:t-1},{m}_{1:t},{a}_{1:t-1}^{* },{o}_{1:t}^{* })$$

(7)

because ${h}_{t+1}$ was determined by ${h}_{t}$ and ${a}_{t}^{* }$, without any ambiguity, given ${m}_{t+1}=0.$ When the operant state was backtrack mode (${m}_{t+1}=1$), a new state inference (${h}_{t+1}$) was selected from the re-estimated set of states (${H}_{t+1}$) with equal probability (see step 5(b)). ${H}_{t+1}$ was constructed so that each element was consistent with the history of past n-step observed scenes (${o}_{t-n+1:t}^{* }$) and actions (${a}_{t-n:t}^{* }$), and n was subject-wisely estimated by MLE.

There were some exceptional cases that applied to step 5. When the subjects moved to a grid space that had been visited before, but their predicted scene matched the observed scene, $\hat{o}={o}^{* }$, the operant state was set as backtrack mode (action-backtrack mode) because they were considered to have performed inefficient or erroneous exploration. If all the passable doors led to grids that had been explored in such an action-backtrack mode, it resulted in another exceptional case that was regarded as update mode. Note that these exceptional cases were addressed for logical consistency but rarely occurred.

When validating our HMM-based behavioral model, we used the agreement between the model’s predicted action (${\hat{a}}_{t}$) and the actual action (${a}_{t}^{* }$) taken by the subjects. The action was predicted by

$${\hat{a}}_{t} =\mathop{{{{{{\rm{argmax}}}}}}}\limits_{a}P\left(a|{a}_{1:t-1}^{* },{o}_{1:t}^{* }\right)= \mathop{{{{{{\rm{argmax}}}}}}}\limits_{a}\mathop{\sum }\limits_{{c}_{h,t}}\mathop{\sum }\limits_{{h}_{t}}\mathop{\sum }\limits_{{m}_{1:t}}P(a{{|}}{c}_{h,t},{h}_{t},{m}_{1:t},{a}_{1:t-1}^{* })\\ P\left({{c}_{h,t},h}_{t},{m}_{1:t}|{a}_{1:t-1}^{* },{o}_{1:t}^{* }\right)$$

(8)

which can be calculated as a by-product of Eq. (5). When Eq. (8) provided multiple equally probable actions, we regarded the set of those actions as predicted actions.

When performing model-based analysis, we also used the state confidence, objectively estimated as

$${\hat{c}}_{h,t}=\left\{\begin{array}{cc}0 & {{{{{\rm{if}}}}}}\;\mathop{{{{{{\rm{max }}}}}}}\limits_{{h}_{t}}\;P\left({c}_{h,t}=1,{h}_{t}\right)\le \mathop{\sum }\limits_{{h}_{t}}P\left({c}_{h,t}=0,{h}_{t}\right)\;{{{{{\rm{for}}}}}}\;{h}_{t}\in H\\ 1 & {{{{{\rm{otherwise}}}}}}\hfill\end{array}\right.$$

(9)

where $P\left({c}_{h,t},{h}_{t}\right)$ can be obtained by marginalizing Eq. (6) at time step t with respect to m_t, where H is the subset of H_t consisting of h_t for which the $P\left({h}_{t}\right)$ is maximal in ${h}_{t}\in {H}_{t}$. The number of elements in H was sometimes greater than one.

We estimated the model parameters for each subject by minimizing the negative log evidence (Supplementary Table 3):

$${{{{{\rm{Negative}}}}}}\;{{{{{\rm{log }}}}}}\;{{{{{\rm{evidence}}}}}}=-{{{{{{\rm{log }}}}}}}\mathop{\prod }_{g=1}^{G}p({{{{{{\boldsymbol{A}}}}}}}_{g}^{* }{{{{{\rm{|}}}}}}{{{{{\boldsymbol{\theta }}}}}})$$

(10)

$$p\left({{{{{{\boldsymbol{A}}}}}}}_{g}^{* }|{{{{{\boldsymbol{\theta }}}}}}\right)=p({a}_{1}^{* }{{{{{\rm{|}}}}}}{{{{{\boldsymbol{\theta }}}}}})\mathop{\prod }_{t=1}^{T-1}p({a}_{t+1}^{* }{{{{{\rm{|}}}}}}{a}_{t}^{* },{{{{{\boldsymbol{\theta }}}}}})$$

(11)

Here, G is the number of games, ${{{{{{\boldsymbol{A}}}}}}}_{g}^{* }$ is the sequence of actions ${a}_{1:T-1}^{* }$, where T − 1 is the number of action trials in the game g and T is the number of observations. The set of model parameters is denoted by θ. Note that Eq. (11) is the product of the denominator of Eq. (5) and was obtained by repeating the incremental Bayesian estimation (Eq. (5)). The minimized negative log evidence was also used for the Bayesian model selection (see Supplementary Table 3).

Statistics and reproducibility

Imaging data were analyzed using SPM12 (Wellcome Department of Cognitive Neurology, London, UK). The decoding analysis method was implemented using Brain Decoder Toolbox⁷⁷, and the searchlight analysis method was implemented using the Decoding Toolbox⁷⁸. Statistical analyses were performed using MATLAB R2017a (Mathworks, Natick, Massachusetts, US). We analyzed the effects of the prediction correctness and the confidence level on the scene choice reaction time with the R package “ARTool”^81,82. For the behavioral analyses, we used the data of 27 subjects whose scene prediction accuracy in the experimental task was significantly higher than chance (one-sided z-test, p < 0.01, see also Supplementary Fig. 2). The imaging and decoding analyses included 26 subjects because one subject was excluded due to his/her larger head motion. The decoding accuracies were validated using LOSO or LOGO CV procedures and compared to the theoretical chance level using Wilcoxon signed-rank test. The permutation tests were also applied for the scene prediction and confidence decoding analyses (Supplementary Fig. 4d, e). In the conditional decoding analyses, we applied Wilcoxon rank-sum test to evaluate the difference in the scene prediction decodability between the subsets of data (correct vs incorrect, high vs low scene-prediction-confidence, or high vs low state confidence). Note that, as we simply compared two groups of data, no multiple comparison correction was necessary. Values are expressed as the mean ± SD.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The source data underlying the main figures are provided as Supplementary Data 1. All data supporting the main findings are also available via the open-source repository Zenodo⁸³.

Code availability

Codes for the computational model of subjects’ exploration behavior are available via the open-source repository Zenodo⁸³.

References

Elliott Wimmer, G. & Büchel, C. Learning of distant state predictions by the orbitofrontal cortex in humans. Nat. Commun. 10, 2554 (2019).
Shikauchi, Y. & Ishii, S. Decoding the view expectation during learned maze navigation from human fronto-parietal network. Sci. Rep. 5, 1–13 (2015).
Google Scholar
Blom, T., Feuerriegel, D., Johnson, P., Bode, S. & Hogendoorn, H. Predictions drive neural representations of visual events ahead of incoming sensory information. Proc. Natl Acad. Sci. USA 117, 7510–7515 (2020).
Article CAS PubMed PubMed Central Google Scholar
Hindy, N. C., Ng, F. Y. & Turk-Browne, N. B. Linking pattern completion in the hippocampus to predictive coding in visual cortex. Nat. Neurosci. 19, 665–667 (2016).
Article CAS PubMed PubMed Central Google Scholar
Luft, C. D. B., Meeson, A., Welchman, A. E. & Kourtzi, Z. Decoding the future from past experience: Learning shapes predictions in early visual cortex. J. Neurophysiol. 113, 3159–3171 (2015).
Article PubMed PubMed Central Google Scholar
Yoshida, W. & Ishii, S. Resolution of uncertainty in prefrontal cortex. Neuron 50, 781–789 (2006).
Article CAS PubMed Google Scholar
Dunning, D., Griffin, D. W., Milojkovic, J. D. & Ross, L. The overconfidence effect in social prediction. J. Pers. Soc. Psychol. 58, 568–581 (1990).
Article CAS PubMed Google Scholar
Fleming, S. M., Huijgen, J. & Dolan, R. J. Prefrontal contributions to metacognition in perceptual decision making. J. Neurosci. 32, 6117–6125 (2012).
Article CAS PubMed PubMed Central Google Scholar
De Martino, B., Fleming, S. M., Garrett, N. & Dolan, R. J. Confidence in value-based choice. Nat. Neurosci. 16, 105–110 (2013).
Article PubMed CAS Google Scholar
De Martino, B., Bobadilla-Suarez, S., Nouguchi, T., Sharot, T. & Love, B. C. Social information is integrated into value and confidence judgments according to its reliability. J. Neurosci. 37, 6066–6074 (2017).
Article PubMed PubMed Central Google Scholar
Morales, J., Lau, H. & Fleming, S. M. Domain-general and domain-specific patterns of activity supporting metacognition in human prefrontal cortex. J. Neurosci. 38, 3534–3546 (2018).
Article CAS PubMed PubMed Central Google Scholar
Cortese, A., Amano, K., Koizumi, A., Kawato, M. & Lau, H. Multivoxel neurofeedback selectively modulates confidence without changing perceptual performance. Nat. Commun. 7, 13669 (2016).
Cortese, A., Amano, K., Koizumi, A., Lau, H. & Kawato, M. Decoded fMRI neurofeedback can induce bidirectional confidence changes within single participants. Neuroimage 149, 323–337 (2017).
Article PubMed Google Scholar
Löffler, A., Haggard, P. & Bode, S. Decoding changes of mind in voluntary action - dynamics of intentional choice representations. Cereb. Cortex 30, 1199–1212 (2020).
Article PubMed Google Scholar
Funamizu, A., Kuhn, B. & Doya, K. Neural substrate of dynamic Bayesian inference in the cerebral cortex. Nat. Neurosci. 19, 1682–1689 (2016).
Article CAS PubMed Google Scholar
Rodriguez, P. F. Neural decoding of goal locations in spatial navigation in humans with fMRI. Hum. Brain Mapp. 31, 391–397 (2010).
PubMed Google Scholar
Sherrill, K. R. et al. Functional connections between optic flow areas and navigationally responsive brain regions during goal-directed navigation. Neuroimage 118, 386–396 (2015).
Article PubMed Google Scholar
Spiers, H. J. & Maguire, E. A. A navigational guidance system in the human brain. Hippocampus 17, 618–626 (2007).
Article PubMed PubMed Central Google Scholar
Viard, A., Doeller, C. F., Hartley, T., Bird, C. M. & Burgess, N. Anterior hippocampus and goal-directed spatial decision making. J. Neurosci. 31, 4613–4621 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kaelbling, L. P., Littman, M. L. & Cassandra, A. R. Planning and acting in partially observable stochastic domains. Artif. Intell. 101, 99–134 (1998).
Article Google Scholar
Adomi, M., Shikauchi, Y. & Ishii, S. Hidden markov model for human decision process in a partially observable environment. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 6353 LNCS 94–103 (ICANN, 2010).
Purcell, B. A. & Kiani, R. Hierarchical decision processes that operate over distinct timescales underlie choice and changes in strategy. Proc. Natl Acad. Sci. USA 113, E4531–E4540 (2016).
Article CAS PubMed PubMed Central Google Scholar
Sarafyazd, M. & Jazayeri, M. Hierarchical reasoning by neural circuits in the frontal cortex. Science 364, eaav8911 (2019).
Horikawa, T. & Kamitani, Y. Generic decoding of seen and imagined objects using hierarchical visual features. Nat. Commun. 8, 15037 (2017).
Nishimoto, S. et al. Reconstructing visual experiences from brain activity evoked by natural movies. Curr. Biol. 21, 1641–1646 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kim, J. et al. Decoding accuracy in supplementary motor cortex correlates with perceptual sensitivity to tactile roughness. PLoS ONE 10, e0129777 (2015).
Brodersen, K. H. et al. Decoding the perception of pain from fMRI using multivariate pattern analysis. Neuroimage 63, 1162–1170 (2012).
Article PubMed Google Scholar
Albers, A. M., Kok, P., Toni, I., Dijkerman, H. C. & De Lange, F. P. Shared representations for working memory and mental imagery in early visual cortex. Curr. Biol. 23, 1427–1431 (2013).
Article CAS PubMed Google Scholar
Chadwick, M. J., Hassabis, D., Weiskopf, N. & Maguire, E. A. Decoding individual episodic memory traces in the human hippocampus. Curr. Biol. 20, 544–547 (2010).
Article CAS PubMed PubMed Central Google Scholar
Chadwick, M. J., Hassabis, D. & Maguire, E. A. Decoding overlapping memories in the medial temporal lobes using high-resolution fMRI. Learn. Mem. 18, 742–746 (2011).
Article PubMed PubMed Central Google Scholar
Harrison, S. A. & Tong, F. Decoding reveals the contents of visual working memory in early visual areas. Nature 458, 632–635 (2009).
Article CAS PubMed PubMed Central Google Scholar
Naselaris, T., Olman, C. A., Stansbury, D. E., Ugurbil, K. & Gallant, J. L. A voxel-wise encoding model for early visual areas decodes mental images of remembered scenes. Neuroimage 105, 215–228 (2015).
Article PubMed Google Scholar
Xing, Y., Ledgeway, T., McGraw, P. V. & Schluppeck, D. Decoding working memory of stimulus contrast in early visual cortex. J. Neurosci. 33, 10301–10311 (2013).
Article CAS PubMed PubMed Central Google Scholar
Cichy, R., Heinzle, J. & Haynes, J. Imagery and perception share cortical representations of content and location. Cereb. Cortex 22, 372–380 (2012).
Article PubMed Google Scholar
Johnson, M. R. & Johnson, M. K. Decoding individual natural scene representations during perception and imagery. Front. Hum. Neurosci. 8, 59 (2014).
Ragni, F., Tucciarelli, R., Andersson, P. & Lingnau, A. Decoding stimulus identity in occipital, parietal and inferotemporal cortices during visual mental imagery. Cortex 127, 371–387 (2020).
Article PubMed Google Scholar
Stokes, M., Thompson, R., Cusack, R. & Duncan, J. Top-down activation of shape-specific population codes in visual cortex during mental imagery. J. Neurosci. 29, 1565–1572 (2009).
Article CAS PubMed PubMed Central Google Scholar
Horikawa, T., Tamaki, M., Miyawaki, Y. & Kamitani, Y. Neural decoding of visual imagery during sleep. Science 340, 639–642 (2013).
Article CAS PubMed Google Scholar
Baird, B., Smallwood, J., Gorgolewski, K. J. & Margulies, D. S. Medial and lateral networks in anterior prefrontal cortex support metacognitive ability for memory and perception. J. Neurosci. 33, 16657–16665 (2013).
Article CAS PubMed PubMed Central Google Scholar
Fleming, S. M., Weil, R. S., Nagy, Z., Dolan, R. J. & Rees, G. Relating introspective accuracy to individual differences in brain structure. Science 329, 1541–1543 (2010).
Article CAS PubMed PubMed Central Google Scholar
Allen, M. et al. Metacognitive ability correlates with hippocampal and prefrontal microstructure. Neuroimage 149, 415–423 (2017).
Article PubMed Google Scholar
Desender, K., Boldt, A., Verguts, T. & Donner, T. H. Confidence predicts speed-accuracy tradeoff for subsequent decisions. Elife 8, e43499 (2019).
Kiani, R., Corthell, L. & Shadlen, M. N. Choice certainty is informed by both evidence and decision time. Neuron 84, 1329–1342 (2014).
Article CAS PubMed PubMed Central Google Scholar
Peters, M. A. K. et al. Perceptual confidence neglects decision-incongruent evidence in the brain. Nat. Hum. Behav. 1, 0139 (2017).
Hasegawa, M. et al. Selective suppression of local circuits during movement preparation in the mouse motor cortex. Cell Rep. 18, 2676–2686 (2017).
Article CAS PubMed Google Scholar
Churchland, M. M., Yu, B. M., Ryu, S. I., Santhanam, G. & Shenoy, K. V. Neural variability in premotor cortex provides a signature of motor preparation. J. Neurosci. 26, 3697–3712 (2006).
Article CAS PubMed PubMed Central Google Scholar
Klein-Flügge, M. C., Nobbs, D., Pitcher, J. B. & Bestmann, S. Variability of human corticospinal excitability tracks the state of action preparation. J. Neurosci. 33, 5564–5572 (2013).
Article PubMed PubMed Central CAS Google Scholar
Jeye, B. M., MacEvoy, S. P., Karanian, J. M. & Slotnick, S. D. Distinct regions of the hippocampus are associated with memory for different spatial locations. Brain Res. 1687, 41–49 (2018).
Article CAS PubMed Google Scholar
Hirose, S., Nambu, I. & Naito, E. Cortical activation associated with motor preparation can be used to predict the freely chosen effector of an upcoming movement and reflects response time: An fMRI decoding study. Neuroimage 183, 584–596 (2018).
Article PubMed Google Scholar
Gu, J., Liu, B., Li, X., Wang, P. & Wang, B. Cross-modal representations in early visual and auditory cortices revealed by multi-voxel pattern analysis. Brain Imaging Behav. 14, 1908–1920 (2020).
Article PubMed Google Scholar
Grootswagers, T., Cichy, R. M. & Carlson, T. A. Finding decodable information that can be read out in behaviour. Neuroimage 179, 252–262 (2018).
Article PubMed Google Scholar
Ritchie, J. B. & de Beeck, H. O. Using neural distance to predict reaction time for categorizing the animacy, shape, and abstract properties of objects. Sci. Rep. 9, 13201 (2019).
Chen, Y. et al. Allocentric versus egocentric representation of remembered reach targets in human cortex. J. Neurosci. 34, 12515–12526 (2014).
Article CAS PubMed PubMed Central Google Scholar
Committeri, G. et al. Reference frames for spatial cognition: different brain areas are involved in viewer-, object-, and landmark-centered judgments about object location. J. Cogn. Neurosci. 16, 1517–1535 (2004).
Article PubMed Google Scholar
Lester, B. D. & Dassonville, P. The role of the right superior parietal lobule in processing visual context for the establishment of the egocentric reference frame. J. Cogn. Neurosci. 26, 2201–2209 (2014).
Article PubMed Google Scholar
Neggers, S. F. W., Van der Lubbe, R. H. J., Ramsey, N. F. & Postma, A. Interactions between ego- and allocentric neuronal representations of space. Neuroimage 31, 320–331 (2006).
Article CAS PubMed Google Scholar
Wolbers, T., Hegarty, M., Büchel, C. & Loomis, J. M. Spatial updating: How the brain keeps track of changing object locations during observer motion. Nat. Neurosci. 11, 1223–1230 (2008).
Article CAS PubMed Google Scholar
Zaehle, T. et al. The neural basis of the egocentric and allocentric spatial frame of reference. Brain Res. 1137, 92–103 (2007).
Article CAS PubMed Google Scholar
Andersen, R. A., Martyn Bracewell, R., Barash, S., Gnadt, J. W. & Fogassi, L. Eye position effects on visual, memory, and saccade-related activity in areas LIP and 7a of macaque. J. Neurosci. 10, 1176–1196 (1990).
Article CAS PubMed PubMed Central Google Scholar
Andersen, R. A., Essick, G. K. & Siegel, R. M. Encoding of spatial location by posterior parietal neurons. Science 230, 456–458 (1985).
Article CAS PubMed Google Scholar
Duhamel, J. R., Bremmer, F., BenHamed, S. & Graf, W. Spatial invariance of visual receptive fields in parietal cortex neurons. Nature 389, 845–848 (1997).
Article CAS PubMed Google Scholar
Chen, X., DeAngelis, G. C. & Angelaki, D. E. Flexible egocentric and allocentric representations of heading signals in parietal cortex. Proc. Natl Acad. Sci. USA 115, E3305–E3312 (2018).
Rogers, J. L. & Kesner, R. P. Lesions of the dorsal hippocampus or parietal cortex differentially affect spatial information processing. Behav. Neurosci. 120, 852–860 (2006).
Article PubMed Google Scholar
Parslow, D. M. et al. Allocentric spatial memory activation of the hippocampal formation measured with fMRI. Neuropsychology 18, 450–461 (2004).
Article PubMed Google Scholar
Gomez, A., Cerles, M., Rousset, S., Rémy, C. & Baciu, M. Differential hippocampal and retrosplenial involvement in egocentric-updating, rotation, and allocentric processing during online spatial encoding: an fMRI study. Front. Hum. Neurosci. 8, 150 (2014).
Ruotolo, F. et al. Neural correlates of egocentric and allocentric frames of reference combined with metric and non-metric spatial relations. Neuroscience 409, 235–252 (2019).
Article CAS PubMed Google Scholar
Fink, G. R., Dolan, R. J., Halligan, P. W., Marshall, J. C. & Frith, C. D. Space-based and object-based visual attention: shared and specific neural domains. Brain 120, 2013–2028 (1997).
Article PubMed Google Scholar
Maguire, E. A. et al. Knowing where and getting there: a human navigation network. Science 280, 921–924 (1998).
Article CAS PubMed Google Scholar
Kurata, K., Tsuji, T., Naraki, S., Seino, M. & Abe, Y. Activation of the dorsal premotor cortex and pre-supplementary motor area of humans during an auditory conditional motor task. J. Neurophysiol. 84, 1667–1672 (2000).
Article CAS PubMed Google Scholar
Pearce, T. M. & Moran, D. W. Strategy-dependent encoding of planned arm movements in the dorsal premotor cortex. Science 337, 984–988 (2012).
Article CAS PubMed PubMed Central Google Scholar
Dekleva, B. M., Kording, K. P. & Miller, L. E. Single reach plans in dorsal premotor cortex during a two-target task. Nat. Commun. 9, 3556 (2018).
Simon, D. A. & Daw, N. D. Neural correlates of forward planning in a spatial decision task in humans. J. Neurosci. 31, 5526–5539 (2011).
Article CAS PubMed PubMed Central Google Scholar
Peirce, J. et al. PsychoPy2: experiments in behavior made easy. Behav. Res. Methods 51, 195–203 (2019).
Article PubMed PubMed Central Google Scholar
Fleming, S. M., Massoni, S., Gajdos, T. & Vergnaud, J.-C. Metacognition about the past and future: quantifying common and distinct influences on prospective and retrospective judgments of self-performance. Neurosci. Conscious. 2016, niw018 (2016).
Article PubMed PubMed Central Google Scholar
Siedlecka, M., Paulewicz, B. & Wierzchoń, M. But i was so sure! Metacognitive judgments are less accurate given prospectively than retrospectively. Front. Psychol. 7, 218 (2016).
Article PubMed PubMed Central Google Scholar
Yamashita, O., Sato, M. A., Yoshioka, T., Tong, F. & Kamitani, Y. Sparse estimation automatically selects voxels relevant for the decoding of fMRI activity patterns. Neuroimage 42, 1414–1429 (2008).
Article PubMed Google Scholar
Kamitani, Y. & Tong, F. Decoding the visual and subjective contents of the human brain. Nat. Neurosci. 8, 679–685 (2005).
Article CAS PubMed PubMed Central Google Scholar
Hebart, M. N., Görgen, K. & Haynes, J. D. The decoding toolbox (TDT): A versatile software package for multivariate analyses of functional imaging data. Front. Neuroinform. 8, 88 (2015).
Article PubMed PubMed Central Google Scholar
Momennejad, I. & Haynes, J. D. Encoding of prospective tasks in the human prefrontal cortex under varying task loads. J. Neurosci. 33, 17342–17349 (2013).
Article CAS PubMed PubMed Central Google Scholar
Combrisson, E. & Jerbi, K. Exceeding chance level by chance: The caveat of theoretical chance levels in brain signal classification and statistical assessment of decoding accuracy. J. Neurosci. Methods 250, 126–136 (2015).
Article PubMed Google Scholar
Kay, M., Elkin, L. A., Higgins, J. J. & Wobbrock, J. O. ARTool: aligned rank transform for nonparametric factorial ANOVAs. https://doi.org/10.5281/zenodo.594511 (2021).
Wobbrock, J. O., Findlater, L., Gergle, D. & Higgins, J. J. The aligned rank transform for nonparametric factorial analyses using only ANOVA procedures. In Proc. ACM Conference on Human Factors in Computing Systems (CHI ’11) 143–146 (ACM Press, 2011). https://doi.org/10.1145/1978942.1978963.
Katayama, R., Yoshida, W. & Ishii, S. Confidence modulates the decodability of scene prediction during partially-observable maze exploration in humans. zenodo. https://doi.org/10.5281/ZENODO.6364553 (2022).

Download references

Acknowledgements

This study was supported by KAKENHI, No. 17H06310, and JP17H06314 from MEXT, Japan and 19H04180 from JSPS, Japan. The authors thank B. Seymour for invaluable comments to improve this study. The authors would also like to thank Editage (www.editage.com) for English language editing.

Author information

These authors contributed equally: Wako Yoshida, Shin Ishii.

Authors and Affiliations

Graduate School of Informatics, Kyoto University, Kyoto, Kyoto, 606-8501, Japan
Risa Katayama & Shin Ishii
Nuffield Department of Clinical Neuroscience, University of Oxford, Oxford, OX3 9DU, UK
Wako Yoshida
Department of Neural Computation for Decision-making, Advanced Telecommunications Research Institute International, Soraku-gun, Kyoto, 619-0288, Japan
Wako Yoshida
Neural Information Analysis Laboratories, Advanced Telecommunications Research Institute International, Soraku-gun, Kyoto, 619-0288, Japan
Shin Ishii
International Research Center for Neurointelligence, The University of Tokyo, Bunkyo-ku, Tokyo, 113-0033, Japan
Shin Ishii

Authors

Risa Katayama
View author publications
You can also search for this author in PubMed Google Scholar
Wako Yoshida
View author publications
You can also search for this author in PubMed Google Scholar
Shin Ishii
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

W.Y. and S.I. conceived the project; R.K., W.Y., and S.I. designed the research; R.K. and W.Y. performed the research; R.K. analyzed the data; R.K. wrote the draft; and R.K., W.Y., and S.I. prepared the final manuscript.

Corresponding author

Correspondence to Risa Katayama.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Biology thanks the anonymous reviewers for their contribution to the peer review of this work. Primary handling editor: George Inglis. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplementary Information

Supplementary Data 1

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Katayama, R., Yoshida, W. & Ishii, S. Confidence modulates the decodability of scene prediction during partially-observable maze exploration in humans. Commun Biol 5, 367 (2022). https://doi.org/10.1038/s42003-022-03314-y

Download citation

Received: 14 July 2021
Accepted: 23 March 2022
Published: 19 April 2022
DOI: https://doi.org/10.1038/s42003-022-03314-y
Springer Nature Limited

Confidence modulates the decodability of scene prediction during partially-observable maze exploration in humans

Abstract

Similar content being viewed by others

Belief inference for hierarchical hidden states in spatial navigation

Learning of distant state predictions by the orbitofrontal cortex in humans

Decoding the view expectation during learned maze navigation from human fronto-parietal network

Introduction

Results

Behavioral results

Neural correlates of scene prediction

Decoding analysis of scene prediction and confidence

Computational model of maze navigation behavior

Discussion

Methods

Subjects

Maze exploratory navigation task

Training task

Image acquisition and analysis

Decoding analysis

Searchlight analysis for the scene prediction and confidence

Permutation test for the scene prediction and confidence decoding analyses

Behavioral model based on a HMM

Statistics and reproducibility

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Peer Review File

Supplementary Information

Supplementary Data 1

Reporting Summary

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation