Inferring occluded projectile motion changes connectivity within a visuo-fronto-parietal network

Zbären, Gabrielle Aude; Kapur, Manu; Meissner, Sarah Nadine; Wenderoth, Nicole

doi:10.1007/s00429-024-02815-2

Inferring occluded projectile motion changes connectivity within a visuo-fronto-parietal network

Original Article
Open access
Published: 25 June 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Brain Structure and Function Aims and scope Submit manuscript

Inferring occluded projectile motion changes connectivity within a visuo-fronto-parietal network

Download PDF

194 Accesses
1 Altmetric
Explore all metrics

Abstract

Anticipating the behaviour of moving objects in the physical environment is essential for a wide range of daily actions. This ability is thought to rely on mental simulations and has been shown to involve frontoparietal and early visual areas. Yet, the connectivity patterns between these regions during intuitive physical inference remain largely unknown. In this study, participants underwent fMRI while performing a task requiring them to infer the parabolic trajectory of an occluded ball falling under Newtonian physics, and a control task. Building on our previous research showing that when solving the physical inference task, early visual areas encode task-specific and perception-like information about the inferred trajectory, the present study aimed to (i) identify regions that are functionally coupled with early visual areas during the physical inference task, and (ii) investigate changes in effective connectivity within this network of regions. We found that early visual areas are functionally connected to a set of parietal and premotor regions when inferring occluded trajectories. Using dynamic causal modelling, we show that predicting occluded trajectories is associated with changes in effective connectivity within a parieto-premotor network, which may drive internally generated early visual activity in a top-down fashion. These findings offer new insights into the interaction between early visual and frontoparietal regions during physical inference, contributing to our understanding of the neural mechanisms underlying the ability to predict physical outcomes.

Decreasing predictability of visual motion enhances feed-forward processing in visual cortex when stimuli are behaviorally relevant

Article Open access 22 June 2016

Predictive coding of action intentions in dorsal and ventral visual stream is based on visual anticipations, memory-based information and motor preparation

Article 31 October 2019

Top-down cortical interactions in visuospatial attention

Article 20 March 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The ability to anticipate the behaviour of objects in the physical environment is crucial for many of our daily actions, such as crossing a busy street or catching a ball. Converging computational and behavioural evidence suggests that this ability relies on an ‘intuitive physics engine’ that predicts future events by running mental simulations (Bates et al. 2015; Battaglia et al. 2013; Gerstenberg et al. 2017, 2021; Hamrick et al. 2016). Intuitive physical inference has consistently been linked to the activation of a frontoparietal network comprising the supramarginal gyrus (SMG), superior parietal lobule (SPL), and dorsal premotor cortex/supplementary motor area (PMd/SMA) (Fischer et al. 2016). These regions have been shown to contain invariant representations of physical properties, providing evidence for a generalised neural intuitive physics engine (Pramod et al. 2022; Schwettmann et al. 2019). Another line of neuroimaging research into intuitive physics provided evidence that early visual areas are also involved in the process. These regions have been shown to contain representations of inferred physical scenarios that resemble those evoked by perception, suggesting that intuitive physical inference may be accompanied by visual simulation of the inferred scenario (Ahuja et al. 2022; Zbären et al. 2023). Interestingly, during explicitly instructed visual imagery, internally-generated visual cortex activity is modulated by frontoparietal areas similar to those involved in physical inference (Dentico et al. 2014; Dijkstra et al. 2017; Ishai et al. 2000; Mechelli 2004), suggesting a potential role of these regions in generating perception-like images when inferring occluded physical scenes. While there is a growing body of evidence supporting the involvement of specific brain areas in intuitive physics, the underlying connectivity patterns between these regions remain largely unknown.

In a recent study, we investigated the neural representations associated with predicting the parabolic trajectory of an occluded ball falling under Newtonian physics (Zbären et al. 2023). We designed an intuitive physical inference task requiring participants to infer the trajectory of an occluded ball falling parabolically. We first established that participants could learn to accurately predict the ball’s trajectory despite the absence of visual input, suggesting that they have successfully built and relied on a mental model of the physical scene to infer the outcomes. We then showed that solving this task activates early visual regions together with a frontoparietal network, and that early visual regions represent task-specific and perception-like information about the inferred trajectory. These results suggest that the outcomes of physical inferences may be represented in form of the perceivable sensory consequences in early visual areas, despite the absence of visual stimulation. Building upon these findings, the aim of the present study was to examine connectivity patterns among brain regions involved in physical inference and whose activity is linked to early visual processing when predicting the trajectory of objects falling parabolically and under occlusion.

First, we aimed to identify regions that may drive or be influenced by early visual activity during physical inference, by conducting a psychophysiological interaction (PPI) analysis. Our findings revealed that all regions consistently involved in intuitive physical inference exhibited increased functional coupling with early visual areas when predicting the parabolic trajectory of occluded objects. Given the absence of task-relevant visual inputs during our physical inference task, the measured neural activity within this network reflects internally generated representations of the physical scene and thus, the information flow between sensory and higher-order areas is unclear. To examine this, we investigated directed connectivity changes associated with physical inference of occluded trajectories in the network of regions revealed by the PPI analysis, using dynamic causal modelling (DCM; Friston et al. 2003). More specifically, we compared various anatomically plausible dynamic causal models to test whether physical inference primarily modulates visual-to-parietal connections, parietal-to-visual connections, or both, and whether it primarily modulates parietal-to-premotor connections, premotor-to-parietal connections, or both.

Materials and methods

The data used in this manuscript have previously been published in Zbären et al. (2023), which primarily focused on univariate and multivariate pattern analysis of the BOLD signal. Here, we have re-analysed the same data with an emphasis on functional connectivity analysis and network modelling using DCM. The experimental paradigm and pre-processing pipeline are identical to Zbären et al. (2023). We restate the relevant details here for the readers’ convenience.

Participants

Twenty healthy volunteers participated in the study and four were excluded from the analyses (for detailed exclusion criteria, see Zbären et al. 2023). The final sample consisted of sixteen participants (10 females; 6 males; mean age: 28.31 ± 9.26) with normal or corrected-to-normal vision. The study was approved by the Ethics Committee of the Swiss Federal Institute of Technology (EK 2020-N-31; Zurich, Switzerland) and conducted in accordance with the declaration of Helsinki. All participants provided written informed consent before participation and received monetary compensation upon completion.

fMRI task

To investigate intuitive physical inference, we designed a task that required participants to predict the fall time and landing location of an occluded ball falling parabolically. Participants were exposed to a dynamic 3D physics environment generated using the Unity3D physics engine (version 2019.2.3; http://unity3d.com). The study consisted of a behavioural training session, followed by an fMRI session happening no more than 7 days later. All participants were naïve to the purpose of the experiment throughout both sessions.

The behavioural session included instruction and training on the physical inference task, with participants receiving written feedback on their performance after each trial. During the fMRI session, participants performed a physical inference task that was nearly identical to the one they had been trained on, but without receiving any feedback on their performance. Throughout the fMRI session, the physical inference task was alternated with a visually matched control task. A cross was displayed at the centre of the screen during both sessions, and participants were instructed to fixate it while performing both the physical inference and control tasks.

In each trial of the physical inference task, participants were presented with an object moving horizontally either from right to left or from left to right, whose height and velocity varied across trials (Fig. 1). The object carried a ball that was dropped suddenly, at which point the screen was occluded such that neither the object nor the falling ball could be seen. The scene followed Newtonian physics, with the ball entering projectile motion as soon as it started to fall. Subsequently, participants were required to estimate: (i) when the ball would reach the ground (i.e., ‘fall time estimation’), indicated by a button press, and (ii) where the ball would land, indicated by moving a basket on the bottom of the screen to the estimated location. During the fMRI session and in contrast to the behavioural session, participants were not prompted to indicate their location estimation in every trial but only in one catch trial for every six trials. The trials of the control task featured the same visual stimuli but instead of pressing a button to indicate fall time estimation, participants had to press a button as soon as the colour of the fixation cross changed. The timings of the colour changes were randomly drawn from a distribution ranging from the minimum to the maximum true fall times ± 500 ms. Every trial was followed by a 3 s rest period.

The fMRI experiment consisted of 6 runs, each containing the same 18 physical inference and 18 control trials but differently pseudo-randomised. The 18 trials were generated by combining 3 heights and 3 velocities (i.e., [44, 61, 78 m] x [1.3, 1.7, 2.1 m/s]), resulting in trials featuring varying true fall times and locations. Within each run, trials were presented in 3 blocks of 6 physical inference trials, and 3 blocks of 6 control trials. Each block started with a word cue indicating the task to be performed: ‘ball’ (i.e., physical inference task) or ‘cross’ (i.e., control task). The blocks were alternated within each run, with half of the runs starting with the physical inference and the other half with the control condition. One run lasted 10.42 minutes.

Behavioural and self-rating data

The behavioural data were processed in Matlab (version 9.9; The Mathworks Inc, Natick, MA). To quantify performance, we calculated fall time errors by subtracting the estimated fall times from the true fall times. For each participant, the absolute time error of each trial was computed and then averaged across all trials of the physical inference condition. Additionally, we assessed self-rated vividness. In a post-fMRI debriefing questionnaire, participants were asked whether they ‘imagined the falling ball (i.e., saw it in their mind’s eye) during the experiment’ and if so, to rate the vividness of the image on a visual-analogue scale. The scale ranged from 0, corresponding to ‘No image at all, I only “know” I am thinking of the object’ to 10, corresponding to ‘Perfectly realistic, as vivid as real seeing’.

fMRI data acquisition and pre-processing

MRI data were acquired on a 3 tesla Philips Ingenia system using a 32-channel head coil. Anatomical images were acquired using a T1-weighted sequence (160 sagittal slices, voxel size = 1 mm3, TR = 8.3, TE = 3.9 ms, flip angle = 8°, matrix size = 240 × 240, FOV = 240 mm (AP) x 240 mm (RL) x 160 mm (FH)). Functional images were acquired using a whole-brain echo-planar imaging (EPI) sequence (40 interleaved transversal slices, TR = 2500, voxel size = 2.75 × 2.75 × 3.3 mm, TE = 35 ms, flip angle = 82°, matrix size = 80 × 78, FOV = 220 mm (AP) x 220 mm (RL) x 132 mm (FH), 250 volumes per run).

fMRI data were pre-processed using FSL version 6.0 (https://fsl.fmrib.ox.ac.uk/fsl/fslwiki). After discarding the first 4 volumes to account for T1 saturation effects, the following pre-processing steps were applied to each run: motion correction using the Motion Correction Linear Image Registration Tool (Jenkinson et al. 2002), brain extraction using the automated Brain Extraction Tool (BET; Smith 2002), spatial smoothing using a Gaussian kernel of 5 mm full-width-at-half-maximum (FWHM), and high-pass filtering using a 100s cut-off as implemented in FSL’s Expert Analysis Tool (FEAT). Each run was additionally inspected for excessive motion and excluded from further analyses if the absolute mean displacement was greater than ~ half the voxel size (i.e., 1.4 mm); two runs (from two different participants) were excluded. Normalisation was performed by aligning functional images to structural ones using boundary-based registration (Greve and Fischl 2009), aligning structural images to the 2 mm Montreal Neurological Institute (MNI-152) standard space using nonlinear registration (FNIRT), and applying the resulting warp fields to the functional images.

fMRI data analysis

Psychophysiological interaction analysis

The PPI analysis was performed using FSL version 6.0. To define the seed region, we used a standard contrast analysis with a general linear model (GLM) based on a double gamma hemodynamic response function (HRF) and its first temporal derivative. The design matrix contained the following two regressors of interest: a ‘physical inference’ regressor modelling the physical inference task, i.e., the period between the start of the occlusion and the button press (indicating the estimated landing time of the ball) minus 500 ms to account for motor preparation, and a ‘control’ regressor modelling the control task, i.e., the period between the start of the occlusion and the colour change. Additionally, there were five regressors of no interest, modelling the periods of the (i) instructions (ball or cross), (ii) horizontally moving object, (iii) button presses (including 500 ms of motor preparation in the physical inference condition, and the time between the colour change and button press in the control condition), (iv) the occlusion period of missed trials in which there were no button presses, and (v) location estimation in the catch trials. Six motion parameters (i.e., rotations and translations along the x, y, and z-axes), as well as white matter (WM) and cerebrospinal fluid (CSF) time-series, were added as nuisance regressors in the GLM. To further reduce motion artifacts, volumes with an absolute mean displacement greater than half the voxel size were scrubbed.

The seed region was created by intersecting an anatomical mask covering V1, V2, and V3 from the Jülich Histological Atlas (Eickhoff et al. 2007), with the group random-effects activation map revealed by the physical inference > control contrast, thresholded at Z > 3.1 and FWE-corrected using a cluster significance level of p_FWE < 0.05. The ROI was then transformed to each participant’s native functional space, and its time-course extracted. The first-level design matrix of the PPI analysis comprised the following regressors: (i) the contrast between the occluded phase of the ‘physical inference’ versus ‘control’ condition, convolved with a double gamma HRF (i.e., task regressor), (ii) the time-course of the seed-region (i.e., physiological regressor), (iii) the product of the zero-centred task and de-meaned physiological regressors (i.e., interaction term), (iv) an ‘occlusion’ regressor combining the occluded periods of the ‘physical inference’ and ‘control’ conditions, and (v) the same regressors of no interest and nuisance regressors as described above. The interaction term allows the identification of regions exhibiting task-related covariance with the seed region. Accordingly, an ‘interaction term > rest’ contrast was defined for each participant, and the resulting image entered into a mixed effects higher-level analysis. The group z-statistic images were thresholded at Z > 3.1 and corrected for family-wise-error (FWE) using a cluster significance level of p_FWE < 0.05.

To test whether functional connectivity strength is associated with the behavioural and self-rating data, two stepwise multiple linear regression analyses (p < .05) were performed: one with the time estimation performance and one with the self-rated vividness used as a dependent variable (see Sect. 2.3). In both regression analyses, the predictors consisted of the mean parameter estimate of each significant cluster revealed by the PPI analysis.

Dynamic causal modelling

To investigate causal interactions between the brain regions identified through the PPI analysis, we used dynamic causal modelling (DCM, Friston et al. 2003) implemented in Statistical Parametric Mapping (SPM12, http://www.fil.ion.ucl.ac.uk/spm/). DCM for fMRI is a neurophysiologically plausible modelling scheme that estimates task-related changes in effective connectivity from measured BOLD signals, within a network of preselected brain regions. In DCM, neural activity changes are characterised by the following state-space equation (Eq. 1):

$$\dot{z}=\left(A+\sum _{j=1}^{m}{u}_{j}{B}^{j}\right)z+Cu$$

The state vector $\dot{z}$ represents changes in neural activity over time as a function of the current level of neural activity $z$, the experimental stimuli $u$, and the connectivity parameters $A$, $B$, and $C$. The matrix $A$ specifies the intrinsic or endogenous effective connectivity between and within regions, while the matrix $B$ specifies the changes in effective connectivity due to task-related modulatory inputs ${u}_{j}$. The matrix $C$ represents the direct effects of driving inputs $u$ on a given region. The values of extrinsic connections have units in Hertz (Hz) and represent synaptic rate constants (i.e., connection strengths) while intrinsic (within-region) connections are log-scaling parameters.

General linear models

To perform the DCM analysis, the fMRI data pre-processed in FSL was first transformed from native to MNI space, after which two separate first-level GLMs were implemented in SPM: one for time-series extraction and the other for specifying the DCM inputs. The reason for using two separate GLMs was to avoid a rank deficient design matrix that would have resulted from combining the three necessary regressors (i.e., ‘physical inference’, ‘control’, and a combination of both) into a single GLM. Both GLMs included the same five regressors of no interest modelling the periods of the instructions, moving objects, button presses, missed trials, and location estimations (see Sect. 2.5.1). In addition, both GLMs contained the same nuisance regressors consisting of six motion parameters (i.e., rotations and translations along the x, y, and z-axes) and the scrubbing regressors. The GLM used for time-series extraction (GLM1) contained a ‘physical inference’ and a ‘control’ regressor of interest, modelling the occlusion period of the corresponding condition. The GLM used for DCM specification (GLM2) contained the following two regressors of interest: an ‘occlusion’ regressor combining the occlusion periods of the ‘physical inference’ and ‘control’ conditions, and the same ‘physical inference’ regressor as in GLM1. The ‘occlusion’ regressor was used as a driving input, and the ‘physical inference’ regressor as modulatory inputs. As such, the inputs of matrices A and B were not redundant to each other (see Eq. 1). All task regressors were convolved with a canonical hemodynamic response function (HRF) and its first temporal derivative.

Selection of regions of interest and time-series extraction

ROIs were selected based on the results of our PPI analysis and previous research characterising the regions systematically involved in intuitive physics (Fischer et al. 2016; Pramod et al. 2022; Schwettmann et al. 2019; Zbären et al. 2023). The following regions were included in our DCMs: right early visual areas (visual), right supramarginal gyrus (SMG), right superior parietal lobule (SPL), right dorsal premotor cortex (PMd) overlapping with frontal eye fields (FEF), and right supplementary motor area (SMA). We restricted our ROIs to the right hemisphere to limit model complexity and due to stronger activations of the right hemisphere in the PPI analysis. For each ROI, we first defined a fixed outer sphere with a radius of 16 mm, centred on the MNI coordinates of the group-level right hemisphere peak activations in the PPI analysis (i.e., visual [9–88 -14], SMG [56 − 36 52], SPL [20–70 50], PMd [30 0 52], and SMA [6 − 2 64]). To account for individual differences in functional anatomy, we defined a mobile inner sphere with a radius of 6 mm, centred on subject-specific peak activations from a ‘physical inference > control’ contrast from GLM1.

The peak activations were located within both the outer sphere and a mask of the right hemisphere obtained from the Harvard-Oxford subcortical structural atlas (Desikan et al. 2006). Time-series were extracted from the voxels within the inner sphere that exceeded an uncorrected threshold of p < .05. In three participants, one of the five ROIs did not contain any surviving voxels so we lowered their threshold until a peak voxel could be identified (i.e., to p < .1 for two participants and p < .2 for the third participant), as recommended in Zeidman et al. (2019). We did not exclude these participants as someone with a weak or absent response in one brain region may still provide valuable information about the other regions in the network. Also note that the use of a threshold for time-series extraction is only to remove the noisiest voxels.

First-level DCM specification and inversion

In the A matrix, we specified reciprocal intrinsic connections between the following pairs of regions: visual and SMG, visual and SPL, SMG and SPL, SMG and PMd, SMG and SMA, SPL and PMd, SPL and SMA, and PMd and SMA (Fig. 2A), in accordance with the anatomical literature (Bakola et al. 2013; Boussaoud et al. 2005; Felleman and Van Essen 1991; Luppino et al. 1993). We did not specify any connection between premotor and early visual regions, as there is no compelling evidence of direct anatomical connections (Felleman and Van Essen 1991). The driving input (i.e., the ‘occlusion’ regressor from GLM2) was specified as entering the network via visual regions as the onset of this phase was marked by a change in the screen colour. The inputs were mean-centred, such that the parameter estimates in matrix B represent changes in effective connectivity relative to the average connectivity across conditions (i.e., physical inference and control). In the B matrix, we specified a ‘full’ DCM in which all the between-region connections present in the A matrix could be modulated by physical inference. The DCM for each subject was then inverted, thereby providing estimates of the connectivity parameters that best explain the data.

Second-level analysis using parametric empirical Bayes

The subject-specific connectivity parameter estimates were then taken to the group level, where we used parametric empirical Bayes (PEB; Friston et al. 2015) together with Bayesian Model Reduction (BMR; Friston et al. 2016) to test hypotheses on the group-level connectivity parameters. We collated the previously estimated fully connected model of each subject to estimate a second-level PEB model on the $B$ matrix parameters.

To investigate which connections are most likely to be modulated by physical inference, we defined a set of hypotheses expressed as pre-defined reduced PEB models in which certain connections have been switched off. We specified models in which the following sets of connections could be modulated by physical inference: from visual to parietal (i.e., SMG and SPL) regions, from parietal to visual regions, or both, and from parietal (i.e., SMG and SPL) to premotor (i.e., PMd and SMA) regions, from premotor to parietal regions, or both (Fig. 2B). Each type of modulation between visual and parietal regions (i.e., visual-to-parietal, parietal-to-visual, bidirectional) could be combined with each type of modulation between parietal and premotor regions (i.e., parietal-to-premotor, premotor-to-parietal, bidirectional), resulting in a total of nine models. We allowed reciprocal connections between the premotor (PMd and SMA) and between the parietal (SMG and SPL) regions to always be modulated by physical inference, as there was no compelling reason to assume they would not be. This was done to limit the number of models and because these connections were not of particular interest for the current analysis. Together with the full model, our model space consisted of nine models.

We then tested which of our pre-defined models best explains the commonalities across subjects, by comparing the log-evidence of the full PEB model against reduced ones. Since none of the models could be categorised as a winning one (i.e., probability > 95%, see S1 of the Supplementary Material), we averaged the parameters across models using Bayesian model averaging (BMA; Hoeting et al. 1999). BMA yields weighted averages of parameter estimates, where each parameter estimate is weighted by the posterior probability of the associated model, thereby characterizing the direction and size of task-related changes in connectivity strength (i.e., expressed in the matrix $B$). To determine the statistical significance of the parameter estimates, we set a threshold based on free energy and retained the parameters with a posterior probability of being present versus absent ≥ 0.95. Additionally, to test whether and which modulations of effective connectivity were associated with time estimation performance and/or self-rated vividness (see Sect. 2.3), we performed two stepwise multiple linear regression analyses with the DCM parameters informed by the group used as predictors.

Results

Behavioural data

In the physical inference task, the mean absolute fall time errors were of 0.5441 s ± 0.1466, and the self-reported vividness scores had a mean of 5.525 ± 2.2549.

Psychophysiological interaction analysis

We performed a PPI analysis to identify which regions exhibit condition-dependent increases in functional connectivity with early visual areas. Our results revealed increased functional connectivity during physical inference, as compared with the control condition, between early visual areas and a network of frontoparietal regions. This network comprises bilateral dorsal premotor cortex (PMd) with the right hemispheric activation overlapping with frontal eye field (FEF), bilateral supplementary motor area (SMA), bilateral superior parietal lobule (SPL), and right supramarginal gyrus (SMG) with activations extending into the intraparietal sulcus (IPS) (see Fig. 3A and Table S1 of the Supplementary material).

To assess whether functional connectivity changes are relevant for behavioural performance and/or the self-rated vividness of the imagined physical scene, we performed two stepwise multiple linear regressions with the mean parameter estimates of each cluster used as predictors. We found that functional coupling between early visual areas and the SMG significantly predicted self-rated vividness (β = 0.528, p = .035) (Fig. 3B), whereas there were no significant linear associations between functional connectivity and time estimation performance.

Dynamic causal modelling

Bayesian model selection (BMS) revealed that the model that best explains the data, with a probability of 89%, is the one that allows physical inference to modulate top-down parietal-visual and bidirectional parietal-premotor connections, followed by the full model with a probability of 11% (Figure S1). However, since no model’s probability reached 95%, a clear winning model could not be determined. Consequently, Bayesian model averaging (BMA) was used to estimate model parameters.

The results of our PEB/BMA analysis showed that physical inference was associated with strong decreases in effective connectivity from SMG to SPL and PMd while the strongest increases of effective connectivity originated from SPL and projected to SMG, SMA, and early visual areas (see Fig. 4B and Table S2 of the Supplementary Material). Our results further show bidirectional increases of connectivity between premotor areas (i.e., SMA and PMd). Interestingly, the results did not reveal any significant modulation of connectivity between SMG and early visual areas.

We found an association between increased effective connectivity from PMd to SMA and heightened self-rated vividness (β = 0.537, p < .05) (Fig. 4C). There were no significant linear associations between effective connectivity changes and time estimation performance.

Discussion

The aim of the present study was to examine connectivity patterns among brain regions involved in physical inference when predicting the trajectory of objects falling parabolically and under occlusion. Participants underwent fMRI while performing a task in which they had to infer the trajectory of an occluded ball falling from various heights and with various horizontal velocities, and a visually matched control task. Building on our previous research demonstrating that when solving this task, early visual areas contain representations of the occluded trajectory similar to those activated by observing a ball fall (Zbären et al. 2023), the present study aimed to (i) identify regions that interact with early visual areas during the physical inference task, and (ii) investigate changes in effective connectivity within this network of regions. We found that during physical inference, early visual areas are functionally connected to a set of parietal and premotor regions. Our DCM results further show that physical inference is associated with bidirectional changes in effective connectivity within a parieto-premotor network and increased top-down coupling from the SPL to early visual areas.

Early visual areas are functionally connected to frontoparietal regions when predicting the trajectory of occluded objects

Our psychophysiological interaction (PPI) analysis revealed that during physical inference, the functional coupling between early visual areas and several frontoparietal regions increases in a condition-dependent manner. Specifically, the supramarginal gyrus (SMG) and intraparietal sulcus (IPS), superior parietal lobule (SPL), dorsal premotor cortex (PMd) including frontal eye field (FEF), and supplementary motor area (SMA) show increased connectivity with early visual areas, compared to a visually matched control task. Note that these results are unlikely to be driven by involuntary eye movements, as demonstrated by our prior control experiment in which participants performed the same task while their eye movements were monitored using eye tracking, revealing no difference in eye positions and saccades between the physical inference and control conditions (for further details, see Zbären et al. 2023). All these frontoparietal regions have been shown to be consistently involved in intuitive physical inference (Fischer et al. 2016; Pramod et al. 2022; Schwettmann et al. 2019), and to contain trajectory-specific information about the occluded ball when solving this task (Zbären et al. 2023). This network of regions overlaps with areas typically involved in visuospatial attention (Corbetta and Shulman 2002; Nobre 2001) and visual imagery (Winlove et al. 2018). Our PPI analysis suggests that this frontoparietal network may contribute to evoking activity in early visual areas even in the absence of external visual input, consistent with previous research demonstrating the dorsal frontoparietal network’s role in the top-down allocation of attention not only towards external sensory stimuli but also towards internal representations (Cona and Scarpazza 2019). During visual imagery, dorsal premotor and parietal areas have been shown to modulate early visual activity (Dentico et al. 2014; Dijkstra et al. 2017; Ishai et al. 2000; Mechelli 2004). Additionally, we found that the strength of the functional coupling between the SMG/IPS cluster and early visual areas is a significant predictor of the subjective vividness of the imagined scene. This finding aligns with previous research showing a positive relation between the subjective vividness of a mental image and the strength of the connection from the intraparietal sulcus to early visual area (Dijkstra et al. 2017).

Intuitive physical inference modulates effective connectivity within a visuo-fronto-parietal network

Given that functional connectivity cannot offer insights into the direction of information flow between connected areas, we further investigated the network of regions revealed by the PPI analysis in terms of effective connectivity. We built anatomically plausible dynamic causal models (DCM) of these regions and how their interactions may be modulated by physical inference. Bayesian model selection did not reveal a clear winning model as none of the models exhibited a probability greater than 95%. Nevertheless, the DCM with the highest model evidence tentatively suggests that physical inference modulates visual-parietal connections in a top-down fashion and parietal-premotor connections bidirectionally.

Inference on model parameters using Bayesian model averaging revealed that physical inference is associated with bidirectional increases in connectivity between SMA and SPL, and between SMA and PMd. The SMA has been closely linked to imagery, particularly motor imagery (Hétu et al. 2013), but also other modalities such as visual imagery (Palmiero et al. 2009). Interestingly, we found the increase in connectivity from the PMd to the SMA to be predictive of the self-rated vividness of the inferred physical scene. However, it is worth noting that the size of our sample may constrain the generalisability of such brain-behaviour relationships. Additionally, the SMA is involved in temporal processing (Hinton et al. 2004; Macar et al. 1999; Nani et al. 2019). Research has shown that when estimating time-to-contact, humans rely not only on kinematic cues derived from visual inputs (e.g., velocity), but also on temporal information, particularly during occlusion (Battaglini and Ghiani 2021; Chang and Jazayeri 2018). Considering this, it is plausible that the SMA might have been implicated in tracking time during the occluded fall in our physical inference task, thereby contributing to predicting the ball’s landing time.

Our results further suggest that solving the physical inference task is associated with pronounced connectivity changes of the SPL, with an increase in effective connectivity from SPL to SMG/IPS and visual areas. It is likely that our SPL ROI, which was centred around MNI_xyz= 20, -70, 50, contains the human homologue of non-human primates’ area V6Ad, which has been shown to respond to coherent visual motion stimuli and be involved in spatial trajectory planning, particularly in the context of pointing movements (Sulpizio et al. 2023). While this area has been implicated in various tasks requiring the processing of visual information (including continuously moving stimuli) and trajectory planning (Sulpizio et al. 2020, 2023), our analysis suggests a top-down influence of this area on early visual processing during physical inference. This top-down influence may facilitate the activation of visual representations of the physical scenario, resulting in the formation of perception-like images in the absence of bottom-up visual inputs. Interestingly, V6Ad has been shown to be specifically activated by imagery tasks (Sulpizio et al. 2020). Additionally, the SPL increases connectivity bidirectionally with the SMA and unidirectionally to PMd, indicating that key areas of the dorso-medial parieto-premotor processing stream become increasingly connected. In contrast, the SMG/IPS, which is typically considered to be part of the dorso-lateral processing stream, decreases its connectivity with both SPL and PMd. In summary, our findings put forward the hypothesis that performing the physical inference task is associated with stronger bidirectional effective connectivity within a dorso-medial parieto-premotor processing stream, which seems to drive activity in early visual areas and SMG/IPS in a top-down fashion.

Previous work has found that visual imagery is associated with increased top-down coupling from the intraparietal sulcus to early visual regions (Dijkstra et al. 2017), however, in our task, SPL seems to mediate this potential interaction via increased connectivity to both SMG/IPS and early visual areas. Note that the dorso-medial parieto-premotor pathway is considered to constitute the ‘vision-for-action’ pathway, which has traditionally been identified using sensorimotor tasks that involve trajectory planning, for example during reaching (Greulich et al. 2020). Our task did not require participants to directly translate visuo-spatial information into the spatial control of movement, indicating that even though these pathways are essential for motor actions, they seem to underpin more general aspects of behaviour that consider information about the physical world.

Our results did not reveal changes in effective connectivity between early visual areas and the SMG/IPS. This may be explained by the SMG/IPS cluster containing the anterior portion of the IPS (AIP), which does not receive direct input from early visual areas, and recent work showing that also effective connectivity from AIP to early visual areas is limited, as estimated based on resting-state fMRI (Rolls et al. 2023a, b). We did not find any significant associations between changes in functional or effective connectivity and time-estimation performance, suggesting that subject-specific temporal errors may not primarily emerge from suboptimal brain connectivity between two specific areas. This is not entirely unexpected since time estimation performance is likely to reflect the integration of neural processing within a distributed network across the whole duration of the physical inference task.

Conclusion

Our study shows that when solving a physical inference task requiring participants to infer projectile motion under occlusion, early visual areas are functionally connected to a set of parietal and premotor regions. Our dynamic causal modelling results suggest that predicting occluded trajectories is associated with changes in bidirectional effective connectivity within a dorso-medial parieto-premotor network, which may drive activity in early visual areas in a top-down fashion. These findings offer new insights into the interaction between early visual and physics-responsive frontoparietal regions during physical inference, shedding new light on the neural mechanisms underlying the ability to make predictions about the physical environment.

Data availability

Data are openly available on the ETH Library Research Collection with the https://doi.org/10.3929/ethz-b-000578094.

References

Ahuja A, Desrochers TM, Sheinberg DL (2022) A role for visual areas in physics simulations. Cognit Neuropsychol. https://doi.org/10.1080/02643294.2022.2034609
Article Google Scholar
Bakola S, Passarelli L, Gamberini M, Fattori P, Galletti C (2013) Cortical Connectivity suggests a role in Limb Coordination for Macaque Area PE of the Superior Parietal Cortex. J Neurosci 33(15):6648–6658. https://doi.org/10.1523/JNEUROSCI.4685-12.2013
Article CAS PubMed PubMed Central Google Scholar
Bates CJ, Yildirim I, Tenenbaum JB, Battaglia PW (2015) Humans predict liquid dynamics using probabilistic simulation. CogSci 2015
Battaglia PW, Hamrick JB, Tenenbaum JB (2013) Simulation as an engine of physical scene understanding. Proc Natl Acad Sci USA 110(45):18327–18332. https://doi.org/10.1073/pnas.1306572110
Article PubMed PubMed Central Google Scholar
Battaglini L, Ghiani A (2021) Motion behind occluder: amodal perception and visual motion extrapolation. Visual Cognition 29(8):475–499. https://doi.org/10.1080/13506285.2021.1943094
Article Google Scholar
Boussaoud D, Tanné-Gariépy J, Wannier T, Rouiller EM (2005) Callosal connections of dorsal versus ventral premotor areas in the macaque monkey: a multiple retrograde tracing study. BMC Neurosci 6(1):67. https://doi.org/10.1186/1471-2202-6-67
Article PubMed PubMed Central Google Scholar
Chang C-J, Jazayeri M (2018) Integration of speed and time for estimating time to contact. Proc Natl Acad Sci 115(12):E2879–E2887. https://doi.org/10.1073/pnas.1713316115
Article CAS PubMed PubMed Central Google Scholar
Cona G, Scarpazza C (2019) Where is the where in the brain? A meta-analysis of neuroimaging studies on spatial cognition. Hum Brain Mapp 40(6):1867–1886. https://doi.org/10.1002/hbm.24496
Article PubMed PubMed Central Google Scholar
Corbetta M, Shulman GL (2002) Control of goal-directed and stimulus-driven attention in the brain. Nat Rev Neurosci 3(3). https://doi.org/10.1038/nrn755
Dentico D, Cheung BL, Chang J-Y, Guokas J, Boly M, Tononi G, Van Veen B (2014) Reversal of cortical information flow during visual imagery as compared to visual perception. NeuroImage 100:237–243. https://doi.org/10.1016/j.neuroimage.2014.05.081
Article PubMed Google Scholar
Desikan RS, Ségonne F, Fischl B, Quinn BT, Dickerson BC, Blacker D, Buckner RL, Dale AM, Maguire RP, Hyman BT, Albert MS, Killiany RJ (2006) An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. NeuroImage 31(3):968–980. https://doi.org/10.1016/j.neuroimage.2006.01.021
Article PubMed Google Scholar
Dijkstra N, Zeidman P, Ondobaka S, van Gerven MAJ, Friston K (2017) Distinct top-down and bottom-up Brain Connectivity during Visual Perception and Imagery. Sci Rep 7(1):5677. https://doi.org/10.1038/s41598-017-05888-8
Article CAS PubMed PubMed Central Google Scholar
Eickhoff SB, Paus T, Caspers S, Grosbras MH, Evans AC, Zilles K, Amunts K (2007) Assignment of functional activations to probabilistic cytoarchitectonic areas revisited. NeuroImage 36(3):511–521. https://doi.org/10.1016/j.neuroimage.2007.03.060
Article PubMed Google Scholar
Felleman DJ, Van Essen DC (1991) Distributed hierarchical Processing in the Primate Cerebral cortex. Cereb Cortex 1(1):1–47. https://doi.org/10.1093/cercor/1.1.1
Article CAS PubMed Google Scholar
Fischer J, Mikhael JG, Tenenbaum JB, Kanwisher N (2016) Functional neuroanatomy of intuitive physical inference. Proc Natl Acad Sci 113(34):E5072–E5081. https://doi.org/10.1073/pnas.1610344113
Article CAS PubMed PubMed Central Google Scholar
Friston KJ, Harrison L, Penny W (2003) Dynamic causal modelling. NeuroImage 19(4):1273–1302. https://doi.org/10.1016/S1053-8119(03)00202-7
Article CAS PubMed Google Scholar
Friston K, Zeidman P, Litvak V (2015) Empirical Bayes for DCM: A Group Inversion Scheme. Frontiers in Systems Neuroscience, 9. https://www.frontiersin.org/articles/https://doi.org/10.3389/fnsys.2015.00164
Friston KJ, Litvak V, Oswal A, Razi A, Stephan KE, van Wijk BCM, Ziegler G, Zeidman P (2016) Bayesian model reduction and empirical Bayes for group (DCM) studies. NeuroImage 128:413–431. https://doi.org/10.1016/j.neuroimage.2015.11.015
Article PubMed Google Scholar
Gerstenberg T, Zhou L, Smith KA, Tenenbaum JB (2017) Faulty Towers: A hypothetical simulation model of physical support. Proceedings of the 39th Annual Meeting of the Cognitive Science Society
Gerstenberg T, Goodman ND, Lagnado DA, Tenenbaum JB (2021) A counterfactual simulation model of causal judgments for physical events. Psychol Rev. https://doi.org/10.1037/rev0000281
Article PubMed Google Scholar
Greulich RS, Adam R, Everling S, Scherberger H (2020) Shared functional connectivity between the dorso-medial and dorso-ventral streams in macaques. Sci Rep 10(1):18610. https://doi.org/10.1038/s41598-020-75219-x
Article CAS PubMed PubMed Central Google Scholar
Greve DN, Fischl B (2009) Accurate and robust brain image alignment using boundary-based registration. NeuroImage 48(1):63–72. https://doi.org/10.1016/j.neuroimage.2009.06.060
Article PubMed Google Scholar
Hamrick JB, Battaglia PW, Griffiths TL, Tenenbaum JB (2016) Inferring mass in complex scenes by mental simulation. Cognition 157:61–76. https://doi.org/10.1016/j.cognition.2016.08.012
Article PubMed Google Scholar
Hétu S, Grégoire M, Saimpont A, Coll M-P, Eugène F, Michon P-E, Jackson PL (2013) The neural network of motor imagery: an ALE meta-analysis. Neurosci Biobehavioral Reviews 37(5):930–949. https://doi.org/10.1016/j.neubiorev.2013.03.017
Article Google Scholar
Hinton SC, Harrington DL, Binder JR, Durgerian S, Rao SM (2004) Neural systems supporting timing and chronometric counting: an FMRI study. Cogn Brain Res 21(2):183–192. https://doi.org/10.1016/j.cogbrainres.2004.04.009
Article Google Scholar
Ishai A, Ungerleider LG, Haxby JV (2000) Distributed neural systems for the generation of visual images. Neuron 28(3):979–990. https://doi.org/10.1016/S0896-6273(00)00168-9
Article CAS PubMed Google Scholar
Jenkinson M, Bannister P, Brady M, Smith S (2002) Improved optimization for the Robust and Accurate Linear Registration and Motion correction of brain images. NeuroImage 17(2):825–841. https://doi.org/10.1006/nimg.2002.1132
Article PubMed Google Scholar
Luppino G, Matelli M, Camarda R, Rizzolatti G (1993) Corticocortical connections of area F3 (SMA-proper) and area F6 (pre-SMA) in the macaque monkey. J Comp Neurol 338(1):114–140. https://doi.org/10.1002/cne.903380109
Article CAS PubMed Google Scholar
Macar F, Vidal F, Casini L (1999) The supplementary motor area in motor and sensory timing: evidence from slow brain potential changes. Exp Brain Res 125(3):271–280. https://doi.org/10.1007/s002210050683
Article CAS PubMed Google Scholar
Mechelli A (2004) Where bottom-up meets Top-down: neuronal interactions during perception and imagery. Cereb Cortex 14(11):1256–1265. https://doi.org/10.1093/cercor/bhh087
Article PubMed Google Scholar
Nani A, Manuello J, Liloia D, Duca S, Costa T, Cauda F (2019) The neural correlates of Time: a Meta-analysis of Neuroimaging studies. J Cogn Neurosci 31(12):1796–1826. https://doi.org/10.1162/jocn_a_01459
Article PubMed Google Scholar
Nobre AC (2001) The attentive homunculus: now you see it, now you don’t. Neurosci Biobehavioral Reviews 25(6):477–496. https://doi.org/10.1016/S0149-7634(01)00028-8
Article CAS Google Scholar
Palmiero M, Olivetti Belardinelli M, Nardo D, Sestieri C, Di Matteo R, D’Ausilio A, Romani GL (2009) Mental imagery generation in different modalities activates sensory-motor areas. Cogn Process 10(S2):268–271. https://doi.org/10.1007/s10339-009-0324-5
Article Google Scholar
Pramod R, Cohen MA, Tenenbaum JB, Kanwisher N (2022) Invariant representation of physical stability in the human brain. eLife 11:e71736. https://doi.org/10.7554/eLife.71736
Article CAS PubMed PubMed Central Google Scholar
Rolls ET, Deco G, Huang C-C, Feng J (2023a) Multiple cortical visual streams in humans. Cereb Cortex 33(7):3319–3349. https://doi.org/10.1093/cercor/bhac276
Article PubMed Google Scholar
Rolls ET, Deco G, Huang C-C, Feng J (2023b) The human posterior parietal cortex: effective connectome, and its relation to function. Cereb Cortex 33(6):3142–3170. https://doi.org/10.1093/cercor/bhac266
Article PubMed Google Scholar
Schwettmann SE, Tenenbaum JB, Kanwisher N (2019) Invariant representations of mass in the human brain. eLife 8:1–14. https://doi.org/10.7554/eLife.46619
Article Google Scholar
Smith SM (2002) Fast robust automated brain extraction. Hum Brain Mapp 17(3):143–155. https://doi.org/10.1002/hbm.10062
Article PubMed PubMed Central Google Scholar
Sulpizio V, Neri A, Fattori P, Galletti C, Pitzalis S, Galati G (2020) Real and imagined grasping movements differently activate the human Dorsomedial Parietal Cortex. Neuroscience 434:22–34. https://doi.org/10.1016/j.neuroscience.2020.03.019
Article CAS PubMed Google Scholar
Sulpizio V, Fattori P, Pitzalis S, Galletti C (2023) Functional organization of the caudal part of the human superior parietal lobule. Neurosci Biobehavioral Reviews 153:105357. https://doi.org/10.1016/j.neubiorev.2023.105357
Article Google Scholar
Winlove CIP, Milton F, Ranson J, Fulford J, MacKisack M, Macpherson F, Zeman A (2018) The neural correlates of visual imagery: a co-ordinate-based meta-analysis. Cortex 105:4–25. https://doi.org/10.1016/j.cortex.2017.12.014
Article PubMed Google Scholar
Hoeting JA, Madigan D, Raftery AE, Volinsky CT (1999) Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors). Stat Sci , 14(4):382–417. https://doi.org/10.1214/ss/1009212519
Zbären GA, Meissner SN, Kapur M, Wenderoth N (2023) Physical inference of falling objects involves simulation of occluded trajectories in early visual areas. Hum Brain Mapp 44(10):4183–4196. https://doi.org/10.1002/hbm.26338
Article PubMed PubMed Central Google Scholar
Zeidman P, Jafarian A, Corbin N, Seghier ML, Razi A, Price CJ, Friston KJ (2019) A guide to group effective connectivity analysis, part 1: first level analysis with DCM for fMRI. NeuroImage 200:174–190. https://doi.org/10.1016/j.neuroimage.2019.06.031
Article PubMed Google Scholar

Download references

Acknowledgements

We would like to thank members of the TNU and Marc Bächinger for their help and feedback on the DCM analysis, and all the participants for their time and effort.

Funding

This work was supported by The Future Learning Initiative, ETH Zurich.

Open access funding provided by Swiss Federal Institute of Technology Zurich

Author information

Sarah Nadine Meissner and Nicole Wenderoth contributed equally to this work.

Authors and Affiliations

Neural Control of Movement Lab, Department of Health Science and technology, ETH Zurich, Zurich, Switzerland
Gabrielle Aude Zbären, Sarah Nadine Meissner & Nicole Wenderoth
Professorship for Learning Sciences and Higher Education, ETH Zurich, Zurich, Switzerland
Manu Kapur
Future Health Technologies, Singapore-ETH Centre, Campus for Research Excellence And Technological Enterprise (CREATE), Singapore, Singapore
Nicole Wenderoth

Authors

Gabrielle Aude Zbären
View author publications
You can also search for this author in PubMed Google Scholar
Manu Kapur
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Nadine Meissner
View author publications
You can also search for this author in PubMed Google Scholar
Nicole Wenderoth
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.A.Z., M.K., S.N.M., and N.W conceived and designed research; G.A.Z. performed experiments; G.A.Z analysed data; G.A.Z., S.N.M., and N.W interpreted results; G.A.Z prepared figures; G.A.Z drafted manuscript; G.A.Z., M.K., S.N.M., and N.W edited and revised manuscript. All authors approved final version of manuscript.

Corresponding author

Correspondence to Gabrielle Aude Zbären.

Ethics declarations

Ethics approval

This study was performed in line with the principles of the Declaration of Helsinki. Approval was granted by the Ethics Committee of the Swiss Federal Institute of Technology (EK 2020-N-31; Zurich, Switzerland).

Consent to participate

Informed consent was obtained from all individual participants included in the study.

Competing interests

The authors declare no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zbären, G.A., Kapur, M., Meissner, S.N. et al. Inferring occluded projectile motion changes connectivity within a visuo-fronto-parietal network. Brain Struct Funct (2024). https://doi.org/10.1007/s00429-024-02815-2

Download citation

Received: 16 November 2023
Accepted: 03 June 2024
Published: 25 June 2024
DOI: https://doi.org/10.1007/s00429-024-02815-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Inferring occluded projectile motion changes connectivity within a visuo-fronto-parietal network

Abstract

Similar content being viewed by others

Decreasing predictability of visual motion enhances feed-forward processing in visual cortex when stimuli are behaviorally relevant

Predictive coding of action intentions in dorsal and ventral visual stream is based on visual anticipations, memory-based information and motor preparation

Top-down cortical interactions in visuospatial attention

Introduction

Materials and methods

Participants

fMRI task

Behavioural and self-rating data

fMRI data acquisition and pre-processing

fMRI data analysis

Psychophysiological interaction analysis

Dynamic causal modelling

General linear models

Selection of regions of interest and time-series extraction

First-level DCM specification and inversion

Second-level analysis using parametric empirical Bayes

Results

Behavioural data

Psychophysiological interaction analysis

Dynamic causal modelling

Discussion

Early visual areas are functionally connected to frontoparietal regions when predicting the trajectory of occluded objects

Intuitive physical inference modulates effective connectivity within a visuo-fronto-parietal network

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval

Consent to participate

Competing interests

Additional information

Publisher’s Note

Electronic supplementary material

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation