Bayesian MEG time courses with fMRI priors

Wang, Yingying; Holland, Scott K.

doi:10.1007/s11682-021-00550-4

Bayesian MEG time courses with fMRI priors

Original Research
Open access
Published: 25 September 2021

Volume 16, pages 781–791, (2022)
Cite this article

Download PDF

You have full access to this open access article

Brain Imaging and Behavior Aims and scope Submit manuscript

Bayesian MEG time courses with fMRI priors

Download PDF

1867 Accesses
2 Citations
Explore all metrics

Abstract

Magnetoencephalography (MEG) records brain activity with excellent temporal and good spatial resolution, while functional magnetic resonance imaging (fMRI) offers good temporal and excellent spatial resolution. The aim of this study is to implement a Bayesian framework to use fMRI data as spatial priors for MEG inverse solutions. We used simulated MEG data with both evoked and induced activity and experimental MEG data from sixteen participants to examine the effectiveness of using fMRI spatial priors in MEG source reconstruction. For simulated MEG data, incorporating the prior information from fMRI increased the spatial resolution of MEG source reconstruction by 3 mm on average. For experimental MEG data, fMRI spatial information reduced the spurious clusters for evoked activity and showed more left-lateralized activation pattern for induced activity. The use of fMRI spatial priors greatly reduced location error for induced source in MEG data. Our results provide empirical evidence that the use of fMRI spatial priors improves the accuracy of MEG source reconstruction. The combined MEG and fMRI approach can provide neuroimaging data with better spatial and temporal resolutions to add another perspective to our understanding of the neurobiology of language. The potential clinical applications include pre-surgical evaluation of language function for epilepsy patients and evaluation of language network for children with language disorders.

fMRI of Epilepsy

Functional MRI in the Presurgical Epilepsy Evaluation

Presurgical Language fMRI in Epilepsy: An Introduction

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The last decade has witnessed great advances in multi-modal data fusion techniques (Auranen et al., 2009; Baillet & Garnero, 1997; Baillet et al., 1999; Debener et al., 2006; Friston et al., 2008; Henson et al., 2010; Nummenmaa et al., 2007; Sato et al., 2004; Wipf & Nagarajan, 2009) and increasing interests in studying high-order cognition in the human brain using multi-modal techniques, especially data fusion of functional magnetic resonance imaging (fMRI) and magnetoencephalography (MEG) (Liljestrom et al., 2009; Pang et al., 2010; Vartiainen et al., 2011; Wang et al., 2012). Our previous study has demonstrated spatial concordance in the left inferior frontal gyrus (IFG) for covert or overt generation versus overt repetition, and bilateral motor cortices when overt generation versus covert generation (Wang et al., 2012), algin with other studies (Pang et al., 2010; Vartiainen et al., 2011). In sum, these studies provide evidence that the two modalities are assessing the same language network during language production and comprehension tasks. The high spatial concordance between fMRI and MEG data in accessing language function provides confidence that fMRI can be used as spatial constraints on MEG source localization. Consequently, high spatiotemporal information obtained from the promising multi-modal data integration approach can potentially yield new insights into the complex brain networks supporting high-order cognition. Several integration schemes have been introduced to combine fMRI and MEG data such as fMRI-guided equivalent current dipole (ECD) fitting (Ahlfors & Simpson, 2004), fMRI-constrained cortical current density imaging (Dale et al., 2000; Liu et al., 2008; Ou et al., 2010), and the recent popularity of Bayesian schemes applied in MEG source inversion (Auranen et al., 2009; Friston et al., 2008; Henson et al., 2010; Wipf & Nagarajan, 2009).

Currently there is no best solution for combining fMRI and MEG data, but only most suitable approach to integrate the two modalities based on the neurobiological characteristics of brain functions. Based on the current literature, parametric empirical Bayesian (PEB) (Henson et al., 2010) with distributed cortical current density imaging approach is well suited to high-order cognitive data on several counts. First, language tasks usually require communication and coordination among many regions distributed in the brain. It is hard to account for highly distributed regions involved in language tasks using ECD modeling which represents a relatively small number of focal sources. Second, researchers (Liljestrom et al., 2009; Vartiainen et al., 2011; Wang et al., 2012) have shown that there is no simple one-to-one spatial correspondence between fMRI and MEG results. For example, high concordance between MEG and fMRI was observed in the left inferior frontal gyrus (IFG), the bilateral motor cortices, and the right insula, but not in the left medial frontal gyrus and the left cingulate gyrus during an overt verb generation task (Wang et al., 2012). The PEB approach allows the use of fMRI spatial information as “soft” constraints rather than “hard” ones, and thus it is most suited for high-order cognitive paradigms. At last, current available integration schemes have only been tested in the experimental paradigms with short-onset and time-locked stimuli such as visual, auditory, motor, somatosensory. Note it is easy to design a short paradigm with hundreds of trials consisting of these stimuli so that high SNR can be achieved for ECD fitting by averaging hundreds of trials of external MEG signals. Thus, for these types of paradigm design, incorporating fMRI spatial information into MEG source inversion offers very limited advantage and is not a cost-effective approach since MEG alone can already achieve sufficient localization accuracy with hundreds or even thousands of trials of neuromagnetic data. However, for high-order cognitive tasks, especially natural language processes like story listening (narrative comprehension), it is extremely hard to acquire hundreds of trials within a short time period. Further, it is difficult to separate evoked and induced responses to the stimulus, complicating the source localization process from MEG data alone. Long time intervals can potentially induce fatigue and more head motion which deteriorate the quality of MEG data. Therefore, for these types of cognitive tasks, the extra information from fMRI can play a crucial role in improving the solutions of ill-posed MEG source inversion and add another perspective to our understanding of the neurobiology of language.

In this study, we elaborate on the theory of PEB approach and provide information on integration of fMRI spatial constrains into MEG source localization. We present empirical evidence using simulations and analysis of experimental data from sixteen participants who performed a narrative comprehension task during MEG recordings. Our results will generate evidence that the combined MEG and fMRI approach offers time series with better spatial and temporal resolutions, which might help us better understand the neural network supporting specific cognitive process (e.g., language, reading, etc.). The potential clinical applications include pre-surgical evaluation of language function for epilepsy patients and evaluation of language network for children with language disorders.

Methods

The MEG Inverse Problem

$${\varvec{B}}={\varvec{L}}\boldsymbol{ }{\varvec{X}}+{{\varvec{E}}}_{1}$$

(1)

B is the vector of magnetic signal at a given time sample, L is the lead-field matrix, X is the unknow brain activities, E₁ is assumed to be Gaussian noise with zero mean and known variance. The linear system presented in (1) is underdetermined since the number of sensors (only hundreds) is much fewer than the number of possible sources (over thousands). In order to find solutions for (1), there are two general methods including widely used ordinary least squares estimation (OLSE) and maximum likelihood estimation (MLE). The former requires no assumptions about the distributions, but does not allow model testing or selection, etc. On the contrary, MLE provides complete and consistent solutions and can serve as a steppingstone for other inference methods such as Bayesian methods, inference with missing data, etc. We formulated the inversion of (1) within a probabilistic Bayesian framework which allows us to choose hyperparameters and optimize MEG source inversion by incorporating fMRI spatial information.

Bayesian Framework.

The probabilistic model for sources X can be expressed under the assumption of Gaussian noise at the sensor level. Then, the probability distribution is given by (2).

$$P\left({\varvec{B}}|{\varvec{X}}\right)\propto \mathrm{exp}\left(-\frac{1}{2}\beta {\Vert {\varvec{B}}-{\varvec{L}}\cdot {\varvec{X}}\Vert }^{2}\right)$$

(2)

$\beta$ denotes the noise variance. The magnetic field B is observed for a given source current X is proportional to Gaussian probability density. According to Bayes’ theorem, (2) can be used to form the posterior probability over unknown X by (3).

$$P\left({\varvec{X}}|{\varvec{B}},{\varvec{\lambda}}\right)=\frac{P({\varvec{B}}|{\varvec{X}},{\varvec{\lambda}})P({\varvec{X}}|{\varvec{\lambda}})}{P({\varvec{B}}|{\varvec{\lambda}})}$$

(3)

${\varvec{\lambda}}$ represents all the other assumptions and beliefs about the model, the denominator in (3) is the evidence for ${\varvec{\lambda}}$ and ensures the posterior probability is normalized.

$$P\left({\varvec{B}}|{\varvec{\lambda}}\right)=\int P({\varvec{B}}|{\varvec{X}},{\varvec{\lambda}})P({\varvec{X}}|{\varvec{\lambda}})d{\varvec{X}}$$

(4)

The prior probability density $P({\varvec{X}}|{\varvec{\lambda}})$ represents all the prior information known about the unknown X and constrains before data is seen and can be regarded as a regularization which limits model overfitting. Since $P\left({\varvec{B}}|{\varvec{\lambda}}\right)$ does not depend on X. It can be omitted to generate the unnormalized posterior density in (5).

$$P\left({\varvec{X}}|{\varvec{B}},{\varvec{\lambda}}\right)\propto P({\varvec{B}}|{\varvec{X}},{\varvec{\lambda}})P({\varvec{X}}|{\varvec{\lambda}})$$

(5)

The posterior probability is proportional to likelihood times the prior probability. Possible solution of X must simultaneously give a high data likelihood $P({\varvec{B}}|{\varvec{X}},{\varvec{\lambda}})$ and be probable under constraint of the prior $P({\varvec{X}}|{\varvec{\lambda}})$ to give an appreciable posterior distribution $P\left({\varvec{X}}|{\varvec{B}},{\varvec{\lambda}}\right)$. The posterior probability shown in (3) is a measure of what is known after the data is seen and quantifies any new knowledge gained. The data likelihood $P\left({\varvec{B}}|{\varvec{X}},{\varvec{\lambda}}\right)$ is a measure of how well the model predicted the data and essentially determines whether the data under investigation contains any new information. Bayes’ theorem enables us to update the distribution over parameters from the prior to the posterior distribution over the latent variables in light of observed data. For a simple illustration, assume that non-informative priors are used. For each voxel in the brain, we believe there is a 50 percent chance it will be active or inactive. Then, via Bayesian statistics, we update our old beliefs iteratively so some voxels might have 90 percent chance to be active during the task based on the data and priors.

Using the expectation maximization (EM) algorithm, the model is learned by alternating between estimating the posterior distribution over latent variables for a particular setting of model parameters and then re-estimating the best-fit parameters given that distribution over the latent variables (Beal, 2003; Friston et al., 2007). In Bayesian statistics, maximum a posteriori probability (MAP) can be used to as a point estimate to approximate the posterior distribution. There are four major ways of computing MAP estimates, depending upon the specifics of the problem. First, when conjugate priors are used, MAP estimates can be solved analytically since the mode of the posterior distribution can be given in closed form. Second, MAP estimates can be computed analytically or numerically through numerical optimization such as conjugate gradient method or Newton’s method, which requires the first or second derivatives. Third, MAP estimates can be obtained via a modification of an EM algorithm which does not require derivatives of the posterior density. At last, MAP can also be calculated through a Monte Carlo method using simulated annealing.

PEB Framework

In PEB framework (Henson et al., 2010), the third type of method is used to obtain MAP estimates via variational free energy under the Laplace approximation (Friston et al., 2007). Variational Bayes (VB) by variational free energy is a generic approach to construct an analytical approximation of the posterior probability distribution (Bishop, 1998; Friston et al., 2007). The foundation of VB is that the log-evidence can be defined in terms of the free energy F and a Kullback–Leibler divergence term.

$$l\begin{array}{c}L=lnP({\varvec{B}},{\varvec{X}})\\ lnP\left({\varvec{B}}|{\varvec{\lambda}}\right)=F+{D}_{KL}(Q\left({\varvec{X}}\right)\parallel P\left({\varvec{X}}|{\varvec{B}},{\varvec{\lambda}}\right))\\ F={\langle L({\varvec{X}})\rangle }_{q}-{\langle lnQ({\varvec{X}})\rangle }_{q}\end{array}$$

(6)

${\langle L({\varvec{X}})\rangle }_{q}$ is the expected energy. ${\langle lnQ({\varvec{X}})\rangle }_{q}$ is the entropy measuring the uncertainty in a random variable in information theory. From (2), the free energy is a lower-bound approximation to the log- evidence because the divergence term is always positive. The goal is to compute $Q({\varvec{X}})$ for each model by maximizing the free energy F. Then, compute F for Bayesian inference and model comparison, respectively.

FMRI Priors

FMRI results are usually presented as noise-normalized statistical parametric maps (SPMs) rather than as maps of raw signal strength since noise variance may vary greatly between voxels. The topological features of these fMRI SPMs can be assigned probabilities that quantify the chance that the cluster of voxels can be active under null hypothesis. A voxel with the increases in the mean fMRI signal associated with one experimental condition versus another indicates that it has high probability to be an “active” source in MEG source space. Thus, fMRI results can be projected onto the cortical mesh and then converted into covariance matrices (Henson et al., 2010). Since the underlying physiological connections between fMRI and MEG signals are still unclear, fMRI data are treated as probabilistic information about the spatial location of active regions in the brain rather than quantitative information about the amplitude of the neural activity. First, we define a number of discrete clusters by thresholding the SPMs since using each suprathreshold “cluster” from fMRI SPMs to form a separate prior rather than a single prior enables flexibility in adjusting different contributions of multiple constrains. Second, the fMRI clusters need to be projected onto the cortical mesh since the fMRI clusters are 3D volume while MEG cortical mesh is based on the cortical surface through (7). ${X}_{i}^{fv}$ is the i th fMRI cluster. ${S}_{i}^{CS}$ is its counterpart on the cortical surface. ${\varepsilon }_{i}$ is the error term and f is the linkage function that is Heaviside function binarizing each fMRI spatial prior. H is the Voronoï-based interpolation function (Kiebel et al., 2000) that has been suggested to be more superior than other interpolation methods (Henson et al., 2010).

$${S}_{i}^{CS}=H\left[f({X}_{i}^{fv})\right]+{\varepsilon }_{i}$$

(7)

At last, these cortical patches need to be transferred to covariance components through (8). Before conversion, an extra spatial smoothing step is added to reduce the misregistration errors via a spatial coherency function.

$$G=\exp \left(\sigma \boldsymbol{A}\right)\approx {\sum}_{\boldsymbol{i}=\mathbf{0}}^{\mathbf{8}}\frac{{\boldsymbol{\sigma}}_{\boldsymbol{i}}}{\boldsymbol{i}!}{\boldsymbol{A}}_{\boldsymbol{i}}$$

(8)

The elements ${a}_{ij}$ in matrix A equals 1 if $i$ and $j$ are neighbors and 0 otherwise. $\sigma$ is the smoothing parameter. After smoothing $\left[{q}_{1}\dots {q}_{N}\right]=G\left[{S}_{1}^{CS}\dots {S}_{N}^{CS}\right]$, the covariance components can be generated by the outer product ${Q}_{i}={q}_{i}{q}_{i}^{T}$.

Data Simulation.

In (1), L was computed using a single sphere head model (Sarvas, 1987). Two dipoles at ${X}_{1}$(-38, 43, 5) and ${X}_{2}$(-54, -13, 5) in Montreal Neurological Institute (MNI) space were used to generate the simulated MEG data B. In order to check how priors affect both evoked and induced brain signals, ${X}_{1}$ and ${X}_{2}$ had different time courses in (9).

$$\begin{array}{c}{f}_{{X}_{2}}=\mathrm{sin}[\left(t-{t}_{0}\right)/(i\bullet \delta )\bullet 24\bullet 2\pi +\frac{\pi }{2}]\\ {f}_{{X}_{1}}=\mathrm{sin}[\left(t-{t}_{0}\right)\bullet 10\bullet 2\pi +\frac{\pi }{2}]\end{array}$$

(9)

In (9), $t\in \left[0.55, 0.85\right]$ denotes time in seconds, i $\in \left[1, 100\right]$ denotes the trial number, $\delta$ is a positive random number that is different for each trial so that ${X}_{2}$(-54, -13, 5) contains induced activity. We also added gaussian noise to the simulated MEG data to generate data with three different noise levels including no noise, signal-to-noise ratio = 10 dB and -10 dB. The MATLAB code for simulating MEG data is available by request and the illustration diagram of MEG data simulation process is in supplementary materials.

The fMRI SPM{T} map was generated using MarsBaR (Brett et al., 2002) and then projected to the cortical mesh using SPM8 (www.fil.ion.ucl.ac.uk/spm/). Both valid and invalid fMRI spatial priors were considered so that we can evaluate the impact of invalid priors that mismatch MEG dipole locations (see Fig. 1).

For evaluation of source reconstruction results, we used absolute measure of localization error denoted by Euclidean distance between the estimated peak location and the actual location of the sources. We also used area under curve (AUC) based on receiver operating characteristic (ROC) curve to quantify the detection accuracy of various inverse solutions (Barnes et al., 2013; Grova et al., 2006). The simulated MEG data provides the ground truth. The various inverse solutions (no fMRI priors, with all fMRI priors, with only valid fMRI priors, with only invalid fMRI prior) need a decision threshold $\beta$ to build ROC curves. By comparing the inverse solutions with the gold standard, we were able to quantify the number voxels detected as true positive (TP), true negative (TN), false positive (FP) and false negative (FN) for each threshold $\beta \in \left[\mathrm{0,1}\right]$. Sensitivity and specificity were then estimated using (10).

$$\begin{array}{c}specificity\left(\beta \right)=\frac{TN\left(\beta \right)}{TN\left(\beta \right)+FP\left(\beta \right)}\\ sensitivity\left(\beta \right)=\frac{TP\left(\beta \right)}{TP\left(\beta \right)+FN\left(\beta \right)}\end{array}$$

(10)

ROC curves were plotted using sensitivity($\beta$) as y-axis and (1-specificity($\beta$)) as x-axis for each threshold $\beta \in \left[\mathrm{0,1}\right]$. Then, AUC was computed to evaluate detection accuracy. In general, an AUC value greater than 0.8 is considered as sufficiently accurate that achieves 80% good detection rate. Higher AUC indicates better performance. For evaluation of time-course results, we used scatter plots to visualize the correlation between the “true” time courses and the estimated time courses in both time and frequency domain. The volume of interests had a radius of 10 mm. Time courses were extracted from the first eigenvariate of all voxels in the volume of interests, rather than the mean values, since the eigenvariate value is more robust to heterogeneity of response within a cluster. The root mean square error (RMSE) were computed between the “true” time courses and the estimated time courses in both time and frequency domain.

Experimental MEG Data

The experimental MEG and fMRI data from sixteen participants with average age 15.8 years were described in our previous study and a detailed description of the data and paradigms can be found in (Wang et al., 2012). MEG data were acquired using a 275-channel whole head MEG system (VSM Med-Tech Ltd., Port Coquitlam, BC, Canada) sampled at 6 kHz and fMRI data were acquired on a Philips Achieva 3-Tesla MRI scanner with Dual Quasar gradients (Philips Medical Systems, Best, The Netherlands). During the scan, the participants performed a narrative comprehension task including three conditions (story listening, question answering, and pure tone listening). In the current study, we only focused on the contrast of story listening versus tone listening. The group fMRI results were used as spatial priors for MEG source reconstructions (see Fig. 2). Three clusters that survived the thresholding (height threshold T = 5.78 and extent threshold k = 50 voxels, p < 0.005 Family-Wise Error rate corrected) were projected onto the template cortical mesh in Fig. 2-1 and correspond approximately to left inferior gyrus (IFG) and bilateral superior temporal gyrus (STG).

Results

Simulation Study

Table 1 summarized the location error for both peak and center of mass from different inversion methods including multiple sparse priors (MSP) without spatial priors, MSP with all priors, MSP with only valid priors, and MSP with only invalid priors (see supplementary Fig. S2), as well as the AUC values. The ROC curves were plotted in Fig. 3.

Table 1 Summary of different inversion methods

Full size table

Note that the ROC curves had a non-smooth appearance due to the small number of true positive voxels embedded in the simulated data set relative to true negative voxels. True positive voxels were detected as positive at an abrupt threshold, the distribution of values was not very wide ranging. Still the AUC varied with the prior information incorporated in the model and from this parameter we could see that the performance was superior when valid fMRI priors or mixture of valid and invalid fMRI priors used in the estimation process.

For evoked source ${X}_{1}$(-38, 43, 5), the location error of peak activity was not affected by the fMRI priors, but the location error of the center of cluster decreased when the accurate fMRI priors were incorporated. In addition, the mixture of spatial priors with valid and invalid locations decreased the location error of the center of cluster. The AUC increased from good to excellent when the accurate fMRI priors were used.

For induced source ${X}_{2}$(-54, -13, 5), the location errors of peak activity and the center of cluster were greatly reduced by the fMRI priors. Higher-order cognitive activity in the brain has been shown to be induced responses. Even when only inaccurate spatial prior was used, the location error and AUC was not affected that much. The add-on white noise slightly decreased the AUC and increased the location error for both sources.

The extracted time courses were plotted against the actual time courses (see supplementary Fig. S6-7). Table 2 showed the RMSE values for different inversion methods. The introduction of spatial priors reduced discrepancy between the reconstructed time courses and ground truth, which were more evident in evoked source activity X₁. For evoked source activity X₁, average of 100 trials did not change the RMSE compared to 100-trial time courses. But for induced source activity X₂, average of 100 trials reduced the RMSE dramatically compared to 100-trial time courses. Inversion with all fMRI spatial priors including invalid and valid ones offered the lowest RMSE for evoked source activity X₁, while inversion with valid only spatial fMRI priors gives lowest RMSE for induced source activity X₂ without averaging. The virtual sensor approach gave the lowest RMSE for induced source activity X₂ after averaging all the trials. The inversion approach with all fMRI spatial priors provided the lowest overall RMSE for both evoked and induced source activity (X₁ and X₂). The time course extraction was problematic for induced response X₂ due to the trial-to-trial variability. The unaveraged time courses had higher RMSE which indicated high discrepancy between the original signal and extracted signal.

Table 2 RMSE as a quantitative measure of time-course extraction quality for each inversion method

Full size table

The single-sided amplitude spectrum plots were generated for all time course using Fourier transformation (see supplementary Fig. S8). All inversion methods yielded low RMSE. Table 3 showed the RMSE for spectrum plots.

Table 3 RMSE as a quantitative measure of time-course extraction quality for each inversion method

Full size table

Experimental MEG Data

The introduction of group fMRI results as spatial priors for MEG inverse problem significantly increased free-energy F (T(15) = 2.51, p < 0.05, one-tailed). For evoked activity, the effect of fMRI priors on the source reconstruction was mainly to divide activity slightly more bilaterally in the STG and reduce the spurious clusters (see Fig. 4). For induced activity, the effect of fMRI priors on the source reconstruction was to pull activity slightly more left in STG (see Fig. 4) that was consistent with known patterns of leftward lateralization of this task (Wang et al., 2012).

Discussion

We implemented the hierarchical Bayesian framework on MEG source reconstruction with fMRI spatial information incorporated as spatial priors and applied this approach to both simulated MEG data and experimental MEG data from sixteen adolescents during a narrative comprehension task.

For simulated MEG data, the spatial resolution of MEG source reconstruction increases (3 mm on average) by incorporating the prior information from fMRI in the source reconstruction. The use of fMRI spatial priors greatly reduced location error for induced source in MEG data. This is important since the induced responses of the neuronal activity in the brain are often corresponding to high-order cognitive tasks. Therefore, the additional spatial priors from fMRI could benefit the accuracy of source reconstruction especially for this type of high-order cognitive processes. Wang et al. (2012) reported similarities and differences between fMRI and MEG data from the same participant (Wang et al., 2012). FMRI spatial priors would include valid and invalid spatial locations. Our results suggested that the combination of accurate and inaccurate spatial priors still increased the accuracy of MEG source reconstruction since the inaccurate priors are effectively discarded in the restricted maximum likelihood procedure. The AUC was greatly increased from good to excellent when the fMRI priors are incorporated into the MEG inversion problem. Thus, the MSP with fMRI priors is a robust approach for MEG inverse problem.

This is the first study that applied the hierarchical Bayesian framework on simulated MEG data from both evoked and induced source activity. The evoked responses are phase-locked to trial onset and induced responses have a random phase-relationship over trials. Simple paradigm design with short duration of visual, or auditory, or sensory, or motor stimulus usually produces evoked responses. High-order cognitive paradigms have longer stimulus duration and induce a more complex process in the brain that varies over trials. This type of paradigm usually generates both evoked and induced activity in the brain. From our simulated data, we found that the accurate fMRI spatial priors have evident effects on the induced activity not the evoked activity. The induced responses benefit more from the fMRI prior information since there is more trial-to-trial variability in the induced responses. From our experimental data, activation patterns from evoked and induced responses separate different stages of language processes involved in the narrative comprehension task. This finding demonstrated that the wide range of neuronal activity recorded by MEG could improve our understanding of language processes.

The introduction of accurate spatial priors reduced discrepancy between the reconstructed time courses and the actual time courses. The effect was more evident in the estimated time courses of the evoked source. This finding was encouraging in that spatial priors are beneficial not only in spatial accuracy but also in temporal accuracy. The trial-to-trial variability for induced source affected the accuracy of estimated time courses at each individual trial due to random nature of the phase relationship with the stimuli. When the time courses from induced source were transferred into frequency domain, the power spectrum in the frequency domain for each trial showed much less discrepancy between the ground truth and estimation. This finding confirmed other studies in the literature (David, Kilner, & Friston, 2006; Friston, 2006; Michalopoulos et al., 2011), suggesting using the average energy over trials to represent the induced responses.

For our experimental data, the introduction of fMRI spatial priors showed significant increases in the free-energy bound, which indicates the improvement in model evidence by adding fMRI spatial information into MEG source reconstruction. Our experimental results are in line with Henson et al. (Henson et al., 2011). The group composite activation maps of story > tone contrast from MEG with fMRI spatial priors showed very similar activation patterns to our previous cross-sectional fMRI studies (Karunanayaka et al., 2007; Szaflarski et al., 2012; Vannest et al., 2009). FMRI spatial information reduced the spurious clusters for evoked activity and showed more left-lateralized activation pattern for induced activity. The activations in the STG were bilateral for evoked activity, whereas activations in the STG were left-lateralized for induced activity. The bilateral activations in STG, revealed by evoked responses, could be due to the residual prelinguistic auditory processing since the use of tone listening as the control condition presumably subtracts out earlier stages of auditory processing. The strong left-lateralized activations in STG, revealed by induced responses, agree with other fMRI studies (Benson et al., 2006; Binder et al., 2000; Mummery et al., 1999; Scott et al., 2000), which suggested the high-order cognitive process for understanding speech sounds takes place in the left temporal lobe. Therefore, MEG with fMRI spatial priors might be helpful in determining the lateralization of language functions in presurgical mapping.

The limitations of this study are also identified. First, the fMRI spatial priors in the present study are binarized for simplicity. The fMRI spatial priors can also be continuous function of fMRI signal. Binary priors can produce slightly higher statistical value than continuous priors (Henson et al., 2010). Then, the forward model used in this study was the single sphere head model (Sarvas, 1987), which is the simplified model. Future studies could test different forward models (e.g., boundary-element models) and its effects on MEG inverse solutions with fMRI priors.

Conclusion

In summary, the hierarchical Bayesian framework allows us to incorporate fMRI spatial priors and improve MEG source estimates resulting in a distribution of likely solutions instead of a single solution. Combining MEG and fMRI data within the Bayesian framework is a promising approach to overcome the limitations of each modality and offers brain signal with fine spatiotemporal resolution which is crucial for effective connectivity analysis.

Data availability

All data are available by request.

Code availability

All codes are available by request.

References

Ahlfors, S. P., & Simpson, G. V. (2004). Geometrical interpretation of fMRI-guided MEG/EEG inverse estimates. NeuroImage, 22(1), 323–332. https://doi.org/10.1016/j.neuroimage.2003.12.044
Article PubMed Google Scholar
Auranen, T., Nummenmaa, A., Vanni, S., Vehtari, A., Hamalainen, M. S., Lampinen, J., & Jaaskelainen, I. P. (2009). Automatic fMRI-guided MEG multidipole localization for visual responses. Human Brain Mapping, 30(4), 1087–1099. https://doi.org/10.1002/hbm.20570
Article PubMed Google Scholar
Baillet, S., & Garnero, L. (1997). A Bayesian approach to introducing anatomo-functional priors in the EEG/MEG inverse problem. IEEE Transactions on Biomedical Engineering, 44(5), 374–385. https://doi.org/10.1109/10.568913
Article CAS PubMed Google Scholar
Baillet, S., Garnero, L., Marin, G., & Hugonin, J. P. (1999). Combined MEG and EEG source imaging by minimization of mutual information. IEEE Transactions on Bio-medical Engineering, 46(5), 522–534. https://doi.org/10.1109/10.759053
Barnes, G. R., Chowdhury, R. A., Lina, J. M., Kobayashi, E., & Grova, C. (2013). MEG Source Localization of Spatially Extended Generators of Epileptic Activity: Comparing Entropic and Hierarchical Bayesian Approaches. PLoS ONE, 8(2), e55969. https://doi.org/10.1371/journal.pone.0055969
Article CAS Google Scholar
Beal, M. J. (2003). Variational algorithms for approximate Bayesian inference. University of London, University College London (United Kingdom) (Doctor of Philosophy)
Benson, R. R., Richardson, M., Whalen, D., & Lai, S. (2006). Phonetic processing areas revealed by sinewave speech and acoustically similar non-speech. NeuroImage, 31(1), 342–353.
Article PubMed Google Scholar
Binder, J. R., Frost, J. A., Hammeke, T. A., Bellgowan, P. S., Springer, J. A., Kaufman, J. N., & Possing, E. T. (2000). Human temporal lobe activation by speech and nonspeech sounds. Cerebral Cortex (New York, N.Y. : 1991), 10(5), 512–528. https://doi.org/10.1093/cercor/10.5.512
Bishop, C. M. (1998). Latent variable models. In Learning in graphical models (pp. 371–403). Springer, Dordrecht.
Brett, M., Anton, J.-L., Valabregue, R., & Poline, J.-B. (2002). Region of interest analysis using the MarsBar toolbox for SPM 99. NeuroImage, 16, S497.
Google Scholar
Dale, A. M., Liu, A. K., Fischl, B. R., Buckner, R. L., Belliveau, J. W., Lewine, J. D., & Halgren, E. (2000). Dynamic statistical parametric mapping: combining fMRI and MEG for high-resolution imaging of cortical activity. Neuron, 26(1), 55–67.
Article CAS PubMed Google Scholar
David, O., Kilner, J. M., & Friston, K. J. (2006). Mechanisms of evoked and induced responses in MEG/EEG. Neuroimage, 31(4), 1580–1591. https://doi.org/10.1016/j.neuroimage.2006.02.034
Article PubMed Google Scholar
Debener, S., Ullsperger, M., Siegel, M., & Engel, A. K. (2006). Single-trial EEG–fMRI reveals the dynamics of cognitive function. Trends in Cognitive Sciences, 10(12), 558–563. https://doi.org/10.1016/j.tics.2006.09.010
Article PubMed Google Scholar
Friston, K. (2006). Dynamic causal modelling of brain responses. Journal of Psychophysiology, 20, 322.
Google Scholar
Friston, K., Chu, C., Mourão-Miranda, J., Hulme, O., Rees, G., Penny, W., & Ashburner, J. (2008). Bayesian decoding of brain images. NeuroImage, 39(1), 181–205. https://doi.org/10.1016/j.neuroimage.2007.08.013
Friston, K., Mattout, J., Trujillo-Barreto, N., Ashburner, J., & Penny, W. (2007). Variational free energy and the Laplace approximation. NeuroImage, 34(1), 220–234. https://doi.org/10.1016/j.neuroimage.2006.08.035
Grova, C., Makni, S., Flandin, G., Ciuciu, P., Gotman, J., & Poline, J. B. (2006). Anatomically informed interpolation of fMRI data on the cortical surface. Neuroimage, 31(4), 1475–1486. https://doi.org/10.1016/j.neuroimage.2006.02.049
Article CAS PubMed Google Scholar
Henson, R. N., Flandin, G., Friston, K. J., & Mattout, J. (2010). A Parametric Empirical Bayesian framework for fMRI-constrained MEG/EEG source reconstruction. Human Brain Mapping, 31(10), 1512–1531. https://doi.org/10.1002/hbm.20956
Article PubMed PubMed Central Google Scholar
Henson, R. N., Wakeman, D. G., Litvak, V., & Friston, K. J. (2011). A Parametric Empirical Bayesian Framework for the EEG/MEG Inverse Problem: Generative Models for Multi-Subject and Multi-Modal Integration. Frontiers in Human Neuroscience, 5, 76. https://doi.org/10.3389/fnhum.2011.00076
Article PubMed PubMed Central Google Scholar
Karunanayaka, P. R., Holland, S. K., Schmithorst, V. J., Solodkin, A., Chen, E. E., Szaflarski, J. P., & Plante, E. (2007). Age-related connectivity changes in fMRI data from children listening to stories. Neuroimage, 34(1), 349–360. https://doi.org/10.1016/j.neuroimage.2006.08.028
Article PubMed Google Scholar
Kiebel, S. J., Goebel, R., & Friston, K. J. (2000). Anatomically informed basis functions. NeuroImage, 11(6 Pt 1), 656–667. https://doi.org/10.1006/nimg.1999.0542S1053-8119(99)90542-6[pii]
Article CAS PubMed Google Scholar
Liljestrom, M., Hulten, A., Parkkonen, L., & Salmelin, R. (2009). Comparing MEG and fMRI views to naming actions and objects. Human Brain Mapping, 30(6), 1845–1856. https://doi.org/10.1002/hbm.20785
Article PubMed PubMed Central Google Scholar
Liu, Y., Xiang, J., Wang, Y., Vannest, J. J., Byars, A. W., & Rose, D. F. (2008). Spatial and frequency differences of neuromagnetic activities in processing concrete and abstract words. Brain Topography, 20(3), 123–129. https://doi.org/10.1007/s10548-007-0038-x
Article PubMed Google Scholar
Michalopoulos, K., Iordanidou, V., Giannakakis, G. A., Nikita, K. S., & Zervakis, M. (2011, 5–7 Oct. 2011). Characterization of evoked and induced activity in EEG and assessment of intertrial variability. Paper presented at the Biomedical Engineering, 2011 10th International Workshop on.
Mummery, C. J., Ashburner, J., Scott, S. K., & Wise, R. J. S. (1999). Functional neuroimaging of speech perception in six normal and two aphasic subjects. Journal of Acoustical Society of America, 106(1), 449.
Article CAS Google Scholar
Nummenmaa, A., Auranen, T., Hamalainen, M. S., Jaaskelainen, I. P., Lampinen, J., Sams, M., & Vehtari, A. (2007). Hierarchical Bayesian estimates of distributed MEG sources: theoretical aspects and comparison of variational and MCMC methods. Neuroimage, 35(2), 669–685. https://doi.org/10.1016/j.neuroimage.2006.05.001
Article PubMed Google Scholar
Ou, W., Nummenmaa, A., Ahveninen, J., Belliveau, J. W., Hamalainen, M. S., & Golland, P. (2010). Multimodal functional imaging using fMRI-informed regional EEG/MEG source estimation. Neuroimage, 52(1), https://doi.org/10.1016/j.neuroimage.2010.03.001
Pang, E. W., Wang, F., Malone, M., Kadis, D. S., & Donner, E. J. (2010). Localization of Broca’s area using verb generation tasks in the MEG: Validation against fMRI. Neurosci. Lett, 490(3), 215–219. https://doi.org/10.1016/j.neulet.2010.12.055
Article CAS PubMed PubMed Central Google Scholar
Sarvas J. (1987). Basic mathematical and electromagnetic concepts of the biomagnetic inverse problem. Physics in Medicine and Biology, 32(1), 11–22. https://doi.org/10.1088/0031-9155/32/1/004
Sato, M.-A., Yoshioka, T., Kajihara, S., Toyama, K., Goda, N., Doya, K., & Kawato, M. (2004). Hierarchical Bayesian estimation for MEG inverse problem. NeuroImage, 23(3), 806–826. https://doi.org/10.1016/j.neuroimage.2004.06.037
Article PubMed Google Scholar
Scott, S. K., Blank, C. C., Rosen, S., & Wise, R. J. (2000). Identification of a pathway for intelligible speech in the left temporal lobe. Brain: A Journal of Neurology, 123 Pt 12(Pt 12), 2400–2406. https://doi.org/10.1093/brain/123.12.2400
Szaflarski, J. P., Altaye, M., Rajagopal, A., Eaton, K., Meng, X. X., Plante, E., & Holland, S. K. (2012). A 10-year longitudinal fMRI study of narrative comprehension in children and adolescents. NeuroImage, 63(3), 1188–1195. https://doi.org/10.1016/j.neuroimage
Article PubMed Google Scholar
Vannest, J. J., Karunanayaka, P. R., Altaye, M., Schmithorst, V. J., Plante, E. M., Eaton, K. J., . . . Holland, S. K. (2009). Comparison of fMRI data from passive listening and active-response story processing tasks in children. Journal of Magnetic Resonance Imaging, 29(4), 971-976. https://doi.org/10.1002/jmri.21694
Vartiainen, J., Liljestrom, M., Koskinen, M., Renvall, H., & Salmelin, R. (2011). Functional magnetic resonance imaging blood oxygenation level-dependent signal and magnetoencephalography evoked responses yield different neural functionality in reading. J. Neurosci, 31(3), 1048–1058. https://doi.org/10.1523/JNEUROSCI.3113-10.2011
Article CAS PubMed PubMed Central Google Scholar
Wang, Y., Holland, S. K., & Vannest, J. (2012). Concordance of MEG and fMRI patterns in adolescents during verb generation. Brain Research, 1447, 79–90. https://doi.org/10.1016/j.brainres.2012.02.001
Article CAS PubMed PubMed Central Google Scholar
Wipf, D., & Nagarajan, S. (2009). A unified Bayesian framework for MEG/EEG source imaging. NeuroImage, 44(3), 947–966.
Article PubMed Google Scholar

Download references

Acknowledgements

The authors would like to thank Ms. Kate Hibbard, Ms. Julie Franks, Ms. Sara Robertson, and Ms. Amanda Huber for their assistance in helping with recruitment and data collection, as well as Mr. Kendall O’Brien and Ms. Amanda Woods for their assistance in performing all the MRI scans.

Funding

This work was supported by the U.S. National Institute of Child Health and Human Development under Grant R01-HD38578 awarded to S.K. Holland.

Author information

Authors and Affiliations

Neuroimaging for Language, Literacy and Learning, Department of Special Education and Communication Disorders, University of Nebraska-Lincoln, Lincoln, NE, 68583, USA
Yingying Wang
Center for Brain, Biology and Behavior, University of Nebraska-Lincoln, Lincoln, NE, 68588, USA
Yingying Wang
Pediatric Neuroimaging Research Consortium, Cincinnati Children’s Hospital, Cincinnati, OH, 45229, USA
Scott K. Holland

Authors

Yingying Wang
View author publications
You can also search for this author in PubMed Google Scholar
Scott K. Holland
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Author contributions included conception and study design (YW and SKH), data collection and analysis (YW), interpretation of results (YW and SKH), drafting the manuscript work or revising it critically for important intellectual content (YW and SKH) and approval of final version to be published and agreement to be accountable for the integrity and accuracy of all aspects of the work (Both authors).

Corresponding author

Correspondence to Yingying Wang.

Ethics declarations

Conflicts of interest

Both authors declare no competing interests.

Ethics approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the Institutional Review Board of Cincinnati Children’s Hospital Medical Center.

Informed consent

Informed consent was obtained from all participants.

Consent for publication

Both authors consent for publication.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 24092 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, Y., Holland, S.K. Bayesian MEG time courses with fMRI priors. Brain Imaging and Behavior 16, 781–791 (2022). https://doi.org/10.1007/s11682-021-00550-4

Download citation

Accepted: 28 August 2021
Published: 25 September 2021
Issue Date: April 2022
DOI: https://doi.org/10.1007/s11682-021-00550-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Bayesian MEG time courses with fMRI priors

Abstract