Skip to main content
Log in

A Hierarchical Modeling Approach to Data Analysis and Study Design in a Multi-site Experimental fMRI Study

  • Published:
Psychometrika Aims and scope Submit manuscript

Abstract

We propose a hierarchical Bayesian model for analyzing multi-site experimental fMRI studies. Our method takes the hierarchical structure of the data (subjects are nested within sites, and there are multiple observations per subject) into account and allows for modeling between-site variation. Using posterior predictive model checking and model selection based on the deviance information criterion (DIC), we show that our model provides a good fit to the observed data by sharing information across the sites. We also propose a simple approach for evaluating the efficacy of the multi-site experiment by comparing the results to those that would be expected in hypothetical single-site experiments with the same sample size.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1.
Figure 2.
Figure 3.
Figure 4.
Figure 5.

Similar content being viewed by others

References

  • American Psychiatric Association (2000). Diagnostic and statistical manual of mental disorders, Washington, DC, USA.

  • DeYoe, E.A., Bandettini, P., Neitz, J., Miller, D., & Winans, P. (1994). Functional magnetic resonance imaging (fMRI) of the human brain. Journal of Neuroscience Methods, 54, 171–187.

    Article  PubMed  Google Scholar 

  • Friedman, L., Glover, G.H., Krenz, D., & Magnotta, V. (2006). Reducing inter-scanner variability of activation in a multicenter fMRI study: role of smoothness equalization. NeuroImage, 32, 1656–1668.

    Article  PubMed  Google Scholar 

  • Friedman, L., Stern, H., Brown, G.G., Mathalon, D.H., Turner, J., Glover, G.H., Gollub, R.L., Lauriello, J., Lim, K.O., Cannon, T., Greve, D.N., Bockholt, H.J., Belger, A., Mueller, B., Doty, M.J., He, J., Wells, W., Smyth, P., Pieper, S., Kim, S., Kubicki, M., Vangel, M., & Potkin, S.G. (2008). Test-retest and between-site reliability in a multicenter fMRI study. Human Brain Mapping, 29, 958–972.

    Article  PubMed  Google Scholar 

  • Gelman, A., Carlin, J., Stern, H., & Rubin, D. (2003). Bayesian data analysis (2nd ed.). London: Chapman & Hall/CRC.

    Google Scholar 

  • Glahn, D.C., Laird, A.R., Ellison-Wright, I., Thelen, S.M., Robinson, J.L., Lancaster, J.L., Bullmore, E., & Fox, P.T. (2008). Meta-analysis of gray matter anomalies in schizophrenia: application of anatomic likelihood estimation and network analysis. Biological Psychiatry, 64, 774–781.

    Article  PubMed  Google Scholar 

  • Goghari, V.M., Sponheim, S.R., & MacDonald, A.W. III (2010). The functional neuroanatomy of symptom dimensions in schizophrenia: a qualitative and quantitative review of a persistent question. Neuroscience and Biobehavioral Reviews, 34, 468–486.

    Article  PubMed  Google Scholar 

  • Lazar, N. (2008). The statistical analysis of functional MRI data. New York: Springer.

    Google Scholar 

  • Lindquist, M.A. (2008). The statistical analysis of fMRI data. Statistical Science, 23(4), 439–464.

    Article  Google Scholar 

  • Potkin, S.G., Turner, J.A., Brown, G.G., McCarthy, G., Greve, D.N., Glover, G.H., Manoach, D.S., Belger, A., Diaz, M., Wible, C.G., Ford, J.M., Mathalon, D.H., Gollub, R., Lauriello, J., O’Leary, D., van Erp, T.G.M., Toga, A.W., Preda, A., & Lim, K.O. (2009). Working memory and DLPFC inefficiency in schizophrenia: the FBIRN study. Schizophrenia Bulletin, 35, 19–31.

    Article  PubMed  Google Scholar 

  • Robins, J.M., van der Vaart, A., & Ventura, V. (2000). Asymptotic distribution of p values in composite null models. Journal of the American Statistical Association, 95, 1143–1156.

    Google Scholar 

  • Smith, S.M., Jenkinson, M., Woolrich, M.W., Beckmann, C.F., Behrens, T.E.J., Johansen-Berg, H., Bannister, P.R., De Luca, M., Drobnjak, I., Flitney, D.E., Niazy, R.K., Saunders, J., Vickers, J., Zhang, Y., De Stefano, N., Brady, J.M., & Matthews, P.M. (2004). Advances in functional and structural MR image analysis and implementation as FSL. NeuroImage, 23(Suppl. 1), S208–S219.

    Article  PubMed  Google Scholar 

  • Spiegelhalter, D.J., Best, N.G., Carlin, B.P., & van der Linde, A. (2002). Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society. Series B, 64, 583–639.

    Article  Google Scholar 

  • Sternberg, S. (1966). High-speed scanning in human memory. Science, 153, 652–654.

    Article  PubMed  Google Scholar 

  • Van Snellenberg, J.X. (2009). Working memory and long-term memory deficits in schizophrenia: is there a common substrate? Psychiatry Research, 174, 89–96.

    Article  PubMed  Google Scholar 

  • Woolrich, M.W. (2012). Bayesian inference in FMRI. NeuroImage, 62(2), 801–810.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

The project described was supported by the National Center for Research Resources and the National Center for Advancing Translational Sciences, National Institutes of Health, through Grant UL1 TR000153. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH. The authors also acknowledge NIH funding from P41RR013218.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Babak Shahbaba.

Additional information

Bo Zhou and Kinh H. Tieu are co-first authors.

Appendices

Appendix A

1.1 A.1 Gibbs Sampler for the Normal Model

For our hierarchical Bayesian model with normal distribution, we use Markov Chain Monte Carlo (MCMC) to sample from the posterior distribution of model parameters. Given a weakly informative prior \(f(\sigma^{2}_{\mathit{run}})\propto \mathrm{Inv}\mbox{-}\chi^{2}(1,10)\) for \(\sigma^{2}_{\mathit{run}}\), we sample visit mean and run variance as follows:

Class mean is determined by group effect, hemisphere effect, and load effects. Let β=(β 1,β 2,β 3,β 4) be coefficients for group effect, hemisphere effect, and load effect; also, let X be the design matrix. Given a weakly informative prior \(f(\sigma^{2}_{\mathit{sub}}) \propto\mathrm{Inv}\mbox{-}\chi^{2}(1,10)\) for \(\sigma^{2}_{\mathit{sub}}\) and flat prior f(β)∝1 for β, we have

Given a weakly informative prior \(f(\sigma^{2}_{\mathit{site}W_{s}}) \propto \mathrm{Inv}\mbox{-}\chi^{2}(1,10)\) for \(\sigma^{2}_{\mathit{site}W_{s}}\), we sample subject-specific site effects b si and site variation within subjects \(\sigma^{2}_{\mathit{site}W_{s}}\) as follows:

Given a weakly informative prior \(f(\sigma^{2}_{\mathit{site}A_{s}}) \propto \mathrm{Inv}\mbox{-}\chi^{2}(1,10)\) for \(\sigma^{2}_{\mathit{site}A_{s}}\) and a flat prior f(b s )∝1 for site mean b s , we sample site-specific site effects b s and site variation across subjects \(\sigma^{2}_{\mathit{site}A_{s}}\) as follows:

1.2 A.2 Gibbs Sampler for the t Model

For the hierarchical model with the t-distribution, the Gibbs sampler is similar to what we discussed above, except for the first part of the hierarchy where we introduce latent variable \(V_{sijk}^{hlg}\). Given a weakly informative prior \(f(\sigma^{2}_{\mathit{run}})\propto\frac{1}{\sigma^{2}_{\mathit{run}}}\) for \(\sigma^{2}_{\mathit{run}}\), we sample \(\mu^{hlg}_{sij}\) and \(\sigma^{2}_{\mathit{var}}\) as follows:

Appendix B

Here, we present the steps involved in the derivation of e s (Equation (9)). To find \(\hat{\varSigma}_{m}\), we start by rewriting our hierarchical Bayesian model as a multivariate normal model as follows:

$$ Y | \beta, \hat{\varSigma}_{m} \sim N(X\beta, \hat{ \varSigma}_{m}), $$

where X is the design matrix, and β=(β 1,β 2,β 3,β 4,b 1,…,b 8). The diagonal elements of the matrix are equal to the variance of \(Y_{sijk}^{hlg}\),

$$ \mathrm{Var}\bigl(Y_{sijk}^{hlg}\bigr)= \hat{ \sigma}^{2}_{\mathit{run}} +\hat{\sigma}^{2}_{\mathit{sub}}+ \hat{\sigma}^{2}_{\mathit{site}W_s}+\hat{\sigma}^{2}_{\mathit{site}A_s}. $$

The off-diagonal elements are equal to the covariance of observations. If two observations come from different subjects, the covariance of \(Y_{sijk}^{hlg}\) with \(Y_{s'i'j'k'}^{h'l'g'}\) is 0,

$$ \textrm{Cov}\bigl(Y_{sijk}^{hlg},Y_{s'i'j'k'}^{h'l'g'} \bigr)= 0. $$

If two observations belong to the same subject but different hemispheres or loads, then the covariance of \(Y_{sijk}^{hlg}\) with \(Y_{sij'k'}^{h'l'g}\) is

$$ \mathrm{Cov}\bigl(Y_{sijk}^{hlg},Y_{sij'k'}^{h'l'g} \bigr)=\hat{\sigma}^{2}_{\mathit{site}A_s}. $$

If two observations belong to the same subject and the same hemisphere and load, but different visits, then

$$ \mathrm{Cov}\bigl(Y_{sijk}^{hlg},Y_{sij'k'}^{hlg} \bigr)= \hat{\sigma}^{2}_{\mathit{site}W_s}+ \hat{\sigma}^{2}_{\mathit{site}A_s}. $$

Finally, if the two observations belong to the same subject, hemisphere, load, and visit, but different runs, we have

$$ \mathrm{Cov}\bigl(Y_{sijk}^{hlg},Y_{sijk'}^{hlg} \bigr)= \hat{\sigma}^{2}_{\mathit{sub}}+\hat{\sigma}^{2}_{\mathit{site}W_s}+ \hat{\sigma}^{2}_{\mathit{site}A_s}. $$

Assuming a flat prior on β and conditional on \(\hat{\varSigma}_{m}\), we can derive the posterior distribution of β, which includes the group effect β 1, as follows:

$$ \beta|Y, \hat{\varSigma}_{m} \sim N \bigl(\bigl( X^T\hat{ \varSigma}_{m}^{-1}X\bigr)^{-1}X^T\hat{ \varSigma}_{m}^{-1}Y,\bigl(X^T\hat{ \varSigma}_{m}^{-1}X\bigr)^{-1} \bigr). $$

The conditional variance of β is therefore

The first element of the above matrix is the conditional variance of the group effect for the multi-center study, \(\mathrm{Var}(\beta_{1}|Y,\hat{\varSigma}_{m})\).

We can use a similar approach to obtain \(\mathrm{Var}(\beta_{1}|Y,\hat{\varSigma}_{s})\) for a single-center study at site s, assuming all subjects are sent to this site only. To this end, we assume

$$ Y | \beta', \hat{\varSigma}_{s} \sim N \bigl(X'\beta', \hat{\varSigma}_{s}\bigr), $$

where X′ is the design matrix for the hypothetical single-center study, β′=(β 1,β 2,β 3,β 4,b s ). Note that in this case, s is fixed and refers to the site to which all subjects are sent.

As before, the diagonal elements of \(\hat{\varSigma}_{s}\) are equal to the variance of \(Y_{sijk}^{hlg}\), similar to the multi-center study, and the off-diagonal elements are equal to the covariance of observations. Note that for a single-center study, we still use a hierarchical model in which there are two visits per subject and three runs per visit given hemisphere and load. Similar to our model for multi-center study, the hierarchical model for a single-center study accounts for within-subject site effects and across-subject site effects. (See Equations (3) and (4).) Therefore, finding the covariance of observations for a single-center study is similar to what we discussed for the multi-center study. The difference is that for a single-center study, s remains fixed for all subjects. Therefore, while the steps to calculate the covariance matrix (discussed below) for a single-center study are similar to those of the multi-center study, the resulting covariance matrices are not the same.

If two observations are obtained from different subjects, their covariance is zero,

$$ \mathrm{Cov}\bigl(Y_{sijk}^{hlg},Y_{si'j'k'}^{h'l'g'} \bigr)= 0. $$

If two observations belong to the same subject but different hemispheres or loads, then the covariance of \(Y_{sijk}^{hlg}\) with \(Y_{sij'k'}^{h'l'g}\) is

$$ \mathrm{Cov}\bigl(Y_{sijk}^{hlg},Y_{sij'k'}^{h'l'g} \bigr)=\hat{\sigma}^{2}_{\mathit{site}A_s}. $$

Note that unlike the multi-center study, for a single-center study s is fixed. If two observations belong to the same subject and the same hemisphere and load, but different visits, then

$$ \mathrm{Cov}\bigl(Y_{sijk}^{hlg},Y_{sij'k'}^{hlg} \bigr)= \hat{\sigma}^{2}_{\mathit{site}W_s}+ \hat{\sigma}^{2}_{\mathit{site}A_s}. $$

Finally, if the two observations belong to the same subject, hemisphere, load, and visit, but different runs, we have

$$ \mathrm{Cov}\bigl(Y_{sijk}^{hlg},Y_{sijk'}^{hlg} \bigr)= \hat{\sigma}^{2}_{\mathit{sub}}+\hat{\sigma}^{2}_{\mathit{site}W_s}+ \hat{\sigma}^{2}_{\mathit{site}A_s}. $$

As mentioned above, both multi-center study and single-center study have similar covariance matrices because they have similar hierarchical structures. For example, we use Equations (3) and (4) for both of them. However, for a single-center study, all subjects have the same within-subject and across-subject site effects since there is only one site; whereas, for the multi-center study, the within-subject and across-subject site effects vary depending on the site from which the observations for the same subject are obtained.

Following the same procedure we discussed for the multi-center study, we can estimate the conditional variance of the group effect \(\mathrm{Var}(\beta_{1}|Y, \hat{\varSigma}_{s})\) for the hypothetical scenario where all subjects are sent to site s.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhou, B., Konstorum, A., Duong, T. et al. A Hierarchical Modeling Approach to Data Analysis and Study Design in a Multi-site Experimental fMRI Study. Psychometrika 78, 260–278 (2013). https://doi.org/10.1007/s11336-012-9298-9

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11336-012-9298-9

Key words

Navigation