A Hierarchical Modeling Approach to Data Analysis and Study Design in a Multi-site Experimental fMRI Study

Zhou, Bo; Konstorum, Anna; Duong, Thao; Tieu, Kinh H.; Wells, William M.; Brown, Gregory G.; Stern, Hal S.; Shahbaba, Babak

doi:10.1007/s11336-012-9298-9

A Hierarchical Modeling Approach to Data Analysis and Study Design in a Multi-site Experimental fMRI Study

Published: 28 November 2012

Volume 78, pages 260–278, (2013)
Cite this article

Psychometrika Aims and scope Submit manuscript

Bo Zhou¹,
Anna Konstorum¹,
Thao Duong¹,
Kinh H. Tieu²,
William M. Wells²,
Gregory G. Brown³,
Hal S. Stern¹ &
…
Babak Shahbaba¹

1194 Accesses
3 Citations
Explore all metrics

Abstract

We propose a hierarchical Bayesian model for analyzing multi-site experimental fMRI studies. Our method takes the hierarchical structure of the data (subjects are nested within sites, and there are multiple observations per subject) into account and allows for modeling between-site variation. Using posterior predictive model checking and model selection based on the deviance information criterion (DIC), we show that our model provides a good fit to the observed data by sharing information across the sites. We also propose a simple approach for evaluating the efficacy of the multi-site experiment by comparing the results to those that would be expected in hypothetical single-site experiments with the same sample size.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

RMSEA, CFI, and TLI in structural equation modeling with ordered categorical data: The story they tell depends on the estimation methods

Article 04 June 2018

A new criterion for assessing discriminant validity in variance-based structural equation modeling

Article Open access 22 August 2014

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

Article Open access 05 May 2021

References

American Psychiatric Association (2000). Diagnostic and statistical manual of mental disorders, Washington, DC, USA.
DeYoe, E.A., Bandettini, P., Neitz, J., Miller, D., & Winans, P. (1994). Functional magnetic resonance imaging (fMRI) of the human brain. Journal of Neuroscience Methods, 54, 171–187.
Article PubMed Google Scholar
Friedman, L., Glover, G.H., Krenz, D., & Magnotta, V. (2006). Reducing inter-scanner variability of activation in a multicenter fMRI study: role of smoothness equalization. NeuroImage, 32, 1656–1668.
Article PubMed Google Scholar
Friedman, L., Stern, H., Brown, G.G., Mathalon, D.H., Turner, J., Glover, G.H., Gollub, R.L., Lauriello, J., Lim, K.O., Cannon, T., Greve, D.N., Bockholt, H.J., Belger, A., Mueller, B., Doty, M.J., He, J., Wells, W., Smyth, P., Pieper, S., Kim, S., Kubicki, M., Vangel, M., & Potkin, S.G. (2008). Test-retest and between-site reliability in a multicenter fMRI study. Human Brain Mapping, 29, 958–972.
Article PubMed Google Scholar
Gelman, A., Carlin, J., Stern, H., & Rubin, D. (2003). Bayesian data analysis (2nd ed.). London: Chapman & Hall/CRC.
Google Scholar
Glahn, D.C., Laird, A.R., Ellison-Wright, I., Thelen, S.M., Robinson, J.L., Lancaster, J.L., Bullmore, E., & Fox, P.T. (2008). Meta-analysis of gray matter anomalies in schizophrenia: application of anatomic likelihood estimation and network analysis. Biological Psychiatry, 64, 774–781.
Article PubMed Google Scholar
Goghari, V.M., Sponheim, S.R., & MacDonald, A.W. III (2010). The functional neuroanatomy of symptom dimensions in schizophrenia: a qualitative and quantitative review of a persistent question. Neuroscience and Biobehavioral Reviews, 34, 468–486.
Article PubMed Google Scholar
Lazar, N. (2008). The statistical analysis of functional MRI data. New York: Springer.
Google Scholar
Lindquist, M.A. (2008). The statistical analysis of fMRI data. Statistical Science, 23(4), 439–464.
Article Google Scholar
Potkin, S.G., Turner, J.A., Brown, G.G., McCarthy, G., Greve, D.N., Glover, G.H., Manoach, D.S., Belger, A., Diaz, M., Wible, C.G., Ford, J.M., Mathalon, D.H., Gollub, R., Lauriello, J., O’Leary, D., van Erp, T.G.M., Toga, A.W., Preda, A., & Lim, K.O. (2009). Working memory and DLPFC inefficiency in schizophrenia: the FBIRN study. Schizophrenia Bulletin, 35, 19–31.
Article PubMed Google Scholar
Robins, J.M., van der Vaart, A., & Ventura, V. (2000). Asymptotic distribution of p values in composite null models. Journal of the American Statistical Association, 95, 1143–1156.
Google Scholar
Smith, S.M., Jenkinson, M., Woolrich, M.W., Beckmann, C.F., Behrens, T.E.J., Johansen-Berg, H., Bannister, P.R., De Luca, M., Drobnjak, I., Flitney, D.E., Niazy, R.K., Saunders, J., Vickers, J., Zhang, Y., De Stefano, N., Brady, J.M., & Matthews, P.M. (2004). Advances in functional and structural MR image analysis and implementation as FSL. NeuroImage, 23(Suppl. 1), S208–S219.
Article PubMed Google Scholar
Spiegelhalter, D.J., Best, N.G., Carlin, B.P., & van der Linde, A. (2002). Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society. Series B, 64, 583–639.
Article Google Scholar
Sternberg, S. (1966). High-speed scanning in human memory. Science, 153, 652–654.
Article PubMed Google Scholar
Van Snellenberg, J.X. (2009). Working memory and long-term memory deficits in schizophrenia: is there a common substrate? Psychiatry Research, 174, 89–96.
Article PubMed Google Scholar
Woolrich, M.W. (2012). Bayesian inference in FMRI. NeuroImage, 62(2), 801–810.
Article PubMed Google Scholar

Download references

Acknowledgements

The project described was supported by the National Center for Research Resources and the National Center for Advancing Translational Sciences, National Institutes of Health, through Grant UL1 TR000153. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH. The authors also acknowledge NIH funding from P41RR013218.

Author information

Authors and Affiliations

University of California, Irvine, Irvine, USA
Bo Zhou, Anna Konstorum, Thao Duong, Hal S. Stern & Babak Shahbaba
Harvard Medical School, Boston, USA
Kinh H. Tieu & William M. Wells
University of California, San Diego, San Diego, USA
Gregory G. Brown

Authors

Bo Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Anna Konstorum
View author publications
You can also search for this author in PubMed Google Scholar
Thao Duong
View author publications
You can also search for this author in PubMed Google Scholar
Kinh H. Tieu
View author publications
You can also search for this author in PubMed Google Scholar
William M. Wells
View author publications
You can also search for this author in PubMed Google Scholar
Gregory G. Brown
View author publications
You can also search for this author in PubMed Google Scholar
Hal S. Stern
View author publications
You can also search for this author in PubMed Google Scholar
Babak Shahbaba
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Babak Shahbaba.

Additional information

Bo Zhou and Kinh H. Tieu are co-first authors.

Appendices

Appendix A

1.1 A.1 Gibbs Sampler for the Normal Model

For our hierarchical Bayesian model with normal distribution, we use Markov Chain Monte Carlo (MCMC) to sample from the posterior distribution of model parameters. Given a weakly informative prior $f(\sigma^{2}_{\mathit{run}})\propto \mathrm{Inv}\mbox{-}\chi^{2}(1,10)$ for $\sigma^{2}_{\mathit{run}}$, we sample visit mean and run variance as follows:

Class mean is determined by group effect, hemisphere effect, and load effects. Let β=(β ₁,β ₂,β ₃,β ₄) be coefficients for group effect, hemisphere effect, and load effect; also, let X be the design matrix. Given a weakly informative prior $f(\sigma^{2}_{\mathit{sub}}) \propto\mathrm{Inv}\mbox{-}\chi^{2}(1,10)$ for $\sigma^{2}_{\mathit{sub}}$ and flat prior f(β)∝1 for β, we have

Given a weakly informative prior $f(\sigma^{2}_{\mathit{site}W_{s}}) \propto \mathrm{Inv}\mbox{-}\chi^{2}(1,10)$ for $\sigma^{2}_{\mathit{site}W_{s}}$, we sample subject-specific site effects b _si and site variation within subjects $\sigma^{2}_{\mathit{site}W_{s}}$ as follows:

Given a weakly informative prior $f(\sigma^{2}_{\mathit{site}A_{s}}) \propto \mathrm{Inv}\mbox{-}\chi^{2}(1,10)$ for $\sigma^{2}_{\mathit{site}A_{s}}$ and a flat prior f(b _s)∝1 for site mean b _s, we sample site-specific site effects b _s and site variation across subjects $\sigma^{2}_{\mathit{site}A_{s}}$ as follows:

1.2 A.2 Gibbs Sampler for the t Model

For the hierarchical model with the t-distribution, the Gibbs sampler is similar to what we discussed above, except for the first part of the hierarchy where we introduce latent variable $V_{sijk}^{hlg}$. Given a weakly informative prior $f(\sigma^{2}_{\mathit{run}})\propto\frac{1}{\sigma^{2}_{\mathit{run}}}$ for $\sigma^{2}_{\mathit{run}}$, we sample $\mu^{hlg}_{sij}$ and $\sigma^{2}_{\mathit{var}}$ as follows:

Appendix B

Here, we present the steps involved in the derivation of e _s (Equation (9)). To find $\hat{\varSigma}_{m}$, we start by rewriting our hierarchical Bayesian model as a multivariate normal model as follows:

$$ Y | \beta, \hat{\varSigma}_{m} \sim N(X\beta, \hat{ \varSigma}_{m}), $$

where X is the design matrix, and β=(β ₁,β ₂,β ₃,β ₄,b ₁,…,b ₈). The diagonal elements of the matrix are equal to the variance of $Y_{sijk}^{hlg}$,

$$ \mathrm{Var}\bigl(Y_{sijk}^{hlg}\bigr)= \hat{ \sigma}^{2}_{\mathit{run}} +\hat{\sigma}^{2}_{\mathit{sub}}+ \hat{\sigma}^{2}_{\mathit{site}W_s}+\hat{\sigma}^{2}_{\mathit{site}A_s}. $$

The off-diagonal elements are equal to the covariance of observations. If two observations come from different subjects, the covariance of $Y_{sijk}^{hlg}$ with $Y_{s'i'j'k'}^{h'l'g'}$ is 0,

$$ \textrm{Cov}\bigl(Y_{sijk}^{hlg},Y_{s'i'j'k'}^{h'l'g'} \bigr)= 0. $$

If two observations belong to the same subject but different hemispheres or loads, then the covariance of $Y_{sijk}^{hlg}$ with $Y_{sij'k'}^{h'l'g}$ is

$$ \mathrm{Cov}\bigl(Y_{sijk}^{hlg},Y_{sij'k'}^{h'l'g} \bigr)=\hat{\sigma}^{2}_{\mathit{site}A_s}. $$

If two observations belong to the same subject and the same hemisphere and load, but different visits, then

$$ \mathrm{Cov}\bigl(Y_{sijk}^{hlg},Y_{sij'k'}^{hlg} \bigr)= \hat{\sigma}^{2}_{\mathit{site}W_s}+ \hat{\sigma}^{2}_{\mathit{site}A_s}. $$

Finally, if the two observations belong to the same subject, hemisphere, load, and visit, but different runs, we have

$$ \mathrm{Cov}\bigl(Y_{sijk}^{hlg},Y_{sijk'}^{hlg} \bigr)= \hat{\sigma}^{2}_{\mathit{sub}}+\hat{\sigma}^{2}_{\mathit{site}W_s}+ \hat{\sigma}^{2}_{\mathit{site}A_s}. $$

Assuming a flat prior on β and conditional on $\hat{\varSigma}_{m}$, we can derive the posterior distribution of β, which includes the group effect β ₁, as follows:

$$ \beta|Y, \hat{\varSigma}_{m} \sim N \bigl(\bigl( X^T\hat{ \varSigma}_{m}^{-1}X\bigr)^{-1}X^T\hat{ \varSigma}_{m}^{-1}Y,\bigl(X^T\hat{ \varSigma}_{m}^{-1}X\bigr)^{-1} \bigr). $$

The conditional variance of β is therefore

The first element of the above matrix is the conditional variance of the group effect for the multi-center study, $\mathrm{Var}(\beta_{1}|Y,\hat{\varSigma}_{m})$.

We can use a similar approach to obtain $\mathrm{Var}(\beta_{1}|Y,\hat{\varSigma}_{s})$ for a single-center study at site s, assuming all subjects are sent to this site only. To this end, we assume

$$ Y | \beta', \hat{\varSigma}_{s} \sim N \bigl(X'\beta', \hat{\varSigma}_{s}\bigr), $$

where X′ is the design matrix for the hypothetical single-center study, β′=(β ₁,β ₂,β ₃,β ₄,b _s). Note that in this case, s is fixed and refers to the site to which all subjects are sent.

As before, the diagonal elements of $\hat{\varSigma}_{s}$ are equal to the variance of $Y_{sijk}^{hlg}$, similar to the multi-center study, and the off-diagonal elements are equal to the covariance of observations. Note that for a single-center study, we still use a hierarchical model in which there are two visits per subject and three runs per visit given hemisphere and load. Similar to our model for multi-center study, the hierarchical model for a single-center study accounts for within-subject site effects and across-subject site effects. (See Equations (3) and (4).) Therefore, finding the covariance of observations for a single-center study is similar to what we discussed for the multi-center study. The difference is that for a single-center study, s remains fixed for all subjects. Therefore, while the steps to calculate the covariance matrix (discussed below) for a single-center study are similar to those of the multi-center study, the resulting covariance matrices are not the same.

If two observations are obtained from different subjects, their covariance is zero,

$$ \mathrm{Cov}\bigl(Y_{sijk}^{hlg},Y_{si'j'k'}^{h'l'g'} \bigr)= 0. $$

If two observations belong to the same subject but different hemispheres or loads, then the covariance of $Y_{sijk}^{hlg}$ with $Y_{sij'k'}^{h'l'g}$ is

$$ \mathrm{Cov}\bigl(Y_{sijk}^{hlg},Y_{sij'k'}^{h'l'g} \bigr)=\hat{\sigma}^{2}_{\mathit{site}A_s}. $$

Note that unlike the multi-center study, for a single-center study s is fixed. If two observations belong to the same subject and the same hemisphere and load, but different visits, then

$$ \mathrm{Cov}\bigl(Y_{sijk}^{hlg},Y_{sij'k'}^{hlg} \bigr)= \hat{\sigma}^{2}_{\mathit{site}W_s}+ \hat{\sigma}^{2}_{\mathit{site}A_s}. $$

Finally, if the two observations belong to the same subject, hemisphere, load, and visit, but different runs, we have

$$ \mathrm{Cov}\bigl(Y_{sijk}^{hlg},Y_{sijk'}^{hlg} \bigr)= \hat{\sigma}^{2}_{\mathit{sub}}+\hat{\sigma}^{2}_{\mathit{site}W_s}+ \hat{\sigma}^{2}_{\mathit{site}A_s}. $$

As mentioned above, both multi-center study and single-center study have similar covariance matrices because they have similar hierarchical structures. For example, we use Equations (3) and (4) for both of them. However, for a single-center study, all subjects have the same within-subject and across-subject site effects since there is only one site; whereas, for the multi-center study, the within-subject and across-subject site effects vary depending on the site from which the observations for the same subject are obtained.

Following the same procedure we discussed for the multi-center study, we can estimate the conditional variance of the group effect $\mathrm{Var}(\beta_{1}|Y, \hat{\varSigma}_{s})$ for the hypothetical scenario where all subjects are sent to site s.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhou, B., Konstorum, A., Duong, T. et al. A Hierarchical Modeling Approach to Data Analysis and Study Design in a Multi-site Experimental fMRI Study. Psychometrika 78, 260–278 (2013). https://doi.org/10.1007/s11336-012-9298-9

Download citation

Received: 02 August 2011
Revised: 02 April 2012
Published: 28 November 2012
Issue Date: April 2013
DOI: https://doi.org/10.1007/s11336-012-9298-9

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Hierarchical Modeling Approach to Data Analysis and Study Design in a Multi-site Experimental fMRI Study

Abstract

Access this article

Similar content being viewed by others

RMSEA, CFI, and TLI in structural equation modeling with ordered categorical data: The story they tell depends on the estimation methods

A new criterion for assessing discriminant validity in variance-based structural equation modeling

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

References

Acknowledgements