Skip to main content
Log in

Bayesian mediation analysis methods to explore racial/ethnic disparities in anxiety among cancer survivors

  • Original Paper
  • Published:
Behaviormetrika Aims and scope Submit manuscript

Abstract

Third-variables refer to the middle variables that are positioned in the pathway between an exposure and an outcome variable. Mediation analysis is a statistical approach to identify third variables, and to estimate and test third-variable effects that explain the exposure – outcome association. In this paper, we propose three methods for mediation analysis in Bayesian settings: (1) the function of coefficients method, (2) the product of partial differences method, and (3) the resampling method. The explicit benefit of the Bayesian mediation analysis is that the hierarchical relationships between the exposure variable and third variables, and between third variables and the outcome are naturally built into the Bayesian models. We performed sensitivity analysis to assess the impact of the choice of prior distributions in the three Bayesian inference methods. We found that the proposed methods are robust across a range of priors. Finally, we illustrate the proposed methods using real data from the MY-Health Study to explore racial/ethnic disparities in anxiety among cancer survivors. The results are comparable to those from the Frequentist’s general mediation analysis but request shorter computing time.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data availability Statement

Due to confidentiality agreements, supporting data can only be made available to researchers subject to a non-disclosure agreement. Details of the data and how to request access are available from https://healthcaredelivery.cancer.gov/data/ at NIH/NCI.

References

  • Aung MT, Song Y, Ferguson K K, Cantonwine DE, Zeng L, McElrath TF, . . . Mukherjee, B. (2020, jun). Application of a novel analytical pipeline for high-dimensional multivariate mediation analysis of environmental data. Nature Communication. Retrieved from https://doi.org/10.1101%2F2020.05.30.20117655 https://doi.org/10.1101/2020.05.30.20117655

  • Cao W, Li Y, Yu Q (2021) Sensitivity analysis for assumptions of general mediation analysis. Communications in Statistics - Simulation and Computation. 1908556. https://doi.org/10.1080/03610918.2021

  • Congdon PD (2019, sep). Bayesian analysis options in r, and coding for BUGS, JAGS, and stan. In Bayesian hierarchical models (pp. 45-57). Chapman and Hall/CRC. Retrieved from https://doi.org/10.1201%2F9780429113352-2 https://doi.org/10.1201/9780429113352-2

  • Gao T, Albert JM (2018, oct). Bayesian causal mediation analysis with multiple ordered mediators. Statistical Modelling, 19 (6), 634-652. Retrieved from https://doi.org/10.1177%2F1471082x18798067 https://doi.org/10.1177/1471082x18798067

  • Gelman A, Carlin JB, Stern HS, Dunson DB, Vehtari A, Rubin DB (2015). Bayesian data analysis (3rd Edition ed.). New York: Chapman and Hall/CRC. doi: https://doi.org/ 10.1201/b16018

  • Imai K, Tingley D (2010) A general approach to causal mediation analysis. Psychol Methods 15(4):309–334. https://doi.org/10.1037/a0020761

    Article  Google Scholar 

  • Jensen RE, Moinpour CM, Potosky AL, Lobo T, Hahn EA, Hays RD, . . . Eton DT (2016, oct). Responsiveness of 8 patient-reported outcomes measurement information system (PROMIS) measures in a large, community-based cancer study cohort. Cancer, 123 (2), 327–335. Retrieved from https://doi.org/10.1002%2Fcncr.30354 https://doi.org/10.1002/cncr.30354

  • Jensen RE, Potosky AL, Reeve BB, Hahn E, Cella D, Fries J, . . . Moinpour CM (2015, may). Validation of the PROMIS physical function measures in a diverse US population- based cohort of cancer patients. Quality of Life Research, 24 (10), 2333-2344. Retrieved from https://doi.org/10.1007%2Fs11136-015-0992-9 https://doi.org/10.1007/s11136-015-0992-9

  • Judd, C. M., & Kenny, D. A. (1981, oct). Process analysis: Estimating mediation in treatment evaluations. Evaluation Review, 5 (5), 602-619. Retrieved from https://doi.org/10.1177% 2F0193841x8100500502 https://doi.org/10.1177/0193841x8100500502

  • Kim C, Daniels MJ, Hogan JW, Choirat C, Zigler CM(2019, sep). Bayesian methods for multiple mediators: Relating principal stratification and causal mediation in the analysis of power plant emission controls. The Annals of Applied Statistics, 13 (3). Retrieved from https://doi.org/10.1214%2F19-aoas1260 https://doi.org/10.1214/19-aoas1260

  • Mackinnon DP, Dwyer JH (1993). Estimating mediated effects in prevention stud- ies. Evaluation Review, 17 (2), 144–158. Retrieved from https://doi.org/10.1177% 2F0193841x9301700202 https://doi.org/10.1177/0193841x9301700202

  • Miočević M, Gonzalez O, Valente MJ, MacKinnon DP (2017, jul). A tutorial in bayesian po- tential outcomes mediation analysis. Structural Equation Modeling: A Multidisciplinary Jour- nal, 25 (1), 121–136. Retrieved from https://doi.org/10.1080%2F10705511.2017.1342541 https://doi.org/10.1080/10705511.2017.1342541

  • Miočević M, MacKinnon DP, Levy R (2017, apr). Power in bayesian mediation analysis for small sample research. Structural Equation Modeling: A Multidisciplinary Journal, 24 (5), 666-683. Retrieved from https://doi.org/10.1080%2F10705511.2017.1312407 https://doi.org/10.1080/10705511.2017.1312407

  • Park S, Kaplan D (2015). Bayesian causal mediation analysis for group randomized designs with homogeneous and heterogeneous effects: Simulation and case study. Multivariate Behavioral Research, 50 (3), 316–333. Retrieved from https://doi.org/10.1080%2F00273171 .2014.1003770 https://doi.org/10.1080/00273171.2014.1003770

  • Pearl J (2017). Causal diagrams and the identification of causal effects. In Causality (pp. 65–106). Cambridge University Press. Retrieved from https://doi.org/10.1017% 2Fcbo9780511803161.005 https://doi.org/10.1017/cbo9780511803161.005

  • Rijnhart JJM, Twisk JWR, Eekhout I, Heymans MW (2019, jan). Comparison of logistic-regression based methods for simple mediation analysis with a dichotomous outcome variable. BMC Medical Research Methodology, 19 (1). Retrieved from https://doi.org/ 10.1186%2Fs12874-018-0654-z https://doi.org/10.1186/s12874-018-0654-z

  • VanderWeele TJ (2011, jul). Causal mediation analysis with survival data. Epidemiology, 22 (4), 582–585. Retrieved from https://doi.org/10.1097%2Fede.0b013e31821db37e https://doi.org/10.1097/ede.0b013e31821db37e

  • Yu Q, Fan Y, Wu X (2013). General multiple mediation analysis with an application to explore racial disparities in breast cancer survival. Journal of Biometrics & Biostatistics, 05 (02). Retrieved from https://doi.org/10.4172%2F2155-6180.1000189 https://doi.org/10.4172/2155-6180.1000189

  • Yu Q, Li B (2017) mma: An r package for multiple mediation analysis. J Open Res Softw 5:11

    Article  Google Scholar 

  • Yu Q, Li B (2020) A multivariate multiple third-variable effect analysis with an application to explore racial and ethnic disparities in obesity. J Appl Stat 48(4):750–764. https://doi.org/10.1080/02664763.2020.1738359

    Article  MATH  Google Scholar 

  • Yu Q, Li B (2020b). Third-variable effect analysis with multilevel additive models. PLOS ONE, 15 (10), 1-17. Retrieved from https://doi.org/10.1371/journal.pone.0241072 https://doi.org/10.1371/journal.pone.0241072

  • Yu Q, Li B (2022). Statistical methods for mediation, confounding and moderation analysis using r and SAS. Chapman and Hall/CRC. Retrieved from https://doi.org/10.1201% 2F9780429346941 https://doi.org/10.1201/9780429346941

  • Yu Q, Medeiros K, Wu X, Jensen R (2018, apr). Nonlinear predictive models for multiple mediation analysis: With an application to explore ethnic disparities in anxiety and depression among cancer survivors. Psychometrika, 83 (4), 991-1006. Retrieved from https://doi.org/10.1007%2Fs11336-018-9612-2 https://doi.org/10.1007/s11336-018-9612-2

  • Yu Q, Wu X, Li B, Scribner R (2018, sep). Multiple mediation analysis with survival outcomes: With an application to explore racial disparity in breast cancer survival. Statistics in Medicine, 38 (3), 398–412. Retrieved from https://doi.org/10.1002%2Fsim.7977 https://doi.org/10.1002/sim.7977

  • Yuan Y, MacKinnon DP (2009, dec). Bayesian mediation analysis. Psychological Methods, 14 (4), 301-322. https://doi.org/10.1037/a0016972

Download references

Funding

This research was partially supported by the National Institute On Minority Health And Health Disparities of the National Institutes of Health under award number 2R15MD012387-02, by the National Cancer Institute award number 3R01CA26098-02S1, and by the National Institute of Environmental Health Sciences award numbers P42ES013648 and 3P42ES013648-09S2. We acknowledge that part of this research was conducted with high performance computational resources provided by the Louisiana Optical Network Infrastructure.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bin Li.

Ethics declarations

Conflict of interest

There is no financial or non-financial interests that are directly or indirectly related to this work.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Communicated by Zhi Geng.

Appendix

Appendix

1.1 Bayesian mediation analysis approach for general predictive models

Denote \(\varvec{\beta }\) as the parameters to build f and \(\varvec{\alpha }_k\) the parameters to build \(f_k\). We first fit the posterior predictive models.

  1. 1.

    Given the prior distribution for \(\varvec{\beta },~\pi _0(\varvec{\beta })\), fit the predictive model for Y, \(E(Y)=f(\varvec{M,X}|\varvec{\beta })\).

  2. 2.

    For \(k=1, \ldots , P\), given the prior distribution for \(\varvec{\alpha }_k,~\pi _k(\varvec{\alpha }_k)\), fit the predictive model for \(M_k\), \(E(M_k)=f_k(\varvec{X}|\varvec{\alpha }_k)\).

Algorithm 1

Method 2: The product of differences method Let \(\varvec{z}_i=\{\varvec{X}_i,\varvec{M}_i,Y_i\}\) denote the observed data, where \(i=1,\ldots , n\), \(\varvec{X}_i=(X_{i1},\ldots ,X_{iJ})\) is the vector of J predictors, \(\varvec{M}_i\) is the vector of K third-variables, and \(Y_i\) is the outcome variable.

For \(l=1, \ldots , N\), where N is the posterior sample size,

  1. 1.

    Draw lth sample, \(\varvec{\beta }_l\), from the posterior distribution \(\pi _0(\varvec{\beta }|\varvec{z})\).

  2. 2.

    For \(k=1, \ldots , P\), draw lth sample, \(\varvec{\alpha }_{kl}\), from the posterior distribution \(\pi _k(\varvec{\alpha }_{kl}|\varvec{z})\).

  3. 3.

    For the jth predictor \(X_j,~j=1, \ldots , J\),

  1. (a)

    The lth sample from the posterior distribution of the direct effect of \(X_j\rightarrow Y\) given \(X_j=x_{ij}, i=1, \ldots , n\), is

  • \(\frac{f(\varvec{x}_{i,-j}, x_{ij}+\Delta x_j,\varvec{m}_i|\varvec{\beta }_l)-f(\varvec{x}_{i,-j}, x_{ij},\varvec{m}_i|\varvec{\beta }_l)}{\Delta x_j},\) if \(X_j\) is a continuous variable; \(\varvec{x}_{i,-j}\) denotes the vector \(\varvec{x}_{i}\) with the jth item removed, and \(\Delta x_j\) is the pre-specified changing value in \(X_j\). By default, we set \(\Delta x_j=0.001\).

  • \(f(\varvec{x}_{i,-j}, x_{ij}=1, \varvec{m}_i|\varvec{\beta }_l)-f(\varvec{x}_{i,-j}, x_{ij}=0,\varvec{m}_i|\varvec{\beta }_l),\) if \(X_j\) is binary. \(x_{ij}=0\) refers to the reference group of \(X_j\).

  1. (b)

    For \(k=1, \ldots , P\), the lth sample from the posterior distribution of the indirect effect of \(X_j-M_k\rightarrow Y\) given \(X_j=x_{ij}, i=1, \ldots , n\), is

  • \(\frac{f_k(x_{ij}+\Delta x_j|\varvec{\alpha }_{kl})-f_k(x_{ij}|\varvec{\alpha }_{kl})}{\Delta x_j}\times \frac{f(\varvec{x}_i,\varvec{m}_{i, -k}, m_{ik}+\Delta _m|\varvec{\beta }_l)-f(\varvec{x}_i,\varvec{m}_{i, -k}, m_{ik}|\varvec{\beta }_l)}{\Delta m_k},\) if \(M_k\) is continuous. \(\Delta x_j\) is 1 if \(X_j\) is binary and \(\Delta m_j\) is set at a small value, e.g., \(\Delta m_j=0.001\).

  • \(\frac{f_k(x_{ij}+\Delta x_j|\varvec{\alpha }_{kl})-f_k(x_{ij}|\varvec{\alpha }_{kl})}{\Delta x_j}\times [f(\varvec{x}_i,\varvec{m}_{i, -k}, m_{ik}=1|\varvec{\beta }_l)-f(\varvec{x}_i,\varvec{m}_{i, -k}, m_{ik}=0|\varvec{\beta }_l)],\) if \(M_k\) is binary. \(\Delta m_j\) is set at a small value, e.g., \(\Delta m_j=0.001\).

  1. (c)

    The lth sample from the posterior distribution of the total effect of \(X_j - Y\) given \(X_j=x_{ij}, i=1, \ldots , n\), is the summation of the above direct effect and P indirect effects.

Algorithm 2

Method 3: The resampling method For the jth predictor \(X_j,~j=1, \ldots , J\), at \(X_j=x_{ij}, i=1,\ldots ,n\) and \(l=1, \ldots , N\), where N is the posterior sample size,

  1. 1.

    Draw lth sample, \(\varvec{m}_{l1}\), from the joint posterior distribution of \(\varvec{M}\) given \(\varvec{X}=\varvec{x}_i\).

  2. 2.

    Draw lth sample, \(\varvec{m}_{l2}\), from the joint posterior distribution of \(\varvec{M}\) given \(\varvec{X}=(\varvec{X}_{-j}=\varvec{x}_{i,-j},X_{j}=x_{i,j}+\Delta x_j)\).

  3. 3.

    The lth sample from the posterior distribution for the total effect given \(X_j=x_{ij}\) is

    $$\begin{aligned} te_{ijl}=\frac{f(\varvec{x}_{i,-j},x_{ij}+\Delta x_j,\varvec{m}_{l2})-f(\varvec{x}_{i},\varvec{m}_{l1})}{\Delta x_j}. \end{aligned}$$
  4. 4.

    Draw lth sample, \(\varvec{m}_{l0}\), from the posterior marginal distribution of \(\varvec{M}\).

  5. 5.

    The lth sample from the posterior distribution for the direct effect given \(X_j=x_{ij}\) is

    $$\begin{aligned} de_{ijl}=\frac{f(\varvec{x}_{i,-j},x_{ij}+\Delta x_j,\varvec{m}_{l0})-f(\varvec{x}_{i},\varvec{m}_{l0})}{\Delta x_j}. \end{aligned}$$
  6. 6.

    For \(k=1,\ldots ,P\), the direct effect not from \(M_k\) is

    $$\begin{aligned} de_{ijl,-k}=\frac{f(\varvec{x}_{i,-j},x_{ij}+\Delta x_j,\varvec{m}_{l2,-k},\varvec{m}_{l0,k})-f(\varvec{x}_{i},\varvec{m}_{l1,-k},\varvec{m}_{l0,k})}{\Delta x_j}. \end{aligned}$$

    Therefore, the lth sample from the posterior distribution for the indirect effect of \(X_j-M_k\rightarrow Y\) given \(X_j=x_{ij}\) is

    $$\begin{aligned} ie_{ijl,k}=te_{ijl}-de_{ijl,-k}. \end{aligned}$$

Note that when \(X_j\) is binary, we set all \(x_{ij}\) as 0 (at the reference level) and \(\Delta x_j\) as 1.

1.2 The BUGS model

figure a

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yu, Q., Cao, W., Mercante, D. et al. Bayesian mediation analysis methods to explore racial/ethnic disparities in anxiety among cancer survivors. Behaviormetrika 50, 361–383 (2023). https://doi.org/10.1007/s41237-022-00185-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s41237-022-00185-9

Keywords

Navigation