Abstract
The topic of this article is one-sided hypothesis testing on the means of two populations when there is uncertainty as to which population a datum is drawn. Along with each datum, a probability is given as to which of the populations the datum emanated. Such situations arise, for example, in the use of Bayesian imputation methods to assess racial and ethnic disparities with certain survey, health, and financial data. By use of a Bayesian framework and Markov Chain Monte Carlo sampling from the joint posterior distribution of the population means, the probability of a disparity hypothesis is estimated. This approach extends sample size limitations of previous methods given in the literature from a few dozen to well into the thousands. Four methods are developed and compared. Three methods are implemented in R codes and one method in WinBUGS. All the codes are provided in the appendices.
Similar content being viewed by others
References
Albert, J.: Bayesian Computation with R, 2nd edn. Springer, New York (2009)
Albert, J., Hu, J.: Probability and Bayesian Modeling. CRC Press, Boca Raton (2020)
Bolstad, W.M.: Understanding Computational Bayesian Statistics. Wiley, Hoboken (2010)
Carlin, B.P., Louis, T.A.: Bayesian Methods for Data Anaysis, 3rd edn. CRC Press, Boca Raton (2009)
Elkadry, A., McDonald, G.C.: Hypothesis testing when data sources are uncertain. J. Stat. Theory Pract. 14(4), 1–17 (2020). https://doi.org/10.1007/s42519-020-00132-5
Gill, J.: Bayesian Methods: A Social and Behavioral Sciences Approach, 3rd edn. CRC Press, Boca Raton (2015)
Hahn, E.D.: Bayesian Methods for Management and Business: Pragmatic Solutions for Real Problems. Wiley, Hoboken (2014)
Lunn, D., Jackson, C., Best, N., Thomas, A., Spiegelhalter, D.: The BUGS Book; A Practical Introduction to Bayesian Analysis. CRC Press, Boca Raton (2013)
McDonald, G.C.: Statistical testing when the populations from which samples are drawn are uncertain. Health Serv. Outcomes Res. Methodol. 18(3), 155–174 (2018). https://doi.org/10.1007/s10742-018-0182-7
Sawitzki, G.: Computational Statistics: An Introduction to R. CRC Press, Boca Raton (2009)
Woodworth, G.G.: Biostatistics: A Bayesian Introduction. Wiley, Hoboken (2004)
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix 1: Data for example in Sect. 2, N = 40
This data was created using the following R-Code:
Appendix 2: R code for Laplace calculation in Sect. 2
Appendix 3: R code for MCMC calculations in Sect. 3
Appendix 4: WinBUGS code for MCMC calculations in Sect. 4
Appendix 5: WinBUGS code for MCMC calculations in Sect. 4, common unknown σ
Appendix 6: Some diagnostics from WinBUGS for the model given in “Appendix 5”: Dynamic traces (left), Autocorrelation function (center), and Running Quantiles (right)
Rights and permissions
About this article
Cite this article
McDonald, G.C., Oakley, R.H. Extending computations for disparity testing when data sources are uncertain. Health Serv Outcomes Res Method 23, 207–226 (2023). https://doi.org/10.1007/s10742-022-00286-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10742-022-00286-8