Hypothesis Testing for an Exposure–Disease Association in Case–Control Studies Under Nondifferential Exposure Misclassification in the Presence of Validation Data: Bayesian and Frequentist Adjustments

Karim, Mohammad Ehsanul; Gustafson, Paul

doi:10.1007/s12561-015-9141-9

Hypothesis Testing for an Exposure–Disease Association in Case–Control Studies Under Nondifferential Exposure Misclassification in the Presence of Validation Data: Bayesian and Frequentist Adjustments

Published: 08 January 2016

Volume 8, pages 234–252, (2016)
Cite this article

Statistics in Biosciences Aims and scope Submit manuscript

Mohammad Ehsanul Karim¹ &
Paul Gustafson²

257 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

In epidemiologic studies, measurement error in the exposure variable can have a detrimental effect on the power of hypothesis testing for detecting the impact of exposure in the development of a disease. To adjust for misclassification in the hypothesis testing procedure involving a misclassified binary exposure variable, we consider a retrospective case–control scenario under the assumption of nondifferential misclassification. We develop a test under Bayesian approach from a posterior distribution generated by a MCMC algorithm and a normal prior under realistic assumptions. We compared this test with an equivalent likelihood ratio test developed under the frequentist approach, using various simulated settings and in the presence or the absence of validation data. In our simulations, we considered varying degrees of sensitivity, specificity, sample sizes, exposure prevalence, and proportion of unvalidated and validated data. In these scenarios, our simulation study shows that the adjusted model (with-validation data model) is always better than the unadjusted model (without validation data model). However, we showed that exception is possible in the fixed budget scenario where collection of the validation data requires a much higher cost. We also showed that both Bayesian and frequentist hypothesis testing procedures reach the same conclusions for the scenarios under consideration. The Bayesian approach is, however, computationally more stable in rare exposure contexts. A real case–control study was used to show the application of the hypothesis testing procedures under consideration.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Non-parametric Maximum Likelihood Estimation for Case-Cohort and Nested Case-Control Designs with Competing Risks Data

More efficient estimators for marginal additive hazards model in case-cohort studies with multiple outcomes

Article 15 February 2016

Identification of causal effects in case-control studies

Article Open access 07 January 2022

References

Brenner H (1996) Correcting for exposure misclassification using an alloyed gold standard. Epidemiology 7:406–410
Article Google Scholar
Breslow N, Cain K (1988) Logistic regression for two-stage case-control data. Biometrika 75(1):11–20
Article MathSciNet MATH Google Scholar
Chu R, Gustafson P, Le N (2010) Bayesian adjustment for exposure misclassification in case–control studies. Stat Med 29(9):994–1003
MathSciNet Google Scholar
Copeland K, Checkoway H, McMichael A, Holbrook R (1977) Bias due to misclassification in the estimation of relative risk. Am J Epidemiol 105(5):488–495
Google Scholar
Gelder M, Donders A, Devine O, Roeleveld N, Reefhuis J (2014) Using bayesian models to assess the effects of under-reporting of cannabis use on the association with birth defects, national birth defects prevention study, 1997–2005. Paediatr Perinat Epidemiol 28(5):424–433
Article Google Scholar
Greenland S (1988a) Statistical uncertainty due to misclassification: implications for validation substudies. J Clin Epidemiol 41(12):1167–1174
Article Google Scholar
Greenland S (1988b) Variance estimation for epidemiologic effect estimates under misclassification. Stat Med 7(7):745–757
Article Google Scholar
Greenland S (2001) Sensitivity analysis, monte carlo risk analysis, and bayesian uncertainty assessment. Risk Anal 21(4):579–584
Article Google Scholar
Greenland S (2008) Maximum-likelihood and closed-form estimators of epidemiologic measures under misclassification. J Stat Plan Inference 138(2):528–538
Article MathSciNet MATH Google Scholar
Greenland S, Gustafson P (2006) Accounting for independent nondifferential misclassification does not increase certainty that an observed association is in the correct direction. Am J Epidemiol 164(1):63–68
Article MathSciNet Google Scholar
Gustafson P (2004) Measurement error and misclassification in statistics and epidemiology: impacts and Bayesian adjustments. CRC, Boca Raton
MATH Google Scholar
Gustafson P (2014) Bayesian statistical methodology for observational health sciences data. Can Outlook Stat Action 163:1149
MathSciNet Google Scholar
Gustafson P, Greenland S (2006) Curious phenomena in Bayesian adjustment for exposure misclassification. Stat Med 25(1):87–103
Article MathSciNet Google Scholar
Gustafson P, Greenland S (2014) Misclassification. Handbook of epidemiology. Springer, Berlin, pp 639–658
Chapter Google Scholar
Gustafson P, Karim M (2015) When exposure is subject to nondifferential misclassification, are validation data helpful in testing for an exposure-disease association?
Gustafson P, McCandless L (2010) Probabilistic approaches to better quantifying the results of epidemiologic studies. Int J Environ Res Public Health 7(4):1520–1539
Article MathSciNet Google Scholar
Gustafson P, McCandless L (2014) Commentary: priors, parameters, and probability: a bayesian perspective on sensitivity analysis. Epidemiology 25(6):910–912
Article Google Scholar
Holcroft C, Spiegelman D (1999) Design of validation studies for estimating the odds ratio of exposure-disease relationships when exposure is misclassified. Biometrics 55(4):1193–1201
Article MATH Google Scholar
Kraus J, Greenland S, Bulterys M (1989) Risk factors for sudden infant death syndrome in the US Collaborative Perinatal Project. Int J Epidemiol 18(1):113–120
Article Google Scholar
Ladouceur M, Rahme E, Pineau C, Joseph L (2007) Robustness of prevalence estimates derived from misclassified data from administrative databases. Biometrics 63(1):272–279
Article MathSciNet Google Scholar
Lash T, Fox M, Fink A (2011) Applying quantitative bias analysis to epidemiologic data. Springer, Berlin
MATH Google Scholar
Mak T, Best N, Rushton L (2015) Robust bayesian sensitivity analysis for case-control studies with uncertain exposure misclassification probabilities. Int J Biostat 11(1):135–149
Article MathSciNet Google Scholar
Marshall R (1990) Validation study methods for estimating exposure proportions and odds ratios with misclassified data. J Clin Epidemiol 43(9):941–947
Article Google Scholar
Marshall R (1997) Assessment of exposure misclassification bias in case-control studies using validation data. J Clin Epidemiol 50(1):15–19
Article Google Scholar
McInturff P, Johnson W, Cowling D, Gardner I (2004) Modelling risk when binary outcomes are subject to error. Stat Med 23(7):1095–1109
Article Google Scholar
Palmgren J (1987) Precision of double sampling estimators for comparing two probabilities. Biometrika 74(4):687–694
Article MathSciNet MATH Google Scholar
Prescott G, Garthwaite P (2002) A simple Bayesian analysis of misclassified binary data with a validation substudy. Biometrics 58(2):454–458
Article MathSciNet MATH Google Scholar
Rothman KJ, Greenland S, Lash TL (2008) Modern epidemiology. Lippincott Williams & Wilkins, Philadelphia
Google Scholar
Spiegelman D (1994) Cost-efficient study designs for relative risk modeling with covariate measurement error. J Stat Plan Inference 42(1):187–208
Article MathSciNet MATH Google Scholar
Tosteson T, Ware J (1990) Designing a logistic regression study using surrogate measures for exposure and outcome. Biometrika 77(1):11–21
Article MathSciNet MATH Google Scholar
Youden W (1950) Index for rating diagnostic tests. Cancer 3(1):32–35
Article Google Scholar
Zelen M, Haitovsky Y (1991) Testing hypotheses with binary data subject to misclassification errors: analysis and experimental design. Biometrika 78(4):857–865
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Epidemiology, Biostatistics and Occupational Health, McGill University, Purvis Hall, 1020 Pine Avenue West, Montreal, QC, H3A 1A2, Canada
Mohammad Ehsanul Karim
Department of Statistics, University of British Columbia, Vancouver, Canada
Paul Gustafson

Authors

Mohammad Ehsanul Karim
View author publications
You can also search for this author in PubMed Google Scholar
Paul Gustafson
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohammad Ehsanul Karim.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 1808 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Karim, M.E., Gustafson, P. Hypothesis Testing for an Exposure–Disease Association in Case–Control Studies Under Nondifferential Exposure Misclassification in the Presence of Validation Data: Bayesian and Frequentist Adjustments. Stat Biosci 8, 234–252 (2016). https://doi.org/10.1007/s12561-015-9141-9

Download citation

Received: 21 May 2015
Revised: 19 November 2015
Accepted: 27 December 2015
Published: 08 January 2016
Issue Date: October 2016
DOI: https://doi.org/10.1007/s12561-015-9141-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Hypothesis Testing for an Exposure–Disease Association in Case–Control Studies Under Nondifferential Exposure Misclassification in the Presence of Validation Data: Bayesian and Frequentist Adjustments

Abstract

Access this article

Similar content being viewed by others

Non-parametric Maximum Likelihood Estimation for Case-Cohort and Nested Case-Control Designs with Competing Risks Data

More efficient estimators for marginal additive hazards model in case-cohort studies with multiple outcomes

Identification of causal effects in case-control studies

References

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Supplementary material 1 (pdf 1808 KB)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Hypothesis Testing for an Exposure–Disease Association in Case–Control Studies Under Nondifferential Exposure Misclassification in the Presence of Validation Data: Bayesian and Frequentist Adjustments

Abstract

Access this article

Similar content being viewed by others

Non-parametric Maximum Likelihood Estimation for Case-Cohort and Nested Case-Control Designs with Competing Risks Data

More efficient estimators for marginal additive hazards model in case-cohort studies with multiple outcomes

Identification of causal effects in case-control studies

References

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Supplementary material 1 (pdf 1808 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation