As the use of electronic health records (EHR) to estimate treatment effects has become widespread, concern about bias introduced by error in EHR-derived covariates has also grown. While methods exist to address measurement error in individual covariates, little prior research has investigated the implications of using propensity scores for confounder control when the propensity scores are constructed from a combination of accurate and error-prone covariates. We reviewed approaches to account for error in propensity scores and used simulation studies to compare their performance. These comparisons were conducted across a range of scenarios featuring variation in outcome type, validation sample size, main sample size, strength of confounding, and structure of the error in the mismeasured covariate. We then applied these approaches to a real-world EHR-based comparative effectiveness study of alternative treatments for metastatic bladder cancer. This head-to-head comparison of measurement error correction methods in the context of a propensity score-adjusted analysis demonstrated that multiple imputation for propensity scores performs best when the outcome is continuous and regression calibration-based methods perform best when the outcome is binary.
This is a preview of subscription content, access via your institution.
Buy single article
Instant access to the full article PDF.
Tax calculation will be finalised during checkout.
Abernethy, A.P., et al.: Use of electronic health record data for quality reporting. J. Oncol. Pract. 13(8), 530–534 (2017)
Berger, M.L., et al.: Opportunities and challenges in leveraging electronic health record data in oncology. Fut. Oncol. 12(10), 1261–1274 (2016)
Carroll, R.J., et al.: Measurement Error in Nonlinear Models: A Modern Perspective. Chapman & Hall, New York (2006)
Cole, S.R., Chu, H., Greenland, S.: Multiple-imputation for measurement error correction. Int. J. Epidemiol. 35, 1074–1081 (2006)
Curtis, M.D., et al.: Development and validation of a high-quality composite real-world mortality endpoint. Health Serv. Res. 53(6), 4460–4476 (2018)
Elixhauser, A., et al.: Comorbidity measures for use with administrative data. Med. Care 36(1), 8–27 (1998)
Freedman, L.S., et al.: A comparison of regression calibration, moment reconstruction and imputation for adjusting for covariate measurement error in regression. Stat. Med. 27, 5195–5216 (2008)
Guo, Y., Little, R.A., McConnell, D.S.: On using summary statistics from an external calibration sample to correct for covariate measurement error. Epidemiology 23(1), 165–174 (2012)
Hersh, W.R. et al.: Caveats for the use of operational electronic health record data in comparative effectiveness research. In: Medical Care 51.8 0 3, S30–S37. https://doi.org/10.1097/MLR.0b013e31829b1dbd. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3748381/ (visited on 03/16/2019) (2013)
Hong, H. et al.: Propensity Score-Based Estimators with Multiple Error- Prone Covariates. In: American Journal of Epidemiology (2019)
Joshua, L.K. et al.: Identifying patients with high data completeness to improve validity of comparative effectiveness research in electronic health records data. In: Clinical Pharmacology and Therapeutics, vol. 103. https://doi.org/10.1002/cpt.861 (2017)
Lin, H.-W., Chen, Y.-H.: Adjustment for missing confounders in studies based on observational databases: 2-stage calibration combining propensity scores from primary and validation data. In: American Journal of Epidemiology, vol. 180. https://doi.org/10.1093/aje/kwu130 (2014)
Lin, K.J., et al.: Out-of-system care and recording of patient characteristics critical for comparative effectiveness research. Epidemiology 29, 356–363 (2018)
Little, R.J.A.: Missing-data adjustments in large surveys. J. Bus. Econ. Stat. 6(3), 287–296 (1988). https://doi.org/10.1080/07350015.1988.10509663
Messer, K., Natarajan, L.: Maximum likelihood, multiple imputation and regression calibration for measurement error adjustment. Stat. Med. 27, 6332–50 (2008). https://doi.org/10.1002/sim.3458
Miksad, R.A., Abernethy, A.P.: Harnessing the power of real-world evidence (RWE): a checklist to ensure regulatory-grade data quality. Clin. Phar-macol. Therap. 103(2), 202–205 (2018)
Presley, C.J., et al.: Association of broad-based genomic sequencing with survival among patients with advanced non-small cell lung cancer in the community oncology setting. JAMA 320(5), 469–477 (2018)
Rosenbaum, P.R., Rubin, D.B.: The central role of the propensity score in observational studies for causal effects. Biometrika 70(1), 41–55 (1983)
Rosner, B., Spiegelman, D., Willett, W.C.: Correction of logistic regression relative risk estimates and confidence intervals for measurement error: the case of multiple covariates measured with error. Am. J. Epidemiol. 132(4), 734–745 (1990)
Rusanov, A. et al.: Hidden in plain sight: bias towards sick patients when sampling patients with sufficient electronic health record data for research. In: BMC Medi- cal Informatics and Decision Making 14. https://doi.org/10.1186/1472-6947-14-51. url :%3CGo%20to%20ISI%3E://WOS:000338259400001 (2014)
Spiegelman, D., Carroll, R.J., Kipnis, V.: Efficient regression calibration for logistic regression regression in main study/internal validation study designs with an imperfect reference instrument. Stat. Med. 20, 139–160 (2001)
Steiner, P.M., Cook, T.D., Shadish, W.R.: On the importance of reliable covariate measurement in selection bias adjustments using propensity scores. J. Educ. Behav. Stat. 36(2), 213–236 (2011)
Sturmer, T., Schneeweiss, S., Avorn, J., et al.: Adjusting effect estimates for unmeasured confounding with validation data using propensity score calibration. Am. J. Epidemiol. 162(3), 279–289 (2005)
Sturmer, T., Schneeweiss, S., Rothman, K.J., et al.: Performance of Propensity Score Calibration: A Simulation Study. American journal of epidemiol- ogy 165, 1110–8 (2007). https://doi.org/10.1093/aje/kwm074
USFDA (2018). Framework for FDA’s Real-World Evidence Program
Van Buuren, S., Groothuis-Oudshoorn, C.: MICE multivariate imputation by chained equations. J. Stat. Softw. https://doi.org/10.18637/jss.v045.i03 (2011)
Webb-Vargas, Y., et al.: An imputation-based solution to using mismeasured covariates in propensity score analysis. Stat. Methods Med. Res. 26(4), 1824–1837 (2017)
Weiskopf, N.G., Weng, C.: Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research. J. Am. Med. Inform. Assoc. (2013). https://doi.org/10.1136/amiajnl-2011-000681
The authors would like to thank Flatiron Health for providing us with the data for patients with metastatic bladder cancer.
Research reported in this publication was supported by the National Cancer Institute of the National Institutes of Health under Award Number R21CA227613 and K23CA187185. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Conflict of interest
Dr. Mamtani reports having served as a consultant for Seattle genetics/Astellas. The author(s) declared no other potential conflict of interest with respect to the research, authorship, and/or publication of this article.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Nandita Mitra and Rebecca Hubbard: Co-senior authors.
Electronic supplementary material
Below is the link to the electronic supplementary material.
About this article
Cite this article
Harton, J., Mamtani, R., Mitra, N. et al. Bias reduction methods for propensity scores estimated from error-prone EHR-derived covariates. Health Serv Outcomes Res Method 21, 169–187 (2021). https://doi.org/10.1007/s10742-020-00219-3
- Electronic health record (EHR) data
- Regression calibration
- Propensity score