Misspecification issues in risk adjustment and construction of outcome-based quality indicators

Li, Yue; Dick, Andrew W.; Glance, Laurent G.; Cai, Xueya; Mukamel, Dana B.

doi:10.1007/s10742-006-0014-z

Misspecification issues in risk adjustment and construction of outcome-based quality indicators

Published: 31 January 2007

Volume 7, pages 39–56, (2007)
Cite this article

Health Services and Outcomes Research Methodology Aims and scope Submit manuscript

Yue Li¹,
Andrew W. Dick²,
Laurent G. Glance³,
Xueya Cai⁴ &
…
Dana B. Mukamel⁵

184 Accesses
9 Citations
Explore all metrics

Abstract

Hospital “report cards” reporting risk-adjusted health outcomes are increasingly used to benchmark quality of care. However, risk adjustment methods that do not fully account for the interrelationship between quality, risks and outcomes may lead to biased quality measures. This study aims to determine whether the current approach based on logistic regression and observed-to-expected outcome comparisons (O−E difference or O/E ratio) provides unbiased measures of quality. We first provided a conceptual framework to demonstrate that O−E difference or O/E ratio is inconsistently specified when estimates are based on logistic risk adjustment models. To examine the misspecification issue empirically, risk adjustment was performed based on coronary artery bypass graft (CABG) surgery data from New York’s Cardiac Surgery Reporting System, and quality indicators (QI) of different specifications were calculated for hospital profiling. Computer simulations further explored the issue of misspecified QIs. Results showed that risk-adjusted mortality rates (RAMR) calculated from different QIs identified the same hospital outliers based on 95% confidence intervals, but generated different rank orders for hospitals in both high-quality and low-quality tails of the quality distributions. Simulation results further showed that, compared to O−E and O/E, logistically transformed QIs were superior regarding their abilities to identify hospitals of true extreme rankings, especially when the outcome was less prevalent or the number of patients per hospital was small. Based on our findings, we recommend that analysts consider the use of logistically transformed QI prior to publicly releasing quality rankings using measures based on O−E or O/E.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A 24-step guide on how to design, conduct, and successfully publish a systematic review and meta-analysis in medical research

Article 13 November 2019

Methodological quality (risk of bias) assessment tools for primary and secondary medical studies: what are they and which is better?

Article Open access 29 February 2020

Observational Studies: Uses and Limitations

Notes

We can only observe patient’s actual outcome status O _ij. A hospital’s observed average outcome rate ${\bar {O}_j =\frac{1}{n_j}\sum_{i=1}^{n_j}{(O_{ij})} =\frac{1}{n_j}\sum_{i=1}^{n_j}{(p_{ij})}}$ when n _j is large.
Here we assume that the chance effect is additive. If it is multiplicative, e.g., ${{\bf logit}(p_{ij})={\bf logit}(E_{ij} )\times Q_j \times \varepsilon _{ij}}$, the model is additive after a log-transformation. Further transformation upon the logit-function, however, would make model estimation intractable and is not discussed here.
The NYS model for quality report did not use robust variance estimates. Our analyses indicated that applying hierarchical regression model to the CSRS data did not substantially change the result of hospital profiling.
Normality tests based on simulated data of logit(p _ij)showed that it is approximately normally distributed. We alternatively assumed normal distribution for the error term ${\varepsilon _{ij}}$. The distribution of ${{\bf logit}(p_{ij})}$ and results of the misspecified quality indicator remained unchanged.
The κ measures the level of agreement between two raters evaluating an event on a categorical scale (Landis and Koch 1977). In this study, we defined the event scale as 1 = high-quality outlier, 0 = non-quality outlier, and −1 = low-quality outlier.
In the Bootstrap simulation to calculate the 95% CI of each QI, we changed the dataset for the two hospitals with no death. We changed the death/survival status to death for the patient with highest predicted probability of death in each hospital to ensure that the bootstrapped CI can be defined. Because of this change, the number of identified quality outliers is different from that in the CABG report card based on original data.

References

Ash, A.S., Shwartz, M., Pekoz, E.A.: Comparing outcomes across providers. In: Iezzoni, L.I. (ed.) Risk Adjustment for Measuring Health Care Outcomes. Health Administration Press, Chicago Illinois (2003)
Google Scholar
Chassin, M.R., Hannan, E.L., DeBuono, B.A.: Benefits and hazards of reporting medical outcomes publicly. N. Engl. J. Med. 334, 394–398 (1996)
Article PubMed CAS Google Scholar
Christiansen, C.L., Morris, C.N.: Improving the statistical approach to health care provider profiling. Ann. Intern. Med. 127, 764–768 (1997)
PubMed CAS Google Scholar
Conrad, D.A., Christianson, J.B.: Penetrating the black box financial incentives for enhancing the quality of physician services. Med. Care Res. Rev. 61, 37S–68S (2004)
Article PubMed Google Scholar
Efron, F., Tibshirani, R.J.: An Introduction to the Bootstrap. New York, Chapman & Hall (1993)
Google Scholar
Gatsonis, C., Normand, S.L., Liu, C., Morris, C.: Geographic variation of procedure utilization. A hierarchical model approach. Med. Care 31, YS54–YS59 (1993)
Article PubMed CAS Google Scholar
Glance, L.G., Osler, T., Shinozaki, T.: Effect of varying the case mix on the standardized mortality ratio and W statistic: a simulation study. Chest 117, 1112–1117 (2000)
Article PubMed CAS Google Scholar
Glance L.G., Dick A., Osler T.M., Li Y., Mukamel D.B.: Impact of changing the statistical methodology on hospital and surgeon ranking: the case of the New York State Cardiac Surgery Report Card. Med. Care 44, 311–319 (2006)
Article PubMed Google Scholar
Gould, W., Sribney, W.: Maximum Likelihood Estimation with STATA. College Station, Texas (1999)
Google Scholar
Greene, W.H.: Econometric Analysis. Upper Saddle River, Prentice Hall (2001)
Google Scholar
Hanley, J.A., McNeil, B.J.: The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143, 29–36 (1982)
PubMed CAS Google Scholar
Hannan, E.L., Wu, C., DeLong, E.R., Raudenbush, S.W.: Predicting risk-adjusted mortality for CABG surgery: logistic versus hierarchical logistic models. Med. Care 43, 726–735 (2005)
Article PubMed Google Scholar
Iezzoni, L.I.: The risks of risk adjustment. JAMA 278, 1600–1607 (1997)
Article PubMed CAS Google Scholar
Iezzoni, L.I. (ed).: Risk Adjustment for Measuring Health Care Outcomes. Health Administration Press Illinois, Chicago (2003)
Google Scholar
Iezzoni, L.I., Ash, A.S., Shwartz, M., Daley, J., Hughes, J.S., Mackiernan, Y.D.: Judging hospitals by severity-adjusted mortality rates: the influence of the severity-adjustment method. Am. J. Public Health 86, 1379–1387 (1996a)
Article CAS Google Scholar
Iezzoni, L.I., Shwartz, M., Ash, A.S, Hughes, J.S, Daley, J., Mackiernan, Y.D.: Severity measurement methods and judging hospital death rates for pneumonia. Med. Care 34, 11–28 (1996b)
Article CAS Google Scholar
Landis, J.R., Koch, G.G.: The measurement of observer agreement for categorical data. Biometrics 33, 159–174 (1977)
Article PubMed CAS Google Scholar
Mukamel, D.B., Mushlin, A.I.: The impact of quality report cards on choice of physicians, hospitals and HMOs: – a midcourse evaluation. Jt. Comm. J. Qual. Improve. 27, 20 (2001)
CAS Google Scholar
Mukamel, D.B., Dick, A., Spector, W.D.: Specification issues in measurement of quality of medical care using risk adjusted outcomes. J. Econ. Soc. Meas. 26, 267–281 (2000)
Google Scholar
Mukamel, D.B., Weimer, D.L., Zwanziger, J., Mushlin, A.I.: Quality of cardiac surgeons and managed care contracting practices. Health Services Res. 37, 1129 (2002)
Article Google Scholar
Mukamel, D.B., Watson, N.M., Meng, H., Spector, W.D.: Development of a risk-adjusted urinary incontinence outcome measure of quality for nursing homes. Med. Care 41, 467–478 (2003)
Article PubMed Google Scholar
Mukamel, D.B., Weimer, D.L., Zwanziger, J., Huang-Gorthy, S., Mushlin, A.I.: Quality report cards, selection of cardiac surgeons and racial disparities: a study of the publication of the NYS Cardiac Surgery Reports. Inquiry 41, 435–446 (2004/2005)
Google Scholar
New York State Department of Health: Coronary artery bypass surgery in New York State, 2000–2002. Albany, NY (2004)
Pennsylvania Health Care Cost Containment Council: A Consumer’s Guide to Coronary Artery Bypass Surgery, Vol III. Harrisburg, PA, PH4C (1994)
Google Scholar
Romano, P.S., Zach, A., Luft, H.S., Rainwater, J., Remy, L.L., Campa, D.: The California Hospital Outcomes Project: using administrative data to compare hospital performance. Jt. Comm. J. Qual. Improv. 21, 668–682 (1995)
PubMed CAS Google Scholar
Rosenthal, M.B., Frank, R.G., Li, Z., Epstein, A.M.: Early experience with pay-for-performance: from concept to practice. JAMA 294, 1788–1793 (2005)
Article PubMed CAS Google Scholar
Sacco, W.J., Copes, W.S., Staz, C.F., Smith, J.S., Jr., Buckman, R.F., Jr.: Status of trauma patient management as measured by survival/death outcomes: looking toward the 21st century. J. Trauma 36, 297–298 (1994)
Article PubMed CAS Google Scholar
Schuster, D.P.: Predicting outcome after ICU admission. The art and science of assessing risk. Chest bf 102, 1861–1870 (1992)
PubMed CAS Google Scholar
Shahian, D.M., Normand, S.L., Torchiana, D.F., Lewis, S.M., Pastore, J.O., Kuntz, R.E., Dreyer, P.I.: Cardiac surgery report cards: comprehensive review and statistical critique. Ann. Thorac. Surg. 72, 2155–2168 (2001)
Article PubMed CAS Google Scholar
White H.: A heteroskedasticity-consistent covariance matrix estimator and a direct test for heteroskedasticity. Econometrica 48, 817–830 (1980)
Article Google Scholar
Zaslavsky, A.M.: Statistical issues in reporting quality data: small samples and casemix variation. Int. J. Qual. Health Care 13, 481–488 (2001)
Article PubMed CAS Google Scholar

Download references

Acknowledgements

This project was supported by a grant from the Agency for Healthcare Research and Quality (RO1 HS 13617, Dr. Laurent Glance)

The views presented in this manuscript are those of the authors and may not reflect those of the Agency for Healthcare Research and Quality, of the New York State Department of Health or of the Cardiac Advisory Committee.

Author information

Authors and Affiliations

Department of Medicine, School of Medicine and Biomedical Sciences, ECMC-Clinical Center, CC-163, State University of New York at Buffalo, 462 Grider Street, Buffalo, NY, 14215, USA
Yue Li
The RAND Corporation, Adjunct Professor of Community and Preventive Medicine, The University of Rochester School of Medicine and Dentistry, Rochester, USA
Andrew W. Dick
Department of Anesthesiology, The University of Rochester School of Medicine and Dentistry, Rochester, USA
Laurent G. Glance
Department of Biostatistics, School of Public Health and Health Professions, State University of New York at Buffalo, Buffalo, NY, USA
Xueya Cai
Center for Health Policy Research, University of California, Irvine, USA
Dana B. Mukamel

Authors

Yue Li
View author publications
You can also search for this author in PubMed Google Scholar
Andrew W. Dick
View author publications
You can also search for this author in PubMed Google Scholar
Laurent G. Glance
View author publications
You can also search for this author in PubMed Google Scholar
Xueya Cai
View author publications
You can also search for this author in PubMed Google Scholar
Dana B. Mukamel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yue Li.

Appendix

1.1 Appendix 1: Proof of consistent quality indicator specifications

The O−E difference is consistent with the additive linear probability function below:

$$ \begin{aligned} O_{ij}&=x_{ij} \beta^{\prime}+Q_j +\varepsilon _{ij},\\ &=E_{ij}+Q_j+\varepsilon_{ij}\\ \end{aligned} $$

(4)

in which x _ij represents the set of observed risk factors of the patient, β stands for the effect of risks on patient outcome, Q _j is hospital J ’s quality of care that interacts in an additive fashion with patient’s observed risks and can be estimated as either fixed- or random-effects; ${\varepsilon _{ij}}$ represents the random (chance) effect of unknown factors on patient outcome, and ${E_{ij} =x_{ij} \beta'}$ is patient’s predicted outcome rate. We assume that the expectation (average) of the chance effect over all patients in hospital j equals zero.

That is,

$${\bf EXPECT}(\varepsilon _{ij})=0$$

(5)

If we rearrange Eq. 4 and take expectation over hospital j on both sides, we obtain

$$ \begin{aligned} {\bf EXPECT}(\bar{O}_j-\bar{E}_j)&={\bf EXPECT}\,\left[{\frac{1}{n_j }\sum\limits_{i=1}^{n_j}{(O_{ij})}-\frac{1}{n_j}\sum\limits_{i=1}^{n_j }{(E_{ij})}}\right]\\ &={\bf EXPECT}\left[ {\frac{1}{n_j}\sum\limits_{i=1}^{n_j}{(O_{ij}-E_{ij})}}\right] \\ &={\bf EXPECT}\left[ {\frac{1}{n_j}\sum\limits_{i=1}^{n_j}{(Q_{j}+\varepsilon_{ij})}} \right] \\ &={\bf EXPECT}\left[ {\frac{1}{n_j}\sum\limits_{i=1}^{n_j} {(Q_{j})}} \right]\\ &={\bf EXPECT}(Q_j)\\ &=Q_j^d \\ \end{aligned} $$

because of the assumption in Eq. 5, where Q ^d_j is hospital j’s risk-adjusted QI assuming the difference specification.

Similarly, the O/E ratio is consistent with the linear probability function below:

$$ \begin{aligned} O_{ij}&=x_{ij} \beta'\times Q_j +\varepsilon _{ij},\\ &=E_{ij}\times Q_j+\varepsilon _{ij}\\ \end{aligned} $$

(6)

in which hospital quality Q _j interacts multiplicatively with a patient’s observed risks. If we assume that Eq. 5 also holds here and take expectation over hospital j in Eq. 6, we obtain

$$ \begin{aligned} {\bf EXPECT}(\bar{O}_j /\bar {E}_j)&={\bf EXPECT}\left[ {{\sum\limits_{i=1}^{n_j}{(O_{ij})}}\mathord{\left/ {\vphantom {{\sum\limits_{i=1}^{n_j}{(O_{ij})}}{\sum\limits_{i=1}^{n_j}{(E_{ij} )}}}}\right.\kern-\nulldelimiterspace} {\sum\limits_{i=1}^{n_j}{(E_{ij} )}}}\right]\\ &={\bf EXPECT}\left[ {{\sum\limits_{i=1}^{n_j}{(E_{ij}\times Q_j +\varepsilon _{ij})}} \mathord{\left/{\vphantom {{\sum\limits_{i=1}^{n_j}{(E_{ij}\times Q_j +\varepsilon_{ij})}}{\sum\limits_{i=1}^{n_j}{(E_{ij} )}}}}\right. \kern-\nulldelimiterspace}{\sum\limits_{i=1}^{n_j}{(E_{ij})}}}\right] \\ &={\bf EXPECT}\left[ {{\sum\limits_{i=1}^{n_j}{(E_{ij}\times Q_j )}}\mathord{\left/ {\vphantom {{\sum\limits_{i=1}^{n_j}{(E_{ij}\times Q_j )}} {\sum\limits_{i=1}^{n_j}{(E_{ij})}}}}\right.\kern-\nulldelimiterspace} {\sum\limits_{i=1}^{n_j}{(E_{ij})}}}\right]\\ &={\bf EXPECT}\left[ {{Q_j \times \sum\limits_{i=1}^{n_j}{(E_{ij})}}\mathord{\left/ {\vphantom{{Q_j \times \sum\limits_{i=1}^{n_j}{(E_{ij})}} {\sum\limits_{i=1}^{n_j}{(E_{ij})}}}}\right.\kern-\nulldelimiterspace} {\sum\limits_{i=1}^{n_j}{(E_{ij})}}}\right]\\ &={\bf EXPECT}(Q_j)\\ &=Q_j^r \\ \end{aligned} $$

because of the assumption in Eq. 5, where Q ^r_j is hospital j’s risk-adjusted QI measured by the ratio specification.

1.2 Appendix 2

Before simulating the additive logistic specification (Eq. 2), we estimated additive fixed-effects logistic regression on the empirical CABG data:

$$ {\bf logit}(p_{ij})=x_{ij}\beta'\,+(q_2+q_3+\cdots+q_{36} )+\varepsilon_{ij} $$

in which x _ij is the set of risk factors (Table 3), β is the coefficient to be estimated. The fixed-effects q _j were estimated as the coefficient of dummy variables for hospitals (hospital 1 was omitted, q ₁ = 0). Robust variance estimates (White 1980) were obtained to account for the clustering of patients in hospitals.

After model estimation, we calculated the expected mortality as

$$ {\bf logit}(E_{ij})=(x_{ij}\hat{\beta }')+\sum\limits_{j=1}^{36}{\left({\frac{n_j}{N}\times q_j} \right)}, $$

in which n _j is the number of patients in hospital j, N = 16,120 is the total sample size. Therefore, a patient’s expected outcome was calculated as a linear function of patient risk (${x_{ij} \hat{\beta}^{\prime})}$ plus the weighted-average quality effect assuming that the patient were treated in an average quality hospital in New York. Finally, consistent QI (Q ^ld_j ) was constructed. Estimates from this model were then used for simulating the additive Eq. 2.

Before simulating the multiplicative Eq. 3, we estimated multiplicative fixed-effects logistic regression on the empirical data:

$$ {\bf logit}(p_{ij})=x_{ij}\beta'\times(1+q_2 +q_3+\cdots +q_{36})+\varepsilon _{ij}. $$

For the model to be identified, we normalized the effect of hospital 1 to unit. Maximum likelihood estimation in STATA (Gould and Sribney 1999) was used to obtain robust variance estimates.

The expected mortality was calculated as

$$ {\bf logit}(E_{ij})=(x_{ij} \hat{\beta}')\times\sum\limits_{j=1}^{36} {\,\left({\frac{n_j}{N}\times q_j}\right)} $$

in which the weighted-average quality effect takes a multiplicative form on patient risks. Finally, consistent QI (Q ^lr_j ) was constructed. Estimates based on this model were used for subsequent simulations.

The STATA code for defining the likelihood function of the multiplicative model is listed below:

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, Y., Dick, A.W., Glance, L.G. et al. Misspecification issues in risk adjustment and construction of outcome-based quality indicators. Health Serv Outcomes Res Method 7, 39–56 (2007). https://doi.org/10.1007/s10742-006-0014-z

Download citation

Received: 05 April 2006
Revised: 29 September 2006
Accepted: 26 October 2006
Published: 31 January 2007
Issue Date: June 2007
DOI: https://doi.org/10.1007/s10742-006-0014-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Misspecification issues in risk adjustment and construction of outcome-based quality indicators

Abstract

Access this article

Similar content being viewed by others

A 24-step guide on how to design, conduct, and successfully publish a systematic review and meta-analysis in medical research

Methodological quality (risk of bias) assessment tools for primary and secondary medical studies: what are they and which is better?

Observational Studies: Uses and Limitations

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendix

1.1 Appendix 1: Proof of consistent quality indicator specifications

1.2 Appendix 2

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Misspecification issues in risk adjustment and construction of outcome-based quality indicators

Abstract

Access this article

Similar content being viewed by others

A 24-step guide on how to design, conduct, and successfully publish a systematic review and meta-analysis in medical research

Methodological quality (risk of bias) assessment tools for primary and secondary medical studies: what are they and which is better?

Observational Studies: Uses and Limitations

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendix

Appendix

1.1 Appendix 1: Proof of consistent quality indicator specifications

1.2 Appendix 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation