# A model to estimate the lifetime health outcomes of patients with Type 2 diabetes: the United Kingdom Prospective Diabetes Study (UKPDS) Outcomes Model (UKPDS no. 68)

## Authors

- First Online:

- Received:
- Accepted:

DOI: 10.1007/s00125-004-1527-z

- Cite this article as:
- Clarke, P.M., Gray, A.M., Briggs, A. et al. Diabetologia (2004) 47: 1747. doi:10.1007/s00125-004-1527-z

- 190 Citations
- 4.8k Views

## Abstract

### Aims/hypothesis

The aim of this study was to develop a simulation model for Type 2 diabetes that can be used to estimate the likely occurrence of major diabetes-related complications over a lifetime, in order to calculate health economic outcomes such as quality-adjusted life expectancy.

### Methods

Equations for forecasting the occurrence of seven diabetes-related complications and death were estimated using data on 3642 patients from the United Kingdom Prospective Diabetes Study (UKPDS). After examining the internal validity, the UKPDS Outcomes Model was used to simulate the mean difference in expected quality-adjusted life years between the UKPDS regimens of intensive and conventional blood glucose control.

### Results

The model’s forecasts fell within the 95% confidence interval for the occurrence of observed events during the UKPDS follow-up period. When the model was used to simulate event history over patients’ lifetimes, those treated with a regimen of conventional glucose control could expect 16.35 undiscounted quality-adjusted life years, and those receiving treatment with intensive glucose control could expect 16.62 quality-adjusted life years, a difference of 0.27 (95% CI: −0.48 to 1.03).

### Conclusions/interpretations

The UKPDS Outcomes Model is able to simulate event histories that closely match observed outcomes in the UKPDS and that can be extrapolated over patients’ lifetimes. Its validity in estimating outcomes in other groups of patients, however, remains to be evaluated. The model allows simulation of a range of long-term outcomes, which should assist in informing future economic evaluations of interventions in Type 2 diabetes.

### Keywords

Blood glucoseDiabetesDiabetes-related complicationsEconomic evaluationLife expectancyQuality-adjusted life yearsRisk equation simulation modelUKPDS### Abbreviations

- CHF
congestive heart failure

- ETDRS
Early Treatment Diabetic Retinopathy Study

- ICD-9
International Classification of Diseases, Ninth Revision

- IHD
ischaemic heart disease

- logMAR
logarithm of the minimum angle of resolution

- MI
myocardial infarction

- QALY
quality-adjusted life years

- UKPDS
United Kingdom Prospective Diabetes Study

## Introduction

Computer simulation models are being used increasingly both to model the progression of Type 2 diabetes and to estimate lifetime outcomes associated with different disease management strategies. These models estimate the future occurrence of diabetes-related complications and quantify outcomes in terms of mean life expectancy or mean quality-adjusted life expectancy. They can also be used to estimate future healthcare costs of patients with Type 2 diabetes, but their main purpose is to estimate the cost-effectiveness of different disease management strategies, especially when evidence of the impact of interventions on surrogate endpoints is limited, or where evidence from clinical trials has to be extrapolated over patients’ lifetimes. Currently, there are at least five simulation models being used in these ways [1, 2, 3, 4, 5].

Existing simulation models have synthesised data from a variety of sources into a series of modules that are used to estimate occurrence of different complications. For example, the first model of the progression of Type 2 diabetes had separate modules for cardiovascular disease, retinopathy, nephropathy and neuropathy [1], and used a probabilistic Monte-Carlo analysis to simulate event histories over the remaining lifetimes of newly diagnosed patients with Type 2 diabetes. While that model represented a landmark in the use of computer simulation to model the progression of the disease, it had several limitations. In particular, it placed considerable reliance on data from a Type 1 diabetes trial [6] and on cardiovascular risk estimates derived from the Framingham cohort study, despite there being only 337 people with Type 2 diabetes in the Framingham study and consequent doubts concerning its predictive accuracy for such patients [7]. The Framingham cardiovascular risk equations for the general population were modified by a dichotomous variable indicating the presence or absence of diabetes. It was also unable to simulate the timing or occurrence of multiple complications and hence could not fully capture the clustering of complications that tend to arise [2]. While subsequent simulation models have been able to address some of these limitations [2], there is a need for a comprehensive integrated system of equations from a large and well-validated diabetes-specific data source to forecast major diabetes-related complications.

Observational analyses using data from the UKPDS have examined the relationship between exposure to glycaemia [8] and levels of systolic BP [9] over time, and the development of macrovascular and microvascular complications. The results of these studies, expressed in terms of the change in relative risk of major complications associated with decrements in HbA_{1}c or systolic BP, were consistent with those obtained with regimens of intensive control of blood glucose and BP in the clinical trial. The UKPDS Risk Engine, a risk calculator for estimating absolute risk of coronary heart disease [10] and stroke [11], has been developed based on factors such as age, sex, race, smoking status, and levels of HbA_{1}c, systolic BP and lipid ratios in the first 2 years after diagnosis of Type 2 diabetes.

The model presented here, called the UKPDS Outcomes Model, develops further the modelling of lifetime outcomes, through a system of equations, to predict the occurrence and timing of seven diabetes-related complications and death, and thereby to calculate life expectancy and quality-adjusted life expectancy for patients with Type 2 diabetes. A key aspect of this model is that it is designed to capture the association between different types of complications at an individual patient level. Complications may be associated not only because they share common known risk factors, but also due to the event-related dependence that arises when one complication substantially increases the likelihood of another [2]. For example, the probability of a patient experiencing congestive heart failure (CHF) or myocardial infarction (MI) is positively associated with systolic BP, but the risk of an MI is higher for patients with a history of CHF due to event-related dependence. To account for these dependencies the model makes use of time-varying risk factors, which also facilitates its application to patient groups at different stages of the disease.

## Subjects and methods

### Study subjects

The UKPDS was a clinical trial evaluating regimens with more intensive control of blood glucose and BP than had been described previously [12]. Briefly, between 1977 and 1991, 5102 patients with newly diagnosed Type 2 diabetes were recruited to the study. The subjects were aged between 25 and 65 years, were subsequently shown to have a fasting plasma glucose of above 6 mmol/l on two occasions, had no recent history of MI, ischaemic heart disease (IHD) or CHF, and had never had more than one major vascular event or a severe concurrent illness that would limit life expectancy. Biochemical measurements were taken, including HbA_{1}c, systolic BP, and lipid and lipoprotein fractions at entry to the study, at randomisation after 3 months of dietary therapy, and in each subsequent year. The systolic BP result was the average of three measurements taken at the same visit. Lipids were included in the model as the ratio of total cholesterol : HDL cholesterol. The methodology for the biochemical measurement of HbA_{1}c in the UKPDS, and standardisation with measures reported in the DCCT has been reported elsewhere [13]. Included in this analysis were results from 3642 patients (white, Asian–Indian and Afro-Caribbean) where annual data on potential risk factors were available. The characteristics of these patients and comparisons with the 3867 patients included in the UKPDS glucose control study have been reported elsewhere [8].

The study design and protocol amendments, which are in accordance with the guidelines of the Declaration of Helsinki as revised in 2000, were approved by the Central Oxford Research Ethics Committee and by the equivalent committees at each centre. Each patient also gave informed witnessed consent.

### Model structure

*-*related complications (fatal or non-fatal MI, other IHD, stroke, heart failure, amputation, renal failure and eye disease measured in terms of blindness in one eye) and death in order to estimate lifetime outcomes and quality-adjusted life expectancy. The model is based on an integrated system of parametric equations used to estimate absolute risk of any of the above complications based on the patient’s characteristics (e.g. age and sex), and time-varying risk factors such as HbA

_{1}c and the patient’s history of complications. Definitions of all risk factors initially included in the model are listed in Table 1. To improve model stability, values for time-varying risk factors (HbA

_{1}c, systolic BP, and total cholesterol : HDL cholesterol) were calculated as the means of the values for the previous 2 years (e.g. the value for these variables in the fourth year was the average of the values from the second and third years), while smoking status was based on 3-year periods from diagnosis of diabetes. To model the long-term dependence [14] between different complications, time-varying covariates indicating whether a patient had had a history of other complications since diagnosis of diabetes were also included in the model. These covariates were set to 0 until an event occurred, and then set to 1 from that point onwards.

Abbreviations, definitions and values of variables included in the model

Abbreviations | Definitions/values | Transformation applied |
---|---|---|

AGE | Age in years at diagnosis of diabetes | AGE—52.59 |

AGE_EVENT | Age in years at occurrence of first diabetes-related event elevating risk of mortality | AGE_EVENT—52.59 |

YEAR | Time since diagnosis of diabetes (years) | |

YEAR 2 | 1 if year = 2; 0 otherwise | |

FEMALE | 1 for female; 0 for male | |

AC | 1 Afro-Caribbean; 0 for Caucasian or Asian Indian | |

SMOK | 1 for current smoker, of tobacco in any form; 0 otherwise | |

SMOK_BASE | 1 for smoker of tobacco in any form after diagnosis of diabetes; 0 otherwise | |

BMI | Body mass index at diagnosis of diabetes (m/kg | BMI—27.77 |

HBA1C | HbA | HBA1C—7.09 |

LHBA1C | HbA | LHBA1C—7.09 |

HBA1C_BASE | HbA | HBA1C_BASE—7.09 |

SBP | Systolic BP (mm Hg), 2-year moving average of yearly values | (SBP—135.09) / 10 |

LSBP | Systolic BP (mm Hg), lagged 1 year values | (LSBP—135.09) / 10 |

SBP_BASE | Systolic BP (mm Hg), after diagnosis of diabetes | (SBP_BASE—135.09) / 10 |

TOTAL:HDL | Total cholesterol : HDL cholesterol ratio, 2-year moving average of yearly values | TOTAL:HDL—5.23 |

LTOTAL:HDL | Total cholesterol : HDL cholesterol ratio, lagged 1 year values. | LTOTAL:HDL—5.23 |

LTOT:HDL_BASE | Total cholesterol : HDL cholesterol ratio, after diagnosis of diabetes | LTOT:HDL_BASE—5.23 |

ATRFIB | 1 for atrial fibrillation at diagnosis of diabetes; 0 otherwise | |

PVD | 1 for peripheral vascular disease at diagnosis of diabetes; 0 otherwise | |

IHD | 1 for history of IHD; 0 otherwise | |

MI_EVENT | 1 for year of first MI event; 0 otherwise | |

MI_POST | 1 for all years after first MI event; 0 otherwise | |

STROKE_EVENT | 1 for year of first stroke event; 0 otherwise | |

CHF | 1 for history of congestive heart failure; 0 otherwise | |

AMP | 1 for history of amputation; 0 otherwise | |

BLIND | 1 for history of blindness in one eye; 0 otherwise | |

RENAL | 1 for history of renal failure; 0 otherwise |

The procedures used to fit equations to the observed events in the UKPDS population, as well as the estimated parameters, are reported in greater detail in the statistical appendix. In brief, each type of diabetes-related event was modelled using one or more equations that included time-varying risk factors. In the case of diabetes-related complications, a Weibull proportional hazards regression was used to model the occurrence of a composite outcome covering both fatal and non-fatal events. The coefficients for risk factors were then estimated using maximum likelihood methods that account for censoring (e.g. due to factors such as loss of follow-up, or death). Risk factors with a* p* value of less than 0.05 were considered statistically significant. Separate equations were used to model diabetes- and non-diabetes-related mortality using a combination of Gompertz and logistic regression equations. Finally, the time paths of four risk factors were also estimated using these data. To aid exposition, we grouped the estimated equations into the following categories.

### Cardiovascular disease

The increased risk of cardiovascular disease among people with Type 2 diabetes is well established and is increasingly recognised as one of the major hazards of Type 2 diabetes. In developed countries, cardiovascular disease accounts for the majority of deaths amongst patients with Type 2 diabetes [15] and is a significant contributor to diabetes-related healthcare costs [16]. Within the cardiovascular disease category we have three Weibull equations to estimate the absolute risk of the following: (i) MI, defined as non-fatal myocardial infarction (International Classification of Diseases, Ninth Revision [ICD-9] code 410), or fatal vascular cardiac event (ICD-9 codes ≥410–≤414.9, or ≥428–≤428.9), or sudden death (ICD-9 codes ≥798–≤798.9); (ii) IHD, defined as an ICD-9 code of ≥411–≤414.9; and (iii) CHF, defined as an ICD-9 code of ≥428–≤428.9. We only recorded IHD and CHF events if they occurred prior to an MI event; a patient who had experienced an MI was not classified subsequently as developing IHD.

### Cerebrovascular disease

People with Type 2 diabetes have a higher risk of stroke, and previous epidemiological studies have found that this risk increases with age, elevated BP, smoking and lipoprotein levels [17]. In our model a single Weibull equation was used to estimate the absolute risk of a first non-fatal stroke (ICD-9 codes ≥430–≤434.9, or 436) or fatal stroke (ICD-9 codes ≥430–≤438.9).

### Amputation

Patients with diabetes have an amputation rate that is many times higher than that of patients without diabetes, and previous studies have found that the primary risk factors for amputation include age, (male) sex, smoking, the presence of peripheral vascular disease, and BP [18]. Our model contains a single Weibull equation to estimate the absolute risk of a first amputation of a digit or limb (ICD-9 codes ≥5.845–≤5.848, or 250.6) or a fatal peripheral vascular event (ICD-9 codes 997.2, 997.6, 250.6 or 440.2).

### Blindness

The model contains a Weibull equation to estimate the risk of blindness in one eye, which is defined as a visual acuity Snellen 6/60 or ETDRS logMAR 1.0 or worse for any reason, persisting for 3 months (ICD-9 codes ≥369–≤369.9).

### Nephropathy

Increasing duration of diabetes, hypertension and poor glycaemic control have been shown to elevate the risk of renal disease [19] and the subsequent risk of mortality [20]. Our model contains a single Weibull equation to predict renal failure, defined as creatinine levels of above 250 µmol/l, not ascribable to any acute inter-current illness (ICD-9 codes 250.3, and ≥585–≤586), and death due to renal failure (ICD-9 codes ≥580–≤593.9).

### Risk factor progression

Previous simulation models of the progression of diabetes have made a variety of assumptions regarding how risk factors such as HbA_{1}c and systolic BP change over time [2, 4]. While carrying forward current values provides a simple means of extrapolation [21, 22], it does not account for the likely change in some of these risk factors over time (e.g. the upward trend in HbA_{1}c levels that many patients experience). In the UKPDS Outcomes Model, we treat the four risk factors (HbA_{1}c, systolic BP, total cholesterol : HDL cholesterol, and smoking status) as panel data (longitudinal data), and fit a random effects model to allow estimation of a time path [23]. While the exact specification differs between risk factors, these equations generally include the value or status of the risk factor at the time a decision was made regarding randomisation in the UKPDS (which took place after a 3-month dietary run-in) [24], the number of years since diagnosis, and the value of the risk factor in the preceding period. For HbA_{1}c an indicator variable for the second year was also included to account for the initial decline in HbA_{1}c observed in the study. Equations for HbA_{1}c, systolic BP, and total cholesterol : HDL cholesterol were based on annual measures of each risk factor, while smoking status (due to its less frequent ascertainment) was based on 3-year periods from diagnosis of diabetes.

### Mortality

People diagnosed with Type 2 diabetes in middle age typically have a life expectancy of between 5 and 10 years shorter than people without diabetes, with heart disease being the major underlying cause of death [25]. Our model contains three equations for estimating absolute risk of mortality. Two of these equations estimate the likelihood of death after the occurrence of several diabetes-related complications that have been shown to elevate the risk of mortality. The first equation, based on logistic regression, estimates the probability of death in the first year in which either an MI, CHF, stroke, amputation or renal failure first occurs (as defined in the Weibull models outlined above). The second equation estimates the risk of diabetes-related mortality of patients with a history of any of these events in all subsequent years. The third mortality equation estimates the risk of death from causes unrelated to diabetes (primarily due to cancers and accidents).

### Using the model to simulate outcomes

The main purpose of the model is to estimate the likely occurrence of the major diabetes-related complications over a lifetime for patients with specified prognostic risk factors, in order to calculate health outcomes such as life expectancy or quality-adjusted life expectancy.

While increases in mean life expectancy are widely recognised as a useful measure of benefit [26], it is also important to encapsulate the potential impact of complications on quality of life. This model uses the quality-adjusted life-years (QALY) method to adjust length of life for quality of life by assigning a value or health utility, on a scale on which 0 represents death and 1 represents full health, for each year of life.

The impact on utility of different diabetes-related complications has been reported in a number of studies, including one study in which the EQ-5D health status questionnaire [27] was used to survey 3192 patients still participating in the UKPDS in 1997 [28]. Based on data from that survey, the mean utility for patients free of microvascular and macrovascular complications was estimated at 0.78. Patients with a history of complications have a lower utility and the following decrements were estimated: −0.055 for an MI; −0.090 for other IHD; −0.164 for stroke; −0.108 for heart failure; −0.280 for amputation; and −0.074 for blindness in one eye. For simplicity and in the absence of sufficient data to estimate empirically, it is assumed that multiple complications have an additive effect on utility, e.g. the utility of a patient who had other IHD and then had an MI would first be decremented by 0.090 and then by a further 0.055. However, the UKPDS Outcomes Model is sufficiently flexible to allow other assumptions to be adopted regarding the impact that different complications (and combinations of complications) have on quality of life.

When estimating QALYs it is important to model the sequence of endpoints for each patient. For example, consider the simulated profiles for two patients who are free of complications: the first patient is predicted to have an amputation in the second year post follow-up, an IHD and an MI in the fourth year post follow-up, and is predicted to die in the fifth year; the second patient is predicted to have IHD in the third year post follow-up, an amputation in the fourth year, and a fatal MI in the fifth year. Using the utility weights reported above, while both patients experience the same set of diabetes-related complications over their remaining lifetimes, the QALY profile is different: the profile of the first patient is equivalent to 2.19 QALYs, and the second is equivalent to 2.66 QALYs. By simulating individual patient histories, the model is able to take these differences into account.

There is a need to account for event-related dependence between some complications, as noted above. The actual event-related dependencies were estimated empirically during the equation-fitting procedure, and included if they had a significance level (*p* value) of less than 0.05. We then checked all significant dependencies against the known epidemiology, as discussed in more detail below. In the model, if a non-fatal event is predicted to have occurred, the patient acquires a history of that event (the time-varying covariate for that endpoint is updated from 0 to 1), and thereafter carries that history in their set of characteristics. In addition, as noted above, the two diabetes-related mortality equations are applied to all patients who have a history of complications that were found to elevate the risk of death (MI, CHF, stroke, amputation or renal failure) from the point at which the complication is predicted to have occurred. These equations deal with the likelihood of immediate death following the first event, and the long-term elevation of risk of death consequent to acquiring a history of these complications. Finally a non-diabetes mortality equation is used to forecast the incidence of death from cancer, accidents or other causes not addressed elsewhere in the model.

If the model predicts that a death has occurred, the years lived and quality-adjusted years lived (life years decremented in each cycle by the reduced quality of life associated with particular complications) by that patient are calculated. If the patient is predicted to have survived that cycle, the risk factor equations are used to update their current risk factor values (e.g. systolic BP) and these are carried forward to the next cycle of the model along with the updated event history.

It is important to note that the order in which the event equations listed in Figure 1 (equations 1 to 10) are evaluated to determine the occurrence of an event is not predetermined. Further, some of these events are competing risks (e.g. if a patient dies within a cycle of the model, they can have no additional events). To take this into account, the equations are run in random order in each cycle.

### Internal validation of the model using the UKPDS population

We tested the consistency of the forecast cumulative incidence of different complications and death to the cumulative incidence calculated using non-parametric (life-table) methods. This test of internal validity [29] was based on the forecast number of events from the second to the twelfth year after diagnosis, using the information from the first year after diagnosis in the 3642 patients used to estimate the model [11].

### Application of the model

We demonstrate two potential applications of the model. Firstly, we used the simulation model to extrapolate outcomes over the remaining lifetime of the 3867 patients randomised to conventional or intensive blood glucose control in the UKPDS. Estimated outcomes are stratified by sex and 5-year age categories for patients over 45 years of age at diagnosis of diabetes. Secondly, we calculated the incremental benefit of a regimen of intensive blood glucose control compared with conventional therapy for patients’ remaining lifetimes in terms of life expectancy and quality-adjusted life expectancy. Previously, in the absence of a full outcomes model, we reported the cost-effectiveness of this intervention only in terms of time free of diabetes-related complications [30] using all patients randomised in that comparison rather than the 3642 used here. Given the illustrative nature of these applications we did not discount outcomes.

### Handling uncertainty

Appropriate handling of uncertainty in complex patient simulation models such as the UKPDS Outcomes Model is not straightforward. In applications involving extrapolation of outcomes of patients who were alive at the end of the study, a combination of bootstrap methods and multiple imputation methods were used. The confidence intervals around the QALY estimates were adjusted to take into account the variance within and across imputed data sets [31]. Our method removes Monte-Carlo error, so that confidence intervals from the outcomes model reflect parameter uncertainty in the model.

## Results

Sample size, functional form, parameters and beta coefficients (SEs) for seven equations to estimate the probability of diabetes-related complications

Eq. 1 | Eq. 2 | Eq. 3 | Eq. 4 | Eq. 5 | Eq. 6 | Eq. 7 | |
---|---|---|---|---|---|---|---|

Complication | IHD | MI | CHF | STROKE | AMP | BLIND | RENAL |

No. of subjects | 3612 | 3642 | 3607 | 3607 | 3642 | 3642 | 3642 |

Functional form | Weibull | Weibull | Weibull | Weibull | Weibull | Weibull | Weibull |

Parameters | Estimate of coefficient (SE) | ||||||

λ | −5.310 (0.174) | −4.977 (0.160) | −8.018 (0.408) | −7.163 (0.342) | −8.718 (0.613) | −6.464 (0.326) | −10.016 (0.939) |

ρ | 1.150 (0.067) | 1.257 (0.060) | 1.711 (0.158) | 1.497 (0.126) | 1.451 (0.232) | 1.154 (0.121) | 1.865 (0.387) |

AGE | 0.031 (0.008) | 0.055 (0.006) | 0.093 (0.016) | 0.085 (0.014) | 0.069 (0.014) | ||

FEMALE | −0.471 (0.143) | −0.826 (0.103) | −0.516 (0.171) | ||||

AC | −1.312 (0.341) | ||||||

SMOK | 0.346 (0.097) | 0.355 (0.179) | |||||

BMI | 0.066 (0.017) | ||||||

HBA1C | 0.125 (0.035) | 0.118 (0.025) | 0.157 (0.057) | 0.128 (0.042) | 0.435 (0.066) | 0.221 (0.050) | |

SBP | 0.098 (0.037) | 0.101 (0.026) | 0.114 (0.056) | 0.276 (0.042) | 0.228 (0.075) | 0.404 (0.106) | |

TOTAL:HDL | 0.113 (0.025) | ||||||

Ln (TOTAL:HDL) | 1.498 (0.202) | 1.190 (0.169) | |||||

PVD | 2.436 (0.521) | ||||||

ATRFIB | 1.428 (0.472) | ||||||

IHD | 0.914 (0.150) | ||||||

CHF | 1.558 (0.202) | 1.742 (0.287) | |||||

BLIND | 1.812 (0.462) | 2.082 (0.551) |

Sample size, functional form, parameters and beta coefficients (SEs) for three equations to estimate the probability of mortality

Eq. 8 | Eq. 9 | Eq. 10 | |
---|---|---|---|

Event | EVENT FATALITY | DIABETES MORTALITY | OTHER DEATH |

No. of subjects | 717 | 584 | 3642 |

Functional form | Logistic | Gompertz | Gompertz |

Parameters | Estimate of coefficient (SE) | ||

λ | −3.251 (0.358) | −5.124 (0.363) | −6.373 (0.162) |

φ | 0.003 (0.038) | 0.154 (0.016) | |

Ln (AGE_EVENT) | 2.772 (0.716) | 4.731 (1.066) | |

AGE × (FEMALE) | 0.081 (0.013) | ||

AGE × (1-FEMALE) | 0.104 (0.012) | ||

SMOK | 0.307 (0.141) | ||

HBA1C | 0.114 (0.053) | ||

TOTAL:HDL | 0.109 (0.047) | ||

MI_EVENT | 2.640 (0.336) | 3.939 (0.275) | |

MI_POST | 1.119 (0.277) | ||

STROKE_EVENT | 1.048 (0.376) | 2.807 (0.408) | |

RENAL | 1.585 (0.315) | ||

AMP | 1.032 (0.377) |

Sample size, functional form, parameters and beta coefficients (SEs) for four equations used to estimate annual risk factor values from diagnosis of diabetes

Eq. 11 | Eq. 12 | Eq. 13 | Eq. 14 | |
---|---|---|---|---|

Risk factor | HBA1C | SBP | TOTAL:HDL | SMOK |

No. of subjects | 3631 | 3592 | 3520 | 3536 |

Type of regression | Panel | Panel | Panel | Logistic |

| 0.64 | 0.65 | 0.44 | |

Parameters | Estimate of coefficient (SE) | |||

α | −0.024 (0.017) | 0.030 (0.014) | −0.021 (0.007) | −4.020 (0.236) |

Ln (YEAR) | 0.144 (0.009) | 0.039 (0.008) | ||

YEAR | −0.203 (0.024) | |||

YEAR_2 | −0.333 (0.050) | |||

AGE | −0.027 (0.008) | |||

FEMALE | −0.489 (0.154) | |||

LHBA1C | 0.759 (0.004) | |||

HBA1C_BASE | 0.085 (0.004) | |||

LSBP | 0.717 (0.004) | |||

SBP_BASE | 0.127 (0.004) | |||

LTOTAL:HDL | 0.526 (0.005) | |||

LTOTAL:HDL_BASE | 0.252 (0.006) | |||

LSMOK | 1.878 (0.211) | |||

SMOK_BASE | 4.879 (0.494) |

_{1}c, systolic BP, total cholesterol : HDL cholesterol and smoking status, Figure 3a–d shows: (i) the actual mean value of the risk factor (for the 3642 patients used to estimate the model); (ii) the predicted values, using the equations given in Table 4, for patients with differing values of the relevant risk factor at the time of the decision regarding randomisation in the UKPDS. For HbA

_{1}c, which is shown in Figure 3a, it is evident that patients with an initial value of between 6% and 8% subsequently follow a time path similar to the mean of the observed values for all patients, while the time path for patients with higher initial values differs from the mean of the observed values for all patients, possibly reflecting more intensive treatment aimed at improving the metabolic control of these high-risk patients.

### Two illustrative applications of the model

Estimated mean life expectancy and mean quality-adjusted life expectancy by age at diagnosis of diabetes in the UKPDS population^{a}

Life expectancy (95% CI [years]) | Quality-adjusted life expectancy (95% CI [QALYs]) | |||
---|---|---|---|---|

Age at diagnosis of diabetes | Men | Women | Men | Women |

45–49 | 25.54 (21.92–29.16) | 26.80 (22.12–31.48) | 18.82 (16.48–21.15) | 20.06 (16.92–23.19) |

50–54 | 21.51 (18.65–24.37) | 23.58 (19.94–27.22) | 15.96 (14.14–17.79) | 17.62 (15.30–19.93) |

55–59 | 17.94 (15.78–20.11) | 20.16 (17.34–22.98) | 13.34 (11.97–14.71) | 15.10 (13.23–16.97) |

60–64 | 13.91 (12.55–15.27) | 17.65 (16.58–19.73) | 10.34 (9.46–11.22) | 13.15 (11.88–14.42) |

Estimated life expectancy and QALYS for UKPDS patients on conventional and intensive blood glucose control

UKPDS regimen | ||||
---|---|---|---|---|

Conventional group | Intensive group | Difference | (95% CI) | |

Life expectancy | ||||

Within trial | 9.88 | 10.00 | 0.12 | (−0.13 to 0.36) |

Over lifetime | 21.98 | 22.35 | 0.37 | (−0.72 to 1.46) |

Quality-adjusted life expectancy | ||||

Within trial | 7.62 | 7.72 | 0.10 | (−0.09 to 0.29) |

Over lifetime | 16.35 | 16.62 | 0.27 | (−0.48 to 1.03) |

## Discussion

Simulation models such as the UKPDS Outcomes Model make it possible to quantify the lifetime impact of chronic diseases and the long-term benefits associated with alternative management strategies. They are therefore an important adjunct to randomised controlled trials and other types of epidemiological study, and may also be of value in disease management and healthcare planning. They are essential for proper evaluation of interventions affecting any of the risk factors incorporated in the model, because they permit users to estimate the likely occurrence, over a lifetime, of the main diabetes-related complications. Only when this has been done can quality-adjusted life expectancy and lifetime treatment costs be fully estimated, and the cost-effectiveness of the intervention calculated.

As noted in the Introduction, a number of other diabetes simulation models have been developed, and the present model has benefited greatly from the insights these have provided.

The UKPDS Outcomes Model has some important advantages over some previous simulation models: (i) we used data from 3642 patients who participated in the UKPDS; (ii) there was a median follow-up period of 10.3 years; (iii) we analysed full clinical histories including biochemical measures and ascertainment of major diabetes-related complications; and (iv) there was a very low loss of data during follow-up. The quantity and quality of these data, and the fact that they have come from a single study, has obviated the need to synthesise evidence from many disparate sources. Indeed, we have also been able to use this patient population to estimate the impact of the same diabetes-related complications on subsequent healthcare costs and on quality of life. These results have been reported elsewhere [28] but in time will be integrated with the outcome model described here to form part of a more general UKPDS diabetes health economics model.

The volume of data available to us also removed many constraints on the modelling approach adopted and allowed us to construct a model based on a set of equations estimated using a consistent methodology. In particular, our use of time-varying covariates allowed us to model the linkages between different diabetes-related complications. These have increasingly been recognised as crucial to such simulation models: as a recent study has noted “...it can be catastrophically inaccurate to simulate each kind of event as if it were statistically independent” [2]. We have attempted to address this issue by incorporating linkages where there is a high degree of statistical significance and supporting clinical and epidemiological evidence. For example, our results for stroke are consistent with those from the Framingham study showing that a history of heart failure and atrial fibrillation increase the risk of subsequent stroke [17]. The model may also be used to simulate outcomes in populations with differing baseline characteristics such as age, smoking status, systolic BP, blood lipids, HbA_{1}c and history of diabetes-related complications.

A number of limitations of the UKPDS Outcomes Model should be acknowledged. Firstly, the model only predicts the first event in any single category of diabetes-related complications, and does not allow series of events such as sequential amputations to be modelled directly. However, this limitation should not be overstated, as: (i) such multiple events in the UKPDS data were relatively infrequent; (ii) subsequent fatal events in specific categories of diabetes-related complications are included in the diabetes-related mortality equation; and (iii) additional post-study monitoring data will in time allow this issue to be revisited.

Secondly, the model does not explicitly incorporate a number of morbidities related to diabetes. In particular, peripheral neuropathy and ulceration may give rise to significant reduction in quality of life, to hospitalisation and to high treatment costs [33]. These conditions are also likely to elevate the risk of subsequent events such as amputation. However, these were not included as major endpoints in the UKPDS and cannot easily be incorporated into the outcomes model. Hypoglycaemia and hyperglycaemia are also excluded. Episodes of these conditions are infrequent and rarely serious, but more serious morbidity could occur in small numbers of patients undertaking intensive glycaemic management.

Thirdly, some complications of diabetes are represented in the model using a single state. For example, the only state representing eye disease in the model is the endpoint of blindness in one eye. Again, this is consistent with the primary analyses of the UKPDS, but it almost certainly fails to fully describe the complex process of disease progression and associated treatments, as well as the potential impact on quality of life. There is scope to develop a more detailed model of the progression of eye disease in future work.

Fourthly, we have briefly described the methods used to estimate confidence intervals around our estimates, but there are many important methodological and computational issues in the generation of these confidence intervals and more generally in the ways uncertainty should be handled in a complex model; these need to be explored in greater detail in the future.

We expect the UKPDS Outcomes Model to develop in a number of ways as further follow-up data become available. The greater length of follow-up will allow the shape of the hazard in the Weibull and Gompertz equations to be estimated with greater precision. The risk factors and linkages contained in the model can be re-examined, especially the equations that are currently based on a relatively small number of events (e.g. only 24 patients had renal failure).

The UKPDS Outcomes Model has been described here in some detail to demonstrate the rationale for our approach, the problems encountered and the remaining limitations. Publication of the coefficients of all equations and the description of the simulation algorithm have been included to facilitate a high degree of transparency and to promote a deeper understanding of the model. In this study we have demonstrated that the UKPDS Outcomes Model is able to simulate event histories that closely match observed outcomes of patients in the UKPDS study, and that it can also be used to extrapolate lifetime outcomes. The next challenge is to test external validity by applying the model to other populations of Type 2 diabetic patients. Until this has been undertaken, caution should be exercised when using the model to predict outcomes in these populations. We look forward to the assessment of the model’s external validity and to its future use in assessing outcomes of therapies for Type 2 diabetes.

## Acknowledgements

The UK Prospective Diabetes Study (UKPDS) Group is composed of the following centres:

Radcliffe Infirmary, Oxford; Royal Infirmary, Aberdeen; General Hospital, Birmingham; St George’s Hospital, London; Hammersmith Hospital, London; City Hospital, Belfast; North Staffordshire Royal Infirmary, Stoke-on-Trent; Royal Victoria Hospital, Belfast; St Helier Hospital, Carshalton; Whittington Hospital, London; Norfolk and Norwich Hospital; Lister Hospital, Stevenage; Ipswich Hospital; Ninewells Hospital, Dundee; Northampton Hospital; Torbay Hospital; Peterborough General Hospital; Scarborough Hospital; Derbyshire Royal Infirmary; Manchester Royal Infirmary; Hope Hospital, Salford; Leicester General Hospital; Royal Devon and Exeter Hospital.

The cooperation of the patients and staff at the UKPDS centres is much appreciated. The UKPDS received funding from the UK Medical Research Council, the British Diabetic Association, the UK Department of Health, the National Eye Institute and the National Institute of Diabetes and Digestive and Kidney Disease (the US National Institutes of Health), the British Heart Foundation, The Wellcome Trust, the Charles Wolfson Charitable Trust, the Clothworkers’ Foundation, the Health Promotion Research Trust, the Alan and Babette Sainsbury Trust, the Oxford University Medical Research Fund Committee. Funding was also provided by pharmaceutical companies including Novo-Nordisk, Bayer, Bristol-Myers Squibb, Hoechst, Lilly, Lipha and Farmitalia Carlo Erba, GlaxoWellcome, SmithKline Beecham, Pfizer, Zeneca, Pharmacia and Upjohn, and Roche provided grants for health economics and epidemiological studies.