Introduction

From a public health standpoint, screening for diabetic eye disease is one of the most cost effective preventative health procedures available [13]. Annual examinations are standard in most diabetic eye-screening programmes and recommended by the World Health Organization and most health authorities and organisations involved in diabetes and ophthalmology [46]. As the prevalence of diabetes mellitus worldwide is rising and with it the total cost of screening, it becomes more important to make screening programmes as effective as possible [7].

With a fixed interval between screening visits, the interval is the same for all, whereas the risk of developing sight-threatening retinopathy (STR) between screening visits is individually variable. Some patients are at high risk of developing STR and requiring treatment before the next screening visit, whereas for others this risk would be low. The fixed screening interval must be geared towards patients at relatively high risk; otherwise they might go blind. The fixed ‘one size fits all’ approach to screening leaves room for improvement. By basing the screening interval on each patient’s risk margin, it should be possible to increase screening frequency for those at high risk and thereby increase their safety and at the same time reduce the screening frequency for patients at low risk and reduce expenditure on healthcare costs and patients’ time (Fig. 1). Indeed, such an approach might apply to a variety of diseases.

Fig. 1
figure 1

The schematic drawings illustrate the conceptual difference between fixed annual screening and the individualised approach to diabetic eye screening. a Annual screening: the group of diabetic patients with variable risk for sight-threatening diabetic retinopathy. Red indicates high risk, yellow, medium and green, low risk. All have fixed 12 month screening intervals. However, their risk of developing sight-threatening diabetic retinopathy within the year is vastly different. This is the ‘one size fits all’ approach. b The individualised screening: again the group of diabetic patients with variable risk is depicted. Here the screening interval is based on the individual risk. Patients at high risk are screened more frequently, up to every 6 months, whereas patients at low risk are screened less frequently, down to every 5 years

In 1991 Dasbach and associates examined various approaches to screening for diabetic retinopathy [8]. In 1993, Kalm and Jonsson [9] proposed variable screening intervals based on risk margins and in 1994 the Icelandic screening programme moved in this direction by lengthening the screening interval to 2 years for diabetic patients without retinopathy [10]. This approach has proven to be safe and effective [11], and has been adopted by many diabetic eye-screening programmes. Classifying risk margins based on broad classification, such as presence or absence of retinopathy, is easily mastered by the busy clinician and is traditional in medicine. This approach can be improved dramatically by taking into account more risk factors in a quantitative way with the help of information technology.

The risk factors for incidence and progression of diabetic retinopathy are well known and include duration and type of diabetes, mean blood glucose or HbA1c, systolic blood pressure, and presence of retinopathy [1218]. On the basis of these risk factors it is possible to calculate the individual risk for the development and progression of diabetic retinopathy. While a quantitative calculation based on five or six variables cannot easily be made by the busy clinician, a mathematical algorithm in a computer can do so in seconds. The individual risk for progression to sight-threatening retinopathy (STR) is a sound foundation for recommending an appropriate screening interval for the individual patient, where the risk margin for developing STR (diabetic macular oedema or proliferative diabetic retinopathy) within the screening interval is pre-determined.

In this study we use a mathematical algorithm based on known risk factors to obtain an individualised risk assessment, which is used to determine screening intervals for diabetic retinopathy. The purpose is to standardise individual risk, use the risk assessment to control screening intervals and make screening programmes more effective.

Methods

A mathematical algorithm to compute the risk of STR was created based on: (1) epidemiological data for the prevalence of diabetic retinopathy from the Icelandic eye-screening database [1821]; and (2) on risk factors from published reports [1217] (Table 1). Individual or personal data was not considered in the creation of the algorithm, whereas individual data from the Aarhus diabetic database was used in the empirical testing (see below).

Table 1 Major risk factors and their risk ratios for progression of diabetic retinopathy used in our project, and citations for relevant epidemiological studies

The risk function was based on the Weibull proportional hazards model. The model has two parts: (1) the baseline cumulative probability of STR; and (2) the relative risks associated with risk factors. It is a standard parametric survival function and has, for example, been used in the SCORE project of the European Society of Cardiology to estimate cardiovascular risk [22]. From this model the baseline survival probability for being free of disease, that is, STR, has the form S 0(t) = exp[−exp(α) t p], where t is the time from onset (duration) of diabetes and exp() is the exponential function. The values of α and p define the shape of the survival function and provide flexibility to accurately describe collected survival data. The baseline survival probability of STR is 1 minus the probability of being free of STR or \( {F_0}(t) = {1} - {S_0}(t) = {1} - { \exp }\left[ { - { \exp }\left( \alpha \right){t^p}} \right] \).

The variables α and p, to define S 0(t), were found separately for type 1 and type 2 diabetes by fitting F 0(t) to epidemiological data on STR in Iceland [1821]. The nlminb optimisation routine in R, for non-linear curve estimation, was used for fitting of F 0(t) with results displayed in Fig. 2 [23]. For type 1 diabetes we found that \( {F_0}(t) = {1} - { \exp }\left[ { - { \exp }\left( { - {7}{.849}} \right){t^{{{2}{.075}}}}} \right] \) and for type 2 we found that \( {F_0}(t) = {1} - \left[ {{ \exp }\left( { - { \exp }\left[ { - {4}{.88}} \right]{t^{{{1}{.170}}}}} \right) - 0.0{52}} \right] \). For type 2 we used an offset of 0.052 in risk because it was estimated that 5.2% of newly diagnosed participants had already developed STR [18].

Fig. 2
figure 2

Prevalence of STR by duration of diabetes. Prevalence data from the Icelandic diabetic population (dashed line/circles) with fitted Weibull cumulative probability curve (solid line). a Prevalence for type 1 diabetes. b Prevalence for type 2 diabetes

Proportional hazards were assumed, as is typical for risk models and was assumed in the analysis of the UK Prospective Diabetes Study (UKPDS) and Wisconsin data [1214, 17], to generate personalised estimates of ‘survival’ free from STR by exponentiating S 0(t) with a linear combination of established risk factors and relative risk variables so that the individual survival was S(t)=S 0(t)exp(linear combination). For type 1 diabetes the linear combination used had the form:

$$ \left( {{\text{Hb}}{{\text{A}}_{{1{\text{c}}}}}\left[ \% \right] - 8} \right) \times 0.1851 + \left( {{\text{sbp}} - 130} \right) \times 0.007813 + \left( {{\text{DRpresent}} - 0.52} \right) \times \left( {1.1 + {\beta_{\text{DR}}}} \right) $$

where β DR = 0.194 for men and β DR = −0.194 for women when DR is present and β DR = 0 otherwise. Here sbp stands for systolic blood pressure and DRpresent = 1 if diabetic retinopathy was present and DRpresent = 0 otherwise. For type 2 diabetes, the linear combination was

$$ \left( {{\text{Hb}}{{\text{A}}_{\text{1c}}}\left[ \% \right] - {8}} \right) \times 0.{38}0{544} + \left( {{\text{sbp}} - {13}0} \right) \times 0.0{43}0{8} + \left( {{\text{DRpresent}} - 0.{33}} \right) \times \left( {0.{89} + {\beta_{\text{DR}}}} \right), $$

where β DR = 0.46 for men and β DR = −0.46 for women if diabetic retinopathy was present and β DR = 0 otherwise. The linear combination was centred on mean values of HbA1c and systolic blood pressure in the Icelandic diabetic population, namely 8% (64 mmol/mol) and 130 mmHg respectively, and the prevalence of diabetic retinopathy for type 1 diabetes was 52%, and for type 2 diabetes was 33%. This was done to calibrate the risk so that F 0(t) defined the average risk in the population.

The risk of developing STR in a time interval Δt, for a retinopathy free individual with given risk factor values and t as the duration of diabetes at time t, was then computed as:

$$ {\text{risk}}\left( {\left. {\Delta t} \right|{\text{disease}}\;{\text{free}}\;{\text{at}}\;t} \right) = {1} - S\left( {t + \Delta t} \right)/S(t) $$

To compute the time interval Δt when the risk would reach a value r, between 0 and 1, the equation:

$$ {\text{risk}}\left( {\left. {\Delta t} \right|{\text{disease}}\;{\text{free}}\;{\text{at}}\;{\text{t}}} \right) = r $$

was solved for Δt. The solution Δt was then used as the recommended maximum screening time for the patient before the risk would reach the risk margin, r.

Through a website, www.risk.is, the algorithm receives clinical data including type and duration of diabetes, HbA1c or mean blood glucose, blood pressure and the presence and grade of retinopathy (see electronic supplementary material [ESM] Fig. 1). These data are used to calculate risk for STR for each individual’s worse eye over a given time span. An acceptable risk margin is defined and the algorithm recommends the screening interval for each patient, i.e. when (s)he should next be seen in the screening clinic. For empirical testing the risk margin was set so that the number of individuals with STR at the next screening visit was equal with our algorithm and fixed annual screening. In the Aarhus database 149 patients out of 5,199 developed STR during their first year in the screening programme. This amounts to 2.9% and the risk margin in our algorithm was set at 3.2% to match this rate. The risk margin is slightly higher because of the difference in mean risk between the Aarhus data and the Icelandic population data.

Empirical testing for safety and efficacy of the algorithm was performed using patient records in the database for diabetic retinopathy at the Department of Ophthalmology, Aarhus University Hospital, Denmark. The database is described in detail by Mehlsen et al. [24, 25]. Prospectively accumulated clinical data, fundus photographs and information on outcome are available for 5,199 patients over 20 years and this allows testing of the algorithm in a prospective manner.

Receiver operating characteristics (ROC) analysis of the diagnostic capacity of the algorithm was performed using the roc package in the statistical program R [23].

For each patient in the database the risk of STR was calculated at the first recorded visit in the database. This risk level was used to determine the recommended screening interval, Δt, for the first interval alone. The clinical outcome as recorded in the database was used to empirically find whether each individual patient developed STR within the recommended interval.

Missing values in the Aarhus database were handled as follows: missing blood pressure values were set at 130 mmHg and HbA1c at 8% (64 mmol/mol). There were missing values for blood pressure for a total of 3,492 patients (67%) out of 5,199 patients. There were missing values for HbA1c for 1,131 patients (22%) out of 5,199 patients. Individual data for duration of diabetes, sex, and presence of diabetic retinopathy or STR were available for all patients.

For clinical use it was decided to impose a 6 month floor and 60 month ceiling to screening intervals.

Results

Testing the algorithm in the Aarhus diabetic database indicates that the mean recommended screening interval is 29 months. The reduction in screening frequency is 59% compared with fixed annual screening; 2.9% of the patients developed STR before the next screening visit.

ESM Fig. 1 shows a computer screen shot and an example of how the algorithm is used for an individual patient. The patient’s clinical data are entered in the panel and the graph shows this patient’s risk of developing STR over time. The algorithm recommends that this patient be screened again after 24 months, as is shown in red on the computer screen.

Figure 3 shows the recommended screening interval for type 1 and 2 diabetes patients with variable risk profiles as determined by their risk factors. The screening interval is limited by the 6 month floor and 60 month ceiling and varies according to individual risk. The risk profiles for individual diabetic patients are highly variable. The mean screening interval is 29 months and the median is 23 months. The first quartile for the screening interval is 11 months and the third quartile is 49 months.

Fig. 3
figure 3

Screening time (months) by type of diabetes, HbA1c, systolic blood pressure, diabetic retinopathy (DR; including proliferative retinopathy [PDR]) and sex. The figure shows the recommended screening interval for patients with variable risk profiles as determined by their risk factors. A 6 month floor and a 60 month ceiling is applied and the screening interval ranges from 6 months in high-risk individuals to 5 years for those at low risk. Recommended screening intervals are shown in (a) for type 1 diabetes patients and in (b) for type 2 diabetes patients. Red, ≤6 months; orange, 7–12 months; yellow, 13–24 months; light green, 25–36 months; dark green, 37–48 months; blue, ≥49 months. To convert values for HbA1c in % to mmol/mol, subtract 2.15 and multiply by 10.929

Figure 4a shows a ROC curve, and the area under the curve was found to be 0.76 (95% CI 0.74, 0.78). This number indicates that there is a 76% probability that a randomly selected patient who develops STR will be given a higher risk score than a randomly selected patient who does not develop STR. Figure 4b shows a calibration graph, showing the observed fraction of STR events within each decile of estimated risk.

Fig. 4
figure 4

a ROC curve, with area under the curve of 0.76 (95% CI 0.74, 0.78). b Calibration graph, showing observed fraction of STR events within each decile of estimated risk. The error bars represent 95% CI

Discussion

This is the first report to present a mathematical algorithm, based on epidemiological data on risk factors for diabetic retinopathy, to determine screening intervals for diabetic eye disease. Empirical testing in the Danish database suggests that this approach gets the same results with fewer screening examinations than the fixed annual screening programmes currently in use. With the standard risk margin the number of screening visits is reduced by 59% compared with annual screening, with the same number of patients developing STR before the next screening visit. This reduces the cost for screening programmes and improves utilisation of healthcare resources.

Regular screening and preventive treatment for diabetic retinopathy is a powerful tool to reduce blindness. In Iceland, the prevalence of blindness within the diabetic population decreased from 2.4% to 0.5% following the onset of screening in 1980, and this is largely attributable to the public health programme [2, 10, 1821, 26]. Comparable success has been seen with similar programmes in Sweden and Denmark [2729].

The list of risk factors includes type and duration of diabetes, HbA1c, systolic blood pressure, sex, and the presence and grade of retinopathy. These factors have been reported to predict the development of retinopathy (Table 1). It is an advantage that the algorithm uses only a few risk factors, which makes it easy to use for the patient or the clinician. Mean blood glucose is the number that appears on the blood sugar meter when turned on and could be used as a substitute for HbA1c in cases where the HbA1c is unknown by the patient. Using a mean weighted HbA1c would be a better and a more accurate way of estimating blood sugar control than using HbA1c at a single point in time, since these values are not constant for an individual patient. In practice we would recommend that physicians calculate a mean of the last three HbA1c measurements for each individual patient to give a recommendation on the screening interval. If an individual patient has HbA1c measurements in a wide range it would also be recommended to go with the worst case scenario and use the highest HbA1c level.

For example, a male patient with type 2 diabetes with duration of 10 years, blood pressure of 130/80 mmHg and no known retinopathy would have a screening interval of 53 months with HbA1c of 7% (53 mmol/mol). This would change to 17 months if HbA1c increased to 10% (86 mmol/mol). For a similar patient with 20 year duration of diabetes, the recommended screening interval would be 47 months with HbA1c of 7% (53 mmol/mol) and only 15 months with HbA1c of 10% (86 mmol/mol). We expect that each diabetic patient be followed up on a regular basis by his diabetologist or general practitioner. Should the patient’s risk factors change over time, the risk analysis and recommendation for the screening interval should be adjusted.

Gross proteinuria is a risk indicator of proliferative diabetic retinopathy in younger-onset patients but the relationship was of borderline statistical significance in a population-based study in Wisconsin [30]. Proteinuria may not be strongly predictive for the development of retinopathy and we do not use proteinuria as a predictor in our algorithm. Somewhat counter-intuitively, smoking status was inversely related to the development of new lesions and progression of established retinopathy in the UKPDS50 study [16]. In spite of this data we opted not to include smoking in the risk calculator due to the deleterious effects of smoking in terms of cardiovascular and lung cancer risk that would far outweigh the retinopathy risk reduction in terms of total morbidity.

This study can be seen as an extension of our earlier studies [10, 11], which show that alternate year screening for diabetic eye disease seems to be safe and effective in diabetic patients without retinopathy and, by doing this, the financial savings are substantial. In this study we have gone further and used information technology and available epidemiological studies to standardise the risk and make individual determinations of screening intervals for diabetic eye disease. This makes screening programmes more focused and less expensive. In the early days of diabetic eye screening, Dasbach [8] suggested that cost effectiveness of screening could be improved by selectively targeting high-risk groups of the diabetic population. This was followed by the work of Kalm and Jonsson [9], who also used modelling work to suggest that diabetic screening intervals could be varied and lengthened based on the risk profile of diabetic patients.

We face a global epidemic of diabetes [31, 32], where the number of patients with diabetes in the world is estimated to increase from 171 million in 2000 to at least 366 million in 2030 [7]. While public health programmes for diabetic eye screening are very effective and indeed highly cost effective [1], they are expensive to operate, owing to the very large number of patients. The cost per screening visit in the ophthalmologist-based Icelandic screening system is approximately 53 euros, and the cost involved in photographic screening programmes in Europe is 30–40 euros. Let us assume that each screening visit in developed countries costs 40 euros and the visit and travel takes 2 working hours of the patient’ time. Annual screening for 1 million diabetic patients would cost 40 million euros and 2 million working hours and this cost could be reduced by 59% with our algorithm. Wild et al. [7], estimate the prevalence of diabetes in the USA to be 18 million in 2000 and 30 million in 2030 and it is reasonable to assume that Europe’s somewhat larger population is similar, and we will estimate the current number at 25 million diabetic patients in each region. In this case, either Europe or the USA would each require about 1 billion euros and 50 million working hours annually to conduct a fixed annual diabetic eye-screening programme. Individualised risk assessment and screening in Europe or the USA could save 580 million euros and 29 million working hours each year in each region. In both cases the benefit of reduced prevalence of blindness would far outweigh the cost of screening [1].

Our algorithm is based on large international studies on diabetic retinopathy [1218], and published literature on the epidemiology of diabetic retinopathy was considered. Our algorithm is based on the Icelandic population and it was tested empirically in a Danish population. Both are white with Nordic heritage. While this gives us reasonable confidence in our conclusion, further testing in other populations would be valuable, particularly in order to extend these results to populations of different ethnicity.

The Aarhus database represents prospectively accumulated data of diabetic patients over 20 years. This database allows empirical testing of the outcome for the algorithm based on actual data from actual patients with recorded data at reported time points. A large prospective study is desirable for further testing of our protocol.

The distribution of screening intervals is shown in Fig. 3. Here, the 6 month floor and 60 month ceiling is applied, which removes the extremes in screening intervals.

The area under the ROC curve in Fig. 4a was 0.76, which represents an acceptable diagnostic capacity and is within the range typical for risk models for cardiovascular disease [33].

Figure 4b demonstrates that the algorithm tends to overestimate the risk, especially for high risk. This may be due to a number of factors. In our algorithm we assume that the risk factors are independent, which may not be exact. Also, we assume that the risk factors have a linear effect over the entire range, which may not apply for high values. Third, some of the patients with high-risk characteristics may have received treatment to improve the risk factors such as blood glucose and blood pressure. Fourth, in a relatively large group of patients, default values had to be applied for blood pressure and this could also be responsible for some errors. And last, the baseline cumulative prevalence in our data may not be directly transferable to a new population. Simple calibration methods, such as suggested by van Houwelingen [34], could be applied to the risk function to refine the adaptation to the population using the algorithm. The observed fraction of STR events by deciles of risk would then be seen to be closer to the diagonal in Fig. 4b. This has no effect on risk ranking in the database and therefore the diagnostic capacity of the algorithm stays the same.

Eye screening and preventive treatment for the 200–300 million diabetic patients in the world would substantially decrease the global burden of blindness [13]. Such screening is still a distant dream, in spite of 20 years of goals and plans by major medical organisations. One reason for the lack of success is the considerable cost of such a programme and the lack of resources, both money and manpower. Our information technology can decrease the costs by more than half and help make global diabetic eye screening economically feasible. The basic approach used in this project could be applied to other disease categories and healthcare delivery. Information technology based on available epidemiological data opens the way to individualised allocation of healthcare resources, which will make healthcare more economical and more effective in general.