Introduction

The plasma glucose concentration, like many other physiological variables, is a tracking variable, i.e. individual values tend to remain in the same region of the population distribution over time. As a consequence, plasma glucose values are good predictors of incident diabetes. Predicting diabetes has dual value: it provides the evidence basis for setting diagnostic thresholds for diabetes (and dysglycaemic states in general) and it suggests pathophysiological mechanisms for the development of hyperglycaemia.

Diagnostic thresholds are defined by convention depending on the kind and level of risk to be prevented. In the case of diabetes, the risk of microvascular and macrovascular complications rises with plasma glucose levels, the relationship being steeper for micro- than macrovascular disease [1]. The current, widely accepted definition of type 2 diabetes uses thresholds for fasting (≥7.0 mmol/l) and 2 h post-load (standard 75 g OGTT) plasma glucose concentration (≥11.1 mmol/l): values exceeding either level are diagnostic [2]. Criteria for lesser degrees of dysglycaemia are less well established. Impaired glucose tolerance (IGT), defined by the 2 h glucose concentration, carries heightened risk of diabetes [35] and, possibly, cardiovascular disease [67]. The pathophysiology of IGT is a combination of insulin resistance, hepatic as well as peripheral, and beta cell dysfunction [8] that is similar to that of diabetes, only of a milder degree. As an obvious antecedent of type 2 diabetes, IGT is identified by a screening OGTT, which costs time, facilities and money. More recently, a new category of impaired glucose regulation (IGR, in WHO terms [9]) was introduced to obviate the investment in OGTT screening, namely impaired fasting glucose (IFG), defined as fasting plasma glucose between 6.1 and 7.0 mmol/l. While clearly a risk condition for type 2 diabetes, IFG is believed to have a partially different pathophysiological basis to that of IGT, being characterised by hepatic rather than peripheral insulin resistance and by more marked beta cell dysfunction [10]. Before this definition was widely accepted, it was changed and the lower threshold for fasting glycaemia lowered to 5.6 mmol/l [11]. Rationale and value of the new IFG definition have been criticised [12].

The present work takes advantage of 7 year follow-up data from the Mexico City Diabetes Study [13], a population-based observational study in a population with a high prevalence of diabetes, to assess the value of categorisation of prediabetic states with respect to incident diabetes. Assessment was done by systematically analysing RR, population-attributable risk (PAR) and clinical phenotype of old and new definitions of IFG and IGT.

Methods

Participants

Data were collected as part of the Mexico City Diabetes Study [13], a population-based survey of diabetes and cardiovascular risk factors. Low-income neighbourhoods in Mexico City were selected and a complete enumeration of these was carried out from November 1989 to October 1992. Among the 15,532 inhabitants of these neighbourhoods, 3,505 eligible individuals (35–64 year-old men and non-pregnant women) were identified. Of these, 2,282 (1,342 women, 940 men) completed baseline medical examinations at the clinic. Participants with type 1 diabetes were excluded. Type 2 diabetes was classified as a fasting plasma glucose concentration ≥7.0 mmol/l or a 2 h plasma glucose concentration ≥11.1 mmol/l after a standard 75 g OGTT [13]. Participants who reported having a history of diabetes and who at the time of their clinical examination were taking oral glucose-lowering agents were also considered to have type 2 diabetes, regardless of their plasma glucose values. Diabetic participants on insulin whose age of onset was ≥40 years or whose BMI was >30 kg/m2 were also considered to have type 2 diabetes. After excluding 319 type 2 diabetic participants at the baseline examination (196 women, 123 men), 49 and 265 of the remaining 1,963 participants had IFG6.1 (fasting glucose 6.1–6.9 mmol/l) and IGT (fasting glucose <7.0 mmol/l and 2 h glucose 7.8–11.1 mmol/l), respectively. Isolated IFG6.1 and isolated IFG5.6 (5.6 mmol/l threshold) were defined as 2 h glucose <7.8 mmol/l and fasting glucose between 6.1 and 6.9 mmol/l or between 5.6 and 6.1 mmol/l, respectively. Isolated IGT, isolated IGT6.1 (6.1 mmol/l threshold) and isolated IGT5.6 (5.6 mmol/l threshold) were defined as 2 h glucose between 7.8 and 11.1 mmol/l and fasting glucose <5.6 mmol/l or between 5.6 and 6.1 mmol/l or 6.1 and 6.9 mmol/l, respectively. Two waves of follow-up were completed 3.25 and 7 years later, when 1,770 and 1,753 participants, respectively, of the original cohort were re-examined.

The protocol was approved by the Ethics Committee of the Centro de Estudios en Diabetes, Centro de Investigacion en Salud Poblacional, Instituto Nacional de Salud Publica, Mexico City. All participants gave informed consent.

Anthropometric measurements

Diabetes in at least one parent or sib was classified as a positive family history of diabetes. Height, weight, waist and hip circumferences, and systolic and diastolic blood pressure were measured as described elsewhere [13].

Biochemical measurements

Blood samples were obtained in the fasting state and 2 h after a standard 75 g oral glucose load. Serum samples were centrifuged at 1,500×g, divided into aliquots and stored at −70°C until assayed. Fasting concentrations of serum insulin, proinsulin, plasma glucose, total cholesterol, LDL-cholesterol, HDL-cholesterol and triacylglycerol, and plasma glucose and insulin concentrations 2 h after an oral glucose load were determined as described elsewhere [13] at baseline and follow-up.

Statistical analysis

Data are presented as mean ± SD. Categorical variables were compared by the χ 2 test, continuous variables by ANOVA, with Bonferroni–Dunn testing of multiple post hoc comparisons. Logistic regression was carried out by defining response as a diagnosis of diabetes at either of the two follow-up visits and non-response as no diabetes at the last visit; results are expressed as OR with 95% CI. For each continuous variable in a multiple model, OR was calculated for 1 SD of the population value of that variable, in order to compare the relative importance of the variables. The PAR (%) was calculated as: 100〈f (RR–1)/{1 + [f (RR–1)]}〉, where f is the prevalence of a given condition and RR is the relative risk of incident diabetes for that condition. When using a multi-level nominal variable for the five mutually exclusive categories of glucose intolerance, a binomial regression model with log-link function was used. For these RRs, PAR was calculated according to the following formula:

$${\text{PAR}}_i = {{p_i \;\left( {{\text{RR}}_i - {\text{1}}} \right)} \mathord{\left/{\vphantom {{p_i \;\left( {{\text{RR}}_i - {\text{1}}} \right)} {\left[ {{\text{1}} + \Sigma _j p_j \left( {{\text{RR}}_j - {\text{1}}} \right)} \right]}}} \right.\kern-\nulldelimiterspace} {\left[ {{\text{1}} + \Sigma _j p_j \left( {{\text{RR}}_j - {\text{1}}} \right)} \right]}}{\text{,}}\;i = {\text{1,}}\; \ldots {\text{,}}\;{\text{5}}$$

The corresponding 95% CIs were computed using bootstrap sampling (10,000 replicates) [14]. Log-binomial regression and bootstrap estimation were performed using Stata 8 (StataCorp (2005) Stata Statistical Software: Release 8; StataCorp, College Station, TX, USA). For individual categories of glucose intolerance (2 × 2 tables), PAR and CI were calculated using R 2.8.0 statistical software [15]. Receiver operating characteristic (ROC) analysis was carried out by standard methods. A p value ≤0.05 was considered statistically significant.

Results

In this relatively young population (mean age 47 years) with a high prevalence of diabetes (13.8% at baseline), IGT was also very prevalent (13.7%), five times more so than IFG6.1 (2.4%). Of participants with IFG6.1, 60% also had IGT, whereas only 11% of those with IGT also had IFG6.1.

The RR of diabetes was 4.01 (95% CI: 3.12–5.14) for participants with IGT and 3.73 (95% CI: 2.18–6.39) for those with IFG6.1, corresponding to a PAR of 29 (95% CI: 26–32) and 6% (95% CI: 5–7), respectively. With the glucose threshold for IFG set at 5.6 mmol/l instead of 6.1 mmol/l, the new IFG group (IFG5.6) counted 176 individuals, for whom the RR of diabetes was 4.28 (95% CI: 3.21–5.71) and the corresponding PAR 23% (95% CI: 20–25).

When considering the six mutually exclusive combinations of fasting and 2 h plasma glucose concentrations (Fig. 1), the clinical phenotype of participants who do not have normal glucose tolerance (NGT) was generally similar, all dysglycaemic groups being older, heavier, hyperinsulinaemic, hyperproinsulinaemic and dyslipidaemic (raised triacylglycerols, reduced HDL-cholesterol) compared with the NGT group (Table 1).

Fig. 1
figure 1

Individual fasting and 2 h plasma glucose concentrations in the non-diabetic segment of the Mexico City Diabetes Study population. The 2 h glucose threshold (7.8 mmol/l) and the two fasting glucose thresholds (5.6 and 6.1 mmol/l) identify six mutually exclusive (dotted lines) categories of glucose tolerance. Pink, iIGT; yellow, iIGT5.6; blue, iIGT6.1; dark blue, NGT; red, iIFG5.6; green, iIFG6.1

Table 1 Clinical phenotype in participants belonging to six mutually exclusive categories of baseline glucose tolerance

Over 7 years of follow-up, 165 participants developed type 2 diabetes (crude incidence rate 1.34% per year). As shown in Table 2, 39% of incident diabetes originated from individuals with 2 h plasma glucose levels ≥ 7.8 mmol/l, while 27% derived from participants with IFG. Of the latter, however, more than two thirds also had IGT, isolated forms of IFG contributing only 7% to all incident diabetes cases. The majority of diabetes converters originated from the pool of NGT individuals.

Table 2 Participants developing diabetes over 7 years in six mutually exclusive categories of baseline glucose tolerance

The RR of incident diabetes in participants with isolated IFG6.1 was not significantly increased compared with that of NGT participants, while in those with isolated IFG5.6 the risk was threefold increased and similar to that of participants with isolated IGT. Relative risk was further increased (six- to sevenfold) in participants with non-isolated IGT, regardless of whether their fasting glucose was 5.6 to 6.1 or 6.1 to 6.9 mmol/l (Fig. 2). The corresponding PARs are plotted in Fig. 3.

Fig. 2
figure 2

Relative risk of developing diabetes in five mutually exclusive categories of glucose intolerance. Risk ratios (plotted on a log scale) are relative to the risk in NGT individuals

Fig. 3
figure 3

PAR of developing diabetes in the five mutually exclusive categories of glucose intolerance seen above (Fig. 2)

In a multiple logistic model adjusting for sex, age, familial diabetes and BMI, fasting and 2 h plasma glucose concentrations (as continuous variables) were independent predictors of incident diabetes, with a somewhat higher OR for 2 h than for fasting glucose (Fig. 4) and an area under the ROC curve for this model of 79.5%. When the same model was run with tertiles of fasting and 2 h glucose simultaneously, only the top tertile of fasting glucose carried a significant independent risk of diabetes, whereas the two top tertiles of 2 h glucose conveyed increasing risk. A fasting glucose between 5.0 and 6.8 mmol/l was associated with a similar risk to that from 2 h glucose between 6.4 and 11.1 mmol/l. When fasting and 2 h glucose were replaced by the multilevel categorical variable describing the five mutually exclusive categories of glucose intolerance (Fig. 1), the area under the ROC curve was 77.3%. An expanded model of incident diabetes included serum proinsulin levels and diastolic blood pressure as independent, positive predictors and 2 h plasma insulin concentrations and HDL-cholesterol levels as negative predictors (Fig. 5).

Fig. 4
figure 4

Multivariate predictive model of incident type 2 diabetes including factors as shown. For continuous variables, odds ratio (plotted on a log scale) was calculated for one standard deviation of each variable

Fig. 5
figure 5

Expanded predictive model of incident type 2 diabetes (2 h plasma insulin and fasting proinsulin concentrations log-transformed). Odds ratios are plotted on a log scale

Discussion

The present analysis demonstrates that: (1) when glucose concentration is used as a continuous variable, both fasting and post-load plasma glucose predict incident diabetes independently of other predictors and of one another; (2) the clinical phenotypes of IFG and IGT are generally similar; (3) isolated IFG6.1 is not a diabetes risk condition; (4) lowering the diagnostic threshold for IFG from 6.1 to 5.6 mmol/l substantially raises RR and PAR; and (5) performing an OGTT captures substantially more diabetes progressors than those identified by IFG5.6. These conclusions need to be seen in context.

Several previous studies have reported on the predictive role of fasting or post-load glucose levels for incident type 2 diabetes, but the two have rarely been assessed together and after controlling for other strong predictors (age, BMI and familial diabetes) [4, 5]. The present analysis shows that an equivalent (i.e. 1 SD of the population distribution) change in fasting and 2 h glucose levels is associated with a 45 and 87% increased risk of diabetes, respectively 7 years later (Fig. 4). From the pathophysiological standpoint, it is hardly surprising that both glucose measures should be predictive independently of one another. In fact, fasting glucose is regulated by endogenous glucose output, under the dual control of pre-hepatic plasma insulin and glucagon levels, and by fasting glucose clearance (mostly by non-insulin-dependent tissues) [16]. In contrast, 2 h plasma glucose levels are principally the result of glucose-induced insulin secretion (in particular, beta cell glucose sensing) and insulin-stimulated glucose clearance [17]. Suppression of endogenous glucose output during the post-glucose period, i.e. hepatic insulin sensitivity, also contributes to limitation of glucose excursions. In addition, as repeatedly demonstrated by studies employing the euglycaemic–hyperinsulinaemic clamp technique coupled with glucose tracers, hepatic and peripheral tissue (mainly skeletal muscle) sensitivity to insulin are correlated with one another across a range of conditions [18]. This correlation is consistent with the fact that fasting and 2 h glucose concentrations are also intercorrelated (with a coefficient of determination of 62% in the present study population). As a result, any prognostic criterion for type 2 diabetes based on both plasma glucose measures will translate into either measure alone being predictive and each making an independent contribution to diabetes risk. The fact that post-load glucose values are stronger predictors than fasting levels in this population also reflects the wider variation in the former (twice as wide) than in the latter (coefficient of variation of 30 vs 14% in our cohort). More specifically, fasting glucose levels in the range 5.0 to 6.8 mmol/l carry an equivalent predictive power to that of 2 h glucose levels in the range 4.9 to 6.4 mmol/l, while higher 2 h values still within the IGT range (6.4–11.1 mmol/l) are much stronger independent predictors.

The use of threshold values of continuous variables to diagnose clinical conditions is practical and time-honoured. The validity of thresholds hinges upon their relation to end-organ damage, complications or mortality. As epidemiological or clinical trial data accumulate, the identification of thresholds improves in accuracy. What also matters for healthcare purposes is the amount of risk in the general population that can be attributed to predictors (PAR) and, therefore, the fraction of morbidity that can be prevented by intervening on risk factors. The present data show that, with the current definitions of all three forms of glucose intolerance (type 2 diabetes, IGT and IFG), the PAR is approximately fivefold higher for IGT than for IFG. As expected, if the IFG threshold is decreased by 9% (from 6.1 to 5.6 mmol/l), the PAR rises considerably. However, even with a lower threshold, IFG5.6 as such accounts for only 19% of diabetes progressors, a much larger proportion (∼40%) being derived from individuals with IGT. On the other hand, if the same decrement were applied to the IGT lower threshold (from 7.8 to 7.1 mmol/l), the PAR would rise to 34% (95% CI: 30–38), i.e. one-third of the total population risk. These calculations, derived as they are from population-based, longitudinal observations in an ethnic group with a rather high diabetes prevalence, serve the sole purpose of providing an objective platform upon which policy decisions should rest. Thus, no single test or threshold is best under all circumstances and for all purposes. If the goal is to extract from the general population as many individuals at risk as possible, then the relevant variable is the PAR. IFG5.6 and IGT would have roughly equivalent sensitivity, the former being a cheaper tool than the latter. If, on the other hand, the objective is to capture persons at high risk with good specificity, then use of a combined criterion (isolated IGT [iIGT]) would maximise RR and PAR relative to the background population (Figs 2 and 3). The final choice must weigh these operating characteristics against such factors as cost, logistics and healthcare agenda. In this respect, ‘isolated’ forms are only of scientific interest (and modest at that).

In a population-based study of 1,342 Dutch participants with a mean age of 60 years, de Vegt et al. [19] also reported similar RR for patients with IGT and IFG6.1 to develop diabetes over 6 years; overlap between the two dysglycaemic states was not, however, analysed, nor was PAR reported; the phenotypic characterisation only included anthropometrics and blood pressure values.

One important finding in our study was that high-risk individuals, be they IFG or IGT by any definition, share a common clinical phenotype, with only minor differences (Table 1). As a result, in an expanded prediction model (Fig. 5), higher serum proinsulin and diastolic blood pressure levels confer additional independent risk (45 and 19%, respectively, for each SD of their population values), while higher serum HDL-cholesterol and 2 h post-glucose plasma insulin levels independently reduce (by 26%) risk of incident diabetes. Again, known pathophysiology underlies these stochastic features. Thus, high serum proinsulin concentrations are reputed to stand for early beta cell dysfunction (incomplete processing of proinsulin to insulin) [20]; hypertension co-segregates with manifest diabetes and higher blood pressure levels co-track insulin resistance [21]; low HDL-cholesterol concentrations cluster with insulin resistance [22]; and lower post-load plasma insulin levels have been shown to predict and precede onset of diabetes in the Mexico City population [23]. Therefore, a screening algorithm for diabetes risk could include not only anthropometric but also other traits of the prediabetic state, with or without [24], plasma glucose measures.