Introduction

figure a

Diabetic kidney disease (DKD) is the cause of most renal failure and impaired renal function in type 1 diabetes mellitus. As such it is a major contributor to the reduced life span in type 1 diabetes [1]. Accordingly, developing drugs to prevent or reverse impaired renal function is an important goal. However, there is wide variation in the rate of renal function decline among those with type 1 diabetes with some people being much more susceptible than others. This makes conducting clinical trials of drugs challenging because, over the typical trial follow-up time, the average loss in renal function is modest [2]. An improved ability to predict which people with diabetes will progress most rapidly would facilitate oversampling of such people into clinical trials, thereby improving trial power.

To this end there have been several large-scale investments into developing biomarkers for renal disease progression in diabetes. However, as we recently reviewed [3], while there are many sporadic reports of single biomarkers, there have been few systematic attempts to harness the potential of high-dimensional biomarker assays for prediction. Furthermore, many studies do not evaluate how well biomarkers improve prediction on top of clinical record data.

In this study we aimed to identify circulating serum biomarkers that are associated with progression of renal disease in individuals with type 1 diabetes mellitus and to assess their usefulness in a prediction setting when either limited or comprehensive sets of clinical characteristics are available. We evaluated the same set of biomarkers in two different cohorts, the Scottish Diabetes Research Network Type 1 Bioresource (SDRNT1BIO) [4] and the Finnish Diabetic Nephropathy study (FinnDiane) [5], to assess reproducibility and generalisability of the results across a range of characteristics. Moreover, we used two different technologies (Luminex and LC electrospray tandem MS [LC-MS/MS]) to obtain a comprehensive coverage of both candidate and discovery biomarkers for renal disease.

The objectives of this study were first, to determine the association of each biomarker measured with eGFR achieved at the end of follow-up and with rapid progression status in both cohorts; and second, to select from these a parsimonious panel of biomarkers that would be helpful in predicting renal function decline in both cohorts.

Methods

Participants

SDRNT1BIO [4] is a prospective cohort study comprising 6127 people with a clinical diagnosis of type 1 diabetes mellitus, representing 25% of all adults with type 1 diabetes in Scotland, recruited between December 2010 and November 2013. At recruitment, clinical measurements and a blood sample were taken. From electronic healthcare records we extracted routine health-related data at biosample date, or the closest up to 18 months before. These were supplemented by direct measures of eGFR and albuminuria from samples collected at recruitment. For this study we selected 859 individuals with eGFR between 30 and 75 ml min−1 [1.73 m]−2 at study biosample date and with at least three prospective eGFR determinations over a period of at least 2 years or incident end-stage renal disease (ESRD).

The FinnDiane study is a prospective nationwide multicentre study comprising more than 8400 adults with type 1 diabetes mellitus covering more than 15% of all the type 1 diabetic individuals in Finland [5]. Individuals participated in the study during a regular visit to their attending physician during which detailed demographic and medical history data were collected with standardised questionnaires. For this study, a subpopulation of 315 participants studied between 1994 and 2011 was included, comprising those who at biosample date had an eGFR between 30 and 75 ml min−1 [1.73 m]−2 and who had micro- or macroalbuminuria. Participants were also required to have an eGFR measurement within half a year from the biosample date and at least three prospective eGFR determinations over a time period of at least 3 years. If the person developed ESRD, three prospective eGFR measurements over a time period of at least 1 year was accepted.

In both studies participants were classified as normo-, micro- or macroalbuminuric at baseline according to their albumin/creatinine ratio (ACR) falling in the intervals 0–3.39 mg/mmol, 3.39–33.9 mg/mmol, or above 33.9 mg/mmol, based on two out of three consecutive measurements before baseline.

Both studies were performed in accordance with the Declaration of Helsinki; all participants gave their written consent and the study protocol was approved by the local ethics committees.

Renal outcomes

eGFR was calculated with the CKD-EPI equation [6] using serum and plasma creatinine values retrieved retrospectively and prospectively from medical records. In SDRNT1BIO, these excluded readings concurrent with hospital admissions.

A summary measure of the historical eGFR was obtained by computing a weighted average of all retrospective eGFR records for each person, with weights inversely related to the amount of time leading to the biosample date. Participants with no retrospective eGFR data had their historical eGFR imputed to study day eGFR.

Achieved eGFR was defined as the median eGFR reading of the last 6 months of follow-up. Initiation of renal replacement therapy (RRT) was considered to indicate an achieved eGFR of 10 ml min−1 [1.73 m]−2 and all subsequent readings were censored.

The decline of renal function was estimated by fitting a simple linear regression model to the serial prospective eGFR determinations of each person. A rapid progressors category was defined by dichotomising prospective linear slopes according to a threshold of an average loss of more than 3 ml min−1 [1.73 m]−2 year−1.

Biomarkers measured and analysed

We measured a total of 297 biomarkers in non-fasting serum samples using: (1) the Luminex platform at the CLIA certified Myriad RBM laboratory (Austin, Texas, USA) to assay 30 protein biomarkers; and (2) LC-MS/MS at the WellChild laboratory (King’s College London, UK) to capture 121 metabolites (including six ratios between pairs of metabolites) and 146 tryptic peptides.

A number of quality control steps were performed in each of the two studies prior to the statistical analysis stage. This led to excluding from the analyses biomarkers considered uninformative: those that reported over 98% of undetectable readings and those that had more than 50% of their values missing at random (19 and 8 in SDRNT1BIO, and 20 and 7 in FinnDiane, respectively). We evaluated reproducibility of biomarker measurements by computing intra-class correlations on duplicate samples from a pilot study on a subset of participants (35 duplicate samples in SDRNT1BIO, 25 in FinnDiane). According to this, we removed biomarkers that had an intra-class correlation <0.4 (41 in SDRNT1BIO and 59 in FinnDiane).

Serum samples were stored at −80°C in SDRNT1BIO and −20°C in FinnDiane. Multiple freeze/thaw cycles and suboptimal storage conditions are known to affect some biomarkers. In a separate study we considered 16 FinnDiane sample aliquots stored at −20°C and −80°C: this allowed us to identify which biomarkers are susceptible to, and which are unaffected by, poor storage conditions. Among the biomarkers that passed the quality control steps above, 24 biomarkers were identified to be affected by storage conditions and removed from further analyses in FinnDiane, while they were still considered for SDRNT1BIO in univariate analyses.

Overall, 191 biomarkers were analysed in SDRNT1BIO (27 proteins, 81 metabolites and ratios, and 83 tryptic peptides) and 167 biomarkers in FinnDiane (22 proteins, 63 metabolites and ratios, and 82 tryptic peptides).

Within each study, left-censored values were imputed to half the detection threshold; right-censored values (only present for N-terminal prohormone of brain natriuretic peptide) were imputed to the largest value reported; values missing at random were imputed to the median.

Univariate analysis

Biomarkers were evaluated independently in linear and logistic regression models adjusted for age, sex, diabetes duration, eGFR at biosample date and length of follow-up (basic covariates). To incorporate more information about past and current status of renal function, we also considered models containing ACR category and the weighted average of historical eGFR and alongside the basic set of covariates. We further adjusted models for BMI, systolic BP (SBP), diastolic BP (DBP), HbA1c, HDL-cholesterol, total cholesterol, smoking status, ACR category and weighted average of historical eGFR (full covariates). All continuous variables were Gaussianised prior to fitting the models, and standardised to zero mean and unit SD. Associations were declared significant at p < 10−4, which is particularly conservative even for the number of tests performed (Bonferroni-corrected threshold would be 0.05/191 = 2.6 × 10−4).

Construction of sparse panels of biomarkers

Only the biomarkers with valid data in both studies were considered for the purpose of the selection of a sparse panel of biomarkers (22 from the Luminex platform and 145 from the LC-MS/MS platform). The two biomarker sets were modelled independently from each other. However, for comparison we also generated panels of biomarkers using both platforms simultaneously, although a panel requiring two platforms would not be cost effective or easily deployable in the clinical setting.

We adopted a Bayesian modelling approach based on hierarchical shrinkage priors [7], in which the clinical covariates used to control for confounding in the models were assigned a Gaussian prior (which induces some shrinkage), while the biomarkers were penalised through the horseshoe prior to promote sparsity [8, 9]. The hierarchical shrinkage approach was implemented using the Stan Bayesian inference framework [10]. See electronic supplementary material (ESM) Methods for more details.

We evaluated the predictive performance of the biomarker models on withdrawn data using 10-fold cross-validation, and compared models including biomarkers to baseline (unpenalised) models that only contain clinical covariates, also fitted using Stan. For each set of baseline covariates used, we reported the difference in log-likelihood (computed on the test observations from 10-fold cross-validation, and expressed in natural log units) between the model with biomarkers and the model including only the clinical covariates. For linear regression models we computed the r2 as the squared Pearson correlation coefficient between observed and predicted outcome. For logistic regression models, besides reporting the area under the receiver operating characteristic curve (AUC), we also presented the expected information for discrimination Λ expressed in bits [11]. This is a better measure of the incremental contribution of biomarkers to the predictive performance, as it captures the amount of additional information that they contain over and beyond the initial set of clinical covariates (see ESM Methods for more details). Computations were done with the R package wevid (version 0.6: https://CRAN.R-project.org/package=wevid).

To recover a sparse model, we then applied a projection approach according to which the high-dimensional posterior draws of the model containing all biomarkers (full model) are projected to lower-dimensional subspaces [12, 13] (see ESM Methods for more details). This procedure allowed us to rank the biomarkers in terms of importance. Each candidate model was then evaluated in terms of their contribution to the predictive performance relative to the performance of the full model, so that we could plot the relative explanatory power obtained by biomarker panels of different sizes.

Results

Participant characteristics

Table 1 reports the summary characteristics for the two cohorts analysed.

Table 1 Cohort characteristics at baseline

The length of follow-up was shorter in SDRNT1BIO as compared with FinnDiane (5.2 vs 8.8 years), the former being a more recently established cohort. FinnDiane participants were generally at a more advanced stage of renal function decline, with starting eGFR being lower despite their younger age, reflecting the fact that these individuals were oversampled for albuminuria. Similarly, the rate of progression of renal decline detectable during follow-up differed between the two cohorts in terms of prospective eGFR slopes (−0.83 vs −2.44 ml min−1 [1.73 m]−2 year−1 in SDRNT1BIO and FinnDiane, respectively) and of rapid progression (22.6% vs 40.3%). ESM Table 1 shows the characteristics of rapid progressors to non-progressors in each cohort. Of note, point estimates for HbA1c and SBP are somewhat higher, and HDL-cholesterol lower, in progressors than non-progressors in both cohorts.

Biomarkers explored

ESM Table 2 shows the full list of biomarkers measured with median, interquartile range (IQR) and range in each of the studies, and reason for removal of a biomarker from the analysis. There are important distributional differences in some of the biomarkers that may be due to depletion caused by suboptimal storage conditions of the FinnDiane samples, and may also reflect the more advanced stage of kidney disease in FinnDiane.

Univariate associations

When modelling achieved eGFR adjusted for age, sex, diabetes duration, eGFR and length of follow-up, 46 and 14 biomarkers were statistically significant in SDRNT1BIO and FinnDiane, respectively, and 12 were significant in both. Table 2 shows remarkable consistency in the strongest associations between the two cohorts, with CD27 antigen (CD27) having the largest effect size in both studies. Effect sizes in FinnDiane, where albuminuria rates were higher, were generally larger than in SDRNT1BIO. Consistent with this, there was some evidence that associations were stronger in those with above rather than below median ACR in SDRNT1BIO (e.g. for CD27: β = −0.31 vs −0.21, p = 0.021 for interaction).

Table 2 Associations of each biomarker (considered separately) with achieved eGFR from linear regression models adjusted for age, sex, duration of diabetes, study day eGFR and length of follow-up

When these models were further adjusted for ACR at baseline, the coefficients were reduced somewhat, but the top associations all remained highly significant (ESM Table 3). After adjusting models for the full set of clinical covariates, the largest effect sizes were slightly reduced mainly due to the addition of the weighted average of historical eGFR to the set of covariates, but nine Luminex proteins and four LC-MS/MS measured peptides or metabolites remained significantly associated with final eGFR (ESM Table 4).

Considering the binary outcome of rapid progression, associations were much weaker and fewer were significant (Table 3), but the rank ordering of the associations was similar to that using final eGFR as the outcome.

Table 3 Associations of each biomarker (considered separately) with rapid progression from logistic regression models adjusted for age, sex, duration of diabetes, study day eGFR and length of follow-up

Panels of biomarkers

Table 4 summarises the cross-validated predictive performance of the linear regression models for prediction of achieved eGFR using all the biomarkers with penalty parameters on top of various clinical covariates. On top of age, sex, diabetes duration, baseline eGFR and length of follow-up, the biomarkers from each of the platforms yielded only modest increments in prediction of final eGFR. The Luminex platform gave a similar increment in performance to the LC-MS/MS platform in SDRNT1BIO and somewhat greater performance in FinnDiane. The r2 increased from 0.47 to 0.58 in SDRNT1BIO, and from 0.33 to 0.48 in FinnDiane when adding the Luminex panel to a model containing age, sex, diabetes duration, eGFR and length of follow-up.

Table 4 Cross-validated performance of models for prediction of achieved eGFR

Similarly, as shown in the AUC data in Table 5, biomarkers from both platforms modestly incremented the prediction of rapid progression of renal disease independently of age, sex, diabetes duration, eGFR and length of follow-up. Both the base model without biomarkers and the model including biomarkers were more predictive in FinnDiane. This is possibly because of longer follow-up and the more advanced stage of renal decline of their participants. Using the metric of expected information for discrimination rather than AUC, the conclusions are similar, i.e. that the biomarkers provide only modest additional predictive information on top of clinical covariates. For example, the addition of both panels to the set of basic covariates provides 0.3 and 0.4 extra bits of information for the prediction of rapid progression in SDRNT1BIO and FinnDiane, respectively. Note that the expected information for discrimination metric shows that the increment in prediction of rapid progression with biomarkers in FinnDiane is similar or slightly greater than in SDRNT1BIO, consistent with the generally stronger univariate associations. See discussion for further consideration of what the expected information for discrimination shows that an AUC does not.

Table 5 Cross-validated performance of models for prediction of rapid progression

By applying the forward selection projection approach, we determined a ranking of the biomarkers within each platform (ESM Table 5). Figure 1 displays the ranking of the first three biomarkers for the two platforms considered separately and jointly, when predicting achieved eGFR in models adjusted for basic clinical covariates. This shows that, in general, the gains in explanatory power flatten out after the first few biomarkers are added to the initial set of clinical covariates. Indeed, for the Luminex platform, 75% of the explanatory power is produced by CD27 and kidney injury molecule 1 (KIM-1) in both cohorts.

Fig. 1
figure 1

Contribution of biomarker sets to prediction of achieved eGFR when starting from a model containing only age, sex, diabetes duration, study day eGFR and length of follow-up (basic covariates) in SDRNT1BIO (a, c, e) and FinnDiane (b, d, f). The sets of biomarkers added were: (a, b) Luminex platform; (c, d) LC-MS/MS platform; (e, f); both platforms together. Note that for (c) to (f) the plotting of the curve was interrupted after 25 biomarkers: by definition, the curve would gradually converge to 1 with the addition of all remaining biomarkers. The names of the ten highest ranking biomarkers in each setting are provided in ESM Table 5. Alb T70, albumin T70; B2M, β-2-microglobulin, Cys-C, cystatin C, HSP60, heat-shock protein 60 kDa; Sial, free sialic acid; THP, Tamm–Horsfall urinary glycoprotein; Trp/Kyn, tryptophan/kynurenine

To consider whether the increment in prediction obtained with these biomarkers would be of practical use, we calculated the enrichment in the rate of rapid progression that would be obtained if the biomarkers were measured in potential entrants to a trial with characteristics similar to the cohort participants. By oversampling into the trial according to varying percentile cut-offs for the risk score from models with and without biomarkers, we could produce the curves of Fig. 2. For example, if the most extreme 25% of participants for biomarkers were oversampled then the expected incidence of rapid progression would go from about 23% to 35% in SDRNT1BIO. By contrast in FinnDiane, where most participants were already albuminuric and rapid progression rates are higher, the enrichment would be less, going from 63% to 73%.

Fig. 2
figure 2

Expected cumulative incidence of rapid progression if a clinical trial was enriched with the top percentile of possible participants according to their risk score in SDRNT1BIO (a) and FinnDiane (b). The baseline model contained only age, sex, diabetes duration, study day eGFR and length of follow-up (red line) or a model augmented with CD27 and KIM-1 (blue line). The observed event rate is represented by the horizontal dashed line

Discussion

The key findings from this study were that, despite a large number of the biomarkers evaluated showing highly significant associations with eGFR and its decline: (1) the marginal increment in prediction using these biomarkers panels is modest; (2) almost all of the predictive information can be obtained using just two biomarkers (CD27 and KIM-1); and (3) using weighted historical eGFR and albuminuria gives similar prediction performance to using the biomarkers. Nonetheless, in some clinical trial settings where historical records are not available, oversampling those with the highest levels of CD27 and KIM-1 may be useful for stratification into the trial.

Our data emphasise that, while identifying biomarkers that are associated with eGFR and its decline is rather straightforward, building a biomarker panel is not as simple as considering the top few strongly associated biomarkers. The creation of biomarker sets that are jointly predictive needs to take account of the correlation between the biomarkers. When this is done we can see that almost all the gains in prediction are from just a few biomarkers, here CD27 and KIM-1.

In the SUMMIT study (Surrogate Markers for Micro-and Macrovascular Hard Endpoints for Innovative Diabetes Tools), we have previously shown that KIM-1 was the most useful of a wide range of biomarkers associated with eGFR loss in type 2 diabetes [14]. KIM-1 is expressed on the apical membrane of kidney proximal tubule cells. It is well established as a urinary marker of acute kidney injury. Serum KIM-1 has previously been reported to predict eGFR decline and incidence of ESRD in type 1 diabetes [15, 16]. Furthermore, a Mendelian randomisation approach utilising genetic data to infer causality suggested that urinary KIM-1 would be a causal risk factor for decline in eGFR [17]. However, it remains an open question whether it is a causal biomarker or not.

CD27 is a member of the tumour necrosis factor (TNF) receptor superfamily and is required for the maintenance of T cell immunity. We are not aware of previous reports of CD27 in DKD. However, other members of this superfamily (TNF receptor [TNFR]1 and TNFR2) have previously been reported as biomarkers of eGFR decline by us and others in type 2 diabetes [18, 19] and type 1 diabetes [20]. While circulating levels of CD27 correlate with structural renal abnormalities [18] and while there is considerable evidence for a role of the immune system and inflammation in DKD, whether these associations of members of the TNF receptor superfamily with DKD are causal or secondary to altered filtration remains unclear.

A key strength of our study was the deliberate use of two cohorts with differing characteristics and rates of disease. The SDRNT1BIO cohort was not oversampled for albuminuria but the FinnDiane cohort set used was. Accordingly, for similar starting eGFR, the rate of progression in FinnDiane was higher. In general, biomarker associations with final eGFR were stronger among those with albuminuria and the increment in prediction gained with biomarkers was slightly greater in the FinnDiane cohort. The differing storage conditions in the two cohorts (−80°C with no freeze/thaw cycles in SDRNT1BIO and −20°C in FinnDiane) allowed us to examine which of these biomarkers are highly sensitive to storage—an important consideration for their practical deployment in clinical settings.

Another key strength of our study is the use of advanced statistical methods that avoid over-optimistic assessments of prediction. These include cross-validation and use of penalty parameters that account for the high number of analytes being evaluated. We also introduce here a new measure of prediction performance—the expected information for discrimination. See [11] for the full derivation and discussion of this measure, but an important advantage is that it scales linearly so one can directly compare the increment in prediction gained from two differing base levels. As shown in Table 5, the AUC increases from 0.51 to 0.63 with both biomarker platforms in SDRNT1BIO, but from 0.70 to 0.73 in FinnDiane. Although the reader might be misled into thinking the increment in prediction is therefore greater in SDRNT1BIO, in fact the increase in information that is needed to increase an AUC from the higher starting point of 0.70 to 0.73 (0.4 bits) is actually slightly greater than that needed to increase an AUC from 0.51 to 0.63 (0.3 bits). The reason is that, although not commonly appreciated, in fact the AUC does not scale linearly and one cannot compare the absolute increment in AUC from two differing base levels.

Another important strength of our study is our transparency with regard to showing the effect of adjustment for baseline characteristics and for historical eGFR data. As we recently reviewed [3], many reports of biomarker usefulness are overstated because the marginal gain in prediction beyond baseline and historical eGFR are not reported. In both cohorts examined, the contribution of the biomarkers became much less important when a full set of covariates are used in the model, a result that reflects what was observed in earlier high-dimensional biomarker studies [14, 19]. Of course, while in a clinical setting current and historical measurements are available, these may not be readily accessible in other situations, such as in the screening phase of a clinical trial. In the latter case it may be hard to capture more than the basic set of clinical covariates used in our study. Accordingly, we have provided enrichment plots to give a sense of the potential use of biomarkers in such a context. The increments in prediction we have found are such that a sparse panel of KIM-1 and CD27 is unlikely to be of use in clinical decision making. However, an important point to appreciate is that even a small enrichment of event rate in trials can be worthwhile in terms of sample size, power and resulting cost.

A limitation of our study is the relatively short follow-up and low progression rates in SDRNT1BIO. However, we emphasise that this cohort is representative of the current reality of progression rates in those with stage 2 and 3 DKD in type 1 diabetes. Furthermore, the follow-up time is similar to the length of many trials allowing the challenge of demonstrating drug effects, and the need to oversample those with most progression to be clearly understood. The effect of this shorter follow-up time is that the observed slope and rapid progressor status are less stable and subject to more misclassification in SDRNT1BIO, which will make associations tend more towards the null. Accordingly the biomarker outcome associations were less strong in SDRNT1BIO. However, this lesser progression would not introduce false positive associations. Another limitation is that we have evaluated the marginal improvement in prediction with biomarkers on top of baseline eGFR for predicting final eGFR. However eGFR itself is an imperfect measure of actual GFR [21]. Where measurements are noisy, increments in prediction with biomarkers may be more than would occur if GFR were directly measured. However, since in most settings where biomarkers might be used GFR is not directly measured, it is the increment on eGFR that is most relevant. Another limitation is that we were not able to assess incremental prediction on top of the validated Kidney Failure Risk equation as data on calcium and phosphate levels were not available in most participants [22].

Conclusions

In summary, despite extensive investigation of a wide range of biomarkers, we can demonstrate only modest increments in prediction of future eGFR. The increment is of a magnitude that may be of use in some trial settings but not in clinical decision making. It remains possible, of course, that other biomarkers, or the same biomarkers assayed on other platforms, might achieve greater performance. We hope that the methodological approach we have adopted and illustrated in this work will be useful to others exploring the use of high-dimensional biomarker platforms in a prediction setting.