Introduction

Chronic kidney disease (CKD) is a global health problem1. CKD patients are widely prevalent, and the number of end-stage kidney disease (ESKD) patients is still increasing. ESKD patients requires cost-prohibitive kidney replacement therapy2. Moreover, CKD patients are highly vulnerable, and the risk of cardiovascular events and death increases with the progression of CKD stages3,4,5,6. Thus, it is critical to predict their risk for the progression to ESKD in CKD patients to avoid these unfavorable situations.

Taking the fact that kidney is one of metabolically-active organs7,8, the metabolic profiling of CKD patients is a promising method to identify new biomarkers for the prognoses of CKD patients. Recent cross-sectional studies have demonstrated the correlations between kidney function and the levels of certain metabolites in CKD patients9,10,11,12,13. The presence or altered levels of certain metabolites was also suggested in ESKD patients14. Population-based studies also demonstrated that the levels of some metabolites were associated with the incidence of CKD15,16 or with the worsening of kidney function, i.e., the decrease in estimated glomerular filtration ratio (eGFR, a calculated product of creatinine, sex, and age)17,18. A nested-cohort study in diabetic patients with early stages of CKD (mainly stage 1 and 2) showed a specific metabolomic profile in those who progressed to ESKD within an observational period19. The importance of metabolic profiling and the aberrance of metabolism in CKD patients have been emphasized20, however, the number of prospective metabolomic studies is limited thus far. Thus, whether certain metabolic changes could predict rapid progression of ESKD in referred CKD patients is largely unknown.

We conducted capillary electrophoresis and liquid chromatography mass spectrometry (CE-MS)-based metabolic profiling21,22 in a prospective longitudinal cohort, which was composed of referred CKD patients who were not on dialysis at entry. We aimed to prove that the levels of a subset of blood metabolites would predict the further worsening of kidney function, with the goal to raise the possibility that the measurements of these metabolites will facilitate the management of referred CKD patients.

Results

We performed a metabolic profiling from a prospective cohort, which consisted of referred CKD patients. Data were generated from 112 participants with median follow-up period of 4.4 years (IQR, 2.4–5.5). The baseline characteristic is shown in Table 1. None of the participants were lost to follow-up. Among these patients, 61 started kidney replacement therapy and 17 died, including 4 who died before initiating kidney replacement therapy.

Table 1 Baseline characteristics of the patients.

Metabolomic analyses were performed using the plasma samples of this cohort. Out of 579 metabolites in the library, a total of 218 metabolites were detected in the plasma of CKD patients. Among them, 129 metabolites, including creatinine and urea nitrogen, were above the detection limits in at least 40% of study subjects and were subjected to the rest of the analyses. The levels of 73 out of 129 metabolites were significantly correlated with eGFR (Figs S1–3): the increased levels of urea and creatinine, which represent kidney function, showed the strongest correlation with the decreased level of eGFR, whereas the levels of the rest of metabolites variably, but less strongly, correlated.

The levels of metabolites were divided into three tertiles and were subjected to the Cox regression analyses. The Holm-corrected multiple testing revealed that the levels of 26 out of 127 metabolites (creatinine and urea were excluded) had predictive values on the composite outcome (a combination of end-stage kidney disease requiring replacement therapy and all-cause of death, Table S1). These metabolites were detected in more than 80% of patients except for 2-hydroxyisobutyrate (67.5%), cytidine (56.4%), and 3-hydroxy-3-methylglutarate (53.0%). We fitted multiple Cox regression analyses to assess the predictive value of the levels of these metabolites by adjusting for eGFR, the level of urinary protein, and other clinical covariates such as the presence of diabetes. After adjustment for the clinical covariates, the levels of 16 out of 26 metabolites remained as significant predictors of the composite outcome when tertiles were analyzed as continuous variables (Table 2). The results were largely similar when analyzed for ESKD (Table S2). Patients in the top tertile of these metabolites had 2.0- to 8.0- fold higher adjusted hazard ratios for the composite outcomes, compared with those in the lowest tertile (Fig. 1 and Tables 2 and S2).

Table 2 Cox regression analysis of the effect of plasma metabolites on the risk of composite outcome.
Figure 1: Forest plot of adjusted hazard ratio of each tertile as categorical variable for composite outcome.
figure 1

First tertile was used as reference.

Kaplan-Meier curve analyses demonstrated that patients with the higher levels of 58 metabolites including creatinine and urea, and the lower levels of 4 metabolites, more frequently reached the composite outcome (Figs 2 and S4). Among them, 16 metabolites identified in the analysis above robustly divided the likelihood of reaching the composite outcome (Fig. 2). Because not a small fraction of this cohort is diabetic patients, we stratified the analyses by its presence (Table S3 and Fig. S5). The levels of some metabolites showed good predictive values specifically in the presence (4-oxopentanoate, glucoronate, 2-hydroxyisobutyrate, 5-oxoproline, pimelate, N-acetylneuraminate, 3-methylhistidine, phthalate, trp, hippurate, and 3-hydroxy-3-methylglutarate) or absence (citramalate and 2,3-pyrinedicarboxylate) of diabetes, although the levels of some showed the predictive values regardless of the presence or the absence of diabetes (isethionate, saccharate, trimethylamine N-oxide, cytidine, gluconate, guanidinosuccinate, and uridine).

Figure 2: Kaplan-Meier survival curves of selected metabolites for composite outcome.
figure 2

Patients with first (thick line), second (dotted line), and third (thin gray line) tertile of levels of metabolites were subjected to these analyses.

Discussion

By applying CE-MS-based metabolic profiling, we identified 16 prognostic metabolites in CKD patients. The risk of developing ESKD was elevated from 2.0- to 8.0-fold in those with higher or lower levels of plasma metabolites. Although the levels of some of the metabolites identified in this study correlated with eGFR, these metabolites are associated with kidney deterioration even when they were adjusted for the baseline kidney function. These findings may support the notion that measurement of these metabolites in CKD patients would be a useful tool to assess the risk of progression to ESKD.

The number of studies using metabolomics in CKD studies as a tool is increasing, which strongly suggests the growing interests of this field9,10,11,12,14,15,16,17,18,19. In their first application of metabolomics on CKD study, Toyohara et al. demonstrated correlations between the levels of 64 plasma metabolites and that of eGFR in CKD patients9. Rhee et al. showed that the levels of 49 metabolites were altered in ESKD patients on hemodialysis compared with age-matched control14. Shah et al. showed significant differences in the levels of some plasma metabolites among various stages of CKD 2–410. Hirayama et al. identified some metabolites whose levels were associated with the presence of nephropathy (assessed by the presence of macroalbuminuria) in diabetic patients11, while Sharma et al. demonstrated that the levels of 12 metabolites were associated with the presence of diabetes in CKD12. By nested-cohort study, Niewczas et al. demonstrated that the levels of some plasma metabolites had significant odds ratios for the incidence of ESKD during the observational period in diabetes patients (CKD stage 1–2, 78%; stage 3, 22%)19. In population-based studies, Rhee et al. and Yu et al. demonstrated that a subset of blood metabolites in study participants (eGFR ≥60 mL/min/1.73 m2) were associated with the onset of incident CKD (defined by eGFR of <60)15,16. Geok et al. demonstated that the levels or the combined ratios of some serum metabolites were associated with the decreased ratio of eGFR17,18. The present study added the important prognostic notion that patients with advanced CKD stages have particular metabolomic profiles that are associated with worse prognoses.

The 16 metabolites identified in the present cohort are located on widely variable metabolic pathways. These pathways include nucleotides (cytidine and uridine), glycolysis (gluconate [also known as gluconic acid] and saccharate [glucaric acid]), amino acids (guanidinosuccinate [guanidinosuccinic acid] and 3-methylhistidine), amino sugar (N-acetylneuramine [N-acetylneuraminic acid] and glucoronate [glucuronic acid]), biotin (pimelate [heptanedioic acid]), glutathione (5-oxoproline [pyroglutamic acid]), and taurine (isethionate [2-hydroxyethanesulfonate]). In addition, the origins or pathways of some metabolites are unidentified (4-Oxopentanoate [levulinic acid, 4-ketovalerate], trimethylamine N-oxide [TMAO], citramalate, 2-hydroxyisobutyrate [alpha-hydroxyisobutyric acid, acetonate], and phthalate [Alizarinate, Naphthalinate]). These metabolites might be originated from extracorporeal metabolisms such as gut flora, dietary, and environmental chemicals, whose importance in CKD patients are rediscovered recently23. The presence of broad pathways behind the identified metabolites in our study may reflect the complicated process of kidney deterioration24.

The association with kidney function have been demonstrated in some of metabolites identified in this study. Guanidinosuccinate, trimethylamine N-oxide, 3-methylhistidine25, cytidine26, and uridine27 are ones of the earliest uremic toxins whose toxicity are identified28,29. Trimethylamine N-oxide is also associated with onset of CKD15. Both cytidine and uridine are pyrimidine-derived nucleosides, and the level of cytidine is known to decrease in kidney from drug-induced kidney injury model rats30. Guanidinosuccinate is an acetyl derivative of the amino sugar neuraminic acid, whereas N-acetylneuramine, another metabolite identified in this study, is a major component of glycoconjugates, including glycoproteins which resides on cellular membranes to mediate several cellular functions. The fact that both guanidinosuccinate and N-acetylneuramine were identified as predictors for kidney deterioration in this study may suggest the potential roles of glycomodulation in the progression of kidney diseases. The fact that the level of 3-Methylhistidine, a component of actin and myosin, is associated with muscle protein breakdown31 may reflect muscle-wasting in kidney diseases32. 5-Oxoproline is known to increase its level in response to worsened kidney function9,33. 5-oxoproline is involved in the metabolism of glutathione, a major antioxidant, in both synthetic and degradative pathways. While our study is in revision, Yu et al. reported that lower levels 5-oxoproline was associated with incidence of CKD16, whereas our data suggested that its higher level was a risk for earlier kidney deterioration. The regulation of 5-oxoproline seems to be complicated and its level should be understood based on their kidney function and other underlying comorbidities. Interestingly, the levels of some metabolites have the potentials to predict prognosis in specific patients (i.e., in the presence or absence of diabetes). Whether these metabolites may reflect prognosis synergically or through specific association with certain underlying pathophysiology needs to be studied.

This study has some limitations that need to be addressed in further studies. Although the present study was strengthened by the longitudinal observation, the limitations exist on relatively small number of the cohort and the dependence only on the baseline characteristics. We performed power analyses using a type I error of 5% and 80% power. These analyses revealed that 19 or 65 patients would be needed for the median (3.62 for uridine) or lowest hazard ratios (2.01 for phthalate) of the selected metabolites in the prediction set, respectively. Therefore, we had enough sample size to determine the association between composite outcomes and these metabolites, though the effects of unselected metabolites might be underestimated. This is especially the case in the presence of statistical processes of multiple comparison or adjustments. The correlations between the metabolites identified in this study and eGFR also complicated the results. Although our results were achieved by adjusting for eGFR, we could not completely rule out residual confounding due to kidney function. We considered that the levels of these metabolites in CKD patients may not be simple reflections of the decreased level of eGFR based on the following observations34,35. First, the Spearman correlation coefficients for the selected metabolites with eGFR were not always the highest ones as shown in Fig. S2. Second, one metabolite (phthalate) was not significantly correlated with eGFR. Third, the coefficients of most of the metabolites were lower than that of hemoglobin (r = 0.58), which is used as a prognostic factor of CKD patients. Therefore, the metabolic profile of CKD patients identified in the present study may rather provide additional information for the progression of kidney disease. The importance of the identified metabolites and the variance in the concentrations of these metabolites need to be validated in future studies. Statistical multiple-comparison might have obscured the true relationship. Finally, we have no mechanistic data to demonstrate the reasons for the changes of the levels of these metabolites. Further mechanistic and causal studies together with validation studies will support our comprehensiveness and clinical application.

In summary, 16 metabolites were related to higher risk of the deterioration of kidney function in advanced CKD patients. Our studied population is based on referred CKD patients, who need immediate and impeding improvement of medical care. Measurement of the metabolites identified herein may facilitate our management of advanced CKD patients through the additional information on the clinical parameters, such as eGFR and urinary protein level. We propose that the metabolomic profiling of CKD patients will potentiate us to perform tailored management of CKD.

Methods

Study population and samples

We prospectively enrolled 118 consecutive patients with CKD stages 3, 4 and 5, who were not on dialysis, from a single nephrology department at Rinku General Medical Centre between August 2005 and January 2009. Baseline blood samples from patients after an overnight fast were collected. Patients with insufficient blood samples were excluded beforehand. This study was approved by the institutional ethics committees of Rinku General Medical Centre and Osaka General Medical Centre, and was adherent to the Declaration of Helsinki. All patients provided written informed consent to participate in the study.

Baseline inclusion criteria included age less than 90 years, no complication of malignancy, and no active infection. Patients with incomplete baseline data (n = 2) and started kidney replacement therapy within 1 month after the enrollment (n = 4) were excluded. Final analysis covered the remaining 112 patients.

Kidney function was evaluated from baseline data at the first visit to our outpatient clinic using the estimated glomerular filtration rate (eGFR) based on the equation for Japanese population36. The formula is as follows: eGFR = 194 × serum creatinine (SCr)−1.094 × age−0.287, where age is in years, SCr is in mg/dL and the glomerular filtration rate (GFR) is in mL/min/1.73 m2 body surface area. The product of this equation for women was multiplied by a correction factor of 0.739. Serum creatinine was measured by enzymatic methods in the same laboratory. Random urine samples (10 mL) were also collected at baseline to measure the ratio of urinary protein to creatinine. Other baseline variables included age, sex, diabetes defined according to the International Classification of Diseases, Tenth Revision (ICD-10) codes E10-E14, systolic blood pressure, diastolic blood pressure, hemoglobin, albumin, calcium, phosphate, and use of renin-angiotensin system inhibitor, beta-blocker, and calcium blocker. Cardiovascular disease included ischemic heart disease (ICD-10 codes I20-I25), heart failure (ICD-10 code I50), and stroke (ICD-10 codes I60-I67).

The primary end points were end-stage kidney disease (ESKD) requiring kidney replacement therapy and a composite of ESKD and all-cause of death.

Patients received regular follow-up care in the outpatient ward. Data were collected from source documentation at the end of 2011. The baseline and follow-up data were collected from hospital medical records and discharge abstracts, outpatient visit records, contact with primary and dialysis care physicians and death certificates. End points were validated by at least two physicians. The follow-up of patients is available with accuracy because i) this facility is the central hospital of the southern area of Osaka prefecture, and there is no other central hospital in this region, and ii) regional partnership with the primary and dialysis care physicians has been well-established.

Metabolite extraction and measurement

Metabolites were measured as previously reported with modification21,37. Briefly, plasma samples (40 μl) were put into 360 μl of methanol which contained internal standards [20 μmol/L each of methionine sulfone and 2-(N-morpholino)-ethanesul-fonic acid]. The homogenate was then mixed with 160 μl of Milli-Q water and 400 μl chloroform and centrifuged at 10,000 g for 5 min at 4 °C. Subsequently, the aqueous 300 μl solution was centrifugally filtered through a 5-kDa cutoff filter (Millipore) to remove proteins. The filtrate was centrifugally concentrated and dissolved in 50 μl Milli-Q water that contained reference compounds (200 μmol/L each of 3-aminopyrrolidine and trimesate) immediately before CE-TOFMS analysis.

Measurement of metabolites was performed using CE-TOMFS21,37. Briefly, CE-TOFMS was carried out using an Agilent CE Capillary Electrophoresis System equipped with 6210 Time-of-Flight mass spectrometer, 1100 isocratic HPLC pump, G1603A CE-MS adapter kit, and G1607A CE-ESI-MS sprayer kit. The system was controlled by Agilent G2201AA ChemStation software version B.03.01 for CE. Data acquisition was performed by Analyst QS Build: 7222 software for Agilent TOF (Applied Biosystems). For cationic metabolites, capillary electrophoreses were performed using a fused silica capillary. The electrolyte was 1 M formic acid. For anionic metabolites, a polymer coated COSMO (+) capillary (Nacalai, 07584-44i) was used. The electrolyte was 50 mM ammonium acetate (pH 8.5). For all analytical modes, inner diameter and total length of capillary are 50 μm and 100 cm, respectively. The applied voltage was set at +30 kV and −30 kV for cation and anion modes and nucleotide mode, respectively. Electrosplay ionisation-TOFMS was operated in the positive ion mode (4 kV), the negative ion mode (3.5 kV), and the negative ion mode (3.5 kV) for cationic metabolites, anionic metabolites, and nucleotides, respectively. Exact mass data were acquired over a 50–1000 m/z range. A mixed solution of the standards were prepared immediately before CE-TOFMS analysis as previously described22. The measured metabolite included amino acids, glycolysis intermediates, nucleotides, and their derivatives. CE exhibits extremely high resolution for charged species and its injection volume is quite low (3–30 nl). Therefore, matrix effects are hardly observed in CE-MS system38,39.

Data processing

Raw data were processed using software (MasterHands) developed in-house as previously described22. The data processing flow consisted of noise filtering, baseline correction, peak detection, and integration of the peak areas from 0.02 m/z-wide sections of the electropherograms. The accurate m/z of each peak was calculated by Gaussian curve-fitting on the m/z domain, and the migration times were normalized to match the detected peaks among the multiple datasets. The peaks were identified by matching m/z values and normalized migration times of corresponding authentic standard compounds. Quantification was performed by comparing the peak areas against a calibration curves generated using internal standardization-techniques to eliminate systematic bias which was derived from injection volume variance and MS sensitivity. Metabolites that were above the detection limit in at least 40% of study subjects were analyzed. We set this cut-off value for two reasons. First, we were afraid higher cut-off value may lead us to underestimate the effects of metabolites, which is against the screening-nature of this study. Second, we consider the possibility that the detection of a certain metabolite in itself may provide a valuable information for the prognosis. The levels of metabolites under detection limits were imputed with minimal value of each metabolite. The levels of metabolites collected here were exposed to the Holm’s multiple comparison as described below.

Statistical analysis

We constructed multivariate Cox proportional hazard models to assess an association between each candidate metabolite and the outcome after adjusting for the baseline covariates. Multiple comparison of the hazard models were performed with the Holm method. Log-rank test was used to assess the equality of survival distribution stratified by the median values. Statistical significance was defined as P < 0.05. Statistical analyses were performed using R environment for statistical computing, version 2.15.240, and STATA statistical software version 11 (STATA Corporation, College Station, TX, USA).

Additional Information

How to cite this article: Kimura, T. et al. Identification of biomarkers for development of end-stage kidney disease in chronic kidney disease by metabolomic profiling. Sci. Rep. 6, 26138; doi: 10.1038/srep26138 (2016).