Introduction

As an abrupt or rapid decline in renal filtration function, acute kidney injury (AKI) is a common condition associated with significant morbidity and mortality. In critically ill patients, those with AKI usually present a worse clinical outcome than their non-AKI counterparts [1]. Although regarded as the standard indicators of kidney function loss, serum creatinine (sCr) level and urine output are recognized as having limitations. On the one hand, sCr cannot accurately reflect the glomerular filtration rate (GFR) in a patient with unsteady state, and urine output is easily affected by water intake and the primary water load of the body [2]. On the other hand, sCr and urine output have limited sensitivity and specificity, and a delayed response to kidney impairment. All these factors suggest that sCr and urine output are not appropriate markers in the early detection of AKI. Thus, an accurate and timely biomarker to predict AKI onset or progression after renal insult is urgently needed.

Interleukin-18 (IL-18), a member of the IL-1 family of cytokines, is synthesized as an inactive 23-kDa precursor by several tissues including monocytes, macrophages, and proximal tubular epithelial cells, and is processed into an active 18.3 kDa cytokine by caspase-1 [3]. It has been demonstrated in some animal studies [4] as a mediator of renal ischemia–reperfusion injury, inducing acute tubular necrosis, and neutrophil and monocyte infiltration of the renal parenchyma. More recently, numerous clinical studies have focused on the diagnostic accuracy of IL-18 level in predicting AKI [57]. With the evidence accumulating, contradictory results have raised concerns about the predictive value of AKI across various settings. Thus, we performed a systematic review and meta-analysis to investigate the diagnostic accuracy of IL-18 level for predicting AKI. Since there is no clear consensus about the appropriate cutoff level of IL-18 to predict AKI and different thresholds have been reported by different studies, the summary receiver operating characteristic (sROC) curve was delineated for the meta-analysis.

Methods

Data sources and search strategy

A computer-based search was performed in MEDLINE, EMBASE, Cochrane Library, Ovid, and Springerlink databases, from inception until November 15, 2013, to identify potentially relevant articles. The search strategy consisted of terms related to AKI (“acute kidney injury” and “acute renal failure”) combined with the term “interleukin-18/interleukin 18/IL-18”. To ensure all articles were located, a secondary search of the articles already retrieved was undertaken, reviewing their reference lists. The preferred reporting items for systematic reviews and meta-analysis (PRISMA) guidelines for the conduct of meta-analyses were followed [8].

Study selection

The titles and abstracts were examined independently by two of the authors (X. L. and J. Y.) to ascertain their relevance and inclusion for further analysis. Any disagreements were settled by consensus using a third opinion (Y. Z.). Inclusion criteria were that studies investigated the diagnostic accuracy of urinary IL-18 to predict AKI and had data which could be extracted into a 2 × 2 table or complete data which could be obtained from the corresponding author by e-mail. The search was restricted to human subjects. Only papers in English were eligible.

Data extraction

The following variables were recorded or recalculated: study population, sample size, age, sex, baseline IL-18 level, cutoff value for urine IL-18, AKI definition, number of patients who developed AKI, timing of obtaining the specimen, specificity, sensitivity, and area under the ROC curve (AUROC) with 95 % confidence interval (CI). Data for IL-18 sample storage and detection method were also collected. Two reviewers independently excluded articles on the basis of the title and abstract following a custom-made standardized table. The true-positive, true-negative, false-positive, and false-negative results of each included study were quantified. Quality assessment was performed using the Quality Assessment of Diagnostic Accuracy Studies (QUADAS) tool [9] and independently by the authors (X. L. and J. Y.).

Statistical analysis

The meta-analysis was conducted using sROC curve as described by Rosman [10] and the computation was performed using Meta-Disc1.4. If multiple cutoff points were adopted in any study, the cutoff point (test threshold), sensitivity and specificity in this analysis were selected according to the Youden index. Heterogeneity was quantitatively assessed using the Q statistic. Fixed or random effects models (FEM or REM) were used on the basis of heterogeneity. We performed subgroup analysis to explore the remarkable heterogeneity in the light of different clinical settings.

Results

Search results and study characteristics

Of 179 primary articles searched, 156 were excluded on the basis of abstract and/or title because they were laboratory studies, genetic studies, review/comment articles, animal studies, or irrelevant to the current analysis; 3 were excluded because non-English articles. Of the 20 remaining articles, 5 studies with accurate diagnostic values of IL-18 missing and 3 studies lacking effective data (though we had contacted the corresponding authors to request the missing information) were excluded. As a result, 11 studies [6, 7, 1119] met the inclusion criteria and were selected for the analysis (Fig. 1).

Fig. 1
figure 1

Flowchart representing study selection for systematic review of urine IL-18 as a diagnostic marker for AKI. IL interleukin, AKI acute kidney injury

Characteristics of the included trials are shown in Table 1. Two come from China [6, 11], seven from the USA [7, 13, 14, 1619], one from Taiwan, China [12], and one from Australia [15]. Overall, 2,796 patients were comprised, ranging from newborn to elderly age. Sample sizes were >100 in 7 studies and >1,000 in 1 [14]. Most studies were conducted on critical patients, including those with cardiac surgery (cardiopulmonary bypass) and transplantation, as well as various intensive care patients. Five studies [6, 11, 13, 17, 18] focused on pediatric patients. Males in all studies represented >50 % of the study population. Eight studies defined AKI using acute kidney injury network (AKIN) and/or risk, injury, failure, loss and end-stage renal disease (RIFLE) criteria, two [11, 17] using modified pediatric RIFLE, and one [16] using sCr level which, however, rose to ≥50 % of baseline in the first 72 h. Ten studies harnessed urine IL-18 concentration as the cutoff value and two [11, 15] the ratio of urine IL-18 to urine-creatinine concentration.

Table 1 Characteristics of included studies

Quality assessment

Table 2 lists the methodological quality assessment of the included 11 studies. Ten studies enrolled single or multicenter consecutive patients prospectively, one [7] was retrospective and prospective in design. The study by Chen et al. [12] did not define inclusion and exclusion criteria clearly. The reference standards used in the afore-mentioned studies were creatinine based, which currently is the most widely used standard for evaluating kidney function. The conditions of follow-up were reported in the 10 prospective cohort studies; there were no selective losses in them. Overall, the quality of the studies was suboptimal, without sufficient information to assess all the bias, and uninterpretable, indeterminate or intermediate index test results were not reported. The quality assessment was performed independently by the authors (X. L. and J. Y.).

Table 2 Quality assessment of individual studies

Data extraction and calculation

As the respective number of true-positive, false-positive, true-negative, and false-negative results was not provided by the studies, these indexes were calculated from available sensitivity, specificity, and sample size values. Results are reported in Table 3 which enumerates specimen obtaining times and AKI predictive times of each study. Cutoff and AUROC values are also included in the table. As can be seen, cutoff values for urine IL-18 varied across studies.

Table 3 Sensitivity and specificity of individual studies for urine IL-18 to predict AKI

Diagnostic value of urine IL-18 in AKI prediction

The distribution of accurate estimator in sROC curve floor plan and the Spearman correlation coefficient (r = 0.764, p = 0.006) of logarithmic sensitivity and logarithm (1-specificity) showed that there was a threshold effect in different studies (Fig. 2; Table 4).

Fig. 2
figure 2

The distribution of accurate estimator in sROC curve floor plan. sROC summary receiver operating characteristic

Table 4 Analysis of diagnostic threshold

Figure 3 shows the forest plots and pooled estimates of sensitivity, specificity and diagnostic odds ratios (ORs) respectively. We found a diagnostic OR of 5.11 (95 % CI 3.22–8.12) for urine IL-18 level to predict AKI (Cochran-Q = 28.19, p = 0.0017) with a sensitivity and specificity respectively of 0.51 (heterogeneity Chi-squared 84.53, p < 0.001) and 0.79 (heterogeneity Chi-squared 62.84, p < 0.001) (Fig. 3). Significant non-threshold effect heterogeneity was also disclosed across these studies. As there was no statistically significant between b and 0 (p = 0.1870), the Mantel–Haenszel model was utilized to draw the fitting curve of sROC (Fig. 4). The pooled AUROC was 0.77 (95 % CI 0.71–0.83, Q = 0.71).

Fig. 3
figure 3

Forest plots and pooled estimates of a diagnostic odds ratio (OR), b sensitivity, and c specificity

Fig. 4
figure 4

The fitting curve of sROC

Subgroup analysis (Table 5) showed that urine IL-18 level in pediatric patients (<18 years) was more effective in predicting AKI, with a diagnostic OR of 7.510 (95 % CI 2.988–18.875) compared to 4.652 (2.710–7.987) for the adult group (p = 0.334); the early AKI predictive time (<12 h) subgroup displayed the highest diagnostic OR of 8.176 (95 % CI 2.191–30.507) among the three subgroups (<12, 24 and 48 h, p = 0.037). The pooled estimate diagnostic OR of the admission subgroup was 3.81 (95 % CI 2.08–6.99) and that of the subgroup for other times 7.16 (3.62–14.18). There were no significant differences between the two (p = 0.388). There were significant differences (p = 0.008) between cardiac surgery patients and patients in intensive care unit or coronary care unit patients in the subgroup analysis, with diagnostic ORs of 5.28 (3.59–7.76) and 5.31 (2.59–10.87), respectively.

Table 5 Pooled diagnostic accuracy of IL-18 in various settings

Discussion

The current study presents a meta-analysis of urine IL-18 for predicting AKI development via the sROC analysis approach, both overall and across a range of subgroups. Studies committed to direct comparisons between urine IL-18 and the “gold standard” sCr were included. The finding derived is consistent with previous studies, which lends more force to the use of urine IL-18 as a marker of AKI in clinical practice. The results of this meta-analysis indicate that urine IL-18 had a pooled diagnostic OR of 5.11 and the estimated area under the curve (AUC) of the mean ROC plot was 0.77 (Q = 0.71), with a high heterogeneity in pooled sensitivity and specificity. This suggests that urine IL-18 has a higher chance of early detecting an AKI compared to sCr, but the application of this biomarker in the diagnosis of AKI should be limited to a certain range in the light of limitations intrinsic to such studies. Since pooled results may increase statistical power and lead to more precise estimates of a treatment effect and the pooled random or fixed effect models reflect the between-study variance [20], this may shed light on larger populations.

Due to the presence of a threshold effect, we used sROC curve fitting, the area under ROC and Q index, to merge the data. In the studies included, cutoff values were expressed in two types, urinary concentration or ratio of urine IL-18 and creatinine concentration, and identified as varying across studies (11.25–125 pg/ml). Such differences can be explained by: (1) use of different reagents in these studies though the assay methods were enzyme-linked immunosorbent assay (ELISA); and (2) differences in clinical settings and study population. Therefore, it might be necessary for each center using urine IL-18 level for early AKI diagnosis to define a specific reference range and cutoff value for each clinical setting.

In addition, a significant non-threshold effect heterogeneity exists across these studies. Since these studies were based on different institutions across the world, heterogeneity was inevitable concerning differences in AKI definitions, AKI settings, times for obtaining specimen, and experimental groups admitted to assess the predictive value of urine IL-18. We attempted to use meta-regression to explore the sources of heterogeneity, but failed. So subgroup analysis was performed and proved that urine IL-18 level in pediatric patients and the early AKI predictive time group (<12 h) was more effective in predicting AKI, which might principally account for the heterogeneity. Nevertheless, as the kidney undergoes growth and maturation in neonates who manifest different physiological states due to the abrupt changes at birth, many risk factors are correlated with AKI and the timing of kidney injury remains unknown [21]. The metabolic capacity and compensatory ability of neonates and children are also different from those of adults [22]. Subgroup analysis of age should be more specific if more clinical trials are conducted. Regarding AKI definition, AKIN, RIFLE, AKIN and/or RIFLE, and modified pediatric RIFLE criteria were adopted in the different included studies. The pooled diagnostic OR of AKI within 12 h was greater than that of 24 or 48 h in subgroup analysis. This is useful in clinical processing since preventive strategies can be formulated if AKI can be predicted in advance.

Using sCr level as the “gold standard” of diagnosis of AKI is another limitation of such studies because sCr is not an ideal marker of early loss of glomerular filtration or kidney injury [23]. The best method is to use radio-labeled tracer clearances to define AKI. However, its use in routine clinical practice is restrained because it is invasive, time-consuming and radioactive.

Another limitation lies in IL-18 itself. Cross-sectional studies indicate that urinary IL-18 levels are markedly elevated in patients with acute tubular necrosis compared with healthy controls and a variety of other renal pathologies, including urinary tract infection, chronic renal insufficiency, and pre-renal azotemia [24]. Therefore, the pathophysiology of IL-18 still remains incompletely understood and the true role of IL-18 may be as a mediator of specific injury subtypes rather than as a marker of injury. Though IL-18 can be induced in the proximal tubule after AKI, and released into urine after cleavage by caspase-1 [25, 26], it can also be derived from lung injury and myocardial ischemia, etc. Thus, further studies are required to understand these differences.

Generally speaking, the cost of a single creatinine test is less than a dollar, while that of IL-18 is five times higher. In the course of AKI, biochemical indices need to be monitored and detected many times; given the modest clinical value of IL-18 and its high cost, serum creatinine concentration and change might still be a good indicator rather than IL-18.

Conclusion

In conclusion, this meta-analysis included more than 2,700 patients from 11 studies. The result shows that urine IL-18 level is of diagnostic value for AKI. The diagnostic accuracy for AKI tends to be more effective in pediatric patients and early AKI predictive time. To improve the diagnostic value of urine IL-18, more appropriately designed investigations (e.g. with randomized design and eliminating potential confounders) should be performed.