Introduction

Acute kidney injury (AKI) is a clinical syndrome with a high incidence of extensive renal damage, which can cause abnormalities in renal structure and function. With improvements in the understanding and diagnosis of AKI and the aging of the population, the clinical incidence of AKI has increased significantly in recent years. A systematic review that included 49 million patients identified that AKI occurred in one in five adults and one in three children hospitalized with acute illness1. Furthermore, multiple studies have found that AKI is an important risk factor for higher morbidity and mortality. The overall mortality rate of AKI is close to 25%, and the death rate of severe cases is over 50%. In addition, patients with AKI are at high risk of developing chronic kidney disease (CKD), which places a huge burden on medical and health care2,3. Therefore, it is crucial to address the early and potentially reversible stages of acute kidney injury. Typically, AKI is diagnosed by creatinine-based guidelines that rely solely on guidelines for increased serum creatinine or decreased urine volume4,5. However, serum creatinine concentration does not increase until approximately half of the kidney function is lost, and it may be affected by sex, muscle mass, and the patient’s hydration status6,7. Although elevated blood urea nitrogen and serum creatinine levels are the most widely accepted biomarkers of AKI, their increase may be delayed by 24–72 h following AKI induction.

New biomarkers will help in the early diagnosis of AKI, making the identification of these biomarkers of major clinical interest. Recently, the new biomarkers tissue inhibitor metalloproteinases-2 (TIMP-2) and insulin-like growth factor-binding protein 7 (IGFBP7) have been proposed for the early detection of cardiac surgery-associated acute kidney injury (CSA-AKI)8. Both are involved in the G1 cell cycle, which is a known mechanism of AKI9. In human kidneys, TIMP-2 is expressed in the distal nephron, whereas IGFBP7 is mainly expressed in the proximal tubule, and both markers can be detected in urine samples10. In 2014, the Nephrocheck™ test, which calculates [TIMP-2] × [IGFBP7] concentrations, was developed and approved by the Food and Drug Administration (FDA) for use in intensive care unit (ICU) patients to predict the risk of developing moderate to severe AKI11. Since then, an increasing number of studies have evaluated the clinical application of [TIMP-2] × [IGFBP7] in AKI of different etiologies. Thus, we performed a meta-analysis to determine the clinical value of urinary [TIMP-2] × [IGFBP7] levels in all cause-AKI.

Methods

Search strategy

A complete meta-analysis protocol was constructed in adherence with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) standards. We performed a comprehensive literature search of the MEDLINE, PubMed, and EMBASE databases from 2013 to April 2022 to identify relevant articles. The literature search included the keywords and MeSH (medical subject headings) terms “acute kidney injury”, “acute renal failure”, “acute tubular necrosis”, “TIMP-2”, or “Tissue Inhibitor of Metalloproteinase-2”, “IGFBP7”, “IGF-binding protein 7”, “insulin-like growth factor binding protein 7”, with no language restrictions. The full search strategy is provided in the Supplementary Document. The search strategy was manually adapted according to the citation lists of the retrieved articles for the sensitivity analysis. The reference lists of selected studies were manually searched to identify potentially relevant citations.

Study selection

Two investigators independently evaluated all identified articles for eligibility and inclusion. Any disagreements were resolved by consultation with a third investigator. Although there were no initial language restrictions, for the full-text review and final analysis, we included only articles published in English. Studies included met the following criteria: (1) original study; (2) urinary [TIMP-2] × [IGFBP7] as biomarkers for the early diagnosis of AKI; and (3) studies with mandatory data from which true-positive (TP), false-positive (FP), false-negative (FN), and true-negative (TN) could be found or calculated. The exclusion criteria were as follows: (1) studies from conference abstracts, guidelines, letters, editorials, or reviews; (2) studies without sufficient data for analysis, even after contacting the authors; and (3) studies with duplicate data reported in other studies.

Data extraction and quality assessment

Data on the patients and study characteristics were collected and entered into a database to assess study eligibility. If eligible, a standardized data extraction sheet was used for data extraction. The following data were extracted: first author, year of publication, publication data, study design, population type, age, sample size, test method, timing of measurement, AKI definition, sample storage, TP, FP, FN, TN, sensitivity, specificity, and area under the receiver operating characteristic curve (AUROC). We assessed the methodological quality using the Quality Assessment Tool for Diagnosis Accuracy Studies (QUADAS-2)12. Any discrepancies that arose from the study selection, data extraction, and quality assessment were resolved by discussion to reach a final consensus.

Statistical analysis

All statistical analyses were conducted using Review Manager 5.4 (RevMan; The Cochrane Collaboration, Oxford, UK) and STATA 14.0 software (Stata Corp, LP, College Station, TX, USA). A random-effects model or fixed-effects model was constructed to estimate the pooled sensitivity, specificity, pooled positive likelihood ratio (PLR), negative likelihood ratio (NLR), and diagnostic odds ratio (DOR) with 95% CI. The model selection was based on the heterogeneity of the included studies13. Diagnostic accuracy analysis was consistent with the summary receiver operating characteristic (SROC) curve and area under the curve (AUC) of the SROC. The heterogeneity induced by the threshold effect was set at P < 0.05. Heterogeneity in the meta-analysis represented the degree of variation in the study results, and was assessed using Cochran’s Q test and I2 test14. Cochran’s Q test indicated heterogeneity at P < 0.10, and I2 > 50% was considered an indication of significant heterogeneity. A useful predictor of AKI risk was defined as an AUROC of > 0.7 and P < 0.05. The Fagan nomogram was used to calculate post-test probability (PTP). Deeks’ funnel plot asymmetry test was used to check for publication bias using STATA 14.015. All statistical tests were 2-sided, and statistical significance was set at P < 0.05.

Ethics approval and consent to participate

The local Institutional Review Board deemed the study exempt from review.

Results

Search results

A total of 200 articles were preliminarily identified through the search. First, 40 studies were removed after duplicates were identified. We excluded 120 studies by screening the titles and abstracts. After reviewing the full text of the remaining studies, a further 18 publications were excluded: 10 did not have the required data, 4 had no definition of AKI, 3 had unclear measurement time of biomarkers, and 1 had no male subjects. Moreover, two studies were unable to extract a 2 × 2 table data, but only AUROC prediction values16,17. Finally, 20 studies were included in the quantitative analysis (Fig. 1)10,11,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35.

Figure 1
figure 1

Flow chart of study selection.

Study and patient characteristics

All studies were published between 2013 and 2022, and a total of 3625 patients were included in this meta-analysis. Table 1 presents the characteristics of the included studies. One of the included studies was conducted on infants30, and the remaining 19 studies were conducted on adults. The included studies were conducted on patients with different AKI settings. Ten studies focused on patients who had undergone cardiac surgery, 5 included critically ill patients, 2 included emergency department (ED) patients, 2 included trauma patients, and 1 included patients with out-of-hospital cardiac arrest. It is worth noting that these articles had different definitions of AKI. Urinary [TIMP-2] × [IGFBP7] levels were measured by a commercial enzyme-linked immunosorbent assay (ELISA) in two studies, and the remaining 18 studies were measured using the FDA-approved NephroCheck Test.

Table 1 Characteristics of Incloud studies.

Definition of AKI

Three studies used RIFLE (risk, injury, failure, loss, end-stage kidney disease)10,18,20, and the remaining 17 used the Kidney Disease Improving Global Outcomes (KDIGO) criteria for AKI. Most studies used a minimal threshold for AKI (KDIGO stage 1)11,25,26,27,28,29,30,31,32,33,34,35, whereas 5 studies set higher thresholds (KDIGO stage 2–3)19,21,22,23,24.

Quality assessment and publication bias

The quality outcomes of the included studies according to the QUADAS-2 are shown in Fig. 2. Our results revealed that 7 studies did not use a set threshold, resulting in a higher risk in the index test. Concerning the reference criteria, seven studies were assessed as having unclear risk because they did not mention blinding. None of the studies had concerns regarding applicability. Details of the studies are shown in Table 1 and Supplementary Table 1. The Deeks’ funnel plot asymmetry test was used to evaluate publication bias. The finding of P = 0.26 suggests that the likelihood of publication bias is low (Fig. 2).

Figure 2
figure 2

(a) Summary of the methodological quality of the studies. ‘−’ in red and ‘+’ in green mean high risk and low risk, respectively. ‘?’ in yellow means unclear risk. (b) Deek funnel plot asymmetry test for publication bias.

Diagnostic value of urinary [TIMP-2] × [IGFBP7] for AKI prediction

The TP, FP, FN, TN, paired sensitivity and specificity, and the cutoff values of individual studies for urinary [TIMP-2] × [IGFBP7] levels to predict AKI are listed in Supplementary Table 1. In our analysis, the SROC curve did not show a typical ‘shoulder arm-shaped’ style, and the Spearman correlation coefficient was 0.098 (P = 0.682), indicating a lack of a threshold effect. The pooled sensitivity was 0.79 (95% CI 0.72, 0.84) and pooled specificity was 0.70 (95% CI 0.62, 0.76; Fig. 3). Pooled PLR was 2.6 (95% CI 2.1, 3.3), NLR was 0.31 (95% CI 0.23, 0.40), and pooled DOR was 8 (95% CI 6, 13). The AUROC was 0.81 (95% CI 0.78–0.84; Fig. 3). The I2 values for pooled sensitivity and specificity were 75.72 (95% CI 65.24, 86.20) and 93.29 (95% CI 91.29, 95.30), respectively. The Fagan nomogram (Supplementary Fig. 1) and likelihood ratio scatter plot (Supplementary Fig. 2) were used to estimate the diagnostic value and clinical availability of urinary [TIMP-2] × [IGFBP7] for AKI. The pooled pretest probability was 20%. The results showed that when [TIMP-2] × [IGFBP7] was positive, the post-test probability of [TIMP-2] × [IGFBP7] for AKI increased to 39%. In contrast, when the results were negative, the post-test probability of AKI detection dropped to 7%. The observed PLR value of 2.6 indicated that patients with AKI are 2.6 times more likely to have a positive diagnosis than healthy subjects, and an NLR of 0.31 suggested that the combination of TIMP-2 and IGFBP7 is a useful biomarker for the diagnosis of AKI.

Figure 3
figure 3

(a) Forest plot of the pooled sensitivity and specificity of urine [TIMP-2] × [IGFBP7] in predicting acute kidney injury. (b) Summary receiver operating characteristic graph for the included studies. The curve is represented by the straight line; each of the analyzed studies is represented by a circle; the point estimate to which summary sensitivity (SENS) and specificity (SPEC) correspond is represented by the diamond shape, and the respective 95% confidence intervals, by the dashed line, whereas the 95% confidence area in which a new study will be located is represented by the dotted line.

Subgroup analysis

To further investigate the value of [TIMP-2] × [IGFBP7] for the diagnosis of AKI among different age groups, diagnostic criteria, assessment times, clinical settings, and cut-off value. we performed a series of subgroup analyses. As one of the 20 studies enrolled infants, we reanalyzed the data of adults only in the age subgroup analysis. Sensitivity and specificity did not differ from the summary results in the adult group (0.79 and 0.71). The AUROC was 0.82. The 19 studies were divided into two subgroups according to age (65 years), and there was no significant difference in the predictive performance of urinary [TIMP-2] × [IGFBP7] among different age groups. Fifteen studies used all stages of kidney disease as the AKI diagnosis criteria, and five studies adopted the Stage 2–3 KDIGO criteria to define the AKI endpoint. The results suggested that urine [TIMP-2] × [IGFBP7] showed a better trend of sensitivity 0.84 (95% CI 0.73, 0.91), specificity 0.72 (95% CI 0.56, 0.84), and AUROC 0.86 (95% CI 0.83, 0.89) in the Stage 2–3 subgroup than in the all stage subgroup [sensitivity 0.77 (95% CI 0.69, 0.84), specificity 0.69 (95% CI 0.60, 0.77), and AUROC 0.80 (95% CI 0.76, 0.83)]. We divided the studies into three subgroups based on different measurement times (within 12 h, 12–24 h and 24–48 h). There were 9 studies in the “Within 12 h of measurement” subgroup; estimated AUROC was 0.78, while pooled sensitivity and specificity were 0.81 (95% CI 0.73 0.86) and 0.62 (95% CI 0.53, 0.70). There were 4 studies in the “12–24 h of measurement” subgroup; estimated AUROC was 0.80, while pooled sensitivity and specificity were 0.81 (95% CI 0.66, 0.90) and 0.67 (95% CI 0.54, 0.78). Additionally, our subgroup analysis found that seven studies with a measurement time of 24-48 h demonstrated a high level of diagnostic accuracy for AKI (sensitivity = 0.74, specificity = 0.82, AUC = 0.86) and an earlier diagnosis compared to current AKI diagnostic standards. Our study suggests that measuring urinary [TIMP-2] × [IGFBP7] within 24–48 h of admission may enable earlier identification of AKI, potentially improving patient outcomes. Nine studies focused on cardiac surgery patients, divided into “cardiac surgery” subgroups. Five others focused on patients in the ICU and two on patients in the ED and constituted the “ICU and ED” subgroup. Urine [TIMP-2] × [IGFBP7] showed good diagnostic value in patients undergoing cardiac surgery with an estimated AUROC of 0.83 (95% CI 0.79, 0.86). Pooled sensitivity and specificity were 0.75 (95% CI 0.64, 0.83) and 0.78 (95% CI 0.70, 0.84). Subgroup analysis was conducted based on the [TIMP-2] × [IGFBP7] cut-off value in urine, resulting in two groups: cut-off value equal to 0.3 and cut-off value not equal to 0.3. Our findings suggest that the latter group exhibited superior diagnostic performance (sensitivity = 0.78, specificity = 0.76, AUC = 0.82). The results of subgroup analyses are presented in Table 2.

Table 2 Results of subgroup analysis based on different standards.

Discussion

AKI is a common and severe clinical condition. Currently, the diagnosis of AKI is based on serum creatinine level and urine output; however, these diagnoses are insensitive, especially in the early stages of AKI. Serum creatinine levels do not rise until 24 to 72 h after kidney injury, and urine output is less specific and is also affected by diuretics. Over the past decades, many biomarkers, including neutrophil gelatinase-associated lipocalin (NGAL), kidney injury molecule 1 (KIM-1), and interleukin 18 (IL18), have been studied for the early diagnosis of AKI, but their diagnostic sensitivity is relatively low36,37,38. In recent years, urinary [TIMP-2] × [IGFBP7] has shown the best accuracy and stability in adult ICU patients who developed ischemic or nephrotoxic AKI. We conducted this meta-analysis, which included 20 studies (3625 patients) reporting the diagnostic value of urinary [TIMP-2] × [IGFBP7] in patients with different clinical settings of AKI. We found that this biomarker test had good diagnostic accuracy (sensitivity = 0.79, specificity = 0.70, AUC = 0.81) in all-cause AKI. In addition, the pooled DOR was 8 (95% CI 6–13), which suggests that urinary [TIMP-2] × [IGFBP7] had good diagnostic effectiveness for AKI. When the DOR is greater than 1, a higher diagnostic odds ratio indicates better diagnostic performance. For a DOR of less than 1, the reverse is true39. Moreover, it showed a better trend of sensitivity and specificity in patients with severe AKI (KDIGO stage 2–3) and in patients measured after 12 h. This result is similar to that of previous meta-analyses on this topic, which included 10 full-text prospective studies showing a sensitivity of 0.77, specificity of 0.76 and AUC of 0.83 for the early diagnosis of cardiac surgery-associated AKI in urine [TIMP-2] × [IGFBP7]40. There have been many studies in various clinical settings for AKI (e.g. surgery-associated, cardiac arrest-associated, sepsis) and age groups (e.g. older adults vs. younger adults) were included in the analysis.

Our study demonstrated that across various clinical contexts, a cutoff value of 0.3 was selected in 11 studies included in our analysis. When considering a predictive biomarker, it is crucial to establish a cutoff value that differentiates between disease and healthy groups. Unfortunately, we were unable to determine the optimal cutoff points for urine [TIMP-2] × [IGFBP7] due to the observed variation in cutoff values across the included studies. The large Sapphire and Opal cohorts have validated the cutoffs of 0.3 and 2.0 for moderate-to-severe AKI18,19. However, it remains an open question as to how clinicians can effectively utilize this potential in different clinical contexts.

TIMP-2 and IGFBP7 are two biomarkers of G1 phase cell cycle arrest. These factors act by blocking the binding of cyclin and protein kinases through regulation of the P53 and P21 signaling pathways and alteration of the response of cells to inflammatory factors and toxins41. AKI-induced increases in urinary TIMP2 and IFGBP7 are caused by increased filtration, reduced tubule reabsorption, and urinary leakage of TIMP2 and IGFBP7 from proximal tubule cells42. Such physiological and pathological mechanisms lead to increased urinary TIMP2 and IGFBP7 levels in acute kidney injury and their role in the early prediction of AKI. Our studies suggested that urinary [TIMP-2] × [IGFBP7] showed excellent diagnostic value in patients undergoing cardiac surgery, with an estimated AUC of 0.81. The pooled sensitivity and specificity were 0.73 and 0.76. Zaouter et al. found that in a population at risk of developing cardiac surgery-associated AKI (CSA-AKI), urinary [TIMP-2] × [IGFBP7] was not detected in the first postoperative week within the first 24 postoperative hours29. Conversely, in Levante’s study, [TIMP-2] × [IGFBP7] had an AUC of 0.92 for predicting AKI after cardiac surgery, and a sensitivity and specificity of 0.84 and 0.88, respectively, for a cutoff value of 0.343. Several studies have investigated the ability of urinary [TIMP-2] × [IGFBP7] to predict AKI after cardiac surgery. However, due to the different AKI definitions, measurement times, and cutoff values, the results are not consistent; therefore, further large-scale studies are required.

Significant heterogeneity was observed in this meta-analysis. In subgroup analysis, inter-study heterogeneity may be derived from the AKI threshold. The AUC was higher in the KDIGO stage 2–3 subgroup than in all KDIGO stage subgroups (0.86 vs. 0.80). Similarly, Su et al. showed that urine [TIMP-2] × [IGFBP7] had the best test characteristics for KDIGO stage 2–340. However, Jia et al. suggested that the AUC was higher in the KDIGO Stage 1 subgroup44, but there were only three studies in the “KDIGO stage 1” subgroup. AKI diagnosed by the KDIGO stage 1 criteria is affected by a variety of factors, including blood concentration and drugs; thus, it may only reflect pure functional impairment, without a true kidney injury having occured. TIMP-2 and IGFBP7 levels in the urine were elevated only when the kidney was in emergency or damaged conditions. Therefore, urine [TIMP-2] × [IGFBP7] may be more sensitive for predicting severe AKI (KDIGO stage 2 or 3).

Despite significant advances in the epidemiology of AKI, predicting kidney recovery after AKI remains a major clinical challenge. One study has shown that plasma NGAL can predict AKI recovery, but its predictive performance is limited45. In another study, it was shown that the course of urinary [TIMP-2] × [IGFBP7] concentration predicted renal recovery from AKI following cardiac surgery (AUC: 0.79, 95% CI 0.65, 0.92). In contrast to [TIMP-2] × [IGFBP7], urinary NGAL concentration failed to predict renal recovery (AUC: 0.48, 95% CI 0.31, 0.64)20. The correlation between cell cycle arrest biomarkers and AKI outcomes deserves further investigation. Our data are in line with Sakyi’s study that urinary [TIMP-2] × [IGFBP7] showed the best diagnostic performance in predicting KDIGO AKI stage 2 and 3. Moreover, Sakyi et al. also suggested mean level of [TIMP-2] × [IGFBP7] increased as AKI stage increased32. The levels of cell cycle arrest biomarkers ([TIMP-2] × [IGFBP7]) increased with increasing AKI severity, indicating it could be used to monitor the progress of the condition. However, the severity of AKI is associated with a significantly increased incidence of clinically important outcomes such as renal replacement therapy, in-hospital mortality, and persistent renal dysfunction. Consequently, early detection and risk assessment could improve patient outcomes through early intervention and optimized patient management.

The present study had several limitations. First, some of the included studies had small sample sizes, which may have led to an overestimation of urinary [TIMP-2] × [IGFBP7] in the diagnosis of AKI. Second, most of the published literature on [TIMP-2] × [IGFBP7] has focused on its application in adult ICU patients and cardiac surgery patients. Whether [TIMP-2] × [IGFBP7] use can be expanded to other contexts or patient populations, such as pediatric patients, has not been confirmed. Finally, many of the included studies assessed urine [TIMP-2] × [IGFBP7] at a single point in time, without continuous measurements, and lacked predictions of AKI progression and prognosis.

Conclusion

In conclusion, this meta-analysis included updated clinical studies and used more accurate analysis methods to assess the diagnostic value of urine [TIMP-2] × [IGFBP7] compared to the current literature. Our meta-analysis suggests that urinary [TIMP-2] × [IGFBP7] levels have good predictive value as biomarkers of AKI in a wide range of clinical settings. However, whether and how urinary [TIMP-2] × [IGFBP7] can be widely used in the clinical diagnosis of all-cause AKI needs to be studied in different clinical settings, patient populations, and disease spectrum studies in the future.