Background

Acute kidney injury (AKI) is a medical complication associated with significant morbidity and mortality in critically ill patients [13]. AKI is common in critical illness, and severe AKI is associated with up to 60 % hospital mortality [4]. Renal replacement therapy (RRT) within the intensive care unit (ICU) is conducted as either intermittent hemodialysis or continuous renal replacement therapy (CRRT). Traditional indications for RRT require the development of overt clinical manifestations of renal insufficiency, such as acidosis, electrolyte disturbances (most notably hyperkalemia), uremic complications (encephalopathy or pericarditis), and volume overload unresponsive to aggressive medical management. In spite of research and increasing clinical experience with dialysis, the optimal time to initiate RRT in the course of critical illness complicated by AKI is unclear.

The notion of “early” RRT is to initiate dialysis therapy before nitrogenous and other metabolic products accumulate to the degree where they become relatively resistant to therapy [5, 6]. Despite the intuitive rationale for “early” RRT, there is limited evidence to guide clinicians on the optimal time to initiate RRT in critical illness. Neither standard clinical parameters nor research into novel clinical biomarkers has emerged to clearly define an ideal time or clinical picture where the initiation of RRT optimizes patient outcomes. Earlier initiation of RRT must be balanced with potential patient harm associated with RRTs. Protocolled use of hemofiltration for 96 h in patients with septic shock admitted to an ICU regardless of their renal function suggests that “early” RRT can be associated with negative patient outcomes [7]. As a result, research into “early” RRT includes multiple definitions of early that reflect a potpourri of time factors, biochemical markers, and clinical parameters in an attempt to balance the risks of initiating RRT with the benefits expected from supporting renal function during critical illness.

The authors of two earlier meta-analyses pooled available data on this topic to suggest that “early” RRT improves survival in critical illness. Seabra et al. [8] identified 23 studies (5 randomized controlled trials [RCTs]/quasi-RCTs, 1 prospective study, and 17 retrospective cohort studies) and concluded that “early” initiation of RRT was associated with 28 % mortality risk reduction (relative risk [RR] 0.72, 95 % CI 0.64–0.82, p < 0.001). Karvellas et al. [9] identified 15 studies (2 RCTs, 4 prospective studies, and 9 retrospective cohort studies) and reached similar conclusions, reporting a significant improvement in 28-day mortality with “early” RRT (OR 0.45, 95 % CI 0.28–0.72, p < 0.001). However, the overall findings were not congruent with the subgroup analysis of randomized trials (RR 0.64, 95 % CI 0.4–1.05, p = 0.08), where there was a signal that “early” RRT was not associated with a significant survival advantage. This has diminished clinical confidence in the conclusions reached by the earlier meta-analyses, and consequently “early” RRT in critical illness remains a controversial therapeutic intervention.

Since 2012, additional studies have been published that do not support the conclusions of the previous meta-analyses, and this has further diminished the confidence in the previous conclusions that suggested a survival benefit in critical illness associated with “early” RRT. We conducted a systematic review and evidence synthesis to investigate whether “early” versus “late” initiation of RRT in critically ill patients with AKI improves patient survival and selected secondary outcomes for potential signals to suggest that “early” RRT may reduce patient morbidity or enhance illness recovery. Our goal was to identify the highest-quality studies on this topic and use a pooled meta-analysis of these studies to inform our conclusions.

Methods

Search strategy

This study was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [10] (see Additional file 1: Figure S1 for PRISMA checklist). Our null hypothesis was that “early” initiation of RRT does not improve patient survival in critical care patients with AKI. This systematic review was not registered, and a protocol does not exist. The PubMed and Embase databases were searched to identify published articles following four broad themes: AKI, RRT, time of initiation, and critical illness (see Additional file 2: Table S1 for search terms). SK is a National Institutes of Health (NIH) physician and requested the NIH librarian to provide oversight for the search strategy. Our search was limited to English-language-only, full-text primary research publications (including abstracts with full text availability) reporting findings of clinical trials and observational studies (cohort and case-control design) published between January 1985 and November 2015. Studies before 1985 were not actively sought, owing to a low likelihood of relevance to modern RRTs and critical care practices.

Study selection

References were screened and excluded if they were small case reports or observational studies (fewer than 10 subjects), were not focused on critically ill adult patients, did not report mortality data, involved basic science data, or did not clearly distinguish between “early” and “late” groups. This task was divided among the authors. A second evaluation led by the senior author (RLCK) was conducted to evaluate study quality. Studies were designated as being of “high quality” or “low quality.” Studies were assigned a “low-quality” rating if there was no illness severity assessment between cohorts or at the time of randomization (n = 8), significant differences (p < 0.05) between cohort groups (n = 7) at baseline, incomplete basic demographic data at baseline (n = 6) to exclude baseline differences, or a Newcastle-Ottawa Quality Assessment (NOQA) Scale [11] for cohort studies rating less than 7 (n = 6). The senior author (RLCK) was the arbiter in cases of disagreement. Only high-quality studies were included in quantitative meta-analysis of the primary and secondary outcomes.

Primary and secondary outcomes

The primary outcome of interest was mortality at 1 month (pooling outcomes for mortality at 28 or 30 days, depending on what was reported by the primary authors). In addition to mortality, we analyzed selected secondary outcomes, including ICU length of stay (LOS) and hospital LOS. Secondary outcomes were not consistently reported for all studies, and only studies with applicable data were included in our pooled analysis. Weighted means were calculated as a product of the number of patients and mean duration to reach a total and represented as a total of patient-days per study. These values were summed and divided by the total number of patients from all included studies to reach weighted mean duration of LOS for both hospital and ICU LOS metrics. A similar process was used to derive the mean weighted illness severity scores. Other potentially relevant secondary outcomes, including mechanical ventilation requirements, vasopressor requirements, and renal recovery rates, were considered, but these variables were inconsistently reported and commonalities could not be reached among the heterogeneous parameters that were available.

Definition of “early” versus “late”

Early was defined on the basis of criteria used by the original authors in their respective studies. We accepted a broad definition of early based on biochemical markers according to RIFLE classifications (risk, injury, failure, loss of function, and end-stage kidney disease), Acute Kidney Injury Network (AKIN) stages, or time-based cutoffs (e.g., within a defined time from ICU admission or development of a biochemical “start time”). Accepting a broad definition of early was intended to optimize the potential for identifying an effect associated with “early” RRT. A limitation of this approach is that “early” according to one study investigator might be considered “late” by another study investigator. “Late” RRT criteria involved either usual practice or expectant care (i.e., no RRT initiated). “Usual practice” generally involved implementing RRT following the development of classic RRT indications unresponsive to medical management.

Statistical analysis

The quality of cohort trials was assessed using the NOQA Scale (range from 0 to 9, with 9 indicating the highest quality) [11]. The NOQA Scale for cohort studies assesses the domains of population selection, comparability of cohorts, and outcome assessment. A meta-analysis was conducted using the high-quality studies to calculate the pooled OR for mortality at 1 month. A random effects model was used because of the significant heterogeneity between studies on this topic. A random effects model is indicated when study populations differ in ways that could impact the results. Heterogeneity was assessed on the basis of the Q value and I 2 and τ2 statistics. A p value less than 0.05 was considered statistically significant. All analyses were performed using Comprehensive Meta-Analysis version 3.3.070 software (www.meta-analysis.com; Biostat, Englewood, NJ, USA).

Results

The systematic literature search yielded 2405 references that were subsequently refined to 36 studies eligible for inclusion in this meta-analysis (see Additional file 3: Figure S2 for article selection breakdown). These references included 7 RCTs [7, 1217], 10 prospective cohort studies [1827], and 19 retrospective cohort studies [2846]. Only nine studies met our criteria for high quality [7, 12, 1417, 21, 35, 40]. A summary of the fundamental characteristics of all evaluated studies is provided in Table 1.

Table 1 Trial Summary Table by Study Type (n=36)

Primary outcome

The observed pooled crude mortality rates varied significantly between the high- and low-quality studies. Among the high-quality studies, the pooled “early” RRT study group mortality rate was 34.6 % (192 of 555) compared with 40.2 % (196 of 487) in the pooled “late” RRT group. The low-quality studies demonstrated a pooled “early” RRT group mortality rate of 51.3 % (1871 of 3645) compared with 54.3 % (1486 of 2737) in the “late” RRT groups. The most frequently reported measurement of illness severity in the studies we analyzed was the Sequential Organ Failure Assessment (SOFA) score. The SOFA score has been correlated with critical care patient outcomes [47, 48], but it is not as robust as other scoring systems validated in predicting survival (e.g., Acute Physiology and Chronic Health Evaluation II [APACHE2] or Simplified Acute Physiology Score II [SAPS2]) [49]. The mean weighted SOFA scores in the high-quality studies were 10.2 and 10.4 in the “early” and “late” groups, respectively. SOFA scores were reported for 78 % of patients in the high-quality studies. Among the high-quality studies, the SOFA score appeared to correspond with an APACHE2 score of approximately 20 or a SAPS2 score of approximately 53 when these additional illness severity metrics were reported by the principal investigators. Unfortunately, more detailed quantitative evaluation of illness severity using APACHE2 or SAPS2 scores was not possible, owing to heterogeneous reporting methods between investigators and a lack of sufficient data. SOFA scores were reported for 65 % of the patients in the studies assigned low-quality ratings. The mean weighted SOFA scores in the “early” and “late” groups among the low-quality studies were comparable to those for the high-quality studies at 10.0 and 9.2, respectively. No further comments can be made regarding illness severity scores among the low-quality studies, owing to lack of homogeneous and sufficient data. Illness severity scores for all studies are summarized in Table 1.

Pooled analysis of the high-quality studies (n = 9) indicates no mortality benefit with “early” versus “late” RRT, with an OR of 0.665 (95 % CI 0.384–1.153, p = 0.146) (Fig. 1). The bulk of the data in support of “early” RRT rests in the pooled low-quality studies (n = 27), with an OR of 0.471 (95 % CI 0.343–0.649, p < 0.001) (Fig. 2). Similarly to authors of previous meta-analyses, we found very high heterogeneity among studies on this topic. Heterogeneity was highest among the low-quality studies, reflected by a Q value of 163.8, I 2 value of 84 %, and τ2 = 0.495 (p < 0.001). Among the high-quality studies, there continued to be statistically significant heterogeneity, with a Q value of 29.1, I 2 value of 72.5 %, and τ2 = 0.481 (p < 0.001). Subgroup analysis of the high-quality studies according to ICU admission type and surgical [14, 16, 21] versus mixed medical admissions [7, 12, 15, 17, 35, 40] demonstrated no significant subgroup mortality benefits associated with “early” RRT (see Additional file 4: Figure S3a and b for forest plots by ICU admission type). Subgroup analysis among the high-quality studies was also conducted using the definition of early according to time criteria (hours or days) versus biochemical parameters (i.e., rising creatinine, uremia, oliguria) (see Additional file 5: Figure S4a and b for forest plots by biochemical or time definition of early). There were no significant effects observed in pooled mortality trends in studies that defined early by time criteria rather than on the basis of biochemical parameters.

Fig. 1
figure 1

Mortality forest plot of pooled analysis of high-quality studies (n = 9)

Fig. 2
figure 2

Mortality forest plot pooled analysis of low-quality studies (n = 27)

Secondary outcomes

The secondary outcomes analyzed included ICU LOS and hospital LOS. Five of the nine high-quality studies reported ICU LOS data [12, 16, 17, 35, 40]. The mean weighted ICU LOS in the “early” group was 9.4 days (n = 351), compared with 10.8 days (n = 281) in the “late” group. None of the studies reported a significant finding with respect to ICU LOS and “early” RRT. Pooled analysis for ICU LOS also demonstrated no significant change in ICU LOS associated with “early” RRT, with a standard difference in the means of −0.035 (95 % CI −0.196 to 0.127, p = 0.674) using a fixed effects model (Q = 0.598, p = 0.963) (Fig. 3). Hospital LOS was reported in five of nine high-quality studies [12, 16, 17, 21, 40]. The mean weighted hospital LOS in the “early” group was 19.3 days (n = 317), compared with 17.1 days (n = 266) in the “late” group. The pooled hospital LOS data do not reveal any significant difference in hospital LOS using a fixed effects model with a standard difference in the means of 0.040 (95 % CI −0.125 to 0.204, p = 0.638) (Fig. 4).

Fig. 3
figure 3

Forest plot of pooled analysis of standard difference of the means for intensive care unit length of stay (n = 5)

Fig. 4
figure 4

Forest plot of pooled analysis of standard difference of the means for hospital length of stay (n = 5)

Discussion

Despite several studies having been conducted on this topic over the last 30 years, a clear answer regarding the optimal timing of RRT in critical illness remains elusive. Our analysis does not confirm the conclusions of previous meta-analyses on this topic. Four studies [12, 14, 21, 35] in the high-quality group were previously included in the meta-analysis by Karvellas et al. [9], and only one study [12] was included in the meta-analysis by Seabra et al. [8]. The addition of four recently published studies [1517, 40] and one high-quality study that was not previously included in meta-analysis [7] accounts for our results that differ from those of earlier authors. Our conclusions build on the concerns raised by both earlier meta-analyses that the results of cohort trials in favor of “early” RRT were not reproduced in methodologically more rigorous study designs (i.e., RCTs). In our further analysis we did not identify critical illness patient subgroups for whom “early” RRT might be more beneficial. Similarly, how one defines early (according to time or on the basis of biochemical characteristics) does not identify a survival advantage associated with “early” RRT compared with usual care. The optimal timing for initiation of RRT is not clarified on the basis of research evaluated to date.

The strength of our present analysis rests on our extensive literature search and strict classification according to study quality to limit risk of type I hypothesis testing error. Prior meta-analyses relied heavily on retrospective cohort study data that possessed incomplete preintervention data or preexisting significant differences in groups which predisposed the investigators to identify a survival difference attributed to “early” RRT that may have been accounted for by the preintervention population differences. We identified differences in the crude mortality rates between the high- and low-quality studies that are incompletely explained. The crude mortality rate differences may be explained by factors that are not adequately controlled for between the groups before the intervention of “early” versus “late” RRT (e.g., unreported regional institutional differences, variation in intensive care resources, institutional setting variability [academic versus community], or natural history variability of the diseases precipitating critical illness). In cohort trials, a difference in preintervention study groups indicates a critical methodological flaw that precludes deriving conclusions from their results. This is referred to as a type I error in hypothesis testing and may falsely attribute differences in outcomes to the study variable rather than the differences between cohorts that existed before analysis. Among high-quality studies, there was no survival advantage to “early” RRT with an OR of 0.665 (p = 0.146). Any inclusion of the low-quality study data would significantly pull the conclusion in favor of “early” RRT, which would represent fulfillment of a type I statistical hypothesis error. The strength of our work is that we vigorously guarded against this possibility.

Subgroup analysis of the high-quality studies did not reveal a survival benefit associated with either a surgical or medical critical care patient population. This conclusion remained the same regardless of whether early was defined by time or on the basis of biochemical parameters. Our secondary outcome analysis was limited by inconsistent and incomplete data reported across studies. Limited pooled analysis of the available data suggested that there was no significant effect on either ICU or hospital LOS associated with “early” RRT. Incomplete data does not permit us to evaluate additional secondary outcomes of interest (such as requirement for mechanical ventilation or rates of renal recovery) that might also be clinically relevant considerations factored into the decision to initiate RRT in critical illness.

By limiting our analysis to studies meeting high-quality criteria, we dismissed a large volume of research on this topic. A critique of our work is that we discarded studies for methodological shortcomings that others may feel should have been included. Most studies (n = 21) in the low-quality group were excluded for incomplete cohort data or significant preintervention differences between cohort groups. The decision to exclude these trials is less controversial than our decision to exclude cohort trials for an NOQA Scale rating less than 7 (n = 6). This is potentially controversial because the NOQA Scale has received criticism regarding its validity and applicability in meta-analysis cohort trial quality assessment [50]. The NOQA Scale has received positive endorsement from some authors [11], but detailed psychometric properties have not been published in peer-reviewed journals to date. Furthermore, our selection of an NOQA Scale rating less than 7 to identify low quality is arbitrary. Our rationale for selecting this cutoff was that it necessitates that at least one of the three NOQA Scale domains be seriously compromised, and we felt that this represented a significant bias predisposing the study results to committing a type I error pattern. Seabra et al. [8] attempted to assign a quality score to trials (0 = lowest quality to 5 = best quality) to evaluate this domain, but their methodology for score assignment was obscure and was not able to be replicated or directly compared with our methods. In qualitative comparison, the study assigned their top score [12] was included in our quantitative analysis; however, their second highest quality study [13] was excluded due to lack of reported illness severity scores between groups. Including studies with methodological errors does not advance scientific understanding of this topic and has contributed to the discordant findings on it.

Early studies on this topic were small and may have overestimated an effect size associated with “early” RRT based on the small size of the study populations. An example of this problem is the Sugahara et al. study [14], where 14-day mortality within the “early” group was 14 % (2 of 14), compared with 86 % (12 of 14) in the “late” group (p < 0.01). While this study was included in our quantitative analysis, the magnitude of the mortality benefit reported in this trial associated with “early” RRT has not been reproduced by subsequent investigators, for reasons that are not clear. In our review of the ongoing trials on this topic registered with the NIH (www.clinicalTrials.gov), we identified three trials [5153] that may add to knowledge in this area. The methodology of all three active RCTs is roughly similar, with patients randomized from a point in time triggered by the development of biochemical renal injury reflected by a RIFLE grade of “failure” (at least one of rise in creatinine by minimum of 300 %, oliguria less than 0.3 ml/kg/h for 12 h, or anuria lasting more than 12 h). From this biochemical entry point, patients will be randomized to immediate initiation of RRT (goal time to RRT less than 12 h) or standard care (RRT initiated after failure of medical management to temporize metabolic derangements or volume overload). These study designs are similar to the design used by Wald et al. [17], included in our analysis, that was able to separate an “early” group to mean time to RRT of 9.2 h and a “late” RRT group with a mean time to RRT of 32 h after biochemical inclusion criteria were met. Wald et al. [17] did not identify a significant difference in mortality rates between their two groups (p = 0.74). These studies in process will add to the quantity of patients evaluated in this manner and will build on the availability of high-quality data on this topic. By clearly defining routine biochemical criteria associated with acute renal injury, they provide a practical method of renal injury assessment that can be determined by intensivists and nephrologists considering RRT.

Conclusions

The results of our meta-analysis contradict the findings reported by previous authors [8, 9], and we conclude that “early” initiation of RRT in critically ill patients with AKI does not improve survival. This conclusion is derived from the pooled high-quality trial data and excludes data from cohort trials where there were methodological shortcomings that predisposed them to find an effect misattributed to the intervention. Pooled analysis of secondary outcomes did not demonstrate a statistical reduction in ICU or hospital LOS. Additional well-designed RCTs will provide greater confidence in these conclusions as optimal patient care practices progress in critical care. Clinical triggers for the initiation of RRT to optimize patient outcomes have not been clearly identified by current research. Meanwhile, intensivists and nephrologists are encouraged to refrain from lowering their clinical thresholds for implementing RRT in critical care patients with acute renal injury.

Key messages

  • High-quality trial data do not demonstrate improved survival using an “early” RRT approach in critical illness complicated by AKI.

  • Lower-quality trial data demonstrate significantly higher mortality rates and form the basis for the bulk of support for “early” AKI.

  • The optimal time to initiate RRT in critical illness remains undefined.

  • A conservative approach to initiating RRT in critical illness is supported.