Background

Acute lymphoblastic leukemia (ALL) is a hematologic malignancy of bone marrow featured by the overproduction of immature lymphoblasts [1]. It represents 75–80% of childhood acute leukemias and 20% of all leukemias in adults, with approximately 6000 cases diagnosed every year in the United States [1, 2]. Despite evolving treatment protocols, the relapse rate is approximately 15–20% in ALL, and the cure rate is much lower after relapse [3]. These relapses are due to the persistence of residual malignant cells, namely minimal residual disease (MRD), that cannot be detected by the morphological examination of the bone marrow [4]. Great efforts have been made to standardize MRD quantification using real-time polymerase chain reaction (PCR) of immunoglobulin and T-cell receptor (TCR) gene rearrangements, real-time PCR-based detection of fusion gene transcripts [e.g., breakpoint cluster region-Abelson (BCR-ABL)] or breakpoints, and flow cytometric immunophenotyping [5, 6]. MRD allows a more precise assessment of treatment efficacy and reduction of leukemic burden [7]. It has important prognostic and therapeutic implications for adults and children with ALL [8, 9]. The UKALL 2003 trial suggested that MRD risk stratification was helpful in adjusting the treatment intensity [10].

Allogeneic stem cell transplantation (allo-SCT) is the preferred treatment for adults with relapsed disease and children with high-risk relapses [11, 12]. The SCT mortality due to relapse is 30–40% in adults and children. The treatment-related mortality is also 30–40% in adults but lowers in children [13, 14]. A body of evidence indicated a direct correlation of the likelihood of relapse after transplant with the MRD status before transplantation [15, 16]. However, this significant association was not observed in some studies [17,18,19,20]. Also, patients’ age, detection methods, and adjustment of clinical covariates largely varied among different studies [16]. Additionally, the impact of MRD on overall survival (OS) and nonrelapse mortality (NRM) remained uncertain. Therefore, this systematic review and meta-analysis was conducted to explore the impact of MRD prior to transplantation on the prognosis for patients with ALL.

Methods

Search strategy and inclusion criteria

The meta-analysis was performed according to the Preferred Reporting Items for Systematic Reviews and Meta-Analysis statement [21]. Studies in PubMed, Embase, and the Cochrane Library were searched from the database inception to March 2016, using the following text and/or medical subject heading terms: (1) “acute lymphoblastic leukemia” or “acute lymphoblastic leukaemia”; (2) “minimal residual disease”; (3) “relapse” or “relapse-free survival” or “leukemia-free survival” or “leukaemia-free survival” or “disease-free survival” or mortality; and (4) transplantation. The search was restricted to publications in the English language. The references of included studies were screened for potentially missing records. This systematic review with meta-analysis was not registered in a trial registry.

Studies considered for inclusion were as follows: (1) reported the detection of bone marrow MRD prior to allo-SCT in patients with ALL; (2) had no limitation in terms of the age of included patients; (3) cohort study, prospective or retrospective; (4) published in English; and (5) presented data on the main outcomes of relapse, relapse-free survival (RFS), event-free survival (EFS), and/or NRM. Disease-free survival (DFS) and leukemia-free survival (LFS) were interpreted as synonymous with RFS.

Study selection and quality assessment

Two independent reviewers (ZLS and XZG) screened the citations for inclusion based on titles and abstracts. Multiple studies involving the same cohort of patients (or duplicate patient populations) were identified and combined. Only the most recent or comprehensive study was selected to avoid double-counting. The Newcastle–Ottawa Scale (NOS) for nonrandomized studies was used to assess the quality of included studies. The items included patient selection (4 points), comparability of cohorts (maximum 2 points), and outcome assessment (maximum 3 points), with a total of 9 points [22]. The data extraction and quality assessment were conducted independently by two authors (LFY and LY). The information was examined and adjudicated independently by an additional author (WWM) referring to the original studies.

Statistical analysis

Time-to-event data were most appropriately analyzed using the hazard ratios (HRs). Thus, the HR and its 95% confidence interval CI) were used as summary effect estimates for these outcomes. Adjusted HRs were directly extracted from the results of multivariate analysis using the Cox regression model. When missing data regarding adjusted HR were encountered, it was indirectly estimated using the Kaplan–Meier (KM) curves guided by the method of Tierney et al [23]. The random-effects model was used for meta-analysis. The heterogeneity between the included studies was assessed using the Cochran Q test and the I2 statistics. A P value less than 0.1 or I2 values >50% was regarded as heterogeneity [24]. Subgroup analyses were performed based on the following clinical variables: study design (retrospective or prospective), region (USA, Europe, or Asia), population (adults or children), MRD assay modality [PCR or flow cytometry (FC)], and source of effect estimates (adjusted by multivariate analysis or unadjusted from KM curves). A sensitivity analysis was performed by excluding the studies one by one. Publication bias was visually explored using funnel plots and statistically assessed using Egger’s and Begg’s tests. All data were synthesized using the STATA 12.0 software and “metan” package (StataCorp LP, TX, USA). Two-tailed P ≤ 0.05 was considered significant for all statistical analyses.

Results

Literature search

A total of 418 studies were identified, including 221 from PubMed, 158 from Embase, and 39 from the Cochrane Library. The 158 duplicate studies were discarded. Also, 64 reviews or meta-analyses and 146 studies on irrelevant topics were removed. Fifty full-text studies were assessed for eligibility. Moreover, 4 studies of autologous stem cell transplantation and 19 studies that did not present the association between pre-transplant MRD and outcomes were discarded. Twenty-seven studies were included in qualitative synthesis. As 6 studies had insufficient data, 21 studies were finally pooled into the meta-analysis. The flow diagram of study selection is shown in Fig. 1.

Fig. 1
figure 1

Study selection process

Study characteristics and quality appraisal

The characteristics of 21 included studies are shown in Table 1. These articles were published between 1998 and 2016, including 9 retrospective studies and 12 prospective studies. The sample size ranged from 29 to 522. Seven studies enrolled mainly adult patients, 11 studies included mainly pediatric patients, and 3 studies comprised a mixture of adults and children. Six studies used FC and 15 studies employed PCR to detect MRD. The quality assessment of included studies is shown in Additional file 1: Table S1. The NOS score of included studies ranged from 5 to 9, and the important items included a representative of MRD (+) patients, comparability, and adequate follow-up duration. Seven studies did not enroll representative patients with ALL, including two studies of relapsed ALL, [25, 26] two studies of Philadelphia-positive ALL, [19, 27] one study of Philadelphia-negative ALL, [28] and two studies of high-risk ALL [18, 29]. For the item of comparability, the score was deducted when the study did not present sufficient adjusted effect estimates [14, 18, 28,29,30,31,32,33]. Only seven studies with 5-year outcomes in the follow-up were considered as the adequate follow-up [14, 18, 34,35,36,37,38].

Table 1 Characteristics of included studies

Relapse

Twelve studies investigated the association between pre-transplantation MRD and cumulative incidence of relapse [18, 19, 27, 29, 31,32,33,34, 36, 37, 39, 40]. HRs from five studies were derived from the KM curves, [18, 29, 31,32,33], and HRs from seven studies were obtained from the multivariate analysis [19, 27, 34, 36, 37, 39, 40]. Patients with positive MRD prior to allo-SCT had a significantly higher rate of relapse compared with those with negative MRD (HR = 3.26; 95% CI 2.23–4.75, P < 0.05) (Fig. 2). Moderate heterogeneity was revealed (I2 = 46%; P < 0.05). Subgroup analyses were conducted according to the following variables: population, design, region, detection method, and adjustment of HR. The pooled results remained statistically significant for all stratified analyses (except for the single Asian study), suggesting the robustness of the relationship. The heterogeneity was low or nonsignificant for the subgroups of adult patients, retrospective studies, United States of America (USA), FC, and adjusted HR, indicating that these factors might explain the observed heterogeneity (Table 2). A sensitivity analysis was further conducted. After each study was sequentially excluded from the pooled analysis, the conclusion was not affected by the exclusion of any specific study.

Fig. 2
figure 2

Forest plot showing the association between pre-transplant MRD and relapse after allo-SCT

Table 2 Subgroup analyses for the outcome of relapse

RFS

Data on RFS were directly obtained from seven studies, [19, 27, 34, 35, 38,39,40] or indirectly calculated from three studies [28, 31, 33]. Pre-transplantation positive MRD was a significant negative predictor of RFS (HR = 2.53; 95% CI 1.67–3.84; P < 0.05) (Fig. 3). Statistically significant heterogeneity was revealed (I2 = 74.1%; P < 0.05).

Fig. 3
figure 3

Forest plot showing the association between pre-transplant MRD and relapse-free survival after allo-SCT

Subgroup analyses were performed, and the results are shown in Table 3. Notably, the pooled results in the subgroups of Asian population and unadjusted HRs were nonsignificant. The heterogeneity remained moderate to high among all subgroups, indicating that none of the stratifying variables could explain the high heterogeneity. In a sensitivity analysis, the conclusion was not affected by the exclusion of any specific study after each study was sequentially excluded from the pooled analysis.

Table 3 Subgroup analyses for the outcome of relapse-free survival

EFS

Eight studies explored the data on EFS, with four on adjusted HRs [25, 26, 35, 41] and four on unadjusted HRs [14, 30, 31, 36]. All studies were conducted in Europe. The pooled analysis revealed a significant correlation of positive MRD before allo-SCT with worse EFS (HR = 4.77; 95% CI 3.31–6.87; P < 0.05) (Fig. 4). No significantly low heterogeneity was shown (I2 = 30.5%; P > 0.1).

Fig. 4
figure 4

Forest plot showing the association between pre-transplant MRD and event-free survival after allo-SCT

In the subgroup analyses, the pooled results remained significant in all subgroups (Table 4). No heterogeneity was shown for the subgroups of pediatric patients, retrospective studies, and FC. In a sensitivity analysis, the conclusion did not change substantially by removing any single study after excluding the included studies one by one.

Table 4 Subgroup analyses for the outcome of event-free survival

OS

Ten studies showed data on OS outcome [19, 28, 29, 32, 35, 37,38,39,40, 42]. The adjusted HRs were directly obtained from four studies [19, 35, 37, 42] and indirectly derived from KM curves in six studies [28, 29, 32, 38,39,40]. The pooled data demonstrated that patients with ALL having positive MRD prior to allo-SCT had a significantly unfavorable OS (HR = 1.98; 95% CI 1.40–2.80; P < 0.05) (Fig. 5). A significant heterogeneity was revealed (I2 = 67.2%; P < 0.01).

Fig. 5
figure 5

Forest plot showing the association between pre-transplant MRD and overall survival after allo-SCT

In stratified analyses, the pooled data for the subgroups of adult patients and Asian population showed no statistical significance (Table 5). The heterogeneity was low in the three subgroups of the population and the subgroups of American or Asian studies. In a subgroup analysis, the exclusion of any specific study did not alter the pooled conclusion significantly.

Table 5 Subgroup analyses for the outcome of overall survival

NRM

Three studies were eligible [19, 25, 40]. Positive MRD prior to transplantation was not associated with a higher rate of NRM (HR = 1.24; 95% CI 0.79–1.96; P = 0.35) (Fig. 6). A high heterogeneity was revealed (I2 = 69.4%; P < 0.05). Subgroup analyses or publication test was not conducted due to a limited number of studies. The three studies were excluded one by one, and the conclusion did not change significantly.

Fig. 6
figure 6

Forest plot showing the association between pre-transplant MRD and non-relapse mortality after allo-SCT

Publication bias

The publication bias for the outcomes of relapse, RFS, EFS, and OS was assessed. Notably, the funnel plots were asymmetrical for the outcome of relapse (Fig. 7a) and RFS (Fig. 7b). In contrast, the plots were symmetrical for the outcomes of EFS (Fig. 7c), and OS (Fig. 7d). When statistically assessed using the Egger’s test, the publication bias was statistically significant for relapse (P = 0.02) and RFS (P = 0.02), but not for EFS (P = 0.20) or OS (P = 0.49). Further, the results of Begg’s tests were examined, showing no publication bias for relapse (P = 0.06), RFS (P = 0.21), EFS (P = 0.39), and OS (P = 0.86).

Fig. 7
figure 7

Funnel plots for the outcomes of relapse (a), relapse-free survival (b), event-free survival (c), and overall survival (d)

Discussion

The prognostic value of pre-transplant MRD was demonstrated in several ways. First, the collected data revealed that patients with positive MRD before allo-SCT had a higher cumulative incidence of relapse in the follow-up. In accordance, the RFS was much shorter for the MRD (+) patients. The EFS was a composited outcome defined as the time from allo-SCT to the first occurrence of relapse or death. Positive MRD before allo-SCT was also an independent negative predictor of EFS. Furthermore, patients with positive pre-transplant MRD were proved to have a significant dismal OS. To the best of my knowledge, this meta-analysis was the first to appraise the role of MRD assessment in the pre-transplant setting in patients with ALL.

The prognostic power of pre-transplant MRD, as well as the sources of heterogeneity, was examined by subgroup analyses. A careful statistical process was conducted with caution that several clinical covariates might interact with the role of MRD and potentially cause bias. Thus, the adjusted estimates were preferred. The most relevant studies paid attention to the well-known confounding factors of disease status (CR1 or CR2), age, sex, and genetic mutations. Notably, the pooled HRs from the adjusted multivariate analysis or crude analysis from KM curves uniformly demonstrated a significant association between pre-transplant MRD and the outcomes of relapse, EFS, and OS. Only for RFS, the crude analysis by three studies failed to show the significant correlation. The included studies using PCR assay mostly amplified the rearranged immunoglobulin and TCR gene segment in the leukemic clone. Only few studies tested the fusion gene of BCR-ABL1. The FC-based assay, by examining the immunophenotypes, was advantageous in terms of rapid process and readily available results. Subgroup analyses demonstrated that the statistical significance for relapse, RFS, EFS, and OS was estimated using either PCR or FC method. The leukemogenic events were different for pediatric and adult patients with ALL [43]. A higher prevalence of unfavorable genetic subtypes, such as the BCR-ABL1 fusion protein, was observed among older patients [2]. The latest National Comprehensive Cancer Network guideline stated that the OS decreased substantially with increased age for patients with ALL [1]. Interestingly, MRD was not found to be a significant predictor of dismal OS for adult patients with ALL. However, MRD prior to transplantation was a constant negative predictor among the outcomes of relapse, RFS, and EFS, regardless of discrepancies in patients’ ages. Compared with prospective studies, retrospective studies relied on data recall or information from previous medical records. However, a significant difference was not observed by analyzing retrospective or prospective studies alone.

Of note, when assessing the outcome of NRM, the MRD status prior to transplantation did not have a significant role. Myeloablative conditioning for MRD-positive patients versus reduced intensity conditioning for patients with undetected MRD might affect the NRM outcomes. A small number of studies might limit the statistical power. Additionally, some other factors might outweigh MRD in predicting NRM. Previous evidence suggested that younger adults had reduced post-transplant mortality. Myeloablation might be not feasible in patients older than 35 years because a higher toxicity was more commonly seen in these recipients [26, 44]. Thus, MRD might only be a subordinate factor for this outcome.

This meta-analysis had several strengths. It included 21 studies with a total of 2323 patients around the world. Comprehensive prognostic outcomes, including relapse, RFS, EFS, OS, and NRM, were evaluated. The prognostic value of MRD was appraised according to different assaying modalities, populations, and study designs. Both adjusted and crude data were presented. Largely, the association between MRD and outcomes remained stable among subgroups. It was confirmed that the detection of MRD was of considerable importance in identifying patients with poor outcome after allo-SCT. MRD was advocated to be a useful molecular biomarker for accurate triage of patients’ =pre-transplantation and preemptive escalation of post-transplant interventions [15].

This study also had several shortcomings. Several studies collected data retrospectively. Some studies had small sample sizes, which might have reduced the statistical power. Patients’ disease status of NRM might have biased this relationship. The inclusion criteria were heterogeneous, and patients were treated using varied chemotherapy protocols. The timing and duration of follow-up were inconsistent. Also, the definitions of assay-specific thresholds and the lack of one universal detection method or testing target were heterogeneous among the included studies. No consensus was reached regarding the standardization of MRD measurement. Furthermore, it failed to give strong justification for providing a quantitative assessment of the influence of pre-transplant MRD. A multitude of confounding factors, such as the use of tyrosine kinase inhibitors, the pre-transplant remission type (CR1 or CR2), donor source, and graft-versus-leukemia, were not sufficiently adjusted in many studies when analyzing the impact of MRD. In fact, these factors were even inconsistent within an individual study [25, 40]. Even for studies that reported adjusted HRs, the degree of adjustment largely varied. The subgroup findings should be considered as exploratory, and thus would need to be tested in original studies. Finally, this study was conducted with summary statistics rather than with individual data, which might have ignored the impact of some covariates on the outcomes at the patient level. The availability of data from individual patients could resolve this problem and increase the power of meta-analysis.

Future studies should aim to decide how best to use the prognostic information of MRD. Several ways can be considered to improve the outcomes for MRD (+) patients at transplantation. Pre-transplantation treatment with non-cross-resistant agents might be helpful in decreasing the residual malignant clone [14, 40]. Preemptive immunotherapy or chemotherapy might be beneficial during the post-transplantation stage [45, 46]. Lankester et al. preliminarily revealed that alloimmune intervention after allo-SCT was feasible in reducing residual leukemic cells [17]. Further, a randomized trial should be performed on patients with ALL in complete remission who had positive MRD and received either allo-SCT or additional novel chemotherapy before transplantation.

Conclusions

In conclusion, this meta-analysis provided evidence that positive MRD prior to allo-SCT was associated with higher relapse and poor survival in patients with ALL. Allo-SCT appeared to be insufficient for some patients with positive MRD at transplantation. The findings of this study suggested the rationale for future studies to prevent relapse and improve survival for this group of high-risk patients.