Introduction

Acute myeloid leukemia (AML) is a relatively rare cancer, but is the most common leukemia in adults in Japan. AML is a highly heterogeneous disease at multiple levels, including its molecular and genetic background, clinical behavior, and sensitivity to chemotherapy. To conduct a scientific clinical study for the treatment of AML, a large number of patients are required to ensure a reliable statistical analysis. The Japan adult leukemia study group (JALSG) was organized in 1987 to conduct a multicenter cooperative study of AML patients in Japan. Although the JALSG consisted of only 14 institutions at its inception, as of April 2012, it included 119 institutions with 94 affiliated hospitals. Since it was formed, the JALSG has conducted six studies in adult AML patients younger than 65 years old, excluding those with acute promyelocytic leukemia, in an effort to establish a standard therapeutic regimen. I herein summarize the JALSG studies of AML, focusing on two trials, AML97 and AML201.

Induction therapy

For more than three decades, the combination of anthracycline and cytarabine (Ara-C) has been a standard induction therapy for AML, and a combination of daunorubicin (DNR) at a dose of 45–50 mg/m2/day for 3 days and Ara-C at a dose of 100–200 mg/m2/day given by continuous infusion for 7 days has generally been used. Many studies have been conducted to attempt to improve the remission rate and to achieve deeper remission. Studies using high-dose Ara-C (HiDAC) found no improvement of the CR rate and a high incidence of treatment-related toxicities [1, 2]. In the late 1980s, however, idarubicin (IDR) was introduced into clinics, and three randomized studies comparing IDR with DNR reported significantly higher CR rates for IDR [35]. Furthermore, a meta-analysis also confirmed that IDR at a dose of 10–12 mg/m2/day for 3 days had a superior effect than DNR at a dose of 45–60 mg/m2/day for 3 days in the achievement of a CR [6]. For this reason, IDR and Ara-C were used as induction therapy in the AML95 [7] and AML97 studies [8]. Both studies yielded CR rates similar to those in the earlier AML87 [9], AML89 [10], and AML92 [11] studies. In these three studies, DNR and other drugs were administered in a response-oriented individualized manner; i.e., additional drugs were given for a few days when the bone marrow at day 8 was not hypoplastic and contained a substantial number of blasts. Therefore, the total doses of DNR administered during the first course of induction therapy were 240–280 mg/m2 given for more than 5–7 days, which was more than the conventional dose of 40–60 mg/m2 given for 3 days. As there had been no direct comparison between a high dose of DNR and the standard dose of IDR (12 mg/m2), we prospectively compared IDR (12 mg/m2 for 3 days) with DNR (50 mg/m2 for 5 days), in combination with Ara-C (100 mg/m2 for 7 days), in the AML201 study [12]. Of the 1064 patients registered, 1057 patients were evaluable. They were randomly assigned to receive either DNR or IDR. The CR rate was 77.5 % (95 % CI 73.8–80.9 %) for the DNR group and 78.2 % (95 % CI, 74.5 %–81.5 %) for the IDR group (P = 0.79). These results showed that the high-dose DNR regimen was not inferior to the IDR regimen. In addition, the 5-year OS was 48 % for the DNR group and 48 % for the IDR group (P = 0.54), and the 5-year relapse-free survival (RFS) rates were 41 and 41 % (P = 0.97), respectively. Consequently, both regimens produced a high CR rate and had good long-term efficacy. However, sepsis occurred more frequently in the IDR group than in the DNR group (8.7 and 4.9 %, P = 0.02) and early death within 60 days occurred more frequently in the IDR group than in the DNR group (4.7 and 2.1 %, P = 0.03). Acute and late-onset cardiotoxicities were reported only in a small number of patients in both groups. We concluded that both regimens can be used, and they are now the standard induction therapy in Japan at this time. Recently, The Eastern Cooperative Oncology Group also reported that a higher dose of DNR (90 mg/m2 for 3 days) improved the CR rate and OS, as were seen in JALSG studies [13]. Given this and JALSG reports, the optimal total dose of DNR is still to be explored but may rest somewhere between 250 and 270 mg/m2.

To identify a better induction therapy for AML, we conducted a phase I trial (AML206) of a combination of gemtuzumab ozogamicin (GO) with the standard induction therapies for patients with relapsed/refractory AML. This study showed that 3 mg/m2 GO combined with Ara-C and the standard dose of IDR or the high dose of DNR can be safely administered [14]. We were planning a phase II trials to investigate the clinical efficacy of this combination therapy, but GO was withdrawn from the US market in 2010 based on data from the Southwest Oncology Group (SWOG) Study S0106, which failed to confirm a clinical benefit and showed a high rate of fatal induction toxicity. More recently, however, the outcomes of the Acute Leukemia French Association (ALFA)-0701 study have been reported [15]. These results showed that the addition of fractionated lower doses (3 mg/m2) of GO to the standard chemotherapy allows the safe delivery of higher cumulative doses and substantially improves CR rate and OS. The optimal dose and schedule should be decided in future trials by leukemia study groups, including the JALSG.

Consolidation chemotherapy

Consolidation therapy is generally given immediately after a patient achieves CR, and its intensity is the same as that for induction therapy. Although intensive consolidation therapy is necessary for curative results, the optimal type and number of cycles have yet to be defined. HiDAC therapy is generally used in the United States and other countries after the landmark Cancer and Leukemia Group B-8525 (CALGB-8525) study [16]. In Japan, however, because HiDAC therapy was not approved by our national medical insurance system until 2001, a standard dose of Ara-C-based combination chemotherapy using non-cross-resistant agents (multiagent CT) was commonly used in early the five studies of adult AML performed by the JALSG. Therefore, we more recently compared conventional multiagent CT with HiDAC therapy in he AML201 trial [17]. In this trial, 781 CR patients were randomly assigned to receive either the HiDAC regimen or the multiagent CT regimen. The 5-year DFS was 43 % for the HiDAC group and 39 % for the multiagent CT group, and the 5-year OS rates were 58 and 56 %, respectively. There were therefore no significant differences in the DFS and OS between the groups. However, among the favorable cytogenetic risk group (n = 218), the 5-year DFS was 57 % for the HiDAC and 39 % for multiagent CT (P = 0.050) (Fig. 1), but the 5-year OS rates were 75 and 66 %, respectively (P = 0.174). After each course of consolidation, the nadir of the WBC count was significantly lower and the day when a WBC less than 1.0 × 109/L was noted was significantly longer in the HiDAC group. The frequency of documented infections was also significantly higher in the HiDAC group, even though the frequency and the number of days of granulocyte colony-stimulating factor (G-CSF) administration were significantly higher in the HiDAC group. However, the subset analysis showed that the higher incidence of documented infection following treatment with the HiDAC regimen was noted only in the intermediate cytogenetic risk group. Death in CR was reported in only a small number of patients in both groups. Therefore, HiDAC therapy can be recommended only for patients with favorable cytogenetics. This limitation of HiDAC therapy is associated with lower medical expenses, because HiDAC therapy and granulocyte colony-stimulating factor administration are costly.

Fig. 1
figure 1

Relapse-free survival by treatment arm and karyotype risk. a Relapse-free survival of CR patients by treatment arm. Predicted 5-year RFS was 43 % for the HiDAC group (n = 389) (red line) and 39 % for the standard dose group (n = 392) (blue line) (P = 0.724). b Disease-free survival by treatment arm for the favorable cytogenetic risk group. Predicted 5-year DFS was 57 % for the HiDAC group (n = 108) (red line) and 39 % for the multiagent CT group (n = 110) (blue line) (P = 0.050). c Disease-free survival by treatment arm for the intermediate cytogenetic risk group. Predicted 5-year DFS was 38 % for the HiDAC group (n = 242) (red line) and 39 % for the multiagent CT group (n = 256) (blue line) (P = 0.403). d Disease-free survival and overall survival by treatment arm for the adverse cytogenetic risk group. Predicted 5-year DFS was 33 % for the HiDAC group (n = 27) (red line) and 14 % for the multiagent CT group (n = 14) (blue line) (P = 0.364)

Hematopoietic stem cell transplantation as consolidation therapy

Allo-HSCT is the most intensive treatment option for AML, but the low relapse rate is usually offset by a high incidence of treatment-related mortality (TRM). Many clinical trials have compared allo-HSCT with chemotherapy as a post-remission therapy for the patients with AML in CR. Some studies have not found a benefit of allo-HSCT on either the DFS or OS [18, 19], and some showed an advantage of allo-HSCT only on the DFS when it was compared with chemotherapy/auto-transplantation [20, 21]. Most of these studies targeted all patients in CR as a single population, without prospective stratification by their prognostic factors. We prospectively compared the effectiveness of allo-HSCT with chemotherapy among CR patients aged 15–50 years, who were stratified into intermediate- or poor risk groups following the JALSG scoring system in the AML97 study [22]. Seventy-three CR patients with and 92 without an HLA-identical sibling were assigned to the donor and non-donor groups. Of the 73 patients in the donor group, 38 (52 %) received allo-HSCT during the first CR and 17 (23 %) after relapse. An intention-to-treat analysis revealed that the relapse incidence was reduced in the donor group (52 vs. 77 %; P = 0.008), and that the DFS was improved (39 vs. 19 %; P = 0.016), but the OS was not significantly different (46 vs. 29 %; P = 0.088). However, an OS benefit was seen in the patients aged 36–50 years (49 vs. 24 %; P = 0.031). Therefore, it is reasonable that allo-HSCT should be performed for intermediate or poor risk patients with an HLA-identical donor when they are in CR.

Maintenance therapy

In contrast to consolidation therapy, maintenance therapy is of low intensity and is prolonged, but the benefits of such therapy remain unclear. The AML87 trial [9] showed that a significantly longer DFS was achieved when 12 courses of moderately intensive maintenance therapy were administered instead of the four courses of maintenance therapy. However, to preserve the quality of life of the patients in the following AML trials (AML89 [10], AML92 [11], and AML95 [7]), six courses of maintenance therapy were used, and demonstrated a better OS than that achieved in the AML87 study. Six courses of maintenance therapy became the standard regimen in Japan thereafter. We conducted the AML97 trial [8] to compare four courses of standard-dose consolidation therapy without maintenance (Arm A) with three courses of consolidation therapy followed by six courses of maintenance therapy (Arm B). In the AML97 trial, 998 CR patients were assigned randomly to either Arm A or Arm B. The 5-year OS rate was 52.4 % for Arm A and 58.4 % for Arm B (P = 0.599), and the 5-year DFS rates were 35.8 and 30.4 % (P = 0.543), respectively. In analyzing the data by risk group, no statistical difference was observed either in the 5-year OS or in the 5-year DFS between the two arms. We concluded that prolonged consolidation and maintenance therapy could be successfully achieved with four courses of consolidation therapy in all AML risk groups. Considering the results of the AML87 [9] and AML97 [8] trial, the benefits of maintenance therapy seemed to be dependent on the intensity of the consolidation given prior to the maintenance therapy. Given the recent use of intensive consolidation, maintenance therapy is therefore considered to be unnecessary. Moreover, avoiding the use of maintenance therapy improves the quality life of adult AML patients.

Salvage therapy

Over 50 % of CR patients eventually relapse within 3 years, despite intensive post-remission chemotherapy. Considering that relapsed AML is not curable by treatment with chemotherapy alone, and that the outcome after allo-HSCT in the second CR is comparable to that after transplantation in the first CR, salvage therapy followed by allo-HSCT is imperative to increase the cure rate for patients with AML. HiDAC is one of the promising salvage therapies, but its efficacy for early relapsed patients is limited. Based on the finding that fludarabine augments the accumulation of Ara-C-triphosphate, investigators at the M. D. Anderson Cancer Center developed a combination therapy with fludarabine, HiDAC and G-SCF (FLAG) [23]. We developed a FLAGM regimen by adding mitoxantrone to FLAG. After determining the safe dosage of the combination therapy in a phase I study [24], we performed a phase II study using FLAGM for relapsed/refractory adult AML patients. Forty-one patients aged 18–64 years (median 52) enrolled in this trial. The FLAGM regimen yielded a high CR rate (73 %) with a relatively low toxicity profile, regardless of the duration of the first CR or the preceding HiDAC regimen, and it appears to be a good salvage therapy for relapsed/refractory AML patients, which enables a high proportion of patients (80 %) to undergo allo-HSCT [25]. Therefore, our FLAGM regimen represents a promising salvage regimen for relapsed/refractory AML in young adults. The JALSG is planning to conduct a large-scale prospective study with FLAGM.

Prognostic factors

Prognostic factors are important not only for predicting prognosis, but also for stratifying patients to determine optimal treatment. This is especially true because of the heterogeneity of AML. Cytogenetics is one of the most important prognostic factors. The karyotype of the leukemic cells can be divided into three groups categorized as favorable, intermediate, and poor risk, based on the specific karyotype [26, 27].

It has recently been reported that mutations or overexpression of specific genes have prognostic significance [28]. In core binding factor (CBF) AML [e.g. AML with t(8;21) or inv(16)], mutations in KIT have been shown to have a potentially unfavorable influence. In cytogenetically normal AML (CN-AML), which is identified in about 50 % of patients within intermediate risk, fms-like tyrosine kinase 3 gene-internal tandem duplications (FLT3-ITD) [29], CCAAT/enhancer binding protein alpha (CEBPA), and nucleophosmin1 (NPM1) mutations are sometimes identified alone or in combination. FLT3-ITD is associated with unfavorable outcome, while CEBPA and NPM1 are associated with favorable outcome if present without FLT3-ITD. The European leukemiaNet therefore proposed a new classification that includes data from a cytogenetic analysis and from a mutation analysis of the NPM1, CEBPA, and FLT3 genes [30]. High expression of the brain and acute leukemia cytoplasmic (BAALC) gene and the ETS-related gene (ERG) has a negative impact on the prognosis of patients with CN-AML.

The JALSG developed a different scoring system (JALSG scoring system) using the karyotype, age, WBC count at presentation, FAB classification, positivity for peroxidase in blasts, performance status (PS), number of induction therapies required to achieve complete remission (CR), which were extracted for achieving CR, DFS, and overall survival (OS) using a multivariate analysis of the results from three studies (AML87 [9], AML89 [10], AML92 [11]) [31]. We prospectively applied the JALSG scoring system to stratify patients in the AML97 study [22], in which allogeneic hematopoietic stem cell transplantation (allo-HSCT) was assigned as consolidation therapy to those with an intermediate or poor risk classification. The OS and DFS of the CR patients in the AML97 study were clearly divided into three groups by the JALSG scoring system. Furthermore, it was shown that the JALSG scoring system more efficiently selected poor responders to chemotherapy in the favorable risk group, and good or poor responders to chemotherapy in the intermediate risk group compared to karyotype-based stratification [32]. We are now planning to create a new JALSG scoring system based on the clinical data from the AML201 study, in which more than 1000 AML patients were enrolled, as the results of the gene mutation analyses are not clinically available.

Conclusion

The JALSG has conducted six randomized, phase III multicenter trials in the 25 years since its formation, and has established the standard induction and post-remission therapy in Japan. The results of these trials are summarized in Table 1 and Fig. 2. Unfortunately, the CR rate has not changed significantly, and the 7-year OS rates in all of the studies, except for the AML201 trial, have ranges from 28.6 to 36.2 % (Table 1; Fig. 1). However, the OS in the AML201 trial was markedly improved, reaching 48.0 %. Table 2 shows the frequency of allo-HSCT in each trial. Over time, allo-HSCT has gradually increased. Notably, about 50 % of the patients enrolled in the AML201 trial underwent allo-HSCT. Considering these data, it is highly possible that the improvement of the OS in the AML201 trial resulted from the large number of patients who underwent allo-HSCT at the primary induction failure and after relapse. These data not only suggest that allo-HSCT provides an OS benefit, but also that our chemotherapy approaches have reached the limit of their efficacy for AML in young adults.

Table 1 Outcome of JALSG studies for AML
Fig. 2
figure 2

Overall survival according to JALSG studies. (allo-HSCT non-censored)

Table 2 Frequency of allo-hematopoietic stem cell transplantation in JALSG studies for AML

To further improve the prognosis of young adult AML patients, given that many genomic alterations in AML have been demonstrated to influence the response to clinical therapies, further efforts to integrate genome-wide molecular alterations with the treatment response and outcome are needed. The JALSG is now conducting the AML209 Genomic Study (AML209-GS) (UMIN 000003432) to clarify the genotypes of patients so that they can be stratified for individual therapies by prospectively and comprehensively analyzing the genetic alterations in 1500 newly diagnosed AML patients. In the AML209-GS study, the AML cells obtained from registered patients are currently being subjected to the analyses of cytogenetics and comprehensive gene mutations such as those of FLT3, NPM1, CEBPA, MLL-PTD, KIT, NRAS, TP53, WT1 and IDH. Furthermore, to evaluate the efficacy and safety of the individual therapies based on the FLT3-ITD or KIT mutation status, the JALSG is conducting the AML209-FLT3-SCT and CBF-AML209-KIT studies. Patients carrying a mutation in FLT3-ITD who achieve CR and are younger than 50 years old are registered in the AML209-FLT3-SCT study, and will receive allo-SCT during the 1st CR. The CBF-AML patients who achieve CR are registered in the CBF-AML20-KIT study, and will be treated with three courses of HiDAC. In this study, three types of KIT mutations are being analyzed and evaluated for their prognostic impacts.

More recently, the JALSG AML209 genome-wide study (AML209-GWS) has been initiated as an accompanying study for the AML209-GS study, in which whole exon or genome sequences, as well as the methylome of AML cells, will be examined to identify the genetic and/or epigenetic status associated with the long-term prognosis and the sensitivity to and adverse events from chemotherapies. These JALSG studies are expected to establish a novel stratification system for treatment decision making, providing a more personalized therapy for Japanese AML patients. We are hopeful that a large number of patients can be recruited, and that these studies can be successfully accomplished in the short-term.