Abstract
The ability to identify a specific type of leukemia using minimally invasive biopsies holds great promise to improve the diagnosis, treatment selection, and prognosis prediction of patients. Using genome-wide methylation profiling and machine learning methods, we investigated the utility of CpG methylation status to differentiate blood from patients with acute lymphocytic leukemia (ALL) or acute myelogenous leukemia (AML) from normal blood. We established a CpG methylation panel that can distinguish ALL and AML blood from normal blood as well as ALL blood from AML blood with high sensitivity and specificity. We then developed a methylation-based survival classifier with 23 CpGs for ALL and 20 CpGs for AML that could successfully divide patients into high-risk and low-risk groups, with significant differences in clinical outcome in each leukemia type. Together, these findings demonstrate that methylation profiles can be highly sensitive and specific in the accurate diagnosis of ALL and AML, with implications for the prediction of prognosis and treatment selection.
Similar content being viewed by others
Introduction
Acute lymphocytic leukemia (ALL) and acute myelogenous leukemia (AML), two common types of human acute leukemia, arise from hematopoietic progenitors of lymphoid or myeloid lineage or from hematopoietic stem cells. The diagnosis of leukemia based on pathological and molecular subtype as well as other histological markers is currently the gold standard for the selection of proper treatment and prognosis stratification.1,2,3 Immunological and molecular-based classifications are also used in the treatment decision-making process for ALL or AML. However, they still lack accuracy, especially in prognosis and survival predictions.
Epigenetic changes such as chromatin modification, microRNA expression changes, and DNA methylation changes have been reported extensively in cancer studies.4 The methylation pattern of CpG sites is an epigenetic regulator of gene expression.5,6 Extensive alterations in DNA methylation have been noted in almost all cancer types, causing changes in gene expression that promote oncogenesis.5,7,8 Both epigenetic and somatic mutations have promise for improving the characterization of malignancy to predict treatment response and prognosis.7,9,10,11 Particular changes in methylation profiles are postulated to be reproducibly found in specific cancer types. In contrast, somatic mutations, with some notable exceptions, typically show neither specificity nor sensitivity for a particular cancer type. Even within commonly mutated genes, individual mutations may be found across tens or hundreds of kilobases, limiting the utility of targeted sequencing of these molecular markers.12,13
Methods for DNA methylation evaluation can be classified into enzyme-based, anti-methylcytosine antibody-based, and bisulfate treatment-based approaches.14 Although each approach provides specific advantages over the others, the bisulfate treatment-based method has been the most widely utilized method due to its reproducibility and single base-pair resolution and the existence of particulate padlock primer-based bisulfate sequencing.15,16 Compared to other bisulfate treatment-based methods, the padlock-based method is more cost-effective, methylation position-specific, and flexible to modification; therefore, it has been commonly utilized for single-base-pair-resolution analysis.17 In our study, a padlock probe set was generated from 729 CpG markers that showed differential methylation values in many cancer types when compared to the corresponding normal tissues.18
Thus, to explore the utility of methylation patterns in differentiating leukemic cancers and improving prognosis, we analyzed the whole-genome methylation profiles of blood samples from patients with ALL and AML and healthy controls. We also used methylation patterns to predict survival in these patients. These markers not only outperformed present-day methods in their high sensitivity and specificity for diagnosis but also demonstrated the effect of stratifying patients with different prognoses.
Results
Characteristics of patients
Clinical characteristics and molecular profiles, including methylation data for our study cohort, were obtained for 194 AML patients, 136 ALL patients, and 754 healthy individuals. The clinical characteristics of the AML and ALL patients in the study cohorts and healthy controls are listed in Table 1.
Genome-wide methylation profiling identifies specific methylation signatures in leukemia
We randomly split the TCGA AML samples, Chinese ALL samples, and normal blood samples of healthy controls into training and validation data sets at a 70/30 ratio using R. We then compared methylation differences between the TCGA AML samples and normal blood samples and between the Chinese ALL samples and normal blood samples in the training data sets using the nearest shrunken centroids method.19 Two sets of CpG sites were then identified and used to differentiate the TCGA AML samples from normal blood samples and the Chinese ALL samples from normal blood samples in the validation data sets. This method of random splitting was repeated 20 times. Tables 2A, 2B, 3A, 3B shows confusion tables describing the performance of these classifiers in differentiating AML and ALL samples from normal blood samples on one of the 20 training and validation data sets. The 20 sets of CpG sites identified through AML-normal comparison revealed four common CpG sites. These four CpG sites were plotted in an unsupervised fashion in AML versus normal blood samples (Fig. 1a). The accuracy of using these four CpG sites for predicting AML leukemia was assessed by the ROC curve (Fig. 1b), which had an AUC of 0.9998.
Similarly, we identified seven common CpG sites through the ALL-normal comparison (Fig. 2a). The accuracy of using these seven CpG sites for predicting ALL leukemia was assessed by the ROC curve (Fig. 2b), which had an AUC of 0.9995. It is worth noting that two common CpG sites (cg05304729 and cg18518074) appeared both in the AML-normal comparison and in the ALL-normal comparison (Figs. 1a, 2a). Taken together, these data demonstrated that differential methylation of CpG sites was able to distinguish the blood of particular leukemia types from normal blood with high specificity and sensitivity (Figs. 1b, 2b). Overall, these results demonstrate the robust nature of these methylation patterns in identifying the presence of a particular type of leukemia.
Methylation profiles can distinguish between different leukemia
We have shown the ability of our method to distinguish between the blood of particular types of leukemia and normal blood samples. We then investigated whether our algorithm was able to distinguish different types of leukemic cancers (ALL and AML) arising from bone marrow. We identified five CpG sites that could be used to differentiate the TCGA AML samples from our Chinese ALL samples (Fig. 3a) and generated confusion tables (Tables 2C, 3C) describing the performance of our classifiers on one of 20 training and validation data sets consisting of the TCGA AML samples and the Chinese ALL cohort samples used in Tables 2A, 2B, 3A, 3B. It is worth noting that among these five CpG sites, one (cg00142402) was also identified in the AML and normal comparison, and two (cg08261841 and cg09247255) were also identified in the ALL and normal comparison. The accuracy of using these five CpG sites for differentiating between AML and ALL can be assessed by the ROC curve (Fig. 3b), which had an AUC of 0.9998. Together, these results demonstrate the efficacy of using methylation patterns for the accurate diagnosis of a cancer histological subtype. The 11 unique CpG sites that could differentiate among TCGA AML, Chinese ALL and normal blood samples are plotted in an unsupervised fashion in Fig. 4.
Methylation profiles predict prognosis and survival rates
We investigated the effect of methylation markers on the survival rate of each leukemia subtype (AML and ALL) based on a semisupervised method.20 Specifically, for each leukemia subtype, the CpG sites in the training data were ranked based on their Cox scores. Thirty-nine CpG sites whose Cox scores exceeded 2.197 (corresponding to the 96th percentile of the AML Cox scores) and 93 CpG sites whose Cox scores exceeded 3.215 (corresponding to the 92nd percentile of the ALL Cox scores) were selected, and their methylation profiles were used to classify 125 AML patients and 102 ALL patients, respectively, into “good survival” or “bad survival” by the 2-means clustering method. The resulting two subgroups for each leukemia subtype (AML and ALL) showed the most significant difference with respect to survival, and from these subgroups, we also obtained two optimal classification models: one contained 20 methylation signatures for the AML subtype, and one contained 23 methylation signatures for the ALL subtype (see the methods section). These two classifiers were then used to classify the 55 AML patients and the 34 ALL patients in the validation cohort. Individual patient survival data were plotted using a Kaplan–Meier curve (Fig. 5). A similar result was also observed in the whole cohort (Fig. S1). These methylation signatures were able to predict highly significant differences in the survival of patients with ALL and AML.
Discussion
Tumor-specific methylation patterns have been widely studied for their potential in cancer diagnosis and prognosis.21,22,23 Due to the high cost of whole-methylome sequencing, targeted specific methylation positions have been more commonly surveyed in tumor methylation marker discovery screening. For example, our previous work on hepatocellular carcinoma utilized a 401 padlock probe set and found ten CpG markers for diagnosis and eight CpG markers for prognosis.16 In this study, we designed a padlock-based bisulfate sequencing method using data from the TCGA database. We demonstrated that differential methylation of CpG sites was able to distinguish the blood from a particular leukemia type from normal blood with high specificity and sensitivity (Tables 2, 3). We also demonstrated our ability to distinguish histologic subtypes of leukemia (ALL and AML) derived from the same tissue in the bone marrow (Tables 2, 3). Furthermore, we showed that methylation patterns can predict survival in ALL or AML patients and revealed subsets of patients with either a significant positive or negative prognosis. This finding raises the possibility that methylation may help to identify relatively benign or aggressive tumors and may aid in decision-making regarding the selection of more or less aggressive treatment and monitoring. DNA methylation patterns likely represent common pathways of carcinogenesis and may be more reproducibly altered in cancers, potentially allowing more robust diagnosis and prognostication than somatic mutations. Indeed, methylation patterns may capture the biological state of a cell more accurately than histopathology or somatic mutations alone.
Our data have significant implications for improving the diagnostic yield for biopsies from patients whose bone marrow biopsy results are inconclusive, which often occurs due to artificial tissue distortion. These results may further be helpful to identify leukemic subtypes in cases in which the tissue yield or quality is inadequate for histology to make an accurate diagnosis, as histology requires preservation of the tissue architecture.24 In fact, it was often a dilemma with biopsies to balance between specimen yield and quality and discomfort or potential complications such as hemorrhage.25,26 Moreover, bone marrow pathological examinations are often relatively time-consuming, and diagnosis based on morphology can be inconclusive or inconsistent depending on the personal experience of pathologists. In contrast, DNA methylation analysis requires only a small amount of tissue to obtain adequate DNA, thus potentially allowing the use of lower quality biopsies. The ability to identify histologic subtypes for these cancers within the bone marrow has important implications because different cancers confer different prognoses and require distinct treatment plans; diagnostic failure or uncertainty may lead to less favorable outcomes and survival.
It may not be surprising that DNA methylation patterns have such differentiating abilities in distinguishing between the blood of subtypes of leukemia and normal blood. It is known that many genes involved in the methylation machinery are mutated in leukemia (TET2, TPMT, and DNMT3A),27,28,29,30,31 therefore leading to significant alteration in methylation patterns.
Recently, a number of prognostic factors have been proposed for AML and ALL, such as clinical features, immunophenotype, and cytogenetic and molecular characteristics.19,32,33,34 The identification of prognostic factors, an improved stratification of risk groups and survival analyses have made it possible to identify the presence of the disease and evaluate treatment outcomes.35,36 However, the clinical utilities of gene mutation analysis, gene expression profiling, and microRNA analysis remain uncertain at this time. Flow cytometry also provides a direct assessment of surface antigen expression profiles on leukemic cells,37 facilitating the rational and individualized selection of targeted immunotherapy strategies. Several advances in flow cytometry, including the availability of new monoclonal antibodies, improved gating strategies, and multiparameter analytic techniques, have all dramatically improved its utility in the diagnosis and classification of leukemia. However, morphologic and differentiation-based classifications of leukemia are limited by their prognostic value, as well as the available monoclonal antibodies.
In this study, we also applied methylation profiling and machine learning analysis to the survival data of ALL and AML patients. Interestingly, we were able to separate each leukemic type we examined into distinct groups with better or worse survival outcomes. These results also support the idea that methylation patterns may offer a more accurate picture of the biological state of a cancer than histology and IHC alone or even somatic mutation analysis. However, we expect that a combination of all of these methods is most likely to offer the most complete and useful information for treating patients with leukemia. One known prognostic factor is the origin from which progenitor leukemic cancer cells are derived from during hematopoiesis, as leukemic cells from more differentiated progenitors carry a better prognosis. Therefore, it would be interesting to see if leukemic patients with better prognosis/survival based on a methylation signature have the characteristics of a more differentiated disease.
Additionally, the blood can be taken from the patients at any time during the course of therapy, which facilitates the use of the methylation profile for dynamic monitoring of the epigenetic changes of leukemic cells instead of repetitive bone marrow biopsies. It also allows for the detection of minimal residual disease and the prediction of the risk of relapse.
In summary, we identified a CpG methylation panel for the diagnosis and prognosis of common leukemia with high sensitivity and specificity. Our results support the potential clinical utility of DNA methylation signatures to distinguish leukemia types and to predict prognosis and outcomes.
Key points
ALL and AML have specific DNA methylation signatures that are associated with cancer-related gene expression regulation.
DNA methylation markers can differentiate AML from ALL.
DNA methylation markers can provide prognosis and survival assessment for AML and ALL patients.
Methods
Patient data
Patient data of the AML training and validation cohorts were obtained from The Cancer Genome Atlas (TCGA). Patient characteristics are summarized in Table 1. Complete clinical, molecular, and histopathological data sets are available at the TCGA website: https://tcga-data.nci.nih.gov/tcga/. Individual institutions that contributed samples coordinated the consent process and obtained informed written consent from each patient in accordance with their respective institutional review boards.
The second independent (Chinese) ALL cohort consisted of patients from Guangzhou Women and Children’s Medical Center, China, and patient characteristics are summarized in Table 1. This project was approved by the IRB of Guangzhou Women and Children’s Medical Center. Informed consent was obtained from all patients. Tumor and normal tissues were obtained as clinically indicated for patient care and were retained for this study with patients’ informed consent.
Data sources
DNA methylation data were obtained from both the TCGA analysis of 485,000 sites generated using the Infinium 450K Methylation Array and the following GSE data set: GSE40279. Methylation profiles for AML cancer types and their corresponding normal blood were analyzed. IDAT format files of the methylation data were generated containing the ratio values of each scanned bead. Using the minfi package from Bioconductor, these data files were converted into a score, referred to as a beta value. Methylation data of the Chinese cohort were obtained by padlock-based bisulfate sequencing of a pancancer marker set and were analyzed as described below.
Generating methylation markers enriched in cancer
We selected 729 previously reported CpG markers that showed differential methylation values in many cancer types when compared to the corresponding normal tissues.18
Classifying samples
For classifying the ALL, AML, and normal blood samples, we applied a supervised learning technique, the “nearest shrunken centroids” procedure of Tibshirani et al.38, which is implemented in the package PAM.39 Specifically, we first mixed the TCGA AML samples, Chinese ALL samples and normal blood samples. Seventy percent of these combined samples were put into the training set, and thirty percent were put into the validation set. We then performed the PAM procedure with 10-fold cross-validation on the training data set and obtained robust classifiers for each AML-normal, ALL-normal, and AML-ALL comparison. These classifiers were then used to classify the validation data. This leave-group-out cross-validation was repeated 20 times.
To predict survival in each leukemia subtype (AML and ALL), we applied a semisupervised method proposed by Bair and Tibshirani.20 Specifically, the patient cohorts were randomly divided into a training set (n = 125 for AML and n = 102 for ALL) and a validation set (n = 55 for AML and n = 34 for ALL). For each CpG site, we fit a univariate Cox proportional hazard regression model with survival outcome and methylation value as predictors using the training data set. These CpG sites were then ranked based on their Cox scores. For a given Cox score cutoff, we obtained a list of CpG sites whose Cox scores exceed the cutoff. Then, we performed 2-means clustering on the training patients and obtained two subgroups for each leukemia subtype. We then conducted log-rank tests on the survival of these two subgroups for each leukemia subtype and applied the nearest shrunken centroids model with cross validation to train a classification model. We examined 100 equally spaced Cox scores between the 90th percentiles of the Cox scores and the maximum of the Cox scores. The optimal Cox score cutoff was chosen such that the resulting two subgroups for each leukemia subtype differed most significantly with respect to survival, and the resulting classification model had the smallest cross validation error. We then used the trained classification models, one for AML and one for ALL, to predict the subgroup to which each patient in the AML and ALL validation sets belonged. The 20 methylation signatures for survival in AML and the 23 methylation signatures for survival in ALL are listed below.
AML: cg01336231, cg01413582, cg01509330, cg02264990, cg02329430, cg02858512, cg03297901, cg03556653, cg04596071, cg05038216, cg06034933, cg08098128, cg13066703, cg17757602, cg18869709, cg19966212, cg20300129, cg23193870, cg23680451, and cg25145765.
ALL: cg01628067, cg03001333, cg04984818, cg05145233, cg05304729, cg05956452, cg06261066, cg09157302, cg14608384, cg15289427, cg15608301, cg15707093, cg16266227, cg18869709, cg19470372, cg19864130, cg20686234, cg21913319, cg24720672, cg24747122, cg24983367, cg26584619, and cg27178401.
In our analysis, we observed four potential types of classification errors.
-
1.
False negative; e.g., ALL blood was identified as normal blood.
-
2.
False positive; e.g., normal blood was identified as ALL or AML blood.
-
3.
Correct sample, incorrect leukemia type; e.g., ALL blood was identified as AML blood.
Tumor DNA extraction
Genomic DNA extraction from normal blood or ALL bone marrow cancer samples was performed with the QIAamp DNA Mini Kit (Qiagen) according to the manufacturer’s recommendations. DNA was stored at −20 °C and analyzed within 1 week of preparation.
Bisulfite conversion of genomic DNA
Up to 1 µg of genomic DNA was converted to bis-DNA using an EZ DNA Methylation-Lightning™ Kit (Zymo Research) according to the manufacturer’s protocol. The resulting bis-DNA had a size distribution of ~200–3000 bp, with a peak around ~500–1000 bp. The efficiency of bisulfite conversion was >99.8%, as verified by deep sequencing of bis-DNA and analyzing the ratio of the C to T conversion of CH (non-CG) dinucleotides.
Determination of DNA methylation levels of the ALL cohort by deep sequencing of bis-DNA captured with molecular-inversion (padlock) probes
A total of 729 CpG markers whose methylation levels significantly differed in any of the comparisons between leukemic and normal tissue were used to design padlock probes for sequencing. Padlock-capture of bis-DNA was based on published techniques and protocols with modifications.17,40,41
Determination of DNA methylation levels by deep sequencing of bis-DNA captured with molecular inversion (padlock) probes
Padlock probes were designed to capture regions containing the CpG markers whose methylation levels significantly differed in comparison between leukemic and normal blood. Padlock-capture of bis-DNA was based on published techniques and protocols with modifications.40,41
Probe design and synthesis
Padlock probes were designed using the ppDesigner software. The average length of the captured region was 100 bp, with the CpG marker located in the central portion of the captured region.
Bis-DNA capture
For this analysis, 100 ng of bisulfite-converted DNA was annealed to padlock probes in 20 µl reactions containing 1× Ampligase buffer (Epicenter). To anneal probes to DNA, 30 s of denaturation at 95 °C was followed by a slow cooling to 55 °C. To fill gaps between annealed arms, the following mixture was added to each reaction: Pfu polymerase (Agilent), 0.5 U of Ampligase (Epicenter) and 250 pmol of each dNTP in 1× Ampligase buffer. After 5 h of incubation at 55 °C, the reactions were denatured for 2 min at 94 °C and snap-cooled on ice. Exonuclease mix (20 U of ExoI and 100 U of ExoIII, both from Epicenter) was added, and single-stranded DNA degradation was carried out at 37 °C for 2 h, followed by enzyme inactivation for 2 min at 94 °C.
Circular products of the above CpG site-specific capture were amplified by PCR with concomitant barcoding of separate samples. Amplification was carried out using primers specific to linker DNA within the padlock probes, one of which contained specific 6 bp barcodes. Both primers contained Illumina next-generation sequencing adapter sequences. PCR of the captured DNA was performed using Phusion Flash Master Mix (Thermo) and a 200 nM final concentration of primers under the following cycle conditions: 10 s @ 98 °C; 8 cycles of 1 s @ 98 °C, 5 s @ 58 °C, and 10 s @ 72 °C; 25 cycles of 1 s @ 98 °C and 15 s @ 72 °C; and 60 s @ 72 °C. PCRs were mixed, and the resulting library was size selected to include effective captures (~230 bp) and exclude “empty” captures (~150 bp) using Agencourt AMPure XP beads (Beckman Coulter). The purity of the libraries was verified by PCR using Illumina flowcell adapter primers (P5 and P7), and the concentrations were determined using the Qubit dsDNA HS assay (Thermo Fisher). Libraries we sequenced using the MiSeq and HiSeq2500 systems (Illumina).
Sequencing data analysis
The sequencing reads were mapped using the software tool bisReadMapper with some modifications. First, UMI were extracted from each sequencing read and appended to read headers within the FASTQ files using a custom script generously provided by D.D. Reads were on-the-fly converted as if all Cs were nonmethylated and mapped to in-silico converted DNA strands of the human genome, also as if all Cs were nonmethylated, using Bowtie 2.42 Original reads were merged and filtered for single UMI, i.e., reads carrying the same UMI were discarded, leaving a single one. Methylation frequencies were extracted for all CpG markers for which padlock probes were designed. Markers with less than 20 reads in any sample were excluded from analysis. This resulted in ~600 CpG markers for which the methylation level was determined with an accuracy of 5% or more.
References
Yeoh, E. J. et al. Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling. Cancer Cell 1, 133–143 (2002).
Cancer Genome Atlas Research, N. Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia. N. Engl. J. Med. 368, 2059–2074 (2013).
Pui, C. H., Robison, L. L. & Look, A. T. Acute lymphoblastic leukaemia. Lancet 371, 1030–1043 (2008).
Shahjahani, M. et al. Rare cytogenetic abnormalities and alteration of microRNAs in acute myeloid leukemia and response to therapy. Oncol. Rev. 9, 261 (2015).
Jaenisch, R. & Bird, A. Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat. Genet. 33(Suppl), 245–254 (2003).
Vaissiere, T., Sawan, C. & Herceg, Z. Epigenetic interplay between histone modifications and DNA methylation in gene silencing. Mutat. Res. 659, 40–48 (2008).
Baylin, S. B. & Jones, P. A. A decade of exploring the cancer epigenome—biological and translational implications. Nat. Rev. Cancer 11, 726–734 (2011).
Herman, J. G. & Baylin, S. B. Gene silencing in cancer in association with promoter hypermethylation. N. Engl. J. Med. 349, 2042–2054 (2003).
Pleasance, E. D. et al. A comprehensive catalogue of somatic mutations from a human cancer genome. Nature 463, 191–196 (2010).
Beroukhim, R. et al. The landscape of somatic copy-number alteration across human cancers. Nature 463, 899–905 (2010).
Esteller, M. Cancer epigenomics: DNA methylomes and histone-modification maps. Nat. Rev. Genet. 8, 286–298 (2007).
Koboldt, D. C. et al. VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. Genome Res. 22, 568–576 (2012).
Kandoth, C. et al. Mutational landscape and significance across 12 major cancer types. Nature 502, 333–339 (2013).
Lee, E. J., Luo, J., Wilson, J. M. & Shi, H. Analyzing the cancer methylome through targeted bisulfite sequencing. Cancer Lett. 340, 171–178 (2013).
Hao, X. et al. DNA methylation markers for diagnosis and prognosis of common cancers. Proc. Natl Acad. Sci. USA 114, 7414–7419 (2017).
Xu, R. H. et al. Circulating tumour DNA methylation markers for diagnosis and prognosis of hepatocellular carcinoma. Nat. Mater. 16, 1155–1161 (2017).
Deng, J. et al. Targeted bisulfite sequencing reveals changes in DNA methylation associated with nuclear reprogramming. Nat. Biotechnol. 27, 353–360 (2009).
Fernandez, A. F. et al. A DNA methylation fingerprint of 1628 human samples. Genome Res. 22, 407–419 (2012).
Mrózek, K. et al. Prognostic significance of the European LeukemiaNet standardized system for reporting cytogenetic and molecular alterations in adults with acute myeloid leukemia. J. Clin. Oncol. 30, 4515–4523 (2012).
Bair, E. & Tibshirani, R. Semi-supervised methods to predict patient survival from gene expression data. PLoS Biol. 2, E108 (2004).
Sandoval, J. et al. A prognostic DNA methylation signature for stage I non-small-cell lung cancer. J. Clin. Oncol. 31, 4140–4147 (2013).
Diaz-Lagares, A. et al. A novel epigenetic signature for early diagnosis in lung cancer. Clin. Cancer Res 22, 3361–3371 (2016).
Gai, W. et al. Liver-specific and colon-specific DNA methylation markers in plasma for investigation of colorectal cancers with or without liver metastases. Clin. Chem. 64, 1239–1249 (2018).
Wilkins, B. S. Pitfalls in bone marrow pathology: avoiding errors in bone marrow trephine biopsy diagnosis. J. Clin. Pathol. 64, 380–386 (2011).
Bain, B. J. Morbidity associated with bone marrow aspiration and trephine biopsy - a review of UK data for 2004. Haematologica 91, 1293–1294 (2006).
Orazi, A. Histopathology in the diagnosis and classification of acute myeloid leukemia, myelodysplastic syndromes, and myelodysplastic/myeloproliferative diseases. Pathobiology 74, 97–114 (2007).
Chou, W. C. et al. TET2 mutation is an unfavorable prognostic factor in acute myeloid leukemia patients with intermediate-risk cytogenetics. Blood 118, 3803–3810 (2011).
Itzykson, R. et al. Impact of TET2 mutations on response rate to azacitidine in myelodysplastic syndromes and low blast count acute myeloid leukemias. Leukemia 25, 1147–1152 (2011).
Shen, Y. et al. Gene mutation patterns and their prognostic impact in a cohort of 1185 patients with acute myeloid leukemia. Blood 118, 5593–5603 (2011).
Grossmann, V. et al. The molecular profile of adult T-cell acute lymphoblastic leukemia: mutations in RUNX1 and DNMT3A are associated with poor prognosis in T-ALL. Genes Chromosomes Cancer 52, 410–422 (2013).
Stanulla, M. et al. Thiopurine methyltransferase (TPMT) genotype and early treatment response to mercaptopurine in childhood acute lymphoblastic leukemia. JAMA 293, 1485–1489 (2005).
Harrison, C. J. Cytogenetics of paediatric and adolescent acute lymphoblastic leukaemia. Br. J. Haematol. 144, 147–156 (2009).
Bhojwani, D. et al. Gene expression signatures predictive of early response and outcome in high-risk childhood acute lymphoblastic leukemia: a Children’s Oncology Group Study. J. Clin. Oncol. 26, 4376–4384 (2008).
Rockova, V. et al. Risk stratification of intermediate-risk acute myeloid leukemia: integrative analysis of a multitude of gene mutation and gene expression markers. Blood 118, 1069–1076 (2011).
Schultz, K. R. et al. Risk-and response-based classification of childhood B-precursor acute lymphoblastic leukemia: a combined analysis of prognostic markers from the Pediatric Oncology Group (POG) and Children’s Cancer Group (CCG). Blood 109, 926–935 (2007).
Santamaría, C. M. et al. Molecular stratification model for prognosis in cytogenetically normal acute myeloid leukemia. Blood 114, 148–152 (2009).
Chevallier, P. et al. Simultaneous study of five candidate target antigens (CD20, CD22, CD33, CD52, HER2) for antibody-based immunotherapy in B-ALL: a monocentric study of 44 cases. Leukemia 23, 806–807 (2009).
Tibshirani, R., Hastie, T., Narasimhan, B. & Chu, G. Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc. Natl Acad. Sci. 99, 6567–6572 (2002).
Tibshirani, R., Hastie, T., Narasimhan, B. & Chu, G. Class prediction by nearest shrunken centroids, with applications to DNA microarrays. Stat. Sci. 18, 104–117 (2003).
Porreca, G. J. et al. Multiplex amplification of large sets of human exons. Nat. Methods 4, 931–936 (2007).
Diep, D. et al. Library-free methylation sequencing with bisulfite padlock probes. Nat. Methods 9, 270–272 (2012).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Acknowledgements
This work was supported in part by the National Natural Science Foundation of China (Grant 81102248), Science and Technology Plan Projects of Guangdong (Grant 2014A020212695), Natural Science Foundation of Guangdong Province, Major Special Project of Guangzhou Science and Technology and Information Bureau (Grant 122400037), Chinese Academy of Sciences, and Guangzhou Regenerative Medicine and Health Guangdong Laboratory. The results published here are in part based upon data generated by the TCGA Research Network: http://cancergenome.nih.gov/.
Author information
Authors and Affiliations
Contributions
H.J., X.S. and K.Z. designed the research; Z.O., Y.H., M.Y., S.W., G.L., J.Z., R.Z., J.W., Q.Q. and T.P. performed the research; Z.O., X.Z., W.H., L.H., X.G., H.L. and W.W. collected clinical data; L.Z., E.Z., Z.L., G.Z., D.Z., C.Z., I.Z., R.-z.Z., O.L., L.C. and T.S. contributed new reagents/analytic tools; Z.O., H.L., X.C., J.-k.Z. and K.Z. analyzed data and X.C., J.-k.Z., and K.Z. wrote the paper.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Jiang, H., Ou, Z., He, Y. et al. DNA methylation markers in the diagnosis and prognosis of common leukemias. Sig Transduct Target Ther 5, 3 (2020). https://doi.org/10.1038/s41392-019-0090-5
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41392-019-0090-5
- Springer Nature Limited
This article is cited by
-
MethScore as a new comprehensive DNA methylation-based value refining the prognosis in acute myeloid leukemia
Clinical Epigenetics (2024)
-
Dysregulation of plasma circulating microRNAs in all-cause and cause-specific cancers: the Rotterdam Study
Biomarker Research (2024)
-
Epigenetic regulation in hematopoiesis and its implications in the targeted therapy of hematologic malignancies
Signal Transduction and Targeted Therapy (2023)
-
Genome-wide promoter methylation profiling in a cellular model of melanoma progression reveals markers of malignancy and metastasis that predict melanoma survival
Clinical Epigenetics (2022)
-
Cell-free DNA 5-hydroxymethylcytosine is an emerging marker of acute myeloid leukemia
Scientific Reports (2022)