Abstract
Long non-coding RNAs (lncRNAs) represent a novel class of functional RNA molecules with an important emerging role in cancer. To elucidate their potential pathogenetic role in chronic lymphocytic leukemia (CLL), a biologically and clinically heterogeneous neoplasia, we investigated lncRNAs expression in a prospective series of 217 early-stage Binet A CLL patients and 26 different subpopulations of normal B-cells, through a custom annotation pipeline of microarray data. Our study identified a 24-lncRNA-signature specifically deregulated in CLL compared with the normal B-cell counterpart. Importantly, this classifier was validated on an independent data set of CLL samples. Belonging to the lncRNA signature characterizing distinct molecular CLL subgroups, we identified lncRNAs recurrently associated with adverse prognostic markers, such as unmutated IGHV status, CD38 expression, 11q and 17p deletions, and NOTCH1 mutations. In addition, correlation analyses predicted a putative lncRNAs interplay with genes and miRNAs expression. Finally, we generated a 2-lncRNA independent risk model, based on lnc-IRF2-3 and lnc-KIAA1755-4 expression, able to distinguish three different prognostic groups in our series of early-stage patients. Overall, our study provides an important resource for future studies on the functions of lncRNAs in CLL, and contributes to the discovery of novel molecular markers with clinical relevance associated with the disease.
Similar content being viewed by others
Introduction
Chronic lymphocytic leukemia (CLL) is a clinically heterogeneous disease.1, 2 Over the past two decades, several studies have shown a relationship between such heterogeneity and the presence of cellular and molecular markers, such as IGHV mutational status,3 cytogenetic abnormalities and gene mutations.4, 5, 6, 7, 8, 9, 10, 11, 12, 13 In recent years, great attention has been devoted to long non-coding RNA (lncRNA) molecules, a large class of transcripts that represent over half of the mammalian non-coding transcriptome. LncRNAs are involved in many biological processes, such as transcriptional gene regulation, maintenance of genomic integrity, genomic imprinting, cell differentiation or development.14, 15 Their deregulation promotes tumor formation, progression and metastasis in many types of cancer,16 including hematologic malignancies.17, 18 In addition, recent evidence has suggested and/or demonstrated an interplay of the various non-coding species, especially lncRNAs and miRNAs, in cancer development.19 The number of known human lncRNA transcripts is still evolving. To date, approximately 110 000 human lncRNAs are annotated in LNCipedia repository, with many loci generating multiple transcripts.20
The knowledge of the role of lncRNAs in CLL is virtually absent. Garding et al.21 provided insights into the epigenetic mechanisms regulating the expression of the DLEU1 and DLEU2 genes, mapped to the critical region at chromosomal band 13q14.3 which is deleted in over 50% of patients with CLL.9, 22 DLEU2 hosts the miR-15a/16-1 cluster, known for its crucial role in CLL pathogenesis.23 The only additional evidence of the involvement of lncRNAs in CLL suggest a role for NEAT1 and lincRNA-p21 as novel elements of the p53-dependent DNA damage response machinery.24
Herein, we investigated the lncRNA expression profiles in a large prospective series of Binet A CLL patients and in distinct populations of normal B-cells, isolated from tonsils and peripheral blood B-lymphocytes. Additionally, we searched for a putative significant correlation between lncRNAs and miRNAs at expression levels. Finally, lncRNA expression patterns were correlated with prognosis and prognostic markers to outline their possible role in CLL outcome.
Patients and methods
Patients
The study included 217 newly diagnosed CLL patients prospectively enrolled from several Italian institutions in an observational multicenter study (clinicaltrial.gov #NCT00917540), as previously described.25
Peripheral blood mononuclear cells from CLL patients and healthy donors, and B-cell populations from tonsils (i.e. naïve B-cells (N), marginal zone (MZ)-like, germinal center (GC), switched memory (M) B-cells), tonsillar and bone marrow plasma cells were obtained as previously described.25 Highly enriched CD19+ B-cells were characterized for IGHV mutational status, CD38 and ZAP70 expressions, NOTCH1 mutations, and cytogenetic alterations, that is, deletion of chromosome 13q (del13q), 11q (del11q), 17p (del17p) and trisomy chromosome 12 (12+), as previously described.12, 26, 27, 28, 29
Transcriptional analyses
The expression profiles of the 217 samples was generated using GeneChip Gene 1.0 ST Array (Affymetrix Inc., Santa Clara, CA, USA) as previously described,30 and data have been deposited in the NCBI Gene Expression Omnibus database (GSE46261). To detect lncRNA expression, we applied a custom pipeline able to re-annotate the probes included in the Gene 1.0 ST expression array according to the LNCipedia-v3.1 annotation database. MiRNA profiling of the 217 samples was generated using Agilent Human miRNA microarray (G4470B; Agilent Technologies, Santa Clara, CA, USA).31
Principal component analysis was performed by singular value decomposition of the considered data expression matrix using the prcomp function in the stats package, and the results visualized using the plot3d function in the rgl package for R software. Supervised analyses were performed using the Significant Analysis of Microarrays (SAM) software version 5.0032 using the web application provided in the shiny package for R software (https://github.com/MikeJSeo/SAM). The differentially expressed lncRNAs discriminating CLL and normal B-cell counterpart were identified using Prediction Analysis of Microarrays as previously described.33 Hierarchical agglomerative clustering of patients was performed adopting Pearson's correlation and average as distance and linkage methods, respectively. Functional characterization of lncRNAs was performed using the GSEA software v.2.2.134 and the gene sets from Hallmarks, Kegg and Reactome collections. Ranking of coding genes from genes sets used the lncRNAs expression as a continuous phenotype and the Pearson’s correlation as metric. Gene sets were considered significant if the false discovery rate was ⩽0.25 under 1000 permutations.
Statistical analysis
The Cox proportional-hazards model in the Globaltest package for R was used to test the association between lncRNA expression levels, assumed as continuous variables, and progression-free survival (PFS) in terms of time-to-first-treatment as clinical outcome. Globaltest was run with 100 000 permutations on lncRNAs that varied mostly over the data set, namely those whose standard deviation of the expression exceeded the median plus one median absolute deviation of all the standard deviations. The list of significantly associated lncRNA to the PFS was reduced to derive a prognostic lncRNA signature: briefly, k-means clustering was applied to each lncRNA to define high- and low-expressing samples and then test the clinical outcome of the two groups. The survival distributions identified by this approach were tested using Kaplan–Meier estimator and log-rank test, and P-values were calculated according to the standard normal asymptotic distribution (survdiff function of the survival R package). The Benjamini–Hochberg method was applied for multiple testing corrections. Independence between lncRNA models and common CLL prognostic factors (IGHV mutational status, CD38, ZAP70, NOTCH mutation and unfavorable cytogenetic aberrations, namely del11q, del17p and 12+) was assessed using the multivariate Cox proportional-hazards regression procedure by the coxph function in the survival R package. Kendall Tau correlations and Wilcoxon rank-sum tests were applied using standard functions in R base package.
miRNA targets custom predictions were obtained for lncRNA sequences using RNA-22 version 2.0 prediction algorithm,35 which allows customizing input sequences and parameters (https://cm.jefferson.edu/rna22/Interactive/). The RNA-22 perl script was run on the combination of the 286 miRNA and 395 lncRNA FASTA sequences obtained from the correlation analysis (P<0.05 after multiple testing adjustment). The default sensitivity/specificity ratio of 1.032 was chosen, and provided a minimum seed size of seven bases with only one mismatch exception, a minimum number of 12 paired-up bases in heteroduplex with −12 kcal/mol maximum folding energy, and no more than one G-U wobble in the seed region.
Results
LncRNA expression profiles in CLL and in distinct subgroups of normal B-cells
The lncRNAs expression profiles of leukemic cells from a cohort of 217 patients with early stage, Binet A CLL were investigated (Supplementary Table S1)25 and compared with those of different types of normal B-cells. These samples included six peripheral B-cell (pBC), four GC, three N, two MZ, three M, four tonsillar plasma cells and four bone marrow plasma cells.
To detect lncRNA expression, we applied a custom pipeline able to re-annotate the probes included in the Gene 1.0 ST expression array according to the LNCipedia annotation database. Such a strategy allowed us to investigate the expression levels of 1852 well-annotated and specific human lncRNAs, as previously reported.36
Firstly, we searched for the B-cell subset sharing lncRNA expression similarities with CLL cells. Clustering analysis of the 26 normal B-cell subtypes was run on the 141 lncRNAs varying at least 1.5-fold in expression levels from the mean across the data set. Distinct lncRNAs profiles were associated with tonsillar plasma cells, bone marrow plasma cells, pBCs and GC cells, whereas M, MZ and N B-cells presented undistinguishable expression profiles and were considered as part of a single cluster (Figure 1a). Among these five B-cells clusters, multiclass SAM analysis identified 226 differentially expressed lncRNAs (Supplementary Figure S1) that were subsequently used to compare normal B-cell subsets with CLL samples. The principal component analysis analysis revealed that CLL were more similar to tonsillar MZ, M and N B-cells than to GC cells, pBCs, tonsillar plasma cells or bone marrow plasma cells (Figure 1b). This finding is similar to that reported by our previous study in the context of miRNA and small nucleolar RNA (snoRNAs) expression in the same CLL data set.25, 31 Given the similarities of the lncRNA profile among CLL cells, tonsillar MZ, M and N B-cells as a whole were chosen as the 'comparator' B-cells to unravel lncRNA expression alterations in CLL. The classifier was first trained on an unbiased randomly selected cohort of 109 CLL cases (training set), and then applied to a validation set including the 108 remaining samples (Supplementary Table S1). The minimum error corresponded to 24 discriminant lncRNAs expression profiles between CLLs and comparator B-cells, leading to a good classification accuracy when applied to the validation patient set (overall: 99.14%; specificity: 100%; sensitivity: 99.07%) (Figure 1c). Specifically, the 14 lncRNAs that were downregulated in CLL included lnc-TOMM7-1 (showing the lowest expression ratio CLL/comparator) and 8 lncRNAs that are located in chromosomal regions, such as 14q32, 2p and 22q, coding for the highly variable portions of the immunoglobulin genes or for IGHV pseudogenes. The lncRNAs classifier included 10 lncRNAs that were upregulated in CLL samples, among which lnc-SNX29P2-3 and lnc-SEL1L3-6 (see below).
Importantly, the 24-lncRNA classifier signature was validated on a publicly available independent cohort including 9 CLL and 10 normal samples, profiled on the same array.37 Specifically, we verified the predictive power of the 24-lncRNA model using leave-one-out and linear discriminant analysis as the cross-validation procedure and prediction method, respectively, and demonstrated its capability to discriminate robustly CLL from normal B-cell samples (overall accuracy: 100%, Supplementary Figure S2).
LncRNA expression in different CLL prognostic subgroups
CLL patients can be stratified into different prognostic groups based upon the presence/absence of cellular, molecular or cytogenetic markers. Here, we aimed at identifying specific lncRNAs signatures associated with each of these CLL subgroups (specified in Supplementary Table S1). We compared lncRNA expression of IGHV mutated (M-CLL, n=131) vs unmutated (UM-CLL, n=85) patients. The analysis identified 30 differentially expressed lncRNAs (Supplementary Table S2). To reduce biases possibly due to group heterogeneity (i.e., the distribution of cytogenetic alterations within the two IGHV classes), we restricted the analysis to a homogeneous CLL subgroup with del13q as common and sole abnormality, including 31 UM and 73 M-CLLs. This approach led to the identification of 12 differentially expressed lncRNAs (Table 1), 10 of which were already detected in the comparison of unselected M- and U-CLL (Supplementary Table S2). In particular, we identified an upregulation of the lnc-IRF2-3 mapping at 4q35 and lnc-AC004696.1-1, which is located antisense, head to head, to ZNF667 gene at 19q13.
Overall, 12 and 20 lncRNAs distinguished CLL stratified based on ZAP70 or CD38 expression levels, respectively (Supplementary Tables S3 and S4). Among them, five lncRNAs (lnc-IRF2-3, lnc-AC004696.1-1, lnc-TNFRSF13B-5, lnc-C1orf132-1 and lnc-BACH1-1) were common to those identified in the above analysis (Supplementary Figure S3), which is in line with the notion that CD38 and ZAP70 represent in part surrogate markers of IGHV mutational status.
To identify lncRNA expression patterns characterizing the major cytogenetic aberrations, all the FISH 'negative' CLLs were considered as normal counterpart (78 patients). In addition, when del13q occurred with another cytogenetic abnormality, the latter was considered predominant for group inclusion. The results of each analysis are reported in Supplementary Tables S5–S8, while the most significant lncRNAs (top 10 score) are reported in Table 1. In detail, del13q patients showed an upregulation of lnc-SEL1L3-1, lnc-AMZ1-6, lnc-DTNB-2 and lnc-SNX29P2-3 (Supplementary Table S5), which were also downregulated in the comparator normal B-cells with respect to CLL samples (Figure 1c), and the downregulation of lnc-SPRYD7-1, also known as DLEU2. In addition, CLLs with del13q showed the downregulation of lnc-LIPG-3. This lncRNA has three different transcripts, including the independent transcriptional unit SCARNA17 and the non-coding RNA SNHG22, in the intron of which the SCARNA17 is located. Lnc-LIPG-3 is also downregulated in CLL samples with del11q and 12+, both of which showed an upregulation of lnc-AC004696.1-1. Only two lncRNAs were differentially expressed in patients with del17p. In particular, we found the downregulation of lnc-LTBP3-2 located at 11q13 antisense to both NEAT1 and MALAT1, the latter being a well-known putative oncogenic lncRNA found significantly correlated with lnc-LTBP3-2 expression in our CLL data set (R=0.54, P<2.2e-16).
Finally, we investigated the expression of lncRNAs specifically associated with the presence of NOTCH1 mutations (Supplementary Table S9). Again, among the seven lncRNAs differentially expressed, NOTCH1 mutated CLLs showed an upregulation of lnc-IRF2-3, lnc-AC004696.1-1 and lnc-BACH1-1, and a downregulation of lnc-C1orf132-1, all of which were found to be deregulated in CLL subgroups with adverse prognostic factors.
LncRNAs interplay with genes and miRNAs in CLL
To gain evidence of lncRNAs that might influence gene expression, we evaluated the correlation between the expression level of the 1852 lncRNAs and that of the 17 788 coding transcripts unambiguously detected by the arrays. To be confident of predicting lncRNA–gene relationship, we focused on an lncRNAs/mRNAs ratio with a correlation coefficient >0.9 (Table 2), and found three distinct lncRNAs: (i) Lnc-AC004696.1-1 (see above) which positively correlated with its antisense overlapping gene ZNF667, suggesting a potential in cis regulation; (ii) lnc-NDST3-1, downregulated in del13q and del11q CLLs (Table 1), which positively correlated with the expression of RPL18A; and (iii) lnc-cYorf15A.1-2 that highly correlated with the expression of four genes located in sense or antisense orientation on chromosome Y about 7 Mb apart, and with two other genes, RPS4Y1 and ZFY, located on the short arm of chromosome Y. Since lnc-cYorf15A.1-2 and its correlated genes are expressed by males only, the correlation coefficients were recalculated in male patients and found to be below 0.8 for all of the six lncRNA–gene couples.
The relationship between lncRNAs and miRNAs also was studied in all patients of the cohort.31 To test the transcriptional relationships, the correlations between the expression levels of any lncRNA and miRNA were computed first. We identified 861 lncRNA-miRNA couples significantly anti-correlated in our database (q-value<0.01). Then, one of the most common target prediction algorithms (RNA-22) was run on each significantly correlated pair, using the miRNA sequences annotated on miRbase v20 and the LNCipedia lncRNA sequences corresponding to the fragments investigated by the probes on the array. Hence, we identified 11 lncRNA-miRNA anti-correlated couples supported by miRNA target prediction (q-value<0.05; Table 3). Notably, we found the couples miR-331-3p/lnc-LIPG-3 (lnc-LIPG-3 downregulated in samples with del13q, del11q or 12+; Table 1), and miR-574-3p/lnc-LINS-1 (miR-574-3p reported as upregulated in UM-CLL patients).31
Identification of an lncRNA transcriptional profile with clinical relevance
Since the cohort analyzed was recruited in a perspective study on Binet A patients, we could investigate correlations between lncRNA profiles and PFS. Clinical data were available for 209/217 CLL. To this aim, a globaltest was run on the 471 lncRNAs the expression of which varied mostly over the data set. Eight lncRNAs showed significant association with PFS (P<0.001; Supplementary Table S10) and five had a significant predictive value for PFS in univariate analysis (Table 4). The expression levels of lnc-IRF2-3, lnc-AC004696.1-1, lnc-C1orf132-1 and KIAA1755-4 were validated by qRT-PCR, showing a good concordance (Supplementary Figure S4 and Supplementary Table S11). Notably, lnc-KIAA1755-4 corresponds to the SNORA71A that is processed from the intronic region of the non-coding RNA SNHG17 (Supplementary Figure S4d).
Next, our analysis focused on lnc-IRF2-3 that had the highest predictive value for PFS by univariate test and was highly expressed in UM-CLLs. Interestingly, the highest lnc-IRF2-3 expression level was found in the subset of UM-CLL patients with the shortest PFS (Figure 2a). However, its predictive power lost strength (P=0.051) after adjustment for covariate confounders, particularly CD38, likely because high expression levels of CD38 and lnc-IRF2-3 define largely overlapping CLL subgroups (Supplementary Figure S3). In order to improve the robustness of the lncRNAs as PFS predictors, lnc-IRF2-3 was combined with a second lncRNA from the group with the significant predictive value (Table 4). We evaluated each of these lncRNAs in combination with lnc-IRF2-3, based on the following scheme defining groups as (i) high/high, (ii) low/low or (iii) discordant expression levels. Among all the possible combinations tested, the couple lnc-IRF2-3 and lnc-KIAA1755-4 determined the best predictive model. Specifically, a better PFS corresponded to patients with a concomitant low expression of both lncRNAs (a 'low-risk' group, including 122 patients, 56%), whereas a worse PFS was associated with their concomitant high expression ('high-risk' group, 12 patients; Figure 2b). The 'intermediate-risk' group (75 patients) was characterized by the discordant expression of the two lncRNAs. The high-risk group had a hazard ratio of 8.05 (95% CI: 3.82–16.96; median PFS: 862 days), and the intermediate-risk group of 2.27 (95% CI: 1.39–3.69; median PFS: 1678 days), compared with the low-risk group (median PFS not reached). Finally, multivariate regression analysis confirmed the independence of the 2-lncRNA risk model from other known predictive factors in CLL (IGHV mutational status, CD38 and ZAP70 expression, NOTCH mutation and unfavorable chromosomal aberration as covariates; Table 5). Likewise, the 2-lncRNA risk model resulted informative when compared with the recently defined progression-risk score (PRS) based on the integration of clinical, laboratory and biological parameters independently associated with PFS38 (Table 5).
To gain insight into their possible role in CLL pathophysiology, we evaluated in silico whether the modulation of lnc-IRF2-3 or lnc-KIAA1755-4 might correspond to transcriptional signatures possibly associated with functional categories. Specifically, we ranked genes according to decreasing Pearson’s correlation with the expression level of lnc-IRF2-3 or lnc-KIAA1755-4 in the 217 samples using the gene set enrichment analysis software to identify a priori-defined sets of genes showing concordant modulation. We found lnc-IRF2-3 associated with 18 gene sets (1 from Kegg and 17 from Hallmarks data sets; Supplementary Table S12; representative examples in Figure 3a), virtually all related to the metabolism of amino acids, sugars and lipids. Interestingly, lnc-IRF2-3 resulted associated with the gene set linked to primary immunodeficiency, and with genes encoding proteins over-represented on the apical surface of epithelial cells (Figure 3a). Concerning lnc-KIAA1755-4, we identified 30 gene sets (6 from Kegg and 24 from Reactome data sets; Figure 3b and Supplementary Table S13), 10 of which including genes mainly encoding for ribosomal proteins, and thus principally associated with ribosome formation and translational processes. Notably, 12 gene sets highly enriched in genes encoding for histone components are associated with transcriptional processes, chromosomes and telomeres maintenance, and telomeres packaging.
Discussion
This study provided a comprehensive analysis of the transcriptional profile of lncRNAs in a large cohort of early-stage CLL patients.
First, we unraveled a 24-lncRNA signature specifically deregulated in CLL compared with a normal B-cell counterpart. The identification of the normal counterpart is currently a debated issue.37, 39 Although based on a limited number of normal samples, our analysis revealed that the lncRNA expression profile of leukemic cells is more similar to N, MZ and M B-cells than to GC cells, total pBCs or PCs, as previously observed in our previous studies for miRNA and snoRNA expression patterns.25, 31 Therefore, lncRNAs that significantly discriminate CLL cells and N, MZ and M B-cells, may be of relevance in disease pathogenesis. Importantly, the 24-lncRNA model was validated on an independent data set. Among the 24 lncRNAs, we highlighted the downregulation of lnc-TOMM7-1, mapped to chromosome 7p antisense to the interleukin-6 (IL6) gene, which promotes B-cell lineage proliferation and differentiation. Lnc-TOMM7-1 may participate in IL6 transcriptional regulation and therefore may have a pathogenic role, given the potential function of IL6 as an autocrine growth factor in CLL.40 The lncRNAs classifier also included lnc-SNX29P2-3 and lnc-SEL1L3-6 which were upregulated in CLL samples. This finding, contradictory to that found by us in PC dyscrasia where the two lncRNAs are downregulated in pathological samples,36 suggests that the role of lncRNA likely depends on the cellular context.
Despite the homogeneity of the lncRNA expression profile, possibly enhanced by testing samples of patients uniformly at their early disease stages, specific lncRNA signatures were detected in subgroups of CLL stratified according to cellular, molecular and cytogenetic markers. As expected, some lncRNAs were recurrently associated with the presence of adverse prognostic markers. Among them, it is worth mentioning lnc-AC004696.1-1, also known as ZNF667-AS1, the highest expression of which was found in UM-CLLs as also showed by previous studies,41 and was also negatively associated with a short PFS. Interestingly, two recent studies have reported epigenetic silencing of this lncRNA during the immortalization of human mammary epithelial cells as well as in a panel of cancer types including acute myeloid leukemia, diffuse large B-cell lymphoma42 and solid tumors.43 These findings seem contradictory to the situation that we and others41 have found in CLL with adverse prognosis for which high lnc-AC004696.1-1 expression was compatible with a transformed and potentially aggressive condition, again supporting the notion that the role played by lncRNA likely depends on the cellular context. Notably, in our CLL database, lnc-AC004696.1-1 expression highly correlated with that of the ZNF667 gene (Table 2), with which it shares (head to head) a CpG island. A co-regulatory mechanism of the two transcripts needs to be investigated, but it is of interest the evidence that ZNF667 inhibits the expression of the anti-apoptotic gene BAX in rats.44
Albeit it is feasible that lncRNAs are engaged in miRNA mediated interactions,19 their contribution is yet poorly explored in general, and in CLL in particular. To provide insights into this aspect, we studied the expression correlation of lncRNAs and miRNAs in paired samples, with the support of target prediction analysis. Among the 11 pairs reported in Table 3, particularly relevant appear miR-574-3p/lnc-LINS-1 and miR-331-3p/lnc-LIPG-3. MiR-574-3p is upregulated in UM-CLL and is associated with a major risk of disease progression in the same cohort of CLL.31 Considering that miR-574-3p has been described as a tumor suppressor miRNA in different types of solid tumors,45 we can hypothesize that in CLL, lnc-LINS-1 may play a fine-tuning role in the regulatory circuitry including miR-574-3p and its target genes. Considering the other miRNA/lncRNA pair, lnc-LIPG-3 expression was downregulated in samples with del13q, del11q or 12+ (Table 1). While existing data describe miR-331-3p as one of the most expressed miRNA in CLL as compared with normal peripheral CD19+ B-cells,46 our results do not highlight such a great difference with the normal comparators (Supplementary Figure S5) and is in all likelihood due to the different B-cell subpopulation chosen as normal controls. Notably, a network between miR-331-3p and lncRNAs has been already described in gastric cancer, where miR-331-3p has been proven to target the HOTAIR lncRNA that, in turn, functions as a competing endogenous RNA (ceRNA) to regulate HER2 expression by sponging miR-331-3p.47
Finally, we have proposed a 2-lncRNA risk model, based on the expression of lnc-IRF2-3 and lnc-KIAA1755-4, able to stratify our series of early-stage Binet A CLL patients into three different prognostic groups. The model identifies a very high-risk group characterized by the concomitant high expression of both lncRNAs. Notably, the model is independent of the common prognostic markers and of a recently defined progression-risk score.38 Information on lnc-IRF2-3 is still very limited. Ferreira et al.41 reported its high expression in prognostically poor UM-patients (Supplementary Table S2). Conversely, lnc-IRF2-3 is progressively downregulated trough the more aggressive stages of PC dyscrasia.36 Such a discrepancy maybe due to the cellular context as previously hypothesized for lnc-SNX29P2-3 and lnc-SEL1L3-6. Our gene set enrichment analysis results pointed out that in CLL lnc-IRF2-3 expression is associated with 18 gene sets the majority of which (15) are related to the metabolism of amino acids, sugars and lipids. Among the remaining ones, it is of note the gene set linked to primary immunodeficiency given that it includes genes related to CLL biology such as Zap70, CD19, BTK and CD79A genes. In addition, we found a specific association between lnc-IRF2-3 expression and the gene set that included genes related the apical surface of epithelial cells, such as those important for determining cell polarity. Overall these data prompt to further investigation of the functional role of lnc-IRF2-3 in the biology and the progression of the disease. Lnc-KIAA1755-4 is processed from the intronic region of the non-coding RNA SNHG17 and actually corresponds to SNORA71A, thus belonging to a class of molecules essentially localized in the nucleolus where they function as guide RNAs for the post-transcriptional modification of ribosomal RNAs. In particular, SNORA71A is predicted to guide the pseudouridylation of U406 in 18S rRNA.48 The potential clinical relevance of this type of ncRNA family in cancer and CLL have been previously reported by us and others.25, 49 Functional analysis of genes whose expression might be related to lnc-KIAA1755-4 highlighted enrichments in genes encoding for histone components, genes associated with transcriptional processes, chromosome and telomere maintenance, and telomeres packaging, all of which are biological processes frequently affected during cellular immortalization and tumor progression.
Overall, our findings offer a portrait of the lncRNA transcriptional landscape in CLL, ultimately providing insights into the biological mechanisms involving the non-coding fraction of the transcriptome and contributing to suggest novel putative molecular markers associated with high-risk course of the disease.
References
Chiorazzi N, Rai KR, Ferrarini M . Chronic lymphocytic leukemia. N Engl J Med 2005; 352: 804–815.
Zwiebel JA, Cheson BD . Chronic lymphocytic leukemia: staging and prognostic factors. Semin Oncol 1998; 25: 42–59.
Kay NE, Shanafelt TD . Prognostic factors in chronic lymphocytic leukemia. Curr Hematol Malig Rep 2007; 2: 49–55.
Crespo M, Bosch F, Villamor N, Bellosillo B, Colomer D, Rozman M et al. ZAP-70 expression as a surrogate for immunoglobulin-variable-region mutations in chronic lymphocytic leukemia. N Engl J Med 2003; 348: 1764–1775.
Damle RN, Wasil T, Fais F, Ghiotto F, Valetto A, Allen SL et al. Ig V gene mutation status and CD38 expression as novel prognostic indicators in chronic lymphocytic leukemia. Blood 1999; 94: 1840–1847.
Hamblin TJ, Davis Z, Gardiner A, Oscier DG, Stevenson FK . Unmutated Ig V(H) genes are associated with a more aggressive form of chronic lymphocytic leukemia. Blood 1999; 94: 1848–1854.
Byrd JC, Smith L, Hackbarth ML, Flinn IW, Young D, Proffitt JH et al. Interphase cytogenetic abnormalities in chronic lymphocytic leukemia may predict response to rituximab. Cancer Res 2003; 63: 36–38.
Dohner H, Fischer K, Bentz M, Hansen K, Benner A, Cabot G et al. p53 gene deletion predicts for poor survival and non-response to therapy with purine analogs in chronic B-cell leukemias. Blood 1995; 85: 1580–1589.
Dohner H, Stilgenbauer S, Benner A, Leupolt E, Krober A, Bullinger L et al. Genomic aberrations and survival in chronic lymphocytic leukemia. N Engl J Med 2000; 343: 1910–1916.
Haferlach C, Dicker F, Schnittger S, Kern W, Haferlach T . Comprehensive genetic characterization of CLL: a study on 506 cases analysed with chromosome banding analysis, interphase FISH, IgV(H) status and immunophenotyping. Leukemia 2007; 21: 2442–2451.
Stilgenbauer S, Bullinger L, Lichter P, Dohner H . Genetics of chronic lymphocytic leukemia: genomic aberrations and V(H) gene mutation status in pathogenesis and clinical course. Leukemia 2002; 16: 993–1007.
Lionetti M, Fabris S, Cutrona G, Agnelli L, Ciardullo C, Matis S et al. High-throughput sequencing for the identification of NOTCH1 mutations in early stage chronic lymphocytic leukaemia: biological and clinical implications. Br J Haematol 2014; 165: 629–639.
Rossi D, Rasi S, Fabbri G, Spina V, Fangazio M, Forconi F et al. Mutations of NOTCH1 are an independent predictor of survival in chronic lymphocytic leukemia. Blood 2012; 119: 521–529.
Cech TR, Steitz JA . The noncoding RNA revolution-trashing old rules to forge new ones. Cell 2014; 157: 77–94.
Nobili L, Lionetti M, Neri A . Long non-coding RNAs in normal and malignant hematopoiesis. Oncotarget 2016, e-pub ahead of print 11 May 2016 doi:10.18632/oncotarget.9308.
Brunner AL, Beck AH, Edris B, Sweeney RT, Zhu SX, Li R et al. Transcriptional profiling of long non-coding RNAs and novel transcribed regions across a diverse panel of archived human cancers. Genome Biol 2012; 13: 75–87.
Ling H, Vincent K, Pichler M, Fodde R, Berindan-Neagoe I, Slack FJ et al. Junk DNA and the long non-coding RNA twist in cancer genetics. Oncogene 2015; 34: 5003–5011.
Yang G, Lu X, Yuan L . LncRNA: a link between RNA and cancer. Biochim Biophys Acta 2014; 1839: 1097–1109.
Liz J, Esteller M . lncRNAs and microRNAs with a role in cancer development. Biochim Biophys Acta 2006; 1859: 169–176.
Volders PJ, Verheggen K, Menschaert G, Vandepoele K, Martens L, Vandesompele J et al. An update on LNCipedia: a database for annotated human lncRNA sequences. Nucleic Acids Res 2015; 43: D174–D180.
Garding A, Bhattacharya N, Claus R, Ruppel M, Tschuch C, Filarsky K et al. Epigenetic upregulation of lncRNAs at 13q14.3 in leukemia is linked to the In Cis downregulation of a gene cluster that targets NF-kB. PLoS Genet 2013; 9: e1003373.
Gaidano G, Foa R, la-Favera R . Molecular pathogenesis of chronic lymphocytic leukemia. J Clin Invest 2012; 122: 3432–3438.
Cimmino A, Calin GA, Fabbri M, Iorio MV, Ferracin M, Shimizu M et al. miR-15 and miR-16 induce apoptosis by targeting BCL2. Proc Natl Acad Sci USA 2005; 102: 13944–13949.
Blume CJ, Hotz-Wagenblatt A, Hullein J, Sellner L, Jethwa A, Stolz T et al. p53-dependent non-coding RNA networks in chronic lymphocytic leukemia. Leukemia 2015; 29: 2015–2023.
Ronchetti D, Mosca L, Cutrona G, Tuana G, Gentile M, Fabris S et al. Small nucleolar RNAs as new biomarkers in chronic lymphocytic leukemia. BMC Med Genomics 2013; 6: 27–37.
Cutrona G, Colombo M, Matis S, Reverberi D, Dono M, Tarantino V et al. B lymphocytes in humans express ZAP-70 when activated in vivo. Eur J Immunol 2006; 36: 558–569.
Cutrona G, Colombo M, Matis S, Fabbi M, Spriano M, Callea V et al. Clonal heterogeneity in chronic lymphocytic leukemia cells: superior response to surface IgM cross-linking in CD38, ZAP-70-positive cells. Haematologica 2008; 93: 413–422.
Fabris S, Mosca L, Todoerti K, Cutrona G, Lionetti M, Intini D et al. Molecular and transcriptional characterization of 17p loss in B-cell chronic lymphocytic leukemia. Genes Chromosomes Cancer 2008; 47: 781–793.
Morabito F, Cutrona G, Gentile M, Matis S, Todoerti K, Colombo M et al. Definition of progression risk based on combinations of cellular and molecular markers in patients with Binet stage A chronic lymphocytic leukaemia. Br J Haematol 2009; 146: 44–53.
Todoerti K, Agnelli L, Fabris S, Lionetti M, Tuana G, Mosca L et al. Transcriptional characterization of a prospective series of primary plasma cell leukemia revealed signatures associated with tumor progression and poorer outcome. Clin Cancer Res 2013; 19: 3247–3258.
Negrini M, Cutrona G, Bassi C, Fabris S, Zagatti B, Colombo M et al. microRNAome expression in chronic lymphocytic leukemia: comparison with normal B-cell subsets and correlations with prognostic and clinical parameters. Clin Cancer Res 2014; 20: 4141–4153.
Tusher VG, Tibshirani R, Chu G . Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci USA 2001; 98: 5116–5121.
Agnelli L, Bicciato S, Mattioli M, Fabris S, Intini D, Verdelli D et al. Molecular classification of multiple myeloma: a distinct transcriptional profile characterizes patients expressing CCND1 and negative for 14q32 translocations. J Clin Oncol 2005; 23: 7296–7306.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA 2005; 102: 15545–15550.
Miranda KC, Huynh T, Tay Y, Ang YS, Tam WL, Thomson AM et al. A pattern-based method for the identification of MicroRNA binding sites and their corresponding heteroduplexes. Cell 2006; 126: 1203–1217.
Ronchetti D, Agnelli L, Taiana E, Galletti S, Manzoni M, Todoerti K et al. Distinct lncRNA transcriptional fingerprints characterize progressive stages of multiple myeloma. Oncotarget 2016; 7: 14814–14830.
Seifert M, Sellmann L, Bloehdorn J, Wein F, Stilgenbauer S, Durig J et al. Cellular origin and pathophysiology of chronic lymphocytic leukemia. J Exp Med 2012; 209: 2183–2198.
Gentile M, Shanafelt TD, Cutrona G, Molica S, Tripepi G, Alvarez I et al. A progression-risk score to predict treatment-free survival for early stage chronic lymphocytic leukemia patients. Leukemia 2015; 30: 1440–1443.
Chiorazzi N, Ferrarini M . Cellular origin(s) of chronic lymphocytic leukemia: cautionary notes and additional considerations and possibilities. Blood 2011; 117: 1781–1791.
Biondi A, Rossi V, Bassan R, Barbui T, Bettoni S, Sironi M et al. Constitutive expression of the interleukin-6 gene in chronic lymphocytic leukemia. Blood 1989; 73: 1279–1284.
Ferreira PG, Jares P, Rico D, Gomez-Lopez G, Martinez-Trillos A, Villamor N et al. Transcriptome characterization by RNA sequencing identifies a major molecular and clinical subdivision in chronic lymphocytic leukemia. Genome Res 2014; 24: 212–226.
Vrba L, Garbe JC, Stampfer MR, Futscher BW . A lincRNA connected to cell mortality and epigenetically-silenced in most common human cancers. Epigenetics 2015; 10: 1074–1083.
Wang ZL, Li B, Piccolo SR, Zhang XQ, Li JH, Zhou H et al. Integrative analysis reveals clinical phenotypes and oncogenic potentials of long non-coding RNAs across 15 cancer types. Oncotarget 2016, e-pub ahead of print 27 April 2016 doi:10.18632/oncotarget.9037.
Jiang L, Wang H, Shi C, Liu K, Liu M, Wang N et al. ZNF667/Mipu1 is a novel anti-apoptotic factor that directly regulates the expression of the rat Bax gene in H9c2 cells. PLoS One 2014; 9: e111653.
Krishnan P, Ghosh S, Wang B, Li D, Narasimhan A, Berendt R et al. Next generation sequencing profiling identifies miR-574-3p and miR-660-5p as potential novel prognostic markers for breast cancer. BMC Genomics 2015; 16: 735–751.
Zanette DL, Rivadavia F, Molfetta GA, Barbuzano FG, Proto-Siqueira R, Silva-Jr WA et al. miRNA expression profiles in chronic lymphocytic and acute lymphocytic leukemia. Braz J Med Biol Res 2007; 40: 1435–1440.
Liu XH, Sun M, Nie FQ, Ge YB, Zhang EB, Yin DD et al. Lnc RNA HOTAIR functions as a competing endogenous RNA to regulate HER2 expression by sponging miR-331-3p in gastric cancer. Mol Cancer 2014; 13: 92–105.
Ganot P, Bortolin ML, Kiss T . Site-specific pseudouridine formation in preribosomal RNA is guided by small nucleolar RNAs. Cell 1997; 89: 799–809.
Williams GT, Farzaneh F . Are snoRNAs and snoRNA host genes new players in cancer? Nat Rev Cancer 2012; 12: 84–88.
Acknowledgements
This work was supported by grants from Associazione Italiana Ricerca sul Cancro (AIRC) 'Special Program Molecular Clinical Oncology - 5 per mille' n. 9980, 2010/15 (to MF, FM, AN and PT), from AIRC IG16722 to AN, from AIRC n.16695 to FM, from AIRC IG14326 to MF and from Italian Ministry of Health 5x1000 funds to GC. SM was supported by AIRC.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no conflict of interest.
Additional information
Supplementary Information accompanies this paper on Blood Cancer Journal website
Supplementary information
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Ronchetti, D., Manzoni, M., Agnelli, L. et al. lncRNA profiling in early-stage chronic lymphocytic leukemia identifies transcriptional fingerprints with relevance in clinical outcome. Blood Cancer Journal 6, e468 (2016). https://doi.org/10.1038/bcj.2016.77
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/bcj.2016.77
- Springer Nature Limited
This article is cited by
-
Advances in Epigenetics and Epigenomics in Chronic Lymphocytic Leukemia
Current Genetic Medicine Reports (2019)
-
Long non-coding RNAs defining major subtypes of B cell precursor acute lymphoblastic leukemia
Journal of Hematology & Oncology (2019)
-
Comprehensive off-target analysis of dCas9-SAM-mediated HIV reactivation via long noncoding RNA and mRNA profiling
BMC Medical Genomics (2018)