Abstract
Background
Tumours with no evidence of neuroendocrine transformation histologically but harbouring neuroendocrine features are collectively referred to as non-small cell lung cancer (NSCLC) with neuroendocrine differentiation (NED). Investigating the mechanisms underlying NED is conducive to designing appropriate treatment options for NSCLC patients.
Methods
In the present study, we integrated multiple lung cancer datasets to identify neuroendocrine features using a one-class logistic regression (OCLR) machine learning algorithm trained on small cell lung cancer (SCLC) cells, a pulmonary neuroendocrine cell type, based on the transcriptome of NSCLC and named the NED index (NEDI). Single-sample gene set enrichment analysis, pathway enrichment analysis, ESTIMATE algorithm analysis, and unsupervised subclass mapping (SubMap) were performed to assess the altered pathways and immune characteristics of lung cancer samples with different NEDI values.
Results
We developed and validated a novel one-class predictor based on the expression values of 13,279 mRNAs to quantitatively evaluate neuroendocrine features in NSCLC. We observed that a higher NEDI correlated with better prognosis in patients with LUAD. In addition, we observed that a higher NEDI was significantly associated with reduced immune cell infiltration and immune effector molecule expression. Furthermore, we found that etoposide-based chemotherapy might be more effective in the treatment of LUAD with high NEDI values. Moreover, we noted that tumours with low NEDI values had better responses to immunotherapy than those with high NEDI values.
Conclusions
Our findings improve the understanding of NED and provide a useful strategy for applying NEDI-based risk stratification to guide decision-making in the treatment of LUAD.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Lung cancer is one of the leading contributors to cancer incidence and mortality worldwide [1].According to pathological characteristics, there are two major histological subtypes of lung cancer, small cell lung cancer (SCLC), the most common form of neuroendocrine lung cancer, and non-small cell lung cancer (NSCLC), which accounts for approximately 85% of lung cancer cases and mainly comprises lung squamous cell carcinoma (LUSC) and lung adenocarcinoma (LUAD) [2]. EGFR tyrosine kinase inhibitors (EGFR-TKIs) have become the standard first-line therapy for treating patients with advanced NSCLC harbouring EGFR activating mutations. However, acquired resistance to targeted therapies is inevitable [3]. Histological transformation of LUAD into an aggressive neuroendocrine derivative resembling SCLC is one of the mechanisms underlying acquired resistance to EGFR-TKIs [4]. Thus, investigating the mechanisms underlying neuroendocrine differentiation (NED) is crucial for providing more appropriate individualized treatment recommendations for patients with NSCLC. It has been recognized that neuroendocrine features can be demonstrated by immunohistochemistry or ultrastructural pathology in 10–30% of lung cancers with ordinary non-small cell morphology [5]. These tumours with no histological evidence of neuroendocrine transformation are collectively referred to as non-small cell carcinomas with NED (NSCLC-NED). Some studies have reported that NSCLC-NED was associated with an increased response to chemotherapy, whereas others have not [6, 7].
Immunohistochemistry (IHC) is the main method used to identify NED [8]. The different sensitivities and specificities of neuroendocrine markers, such as chromogranin A (CgA), neural cell adhesion molecule 1 (CD56), synaptophysin (Syp) and neuron-specific enolase (NSE), may lead to significant variation in reported NSCLC-NED incidence and contradictory results about the prognosis of NSCLC-NED [9]. Transcriptomic technologies, especially RNA sequencing, with advantages of being amenable to standardization and avoiding interpretation bias, are more accurate for diagnosing cancer and provide a reliable estimation of protein marker levels in cancer [10, 11]. Hence, integrative analysis based on the transcriptome may enable quantitative assessment of the degree of NED in NSCLC.
Gene expression data obtained from tumour biopsy samples reflect a superposition of cell subpopulations in samples. The methods developed to deconvolute gene expression data from mixed samples are based on nonnegative matrix factorization or other matrix decomposition techniques, and it is difficult to identify all tumour subtypes in a single optimization [12]. The one-class logistic regression (OCLR) machine learning algorithm is a novel one-class method based on logistic regression, which has been demonstrated to be better than its binary predictor counterparts in identifying specific cell types in heterogeneous cell populations [13]. The OCLR algorithm was applied to obtain independent stemness indices in multiple malignancies and revealed patterns of intratumor molecular heterogeneity [14]. It is worth conducting OCLR in the assessment of NED in NSCLC.
Immunotherapy has become a promising tool for cancer treatment, but its efficacy remains limited in most clinical settings [15]. The link between immunotherapy response and neuroendocrine features of NSCLC is unclear. There is still a lack of reliable biomarkers for predicting immunotherapy benefit in NSCLC. In the present study, we used OCLR to build a new model to quantitatively determine the tendency of NSCLC tumours to undergo NED based on mRNA expression data and named it the NED index (NEDI). This study aimed to develop a classification tool and machine-learning algorithm to analyse a spectrum of neuroendocrine features, predict the efficacy of immunotherapy, and guide decision-making regarding the treatment of NSCLC. Our findings may provide novel insights for improving therapeutic efficacy in NSCLC, especially LUAD.
2 Materials and methods
2.1 Acquisition of lung cancer cohorts and data processing
The present study collected lung cancer data from diverse cohorts from four individual platforms: The Cancer Genome Atlas (TCGA), Clinical Proteomic Tumor Analysis Consortium (CPTAC), Cancer Cell Line Encyclopedia (CCLE) and Gene Expression Omnibus (GEO). We downloaded the preprocessed mRNA expression matrices of 185 lung cancer cell lines (50 SCLC cell lines and 135 NSCLC cell lines) from the CCLE data portal (http://www.broadinstitute.org/ccle) [16] to establish a neuroendocrine features model by applying the OCLR algorithm. The mRNA expression profiles, clinical information, proteomic data and somatic mutation data of all TCGA-LUAD cohorts and TCGA-LUSC cohorts were obtained through the TCGA data portal (https://portal.gdc.cancer.gov/). The expression profile and clinical information of LUAD and LUSC from CPTAC [17, 18] and SCLC data from the U Cologne SCLC cohort [19] were downloaded from cBioPortal (https:// www. cbioportal.org). In addition, we retrieved and obtained lung cancer datasets from the GEO database, including the GSE159857 cohort [20], GSE60644 cohort [21], GSE9074 cohort (Kano M et al.), GSE118131 cohort [22],GSE64322 cohort [23], GSE1307 cohort [24], GSE31625 [25], GSE31210 cohort [26], GSE50081 cohort [27], GSE72094 cohort [28], GSE91061 cohort [29], GSE13213 cohort [30], GSE126044 cohort [31], GSE42127 cohort [32] and GSE135222 [33]. Detailed information about the public datasets used in this study were shown in Table S1. To reduce the systematic error in the different datasets, the gene expression TPM values of RNA-seq were performed log2 transformation using a pseudo-count of 1; log2(TPM + 1) and all microarray data included in this study were log2 transformed and then normalized to [0,1]. The normalization formula was X' = (X-Xmin)/(Xmax-Xmin), where X is the processed microarray value of gene expression [34].
2.2 Cell lines and Western blot analysis
Human lung adenocarcinoma cell lines A549, HCC827 and PC9 were purchased from China Center for Type Culture Collection (CCTCC, Wuhan, China) and human small cell lung cancer cell line H524 was purchased from Procell (Procell, Wuhan, China). HCC827, PC9 and H524 were cultured in RPMI 1640 medium (Gibco, USA) with 10% fetal bovine serum (FBS), 100 U/ml penicillin and 100 U/ml streptomycin. A549 was cultured in DMEM medium (Gibco, USA) with 10% fetal bovine serum (FBS) and penicillin–streptomycin. The whole cell lysates were derived from RIPA lyzed cells and the protein concentrations were determined by the BCA kit (Beyotime). Then, the protein was run on an SDS-PAGE gel and transferred to nitrocellulose. Nitrocellulose membranes were blocked in 5% bovine serum albumin (BSA) and probed with antibodies overnight. Anti-GAPDH, CD56, and Syp were purchased from Cell Signaling Technology; Anti- CgA were purchased from Proteintech. Secondary antibodies conjugated to horseradish peroxidase were followed by enhanced chemiluminescence (Thermo Fisher Scientific, Waltham, Massachusetts, USA).
2.3 Calculation of the NED index (NEDI) for NSCLC
To calculate a NEDI based on mRNA expression, we built a novel model using OCLR trained on lung neuroendocrine cancer from the CCLE data portal using R package “gelnet” [13]. To ensure compatibility between the different cohorts, we first mapped the list of common genes at the intersection of three cohorts (CCLE, TCGA-LUAD and CPTAC-LUAD), and the resulting matrix containing 13,279 mRNAs expression values (log2(TPM + 1)) across all available samples of SCLC and NSCLC cell lines from the CCLE data portal was further analysed. We mean-centred the mRNA expression data and then applied OCLR to only the SCLC samples. When the signature was obtained, it could be applied to score other samples. For mRNA expression data, we calculated Spearman correlations between the model’s weight vector (Table S2) and the new sample’s expression profile (such as NSCLC samples from TCGA or GEO datasets). Then, the Spearman correlations were subsequently mapped to the [0,1] range by using a linear transformation that subtracted the minimum and divided by the maximum and were defined as NEDI.
2.4 Evaluation of the NEDI with somatic variants in TCGA-LUAD
In this analysis, we aimed to investigate the differences in somatic mutations between the NEDI-low and high subgroups according to the median NEDI and further detected the SNVs, SNPs, and INDELs using R software. We downloaded and prepared somatic variants with the Mutation Annotation Format (MAF) and used the R package “maftools” to perform the visualization [35]. The tumour mutation burden (TMB) referred to number of the somatic genes, base substitutions, insertions or deletions, and we calculated the TMB as (mutation frequency with the number of variants)/the length of exons (38 million) for each TCGA-LUAD sample. The associations between NEDI and TMB data were calculated based on Pearson correlation coefficients. Co-occurring and mutually exclusive mutations were identified using the R package “CoMEt” [36]. Fisher’s exact test was used to identify the differential mutation pattern, and genes with a P value lower than 0.05 were defined as differentially mutated genes.
2.5 TME construction and immunotherapy response prediction
Immune-related gene signatures were calculated using the single-sample gene-set enrichment analysis (ssGSEA) algorithm through R package “GSVA” [37]. The annotated gene set file (msigdb.v6.2.symbols.gmt) was adopted for our analysis as the reference, and the significance was also based on the threshold of P < 0.05. The correlations between the NEDI and all 29 immune-related gene signatures were calculated based on the Pearson correlation coefficients. The stromal, immune, and estimate scores of each sample were estimated using the ESTIMATE algorithm by the R package “estimate”[38]. GSEA was performed by the R package “fgsea”. The Cancer Immunome Atlas (TCIA) web tool provides the results of comprehensive immunogenomic analyses (https://tcia.at/) [39]. Tumour immunogenicity was quantitatively scored from 0 to 10 and was named the immunophenoscore (IPS). Next, we also used the ssGSEA algorithm to compare the T-cell inflamed gene expression profile (GEP) and innate anti-PD-1 resistance (IPRES), which could predict melanoma response to ICB treatment between the NEDI-low and high subgroups. Furthermore, the unsupervised subclass mapping (Submap) method was applied to evaluate the expression similarity of LUAD patients in the NEDI-low and high subgroups with different immunotherapy outcomes [40]. If a pair's expression profiles shared more similarity, their clinical outcomes were more likely to be similar [41].
2.6 Assessment of sensitivity to chemotherapeutic and molecular drugs
To estimate the NEDI in predicting the response to chemotherapeutic and molecular drugs, the R package “pRRophetic” [42] was applied to calculate the half-maximal inhibitory concentration (IC50) of samples between the NEDI-low and high subgroups. The IC50 values of the NEDI-low and high subgroups were compared with the Mann–Whitney test.
2.7 Screening of potential compounds targeting the NEDI signature in LUAD
The Connectivity Map (CMap) [43] dataset is a comprehensive resource for exploring the underlying associations among genes, chemicals, and biological conditions. We wanted to utilize the CMap to screen potential candidate compounds that target crosstalk associated with LUAD in the NEDI-low and high subgroups. We calculated a list of differentially expressed genes between NEDI-low and high subgroups and selected the top 150 to query the dataset. The query results were calculated and ranked with scores from -100 to 100. Overall, we selected the relative compounds that had an enrichment score ≥ 70, which might be capable of targeting the NSCLC-NED.
2.8 Establishing and evaluating the prognosis prediction model
Survival time and status were used to evaluate the prognosis of LUAD patients. The workflow to explore the prognostic signatures in LUAD consists of four steps: (1) Genes were sorted based on a median absolute deviation (MAD) value > 0.5 and a gene co-expression network was constructed using the R package “WGCNA” [44] to analyse and identify gene modules strongly associated with the NEDI in TCGA-LUAD. The 2653 ranked genes were screened and used for further analysis. (2) Univariate Cox proportional hazards regression was used to assess the individual effect of every alteration using the R package “survival”, and then the features with a P value less than 0.05 were selected for further analysis. (3) Lasso regression model was used to obtain the genetic variable-based prediction model and to generate the risk scores for all samples using the R package “survival”. (4) The risk score of each patient was predicted based on the training model by using the “predict” function included in the R package “survival”. In our analysis, 9 genes (CXCL17, DKK1, KRT8, IRX5, ENPP5, RHOV, MELTF, ESYT3, PLEK2) retained their Lasso regression coefficients. LUAD cohorts from GEO (GSE72094, GSE31210 and GSE50081) were used as validation sets. Based on the gene expression values and Lasso coefficients, a risk score for each sample was calculated as follows:
Then, we conducted survival analyses of LUAD patients with respect to risk level. To further improve the prediction ability, we constructed a nomogram by integrating stage and risk score by R package “survival” and “rms” [45]. Time-dependent receiver operating characteristic (time-ROC) analysis and time-dependent concordance index (C-index) were used to validate the best accuracy of nomogram compared with other prognostic factors by R package “pec” and “riskRegression” [46, 47]. The time-ROC and C-index were calculated to evaluate the performance of the prognosis prediction model.
2.9 Statistical analysis
Student’s t test was used to compare two groups with normally distributed data, while the Mann–Whitney test was conducted to compare groups with nonnormally distributed data. For paired samples, the paired t-test was utilized. Comparisons between multiple groups with normally distributed data were carried out using an analysis of variance (ANOVA). Differential analysis was performed based on the R package “limma” [48], and Kaplan–Meier analysis was conducted via the R package “survival” by using the optimal cut-off points. Analyses were conducted using GraphPad V.9.4.1 software and R V.4.2.1 software, and statistical significance was defined as P < 0.05. The information about key software and algorithms used in this study are provided in Table S3.
3 Results
3.1 Development and validation of the mRNA expression-based one-class predictor to quantitatively evaluate neuroendocrine features in NSCLC
To define the signatures of NSCLC tumours with neuroendocrine features, we first determined the protein levels of neuroendocrine markers, including CgA, CD56, and Syp, in NSCLC cells, and we found that these markers were undetected in A549, HCC827 and PC9 human lung adenocarcinoma cells, except for CgA (Fig. 1a). This result indicated that it is difficult to evaluate neuroendocrine features at the neuroendocrine marker protein level. Hence, we performed an integrative analysis based on the transcriptome to quantify the degrees of NED in lung cancer. Public lung cancer datasets for which transcriptomic profiles were available, including TCGA, CPTAC, CCLE and GEO, were analysed. For the mRNA expression-based model, to ensure compatibility between the different cohorts, we first mapped the list of common genes at the intersection of three cohorts, including CCLE, TCGA-LUAD and CPTAC-LUAD, and the resulting matrix containing 13,279 mRNAs expression values across all available samples was further analysed (Figure S1a). Traditional supervised approaches deconvolute gene expression data from mixed samples into a set of distinct cell types based on two or more classes to train models. However, there is no definitive negative class, and it is unclear how the classes in the negative set should be weighted for neuroendocrine features in NSCLC. Therefore, we developed a NED index (NEDI) using the OCLR algorithm trained on 50 SCLC cells (gene expression datasets from CCLE database), a particular known pulmonary neuroendocrine cell type that exhibits neuroendocrine features, to identify whether neuroendocrine features are present at some appreciable level in an NSCLC tumour sample that may contain numerous cell types (Fig. 1b). The OCLR-based NEDI ranges from low (zero) to high (one) NED and was applied to multiple lung cancer cohorts. The NEDI was validated by applying it to an external dataset composed of both neuroendocrine cancers, including large-cell neuroendocrine carcinoma, typical carcinoid and SCLC, and non-neuroendocrine lung cancers (GSE1037) [24, 49]. As anticipated, lung cancer samples showed different degrees of neuroendocrine phenotypes, and LUAD samples displayed lower NEDI values than neuroendocrine tumour types (Fig. 1c). Within these, individual tumour samples showed heterogeneous neuroendocrine features. Moreover, the NEDI values in different lung cancer cohorts with distinct histological types were further calculated, and all neuroendocrine samples exhibited higher NEDI values than samples from LUAD and LUSC (Fig. 1d). Interestingly, primary tumour LUAD samples exhibited neuroendocrine features that were similar to those of non-tumour samples obtained from matched adjacent normal tissues of origin (Figure S1b). Moreover, we found that the neuroendocrine index using the OCLR algorithm trained on large-cell neuroendocrine carcinomas (LCNC) or carcinoids (CC) could not exactly distinguish neuroendocrine and non-neuroendocrine carcinomas (Figure S1c, d).
Neuroendocrine transformation is one of the mechanisms of acquired resistance to EGFR-TKIs in LUAD. Therefore, we further analysed the NEDI by scoring index values of NSCLC samples that are characterized by different molecular changes associated with TKI resistance (GSE64322), and we found that TKI-resistant NSCLC samples attained a higher NEDI than TKI-sensitive NSCLC samples (Fig. 1e). A similar result was also observed in LUAD with resistance to erlotinib (GSE31625) (Figure S1e). These results suggested that NSCLC samples exhibited NED during TKI resistance even in the absence of detectable histological transformation into SCLC. In addition, we noted that a high NEDI was associated with higher gene expression of neuroendocrine markers, including CgA, CD56, Syp and NSE in TCGA-LUAD and TCGA-LUSC (Figure S1f-m).
In summary, we developed and validated an mRNA expression-based one-class predictor trained on SCLC cells to quantitatively evaluate neuroendocrine features in NSCLC, and lung cancer samples stratified by the index were used for further integrative analyses.
3.2 Selected molecular and clinical outcomes associated with the NEDI in LUAD
We further investigated the association between the NEDI and the known clinical and molecular features of LUAD. We observed no significant association between the NEDI and clinical features, including gender, age, TNM classification and clinical stage (Fig. 2a). Interestingly, we found that the NEDI was highest in LUAD.2 (Fig. 2b), one of the molecular subtypes of LUAD previously defined in TCGA and known to exhibit higher tumour ploidy and mutation rates and to frequently harbour TP53 mutations [50]. Biallelic inactivation of TP53 and RB1 has been reported to be associated with several key oncogenic processes in SCLC [19]. We found that a high NEDI was strongly associated with somatic mutations in the TP53 and RB1 genes, which are potent tumour suppressors in LUAD (Fig. 2c). This result indicated that TP53 and RB1 could be drivers of the dedifferentiation of neuroendocrine in LUAD. EGFR mutation is a well-known driver in lung cancer [51]. However, we did not find a correlation between the NEDI and EGFR mutation status (Figure S2a-c). These results suggested that aberrant pathway alterations mediated by genetic disorder play an important role in regulating the NED of LUAD and remain to be further investigated.
SCLC is strongly associated with exposure to tobacco carcinogens [52]. Therefore, we evaluated the association between the NEDI and smoking status, and we found that current smokers had a higher NEDI than former smokers and non-smokers (Fig. 2d). This result suggested that the NED of LUAD might be activated in response to smoking. It has been shown that NED is related to the variable prognosis of NSCLC in different studies, with sometimes conflicting results [53]. However, we observed that a higher NEDI correlated with better prognosis in the TCGA-LUAD cohort in the present study (Fig. 2e). Similar results were obtained in other GEO-LUAD cohorts (GSE31210, GSE72094 and GSE50081) (Fig. 2f–h). These findings indicated that the NEDI based on transcriptomic data has a better prognostic value for LUAD than IHC.
3.3 Correlation of the Mutational Landscape and NED in LUAD
As previously mentioned, genetic disorders might play an important role in regulating NED in LUAD. Therefore, we systematically further investigated the genomic disparity between the NEDI-low and high subgroups, which were divided according to the median NEDI value, in LUAD based on WES data from the TCGA-LUAD cohort. Somatic mutations, including single-nucleotide polymorphisms (SNPs), single-nucleotide variants (SNVs), insertions (INSs), and deletions (DELs), were analysed with the R package “maftools”. As expected, we found a strong association between the NEDI and a higher tumour mutation burden (TMB) (Fig. 3a). We noted that the genomic variants were highly enriched in missense mutations in both the NEDI-low and high subgroups (approximately 85%), and samples in the NEDI-high subgroup exhibited a significantly larger number of variants than those in the NEDI-low subgroup (Fig. 3b). For SNVs, C > A was the most common type of variant, and all SNVs were enriched in tumour samples with a high NEDI (Figure S3a). In addition, we found that a high NEDI was associated with transversion (Tv) in all SNVs; however, an adverse association between the NEDI and transition (Ti) in all SNVs was detected (Figure S3b). We also found that SNPs were enriched in the NEDI-high subgroup (Figure S3c). Despite the significant differences in the variant number of the somatic mutations between the NEDI-low and high subgroups, the constituent ratio of each mutation type occupied among all variants remained almost unchanged (Figure S3d-g), indicating that genomic disparity in variant number was not caused by a shift in the type of mutation.
Moreover, we found that 25 genes were mutated in more than 20% of the samples in the NEDI-high subgroup, while only 10 genes met this criterion in the NEDI-low subgroup (Fig. 3c). Interestingly, TP53, TTN, CSMD3 and MUC16 occupied the top four positions in both subgroups. In addition, 578 differentially mutated genes were detected using Fisher’s exact test (Table S4). We further performed an enrichment analysis of known oncogenic pathways based on these differentially mutated genes and found that 7 canonical pathways, the RTK-RAS, Notch, β-catenin/Wnt, Hippo, PI-3-Kinase/Akt, cell cycle and TP53 signalling pathways, were significantly enriched (Fig. 3d). This suggested that these oncogenic pathway alterations mediated by genomic mutation, especially RTK-RAS signalling pathway alterations, might be related to the regulation of NED in LUAD.
Moreover, we investigated the co-occurring and exclusive mutations of the top 25 most frequently mutated genes by using the CoMEt algorithm. Compared with the pervasive co-occurrence landscape, there were three unique cases (KRAS-TP53, KRAS-CSMD3 and KRAS-TNR) that exhibited mutually exclusive mutations in the NEDI-high subgroup (Fig. 3e), suggesting that these pairs of genes shared the same pathways and probably had redundant effects on regulating the NED of LUAD.
Taken together, these results suggested that the mutations played a meaningful role in the NED of LUAD.
3.4 Biological processes associated with NED in LUAD
To investigate the biological processes associated with NED in LUAD, pathway enrichment analysis (gene set enrichment analysis, GSEA) was performed on differentially expressed genes (DEGs) between the NEDI-low and high subgroups in the TCGA-LUAD cohort. Notably, cell-cycle-related cancer hallmark gene sets, such as E2F_TARGETs, G2M_CHECKPOINT and MITOTIC_SPINDLE, significantly contributed to the positive side of the NED signature (Fig. 4a). This suggests that the progression of the cell cycle orchestrated by numerous checks and balances might be distinctly primed for NED. However, immune-related cancer hallmark gene sets, such as IL6_JAK_STAT3_SIGNALING, INTERFERON_ ALPHA_RESPONSE and INFLAMMATORY_RESPONSE, were significantly enriched on the negative side of the NED signature (Fig. 4a). This negative enrichment implied that sufficient immune cell infiltration might be exhibited within tumour samples with low NED, and those patients might benefit from immunotherapy.
To further delineate potential molecular events underlying NED in LUAD, the top 150 upregulated and downregulated genes in the NEDI-high subgroup compared with the NEDI-low subgroup in the TCGA-LUAD and 3 GEO-LUAD cohorts were subjected to the CMap (http://www.broad.mit.edu/cmap/) analysis, a systematic approach to investigate the functional connections among diseases, genetic perturbations, and drug actions. CMap mode-of-action (MoA) analysis revealed 38 mechanisms of action shared by 49 compounds with the highest prediction scores. In particular, 5 compounds (ABT-751, flubendazole, nocodazole, vincristine, vinorelbine) shared the MoA of tubulin inhibitors (Fig. 4b, c), suggesting that microtubule-based functions, such as cell migration, cell division, and intracellular trafficking, might be involved in NED and that microtubule-targeting agents, such as taxanes (paclitaxel and docetaxel) and vinca alkaloids (vincristine), might mediate the dedifferentiation of neuroendocrine in LUAD.
To investigate the association between the NEDI and these malignant features, including rapid proliferation [54], angiogenesis [55] and active EMT(obtained from Gene Ontology (GO): 0001837), we quantified the differentiation ability of tumours in the cell cycle, angiogenesis and EMT by the ssGSEA algorithm and found pronounced positive correlations of the NEDI with the cell cycle and negative correlations of the NEDI with angiogenesis in TCGA-LUAD and GEO-LUAD cohorts(GSE13213, GSE50081 and GSE72094) (Fig. 4d, e and Figure S4a,b). However, no significant association of the NEDI with EMT was observed (Fig. 4f and Figure S4c). It has been reported that cancer progression involves a gradual loss of a differentiated phenotype and an acquisition of stem-cell-like features [14]. Therefore, we also tested the association of the NEDI with the stemness index. We observed a positive correlation between the NEDI and the stemness index (Fig. 4g and Figure S4d), suggesting that LUAD samples with high levels of neuroendocrine features exhibited progenitor and stem-cell-like features, which might display the potential for histological transformation into SCLC. These results indicated that with more active NED, there was more rapid cell proliferation and reduced angiogenesis within microenvironments in LUAD and that LUAD samples with a high NEDI might be more susceptible to chemotherapy and less sensitive to antiangiogenic agents.
In conclusion, microtubule-based functions, such as cell migration, cell division, and intracellular trafficking, might be involved in NED, and tumours with strong potential for NED were generally accompanied by more rapid cell proliferation and reduced angiogenesis within microenvironments in LUAD.
3.5 Association of the NED index with immunotherapy response in NSCLC
To assess the relationships between NED and the tumour immune micro-environment, we analysed 29 immune-related pathways in the NEDI-low and NEDI-high subgroups by applying the ssGSEA algorithm. We observed that higher NEDI values were significantly associated with reduced immune cell infiltration and immune effector molecule expression in the TCGA-LUAD and GEO-LUAD cohorts (GSE13213, GSE50081 and GSE72094) (Fig. 5a and Figure S5a). Moreover, we found that the NEDI-low subgroup had higher stromal scores, immune scores and ESTIMATE scores calculated using the ESTIMATE algorithm (Fig. 5b and Figure S5b). These results showed that the NEDI-low subgroup was more likely to have an inflamed immune microenvironment. We also found an association of higher NEDI values with lower PD-L1 protein expression in the TCGA-LUAD dataset (Fig. 5c). We expected that tumours with low NED would be susceptible to immune checkpoint blockade (ICB) treatment because they have sufficient immune cell infiltration and preexisting upregulation of the PD-L1 pathway.
We further analysed the probability of ICB response in the NEDI-low and NEDI-high subgroups through SubMap analysis by analysing the NEDI in a published melanoma immunotherapy dataset [56].The results showed that the NEDI-low subgroup was likely to have a favourable response to anti-PD-1 therapy. It has been reported that the T-cell inflamed gene expression profile (GEP) and innate anti-PD-1 resistance gene signature (IPRES) can predict tumour response to ICB therapy [57, 58]. The T-cell inflamed GEP and IPRES were enriched in the NEDI-low subgroup in comparison to the NEDI-high subgroup (Fig. 5e and Figure S5c, d). We then assessed the difference in TCIA immunotherapy scores between the NEDI-low and NEDI-high subgroups and found that patients with lower NEDI values had higher TCIA immunotherapy scores in the TCGA-LUAD cohort (Figure S5e).
Finally, the value of the NEDI for predicting immunotherapy response was validated in three external NSCLC datasets (GSE91061, GSE126044 and GSE135222) with full clinical information about immunotherapy [29, 31, 33]. We found that NSCLC patients with low NEDI values were more likely to have considerable therapeutic benefits and clinical response to anti-PD-1/L1 immunotherapy than with those with high NEDI values (Fig. 5f–h). In particular, patients with low NEDI values exhibited longer progression-free survival (PFS) (Fig. 5i). These results indicated that tumours with low NED are more likely to respond to immunotherapy than those with high NED and that the NEDI can be used to predict the clinical response of LUAD to immunotherapy.
3.6 The NEDI can serve as a predictive indicator of resistance to distinct chemotherapeutic agents
Chemotherapy is one of the standard treatments for LUAD patients. To further determine the value of the NEDI to guide treatment for LUAD, we used the “pRRophetic” algorithm to predict the response of LUAD samples to chemotherapeutic agents in multiple LUAD datasets (TCGA-LUAD, GSE31210, GSE13213 and GSE72094). We observed that LUAD samples with higher NEDI values displayed more sensitivity to etoposide, one of the most active agents for SCLC, but less sensitivity to paclitaxel than LUAD samples with lower NEDI values (Fig. 6a, b). Furthermore, 39 LUAD patients (GSE42127) who received adjuvant chemotherapy (mainly taxanes plus carboplatin) were analysed [32], and we observed that patients with higher NEDI values exhibited much worse prognoses (Figure S6). These results implied that paclitaxel-based regimens might not be effective in LUAD patients with high NEDI values and that etoposide-based comprehensive therapy would be a better alternative.
In summary, these results suggested that the NEDI can serve as a predictive indicator of resistance to distinct chemotherapeutic agents and that etoposide-based chemotherapy might be more effective in the treatment of LUAD with a high level of NED. However, clinical trials should be further conducted.
3.7 Development and validation of a predictive nomogram based on the NEDI for LUAD
To construct the NEDI-based prognostic model, 2693 mRNAs with a median absolute deviation (MAD) > 0.5 within the list of common genes were applied to perform WGCNA in the TCGA-LUAD cohort to find the key modules associated with the NEDI. The thresholding power of β = 5 (scale-free R2 = 0.9) was selected to construct a scale-free network (Figure S7a-c). Of the 7 modules identified, the turquoise module showed the highest positive correlation with the NEDI (Fig. 7a). Herein, 774 genes within the turquoise module were used to perform univariate Cox regression analysis, resulting in the identification of 385 survival-related genes. Next, those genes were selected for the Lasso Cox regression analysis, a recommended dimensionality-reduction method for the regression of high-dimensional data [59]. Finally, a NEDI-based risk model was established by calculating risk scores on the basis of the expression of 9 pivotal genes (CXCL17, DKK1, KRT8, IRX5, ENPP5, RHOV, MELTF, ESYT3, PLEK2) and respective coefficients for each patient (Table S5). Using the optimal cut-off points as the threshold, patients in the TCGA-LUAD cohort were divided into low- and high-risk subgroups, and the Kaplan‒Meier survival curves showed that patients with lower risk scores had better clinical outcomes (Fig. 7b). Moreover, the prognostic value was validated in GEO-LUAD datasets, including GSE72094, GSE31210 and GSE50081 (S7d-f). In addition, risk score and clinical stage were identified as prognostic factors after univariable Cox regression analysis and were used to construct a comprehensive nomogram, which can provide clinicians with the prognosis of patients based on their total points (Fig. 7c and Figure S7g). The calibration curves of 1- 3 and 5-year OS prediction were close to the ideal performance in both the training (TCGA-LUAD, C-index = 0.712) and external validation cohorts (GSE72094, C-index = 0.703, and GSE31210, C-index = 0.71 and GES50081, C-index = 0.659) (Fig. 7d–g). In addition, time-dependent ROC analysis demonstrated that the nomogram exhibited a much more powerful capacity for survival prediction than the NEDI and clinical stage, with an average AUC above 0.75 during a follow-up of 4 years (Fig. 7h). In conclusion, our results imply that the NEDI-based prognostic model can guide clinical treatment and predict prognosis to a certain extent in patients with LUAD.
4 Discussion
Neuroendocrine features can be found in 10–30% of NSCLC tumours with no evidence of neuroendocrine transformation histologically, and these tumours are collectively referred to as NSCLC with NED [5]. In contrast to conventional histological classifications, NSCLC-NED can be identified by IHC or ultrastructural pathology examination rather than light microscopy [6, 7]. Distinct studies have been carried out to explore the treatment response and prognosis of NSCLC-NED, but the results are contradictory. Some of these contradictory findings may be particularly related to the different stages of disease, treatments received and antibodies used in IHC [53, 60, 61]. Despite these advances, little is known about NSCLC-NED therapy, which leads to uncertainty about appropriate treatments and prognostic implications for clinicians.
Gene expression data obtained from tumour biopsy samples reflect the overall state of multiple cell subpopulations in a sample. Traditional methods based on nonnegative matrix factorization or other matrix decomposition techniques for collecting gene expression data have difficulty identifying all tumour cell subtypes in a single optimization [12]. The OCLR machine learning algorithm, which is a novel one-class method based on logistic regression, has been demonstrated to have an advantage over its binary predictor counterparts in identifying specific cell types in heterogeneous cell populations [13]. In the present study, we chose the one-class framework because it is robust in the absence of a “negative” class. The NED index (NEDI) to quantify the degree of NED in NSCLC was established by the OCLR algorithm trained on 50 SCLC cell lines with the expression patterns of 13,279 common genes. Our results showed that the NEDI was positively associated with the level of neuroendocrine features, and this model was validated in different independent lung cancer cohorts. Application of the NEDI revealed a high degree of intratumor heterogeneity with respect to neuroendocrine features and provided insight into quantifying the degree of NED in NSCLC. To the best of our knowledge, this is the first study to develop a classification tool and machine-learning algorithm for the analysis of a spectrum of neuroendocrine features in NSCLC based on transcriptome data.
In this study, we initially analysed the relationship between the NEDI and molecular and clinical features and found a strong association between the NEDI and LUAD.2, one of the molecular subtypes of LUAD previously defined by TCGA and known to exhibit higher tumour ploidy and mutation rates and to frequently harbour TP53 mutations [50]. We found that a high NEDI was strongly correlated with somatic mutations in the TP53 and RB1 genes and tobacco carcinogens, which have been reported to be associated with the development of SCLC, the most common form of neuroendocrine lung cancer. These results further confirmed that the NEDI could be used to quantitatively determine the degree of NED in NSCLC.
Different mutational and immune landscapes were also observed between groups with different levels of NED (categorized based on the NEDI). We found that LUAD samples with higher NEDI values showed more genomic variants, and there was a strong association between the NEDI and TMB. It has been shown that patients with tumours with a high TMB are likely to benefit from immunotherapy [62]. Unexpectedly, a higher NEDI was associated with reduced immune cell infiltration and lower PD-L1 expression in LUAD despite the high TMB in our results. The therapeutic benefits and clinical response to anti-PD-1/L1 immunotherapy in LUAD patients with different NEDI values were predicted by calculating the T-cell inflamed GEP and IPRES and performing SubMap analysis. Moreover, the model was validated in three independent datasets of lung cancer immunotherapy. Our results showed that LUAD patients with high NEDI values were less likely to respond to ICB treatments. Our findings indicated that the NEDI might help predict the efficacy of immunotherapy and that TMB might not be a fully reliable biomarker for predicting the response to immunotherapy in LUAD patients.
Furthermore, we performed GSEA to investigate the biological processes associated with NED in LUAD and found that cell-cycle-related cancer hallmark gene sets significantly contributed to the positive side and immune-related cancer hallmark gene sets were significantly enriched on the negative side of the NED signature. This further confirmed that sufficient immune cell infiltration might exist within LUAD samples with low degrees of NED and that those patients might benefit from immunotherapy. Moreover, CMap analysis using the gene expression signatures of tumour samples with different NEDI values was performed and showed that tubulin inhibitors were most likely to be effective based on their MoA, indicating that microtubule-based functions, such as cell migration, cell division, and intracellular trafficking, might be involved in NED. In addition, we found that tumours with strong potential for NED generally had more rapid cell proliferation and reduced angiogenesis within microenvironments in LUAD, suggesting that LUAD samples with high NEDI values might be more susceptible to chemotherapy and less sensitive to antiangiogenic agents, which might result in better prognoses in patients with LUAD with high degrees of NED.
Neuroendocrine features arise from progenitor cells that accumulate mutations in oncogenic pathways or from differentiation of non-neuroendocrine cancer cells into cancer cells with neuroendocrine features via alteration of core signalling pathways that regulate the neuroendocrine phenotypes in NSCLC. In this study, we observed that primary LUAD tumour samples exhibited neuroendocrine features that were similar to those of nontumour samples obtained from matched adjacent normal tissues of origin, suggesting that the cancer cells had inherent neuroendocrine features rather than the features developing because of differentiation induced by the tumour microenvironment in NSCLC. However, further validation using other genomic profiles and laboratory experiments is required to validate the findings.
Stem-cell-like features are defined as the potential for self-renewal and differentiation in a cell of origin that possesses the ability to give rise to all cell types in an adult organism [63]. In our results, we found that EGFR-TKI-resistant samples had a higher NEDI than TKI-sensitive samples from patients with LUAD with EGFR mutations. Moreover, the NEDI was positively correlated with stemness features in LUAD. These results suggest that LUAD samples with high levels of neuroendocrine features exhibit progenitor and stem-cell-like features and high potential for histological transformation into SCLC, which may result in acquired resistance to EGFR-TKIs.
More interestingly, we observed that LUAD samples with higher NEDI values displayed more sensitivity to etoposide, one of the most active agents used for small-cell lung cancer therapy, but less sensitivity to paclitaxel (microtubule-targeting agents) by using the “pRRophetic” algorithm. Consistently, a much worse prognosis in the higher NEDI subgroup was validated in LUAD patients who received adjuvant chemotherapy with a taxane. This may be partly explained by the dedifferentiation of neuroendocrine cells induced by microtubule-targeting agents, which results in reduced cell cycle progression and less sensitivity to chemotherapeutic agents. It also indicated that paclitaxel-based regimens might not be effective in LUAD patients with high NEDI values and that etoposide-based combination therapy would be a better alternative. Chemotherapy has been introduced into the standard treatment for LUAD patients with acquired resistance to EGFR-TKIs [3]. Hence, etoposide-based regimens might be more effective in LUAD patients with acquired resistance to EGFR-TKIs.
Different sensitivities and specificities of the agents used in IHC analysis may lead to significant variation in reported incidence and underly the contradictory results about the prognosis of NSCLC-NED [60]. An integrative analysis based on the transcriptome would provide a reliable estimation of markers for cancer with advantages of being amenable to standardization and avoiding interpretation bias [10, 11]. Hence, the mRNA-based NEDI model would be more reliable for the evaluation of NED in NSCLC than IHC analysis. However, different transcriptome analysis platforms utilize various equipment types and physical principles for detecting output signals, as well as different library preparation protocols, leading to substantial batch effects and incompatibility problems [64, 65]. Moreover, contradictory mRNA expression results have been obtained from the use of various sources of clinical biomaterials; for example, high-integrity RNAs may be obtained from fresh tissue specimens rather than formalin-fixed paraffin-embedded tissue samples [66]. Hence, the mRNA-based NEDI model has some limitations in clinical utility and thus should be validated and modified in subsequent studies.
We found that LUAD patients with a higher NEDI exhibited a better prognosis in most LUAD datasets, indicating that the NEDI (calculated based on transcriptomic data) exhibited high prognostic value for LUAD. By applying the NEDI model, a prognostic NED-related signature was established and tested in the TCGA-LUAD cohort. A NEDI-based risk model was established by calculating risk scores on the basis of the expression of 9 pivotal genes. We found that patients with lower risk scores showed better clinical outcomes. In addition, the risk score and clinical stage exhibited powerful prognostic capacity and were incorporated into a comprehensive nomogram that achieved accurate survival prediction.
Although this study achieved comprehensive investigation of the mutational landscapes, altered pathways and immune characteristics of subgroups with different NEDI values and suggested potential personalized treatments for patients with LUAD based on the NEDI model, future studies including evaluations of molecular mechanisms and clinical trials should be conducted to validate our findings and ensure the clinical utility of the NEDI as a biomarker. These issues are worthy of further study.
5 Conclusion
Collectively, our findings shed some light on the field of NED and we developed a new index based on the OCLR algorithm to quantitatively evaluate neuroendocrine features in LUAD. Our study provides a strategy for applying NEDI-based risk stratification to guide decision-making regarding immunotherapy and chemotherapy.
Data availability
All data generated or analysed during this study are included in this published article and its supplementary information files. Lung cancer transcriptomics datasets analyzed in this research can be found at: The Cancer Genome Atlas (TCGA) (https://portal.gdc.cancer.gov/), Clinical Proteomic Tumor Analysis Consortium (CPTAC) (https://www.cbioportal.org), Cancer Cell Line Encyclopedia (CCLE) (http://www.broadinstitute.org/ccle), and Gene Expression Omnibus (GEO) (https://www.ncbi.nlm.nih.gov/geo).
Code availability
The statistics and analysis of data were based on open source software programming (R project, https://www.r-project.org/) and the information about key software and algorithms used in this study are provided in Table S3.
Abbreviations
- NSCLC:
-
Non-small cell lung cancer
- OCLR:
-
One-class logistic regression
- NED:
-
Neuroendocrine differentiation
- NEDI:
-
Neuroendocrine differentiation index
- SCLC:
-
Small cell lung cancer
- LUAD:
-
Lung adenocarcinoma
- LUSC:
-
Lung squamous cell carcinoma
- EGFR-TKIs:
-
Epidermal growth factor receptor -tyrosine kinase inhibitors
- NSCLC-NED:
-
Non-small cell carcinomas with neuroendocrine differentiation
- IHC:
-
Immunohistochemistry
- CgA:
-
Chromogranin A
- CD56:
-
Neural cell adhesion molecule 1
- Syp:
-
Synaptophysin
- NSE:
-
Neuron-specific enolase
- TCGA:
-
The Cancer Genome Atlas
- CPTAC:
-
Clinical Proteomic Tumor Analysis Consortium
- CCLE:
-
Cancer Cell Line Encyclopedia
- GEO:
-
Gene Expression Omnibus
- BSA:
-
Bovine serum albumin
- MAF:
-
Mutation Annotation Format
- TME:
-
Tumor microenvironment
- TMB:
-
Tumour mutation burden
- ssGSEA:
-
Single-sample gene-set enrichment analysis
- IPS:
-
Immunophenoscore
- T-cell inflamed GEP:
-
T-cell inflamed gene expression profile
- IPRES:
-
Innate anti-PD-1 resistance
- Submap:
-
Subclass mapping
- CMap:
-
Connectivity Map
- LCNC:
-
Large-cell neuroendocrine carcinomas
- CC:
-
Carcinoids
- SNPs:
-
Single-nucleotide polymorphisms
- SNVs:
-
Single-nucleotide variants
- INSs:
-
Insertions
- DELs:
-
Deletions
- DEGs:
-
Differentially expressed genes
- EMT:
-
Epithelial mesenchymal transition
References
Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, Bray F. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021;71:209–49.
Thai AA, Solomon BJ, Sequist LV, Gainor JF, Heist RS. Lung cancer. Lancet. 2021;398:535–54.
Passaro A, Janne PA, Mok T, Peters S. Overcoming therapy resistance in EGFR-mutant lung cancer. Nat Cancer. 2021;2:377–91.
Quintanal-Villalonga A, Taniguchi H, Zhan YA, Hasan MM, Chavan SS, Meng F, Uddin F, Manoj P, Donoghue MTA, Won HH, et al. Multiomic analysis of lung tumors defines pathways activated in neuroendocrine transformation. Cancer Discov. 2021;11:3028–47.
Slodkowska J. The value of immunohistochemical identification of neuroendocrine differentiation in non small cell lung carcinoma. Rocz Akad Med Bialymst. 1997;42(Suppl 1):23–7.
Graziano SL, Mazid R, Newman N, Tatum A, Oler A, Mortimer JA, Gullo JJ, DiFino SM, Scalzo AJ. The use of neuroendocrine immunoperoxidase markers to predict chemotherapy response in patients with non-small-cell lung cancer. J Clin Oncol. 1989;7:1398–406.
Skov BG, Sorensen JB, Hirsch FR, Larsson LI, Hansen HH. Prognostic impact of histologic demonstration of chromogranin A and neuron specific enolase in pulmonary adenocarcinoma. Ann Oncol. 1991;2:355–60.
Slodkowska J, Zych J, Szturmowicz M, Demkow U, Rowinska-Zakrzewska E, Roszkowski-Sliz K. Neuroendocrine phenotype of non-small cell lung carcinoma: immunohistological evaluation and biochemical study. Int J Biol Markers. 2005;20:217–26.
Liu Y, Xu J, Jiao Y, Hu Y, Yi C, Li Q, Tong Z, Wang X, Hu L, Xiao Q, et al. Neuroendocrine differentiation is a prognostic factor for stage II poorly differentiated colorectal cancer. Biomed Res Int. 2014;2014: 789575.
Conroy JM, Pabla S, Nesline MK, Glenn ST, Papanicolau-Sengos A, Burgher B, Andreas J, Giamo V, Wang Y, Lenzo FL, et al. Next generation sequencing of PD-L1 for predicting response to immune checkpoint inhibitors. J Immunother Cancer. 2019;7:18.
Sorokin M, Ignatev K, Poddubskaya E, Vladimirova U, Gaifullin N, Lantsov D, Garazha A, Allina D, Suntsova M, Barbara V, Buzdin A. RNA sequencing in comparison to immunohistochemistry for measuring cancer biomarkers in breast cancer and lung cancer specimens. Biomedicines. 2020;8:114.
Brunet JP, Tamayo P, Golub TR, Mesirov JP. Metagenes and molecular pattern discovery using matrix factorization. Proc Natl Acad Sci U S A. 2004;101:4164–9.
Sokolov A, Paull EO, Stuart JM. One-class detection of cell states in tumor subtypes. Pac Symp Biocomput. 2016;21:405–16.
Malta TM, Sokolov A, Gentles AJ, Burzykowski T, Poisson L, Weinstein JN, Kaminska B, Huelsken J, Omberg L, Gevaert O, et al. Machine learning identifies stemness features associated with oncogenic dedifferentiation. Cell. 2018;173(338–354): e315.
Hiam-Galvez KJ, Allen BM, Spitzer MH. Systemic immunity in cancer. Nat Rev Cancer. 2021;21:345–59.
Ghandi M, Huang FW, Jane-Valbuena J, Kryukov GV, Lo CC, McDonald ER 3rd, Barretina J, Gelfand ET, Bielski CM, Li H, et al. Next-generation characterization of the Cancer Cell Line Encyclopedia. Nature. 2019;569:503–8.
Gillette MA, Satpathy S, Cao S, Dhanasekaran SM, Vasaikar SV, Krug K, Petralia F, Li Y, Liang WW, Reva B, et al. Proteogenomic characterization reveals therapeutic vulnerabilities in lung adenocarcinoma. Cell. 2020;182(200–225): e235.
Satpathy S, Krug K, Jean Beltran PM, Savage SR, Petralia F, Kumar-Sinha C, Dou Y, Reva B, Kane MH, Avanessian SC, et al. A proteogenomic portrait of lung squamous cell carcinoma. Cell. 2021;184(4348–4371): e4340.
George J, Lim JS, Jang SJ, Cun Y, Ozretic L, Kong G, Leenders F, Lu X, Fernandez-Cuesta L, Bosco G, et al. Comprehensive genomic profiles of small cell lung cancer. Nature. 2015;524:47–53.
Thaiparambil J, Dong J, Grimm SL, Perera D, Ambati CSR, Putluri V, Robertson MJ, Patel TD, Mistretta B, Gunaratne PH, et al. Integrative metabolomics and transcriptomics analysis reveals novel therapeutic vulnerabilities in lung cancer. Cancer Med. 2022. https://doi.org/10.1002/cam4.4933.
Karlsson A, Jonsson M, Lauss M, Brunnstrom H, Jonsson P, Borg A, Jonsson G, Ringner M, Planck M, Staaf J. Genome-wide DNA methylation analysis of lung carcinoma reveals one neuroendocrine and four adenocarcinoma epitypes associated with patient outcome. Clin Cancer Res. 2014;20:6127–40.
Laddha SV, da Silva EM, Robzyk K, Untch BR, Ke H, Rekhtman N, Poirier JT, Travis WD, Tang LH, Chan CS. Integrative genomic characterization identifies molecular subtypes of lung carcinoids. Cancer Res. 2019;79:4339–47.
Niederst MJ, Sequist LV, Poirier JT, Mermel CH, Lockerman EL, Garcia AR, Katayama R, Costa C, Ross KN, Moran T, et al. RB loss in resistant EGFR mutant lung adenocarcinomas that transform to small-cell lung cancer. Nat Commun. 2015;6:6377.
Fujiwara T, Hiramatsu M, Isagawa T, Ninomiya H, Inamura K, Ishikawa S, Ushijima M, Matsuura M, Jones MH, Shimane M, et al. ASCL1-coexpression profiling but not single gene expression profiling defines lung adenocarcinomas of neuroendocrine nature with poor prognosis. Lung Cancer. 2012;75:119–25.
Balko JM, Potti A, Saunders C, Stromberg A, Haura EB, Black EP. Gene expression patterns that predict sensitivity to epidermal growth factor receptor tyrosine kinase inhibitors in lung cancer cell lines and human lung tumors. BMC Genomics. 2006;7:289.
Okayama H, Kohno T, Ishii Y, Shimada Y, Shiraishi K, Iwakawa R, Furuta K, Tsuta K, Shibata T, Yamamoto S, et al. Identification of genes upregulated in ALK-positive and EGFR/KRAS/ALK-negative lung adenocarcinomas. Cancer Res. 2012;72:100–11.
Der SD, Sykes J, Pintilie M, Zhu CQ, Strumpf D, Liu N, Jurisica I, Shepherd FA, Tsao MS. Validation of a histology-independent prognostic gene signature for early-stage, non-small-cell lung cancer including stage IA patients. J Thorac Oncol. 2014;9:59–64.
Schabath MB, Welsh EA, Fulp WJ, Chen L, Teer JK, Thompson ZJ, Engel BE, Xie M, Berglund AE, Creelan BC, et al. Differential association of STK11 and TP53 with KRAS mutation-associated gene expression, proliferation and immune surveillance in lung adenocarcinoma. Oncogene. 2016;35:3209–16.
Riaz N, Havel JJ, Makarov V, Desrichard A, Urba WJ, Sims JS, Hodi FS, Martin-Algarra S, Mandal R, Sharfman WH, et al. Tumor and microenvironment evolution during immunotherapy with nivolumab. Cell. 2017;171(934–949): e916.
Tomida S, Takeuchi T, Shimada Y, Arima C, Matsuo K, Mitsudomi T, Yatabe Y, Takahashi T. Relapse-related molecular signature in lung adenocarcinomas identifies patients with dismal prognosis. J Clin Oncol. 2009;27:2793–9.
Cho JW, Hong MH, Ha SJ, Kim YJ, Cho BC, Lee I, Kim HR. Genome-wide identification of differentially methylated promoters and enhancers associated with response to anti-PD-1 therapy in non-small cell lung cancer. Exp Mol Med. 2020;52:1550–63.
Hight SK, Mootz A, Kollipara RK, McMillan E, Yenerall P, Otaki Y, Li LS, Avila K, Peyton M, Rodriguez-Canales J, et al. An in vivo functional genomics screen of nuclear receptors and their co-regulators identifies FOXA1 as an essential gene in lung tumorigenesis. Neoplasia. 2020;22:294–310.
Jung H, Kim HS, Kim JY, Sun JM, Ahn JS, Ahn MJ, Park K, Esteller M, Lee SH, Choi JK. DNA methylation loss promotes immune evasion of tumours with high mutation and copy number load. Nat Commun. 2019;10:4278.
Song LR, Weng JC, Li CB, Huo XL, Li H, Hao SY, Wu Z, Wang L, Li D, Zhang JT. Prognostic and predictive value of an immune infiltration signature in diffuse lower-grade gliomas. JCI Insight. 2020;5:e133811.
Mayakonda A, Lin DC, Assenov Y, Plass C, Koeffler HP. Maftools: efficient and comprehensive analysis of somatic variants in cancer. Genome Res. 2018;28:1747–56.
Leiserson MD, Wu HT, Vandin F, Raphael BJ. CoMEt: a statistical approach to identify combinations of mutually exclusive alterations in cancer. Genome Biol. 2015;16:160.
Hanzelmann S, Castelo R, Guinney J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics. 2013;14:7.
Yoshihara K, Shahmoradgoli M, Martinez E, Vegesna R, Kim H, Torres-Garcia W, Trevino V, Shen H, Laird PW, Levine DA, et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat Commun. 2013;4:2612.
Charoentong P, Finotello F, Angelova M, Mayer C, Efremova M, Rieder D, Hackl H, Trajanoski Z. Pan-cancer immunogenomic analyses reveal genotype-immunophenotype relationships and predictors of response to checkpoint blockade. Cell Rep. 2017;18:248–62.
Reich M, Liefeld T, Gould J, Lerner J, Tamayo P, Mesirov JP. GenePattern 2.0. Nat Genet. 2006;38:500–1.
Hoshida Y, Brunet JP, Tamayo P, Golub TR, Mesirov JP. Subclass mapping: identifying common subtypes in independent disease data sets. PLoS ONE. 2007;2: e1195.
Geeleher P, Cox N, Huang RS. pRRophetic: an R package for prediction of clinical chemotherapeutic response from tumor gene expression levels. PLoS ONE. 2014;9: e107468.
Lamb J, Crawford ED, Peck D, Modell JW, Blat IC, Wrobel MJ, Lerner J, Brunet JP, Subramanian A, Ross KN, et al. The Connectivity Map: using gene-expression signatures to connect small molecules, genes, and disease. Science. 2006;313:1929–35.
Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:559.
Nunez E, Steyerberg EW, Nunez J. Regression modeling strategies. Rev Esp Cardiol. 2011;64:501–7.
Gerds TA, Cai T, Schumacher M. The performance of risk prediction models. Biom J. 2008;50:457–79.
Mogensen UB, Ishwaran H, Gerds TA. Evaluating random forests for survival analysis using prediction error curves. J Stat Softw. 2012;50:1–23.
Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, Smyth GK. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43: e47.
Jones MH, Virtanen C, Honjoh D, Miyoshi T, Satoh Y, Okumura S, Nakagawa K, Nomura H, Ishikawa Y. Two prognostically significant subtypes of high-grade lung neuroendocrine tumours independent of small-cell and large-cell neuroendocrine carcinomas identified by gene expression profiles. Lancet. 2004;363:775–81.
Cancer Genome Atlas Research N: Comprehensive molecular profiling of lung adenocarcinoma. Nature 2014, 511:543–550.
Oxnard GR, Hu Y, Mileham KF, Husain H, Costa DB, Tracy P, Feeney N, Sholl LM, Dahlberg SE, Redig AJ, et al. Assessment of resistance mechanisms and clinical implications in patients with EGFR T790M-positive lung cancer and acquired resistance to osimertinib. JAMA Oncol. 2018;4:1527–34.
Rudin CM, Brambilla E, Faivre-Finn C, Sage J. Small-cell lung cancer. Nat Rev Dis Primers. 2021;7:3.
Howe MC, Chapman A, Kerr K, Dougal M, Anderson H, Hasleton PS. Neuroendocrine differentiation in non-small cell lung cancer and its relation to prognosis and therapy. Histopathology. 2005;46:195–201.
Tirosh I, Izar B, Prakadan SM, Wadsworth MH 2nd, Treacy D, Trombetta JJ, Rotem A, Rodman C, Lian C, Murphy G, et al. Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq. Science. 2016;352:189–96.
Senbabaoglu Y, Gejman RS, Winer AG, Liu M, Van Allen EM, de Velasco G, Miao D, Ostrovnaya I, Drill E, Luna A, et al. Tumor immune microenvironment characterization in clear cell renal cell carcinoma identifies prognostic and immunotherapeutically relevant messenger RNA signatures. Genome Biol. 2016;17:231.
Roh W, Chen PL, Reuben A, Spencer CN, Prieto PA, Miller JP, Gopalakrishnan V, Wang F, Cooper ZA, Reddy SM, et al. Integrated molecular analysis of tumor biopsies on sequential CTLA-4 and PD-1 blockade reveals markers of response and resistance. Sci Transl Med. 2017. https://doi.org/10.1126/scitranslmed.aah3560.
Auslander N, Zhang G, Lee JS, Frederick DT, Miao B, Moll T, Tian T, Wei Z, Madan S, Sullivan RJ, et al. Robust prediction of response to immune checkpoint blockade therapy in metastatic melanoma. Nat Med. 2018;24:1545–9.
Hugo W, Zaretsky JM, Sun L, Song C, Moreno BH, Hu-Lieskovan S, Berent-Maoz B, Pang J, Chmielowski B, Cherry G, et al. Genomic and transcriptomic features of response to anti-PD-1 therapy in metastatic melanoma. Cell. 2016;165:35–44.
Tibshirani R. The lasso method for variable selection in the Cox model. Stat Med. 1997;16:385–95.
Schleusener JT, Tazelaar HD, Jung SH, Cha SS, Cera PJ, Myers JL, Creagan ET, Goldberg RM, Marschke RF Jr. Neuroendocrine differentiation is an independent prognostic factor in chemotherapy-treated nonsmall cell lung carcinoma. Cancer. 1996;77:1284–91.
Gajra A, Tatum AH, Newman N, Gamble GP, Lichtenstein S, Rooney MT, Graziano SL. The predictive value of neuroendocrine markers and p53 for response to chemotherapy and survival in patients with advanced non-small cell lung cancer. Lung Cancer. 2002;36:159–65.
Sha D, Jin Z, Budczies J, Kluck K, Stenzinger A, Sinicrope FA. Tumor mutational burden as a predictive biomarker in solid tumors. Cancer Discov. 2020;10:1808–25.
Ge Y, Gomez NC, Adam RC, Nikolova M, Yang H, Verma A, Lu CP, Polak L, Yuan S, Elemento O, Fuchs E. Stem cell lineage infidelity drives wound repair and cancer. Cell. 2017;169(636–650): e614.
Borisov N, Shabalina I, Tkachev V, Sorokin M, Garazha A, Pulin A, Eremin II, Buzdin A. Shambhala: a platform-agnostic data harmonizer for gene expression data. BMC Bioinformatics. 2019;20:66.
Borisov N, Suntsova M, Sorokin M, Garazha A, Kovalchuk O, Aliper A, Ilnitskaya E, Lezhnina K, Korzinkin M, Tkachev V, et al. Data aggregation at the level of molecular pathways improves stability of experimental transcriptomic and proteomic data. Cell Cycle. 2017;16:1810–23.
Suntsova M, Gaifullin N, Allina D, Reshetun A, Li X, Mendeleeva L, Surin V, Sergeeva A, Spirin P, Prassolov V, et al. Atlas of RNA sequencing profiles for normal human tissues. Sci Data. 2019;6:36.
Funding
This work was supported by the National Natural Science Foundation of China (81702767 to L.Y), Hubei Provincial Natural Science Foundation of China (2021CFB525 to Y.Y.D), Scientific Research Foundation for Talented Scholars, Xiangyang Central Hospital, Affiliated Hospital of Hubei University of Arts and Science (4736 to L.Y) and Shiyan Science and Scientific Research Project (22Y66 to X.L.C).
Author information
Authors and Affiliations
Contributions
L.Y (Corresponding author, Department of Oncology, Xiangyang Central Hospital, Affiliated Hospital of Hubei University of Arts and Science, Xiangyang, Hubei 441021, P.R.China. Email address: studyong@outlook.com) conceived and designed the study; L.H, H.Y, L.Z.T, G.L.P and W.Y performed most of the computational analysis; L.Y and L.H prepared the illustrations and wrote the manuscript; Y.T.N, Y.Y.D, Q.P and X.L.C helped in the computational analysis procedure and draft preparation; All authors approved the final version of the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Liu, H., Han, Y., Liu, Z. et al. Depiction of neuroendocrine features associated with immunotherapy response using a novel one-class predictor in lung adenocarcinoma. Discov Onc 14, 71 (2023). https://doi.org/10.1007/s12672-023-00693-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12672-023-00693-4