Identification of a robust functional subpathway signature for pancreatic ductal adenocarcinoma by comprehensive and integrated analyses

Wang, Ping; Zhang, Chunlong; Li, Weidong; Zhai, Bo; Jiang, Xian; Reddy, Shiva; Jiang, Hongchi; Sun, Xueying

doi:10.1186/s12964-020-0522-4

Identification of a robust functional subpathway signature for pancreatic ductal adenocarcinoma by comprehensive and integrated analyses

Research
Open access
Published: 02 March 2020

Volume 18, article number 34, (2020)
Cite this article

Download PDF

You have full access to this open access article

Cell Communication and Signaling Aims and scope Submit manuscript

Identification of a robust functional subpathway signature for pancreatic ductal adenocarcinoma by comprehensive and integrated analyses

Download PDF

Ping Wang^1,2^na1,
Chunlong Zhang³^na1,
Weidong Li^1,4,
Bo Zhai^1,4,
Xian Jiang¹,
Shiva Reddy⁵,
Hongchi Jiang¹ &
…
Xueying Sun ORCID: orcid.org/0000-0002-8065-8747¹

1925 Accesses
1 Citation
4 Altmetric
Explore all metrics

Abstract

Background

Pancreatic ductal adenocarcinoma (PDAC) is a highly lethal malignancy and its mortality continues to rise globally. Because of its high heterogeneity and complex molecular landscapes, published gene signatures have demonstrated low specificity and robustness. Functional signatures containing a group of genes involved in similar biological functions may display a more robust performance.

Methods

The present study was designed to excavate potential functional signatures for PDAC by analyzing maximal number of datasets extracted from available databases with a recently developed method of FAIME (Functional Analysis of Individual Microarray Expression) in a comprehensive and integrated way.

Results

Eleven PDAC datasets were extracted from GEO, ICGC and TCGA databases. By systemically analyzing these datasets, we identified a robust functional signature of subpathway (path:00982_1), which belongs to the drug metabolism-cytochrome P450 pathway. The signature has displayed a more powerful and robust capacity in predicting prognosis, drug response and chemotherapeutic efficacy for PDAC, particularly for the classical subtype, in comparison with published gene signatures and clinically used TNM staging system. This signature was verified by meta-analyses and validated in available cell line and clinical datasets with chemotherapeutic efficacy.

Conclusion

The present study has identified a novel functional PDAC signature, which has the potential to improve the current systems for predicting the prognosis and monitoring drug response, and to serve a linkage to therapeutic options for combating PDAC. However, the involvement of path:00982_1 subpathway in the metabolism of anti-PDAC chemotherapeutic drugs, particularly its biological interpretation, requires a further investigation.

Video Abstract

A multi-gene signature predicts outcome in patients with pancreatic ductal adenocarcinoma

Article Open access 03 December 2014

Integrative analysis of gene expression profiles reveals specific signaling pathways associated with pancreatic duct adenocarcinoma

Article Open access 27 April 2018

Open-source curation of a pancreatic ductal adenocarcinoma gene expression analysis platform (pdacR) supports a two-subtype model

Article Open access 10 February 2023

Background

Pancreatic ductal adenocarcinoma (PDAC) is the fourth leading cause of cancer-related deaths worldwide and is predicted to be the second in the United States and Europe by 2030 [1, 2]. PDAC is regarded as a devastating malignancy due to its aggressive nature, presenting at an advanced stage and resistance to most treatment modalities, resulting in an overall 5-year survival rate at 9% [3], which is the lowest 5-year survival rate among all solid malignancies [4]. Such a poor outcome highlights an urgent need for seeking novel biomarkers to predict survival and monitor therapy response, which may also provide a more precise link to therapeutic options for combating PDAC.

PDAC has a very complex molecular landscape [5]. Efforts in deeply analyzing datasets have led to the discovery of potential PDAC gene signatures, which contain various numbers of distinguishable genes [6,7,8,9,10,11,12,13,14,15,16]. However, these reported signatures have few overlapping component genes with different functions, raising questions about their biological relevance, clinical significance and universal application for the management of PDAC. Each of them only reflects a specific biological trait because of cancer genetic instability, profusion of gene expression and diverse molecular subtyping, given that a high degree of heterogeneity among individuals and even within the same PDAC tumor [17, 18]. On the other hand, functions of genes and pathways explain the major features of pancreatic tumorigenesis and progression [19], thus functional signatures may display more robust performance since they contain a group of genes involved in similar biological functions [10, 20]. In order to excavate functional mechanism-anchored signatures, an analytical method called Functional Analysis of Individual Microarray Expression (FAIME) has been developed, which converts the transcriptomic information into molecular functional profiles [21]. By employing FAIME, we and others have identified several functional signatures for lung cancer [22], melanoma [23] and metabolic disorders [24]. We, therefore, designed the present study aiming at seeking potential functional signatures for PDAC by analyzing maximal number of datasets extracted from available public databases with FAIME in a comprehensive and integrated way.

Materials and methods

Datasets

Seven datasets extracted from databases of Gene Expression Omnibus (GEO) (https://www.ncbi.nlm.nih.gov/geo/) by using appropriate searching strategies (Supplementary Figure S1 and S2) were used as training sets, and 3 datasets from International Cancer Genome Consortium (ICGC) database (http://icgc.org/) and one from The Cancer Genome Atlas (TCGA) database (https://portal.gdc.cancer.gov/) were used as test sets (Supplementary Table S1).

Cell line datasets contained profiles of mRNA expression and drug sensitivity data of 44 and 32 human PDAC cell line samples were extracted from databases of Cancer Cell Line Encyclopedia (CCLE) and Genomics of Drug Sensitivity in Cancer (GDSC) up to March of 2019, respectively.

Resources of pathways and subpathways

The pathway graphs were obtained from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database [25] by using an R-based package called SubpathwayMiner [26] and converted into undirected graphs, where genes were represented by nodes. The subpathway graphs were defined based on the distance similarity rule [26] so that the distance of any two gene nodes was no larger than the cutoff k (default cutoff k = 3). Finally, a total of 300 pathways and 1773 subpathways were included in the study.

Methods of comprehensive and integrated analyses

Comprehensive and integrated analyses were performed at levels of gene, subpathway and pathway by using various combinations of training sets, which consisted of 5, 6 or 7 training datasets. An example of the procedure for a combination of 5 datasets at the level of subpathway is shown in Fig. 1. The activities of each pathway and subpathway were evaluated by using a method of FAIME with modification [21].

Firstly, all the expressed genes (N_g) from each sample were ranked in a descending order according to their expression levels, and the exponential decreasing weights (w) were calculated for the ordered genes (w_{g, s}) by using Formula (1) as follows:

$$ {w}_{g,s}=\left({r}_{g,s}\right)\bullet \left({e}^{\frac{r_{g,s}}{\mid N\mid }}\right) $$

(1)

where r_{g, s} is the rank for gene g in sample s, and ∣N∣, the total number of genes in the sample. For analyzing the subpathway graph i, a component-set subpath_i indicates that it satisfies component ∈ subpath_i and N/subpath_i, all the other components not included in the subpathway graph i. The score of subpathway i activity ($ sP{A}_{subpat{h}_{i,s}} $) was calculated by using Formula (2) as follows:

$$ sP{A}_{subpat{h}_{i,s}}=\frac{1}{\mid subpat{h}_i\mid}\sum \limits_{g\in subpat{h}_i}\left({w}_{g,s}\right)-\frac{1}{\mid N/ subpat{h}_i\mid}\sum \limits_{g\in N/ subpat{h}_i}\left({w}_{g,s}\right) $$

(2)

The normalized score of subpathway activity ($ sPA\_ nor{m}_{subpat{h}_{i,s}} $) was calculated by using Formula (3) as follows:

$$ sPA\_ nor{m}_{subpat{h}_{i,s}=}\frac{sP{A}_{subpat{h}_{i,s}}-\overline{sP{A}_{subpat{h}_i}}}{S\left( sP{A}_{subpat{h}_i}\right)} $$

(3)

where $ \overline{sP{A}_{subpat{h}_i}} $ is the mean score of subpathway activity in all analyzed samples, and $ S\left( sP{A}_{subpat{h}_i}\right) $, standard deviation.

The P-value of each subpathway in each training set was calculated by using a univariate Cox, and an integrated prognostic score (ipScore) of each subpathway in various combinations of training sets was calculated by using Formula (4) as follows:

$$ ipScore=\frac{\sum \limits_i^N-\log 10\left(P- valu{e}_i\right)}{N} $$

(4)

where N (=5, 6 or 7) is the number of training sets in combinations.

The ipScore for each pathway was calculated and normalized to form the pathway activity matrix by using the same method as described above. And for gene level analysis, a univariate cox was performed based on the gene expression level, and the ipScore for each gene was calculated according to the Formula (4).

Meta-analyses

The software STATA (version 14) was employed to evaluate hazard ratio (HR), and a funnel plot, the publication bias. Heterogeneity was assessed by using the I² statistic according to the Cochrane handbook for systematic reviews of interventions and I² > 50% indicates the existence of substantial heterogeneity. A fixed-effect model was used to summarize the results when I² < 50%, otherwise, a random-effect model was used. The possible source of heterogeneity was evaluated by sensitive analysis.

Statistical analyses

A univariate Cox method was employed to evaluate the correlation between the signature and the survival. A K-mean clustering method (K = 2) was performed for analyzing multiple variable signatures. A log-rank test was used to compare the difference in survival between the two groups. Statistical analyses for hierarchical cluster, spearman correlation, hypergeometric test, wilcoxon rank sum test and Cox proportional hazards were performed by using an R software package (Version 3.1.0). P-value < 0.05 is considered statistically significant.

Results

Excavation of gene, subpathway and pathway signatures

An ipScore of each biomarker was calculated in 29 different combinations (each contained 5, 6 or 7 datasets) of the 7 training sets. Based on their ranks, top 30 signatures obtained from each combination were regrouped into serial 28 sets, which contained 3–30 signatures (1st-3rd, 1st-4th, 1st-5th … ...1st-30th), respectively, and were further assessed in the 4 test sets (Fig. 2a). A high heterogeneity existed among different combinations, but functional signatures showed a higher robustness than gene signatures (Fig. 2a).

The appearing frequency of each signature from each combination was counted in the other 28 combinations at levels of gene, subpathway and pathway (Analytical data File 1–3). Top counted signatures showed cumulative effects as assessed in the test sets, the predictive capacity became more robust when the number of signatures was ≥10 (Supplementary Figure S3).

Based on their appearing frequency (cut-off ≥20) in all the 29 combinations, 9 genes (Fig. 2b), 15 subpathways (Fig. 2c) and 22 pathways (Fig. 2d) were selected as candidate signatures, whose relevance with PDAC was further examined by using datasets of 3000 cancer-related and 250 PDAC-related genes derived from Genetic Association Database (GAD). None of 9 gene signatures are PDAC-related (Fig. 2b), in accordance with their poor predictive ability (Fig. 2a). By analyzing the biological functions, commonalities, intersection points and their subsidiary relationship, we chose three pairs (path:00980_2/path:00980, path:00982_1/path:00982 and path:00477_1/path:00477) for further analyses because they were shown to be associated with cancer and/or PDAC at both pathway and subpathway levels (Fig. 2c and d).

Identification of the path:00982_1 subpathway signature

We next analyzed whether these three pairs shared common genes at the levels of subpathway and pathway. As shown in Fig. 3a, the path:00980_2 subpathway covered all the genes of path:00982_1 subpathway, while path:00980 pathway covered most genes (65/73, 89.04%) of path:00982 pathway; but neither path:00477_1 subpathway nor path:04711 pathway shared any genes with the other two pairs. The prognostic capacity of each signature was analyzed in each dataset by using a univariate Cox analysis, which showed that the path:00980_2/path:00980 and path:00982_1/path:00982 signatures had higher predictive capacities than path:00477_1/path:00477 signatures (Fig. 3b). Based on the above comprehensive and integrated analyses and considering the number of genes and the overall predictive capacity, we finally selected the path:00982_1 signature (Supplementary Figure S4 and Supplementary Figure S2) for further analysis.

The path:00982_1 signature is a protective signature for PDAC

Meta-analyses showed that the path:00982_1 signature was a significantly protective factor for PDAC with an overall pooled HR of 0.82 (95% confidence interval [CI] 0.77, 0.89; p < 0.001) (Fig. 4a). The funnel was generally symmetrical without obvious publication biases, indicating the results of meta-analyses were credible (Fig. 4b). However, the heterogeneity (I² = 71.4%) was high as analyzed by using sensitivity analyses. The overall pooled estimate could be reduced by excluding GSE79668 and TCGA datasets, and in particular, exclusion of TCGA dataset made the overall pooled estimate even closer to the lower CI limit (Fig. 4c). After further investigating the detailed techniques employed for generating gene expression profiles of each dataset, we found that RNA-seq (RNA sequencing) techniques were used in GSE79668 and TCGA datasets. We thus classified all the datasets into microarray and RNA-seq subgroups. By using meta-analyses, we found that the microarray subgroup had an overall pooled HR of 0.71 (95% CI 0.61, 0.83; p = 0.053) and an I² of 51.8%, indicating a low heterogeneity; however, the RNA-seq subgroup had an overall pooled HR of 0.93 (95% CI 0.74, 1.16; p = 0.007) and an I² of 75.4%, indicating a high heterogeneity in this subgroup (Fig. 4d). Based on the above results, we postulate that the heterogeneity is caused by RNA-seq techniques.

In addition, by using multivariate Cox analyses, we found that the path:00982_1 signature was an independent predictive factor as examined in all the available datasets, which included clinical information of age, gender, ethnicity, lymph nodes, grade, maximum tumor dimension, TNM (tumor, lymph nodes & metastasis) stages, N classification, molecular subtype (classical and basal) and history of diabetes [27,28,29] (Analytical data File 4).

The path:00982_1 signature displays a higher prognostic capacity for the classical subtype

By adopting a published classification [30], we stratified PDAC patients into classical, quasi-mesenchymal (QM-PDA) and exocrine-like subtypes. Except for GSE57495 and GSE79668 datasets (Supplementary Figure S5), PDAC patients could be classified into three subtypes in the other 9 datasets (Fig. 5). The path:00982_1 signature demonstrated a significant predictive capacity for the classical subtype in 7 datasets (exclusive of GSE71729 and TCGA) (Fig. 5). By using another classification [31], we stratified PDAC patients into classical, basal-like and “others” subtypes. The path:00982_1 signature demonstrated a significant predictive capacity for the classical subtype in GSE21501, GSE28735, GSE62452 and ICGC.CA.seq datasets (Supplementary Figure S6).

The path:00982_1 signature appears to be associated with the efficacy of chemotherapy for PDAC

The path:00982_1 subpathway belongs to the drug metabolism-cytochrome P450 (CYP) pathway, which is responsible for drug response and the survival of PDAC patients [32,33,34]. We therefore explored its intervention with anti-PDAC drugs contained in the standard chemotherapeutic regimens FOLFIRINOX (folinic acid-fluorouracil-irinotecan-oxaliplatin) and gemcitabine plus nab-paclitaxel [35, 36]. Since the data of oxaliplatin and nab-paclitaxel were unavailable, we were only able to analyze the data of half maximal inhibitory concentration (IC₅₀) of irinotecan, gemcitabine, cisplatin (belonging to platinum-based drugs as oxaliplatin) and 5-fluorouracil in PDAC cell lines derived from CCLE and GDSC databases (Analytical data File 5 and 6). A negative correlation was found between the IC₅₀ of each drug and the activity of path:00982_1 subpathway in PDAC cells of classical subtype though it was moderate possibly because of small number of samples (Fig. 6a). We next employed a permutation analysis, in which the same number of samples of classical subtype were randomly selected from total samples, and the correlation between the path:00982_1 activity and the IC₅₀ of each drug was calculated for 10,000 times. The number of times (N) was counted when the correlation value was less than the real correlation value, and P-value was calculated by using a formula (N/10000). The results indicated that the real correlation for irinotecan and gemcitabine was significant (P = 0.0248 and P = 0.0265, respectively) but not for cisplatin or 5-fluorouracil (P = 0.1546 and P = 0.0934, respectively) in PDAC cells of classical subtype.

We next searched for available clinical chemotherapy data in all the datasets and were only able to extract 63 and 50 cases from the ICGC.CA.seq and TCGA datasets, respectively. Patients were classified into subgroups depending on tumor responses, complete response (CR), partial response (PR), stable disease (SD) and progressive disease (PD). As shown in Fig. 6b, the path:00982_1 activity in CR + SD subgroups was significantly higher than that in PD subgroup extracted from ICGC.CA.seq dataset, in which patients received the first-line chemotherapy. The path:00982_1 activity was slightly higher in SD + PR subgroups than PD subgroup, in which patients received the second-line chemotherapy, but the difference did not reach significance (Fig. 6b). Among cases extracted from TCGA dataset, the difference in path:00982_1 activity between PD and CR subgroups receiving gemcitabine or between PD and CR + PR + SD subgroups receiving gemcitabine/FOLFIRINOX was not significant (Fig. 6c). Because the number of samples that contained intact data was too small, we were unable to stratify these patients into molecular subtypes for further analysis.

Discussion

Here we report a functional path:00982_1 subpathway signature, which displays a robust and significant capacity in predicting survival, drug response and chemotherapeutic efficacy of PDAC, particularly those of classical subtype, accounting for 48.5% of all PDAC subtypes (Fig. 5). To our knowledge, this may be the first functional signature identified from a systematic study of the largest number of PDAC datasets involving comprehensive and integrated analyses with FAIME.

The TNM staging system is a globally recognized standard for classifying the extent of spread of cancer and is widely accepted for predicting the prognosis and guiding treatment options for PDAC. The N classification of the 8th Edition of the American Joint Committee on Cancer (AJCC) scheme for PDAC, particularly recently proposed LNR (lymph node ratio)-based N classification for respectable PDAC, has been shown to more accurately predict patient response [27,28,29]. Therefore, we employed a multivariate Cox analysis to compare the predicting power of our signature with this clinical system. As shown in Analytical data File 4, only N classification in GSE21501 dataset, TNM staging in GSE57495 dataset and number of positive lymph nodes in TCGA dataset were significantly correlated with the prognosis. The results indicate that this clinical system needs further optimization in predicting the prognosis of PDAC, in accordance with a previous study [30]. In comparison, the path:00982_1 signature displayed a significant predictive capacity in 5 datasets (P < 0.05) and a marginally significant predictive ability in 2 datasets (0.1 < P > 0.05) in this analysis (Analytical data File 4).

Until now, 11 studies on the identification of gene signatures in PDAC have been published [6,7,8,9,10,11,12,13,14,15,16]. By using multivariate Cox methods, we retrospectively analyzed the predictive capacity of these signatures in the present 11 datasets, in comparison with the path:00982_1 signature. The results showed that the path:00982_1 signature was more robust than any of the published gene signatures (Supplementary Table S3). For instance, the most powerful signature reported by Haider, et al. [8] among all the published gene signatures was shown to be significant in 6 datasets, while the path:00982_1 signature was significant in 7 datasets. In addition, our study has used 7 training sets and 4 test sets, while maximal 4 datasets including only one test set were used in any of the above 11 published studies. More advantageously the path:00982_1 signature was verified for different PDAC molecular subtypes and further validated in cell line and clinical datasets with chemotherapeutic efficacy [35, 36].

Chemotherapy plays an important role in the management of PDAC because of its aggressive nature and being diagnosed at an advanced stage [36, 37]. The path:00982_1 subpathway is located at the downstream of CYPs (Supplementary Figure S7), which constitute a large enzyme family that account for about 75% of the total drug metabolism [38]. Therefore, we analyzed the correlation of clinically used anti-PDAC chemotherapeutic drugs and the activity of path:00982_1 subpathway in available cell line and clinical datasets. The correlation between IC₅₀ of each drug (irinotecan, gemcitabine, cisplatin and 5-fluorouracil) and path:00982_1 activity was only moderate though the correlation was higher in classical subtype than in the overall samples. To further analyze the data, we adopted a permutation analysis, which confirmed that the real correlation for irinotecan and gemcitabine was significant but not for cisplatin or 5-fluorouracil in PDAC cells of classical subtype. We next analyzed the path:00982_1 activity in PDAC patients, who were classified based on tumor response to chemotherapy. A significant result was found in patients receiving the first-line chemotherapy but not the second-line chemotherapy in the ICGC.CA.seq dataset, and not in TCGA dataset as well. Because that the path:00982_1 subpathway is composed of four groups of enzymes (Supplementary Table S2) [25], we further studied these enzymes and tried to seek the association with the above chemotherapeutic drugs. CYP2A6 participates in the metabolism of fluorouracil and CYP3A4 is involved with irinotecan pharmacokinetics [39], CYP2A6 is associated with the efficacy of SOX (S-1 plus oxaliplatin) regimen [40], and CYP4F2 partakes in the metabolism of gemcitabine [41]. However, majority of molecules in this subpathway are unable individually to exhibit a significant predictive ability (Analytical Data File 7) by using a univariate Cox method to evaluate the correlation between each gene and the survival of PDAC patients. The unexpected results may imply that this functional signature should be treated as an integrated enzyme complex, in which the 31 enzymes interact each other and work jointly to generate a biological function. The present results also emphasize the necessity of exploring functional signatures, rather than individual genes for PDAC with a high degree of heterogeneity [17, 18]. However, the role of path:00982_1 subpathway in the metabolism of anti-PDAC chemotherapeutic drugs, particularly its biological interpretation, requires further investigation.

The present study has several limitations, which need to be coped in the future. One is that the identified signature has not been verified in low-throughput experiments and the key nodes involved in this subpathway signature needs to be further mined. Another limitation is that the patients were not stratified into PDAC subtypes due to the small number of samples in available clinical datasets that contained intact profiles of chemotherapy efficacy, survival and gene expression, which may be the reason why the correlation between path:00982_1 activity and tumor response to chemotherapy was not shown to be significant. Finally, the predictive ability of this signature was not exhibited in 4 out of 11 datasets possibly because of a high inter-study heterogeneity resulting from the sample processing, diverse molecular subtyping and particularly RNA-seq techniques. The results also suggest that FAIME may not be suitable for analyzing those datasets when the gene expression profiles were generated by using RNA-seq techniques possibly because that FAIME was developed for microarray expression profiles.

Conclusion

In summary, the present study has identified a novel robust functional signature, which displays a more powerful capacity in predicting the survival and chemotherapy response for patients with PDAC of classical subtype than the published gene signatures and the TNM staging system. This discovery may have an impact to some extent on clinical PDAC practice in the future in three aspects. Firstly, PDAC patients, particularly those of classical subtype, could be selected based on the activity of this subpathway so that chemotherapeutic regimens would be precisely and effectively targeted to those with higher path:00982_1 subpathway activity. Secondly, the signature could be used to improve the current systems for predicting the prognosis and monitoring drug response. Finally, interventions that increase the activity of this subpathway may be applied together with anti-PDAC drugs so that the efficacy of current chemotherapy may be improved. However, the present study has several limitations as mentioned above, and the involvement of path:00982_1 subpathway in the metabolism of anti-PDAC chemotherapeutic drugs, particularly its biological interpretation, requires further investigation.

Availability of data and materials

The data generated are included in the manuscript and supplementary data. Analytical data including 7 files (Named File 1–7) are available at Mendeley Data (DOI: https://doi.org/10.17632/987jp9w76f.1).

Abbreviations

CCLE:: Cancer Cell Line Encyclopedia
FAIME:: Functional Analysis of Individual Microarray Expression
GAD:: Genetic Association Database
GDSC:: Genomics of Drug Sensitivity in Cancer
GEO:: Gene Expression Omnibus
ICGC:: International Cancer Genome Consortium
KEGG:: Kyoto Encyclopedia of Genes and Genomes
PDAC:: Pancreatic ductal adenocarcinoma
TCGA:: The Cancer Genome Atlas

References

Ryan DP, Hong TS, Bardeesy N. Pancreatic adenocarcinoma. N Engl J Med. 2014;371:2140–1.
Article Google Scholar
Siegel RL, Miller KD, Jemal A. Cancer statistics, 2018. CA Cancer J Clin. 2018;68:7–30.
Article Google Scholar
American Cancer Society. Cancer Facts and Figures 2019. Atlanta: American Cancer Society; 2019.
Google Scholar
Moffat GT, Epstein AS, O'Reilly EM. Pancreatic cancer-a disease in need: optimizing and integrating supportive care. Cancer. 2019;125:3927–35.
Article Google Scholar
The Cancer Genome Atlas Research Network. Integrated Genomic Characterization of Pancreatic Ductal Adenocarcinoma. Cancer Cell. 2017;32:185–203.e113.
Birnbaum DJ, Finetti P, Lopresti A, Gilabert M, Poizat F, Raoul JL, Delpero JR, Moutardier V, Birnbaum D, Mamessier E, Bertucci F. A 25-gene classifier predicts overall survival in resectable pancreatic cancer. BMC Med. 2017;15:170.
Article Google Scholar
Chen DT, Davis-Yadley AH, Huang PY, Husain K, Centeno BA, Permuth-Wey J, Pimiento JM, Malafa M. Prognostic fifteen-gene signature for early stage pancreatic ductal adenocarcinoma. PLoS One. 2015;10:e0133562.
Article Google Scholar
Haider S, Wang J, Nagano A, Desai A, Arumugam P, Dumartin L, Fitzgibbon J, Hagemann T, Marshall JF, Kocher HM, et al. A multi-gene signature predicts outcome in patients with pancreatic ductal adenocarcinoma. Genome Med. 2014;6:105.
Article Google Scholar
Kirby MK, Ramaker RC, Gertz J, Davis NS, Johnston BE, Oliver PG, Sexton KC, Greeno EW, Christein JD, Heslin MJ, et al. RNA sequencing of pancreatic adenocarcinoma tumors yields novel expression patterns associated with long-term survival and reveals a role for ANGPTL4. Mol Oncol. 2016;10:1169–82.
Article CAS Google Scholar
Ma S, Kosorok MR, Huang J, Dai Y. Incorporating higher-order representative features improves prediction in network-based cancer prognosis analysis. BMC Med Genet. 2011;4:5.
Google Scholar
Newhook TE, Blais EM, Lindberg JM, Adair SJ, Xin W, Lee JK, Papin JA, Parsons JT, Bauer TW. A thirteen-gene expression signature predicts survival of patients with pancreatic cancer and identifies new genes of interest. PLoS One. 2014;9:e105631.
Article Google Scholar
Raman P, Maddipati R, Lim KH, Tozeren A. Pancreatic cancer survival analysis defines a signature that predicts outcome. PLoS One. 2018;13:e0201751.
Article Google Scholar
Shi G, Zhang J, Lu Z, Liu D, Wu Y, Wu P, Yin J, Yuan H, Zhu Q, Chen L, et al. A novel messenger RNA signature as a prognostic biomarker for predicting relapse in pancreatic ductal adenocarcinoma. Oncotarget. 2017;8:110849–60.
PubMed PubMed Central Google Scholar
Shi XH, Li X, Zhang H, He RZ, Zhao Y, Zhou M, Pan ST, Zhao CL, Feng YC, Wang M, et al. A five-microRNA signature for survival prognosis in pancreatic adenocarcinoma based on TCGA data. Sci Rep. 2018;8:7638.
Article Google Scholar
Stratford JK, Bentrem DJ, Anderson JM, Fan C, Volmar KA, Marron JS, Routh ED, Caskey LS, Samuel JC, Der CJ, et al. A six-gene signature predicts survival of patients with localized pancreatic ductal adenocarcinoma. PLoS Med. 2010;7:e1000307.
Article Google Scholar
Wang WY, Hsu CC, Wang TY, Li CR, Hou YC, Chu JM, Lee CT, Liu MS, Su JJ, Jian KY, et al. A gene expression signature of epithelial tubulogenesis and a role for ASPM in pancreatic tumor progression. Gastroenterology. 2013;145:1110–20.
Article CAS Google Scholar
Makohon-Moore AP, Zhang M, Reiter JG, Bozic I, Allen B, Kundu D, Chatterjee K, Wong F, Jiao Y, Kohutek ZA, et al. Limited heterogeneity of known driver gene mutations among the metastases of individual patients with pancreatic cancer. Nat Genet. 2017;49:358–66.
Article CAS Google Scholar
Waddell N, Pajic M, Patch AM, Chang DK, Kassahn KS, Bailey P, Johns AL, Miller D, Nones K, Quek K, et al. Whole genomes redefine the mutational landscape of pancreatic cancer. Nature. 2015;518:495–501.
Article CAS Google Scholar
Jones S, Zhang X, Parsons DW, Lin JC, Leary RJ, Angenendt P, Mankoo P, Carter H, Kamiyama H, Jimeno A, et al. Core signaling pathways in human pancreatic cancers revealed by global genomic analyses. Science. 2008;321:1801–6.
Article CAS Google Scholar
Chang YH, Chen CM, Chen HY, Yang PC. Pathway-based gene signatures predicting clinical outcome of lung adenocarcinoma. Sci Rep. 2015;5:10979.
Article CAS Google Scholar
Yang X, Regan K, Huang Y, Zhang Q, Li J, Seiwert TY, Cohen EE, Xing HR, Lussier YA. Single sample expression-anchored mechanisms predict survival in head and neck cancer. PLoS Comput Biol. 2012;8:e1002350.
Article CAS Google Scholar
Zhang C, Li C, Xu Y, Feng L, Shang D, Yang X, Han J, Sun Z, Li Y, Li X. Integrative analysis of lung development-cancer expression associations reveals the roles of signatures with inverse expression patterns. Mol BioSyst. 2015;11:1271–84.
Article CAS Google Scholar
Zhang CL, Xu YJ, Yang HX, Xu YQ, Shang DS, Wu T, Zhang YP, Li X. sPAGM: inferring subpathway activity by integrating gene and miRNA expression-robust functional signature identification for melanoma prognoses. Sci Rep. 2017;7:15322.
Article Google Scholar
Zhang Y, Zhang X, Shi J, Tuorto F, Li X, Liu Y, Liebers R, Zhang L, Qu Y, Qian J, et al. Dnmt2 mediates intergenerational transmission of paternally acquired metabolic disorders through sperm small non-coding RNAs. Nat Cell Biol. 2018;20:535–40.
Article CAS Google Scholar
Kanehisa M, Goto S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28:27–30.
Article CAS Google Scholar
Li C, Li X, Miao Y, Wang Q, Jiang W, Xu C, Li J, Han J, Zhang F, Gong B, Xu L. SubpathwayMiner: a software package for flexible identification of pathways. Nucleic Acids Res. 2009;37:e131.
Article Google Scholar
Allen PJ, Kuk D, Castillo CF, Basturk O, Wolfgang CL, Cameron JL, Lillemoe KD, Ferrone CR, Morales-Oyarvide V, He J, et al. Multi-institutional validation study of the American joint commission on Cancer (8th edition) changes for T and N staging in patients with pancreatic adenocarcinoma. Ann Surg. 2017;265:185–91.
Article Google Scholar
van Roessel S, Kasumova GG, Verheij J, Najarian RM, Maggino L, de Pastena M, Malleo G, Marchegiani G, Salvia R, Ng SC, et al. International Validation of the Eighth Edition of the American Joint Committee on Cancer (AJCC) TNM Staging System in Patients With Resected Pancreatic Cancer. JAMA Surg. 2018;153:e183617.
Article Google Scholar
Li HJ, Chen YT, Yuan SQ. Proposal of a modified American joint committee on Cancer staging scheme for resectable pancreatic ductal adenocarcinoma with a lymph node ratio-based N classification: a retrospective cohort study. Medicine (Baltimore). 2018;97:e12094.
Article Google Scholar
Collisson EA, Sadanandam A, Olson P, Gibb WJ, Truitt M, Gu S, Cooc J, Weinkle J, Kim GE, Jakkula L, et al. Subtypes of pancreatic ductal adenocarcinoma and their differing responses to therapy. Nat Med. 2011;17:500–3.
Article CAS Google Scholar
Moffitt RA, Marayati R, Flate EL, Volmar KE, Loeza SG, Hoadley KA, Rashid NU, Williams LA, Eaton SC, Chung AH, et al. Virtual microdissection identifies distinct tumor- and stroma-specific subtypes of pancreatic ductal adenocarcinoma. Nat Genet. 2015;47:1168–78.
Article CAS Google Scholar
Ashida R, Okamura Y, Ohshima K, Kakuda Y, Uesaka K, Sugiura T, Ito T, Yamamoto Y, Sugino T, Urakami K, et al. The down-regulation of the CYP2C19 gene is associated with aggressive tumor potential and the poorer recurrence-free survival of hepatocellular carcinoma. Oncotarget. 2018;9:22058–68.
Article Google Scholar
Gandhi AV, Saxena S, Relles D, Sarosiek K, Kang CY, Chipitsyna G, Sendecki JA, Yeo CJ, Arafat HA. Differential expression of cytochrome P450 omega-hydroxylase isoforms and their association with clinicopathological features in pancreatic ductal adenocarcinoma. Ann Surg Oncol. 2013;20(Suppl 3):S636–43.
Article Google Scholar
Noll EM, Eisen C, Stenzinger A, Espinet E, Muckenhuber A, Klein C, Vogel V, Klaus B, Nadler W, Rosli C, et al. CYP3A5 mediates basal and acquired therapy resistance in different subtypes of pancreatic ductal adenocarcinoma. Nat Med. 2016;22:278–87.
Article CAS Google Scholar
Chiaravalli M, Reni M, O'Reilly EM. Pancreatic ductal adenocarcinoma: state-of-the-art 2017 and new therapeutic strategies. Cancer Treat Rev. 2017;60:32–43.
Article Google Scholar
Garrido-Laguna I, Hidalgo M. Pancreatic cancer: from state-of-the-art treatments to promising novel therapies. Nat Rev Clin Oncol. 2015;12:319–34.
Article CAS Google Scholar
Kleeff J, Korc M, Apte M, La Vecchia C, Johnson CD, Biankin AV, Neale RE, Tempero M, Tuveson DA, Hruban RH, Neoptolemos JP. Pancreatic cancer. Nat Rev Dis Primers. 2016;2:16022.
Article Google Scholar
Guengerich FP. Cytochrome p450 and chemical toxicology. Chem Res Toxicol. 2008;21:70–83.
Article CAS Google Scholar
Mathijssen RH, de Jong FA, van Schaik RH, Lepper ER, Friberg LE, Rietveld T, de Bruijn P, Graveland WJ, Figg WD, Verweij J, Sparreboom A. Prediction of irinotecan pharmacokinetics by use of cytochrome P450 3A4 phenotyping probes. J Natl Cancer Inst. 2004;96:1585–92.
Article CAS Google Scholar
Yang L, Zou S, Shu C, Song Y, Sun YK, Zhang W, Zhou A, Yuan X, Yang Y, Hu S. CYP2A6 polymorphisms associate with outcomes of S-1 plus Oxaliplatin chemotherapy in Chinese gastric Cancer patients. Genomics Proteomics Bioinformatics. 2017;15:255–62.
Article Google Scholar
Wang Y, Li Y, Lu J, Qi H, Cheng I, Zhang H. Involvement of CYP4F2 in the metabolism of a novel monophosphate Ester Prodrug of gemcitabine and its interaction potential in vitro. Molecules. 2018;23. https://doi.org/10.3390/molecules23051195..

Download references

Acknowledgements

Not applicable.

Funding

This research was partially supported by National Key Research and Development Program of China (2017YFC1308602), National Natural Scientific Foundation of China (31701145, 81472321, 81703141, 81401975 and 81703055), and Heilongjiang Provincial Department of Science and Technology in China (GX18C010).

Author information

Ping Wang and Chunlong Zhang contributed equally to this work.

Authors and Affiliations

The Hepatosplenic Surgery Center, the First Affiliated Hospital of Harbin Medical University, Harbin, 150001, China
Ping Wang, Weidong Li, Bo Zhai, Xian Jiang, Hongchi Jiang & Xueying Sun
Department of Interventional Radiology, the Third Affiliated Hospital of Harbin Medical University, Harbin, 150086, China
Ping Wang
College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
Chunlong Zhang
Department of General Surgery, the Fourth Affiliated Hospital of Harbin Medical University, Harbin, 150001, China
Weidong Li & Bo Zhai
Department of Molecular Medicine & Pathology, Faculty of Medical and Health Sciences, the University of Auckland, Auckland, 1142, New Zealand
Shiva Reddy

Authors

Ping Wang
View author publications
You can also search for this author in PubMed Google Scholar
Chunlong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Weidong Li
View author publications
You can also search for this author in PubMed Google Scholar
Bo Zhai
View author publications
You can also search for this author in PubMed Google Scholar
Xian Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Shiva Reddy
View author publications
You can also search for this author in PubMed Google Scholar
Hongchi Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Xueying Sun
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

XS designed and supervised the study. PW, WL and BZ analyzed and interpreted the data. CZ performed the bioinformatics analyses and was a major contributor in writing the manuscript. XJ, SR and HJ performed the biological evaluation. PW, CZ and XS wrote the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Xueying Sun.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Table S1.

Descriptive summary of datasets used in the study. Table S2. Genes of the 00982_1 subpathway. Table S3. Predictive power of published gene signatures. Figure S1. Dataset Search strategy in GEO database. Figure S2. Flow diagram of dataset selection strategies. Figure S3. Accumulative predictive abilities of signatures. Figure S4. Genes in the path:00982_1 subpathway. Figure S5. Collision classification for GSE57495 and GSE79668 datasets. Figure S6. Prognostic capacity of path:00982_1 signature for classical subtype. by Moffitt classification. Figure S7. Association of path:00982_1 subpathway with other pathways.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Wang, P., Zhang, C., Li, W. et al. Identification of a robust functional subpathway signature for pancreatic ductal adenocarcinoma by comprehensive and integrated analyses. Cell Commun Signal 18, 34 (2020). https://doi.org/10.1186/s12964-020-0522-4

Download citation

Received: 14 November 2019
Accepted: 29 January 2020
Published: 02 March 2020
DOI: https://doi.org/10.1186/s12964-020-0522-4

Identification of a robust functional subpathway signature for pancreatic ductal adenocarcinoma by comprehensive and integrated analyses

Abstract

Background

Methods

Results

Conclusion

Similar content being viewed by others

A multi-gene signature predicts outcome in patients with pancreatic ductal adenocarcinoma

Integrative analysis of gene expression profiles reveals specific signaling pathways associated with pancreatic duct adenocarcinoma

Open-source curation of a pancreatic ductal adenocarcinoma gene expression analysis platform (pdacR) supports a two-subtype model

Background

Materials and methods

Datasets

Resources of pathways and subpathways

Methods of comprehensive and integrated analyses

Meta-analyses

Statistical analyses

Results

Excavation of gene, subpathway and pathway signatures

Identification of the path:00982_1 subpathway signature

The path:00982_1 signature is a protective signature for PDAC

The path:00982_1 signature displays a higher prognostic capacity for the classical subtype

The path:00982_1 signature appears to be associated with the efficacy of chemotherapy for PDAC

Discussion

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary information

Additional file 1: Table S1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation