Co-expression analysis of pancreatic cancer proteome reveals biology and prognostic biomarkers

Mantini, G.; Vallés, A. M.; Le Large, T. Y. S.; Capula, M.; Funel, N.; Pham, T. V.; Piersma, S. R.; Kazemier, G.; Bijlsma, M. F.; Giovannetti, E.; Jimenez, C. R.

doi:10.1007/s13402-020-00548-y

Co-expression analysis of pancreatic cancer proteome reveals biology and prognostic biomarkers

Original paper
Open access
Published: 29 August 2020

Volume 43, pages 1147–1159, (2020)
Cite this article

Download PDF

You have full access to this open access article

Cellular Oncology Aims and scope Submit manuscript

Co-expression analysis of pancreatic cancer proteome reveals biology and prognostic biomarkers

Download PDF

G. Mantini ORCID: orcid.org/0000-0002-2169-5468^1,2,
A. M. Vallés¹^na1,
T. Y. S. Le Large^1,3,4^na1,
M. Capula²,
N. Funel⁵,
T. V. Pham¹,
S. R. Piersma¹,
G. Kazemier⁴,
M. F. Bijlsma^5,6,
E. Giovannetti ORCID: orcid.org/0000-0002-7565-7504^1,2 &
…
C. R. Jimenez¹

4754 Accesses
1 Altmetric
Explore all metrics

Abstract

Purpose

Despite extensive biological and clinical studies, including comprehensive genomic and transcriptomic profiling efforts, pancreatic ductal adenocarcinoma (PDAC) remains a devastating disease, with a poor survival and limited therapeutic options. The goal of this study was to assess co-expressed PDAC proteins and their associations with biological pathways and clinical parameters.

Methods

Correlation network analysis is emerging as a powerful approach to infer tumor biology from omics data and to prioritize candidate genes as biomarkers or drug targets. In this study, we applied a weighted gene co-expression network analysis (WGCNA) to the proteome of 20 surgically resected PDAC specimens (PXD015744) and confirmed its clinical value in 82 independent primary cases.

Results

Using WGCNA, we obtained twelve co-expressed clusters with a distinct biology. Notably, we found that one module enriched for metabolic processes and epithelial-mesenchymal-transition (EMT) was significantly associated with overall survival (p = 0.01) and disease-free survival (p = 0.03). The prognostic value of three proteins (SPTBN1, KHSRP and PYGL) belonging to this module was confirmed using immunohistochemistry in a cohort of 82 independent resected patients. Risk score evaluation of the prognostic signature confirmed its association with overall survival in multivariate analyses. Finally, immunofluorescence analysis confirmed co-expression of SPTBN1 and KHSRP in Hs766t PDAC cells.

Conclusions

Our WGCNA analysis revealed a PDAC module enriched for metabolic and EMT-associated processes. In addition, we found that three of the proteins involved were associated with PDAC survival.

Disease-related protein co-expression networks are associated with the prognosis of resectable node-positive pancreatic ductal adenocarcinoma

Article Open access 29 August 2022

Weighted gene co-expression network analysis reveals key genes involved in pancreatic ductal adenocarcinoma development

Article 30 May 2016

Proteogenomic insights into the biology and treatment of pancreatic ductal adenocarcinoma

Article Open access 25 November 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Pancreatic ductal adenocarcinoma (PDAC) is the most common tumor type of the pancreas with a five-year survival rate not exceeding 8% [1]. A lack of reliable markers for early diagnosis, as well as its aggressive metastatic spread are the main causes of this extremely poor survival rate [2, 3]. The development of next-generation sequencing (NGS) has enabled detailed analyses of genomic aberrations and dysregulated gene expression patterns that underlie tumor development and progression, with KRAS, TP53, CDKN2A and SMAD4 as major oncogenic drivers of this disease. As yet, a comprehensive proteomic analysis of clinical PDAC samples is missing.

In recent years, multiple statistical methods and freely available bioinformatics tools have been developed that can extrapolate important features from high-throughput data, e.g. pinpointing genes associated with clinical parameters such as cancer status or patient survival [4]. In this context, networks based on co-expression data [5] have extensively been used to identify densely interconnected genes associated with phenotypic traits. Most of the available algorithms have been applied to microarray- and RNAseq-based expression data [6, 7]. Using these approaches Tang et al. [8], for example, identified new prognostic markers in breast cancer. Additionally, these approaches have been used to search for potential therapeutic targets in small-cell lung carcinoma [9]. More recently, an integrative analysis of co-expression networks from proteomics and transcriptomics data in Alzheimer’s disease revealed protein-specific networks in both asymptomatic and symptomatic patients [10, 11].

Weighted gene co-expression network analysis (WGCNA) assumes that the strength of node-to-node connections is best quantified by measures derived from their correlations. In co-expression networks for biological data, we refer to nodes as “genes” or “proteins”. A glossary of network-related terms is reported in Table 1. Constructing co-expression networks is an effective way to characterize correlation patterns among nodes and to infer new biological functions of densely interconnected nodes called “modules”. Modules can be related to external sample traits such as patient survival, recurrence and disease/health state, in order to discover biomarkers or therapeutic targets. Such modules are indicated by the Module Eigenprotein (ME; with a size of 1 × 20 in the current cohort, this is the most representative vector of values for that module) that can be related to external sample traits. In summary, the goals of a WGCNA analysis are: (i) establishment of real associations between proteins (instead of associations based on previous findings), (ii) identification of pathways specific for the dataset under analysis, (iii) association of modules to external information that provide biologically meaningful modules and (iv) identification of key drivers in relevant modules that may serve as candidate biomarkers and/or therapeutic targets.

Table 1 Glossary of network-related terms

Full size table

Cancer proteomics aims to uncover the molecular basis of this devastating disease and to elucidate associated pathway features that cannot be detected by transcriptomics analyses. In this study, we report a PDAC proteomics analysis based on mass spectrometry (MS) data coupled to WGCNA to define networks of highly correlated proteins with specific functions associated with patient prognosis. We show that one module strongly features metabolic pathways and mesenchymal (EMT) signatures. This co-expression module was found to be significantly associated with disease-free survival (DFS) and overall survival (OS). The prognostic value of three key proteins in this module was validated in an independent cohort of 82 patients. These three proteins individually or in combination were able to predict patient survival.

2 Material and methods

2.1 Patient samples

Approval from the Local Medical Ethical committee at the VU University Medical Center was received (#14038). All patients provided informed consent for tissue sampling, clinical data analysis, and molecular analysis. Snap-frozen tumor samples from 39 patients included between January 2014 until November 2015 were evaluated by the Department of Pathology (Amsterdam UMC, Amsterdam). After pathological revision, 20 samples were eligible for further analysis. A minimum of 5–10% tumor surface was needed for further processing in this study. Clinical parameters were collected prospectively, OS and DFS data were obtained from electronic patient records. One patient was censored for OS analysis, since this patient succumbed to complications after surgery, defined as mortality within 60 days after surgery.

2.2 Protein isolation from bulk tumor tissue and sample preparation for mass spectrometry

Protein isolation was performed as previously described [12]. Briefly, protein lysates were separated on pre-cast 4%–12% gradient gels using the NuPAGE SDS-PAGE system (Invitrogen, Carlsbad, CA, USA). Gels were fixed in 50% ethanol/3% phosphoric acid solution, stained with Coomassie brilliant blue G-250 and then washed and dehydrated in 50 mM ammonium bicarbonate (ABC) once and 50 mM ABC/50% acetonitrile (ACN) twice. Gel lanes were cut into five bands, with each band sliced further into approximately 1 mm³ cubes. The gel cubes were washed and dehydrated once in 50 mM ABC and twice in 50 mM ABC/50% ACN. Subsequently, the gel cubes were reduced in 10 mM DTT/50 mM ABC at 56 °C for 1 h, after which the supernatants were removed and the gel cubes were alkylated in 50 mM iodoacetamide/50 mM ABC for 45 min at room temperature in the dark. Next, the gel cubes were washed with 50 mM ABC/50% ACN, dried in a vacuum centrifuge at 50 °C for 10 min and covered with trypsin solution (Promega, 6.25 ng/ml in 50 mM ABC). Following rehydration with trypsin solution and removal of excess trypsin, the gel cubes were covered with 50 mM ABC and incubated overnight at 25 °C. Peptides were extracted from the gel cubes with 1% formic acid (FA) (once) and 5% FA/50% ACN (twice). All extracts were pooled and stored at −20 °C until use. Prior to liquid chromatography-mass spectrometry (LC-MS), the extracts were concentrated in a vacuum centrifuge at 60 °C, after which volumes were adjusted to 50 μl with 0.05% FA and filtered through a 0.45 μm spin filter into LC autosampler vials [13].

2.3 NanoLC-MS/MS proteomic analysis and database searching

NanoLC-MS/MS analysis was performed as previously described [14]. In brief, peptides were separated using an Ultimate 3000 nanoLC system (Dionex LC-Packings, Amsterdam, The Netherlands) equipped with a 40 cm × 75 μm internal diameter (ID) fused silica column custom packed with 1.9 μm 120 Å ReproSil Pur C18 aqua (Dr Maisch GMBH, Ammerbuch-Entringen, Germany). The samples were injected by gel band, starting with gel band 1 at the top of the gel for all samples, followed by gel band 2, until the final gel band 5. The experiment was considered as one continuous injection series with a blank injection at the start of the experiment. After injection, peptides were trapped at 6 μl/min on a 1 cm × 100 μm ID trap column packed with 5 μm 120 Å ReproSil C18 aqua at 2% buffer B (buffer A: 0.05% formic acid in MQ; buffer B: 80% acetonitrile +0.05% formic acid in MQ) and separated at 300 nl/min in a 10–40% buffer B gradient for 75 min (100 min inject-to-inject). Eluting peptides were ionized at a potential of +2 kVa into a Q Exactive mass spectrometer (Thermo Fisher, Bremen, Germany). Intact masses were measured at resolution 70.000 (at m/z 200) in the Orbitrap using an AGC target value of 3E6 charges. The top 10 peptide signals (charge-states 2+ and higher) were submitted to MS/MS in the HCD (higher-energy collision) cell (1.6 amu isolation width, 25% normalized collision energy). MS/MS spectra were acquired at resolution 17.500 (at m/z 200) in the orbitrap using an AGC target value of 1E6 charges, a maxIT of 60 ms and an underfill ratio of 0.1%. Dynamic exclusion was applied with a repeat count of 1 and an exclusion time of 30 s. MS/MS spectra were searched against a Swissprot reference proteome FASTA file (release January 2018, 42,258 entries, canonical and isoforms, no fragments), using MaxQuant version 1.6.0.16 [15]. Enzyme specificity was set to trypsin and up to two missed cleavages were allowed. Cysteine carboxamidomethylation (Cys, +57.021464 Da) was treated as fixed modification and methionine oxidation (Met, +15.994915 Da) and N-terminal acetylation (N-terminal, +42.010565 Da) as variable modifications. Peptide precursor ions were searched with a maximum mass deviation of 4.5 ppm and fragment ions with a maximum mass deviation of 20 ppm. Peptide and protein identifications were filtered at a false discovery rate (FDR) of 1% using the decoy database strategy. Proteins that could not be differentiated based on MS/MS spectra alone were grouped to protein groups (default MaxQuant settings). A protein was considered identified when at least 1 unique peptide was identified in one sample at high confidence (peptide and protein FDR < 1%). Searches were performed with the label-free quantification option selected. Proteins detected were quantified based on MaxQuant (version 1.6.0.16) output data. Label-free quantification (LFQ) intensities were filtered by contaminants and only proteins with observations across all samples were retained. The MS proteomics data have been deposited to the ProteomeXchange Consortium via PRIDE [16] with accession number PXD015744.

2.4 Weighted gene correlation network analysis (WGCNA) and functional enrichment of identified modules

A protein co-expression network is an undirected graph, where each node corresponds to a protein, and each edge connects a pair of proteins that are significantly correlated [17]. The key concept in WGCNA is “connectivity”. Connectivity describes direct and indirect relationships between two proteins/genes in networks [18].This metric has e.g. previously been used in breast cancer for drug prioritization by Neidlin et al. [19].

To investigate co-expressed proteins in resected patient PDAC samples, we used the WGCNA package [6] in R version 3.5.0 WGCNA defines modules as a group of densely interconnected proteins [18]. In unweighted networks the only information given is the correlation “yes” or “no”, while for weighted networks, users can also gain information about the strength of a correlation. To remove random noise and enhance the strength of correlation, a particular threshold is required from the user. In the WGCNA package and so in this study, the choice was made by applying the scale-free topology criterion [20] using a soft threshold also known as “beta power”. Different soft thresholds were tested (from 1 to 20) and power = 10 was retained to be enough to get an adjacency matrix very similar to a scale-free topology (correlation = 0.90) as shown in Supplementary Fig. S1. More explicitly, the adjacency matrix is obtained by using the correlation value between two proteins raised to the power threshold β (Formula 1)

$$ {a}_{(ij)}={s}_{(ij)}^{\beta } $$

where a_(ij) is the weighted value of a protein in the adjacency matrix defined by rising the co-expression similarity s_(ij) to a power β. Finally, modules are obtained by setting a cut-tree cut off on hierarchical clustering branches. In this study, a dynamic cut-tree method was chosen. Briefly, the algorithm starts by obtaining few large clusters by the static tree cut and, next, implements an iterative process of cluster decomposition and combination. The iteration stops only when the number of clusters becomes stable [21]. The dynamic tree method is essential to avoid relatively small modules. In this study, the minimum module size was set to 20 and the cut height was set to 0.998 automatically by the WGCNA tool. Each module was summarized by a vector of values (1 × 20 in this analysis, where 20 is the number of samples) that was called Module Eigenprotein (ME) and corresponded to the first principal component of the given module [22]. The final network was defined using the weighted option and threshold = 0.02 based on the value range of the data.

The GSEA of the modules was performed by mapping the proteins to gene names and submitting the gene list of each module to the GSEA Broad Institute browser [23]. Gene sets were ranked by significant p value and number of overrepresented genes. We adopted GSEA to perform functional enrichment analysis for each subnetwork based on GO biological process (BP) terms, cellular components (CC), hallmarks of cancer (HC) and transcription factor binding sites (TFBS). Each gene set was ranked using the FDR score and the number of overlapping genes between the module and the gene set. Moreover, the top 5 over-represented terms of each module were subjected to STRING analysis, in order to find the best descriptive biology for each specific module.

2.5 Survival analysis and meta-analysis

Clinical data of patients undergoing resection were obtained from electronic patient records and referral hospitals. Survival data were obtained from government registration. The ME for each module was then correlated to DFS and OS. Assuming to have a trait T and a Module Eigenprotein (ME), correlation or univariate regression models can be used to measure the extent of their association. Modules with a high trait significance may underlie biological pathways associated with the sample trait. Meta-analysis was carried out by applying univariate Cox regression, multivariate Cox regression and log-rank tests on our proteomics dataset and on two different transcriptomics datasets (TCGA [24] and Moffitt et al. [25]). Prognostic marker candidates were ranked based on the number of significant p values obtained from the above-mentioned statistical tests. Kaplan-Meier curves were plotted using “survminer” package in R.

2.6 Immunohistochemical validation of prognostic markers in an independent cohort

Immunohistochemistry (IHC) of tissue microarrays (TMAs) was evaluated as previously described [26]. In brief, FFPE tissues from resected patients were selected and combined in TMAs, including four representative cores from 4 different tumor areas for each patient. IHC staining of KHSRP, SPTBN1 and PYGL was performed according to the manufacturer’s protocols. Anti-KHSRP monoclonal antibody (1:200, anti-KHSRP rabbit ab150393 Abcam), anti-SPTBN1 monoclonal antibody (1:500, anti-SPTBN1 mouse MA3–062, Invitrogen) and anti-PYGL polyclonal antibody (1:150, anti-PYGL rabbit ab198268) were used. Visualization was obtained using a BenchMark Special Stain Automation system (Ventana Medical Systems, Export, USA). Protein staining was evaluated by a molecular pathologist, assessing the amount of tumor and tissue loss, background and overall interpretability. Cytoplasmic immunostaining intensity was classified into four grades: 0 (absent), 1 (weak), 2 (moderate) and 3 (strong), for both STPBN1 and PYGL. To reduce the scoring complexity, samples were defined as “with high expression”, when the staining score was >2 in at least 50% of the tumor cells. The nuclear immunostaining intensity of KHSRP was classified into two grades: 0 (absent) and 1 (present). All patients provided written informed consent for the storage and analysis of their tumor material and survival data, respectively. This study was approved by the Local Ethics Committee of the University of Pisa (Ethics approval #3909, July 3rd, 2013).

Univariate and multivariate analyses were performed using a Cox regression model. Proteins with HR (Hazard Ratio) < 1 were considered protective and those with HR > 1 were defined as non-protective. Meanwhile, proteins with p values < 0.05 were considered statistically significant. A risk score method was used to assess the association of the three prognostic markers with OS in a multivariate analysis. The risk score was evaluated by combining the TMA scores of prognostic proteins weighted by their regression coefficients from univariate Cox regression (Formula 2).

$$ Risk\ score=\sum \limits_{\mathrm{i}=1}^{\mathrm{n}}{\mathrm{TMA}}_{\mathrm{score}}\ast {\upbeta}_{\mathrm{i}.} $$

where n is the number of prognostic proteins, TMA_score is the score of TMA for protein i, and β_i the regression coefficient of protein i in the univariate Cox regression analysis.

Group comparisons were evaluated using the unpaired nonparametric Mann-Whitney U test or unpaired Student’s t test. Fishers exact test was used for categorical analysis. Correlations with clinicopathological characteristics, including DFS and OS, were tested using Kaplan-Meier curves and the log-rank test, as described above.

2.7 Cell culture

Hs766t cells (ATCC, Manassas, USA) were grown in DMEM (Lonza, Verviers, Belgium) supplemented with 10% heat-inactivated fetal bovine serum and 1% penicillin-streptomycin (10,000 U/ml, Gibco, Gaithersburg, MD, USA). Cells were kept at 37 °C in an atmosphere of 5% CO₂ in 75 cm² tissue culture flasks (Greiner Bio-One GmbH, Frickenhausen, Germany) and, for all the experimental procedures, harvested using trypsin-EDTA (Sigma, Zwijndrecht, The Netherlands) in their exponentially growing phase. Cells were tested within the last 3 months by microscopic morphology check and growth curve analysis according to the Cell Line Verification Test Recommendations (ATCC-Technical Bulletin No. 8, 2008). Periodic assays were carried out to detect mycoplasma contamination, and the identity of the cells was confirmed by PCR profiling using short tandem repeats (STR).

2.8 Immunofluorescence assay

Immunofluorescence analysis was performed according to a previously established protocol [27]. Briefly, cells were seeded in a Chamber-Slides System (Lab-Tek, Thermo Fisher Scientific, Waltham, USA) at a density of 5000 cells/well and allowed to attach overnight. Next, co-expression of KHSRP and SPTBN1 was evaluated in Hs766t PDAC cells, stained simultaneously with an anti-KHSRP monoclonal antibody (1:400, anti-KHSRP rabbit ab150393 Abcam) followed by an Alexa Flour 535 anti-rabbit antibody (Red; 1:70), and an anti-SPTBN1 monoclonal antibody (1:100, anti-SPTBN1 mouse MA3–062, Invitrogen) followed by an Alexa Flour 488 anti-mouse antibody (Green; 1:70). Nuclear DNA was stained with 4′, 6-diamidino-2-phenylindole (DAPI). Images were captured using a Zeiss Laser Scanning Microscope, processed and merged using Axiovision 4.1 software (Zeiss Microimaging, Thornwood, USA). In vitro experiments were performed with a minimum of three biological replicates, evaluating at least 100 cells.

3 Results

3.1 PDAC tissue proteomics and co-expression analysis

To obtain proteome level insight into PDAC cells, we used in-depth proteomics based on label-free nanoLC-MS/MS of gel-fractionated proteins to generate proteomic profiles of a cohort of 20 patients. The clinical characteristics of the selected patients are listed in Supplementary Table S1. We ensured equal protein loading of the samples to obtain optimal results (Supplementary Fig. S2). The obtained dataset consisted of 5667 proteins (contaminants removed) encoded by 5494 genes. Unsupervised clustering using all proteins did not reveal any specific grouping of the samples (Supplementary Fig. S3). The proteome dataset was subsequently used to establish a PDAC protein network. To obtain robust co-expression networks, we restricted the analysis to 993 proteins identified in all samples (Supplementary Table S2). Subsequent co-expression analysis yielded 12 consensus modules (Fig. 1), that were subsequently analyzed by GSEA to characterize the associated biology. Each module was annotated with gene sets and clinically relevant information. A complete list of genes associated with the modules is presented in Supplementary Table S3.

The modules covered a wide range of biological terms, and the most frequently occurring terms were those implicated in metabolic processes in context of the mitochondrial compartment (black, green, magenta, turquoise and yellow modules). Furthermore, five modules (blue, green, green yellow, grey and yellow) consisted predominantly of immune system and defense response, probably regulated by STAT3 and ETS2 transcription factors (shown in the TFBS column in Fig. 1), while one module (brown) was linked to coagulation and platelet activation. Four modules were associated with epithelial-to-mesenchymal transition (EMT) processes (black, magenta, pink and purple modules). One module was enriched for transcription factors with STAT5A and E12 binding sites. However, these binding sites were predicted based on the binding regions present in the targets. The transcription factors did not show over-expression in our PDAC cohort.

3.2 Modules as potential prognostic markers for pancreatic cancer

The rationale behind the correlation network approach is to use the network language, which is particularly intuitive to biologists and allows for simple social network analogies. This method indeed allows the detection of biologically meaningful communities in the network and the study of relationships between them, helping the user to define interesting modules associated to external traits. Since co-expressed protein modules were identified and associated with hallmarks of cancer, we hypothesized that some modules may harbor potential markers for PDAC prognosis. Indeed, the magenta module, which presents EMT and glycolysis pathway components, exhibited positive and significant correlations with DFS and OS in our cohort (Fig. 1). Subsequently, we explored the network biology of the prognostic co-expression module and found that this module is involved in metabolism, as can be inferred from the presence of PYGL, SOD2, GSR, GSS, PKM2, DDAH1 and TST, as well as EMT through ENO2, PLOD1 and FMOD (Fig. 2). Moreover, factors exclusively related to EMT were: COL12A1, TPM4, THBS1, FN1, POSTN, COMP, THBS2, CALU and FBLN2 (Fig. 2).

3.3 The Magenta module comprises candidate prognostic biomarkers for resected PDAC

For each module, we obtained the Module Eigenprotein (ME) for further survival association analyses. Only one module was significantly associated with DFS and OS in our proteomics cohort. The same protein signature was tested for OS association with transcriptomics data of the TCGA-PAAD project. The p value for all overall tests (i.e., Likelihood, Wald and Log Rank score) were 0.007, 0.01 and 0.01, respectively, indicating that the same gene signature is significantly associated to OS also in the transcriptomics data. Through subsequent investigation of epigenetic alterations of those genes, we found that oxidative stress and ECM-EMT related genes were not methylated. Therefore, we conclude that epigenetic inhibition of these genes was not prevalent. Additionally, we explored whether it was possible to refine the prognostic signature list by analyzing two publicly available independent transcriptomics datasets [24, 25]. To this end, different statistical tests were applied to prioritize the prognostic candidates, and the genes were ranked based on the frequency of significant observations among the tests (Supplementary Table S4). Our analysis revealed three potential top candidate biomarkers linked with prognosis: scaffold membrane protein spectrin beta chain, non-erythrocytic-1 (SPTBN1), splicing regulatory protein KHSRP and glycogen phosphorylase (PYGL) (Fig. 3). SPTBN1 is an actin-crosslinking protein that links the plasma membrane to the actin cytoskeleton. KHSRP is a multifunctional RNA-binding protein implicated in transcription, pre-mRNA splicing and mRNA localization to control important cellular processes such as metabolism, immune response, proliferation and differentiation. PYGL is a crucial phosphorylase that catalyzes the release of glucose molecules from glycogen, the major carbohydrate storage source. Cells under hypoxic conditions accelerate glycogen metabolism for an optimal glucose utilization (Warburg effect). Thus, PYGL is required for hypoxic cancer cells (as pancreatic cancer cells typically are) for glycolysis and glycogen degradation [28]. Based on genetic data from the TCGA consortium we found that alterations on PYGL can discriminate patient survival (p value < 0.001) even though the number of samples for short survival was relatively small. Interestingly, we found that high expression of SPTBN1 was associated with good prognosis in the proteomics data, but with poor prognosis in the transcriptomics data (Supplementary Fig. S4). Correlations between mRNA and protein data have been extensively studied and debated in the past years [29,30,31] and includes two recent large-scale clinical cancer proteo-genomics studies. A more recent study by Vasaikar and colleagues [32] showed that enzymes belonging to the tricarboxylic acid (TCA) may be universally decreased at the protein level, but not at the mRNA level. This suggests a protein-level adaptation driving a strong Warburg effect in microsatellite instable (MSI) colorectal cancer. In agreement with our study, the module where SPTBN1 belongs to is strongly enriched for metabolic genes. More specifically, these genes belong to glycolytic effects (PYGL, PKM, ENO2), thus preceding the TCA cycle. Another study by Mertins and colleagues [33] on breast cancer showed that despite a C-terminal truncation of GATA3, its protein expression level did not decrease, suggesting the occurrence of post-translational modification. Furthermore, these researchers found that signaling pathways such as PS1, ion channel transport and proteasome and basic cellular mechanism pathways, including ribosome, mRNA splicing, glycosylphosphatidylinositol biosynthesis and RNA polymerase, were enriched for negative correlations between mRNA and protein levels when compared to copy number alterations. Overall these findings suggest that post-translation modifications are more prone to occur in specific pathways compared to others. Since SPTBN1 has also been shown to carry genetic alterations in hepatocellular carcinoma patients with a short OS [34], this may be a starting point for future investigations on SPTBN1 mutations in PDAC patients.

3.4 Validation of KHSRP, SPTBN1 and PYGL as prognostic candidates for resected PDAC

We used WGCNA with unsigned networks. This means that the proteins in our modules can show both positive or negative correlations and that poor or good prognostic markers can fall in the same module because they are associated with the same biology. The top 3 prognostic markers, SPTBN1, KHSRP and PYGL, were chosen for subsequent IHC validation in an independent cohort of 82 resected PDAC patients (Fig. 4). Representative IHC images of tumor cores from two selected patients with highly divergent survival times and their SPTBN1, KHSRP and PYGL expression patterns are shown in Fig. 4a. In line with the proteome data, we found that SPTBN1 and KHSRP correlated with each other and were overexpressed in patients with good prognosis while PYGL, that anti-correlates with KHSRP and SPTBN1, correlated with poor prognosis. All three proteins had a significant prognostic value for OS (Fig. 4b) and PFS (Supplementary Fig. S5). Moreover, the signature of the three proteins taken together successfully predicted patient prognosis with p = 0.0025 (Supplementary Fig. S6). Co-expression of SPTBN1 and KHSRP was further confirmed by immunofluorescence in Hs766t cells. KHSRP (nuclear protein) and SPTBN1 (cytoplasmic protein) were clearly co-expressed (Fig. 4d) in the nucleus and in the cytoplasmatic compartment, respectively.

Finally, univariate and multivariate Cox regression models were used to assess the association of the three prognostic markers to OS. We found that SPTBN1, KHSRP and PYGL maintained significance in univariate and multivariate analyses when correcting for external factors. To assess whether all the three proteins were significantly associated with OS in a multivariate analysis, a risk score was evaluated showing that the prognostic signature of these three proteins was highly associated with OS in this independent cohort (Table 2).

Table 2 Univariate and multivariate analysis of prognostic markers for resected PDAC

Full size table

4 Discussion

In the present study, we generated a comprehensive proteome dataset of 20 resected PDAC specimens and applied a weighted gene co-expression network analysis (WGCNA) to the data. WGCNA is a user-friendly and comprehensive software tool that has already been applied to several clinical features including brain cancer [35], diabetes [36] and chronic fatigue [37]. We focused on co-expression network analysis to infer biological functions and novel prognostic PDAC biomarkers. We reported a proteome dataset of 5667 proteins comprising 993 proteins identified in all samples giving rise to twelve modules in total. Protein co-expression modules were linked to well-known PDAC hallmarks of cancer such as axon-guidance, EMT, oxidative phosphorylation, MYC targets and KRAS signalling, as well as potential new relationships to biological processes. Importantly, one module was found to be significantly associated with survival. This module, called “magenta”, was functionally enriched for glycolysis, EMT, apoptosis and reactive oxidative stress, highlighting a possible interplay between these biological processes.

Despite considerable experimental and computational modeling efforts, the role of EMT in cancer is still not fully understood [38]. In particular, the connection of EMT to various properties of cancer cells such as stemness, drug resistance, metabolism and metastasis is heavily discussed [39, 40]. In our current study, four modules showed overrepresentation of different sets of EMT genes (black, magenta, pink, purple), correlating with metabolic pathways, suggesting that cell metabolism can influence the EMT state or vice versa. Previously, tumor metabolism has been found to be associated with EMT [40, 41], illustrating the complexity of the interplay between EMT and metabolic reprogramming. Interestingly, these four modules were regulated by different transcription factors. In the magenta module transcription factors binding sites (TFBS) for STAT5 and E12 were noted. STAT5 has been shown to be overexpressed during EMT and aberrant activity of this transcription factor has been found to induce mitochondrial dysfunction and reactive oxygen species (ROS) formation, leading to DNA damage [42]. In addition, E12 has been found to be associated with repression of E-cadherin (and thus EMT) in mouse models [43].

Of note, all modules with SP1 as transcription factor binding site (yellow, turquoise, red, green, blue, black) where found to be associated with MYC targets as previously described [44, 45]. SP1 has been shown to regulate the expression of thousands of genes implicated in the control of a diverse array of cellular processes, such as growth [44], differentiation [46], apoptosis [44], angiogenesis [47] and immune response [48]. These cellular processes are all linked to the proteomic modules of our cohort that present SP1 as putative transcription factor.

The magenta module comprised three prognostic markers: SPTBN1, KHSRP and PYGL that were subsequently validated in an independent cohort of 82 patients. These markers may be used in the future to evaluate and predict clinical responses of PDAC resected patients. SPTBN1 is a dynamic intracellular non-pleckstrin homology-domain protein, which plays important roles in cellular shape formation, protection of membranes against stress, positioning of transmembrane proteins, and molecular trafficking. Spectrin is made up of four subunits. Among these, the beta subunits are responsible for most of the binding activity and its role as a transforming growth factor-β signal transducing adapter protein that is necessary to form Smad3/Smad4 complexes [49]. KHSRP (KH-Type Splicing Regulatory Protein) controls important cellular processes such as proliferation, differentiation and metabolism. KHSRP (also known as FBP2) is a factor interacting with an enhancer element upstream of the c-MYC oncogene promoter [50]. In the past twenty years additional roles of KHSRP in post-transcriptional control of gene expression have been discovered with implications for pre-mRNA splicing [51], mRNA decay [52] and microRNA biogenesis [53]. PYGL catalyzes the degradation of glycogen [54] and is responsible for maintaining blood glucose homeostasis by regulating the release of glucose 1-phosphate from liver glycogen stores [55].

Importantly, transcriptomics data of all three biomarkers revealed significant associations with survival. Interestingly, SPTBN1 could be defined as a good prognostic marker based on the proteomics as well as the protein-based IHC data, while it was associated with poor prognosis based on the transcriptomics data (Supplementary Fig. S4A). Systematic studies have revealed multiple processes beyond the “non-correlation” of mRNA expression and protein concentration levels [56]. These include (i) specific translation rates of e.g. upstream open reading frames (uORFs) [57] or internal ribosome entry sites (IRES), (ii) translation rate modulation due to the binding of regulatory proteins or binding of micro-RNAs [58], (iii) modulation of a protein half-life involving the complex ubiquitin-proteasome pathway [59], or autophagy, which may influence protein concentrations independent of transcript levels.

Although we captured three new prognostic biomarkers and the biology associated with these, there are some limitations to our study that need to be noted. Due to the high heterogeneity of PDAC and the limited number of samples, we were not able to delineate proteomics-based PDAC subtypes. Exploring correspondence or correlation with known PDAC subtypes is challenging due to the lack of PDAC subtypes based on proteomics data. Furthermore, because of the limited number of samples, this study should be considered as a first exploratory analysis and its prognostic relevance needs to be validated in additional clinical studies.

Taking together, our data indicate that an EMT-metabolic module is associated with the prognosis after surgical resection of PDAC patients and that the module’s proteins SPTBN1, KHSRP and PYGL may serve as potential prognostic biomarkers. Our results also show that co-expression networks are able to extrapolate tumor-specific biology as well as biological mechanisms empowering prognostic marker discovery, even with a limited number of samples.

Abbreviations

WGCNA:: weighted gene co-expression network analysis
PDAC:: Pancreatic ductal adenocarcinoma
ME:: Module Eigenprotein
EMT:: Epithelial-mesenchymal-transition
OX-PHOS:: Oxidative phosphorylation
TFBS:: Transcription factor binding site
OS:: Overall survival
DFS:: Disease-free survival

References

P. Rawla, T. Sunkara, V. Gaduputi, Epidemiology of pancreatic cancer: Global trends, etiology and risk factors. World J. Oncol. 10, 10–27 (2019)
Article Google Scholar
A. Maitra, R.H. Hruban, Pancreatic cancer. Annu. Rev. Pathol. Mech. Dis. 3, 157–188 (2008)
A. Vincent, J. Herman, R. Schulick, R.H. Hruban, M. Goggins, Pancreatic cancer. Lancet 378, 607–620 (2011)
Article Google Scholar
C. Wu, F. Zhou, J. Ren, X. Li, Y. Jiang, S. Ma, A selective review of multi-level Omics data integration using variable selection. High-Throughput 8, 4 (2019)
Article Google Scholar
A.-L. Barabási, R. Albert, Emergence of scaling in random networks. Science 286, 509–512 (1999)
Article Google Scholar
P. Langfelder, S. Horvath, WGCNA: An R package for weighted correlation network analysis. BMC Bioinformatics 9, 559 (2008)
Article Google Scholar
A.A. Margolin, I. Nemenman, K. Basso, C. Wiggins, G. Stolovitzky, R.D. Favera, A. Califano, ARACNE: An algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinformatics 7 Suppl 1:S7 (2006)
J. Tang, D. Kong, Q. Cui, K. Wang, D. Zhang, Y. Gong, G. Wu, Prognostic genes of breast Cancer identified by gene co-expression network analysis. Front. Oncol. 8, 374 (2018)
H. Nakamura, K. Fujii, V. Gupta, H. Hata, H. Koizumu, M. Hoshikawa, S. Naruki, Y. Miyata, I. Takahashi, T. Miyazawa, H. Sakai, K. Tsumoto, M. Takagi, H. Saji, T. Nishimura, Identification of key modules and hub genes for small-cell lung carcinoma and large-cell neuroendocrine lung carcinoma by weighted gene co-expression network analysis of clinical tissue-proteomes. PLoS One 14, e0217105 (2019)
Article CAS Google Scholar
Q. Zhang, C. Ma, M. Gearing, P.G. Wang, L.-S. Chin, L. Li, Integrated proteomics and network analysis identifies protein hubs and network alterations in Alzheimer’s disease. Acta Neuropathol. Commun. 6, 19 (2018)
N. T. Seyfried, E. B. Dammer, V. Swarup, D. Nandakumar, D. M. Duong, L. Yin, Q. Deng, T. Nguyen, C. M. Hales, T. Wingo, J. Glass, M. Gearing, M. Thambisetty, J. C. Troncoso, D. H. Geschwind, J. J. Lah, A. I. Levey, A Multi-network approach identifies protein-specific co-expression in asymptomatic and symptomatic Alzheimer’s disease. Cell. Syst. 4, 60–72.e4 (2017)
F. Böttger, E.A. Semenova, J.-Y. Song, G. Ferone, J. van der Vliet, M. Cozijnsen, R. Bhaskaran, L. Bombardelli, S.R. Piersma, T.V. Pham, C.R. Jimenez, A. Berns, Tumor heterogeneity underlies differential cisplatin sensitivity in mouse models of small-cell lung cancer. Cell. Rep. 27, 3345–3358.e4 (2019)
S.R. Piersma, J.C. Knol, I. de Reus, M. Labots, B.K. Sampadi, T.V. Pham, Y. Ishihama, H.M.W. Verheul, C.R. Jimenez, Feasibility of label-free phosphoproteomics and application to base-line signaling of colorectal cancer cell lines. Proteome Quest Understand Biol. Dis. HUPO 2014, 247–258 (2015)
M. de Wit, H. Kant, S.R. Piersma, T.V. Pham, S. Mongera, M.P.A. van Berkel, E. Boven, F. Pontén, G.A. Meijer, C.R. Jimenez, R.J.A. Fijneman, Colorectal cancer candidate biomarkers identified by tissue secretome proteome profiling. J. Proteome 99, 26–39 (2014)
Article Google Scholar
J. Cox, M. Mann, MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367 (2008)
J.A. Vizcaíno, E.W. Deutsch, R. Wang, A. Csordas, F. Reisinger, D. Ríos, J.A. Dianes, Z. Sun, T. Farrah, N. Bandeira, P.-A. Binz, I. Xenarios, M. Eisenacher, G. Mayer, L. Gatto, A. Campos, R.J. Chalkley, H.-J. Kraus, J.P. Albar, S. Martinez-Bartolomé, R. Apweiler, G.S. Omenn, L. Martens, A.R. Jones, H. Hermjakob, ProteomeXchange provides globally coordinated proteomics data submission and dissemination. Nat. Biotechnol. 32, 223–226 (2014)
Article Google Scholar
J.M. Stuart, A gene-Coexpression network for global discovery of conserved genetic modules. Science 302, 249–255 (2003)
Article CAS Google Scholar
Z. Bin, H. Steve, A general framework for weighted gene co-expression network analysis. Stat. Appl. Genet. Mol. Biol. 4 Article17 (2005)
M. Neidlin, S. Dimitrakopoulou, L.G. Alexopoulos, Multi-tissue network analysis for drug prioritization in knee osteoarthritis. Sci. Rep. 9, 15176 (2019)
Article Google Scholar
R. Albert, Scale-free networks in cell biology. J. Cell Sci. 118, 4947–4957 (2005)
Article CAS Google Scholar
P. Langfelder, B. Zhang, S. Horvath, Defining clusters from a hierarchical cluster tree: The dynamic tree cut package for R. Bioinformatics 24, 719–720 (2008)
Article CAS Google Scholar
P. Langfelder, S. Horvath, Eigengene networks for studying the relationships between co-expression modules. BMC Syst. Biol. 1, 54 (2007)
A. Subramanian, P. Tamayo, V.K. Mootha, S. Mukherjee, B.L. Ebert, M.A. Gillette, A. Paulovich, S.L. Pomeroy, T.R. Golub, E.S. Lander, J.P. Mesirov, Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 102, 15545–15550 (2005)
R.L. Grossman, A.P. Heath, V. Ferretti, H.E. Varmus, D.R. Lowy, W.A. Kibbe, L.M. Staudt, Toward a shared vision for cancer genomic data. N. Engl. J. Med. 375, 1109–1112 (2016)
Article Google Scholar
R.A. Moffitt, R. Marayati, E.L. Flate, K.E. Volmar, S.G.H. Loeza, K.A. Hoadley, N.U. Rashid, L.A. Williams, S.C. Eaton, A.H. Chung, J.K. Smyla, J.M. Anderson, H.J. Kim, D.J. Bentrem, M.S. Talamonti, C.A. Iacobuzio-Donahue, M.A. Hollingsworth, J.J. Yeh, Virtual microdissection identifies distinct tumor- and stroma-specific subtypes of pancreatic ductal adenocarcinoma. Nat. Genet. 47, 1168–1178 (2015)
Article CAS Google Scholar
E. Giovannetti, Q. Wang, A. Avan, N. Funel, T. Lagerweij, J.-H. Lee, V. Caretti, A. van der Velde, U. Boggi, Y. Wang, E. Vasile, G.J. Peters, T. Wurdinger, G. Giaccone, Role of CYB5A in pancreatic cancer prognosis and autophagy modulation. J. Natl. Cancer Inst. 106 (2014)
O. Firuzi, P.P. Che, B. El Hassouni, M. Buijs, S. Coppola, M. Löhr, N. Funel, R. Heuchel, I. Carnevale, T. Schmidt, G. Mantini, A. Avan, L. Saso, G.J. Peters, E. Giovannetti, Role of c-MET inhibitors in overcoming drug resistance in spheroid models of primary human pancreatic cancer and stellate cells. Cancers 11, 638 (2019)
E. Favaro, K. Bensaad, M.G. Chong, D.A. Tennant, D.J.P. Ferguson, C. Snell, G. Steers, H. Turley, J.-L. Li, U.L. Günther, F.M. Buffa, A. McIntyre, A.L. Harris, Glucose utilization via glycogen phosphorylase sustains proliferation and prevents premature senescence in cancer cells. Cell. Metab. 16, 751–764 (2012)
Y. Liu, A. Beyer, R. Aebersold, On the dependency of cellular protein levels on mRNA abundance. Cell 165, 535–550 (2016)
Article CAS Google Scholar
M. Wilhelm, J. Schlegl, H. Hahne, A.M. Gholami, M. Lieberenz, M.M. Savitski, E. Ziegler, L. Butzmann, S. Gessulat, H. Marx, T. Mathieson, S. Lemeer, K. Schnatbaum, U. Reimer, H. Wenschuh, M. Mollenhauer, J. Slotta-Huspenina, J.-H. Boese, M. Bantscheff, A. Gerstmair, F. Faerber, B. Kuster, Mass-spectrometry-based draft of the human proteome. Nature 509, 582–587 (2014)
Article CAS Google Scholar
A. Franks, E. Airoldi, N. Slavov, Post-transcriptional regulation across human tissues. PLoS Comput. Biol. 13, e1005535 (2017)
Article Google Scholar
S. Vasaikar, C. Huang, X. Wang, V.A. Petyuk, S.R. Savage, B. Wen, Y. Dou, Y. Zhang, Z. Shi, O.A. Arshad, M.A. Gritsenko, L.J. Zimmerman, J.E. McDermott, T.R. Clauss, R.J. Moore, R. Zhao, M.E. Monroe, Y.-T. Wang, M.C. Chambers, R.J.C. Slebos, K.S. Lau, Q. Mo, L. Ding, M. Ellis, M. Thiagarajan, C.R. Kinsinger, H. Rodriguez, R.D. Smith, K.D. Rodland, D.C. Liebler, T. Liu, B. Zhang, A. Pandey, A. Paulovich, A. Hoofnagle, D.R. Mani, D.W. Chan, D.F. Ransohoff, D. Fenyo, D.L. Tabb, D.A. Levine, E.S. Boja, E. Kuhn, F.M. White, G.A. Whiteley, H. Zhu, H. Zhang, I.-M. Shih, J. Bavarva, J. Whiteaker, K.A. Ketchum, K.R. Clauser, K. Ruggles, K. Elburn, L. Hannick, M. Watson, M. Oberti, M. Mesri, M.E. Sanders, M. Borucki, M.A. Gillette, M. Snyder, N.J. Edwards, N. Vatanian, P.A. Rudnick, P.B. McGarvey, P. Mertins, R.R. Townsend, R.R. Thangudu, R.C. Rivers, S.H. Payne, S.R. Davies, S. Cai, S.E. Stein, S.A. Carr, S.J. Skates, S. Madhavan, T. Hiltke, X. Chen, Y. Zhao, Y. Wang, Z. Zhang, Proteogenomic analysis of human colon cancer reveals new therapeutic opportunities. Cell 177, 1035–1049.e19 (2019)
Article CAS Google Scholar
NCI CPTAC, P. Mertins, D.R. Mani, K.V. Ruggles, M.A. Gillette, K.R. Clauser, P. Wang, X. Wang, J.W. Qiao, S. Cao, F. Petralia, E. Kawaler, F. Mundt, K. Krug, Z. Tu, J.T. Lei, M.L. Gatza, M. Wilkerson, C.M. Perou, V. Yellapantula, K. Huang, C. Lin, M.D. McLellan, P. Yan, S.R. Davies, R.R. Townsend, S.J. Skates, J. Wang, B. Zhang, C.R. Kinsinger, M. Mesri, H. Rodriguez, L. Ding, A.G. Paulovich, D. Fenyö, M.J. Ellis, S.A. Carr, Proteogenomics connects somatic mutations to signalling in breast cancer. Nature 534, 55–62 (2016)
Article Google Scholar
J. Chen, S. Zaidi, S. Rao, J.-S. Chen, L. Phan, P. Farci, X. Su, K. Shetty, J. White, F. Zamboni, X. Wu, A. Rashid, N. Pattabiraman, R. Mazumder, A. Horvath, R.-C. Wu, S. Li, C. Xiao, C.-X. Deng, D.A. Wheeler, B. Mishra, R. Akbani, L. Mishra, Analysis of genomes and transcriptomes of hepatocellular carcinomas identifies mutations and gene expression changes in the transforming growth factor-β pathway. Gastroenterology 154, 195–210 (2018)
Article CAS Google Scholar
S. Horvath, B. Zhang, M. Carlson, K.V. Lu, S. Zhu, R.M. Felciano, M.F. Laurance, W. Zhao, S. Qi, Z. Chen, Y. Lee, A.C. Scheck, L.M. Liau, H. Wu, D.H. Geschwind, P.G. Febbo, H.I. Kornblum, T.F. Cloughesy, S.F. Nelson, P.S. Mischel, Analysis of oncogenic signaling networks in glioblastoma identifies ASPM as a molecular target. Proc. Natl. Acad. Sci. USA 103, 17402–17407 (2006)
M.P. Keller, Y. Choi, P. Wang, D. Belt Davis, M.E. Rabaglia, A.T. Oler, D.S. Stapleton, C. Argmann, K.L. Schueler, S. Edwards, H.A. Steinberg, E. Chaibub Neto, R. Kleinhanz, S. Turner, M.K. Hellerstein, E.E. Schadt, B.S. Yandell, C. Kendziorski, A.D. Attie, A gene expression network model of type 2 diabetes links cell cycle regulation in islets with diabetes susceptibility. Genome Res. 18, 706–716 (2008)
Article CAS Google Scholar
C. Priami, Algorithmic systems biology. Commun. ACM 52, 80–88 (2009)
Article Google Scholar
T. Brabletz, R. Kalluri, M.A. Nieto, R.A. Weinberg, EMT in cancer. Nat. Rev. Cancer 18, 128–134 (2018)
Article CAS Google Scholar
K. Weidenfeld, D. Barkan, EMT and Stemness in tumor dormancy and outgrowth: Are they intertwined processes? Front. Oncol. 8, 381 (2018)
C. Seliger, P. Leukel, S. Moeckel, B. Jachnik, C. Lottaz, M. Kreutz, A. Brawanski, M. Proescholdt, U. Bogdahn, A.-K. Bosserhoff, A. Vollmann-Zwerenz, P. Hau, Lactate-modulated induction of THBS-1 activates transforming growth factor (TGF)-beta2 and migration of glioma cells in vitro. PLoS One 8, e78935–e78935 (2013)
Article CAS Google Scholar
G.V. Røsland, S.E. Dyrstad, D. Tusubira, R. Helwa, T.Z. Tan, M.L. Lotsberg, I.K.N. Pettersen, A. Berg, C. Kindt, F. Hoel, K. Jacobsen, A.J. Arason, A.S.T. Engelsen, H.J. Ditzel, P.E. Lønning, C. Krakstad, J.P. Thiery, J.B. Lorens, S. Knappskog, K.J. Tronstad, Epithelial to mesenchymal transition (EMT) is associated with attenuation of succinate dehydrogenase (SDH) in breast cancer through reduced expression of SDHC. Cancer Metab. 7, 6 (2019)
Article Google Scholar
C. Moser, P. Ruemmele, S. Gehmert, H. Schenk, M.P. Kreutz, M.E. Mycielska, C. Hackl, A. Kroemer, A.A. Schnitzbauer, O. Stoeltzing, H.J. Schlitt, E.K. Geissler, S.A. Lang, STAT5b as molecular target in pancreatic cancer--inhibition of tumor growth, angiogenesis, and metastases. Neoplasia N. Y. N. 14, 915–925 (2012)
Article CAS Google Scholar
M.A. Pérez-Moreno, A. Locascio, I. Rodrigo, G. Dhondt, F. Portillo, M.A. Nieto, A. Cano, A new role for E12/E47 in the repression of E-cadherin expression and epithelial-mesenchymal transitions. J. Biol. Chem. 276, 27424–27431 (2001)
J. Kaczynski, T. Cook, R. Urrutia, Sp1- and Krüppel-like transcription factors. Genome Biol. 4, 206 (2003)
Article Google Scholar
S. Kyo, M. Takakura, T. Taira, T. Kanaya, H. Itoh, M. Yutsudo, H. Ariga, M. Inoue, Sp1 cooperates with c-Myc to activate transcription of the human telomerase reverse transcriptase gene (hTERT). Nucleic Acids Res. 28, 669–677 (2000)
Article CAS Google Scholar
O.G. Opitz, A.K. Rustgi, Interaction between Sp1 and cell cycle regulatory proteins is important in transactivation of a differentiation-related gene. Cancer Res. 60, 2825 (2000)
CAS PubMed Google Scholar
N.M. Mazure, M.C. Brahimi-Horn, J. Pouysségur, Protein kinases and the hypoxia-inducible factor-1, two switches in angiogenesis. Curr. Pharm. Des. 9, 531–541 (2003)
K. Jones, J. Kadonaga, P. Luciw, R. Tjian, Activation of the AIDS retrovirus promoter by the cellular transcription factor, Sp1. Science 232, 755–759 (1986)
Article CAS Google Scholar
S. Chen, J. Li, P. Zhou, X. Zhi, SPTBN1 and cancer, which links? J. Cell. Physiol. 235, 17–25 (2020)
Article CAS Google Scholar
T. Davis-Smyth, R.C. Duncan, T. Zheng, G. Michelotti, D. Levens, The far upstream element-binding proteins comprise an ancient family of single-strand DNA-binding Transactivators. J. Biol. Chem. 271, 31679–31687 (1996)
Article CAS Google Scholar
H. Min, C.W. Turck, J.M. Nikolic, D.L. Black, A new regulatory protein, KSRP, mediates exon inclusion through an intronic splicing enhancer. Genes Dev. 11, 1023–1036 (1997)
Article CAS Google Scholar
P. Briata, C.-Y. Chen, A. Ramos, R. Gherzi, Functional and molecular insights into KSRP function in mRNA decay. Biochim. Biophys. Acta 1829, 689–694 (2013)
R. Gherzi, C. Chen, M. Trabucchi, A. Ramos, P. Briata, The role of KSRP in mRNA decay and microRNA precursor maturation. Wiley Interdiscip. Rev. RNA 1, 230–239 (2010)
Article CAS Google Scholar
B. Burwinkel, H.D. Bakker, E. Herschkovitz, S.W. Moses, Y.S. Shin, M.W. Kilimann, Mutations in the liver glycogen phosphorylase gene (PYGL) underlying glycogenosis type VI (hers disease). Am. J. Hum. Genet. 62, 785–791 (1998)
J.L. Ekstrom, T.A. Pauly, M.D. Carty, W.C. Soeller, J. Culp, D.E. Danley, D.J. Hoover, J.L. Treadway, E.M. Gibbs, R.J. Fletterick, Y.S.N. Day, D.G. Myszka, V.L. Rath, Structure-activity analysis of the purine binding site of human liver glycogen phosphorylase. Chem. Biol. 9, 915–924 (2002)
C.J. McManus, G.E. May, P. Spealman, A. Shteyman, Ribosome profiling reveals post-transcriptional buffering of divergent gene expression in yeast. Genome Res. 24, 422–430 (2014)
Article CAS Google Scholar
K. Wethmar, J.J. Smink, A. Leutz, Upstream open reading frames: Molecular switches in (patho)physiology. BioEssays 32, 885–893 (2010)
Article CAS Google Scholar
L.W. Barrett, S. Fletcher, S.D. Wilton, Regulation of eukaryotic gene expression by the untranslated gene regions and other non-coding elements. Cell. Mol. Life Sci. 69, 3613–3634 (2012)
Article CAS Google Scholar
Y.-C. Tang, A. Amon, Gene copy-number alterations: A cost-benefit analysis. Cell 152, 394–405 (2013)
Article CAS Google Scholar

Download references

Acknowledgments

The research reported in this publication was supported by the Dutch Cancer Society (KWF project #10212 and #11957, The Netherlands), an AIRC Start-Up grant (Italy), the CCA Foundation (Amsterdam, the Netherlands) and Fondazione Pisana Per La Scienza (Italy).

Funding

KWF grant Dutch Cancer Society (#10212 and #11957) (CJ, EG, MB), Italian Association for Cancer Research AIRC/Start-Up grant, Italy (EG), Fondazione Pisana Per La Scienza, Italy (EG).

Author information

A. M. Vallés and T. Y. S. Le Large contributed equally to this work.

Authors and Affiliations

Amsterdam UMC, Vrije Universiteit Amsterdam, Department of Medical Oncology, Cancer Center Amsterdam, Amsterdam, The Netherlands
G. Mantini, A. M. Vallés, T. Y. S. Le Large, T. V. Pham, S. R. Piersma, E. Giovannetti & C. R. Jimenez
Fondazione Pisana Per La Scienza, Pisa, Italy
G. Mantini, M. Capula & E. Giovannetti
Amsterdam UMC, Univ of Amsterdam, Laboratory for Experimental Oncology and Radiobiology, Amsterdam, The Netherlands
T. Y. S. Le Large
Amsterdam UMC, Vrije Universiteit Amsterdam, Department of Surgery, Amsterdam, The Netherlands
T. Y. S. Le Large & G. Kazemier
U.O. Anatomia ed Istologia Patologica II Azienda Ospedaliero Universitaria Pisana , Pisa, Italy
N. Funel & M. F. Bijlsma
Oncode Institute, Amsterdam, The Netherlands
M. F. Bijlsma

Authors

G. Mantini
View author publications
You can also search for this author in PubMed Google Scholar
A. M. Vallés
View author publications
You can also search for this author in PubMed Google Scholar
T. Y. S. Le Large
View author publications
You can also search for this author in PubMed Google Scholar
M. Capula
View author publications
You can also search for this author in PubMed Google Scholar
N. Funel
View author publications
You can also search for this author in PubMed Google Scholar
T. V. Pham
View author publications
You can also search for this author in PubMed Google Scholar
S. R. Piersma
View author publications
You can also search for this author in PubMed Google Scholar
G. Kazemier
View author publications
You can also search for this author in PubMed Google Scholar
M. F. Bijlsma
View author publications
You can also search for this author in PubMed Google Scholar
E. Giovannetti
View author publications
You can also search for this author in PubMed Google Scholar
C. R. Jimenez
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

GM performed bioinformatics analyses. AV supervised network biology mining. TLL performed sample preparations for LC-MS/MS. SP performed LC-MS/MS. TP acquired and processed mass spectrometry data. TLL and EG were involved in the selection of patient material and clinical data collection. JK, SP, CJ, TV and were responsible for experimental design and mass spectrometry. EG, MB and CRJ were involved in experimental design and manuscript preparation. NF and EG performed IHC validations. NF and MC performed immunofluorescence validations. CRJ and EG coordinated and supervised the study. All the authors critically reviewed the manuscript

Corresponding authors

Correspondence to E. Giovannetti or C. R. Jimenez.

Ethics declarations

Conflict of interest

The authors declare no conflict of interests. MFB has received research funding from Celgene and has acted as a consultant for Servier. Neither were involved in the drafting of this manuscript, nor the design of this study.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

ESM 1

(DOCX 60 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mantini, G., Vallés, A.M., Le Large, T.Y.S. et al. Co-expression analysis of pancreatic cancer proteome reveals biology and prognostic biomarkers. Cell Oncol. 43, 1147–1159 (2020). https://doi.org/10.1007/s13402-020-00548-y

Download citation

Accepted: 30 June 2020
Published: 29 August 2020
Issue Date: December 2020
DOI: https://doi.org/10.1007/s13402-020-00548-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Co-expression analysis of pancreatic cancer proteome reveals biology and prognostic biomarkers

Abstract

Purpose

Methods

Results

Conclusions

Similar content being viewed by others

Disease-related protein co-expression networks are associated with the prognosis of resectable node-positive pancreatic ductal adenocarcinoma

Weighted gene co-expression network analysis reveals key genes involved in pancreatic ductal adenocarcinoma development

Proteogenomic insights into the biology and treatment of pancreatic ductal adenocarcinoma

1 Introduction

2 Material and methods

2.1 Patient samples

2.2 Protein isolation from bulk tumor tissue and sample preparation for mass spectrometry

2.3 NanoLC-MS/MS proteomic analysis and database searching

2.4 Weighted gene correlation network analysis (WGCNA) and functional enrichment of identified modules

2.5 Survival analysis and meta-analysis

2.6 Immunohistochemical validation of prognostic markers in an independent cohort

2.7 Cell culture

2.8 Immunofluorescence assay

3 Results

3.1 PDAC tissue proteomics and co-expression analysis

3.2 Modules as potential prognostic markers for pancreatic cancer

3.3 The Magenta module comprises candidate prognostic biomarkers for resected PDAC

3.4 Validation of KHSRP, SPTBN1 and PYGL as prognostic candidates for resected PDAC

4 Discussion

Abbreviations

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflict of interest

Additional information

Publisher’s note

Electronic supplementary material

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation