Background

Pancreatic cancer is one of the most aggressive cancers with very low 5 year survival rate (6%) [1]. One of the major reasons for the dismal outcome is the fact that in most of the cases the disease is detected at an advanced stage. The situation is complicated further due to the lack of efficient early detection methods. Circulating biomarkers are generally preferred as screening tools along with various imaging methods and CA 19–9 is the only circulating biomarker being used for pancreatic ductal adenocarcinoma (PDAC). However, its sensitivity and specificity is not good enough for its application as early detection biomarker of PDAC [2]. Recently, circulating nucleic acids like tumor DNA, mRNA and non-coding RNAs (ncRNAs) are being explored for their biomarker potential in different diseases as well as in cancer [3]. microRNAs (miRNAs) are most well studied among all the ncRNAs. They are ~ 22 nt long single stranded ncRNAs, capable of post-transcriptional regulation of gene expression. Altered expression of miRNAs in tissues followed by subsequent deregulation of their target genes has been implicated to development and progression of various cancers. Further exploration have established that miRNAs are secreted into body fluid, primarily, being packaged into exosomes [4]. Therefore, circulating or exosome bound miRNAs have also been implicated to carry organ or disease specific signatures and many studies have been initiated to evaluate the potential of circulating miRNAs to function as non-invasive biomarker for specific diseases.

Similar efforts have been made in pancreatic cancer too. Independent studies have identified miR-21, miR-210, miR-155, miR-200a/b, miR-196a/b etc. in serum or plasma of PDAC patients as well as different miRNAs in tumour tissues [5,6,7]. However, major drawbacks of all these studies are that there are not much of common miRNAs between them and not all the studies have compared the serum/ plasma expression with the corresponding changes in pancreatic tumour tissue. Furthermore, small sample sizes of individual studies are also a problem. Therefore, we decided to perform meta-analysis where results from multiple individual studies were compared in order to increase the statistical power. We have performed a serum miRNA microarray experiment ourselves to find out relative miRNA expression changes between normal individuals and PDAC patients in our cohort and results of which has also been included in the meta-analysis along with other datasets showing similar results. We have also carried out a second meta-analysis to identify the key miRNA changes present in pancreatic tumor tissue and compared the results of these two meta-analyses to finally derive the PDAC specific meta-signature of miRNAs in serum of these patients. Exosomal contribution to the serum miRNA pool was further assessed by comparing our findings to already available results in databases like Exocarta and miRandola [8, 9]. Finally, considering the importance of these miRNAs being altered in both tumor tissue and in serum, we wanted to investigate the effect or the consequence of these alterations and eventually identified target genes and relevant pathways through which the pathogenesis could be explained.

Methods

Patients and blood collection

The pancreatic ductal adenocarcinoma patients were recruited from School of Digestive and Liver Diseases, Institute of Post Graduate Medical Education and Research (IPGME&R) and the age matched normal individuals were healthy volunteers recruited from Indira Gandhi Memorial Hospital. Approval from Institutional Ethics Committee was taken from all the Institutions and written informed consent was obtained from all the recruits prior to the study. From all the recruits 5 ml of peripheral venous blood was collected in vacutainer-serum tubes (BD, USA) and processed within 1 h after blood sampling. After clot has formed, the tubes were centrifuged at 1500×g for 10 min and supernatant was transferred into a new tube and centrifuged again at 2000×g for 10 min. Serum was collected and stored frozen in aliquots at − 80 °C. Only samples without any indication of haemolysis at all stages of serum preparation were used in further study.

Serum RNA isolation and miRNA microarray

Total serum RNA enriched for small RNAs was isolated using miRNAEasy kit from Qiagen using glycogen method and Affymetrix miRNA 4.0 platform was used for microarray. Probe hybridization was done at 48 °C for 16 h at 60 rpm. Affymetrix 3000 7G scanner was used for data acquisition.

Selection of datasets

Datasets were searched in GEO and ArrayExpress using the keywords ‘Pancreatic Cancer’, ‘miRNA profiling’ and ‘microarray’. Datasets using ‘serum’ as source were included in “Serum” datasets group and similarly datasets using ‘tissue’ as source were included in “Tissue” datasets group. In “Tissue” datasets group, only those datasets were selected where the tissue samples were fresh frozen by liquid N2 or stabilized by lysis buffer. Datasets were excluded if data was obtained from some other source e.g. blood, saliva etc. From the datasets, ‘Pancreatic Cancer’ samples were defined as ‘cases’ and ‘healthy control’ samples were defined as ‘controls’.

Processing of datasets

Datasets were processed individually and unsupervised analysis was done using R. Dataset processing included normalization by appropriate method, if raw data was used. In case of processed data above mentioned step was skipped. Unsupervised hierarchical clustering and PCA was done to remove the outlier samples.

Meta-analysis

After the initial processing, “Rankproduct” method was used to do the meta-analysis using R bioconductor package ‘RankProd’ [10,11,12]. This package has the capacity to combine datasets from different origins (meta-analysis) to increase the power of the identification. The expression data obtained after normalization and removal of outlier for each datasets were merged to form a combined expression data file and the origin and disease status of the samples were specified in another file. Both the files were used as input files using RankProd to obtain DemiRs based on percentage of false prediction (PFP). A cutoff of PFP < 0.05 was used. This method was applied for group of serum and tissue datasets separately to obtain list of differentially regulated miRNAs in serum and tissue respectively.

DemiR selection

The differentially expressed miRNAs obtained in serum and tissue were then compared with each other using Venny v2.1 [13] and the miRNAs which were present in both could be believed to be coming from pancreatic tissue and then secreted in serum and were selected for further studies. Selected miRNAs were further subjected to another criterion of being reported in Pancreatic Expression Database (PED). It is a database which catalogues various biomolecules like RNA, protein associated with pancreatic cancer as reported in published literature [14, 15]. We also checked whether the selected miRNAs were secreted in exosome using databases like Exocarta and miRandola, as described in Fig. 1. Exocarta is an exosome database which enlists the contents of exosomes across different species [8, 16,17,18]. miRandola is a database containing manually curated information regarding different extracellular circulating non-coding RNA types [9].

Fig. 1
figure 1

Schematic flowchart representing the study design followed in the study to identify serum specific miRNAs altered in PDAC

Target identification and selection

Experimentally validated targets for the selected miRNAs were identified using miRNet [19, 20]. It is a web-tool which provides statistical and functional support for miRNA studies. Based on the expression status of target genes, a hypergeometric test was done to identify the miRNAs enriched with target genes in reciprocal direction. Only for those miRNAs found to be statistically significant, target genes were extracted which had their expression values in inverse direction with miRNA expression. miRNA-gene network was further created using miRNAs as source nodes and genes as Target nodes in Cytoscape [21].

Biological annotation of genes

Biologically relevant pathways were identified using web-tools like EnrichR and GeneMANIA. Enrichr is an open source, freely available enrichment analysis web-tool [22, 23]. We used the tool for identification of pathways in GO and KEGG using our genes of interest as input. Independently, we also used GeneMANIA for identification of functions of our genes of interest. GeneMANIA is a user-friendly web interface used for predicting the function of genes using databases like GEO, BioGRID, Pathway Commons etc. [24, 25]. Network analysis of miR-gene interactions present in significantly enriched KEGG pathways were performed using miRNet.

Identification of transcription factors

List of transcription factors were obtained from TcoF-DB v2 [26, 27] and compared with upregulated and downregulated genes to identify list of deregulated genes which could also act as transcription factors. The same was validated using other transcription factor databases like TRRUST v2 [28, 29] and TF2DNA [30], considering them only when they were present in at least two out of three databases.

Results

Description of selected datasets and overall plan

We have performed two meta-analysis in this study. The first one was using three miRNA microarray datasets to find out the miRNA meta-signature in serum of the PDAC patients as compared to normal individuals. The second one was using five miRNA microarray datasets to find out miRNA deregulation in PDAC tumor tissues as opposed to normal pancreas. Table 1 summarizes the description of these datasets. The datasets used in the first meta-analysis were PC_NAN_SG1 (our experimental result, Accession number: GSE140196), GSE59856 [31] and GSE85589 (unpublished result) complying with the selection criteria as mentioned in the ‘Methods’ section. First dataset (PC_NAN_SG1) was the result of our miRNA microarray experiment. We used samples from 2 PDAC and 2 normal individuals for the study to look at the differential expression of the miRNAs present in their serum using Affymetrix Multispecies miRNA-4.0 Array. The second dataset (GSE59856) compared serum miRNA expression levels in 100 pancreatic cancer patients and 150 healthy normals using 3D-Gene Human miRNA platform while the third dataset (GSE85589) evaluated the serum miRNA expression profile in 80 pancreatic cancer patients and 18 healthy normal individuals using the same array platform as ours.

Table 1 Information on datasets used in this study

On the other hand, five datasets were selected following the inclusion and exclusion criteria for meta-analysis of miRNAs deregulated in pancreatic tumor tissues. GSE24279 was the first dataset comparing miRNA expression profile from 136 pancreatic tumor tissues and 22 normal pancreas using array platform from Febit [32]. While the second dataset, GSE32678, compared the same from 18 pancreatic cancer patients and 4 normal individuals using miRCURY LNA array [33], the third dataset (GSE41369) used Nanostring nCounter array platform with samples from 9 pancreatic cancer patients and 7 normal individuals [34]. The last two datasets (GSE43796 and E-MTAB-753) used Agilent and Affymetrix GeneChip miRNA array respectively with sample sizes of 6 Pancreatic cancer/ 3 normal and 17 pancreatic cancer/ 17 normal respectively [35, 36].

The overall plan of the study has been described in Fig. 1, which showed that the top ranked miRNAs from both the meta-analysis were compared to get the list of common miRNAs which might be representative of the fraction of miRNAs deregulated in pancreatic tumors and secreted in the circulation. Furthermore, target mRNAs of these miRNAs were obtained and a miR-gene interaction list was derived based on their expression in pancreatic tumors. Subsequently, using this miR-gene interaction information, network analysis was performed and perturbed biological pathways were identified.

Cluster analysis of the datasets

Unsupervised hierarchical cluster analysis was performed for all the datasets used for serum and tissue miRNA analysis. Principle component analysis (PCA) was also performed followed by removal of the outlier samples in these datasets in order to minimize non-specific effects as much as possible while processing these samples further. Figure 2a shows the PCA analysis of serum microarray datasets while Fig. 2b shows the same results for tissue miRNA datasets. We find in both the panels corresponding to serum and tissue, the individuals belonging to normal and pancreatic cancer fall in independent clusters.

Fig. 2
figure 2

Principle Component Analysis of case and control samples from the datasets used for meta-analysis. a Three datasets with serum miRNA expression profiling and b Five datasets with tissue miRNA expression profiling

miRNA meta-signature of pancreatic ductal adenocarcinoma

Using RankProduct method, differentially expressed miRNAs were identified. DEmiRs were selected based on their percentage of false positives or PFP (with a cut-off of PFP < 0.05) and thereby, we obtained a meta-signature of miRNAs in pancreatic cancer. This meta-signature has two components as we have performed two different meta-analysis. Our analysis of expression profile of serum miRNAs differentially expressed in PDAC patients as obtained from three datasets resulted in 406 upregulated and 401 downregulated miRNAs. The result is shown in Fig. 3a, where number of miRNA identified is shown in red when number of genes (miRNAs) was plotted against the estimated PFP. Similarly, meta-analysis of 5 datasets with tissue miRNA profiling, yielded 77 upregulated and 53 downregulated miRNAs as shown in Fig. 3b. We also provide with a list of all differentially expressed miRNAs (both upregulated and downregulated) in serum and tissues, which have been shown in Additional file 2: Table S1a, b and Additional file 3: Table S2a and b respectively. The analysis package itself takes care of the heterogeneity between the samples.

Fig. 3
figure 3

Selection of miRNAs within PFP cut-off of 0.05. Number of genes (miRNAs) in x-axis is plotted against estimated PFP (percentage of false prediction) in y-axis. a shows results from serum and b shows results from tissue. Red colour in figure represents genes falling within PFP cut-off of 0.05

Identification of miRNAs secreted in serum of PDAC patients

A subset of the serum miRNAs, found to be characteristically altered in PDAC patients, must be an outcome of direct contribution from pancreatic tumor itself, either in the form of exosomal secretion or resulting from lysed tumour cells. We were interested to identify this fraction of the miRNAs in order to have a true picture of tumour derived miRNAs in serum. Our two meta-analysis yielded two different lists of differentially expressed miRNAs in serum and pancreatic tumour tissue of PDAC patients. We separately compared the up and downregulated miRNAs between these lists. This comparison ultimately resulted in 17 upregulated and 14 downregulated miRNAs common between them and hence believed to be the reflection of their expression changes in tissue into the serum of the patients (Table 2). We further undertook an extensive literature search in order to find evidences of these miRNAs to be involved in pancreatic cancer [36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70]. While we found many of them having their involvement in PDAC already reported, we also found miRNAs like let7f-5p, miR-1260b, miR-1914-3p, miR-30e-5p, miR-3137 and miR-3652 detected to be deregulated in PDAC for the first time. Moreover, we were interested to find out how many of these secretory miRNAs were part of exosomal cargo and comparison with the reported list of miRNAs from ExoCarta and MiRandola demonstrated that, apart from miR-3162-5p, all the 30 miRNAs were detectable as exosomal cargo in serum in different experimental set ups. Hereby, we generate a list of upregulated and downregulated miRNAs altered in pancreatic tumor tissue and most likely secreted into the serum of the patients through exosomes.

Table 2 Information on miRNAs, differentially expressed in both serum and tissues, as found in this study

Identification of validated target genes of selected miRNAs

The altered set of miRNAs in PDAC, as obtained from our combined meta-analysis, must be performing important functions in the development and progression of the disease. The first step to elucidate their role is to identify the genes being targeted by these miRNAs. We preferred to focus only on experimentally validated targets and chose the web-tool miRNet which provides experimentally validated target information derived from multiple methods from high throughput experiments like CLASH, PAR-CLIP, Microarray and also from qPCR and reporter assays. We found validated target information for 21 out of 31 deregulated miRNAs. A total of 5935 validated targets (2920 genes targeted by 11 upregulated miRNAs and 4340 genes targeted by 10 downregulated miRNAs) were identified. Entire list of upregulated and downregulated target genes have been shown in Additional file 4: Table S3a and b respectively. Thus, we obtained a list of experimentally validated targets for 21 deregulated miRNAs specific for PDAC to be explored further for their involvement in the disease.

Selection of miR-gene pairs

The list of experimentally validated target genes was further investigated for their deregulation in PDAC. It is imperative that there is a huge tissue and disease specific differences in the miRNA regulation of gene expression. So, all these target genes of the altered miRNAs are definitely not involved in PDAC. Furthermore, even if we identify a target gene whose expression is being altered in pancreatic cancer, it might not be an actual target of that particular miRNA if we do not find an inverse correlation between their expressions. Hence, we explored pancreatic expression database (PED) to select that specific subset of target genes which are reported to be upregulated or downregulated in PDAC and whose expression is inversely correlated with that of the miRNAs. A hypergeometric test was conducted and all 21 miRNAs found to be statistically significant (Table 3). Next, target genes for those miRNAs were extracted having expression values in inverse direction with miRNA expression. We obtained 1149 such miRNA-gene pairs following these criteria. Top 300 (150 upregulated miR - downregulated target, 150 downregulated miR - upregulated target) interactions were further used to construct a miR-gene interaction network in Cytoscape to have a holistic view of how there is a concerted interaction of different miRNAs targeting the important genes relevant for PDAC (Figs. 4 and 5). All the miR-gene interaction pairs could be found in Additional file 5: Table S4.

Table 3 Results from Hypergeometric test used to find miRNAs enriched with Target genes in inverse direction of expression
Fig. 4
figure 4

Interaction network between downregulated miRNAs and their target genes. miR-gene interaction network with downregulated miRNAs and their upregulated target genes. Colour scale is in increasing order of LFC from green to red i.e. green is downregulated and red is upregulated. Oval shape represents miRNA, rectangle represents target genes and triangles represent transcription factors which are being targeted by miRNAs

Fig. 5
figure 5

Interaction network between upregulated miRNAs and their target genes. miR-gene interaction network with upregulated miRNAs and downregulated target genes. Colour scale is in increasing order of LFC from green to red i.e. green is downregulated and red is upregulated. Oval shape represents miRNA, rectangle represents target genes and triangles represent transcription factors which are being targeted by miRNAs

Analysis of biological processes and pathways

Genes do not function in isolation, rather selected gene products cross-talk between each other being part of a pathway regulating specific biological processes. Hence, to have a comprehensive understanding of the role of the deregulated genes in the disease pathophysiology, we need to study them together to know how their alteration could perturb these pathways. We used multiple web-tools for this analysis and one of them, Enrichr, even performs the enrichment analysis based on which significantly associated pathways were selected for gene ontology: biological processes and KEGG pathways (adjusted p-value < 0.05). GeneMANIA is another web-tool which similarly predicts the functions of the genes and used for analogous purpose. Tables 4, 5 and 6 shows the representative pathways for biological processes (GO), pathways (KEGG) and functions (GeneMANIA) respectively along with the genes involved in them. The complete list of gene annotation could be found in Additional file 6: Table S5a, b and c. Comparison of the pathways or biological processes among these three categories clearly highlights some important aspects. Extracellular matrix (ECM) emerges as very important component and alteration in pathways like ECM-receptor interaction, focal adhesion and proteoglycan composition indicate importance of tumour-stroma interaction. The other cluster was the classical signaling pathways like TP53, HIF-1, PI3K-Akt, Jak-STAT, FoxO, mTOR, TGF-beta, NF-kB etc. which are altered in most cancers. We also found alteration in AGE-RAGE signaling pathway, insulin resistance signaling pathway and prolactin signaling pathway, which were less discussed in pancreatic cancer. Interestingly, there were few pancreas specific pathways where some of them were clearly indicative of chronic inflammatory process developing into malignancy. We further performed miR-gene network analysis using miRNet and also selected most important pathways as appeared in KEGG, shown in Fig. 6. Main network from which the sub-networks were extracted could be found in Additional file 1: Figure S1.

Table 4 List of 20 significant ‘GO Biological process’ that were obtained using miRNA-targeted genes in Pancreatic Cancer
Table 5 List of 20 significant ‘KEGG pathways’ that were obtained using miRNA-targeted genes in Pancreatic Cancer
Table 6 List of 20 significant ‘GeneMANIA functions’ that were obtained using miRNA-targeted genes in Pancreatic Cancer
Fig. 6
figure 6

Sub-networks showing miRNA-gene interactions. Sub-networks depicting miRNA-gene interactions encompassing some of the significantly enriched KEGG pathways; red circle denotes up-regulated genes; green circles denote down-regulated genes and squares represent miRNA

Transcription factors being targeted by DEmiRs

To have further insight into the functions of the miRNA target genes, we wanted to know how many of them are transcription factors (TFs) themselves. The purpose behind the approach was the fact that TFs are capable to alter an array of genes leading to a strong regulatory effect. Therefore, targeting a TF by a miRNA which is significantly altered in a disease should have much broader impact on the subsequent pathways as well as functioning of that cell. Hence, regulation of TF expression by miRNA plays a very important role in the disease process. We took help from different web-tools (as described in methods) to find out how many of the pancreatic cancer specific targets of that DEmiRs could also act as transcription factors. We identified 38 upregulated genes and 8 downregulated genes which are predicted to act as transcription factors from TcoF-DB v2, TRRUST v2 and TF2DNA shown in Tables 7 and 8. We also identified 15 upregulated genes and 5 downregulated genes acting as transcription cofactors from TcoF-DB v2 as shown in Additional file 7: Table S6.

Table 7 List of downregulated genes (targeted by upregulated miRNAs) which are reported to act as transcription factors as reported in at least two of the three TF databases i.e. TF/TcoF DB, TRRUST and TF2DNA
Table 8 List of upregulated genes (targeted by downregulated miRNAs) which are reported to act as transcription factors as reported in at least two of the three TF databases i.e. TF/TcoF DB, TRRUST and TF2DNA

Discussion

Circulating miRNAs have been of much importance these days due to their potential to function as disease specific non-invasive biomarkers. It is not always necessary to know the origin or tissue-specificity of the specific miRNAs to designate them as biomarker for a specific disease. However, a subset of these circulating miRNAs in serum or plasma which is actually secreted from the diseased tissue or organ will provide additional information regarding regulation of gene expression taking place within that tissue, together with their role as biomarker. Unfortunately, not much attempts are there trying to combine these information in pancreatic ductal adenocarcinoma. There are many research papers, reviews and meta-analysis papers separately analyzing the altered miRNA profile in serum and tumour tissue of PDAC patients [5,6,7] but none of them have tried to compare themselves between them and individual studies also had lot of differences between their findings. While miR-21, miR-155, miR-1290, miR-210, miR-217, miR-141, miR-222, miR-196a, miR-494, miR-148b, miR-216, miR-375, miR-29c miR-96 etc. were among mostly deregulated in tumour tissues, miR-10b, miR18a, miR-20a, miR-21, miR-139-3p, miR-185, miR-210 and miR-196a were frequently altered in serum of the patients. In order to address this discrepancy, differentially expressed serum and tissue miRNAs between studies, we chose to focus on the fraction of serum miRNAs which is most likely secreted from the pancreatic tumour itself. Hence, the common miRNAs we find here has functional importance in the tissue as well as has potential to function as serum based biomarker for detection of PDAC.

The variation between the individual studies could primarily be attributable to the difference in sample sizes, sample processing methods, population differences, selection of platforms etc. The selection of normalization method was based on suggestions/ recommendations by the platform manufacturers and also based on the information on commonly used or preferred normalization methods by researchers using those platforms. Combining all the samples together and analyzing data from all of them following same statistical methodology would deliver results which should be more accurate than the individual analysis. The package, ‘Rankprod’ takes care of variation arise due to multiple studies addressing the clinical heterogeneity. Furthermore, identification of novel miRNAs deregulated in PDAC could be attributed to meta-analysis. It is the beauty of meta-analysis that it could detect the miRNAs which could not be detected in individual studies due to low sample size.

We preferred to select experimentally validated targets only and did not consider the miRNAs which did not have any experimentally validated targets for further analysis. Thereby, the number of miRNAs got reduced from 31 to 21 and Table 3 shows a list of these 21 miRNAs which can be designated as the ‘meta-signature’ of miRNAs for pancreatic cancer detectable in both serum and tumour tissues. Apart from miR-4317, all of the up-regulated miRNAs selected in our meta-analysis have already been reported to be overexpressed in PDAC. While miR-103a-3p has been shown to act as ‘driver’ of pancreatic cancer, the exact role of miR-191-5p, miR-210-3p, miR-23a-3p, miR-103a-3p, miR-92a and miR-320a/b has also been functionally characterized at the molecular level. It has been shown that miR-210-3p regulated the interaction between pancreatic stellate cells and pancreatic cancer cells, a phenomenon being very important with respect to development of PDAC. miR-320a has been shown to promote 5-FU resistance to pancreatic cancer cells by targeting PDCD4 and most interestingly, miR-320b overexpression have been correlated to late stage chronic pancreatitis, linking this chronic inflammatory disease of pancreas to PDAC. Similarly miR-23a-3p and miR-103a-3p have also been shown to target Epithelial Splicing Regulatory Protein 1 and oncogene GPRC5A respectively. miR-99a-5p has been found to be overexpressed in chemotherapy resistant PDAC [71] and though miR-652-3p was overexpressed and responsible for proliferation and metastasis in other cancers [72], a report showed its downregulation in PDAC [73]. Therefore, our identification of overexpressed miR-652-3p demands more experimental insight into the mechanism. miR-92a-3p is a known oncogene in other cancer and its role in PDAC has only been shown in pancreatic cancer cells where it targets JNK signaling pathway inhibitor, DUSP10 and promote JNK signaling and tumorigenesis. However, we show here upregulation of miR-92a in PDAC patients for first time. Following the same trend, there are no previous reports of involvement of miR-423-3p in PDAC and it is imperative that the role of miR-4317, miR-423-3p and miR-92a-3p in PDAC should also be explored in further details.

On the other hand, we also observed down-regulation of known tumour suppressor miRNAs in pancreatic cancer like let-7f-5p, miR-16-5p, miR-126-3p, miR-30d-5p. Functional characterization and how the down-regulation of miRNAs could affect tumourigenesis in pancreas has already been worked out for many of these repressed miRNAs as we find miR-126-3p targeting ADAM9, KRAS and CRK oncogene, while miR-26a suppressing cycling E2 mediated progression of cell cycle in pancreatic cancer. Furthermore we also observed reduced expression of three other miRNAs which are known to function as tumour suppressor miRNAs in other cancers [74,75,76,77,78], but there was one report for each of the miRNAs mentioning about their up-regulation in pancreatic cancer tissues or cell lines requiring additional exploration. Interestingly, we further report for the first time that miR-30b-5p and miR-30a-5p, which are known tumour suppressors in other cancers, are also down-regulated in pancreatic cancers, as observed from our findings.

Moreover, it was of concern that all the targets of one particular miRNA might not actually get targeted in PDAC. Hence, important aspect was to identify the target genes of these 21 miRNAs which actually are deregulated in PDAC in the inversely correlated direction with miRNAs. Pancreatic Expression Database was our choice and after comparing the results with the database entries, we constructed a miRNA-gene interaction table recording all possible interaction of DEmiRs with their DE-Targets in PDAC (Additional file 5: Table S4). However, identifying the miRNA-target interaction alone doesn’t explain the biology completely. The most important part was to identify the key pathways where those genes contributed and deregulation of those genes resulted in perturbation of the pathways. We used EnrichR and GeneMANIA and got almost similar pathways altered and enriched in both cases. Among them, pathways like AGE-RAGE pathways, prolactin signaling pathway and insulin resistance signaling pathway were of much interest as they were not explored in PDAC in much detail. Advanced glycation end products (AGE) are complex group of compounds and interaction of AGEs with their specific receptors (RAGEs) has important functional implications. RAGE has been found to be overexpressed in multiple cancers like colorectal, lung, oral, brain, prostate, melanoma, lymphomas and ovarian cancers [79]. RAGE is known as a multi-ligand receptor, as not only AGE but several other ligands like, HMGB1, and S-100 (calgranulins) etc. also bind to RAGE. RAGE activation is known to promote tumor vasculature, tumor growth and invasion through alteration of various pathways [80]. RAGE signaling has also been implicated earlier in pancreatic cancer development where loss of RAGE function inhibited the development of PDAC in mouse models [81,82,83]. The immuno-histochemical analysis confirmed the expression of RAGE and its other ligands S100P, S100A4, and HMGB-1 in human PDAC [84]. However, interaction of AGE with RAGE and their role in pancreatic cancer has not been explored in much detail. Prolactin is a peptide hormone and is secreted by the anterior pituitary gland. The closeness of this hormone with growth hormone and its functioning through tumour promoting Jak-STAT pathway strengthens the claim that prolactin has tumorigenic properties. The role of prolactin and prolactin receptor has been most well studied in breast and prostate cancer. Moreover, in hepatocellular carcinoma, colorectal cancer, ovarian cancer and endometrial cancer prolactin signaling has also been implicated [85]. Interestingly, prolactin signaling has also been shown to promote metastasis by inducing cell motility and also confers resistance to cancer cells to chemotherapeutic agents [86,87,88,89]. Furthermore, prolactin signaling has been found to facilitate pancreatic beta cell development and acinar cell growth [90]. However, its role in pancreatic cancer has not been discussed before and we report the involvement of prolactin signaling in pancreatic ductal adenocarcinoma, for the first time. Similarly, another interesting signaling pathway got selected was insulin resistance signaling pathway. Insulin resistance and Type II diabetes mellitus have been associated with different types of malignancy like hepatic, colorectal, breast, endometrial and also pancreatic [91]. Interestingly, the same IR and T2DM play a protective role in prostate cancer [92]. For quite some time there was a hypothesis linking IR to pancreatic cancer. Not much work has been done in that area until a recent study using a cohort of approximately 29,000 patients identified IR to be a risk factor for PDAC [93]. Hyperinsulinemia associated with IR and known mitogenic activity of insulin of insulin could be responsible for the process. Our results support this hypothesis and present with molecular proof behind the claim that IR predispose a patient with possibilities of developing PDAC. Therefore, our identification of conventional cancer related pathways as well as some interesting pathways not previously reported in PDAC, not only helps elaborating the cellular mechanism of action but also opens up avenues to interrogate and target key molecules in key altered pathways for better management of the disease in future.

Furthermore, as transcription factors are key factors bringing change in expression to a series of genes, TFs as miRNA targets are important for their ability to regulate expression of many genes simultaneously. We wanted to find out how many of our deregulated genes were transcription factors and careful investigation resulted in identification of 46 genes. As mentioned in their selection criteria, these genes have already been shown to be expressed in PDAC, evidenced mainly by high-throughput studies. Furthermore, we explored them in details for their functional significance in PDAC or in other cancers, if reported. HNF1B and MYBL2 were also reported to be down regulated in renal cell carcinoma, ovarian cancer and myeloid malignancies [94,95,96]. Unfolded protein response (UPR) is an important component of the endoplasmic reticulum (ER) stress and the most important aspect is to maintain the balance between cell death as a consequence and recovery from the stress [97]. ATF6 and XBP1 are two most important factors mediating the UPR and their downregulation could induce tumour cell death in aggressive cancers like PDAC, thereby acting as a regulatory mechanism in the overall process of tumourigenesis. Similar incident happened in case of HOXC11 too, where downregulation of this gene is known to suppress tumour formation and hence the phenomenon could be considered as body’s balancing act to check the tumour growth [98, 99]. Furthermore, there are no reports on the role of ESRRG, HES6 and NFIC transcription factors in PDAC. However, results from The Human Protein Atlas (https://www.proteinatlas.org/) shows poor survival outcome indicative of the disease aggressiveness associated with reduced expression of the gene in case of all of them [100]. Similarly, investigations on upregulated transcription factors identified members of Wnt, TGF-B, NF-kB signaling pathway which are known promoters of pancreatic carcinogenesis. Interestingly, we have identified several transcription factors like TRPS1, NFAT5, FoxF2, ELF4, RLF etc. which haven’t previously been reported to be involved in PDAC but shown to induce EMT, angiogenesis or proliferation of tumours in other organs. Among them, ELF4 has been found to be expressed during pancreatic development [101] and RLF overexpression has been correlated to poor survival in pancreatic cancer patients [100] strengthening the possibility of their involvement in the pathogenesis of PDAC, which could be explored further.

Our study also suffers from some limitations. For the separate meta-analyses of miRNAs altered in serum and tumour tissue, we focused only on studies where miRNA expression was investigated using microarrays. We excluded the small RNA sequencing results, thereby losing some probable candidates. However, when we looked at the small RNA sequencing results, we observed a large proportion of identified miRNAs with no experimentally validated targets at all. As our objective was to focus only on experimentally validated targets, we might not have missed out much of them concentrating only on microarray results. Furthermore, we couldn’t have combined both the analysis and small RNA sequencing datasets would have required separate meta-analysis altogether adding further complexity to the study. Another limitation, we feel, is lack of experiments to validate our finding in relevant cell lines or patient samples. The meta-analysis as well as the subsequent analysis has been performed with reasonable stringency and already got cross-platform validation results from miRNet and from PED. Above all, functional validation for most of the miRNAs belonging to the 21-miRNA meta-signature has already been done in pancreatic cancer, as evidenced by our extensive literature search. Hence, we excluded the experimental validation from this study and plan to perform them in the next one in much detail.

Conclusion

Here, at first, we identify a 21 miRNA meta-signature of PDAC altered in tumour tissue and also secreted in serum. We further demonstrate that, apart from their possible role as biomarker, these miRNAs are also responsible for disease pathophysiology through deregulation of important pathways within the cell mediated by PDAC specific target genes.