Integrative analysis reveals disease-associated genes and biomarkers for prostate cancer progression

Li, Yin; Vongsangnak, Wanwipa; Chen, Luonan; Shen, Bairong

doi:10.1186/1755-8794-7-S1-S3

Integrative analysis reveals disease-associated genes and biomarkers for prostate cancer progression

Research
Open access
Published: 08 May 2014

Volume 7, article number S3, (2014)
Cite this article

Download PDF

You have full access to this open access article

BMC Medical Genomics Aims and scope Submit manuscript

Integrative analysis reveals disease-associated genes and biomarkers for prostate cancer progression

Download PDF

Yin Li¹,
Wanwipa Vongsangnak¹,
Luonan Chen² &
…
Bairong Shen¹

4072 Accesses
19 Citations
Explore all metrics

Abstract

Background

Prostate cancer is one of the most common complex diseases with high leading cause of death in men. Identifications of prostate cancer associated genes and biomarkers are thus essential as they can gain insights into the mechanisms underlying disease progression and advancing for early diagnosis and developing effective therapies.

Methods

In this study, we presented an integrative analysis of gene expression profiling and protein interaction network at a systematic level to reveal candidate disease-associated genes and biomarkers for prostate cancer progression. At first, we reconstructed the human prostate cancer protein-protein interaction network (HPC-PPIN) and the network was then integrated with the prostate cancer gene expression data to identify modules related to different phases in prostate cancer. At last, the candidate module biomarkers were validated by its predictive ability of prostate cancer progression.

Results

Different phases-specific modules were identified for prostate cancer. Among these modules, transcription Androgen Receptor (AR) nuclear signaling and Epidermal Growth Factor Receptor (EGFR) signalling pathway were shown to be the pathway targets for prostate cancer progression. The identified candidate disease-associated genes showed better predictive ability of prostate cancer progression than those of published biomarkers. In context of functional enrichment analysis, interestingly candidate disease-associated genes were enriched in the nucleus and different functions were encoded for potential transcription factors, for examples key players as AR, Myc, ESR1 and hidden player as Sp1 which was considered as a potential novel biomarker for prostate cancer.

Conclusions

The successful results on prostate cancer samples demonstrated that the integrative analysis is powerful and useful approach to detect candidate disease-associate genes and modules which can be used as the potential biomarkers for prostate cancer progression. The data, tools and supplementary files for this integrative analysis are deposited at http://www.ibio-cn.org/HPC-PPIN/

Identification of prognostic markers of high grade prostate cancer through an integrated bioinformatics approach

Article 28 August 2017

Identification of key regulators in prostate cancer from gene expression datasets of patients

Article Open access 11 November 2019

Identification of key pathways and genes in PTEN mutation prostate cancer by bioinformatics analysis

Article Open access 02 December 2019

Background

Prostate cancer is the second leading cause of morbidity and mortality in men [1, 2]. In recent years, the incidence rate of prostate cancer has dramatically increased [3], and this is largely because of lack of diagnosis and treatment of the disease at the early stage [4]. Thus, the successful clinical biomarkers for early diagnosis of the presence of prostate cancer become very urgent to reduce the death risk of the prostate cancer [5, 6].

In the post-genomics era, there is an explosion of biological data and information generated from high-throughput technologies which have rapidly provided an unprecedented multi-level omics data [7]. Such transcriptomics, referred to as gene expression profiling can now comprehensively survey the entire human genomics. Moreover, enormous efforts have been made to identify biomarkers for various cancers by the analysis of different transcriptomics data [8–12]. As an example reported by our previous study, integrative transcriptomics data could be used to identify putative novel prostate cancer associated pathways, such as Endothelin-1/EDNRA trans-activation of EGFR pathway which would provide essential information for development of network biomarkers and individualized therapy strategy for prostate cancer [11–13]. Looking at the other relevant studies for cancer transcriptomics, a large scale expression study presented by Wang et al. identified a set of gene markers for prediction of metastasis for breast cancer [14] and followed by Chari et al. demonstrated an approach based on multiple concerted disruptions (MCD) analysis and identified genes and pathways in cancer [15]. Furthermore, transcriptomics could be used to identify metabolic biomarkers through alterative metabolic pathways at different cancer phases [16]. Concerning on the other levels of omics, proteomics in context of protein-protein interaction network could also be used to characterize and diagnose a pathological process [17]. As clearly reported by Ideker and Sharan [18], the indicating genes as biomarkers in complex diseases tend to cluster together on well-connected proteins interaction sub-networks. In following years, Chuang et al. also showed that it could be useful to extract co-expressed functional sub-networks for metastasis of breast cancer through integrating transcriptomics data with protein-protein interaction to obtain higher classification accuracy [19]. Later, Taylor et al. studied the altered protein interaction modularity to predict breast cancer progression by examining the biochemical structure of the interactome [20]. Besides, there were similar studies for analysis of sub-networks and/or hub proteins which had been helpful for the understanding of the metastasis of cancer at the molecular level [18].

Focusing on prostate cancer, there were some reports on identifying disease-related gene modules, sub-networks or dysfunctional pathways focused on global characteristics of interactome together with gene expression data by different novel algorithms and methods development [21–23]. Nonetheless, there are still few studies on identification of prostate cancer biomarkers for early detection of the presence as well as disease progression [20]. The relationships among the potential prostate cancer genes and associated functions as well as pathways are still poorly characterized, such as how they interacted and regulated with each other, also what they act within the network modules. These investigations are warranted for a comprehensive understanding of the molecular mechanisms underlying prostate cancer progression. Hence, it is a challenge to perform an integrative analysis of different data, which can be gene expression profiling, protein-protein interaction (PPI) data, pathway information, and clinical information, that can offer different perspectives on the biological problems in prostate cancer and further identification of potential biomarkers [24, 25].

In this study, we therefore aim to reveal candidate disease- associated genes and biomarkers for prostate cancer progression by integrative gene expression profiling and network analysis at a systematic level. We first reconstructed human prostate cancer protein-protein interaction network and used this network as a scaffold for further integrative analysed with gene expression data of prostate cancer. Here, analysis of gene expression profiling of prostate cancer was performed at different disease phases. Through modular analysis, the different modules associated with disease phases were then identified. Last but not least, we could identify significant genes through these modules which were supposed to be the gene expression signatures with highly relevant to specific phases of prostate cancer. Once the common genes identified in each of different modules were overlapped, expectedly these common genes were beneficial for uncovering of novel prostate cancer-related pathways and transcription factors which could be candidate biomarkers for prostate cancer progression. Our study hereby demonstrated a practical workflow for integrative analysis of prostate cancer at the systematic level. For the genome-wide studies, this will be a basic effort for future development and evolution in aspects of the translational biomedical informatics, which ultimately intend to improve patient outcomes and diagnostics with omics dataset through integrative systems biology [26].

Methods

Human prostate cancer protein interaction network reconstruction and annotation

The human prostate cancer protein-protein interaction network (HPC-PPIN) was initially reconstructed in order to be further used for integrative analysis as a diagram illustrated in Figure 1. To reconstruct the HPC-PPIN, we used two different types of datasets. The first dataset was the genes associated in prostate cancer derived from a collection of prostate cancer databases and other relevant resources (e.g. Dragon Database of Genes associated with Prostate Cancer (DDPC) [27], GeneGo [28], OMIM [29], KEGG [30], PGDB [31], CCDB [32], and Gene Ontology (GO) [33]).

For the second type of the dataset, it was the human protein-protein interactions data (Homo sapiens) which was downloaded from the BioGRID database [34]. Concerning on annotation of the HPC-PPIN, we used the Database for Annotation, Visualization and Integrated Discovery (DAVID) system [35, 36]. At the beginning, functional annotation clustering tool of DAVID system was applied to group annotated genes within HPC-PPIN across three GO processes underlying molecular function, biological process, and cellular component. Among three GO processes, this tool was then used to identify the enriched GO terms. In order to annotate detailed functions in context of pathways underlying metabolism, cellular process, environmental information process and genetics information process, KEGG database was used (http://www.genome.jp/kegg/pathway.html).

Prostate cancer gene expression data collection and analysis

The gene expression profiles based different platform arrays from different stages of prostate cancer (i.e. disease stages I, II, II, IV) were collected from various laboratories. Table 1 lists available information of collected gene expression profiles (431 samples) of prostate cancer progression. Since only fewer samples are available in stage I than other disease stages, stages I and II were combined into one phase (Table 1). All expression datasets were analysed for gaining statistics values. The statistical processing methods were invoked through the limma (Linear Models for Microarray Data) package in R [37, 38] and scripting under R version 2.9.0 (R Development Core Team). The limma package [37] was applied to perform moderated Student's t-test between all possible pairwise disease phases comparison i.e., early-middle phases, middle-late phases, and early-late phases, to determine significantly differential gene expression. Empirical Bayesian statistical method was applied to moderate the standard errors within each gene and then the Benjamini-Hochberg's method was applied to adjust the multi-testing [39], as well as to obtain the adjusted p-value.

Table 1 Gene expression profiles of prostate cancer used for integrative analysis#

Full size table

Modular analysis for prostate cancer progression

In order to perform modular analysis for study of disease progression, three main steps were necessarily performed. At the first step, the analysed gene expression data previously derived from pairwise disease phase comparison of prostate cancer was integrated with the reconstructed HPC-PPIN. Hereafter core sub-networks analysis and overlapping analysis as second and the third steps were then performed, respectively. Regarding on the core sub-networks analysis, they were investigated for which were shown highly active scores and top ranks based on the greedy algorithm. In this investigation, the greedy algorithm was selected for searching the core sub-networks in a large network of interactions from any pairwise disease phases comparison, where refers to a connected sub-graph of the interactome that has high significance of differential expression values [19]. To elaborate how the greedy algorithm used, originally the adjusted p-value derived from any pairwise disease phases comparison was converted to the readily form of z-score by using the inverse normal cumulative distribution (θ^-1) for scoring and ranking [40]. Afterwards the greedy algorithm by jActiveModules (jAM) plug-in as implemented in the Cytoscape [41, 42] was used to investigate and extract the significant core sub-networks under threshold of three iterations and top ten ranks. Through the end, the list of top ten ranks were merged together to gain a final core sub-network which represented for each of pairwise disease phases comparison and for each of gene expression profile. Notably, jAM was chosen as a basis for this investigation because it is a fashionable method, based on a survey of the current literature. There are several successful cases where jAM has been applied to extract the significant core sub-networks, for examples in fruit fly Drosophila [43], yeast S. cerevisiae [44], worm C. elegans [45] and human H. sapiens [19, 46].

To finalize the modular analysis, the overlapping analysis was carried out. The overlapping analysis at gene level was applied to show the number of enriched genes shared by all gene expression profiles (see Table 1) calculated based on core sub-networks analysis. For example, considering each of a final core sub-network retrieved from each of pairwise disease phase comparison analysing across all gene expression profiles, the overlapping percentage of genes was calculated between any two of the final core sub-networks derived from any two of the gene expression profiles. For the formula of the overlapping analysis, we defined the number of genes

in the final core sub-network1 as (CS¹) and the final core sub-network2 as (CS²). The overlapping percentage between the final core sub-networks was designated as the number of overlapping genes (G) divided by the number of genes in the union of (CS¹) and (CS²) with subtracted (G). It can be calculated as follows in following formula (1):

O v e r l a p p i n g p e r c e n t a g e = \frac{G}{C S^{1} + C S^{2} - G} \times 100 %

(1)

After overlapping analysis, as a result the overlapping percentage across all gene expression profiles was obtained for each of pairwise disease phase comparison. Towards all possible pairwise disease phases comparison (i.e. early-middle phases, middle-late phases, and early-late phases), three different modules associated with disease progression were eventually identified. It is very possible that each of these three modules plays important roles in dynamic changes of molecular interactions at a specific phase of the disease progression. The identified unique genes in each module were regarded as signatures at a specific phase of prostate cancer. The identified common genes in all three modules were regarded as candidate disease-associated genes.

Identified candidate prostate cancer associated genes as putative module biomarker

To validate the identified prostate cancer associated genes as putative module biomarker, we used them as a module biomarker to discriminate between control and prostate cancer samples. Support vector machine (SVM) regression proposed by Cortes and Vapnik [47] was selected due to its attractive features and high performances [48–50] for applying to the expression values of the predicted prostate cancer associated genes from the module biomarker to distinguish prostate cancer from controls. The Receiver Operating Characteristic (ROC) curve and the area under curve (AUC) were used to evaluate the efficiency of classification [51–53]. Two R packages, namely kernlab [54] and ROCR [55], were applied to build the SVM classifier and produced the ROC curves.

Validation of candidate prostate cancer associated genes by statistical methods

For the validation of candidate prostate cancer associated genes, known related genes obtained from the Cancer Gene Census database [56] (accessed on December 6, 2012), Genetic Association Database (GAD) [57] (accessed on October 27, 2012) and AnimalTFDB [58] (accessed on December 7, 2012) were used. A hypergeometric test was applied to estimate the enrichment of these candidate prostate cancer associated genes compared to the known cancer related genes. The equation of the hypergeometric test is shown as follows in (2):

P (X \geq x) = 1 - \sum_{k = 0}^{x - 1} \frac{(\begin{matrix} M \\ k \end{matrix}) (\begin{matrix} N - M \\ n - k \end{matrix})}{(\begin{matrix} N \\ n \end{matrix})}

(2)

In the above equation, N and M represents the number of genes in the expression profiles and the number of known cancer genes respectively, n and k are the number of the candidate prostate cancer associated genes that we identified, and the number of common entries between them, respectively. P represents the statistical significance of the enrichment. Random sampling was used to test the statistical significance and the same number of known cancer genes was randomly selected from Cancer Gene Census database [56], Genetic Association Database (GAD) [57] and AnimalTFDB [58] to assess the statistically significance of these known cancer genes included in the previous results. At first, the same number of genes as the candidate prostate cancer associated genes was randomly selected from the reconstructed HPC-PPIN. Subsequently, the number of known cancer genes included in the random samples was then counted. Afterwards, random sampling was repeated 10⁶ times. Then, the p-value of the candidate prostate cancer associated genes was defined as the probability that one random sampling might contain a greater or equal number of known cancer genes than in our study samples.

Functional and pathway enrichment analysis

The GeneGo, which is a commercial integrated knowledge database [59], was used for analysis of functional and pathway enrichment. The statistical significance value was calculated using hypergeometric distribution and false discovery rate (FDR) method (p value < 0.05).

Results and discussion

Reconstructed HPC-PPIN and its functional annotation

The HPC-PPIN was reconstructed from different prostate cancer databases and other relevant resources along with one directional interaction and repeat interactions removed, hereafter resulting in 5,827 interactions among 1,100 proteins. The characteristics of the reconstructed HPC-PPIN are shown in Table 2. Additional file 1(Figure S1) illustrates the distributing numbers of prostate cancer- related genes from different databases with assigned and unassigned in HPC-PPIN.

Table 2 Number of genes, proteins and interactions between pairwise disease phases comparison#

Full size table

Concerning on the annotated functions of HPC-PPIN in DAVID system, we found that the major of GO terms involved in biological process (36.78%) as illustrated in Figure 2. Considering to KEGG categories, we found that the major annotated functions were involved in genetics information process in the category of transcription regulation (41%). The results are shown in Figure 3. The reconstructed HPC-PPIN with annotated functions is shown in Additional file 2. A graphic representation of HPC-PPIN by Cytoscape [42] is presented in Additional file 3.

Modules involved in prostate cancer progression

As described in modular analysis for prostate cancer progression (Section in Materials and Methods), three different modules were obtained. The results are shown in Figure 4. As presented in the Table 2, noticeably the third module underlying early-late phases comparison contained the maximum number of genes (266 genes), in contrast to the second module underlying middle-late phases comparison contained the minimum number of genes (193 genes). This suggests that the third module has additional gene expression signature changes (82 genes) than the other two modules identified for 56 genes in the first module and 30 genes in the second module. These can be explained that cancer cells possibly develop new mechanisms and regulations for cell proliferation from an initial stage and further enhance tumour metastasis with degenerative disease. Additional file 1 (Table S1) lists all unique gene expression signatures identified in each module.

To further elaborate functions of unique gene expression signatures, literature search using PubMed was performed. Our finding clearly showed that unique gene expression signatures play important roles in progression of prostate cancer at a specific phase. For examples, SMAD3 and TGFB2 were reported as androgen- independent prostate cancer-specific genes [60] which were found in a specific expression of early-late phase. In addition, there were more unique gene expression signatures in early-late phase, for instances PTEN, BRAF, DDX5, NCOA4, WHSC1, CCND2, CDH11, ERCC5, FANCD2, LIFR, MAF, RAF1, and TOP1. Examples of unique gene expression signatures in middle-late phase, we found TP53 and RB1 which were reported as tumour suppressor genes. Growing evidences were also shown in transcription factor, such as STAT3 which was identified only in early-middle phase.

Regarding on pathway enrichment analysis associated in prostate cancer, interestingly Transcription Androgen Receptor nuclear signaling was found to be the enriched pathway as illustrated in Figure 5. Obviously, Transcription factor AR plays an important role in Transcription relationship which was found in all three modules and appeared to be a hub for regulating a lot of genes in this pathway. We also showed the other enriched pathways as shown in Additional file 1 (Fig. S2), for example Development Epidermal Growth Factor Receptor (EGFR) signaling pathway as presented in Additional file 1 (Fig. S3). As known, EGFR signalling pathway regulates cell proliferation, cell differentiation, cell cycle, and cell migration. Undoubtedly, EGFR pathway therefore becomes a part of a complex network that has been an interested target for effective cancer therapies [61, 62]. From this study, the results also showed consistency with our previous work [12].

Candidate disease-associated genes in prostate cancer progression and their statistical significance

In order to identify candidate disease-associated genes in prostate cancer progression, the gene members in each module were overlapped among three modules. With repeat interactions removed, 94 genes were found as the common members and regarded as the candidate disease-associated genes (see Figure 3 and Table 2). For biological interpretation, functional enrichment analysis of our candidate disease-associated genes was conducted using GeneGo [28]. Based on different sub-cellular localizations, namely extracellular, membrane, cytoplasm, and nucleus, a major fraction of 94 candidate disease-associated genes was enriched in the nucleus and different functions were mostly encoded for transcription factors as illustrated in Figure 6.

Additionally, we compared the candidate disease-associated genes with public databases as shown in Table 3. As a result, 23 out of 94 genes were found in Cancer Gene Census database, and 22 out of 94 genes were identified as prostate cancer- related genes from GAD, as well as 18 out of 94 genes were recognized as transcription factors from AnimalTFDB. Concerning on high degree of interactions (≥10 interacted genes), 15 out of 23 genes (65.2%) were found to be hubs when Cancer Gene Census database was used. For other databases, GAD and AnimalTFDB showed 18 out of 22 genes (81.8%) and 12 out of 18 genes (66.7%) were found to be hubs, respectively. These results suggest that cancer-related genes and transcription factors likely showed to be the hub genes. In addition, self-interacting genes tended to be cancer-related genes and transcription factors (shown in Figure 4). As evidences subsequently presented, 32 out of 94 genes as self-interacting genes were found to be 10 cancer-related genes based Cancer Gene Census database, 9 prostate cancer-related genes based GAD, and 9 transcription factors based AnimalTFDB.

Table 3 Summary of statistical significance of candidate disease-associated genes in prostate cancer progression#

Full size table

To further assess statistical significance of 94 candidate disease-associated genes, the Cancer Gene Census database [56], GAD [57], and AnimalTFDB [58] were also used. 23 out of 94 genes underlying 488 genes which have been reported to be related to cancer in Cancer Gene Census database [56], we further investigated whether these genes could be randomly obtained. Statistical significance was checked using hypergeometric distribution and 106 times random simulation. The results showed that two significant p-values of 6 × 10^-4 and < 3 × 10^-4 were obtained, respectively. These indicate that the candidate cancer genes are enriched among known cancer-related genes and cannot be obtained randomly. For GAD [57], 22 out of 94 genes underlying 309 genes reported to be related to prostate cancer. The statistical significance was similarly checked. Two significant p-values of 7 × 10^-3 from a hypergeometric distribution and7 × 10^-3 from random simulation were obtained. Once using the AnimalTFDB [58], 18 out of 94 genes underlying 1,457 genes, which have been reported to be related to transcription factors. Statistical significance was similarly checked. As a result, two significant p-values of 9 × 10^-3 and 7 × 10^-3 were obtained with a hypergeometric distribution and random simulation, respectively.

Validation of candidate disease-associated genes regarded as potential module biomarker

To further validate the ability of the candidate disease- associated genes to distinguish cancer samples from controls, the gene expression dataset in series of GSE6919 for prostate cancer obtained from GEO database (http://www.ncbi.nlm.nih.gov/geo/) and the independent gene expression dataset [63] were used as the tested datasets. Here, we hypothesized that if our candidate disease-associated genes can successfully distinguish cancer samples from control samples in these tested datasets, they can be further shown to be related to prostate cancer and regarded as a potential module biomarker. Moreover, we compared our results with those obtained with a public biomarker set for prostate cancer [64], which were derived from differential gene expression. Five-fold cross validation was used to assess the performance based on different biomarkers and the SVM regression was used as the classifier. Figure 7 and 8 show the ROC curves obtained with our candidate disease-associated genes as module biomarker and known biomarkers individually for these two tested datasets. In addition, we also show AUC (area under curve) to provide the statistical summary of the performance of the classification over the entire range of sensitivity and specificity. In Figure 7, for the GSE6919 gene expression dataset, our module biomarker shows AUC of 91.44% and known biomarkers show AUC of 84.05%. In Figure 8 for the independent gene expression dataset [63], our module biomarker and known biomarkers show AUC of 92.85% and 86.45%, respectively. These results confirm that our identified putative prostate cancer associated module biomarker performs well with respect to the known biomarkers; therefore it could be potentially applied to predict prostate cancer progression.

Transcription factor Sp1 as a novel biomarker for prostate cancer

Towards candidate disease-associated genes as the potential module biomarker, interestingly, we found 18 key transcription factors which had a major fraction involved in Transcriptional regulation. Accordingly, it is possible that these key transcription factors probably regulate a large number of genes and are called potential candidates to be biomarkers for prostate cancer. This is based on the concept that transcription factors are the drivers of the potential regulation of genes in prostate cancer, and thus are relevant for use as biomarkers [65].

In order to identify potential candidates to be biomarkers for prostate cancer, we initially mapped 94 candidate disease-associated genes to GeneGo which invoked an appropriate algorithm to build networks relevant to active data, such as our gene list in a straightforward manner depending on the task. Later, we chose Transcription regulation workflow from GeneGo which generated sub-networks centred on transcription factors. Sub-networks were then ranked by p-values and interpreted in terms of gene ontology. Afterwards, a few of sub-networks containing receptors with direct ligands from our datasets and their closet transcription factors that directly targeted the objects with these datasets were generated. To the end, we could identify potential transcription factors regarded as candidates to be biomarkers for prostate cancer which had a Transcription regulation relationship with the regulated candidate disease-associated genes.

Successfully, we found Myc, AR, ESR1 and p53 as potential transcription factors which were possibly regarded as biomarkers for prostate cancer as shown in Additional file 1 (Figure S4) [66–69]. Surprisingly, our identification showed that Specificity Protein 1 (Sp1) was a hidden key transcription factor involved in regulation of gene expression in early development of human prostate cancer [70]. We found that transcription factor Sp1 directly regulated a lot of candidate disease- associated genes, and also had indirect effect with the remaining genes. The result shows in Figure 9. Focusing on prostate cancer studies, several reports have shown that transcription factor Sp1 regulates some important genes like androgen receptor (AR) and TGF-β [71–73]. Moreover, transcription factor Sp1 has also been found as a new biomarker that could identify a subset of pancreatic ductal adenocarcinoma with aggressive clinical behaviour. It can be used at initial diagnosis of pancreatic adenocarcinoma to identify patients with an increased probability of cancer metastasis and much shortened overall survival [74]. As many articles reported that transcription factor Sp1 plays an important role with clinical behaviour and it is identified as a hub around different transcriptional changes. We therefore propose that transcription factor Sp1 is probably a novel candidate diagnosis biomarker related to prostate cancer. We expect that the future application of transcription factor Sp1 as a biomarker for prostate cancer may improve clinical management.

Conclusions

In summary, we proposed an integrative analysis based on the gene expression profiles and the reconstructed protein-protein interaction network for prostate cancer, in contrast to the conventional methods of examining differential genes expression or proteins expression. In particular, this study was more intensive analysis on modular analysis for investigating the progression of different disease phases of prostate cancer. The achieved significant modules resulted in the identification of the candidate disease-associated genes which were consequently regarded as potential module biomarker. It can be effectively used as the promising feature to distinguish between control and disease samples. Regarding on functional analysis of candidate disease-associated genes, interestingly a major fraction of genes was enriched in the nucleus and different functions were encoded for transcription factors. Concerning on pathway enrichment analysis, Transcription Androgen Receptor (AR) nuclear signaling and Epidermal Growth Factor Receptor (EGFR) signalling pathway were clearly shown to be the pathway targets for prostate cancer progression. Transcription factor AR plays an important role in Transcription relationship and acts as a hub for regulating a lot of genes in the Transcription AR nuclear signaling. EGFR signalling regulates cell proliferation, cell differentiation, cell cycle, and cell migration and therefore it has been a potential interested target for effective cancer therapies. Last but not least, we successfully found an interesting transcription factor Sp1 which could be regarded as a potential novel biomarker for prostate cancer. For a future work, we will further study the experimental validation of potential disease genes and pathways during prostate cancer progression.

References

Dasgupta S, Srinidhi S, Vishwanatha JK: Oncogenic activation in prostate cancer progression and metastasis: Molecular insights and future challenges. J Carcinog. 2012, 11: 4-10.4103/1477-3163.93001.
Article PubMed PubMed Central Google Scholar
Jemal A, Bray F, Center MM, Ferlay J, Ward E, Forman D: Global cancer statistics. CA Cancer J Clin. 2011, 61 (2): 69-90. 10.3322/caac.20107.
Article PubMed Google Scholar
Tricoli JV, Schoenfeldt M, Conley BA: Detection of prostate cancer and predicting progression: current and future diagnostic markers. Clin Cancer Res. 2004, 10 (12 Pt 1): 3943-3953.
Article PubMed Google Scholar
Thun MJ, DeLancey JO, Center MM, Jemal A, Ward EM: The global burden of cancer: priorities for prevention. Carcinogenesis. 31 (1): 100-110.
Lim JE, Hong KW, Jin HS, Kim YS, Park HK, Oh B: Type 2 diabetes genetic association database manually curated for the study design and odds ratio. BMC Med Inform Decis Mak. 2010, 10: 76-10.1186/1472-6947-10-76.
Article PubMed PubMed Central Google Scholar
Lapointe J, Li C, Higgins JP, van de Rijn M, Bair E, Montgomery K, Ferrari M, Egevad L, Rayford W, Bergerheim U, Ekman P, DeMarzo AM, Tibshirani R, Botstein D, Brown PO, Brooks JD, Pollack JR: Gene expression profiling identifies clinically relevant subtypes of prostate cancer. Proc Natl Acad Sci USA. 2004, 101 (3): 811-816. 10.1073/pnas.0304146101.
Article PubMed CAS PubMed Central Google Scholar
Collins FS, Morgan M, Patrinos A: The Human Genome Project: lessons from large-scale biology. Science. 2003, 300 (5617): 286-290. 10.1126/science.1084564.
Article PubMed CAS Google Scholar
Kallioniemi O: Functional genomics and transcriptomics of prostate cancer: promises and limitations. BJU Int. 2005, 96 (Suppl 2): 10-15.
Article PubMed CAS Google Scholar
Jiang J, Cui W, Vongsangnak W, Hu G, Shen B: Post genome-wide association studies functional characterization of prostate cancer risk loci. BMC Genomics. 2013, 14 (Suppl 8): S9-10.1186/1471-2164-14-S8-S9.
Article PubMed PubMed Central Google Scholar
Hu Y, Li J, Yan W, Chen J, Li Y, Hu G, Shen B: Identifying novel glioma associated pathways based on systems biology level meta-analysis. BMC systems biology. 2013, 7 (Suppl 2): S9-10.1186/1752-0509-7-S2-S9.
Article PubMed PubMed Central Google Scholar
Tang Y, Yan W, Chen J, Luo C, Kaipia A, Shen B: Identification of novel microRNA regulatory pathways associated with heterogeneous prostate cancer. BMC systems biology. 2013, 7 (Suppl 2): S6-10.1186/1752-0509-7-S2-S6.
Article PubMed PubMed Central Google Scholar
Wang Y, Chen J, Li Q, Wang H, Liu G, Jing Q, Shen B: Identifying novel prostate cancer associated pathways based on integrative microarray data analysis. Comput Biol Chem. 2011, 35 (3): 151-158. 10.1016/j.compbiolchem.2011.04.003.
Article PubMed CAS Google Scholar
Chen J, Wang Y, Shen B, Zhang D: Molecular signature of cancer at gene level or pathway level? Case studies of colorectal cancer and prostate cancer microarray data. Comput Math Methods Med. 2013, 2013: 909525-
PubMed PubMed Central Google Scholar
Wang Y, Klijn JG, Zhang Y, Sieuwerts AM, Look MP, Yang F, Talantov D, Timmermans M, Meijer-van Gelder ME, Yu J, Jatkoe T, Berns EM, Atkins D, Foekens JA: Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer. Lancet. 2005, 365 (9460): 671-679. 10.1016/S0140-6736(05)17947-1.
Article PubMed CAS Google Scholar
Chari R, Coe BP, Vucic EA, Lockwood WW, Lam WL: An integrative multi-dimensional genetic and epigenetic strategy to identify aberrant genes and pathways in cancer. BMC Syst Biol. 4: 67-
Nam H, Chung BC, Kim Y, Lee K, Lee D: Combining tissue transcriptomics and urine metabolomics for breast cancer biomarker identification. Bioinformatics. 2009, 25 (23): 3151-3157. 10.1093/bioinformatics/btp558.
Article PubMed CAS Google Scholar
Fradet Y: Biomarkers in prostate cancer diagnosis and prognosis: beyond prostate-specific antigen. Curr Opin Urol. 2009, 19 (3): 243-246. 10.1097/MOU.0b013e32832a08b5.
Article PubMed Google Scholar
Ideker T, Sharan R: Protein networks in disease. Genome Res. 2008, 18 (4): 644-652. 10.1101/gr.071852.107.
Article PubMed CAS PubMed Central Google Scholar
Chuang HY, Lee E, Liu YT, Lee D, Ideker T: Network-based classification of breast cancer metastasis. Mol Syst Biol. 2007, 3: 140-
Article PubMed PubMed Central Google Scholar
Taylor IW, Linding R, Warde-Farley D, Liu Y, Pesquita C, Faria D, Bull S, Pawson T, Morris Q, Wrana JL: Dynamic modularity in protein interaction networks predicts breast cancer outcome. Nat Biotechnol. 2009, 27 (2): 199-204. 10.1038/nbt.1522.
Article PubMed CAS Google Scholar
Guo Z, Wang L, Li Y, Gong X, Yao C, Ma W, Wang D, Zhu J, Zhang M, Yang D, Rao S, Wang J: Edge-based scoring and searching method for identifying condition-responsive protein-protein interaction sub-network. Bioinformatics. 2007, 23 (16): 2121-2128. 10.1093/bioinformatics/btm294.
Article PubMed CAS Google Scholar
Ma H, Schadt EE, Kaplan LM, Zhao H: COSINE: COndition-SpecIfic sub-NEtwork identification using a global optimization method. Bioinformatics. 2011, 27 (9): 1290-1298. 10.1093/bioinformatics/btr136.
Article PubMed CAS PubMed Central Google Scholar
Qiu YQ, Zhang S, Zhang XS, Chen L: Detecting disease associated modules and prioritizing active genes based on high throughput data. BMC Bioinformatics. 2010, 11: 26-10.1186/1471-2105-11-26.
Article PubMed PubMed Central Google Scholar
Fortney K, Jurisica I: Integrative computational biology for cancer research. Hum Genet. 130 (4): 465-481.
Joyce AR, Palsson BO: The model organism as a system: integrating 'omics' data sets. Nat Rev Mol Cell Biol. 2006, 7 (3): 198-210. 10.1038/nrm1857.
Article PubMed CAS Google Scholar
Brahmachari SK: Introducing the medical bioinformatics in Journal of Translational Medicine. J Transl Med. 10: 202-
Maqungo M, Kaur M, Kwofie SK, Radovanovic A, Schaefer U, Schmeier S, Oppon E, Christoffels A, Bajic VB: DDPC: Dragon Database of Genes associated with Prostate Cancer. Nucleic Acids Res. 39 (Database): D980-985.
Ekins S, Bugrim A, Brovold L, Kirillov E, Nikolsky Y, Rakhmatulin E, Sorokina S, Ryabov A, Serebryiskaya T, Melnikov A, Metz J, Nikolskaya T: Algorithms for network analysis in systems-ADME/Tox using the MetaCore and MetaDrug platforms. Xenobiotica. 2006, 36 (10-11): 877-901. 10.1080/00498250600861660.
Article PubMed CAS Google Scholar
Hamosh A, Scott AF, Amberger JS, Bocchini CA, McKusick VA: Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res. 2005, 33 (Database): D514-517.
PubMed CAS PubMed Central Google Scholar
Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M: The KEGG resource for deciphering the genome. Nucleic Acids Res. 2004, 32 (Database): D277-280.
Article PubMed CAS PubMed Central Google Scholar
Li LC, Zhao H, Shiina H, Kane CJ, Dahiya R: PGDB: a curated and integrated database of genes related to the prostate. Nucleic Acids Res. 2003, 31 (1): 291-293. 10.1093/nar/gkg008.
Article PubMed CAS PubMed Central Google Scholar
Agarwal SM, Raghav D, Singh H, Raghava GP: CCDB: a curated database of genes involved in cervix cancer. Nucleic Acids Res. 39 (Database): D975-979.
Shoop E, Casaes P, Onsongo G, Lesnett L, Petursdottir EO, Donkor EK, Tkach D, Cosimini M: Data exploration tools for the Gene Ontology database. Bioinformatics. 2004, 20 (18): 3442-3454. 10.1093/bioinformatics/bth425.
Article PubMed CAS Google Scholar
Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M: BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 2006, 34 (Database): D535-539.
Article PubMed CAS PubMed Central Google Scholar
Huang da W, Sherman BT, Lempicki RA: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009, 4 (1): 44-57.
Article PubMed Google Scholar
Huang da W, Sherman BT, Lempicki RA: Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009, 37 (1): 1-13. 10.1093/nar/gkn923.
Article PubMed PubMed Central Google Scholar
Smyth GK, Michaud J, Scott HS: Use of within-array replicate spots for assessing differential expression in microarray experiments. Bioinformatics. 2005, 21 (9): 2067-2075. 10.1093/bioinformatics/bti270.
Article PubMed CAS Google Scholar
Dudoit S, Gentleman RC, Quackenbush J: Open source software for the analysis of microarray data. Biotechniques. 2003, 45-51. Suppl
Benjamini Y, Drai D, Elmer G, Kafkafi N, Golani I: Controlling the false discovery rate in behavior genetics research. Behav Brain Res. 2001, 125 (1-2): 279-284. 10.1016/S0166-4328(01)00297-2.
Article PubMed CAS Google Scholar
Pavlopoulos GA, Secrier M, Moschopoulos CN, Soldatos TG, Kossida S, Aerts J, Schneider R, Bagos PG: Using graph theory to analyze biological networks. BioData Min. 4: 10-
Ideker T, Ozier O, Schwikowski B, Siegel AF: Discovering regulatory and signalling circuits in molecular interaction networks. Bioinformatics. 2002, 18 (Suppl 1): S233-240. 10.1093/bioinformatics/18.suppl_1.S233.
Article PubMed Google Scholar
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13 (11): 2498-2504. 10.1101/gr.1239303.
Article PubMed CAS PubMed Central Google Scholar
Bauer CR, Epstein AM, Sweeney SJ, Zarnescu DC, Bosco G: Genetic and systems level analysis of Drosophila sticky/citron kinase and dFmr1 mutants reveals common regulation of genetic networks. BMC systems biology. 2008, 2: 101-10.1186/1752-0509-2-101.
Article PubMed PubMed Central Google Scholar
Stuart GR, Copeland WC, Strand MK: Construction and application of a protein and genetic interaction network (yeast interactome). Nucleic Acids Res. 2009, 37 (7): e54-10.1093/nar/gkp140.
Article PubMed PubMed Central Google Scholar
Meyer JN, Boyd WA, Azzam GA, Haugen AC, Freedman JH, Van Houten B: Decline of nucleotide excision repair capacity in aging Caenorhabditis elegans. Genome Biol. 2007, 8 (5): R70-10.1186/gb-2007-8-5-r70.
Article PubMed PubMed Central Google Scholar
Mayer ML, Sheridan JA, Blohmke CJ, Turvey SE, Hancock RE: The Pseudomonas aeruginosa autoinducer 3O-C12 homoserine lactone provokes hyperinflammatory responses from cystic fibrosis airway epithelial cells. PLoS One. 2011, 6 (1): e16246-10.1371/journal.pone.0016246.
Article PubMed CAS PubMed Central Google Scholar
Cortes C VV: Support-vector networks. Mach Learn. 1995, 20: 273-297.
Google Scholar
Ng KL, Mishra SK: De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures. Bioinformatics. 2007, 23 (11): 1321-1330. 10.1093/bioinformatics/btm026.
Article PubMed CAS Google Scholar
Rice SB, Nenadic G, Stapley BJ: Mining protein function from text using term-based support vector machines. BMC Bioinformatics. 2005, 6 (Suppl 1): S22-10.1186/1471-2105-6-S1-S22.
Article PubMed PubMed Central Google Scholar
Son YJ, Kim HG, Kim EH, Choi S, Lee SK: Application of support vector machine for prediction of medication adherence in heart failure patients. Healthc Inform Res. 2010, 16 (4): 253-259. 10.4258/hir.2010.16.4.253.
Article PubMed PubMed Central Google Scholar
Wen Z, Liu ZP, Liu Z, Zhang Y, Chen L: An integrated approach to identify causal network modules of complex diseases with application to colorectal cancer. J Am Med Inform Assoc. 2012
Google Scholar
Liu X, Liu ZP, Zhao XM, Chen L: Identifying disease genes and module biomarkers by differential interactions. J Am Med Inform Assoc. 2012, 19 (2): 241-248. 10.1136/amiajnl-2011-000658.
Article PubMed PubMed Central Google Scholar
Liu JJ, Xiang Y: In silico mining and PCR-based approaches to transcription factor discovery in non-model plants: gene discovery of the WRKY transcription factors in conifers. Methods Mol Biol. 2011, 754: 21-43. 10.1007/978-1-61779-154-3_2.
Article PubMed CAS Google Scholar
Karatzoglou BJ: kernlab - an S4 package for kernel methods in R. J Stat Softw. 2004
Google Scholar
Sing T, Sander O, Beerenwinkel N, Lengauer T: ROCR: visualizing classifier performance in R. Bioinformatics. 2005, 21 (20): 3940-3941. 10.1093/bioinformatics/bti623.
Article PubMed CAS Google Scholar
Futreal PA, Coin L, Marshall M, Down T, Hubbard T, Wooster R, Rahman N, Stratton MR: A census of human cancer genes. Nat Rev Cancer. 2004, 4 (3): 177-183. 10.1038/nrc1299.
Article PubMed CAS PubMed Central Google Scholar
Becker KG, Barnes KC, Bright TJ, Wang SA: The genetic association database. Nat Genet. 2004, 36 (5): 431-432. 10.1038/ng0504-431.
Article PubMed CAS Google Scholar
Zhang HM, Chen H, Liu W, Liu H, Gong J, Wang H, Guo AY: AnimalTFDB: a comprehensive animal transcription factor database. Nucleic Acids Res. 2012, 40 (Database): D144-149.
Article PubMed CAS PubMed Central Google Scholar
Liu G, Ding M, Chen J, Huang J, Wang H, Jing Q, Shen B: Computational analysis of microRNA function in heart development. Acta Biochim Biophys Sin (Shanghai). 42 (9): 662-670.
Li TQ, Feng CQ, Zou YG, Shi R, Liang S, Mao XM: Literature-mining and bioinformatic analysis of androgen-independent prostate cancer-specific genes. Zhonghua Nan Ke Xue. 2009, 15 (12): 1102-1107.
PubMed CAS Google Scholar
Seshacharyulu P, Ponnusamy MP, Haridas D, Jain M, Ganti AK, Batra SK: Targeting the EGFR signaling pathway in cancer therapy. Expert Opin Ther Targets. 16 (1): 15-31.
Teixeira AL, Gomes M, Medeiros R: EGFR signaling pathway and related- miRNAs in age-related diseases: the example of miR-221 and miR-222. Front Genet. 3: 286-
Dhanasekaran SM, Barrette TR, Ghosh D, Shah R, Varambally S, Kurachi K, Pienta KJ, Rubin MA, Chinnaiyan AM: Delineation of prognostic biomarkers in prostate cancer. Nature. 2001, 412 (6849): 822-826. 10.1038/35090585.
Article PubMed CAS Google Scholar
Larkin SE, Holmes S, Cree IA, Walker T, Basketter V, Bickers B, Harris S, Garbis SD, Townsend PA, Aukim-Hastie C: Identification of markers of prostate cancer progression using candidate gene expression. Br J Cancer. 2012, 106 (1): 157-165. 10.1038/bjc.2011.490.
Article PubMed CAS PubMed Central Google Scholar
Kaur M, MacPherson CR, Schmeier S, Narasimhan K, Choolani M, Bajic VB: In Silico discovery of transcription factors as potential diagnostic biomarkers of ovarian cancer. BMC systems biology. 2011, 5: 144-10.1186/1752-0509-5-144.
Article PubMed PubMed Central Google Scholar
Chng KR, Cheung E: Sequencing the transcriptional network of androgen receptor in prostate cancer. Cancer Lett. 2012
Google Scholar
Kohli M, Qin R, Jimenez R, Dehm SM: Biomarker-based targeting of the androgen-androgen receptor axis in advanced prostate cancer. Adv Urol. 2012, 2012: 781459-
Article PubMed PubMed Central Google Scholar
Verma MP, P , Verma M: Biomarkers in Prostate Cancer Epidemiology. Cancers. 2011, 3: 3773-3798. 10.3390/cancers3043773.
Article PubMed PubMed Central Google Scholar
Willard SS, Koochekpour S: Regulators of gene expression as biomarkers for prostate cancer. Am J Cancer Res. 2012, 2 (6): 620-657.
PubMed CAS PubMed Central Google Scholar
Yeh HY, Cheng SW, Lin YC, Yeh CY, Lin SF, Soo VW: Identifying significant genetic regulatory networks in the prostate cancer from microarray data based on transcription factor analysis and conditional independency. BMC Med Genomics. 2009, 2: 70-10.1186/1755-8794-2-70.
Article PubMed PubMed Central Google Scholar
Sankpal UT, Goodison S, Abdelrahim M, Basha R: Targeting Sp1 transcription factors in prostate cancer therapy. Med Chem. 7 (5): 518-525.
Eisermann K, Broderick CJ, Bazarov A, Moazam MM, Fraizer GC: Androgen up-regulates vascular endothelial growth factor expression in prostate cancer cells via an Sp1 binding site. Mol Cancer. 2013, 12: 7-10.1186/1476-4598-12-7.
Article PubMed CAS PubMed Central Google Scholar
Yuan H, Gong A, Young CY: Involvement of transcription factor Sp1 in quercetin-mediated inhibitory effect on the androgen receptor in human prostate cancer cells. Carcinogenesis. 2005, 26 (4): 793-801. 10.1093/carcin/bgi021.
Article PubMed CAS Google Scholar
Jiang NY, Woda BA, Banner BF, Whalen GF, Dresser KA, Lu D: Sp1, a new biomarker that identifies a subset of aggressive pancreatic ductal adenocarcinoma. Cancer Epidemiol Biomarkers Prev. 2008, 17 (7): 1648-1652. 10.1158/1055-9965.EPI-07-2791.
Article PubMed CAS Google Scholar
Agarwal SM, Raghav D, Singh H, Raghava GP: CCDB: a curated database of genes involved in cervix cancer. Nucleic Acids Res. 2011, 39 (Database): D975-979. 10.1093/nar/gkq1024.
Article PubMed CAS PubMed Central Google Scholar
Varambally S, Yu J, Laxman B, Rhodes DR, Mehra R, Tomlins SA, Shah RB, Chandran U, Monzon FA, Becich MJ, Wei JT, Pienta KJ, Ghosh D, Rubin MA, Chinnaiyan AM: Integrative genomic and proteomic analysis of prostate cancer reveals signatures of metastatic progression. Cancer Cell. 2005, 8 (5): 393-406. 10.1016/j.ccr.2005.10.001.
Article PubMed CAS Google Scholar
Tomlins SA, Mehra R, Rhodes DR, Cao X, Wang L, Dhanasekaran SM, Kalyana-Sundaram S, Wei JT, Rubin MA, Pienta KJ, Shah RB, Chinnaiyan AM: Integrative molecular concept modeling of prostate cancer progression. Nat Genet. 2007, 39 (1): 41-51. 10.1038/ng1935.
Article PubMed CAS Google Scholar
Kim JH, Dhanasekaran SM, Prensner JR, Cao X, Robinson D, Kalyana- Sundaram S, Huang C, Shankar S, Jing X, Iyer M, Hu M, Sam L, Grasso C, Maher CA, Palanisamy N, Mehra R, Kominsky HD, Siddiqui J, Yu J, Qin ZS, Chinnaiyan AM: Deep sequencing reveals distinct patterns of DNA methylation in prostate cancer. Genome Res. 2011, 21 (7): 1028-1041. 10.1101/gr.119347.110.
Article PubMed CAS PubMed Central Google Scholar
Nanni S, Priolo C, Grasselli A, D'Eletto M, Merola R, Moretti F, Gallucci M, De Carli P, Sentinelli S, Cianciulli AM, Mottolese M, Carlini P, Arcelli D, Helmer-Citterich M, Gaetano C, Loda M, Pontecorvi A, Bacchetti S, Sacchi A, Farsetti A: Epithelial-restricted gene profile of primary cultures from human prostate tumors: a molecular approach to predict clinical behavior of prostate cancer. Mol Cancer Res. 2006, 4 (2): 79-92. 10.1158/1541-7786.MCR-05-0098.
Article PubMed CAS Google Scholar
Welsh JB, Sapinoso LM, Su AI, Kern SG, Wang-Rodriguez J, Moskaluk CA, Frierson HF, Hampton GM: Analysis of gene expression identifies candidate markers and pharmacological targets in prostate cancer. Cancer Res. 2001, 61 (16): 5974-5978.
PubMed CAS Google Scholar
Wang Y, Xia XQ, Jia Z, Sawyers A, Yao H, Wang-Rodriquez J, Mercola D, McClelland M: In silico estimates of tissue components in surgical samples based on expression profiling data. Cancer Res. 2010, 70 (16): 6448-6455. 10.1158/0008-5472.CAN-10-0021.
Article PubMed CAS PubMed Central Google Scholar
Jia Z, Wang Y, Sawyers A, Yao H, Rahmatpanah F, Xia XQ, Xu Q, Pio R, Turan T, Koziol JA, Goodison S, Carpenter P, Wang-Rodriguez J, Simoneau A, Meyskens F, Sutton M, Lernhardt W, Beach T, Monforte J, McClelland M, Mercola D: Diagnosis of prostate cancer using differentially expressed genes in stroma. Cancer Res. 2011, 71 (7): 2476-2487. 10.1158/0008-5472.CAN-10-2585.
Article PubMed CAS PubMed Central Google Scholar
Chen JH, He HC, Jiang FN, Militar J, Ran PY, Qin GQ, Cai C, Chen XB, Zhao J, Mo ZY, Chen YR, Zhu JG, Liu X, Zhong WD: Analysis of the specific pathways and networks of prostate cancer for gene expression profiles in the Chinese population. Med Oncol. 2012, 29 (3): 1972-1984. 10.1007/s12032-011-0088-5.
Article PubMed CAS Google Scholar
Chandran UR, Ma C, Dhir R, Bisceglia M, Lyons-Weiler M, Liang W, Michalopoulos G, Becich M, Monzon FA: Gene expression profiles of prostate cancer reveal involvement of multiple molecular pathways in the metastatic process. BMC Cancer. 2007, 7: 64-10.1186/1471-2407-7-64.
Article PubMed PubMed Central Google Scholar
Yu YP, Landsittel D, Jing L, Nelson J, Ren B, Liu L, McDonald C, Thomas R, Dhir R, Finkelstein S, Michalopoulos G, Becich M, Luo JH: Gene expression alterations in prostate cancer predicting tumor aggression and preceding development of malignancy. J Clin Oncol. 2004, 22 (14): 2790-2799. 10.1200/JCO.2004.05.158.
Article PubMed CAS Google Scholar

Download references

Acknowledgements

We would like to thank Dr. Subir Kumar Nandy for assisting in human prostate cancer protein interaction network reconstruction. BS and YL were supported by National Natural Science Foundation of China (NSFC) (grant no. 31170795 and no. 91230117). WV was supported by National Natural Science Foundation of China (NSFC) (grant no. 31200989).

Declarations

The publication costs for this article were funded by National Natural Science Foundation of China (NSFC) (grant no. 31170795 and no. 91230117).

This article has been published as part of BMC Medical Genomics Volume 7 Supplement 1, 2014: Selected articles from the 3rd Translational Bioinformatics Conference (TBC/ISCB-Asia 2013). The full contents of the supplement are available online at http://www.biomedcentral.com/bmcmedgenomics/supplements/7/S1.

Author information

Authors and Affiliations

Center for Systems Biology, Soochow University, Suzhou, 215006, China
Yin Li, Wanwipa Vongsangnak & Bairong Shen
Key Laboratory of Systems Biology, Chinese Academy of Sciences, Shanghai, 200031, China
Luonan Chen

Authors

Yin Li
View author publications
You can also search for this author in PubMed Google Scholar
Wanwipa Vongsangnak
View author publications
You can also search for this author in PubMed Google Scholar
Luonan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Bairong Shen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bairong Shen.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

YL performed integrative data analysis. WV analyzed the gene expression data. YL and WV wrote the paper. and wrote the paper. BS and LC conceived and designed the overall study and revised the manuscript.

Yin Li, Wanwipa Vongsangnak and Bairong Shen contributed equally to this work.

Electronic supplementary material

Additional file 1: Supplementary file 1. All the Supplementary Figures and Tables mentioned in the paper (DOC 6 MB)

12920_2014_463_MOESM2_ESM.xlsx

Additional file 2: Supplementary file 2. The reconstructed HPC-PPIN with annotated functions is shown in it. (XLSX 565 KB)

12920_2014_463_MOESM3_ESM.xls

Additional file 3: Supplementary file 3. A graphic representation of HPC-PPIN by Cytoscape is presented in it. The file is in .cys format which can be opened by Cytoscape software. (XLS 894 KB)

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Li, Y., Vongsangnak, W., Chen, L. et al. Integrative analysis reveals disease-associated genes and biomarkers for prostate cancer progression. BMC Med Genomics 7 (Suppl 1), S3 (2014). https://doi.org/10.1186/1755-8794-7-S1-S3

Download citation

Published: 08 May 2014
DOI: https://doi.org/10.1186/1755-8794-7-S1-S3

Integrative analysis reveals disease-associated genes and biomarkers for prostate cancer progression

Abstract

Background

Methods

Results

Conclusions

Similar content being viewed by others

Identification of prognostic markers of high grade prostate cancer through an integrated bioinformatics approach

Identification of key regulators in prostate cancer from gene expression datasets of patients

Identification of key pathways and genes in PTEN mutation prostate cancer by bioinformatics analysis

Background

Methods

Human prostate cancer protein interaction network reconstruction and annotation

Prostate cancer gene expression data collection and analysis

Modular analysis for prostate cancer progression

Identified candidate prostate cancer associated genes as putative module biomarker

Validation of candidate prostate cancer associated genes by statistical methods

Functional and pathway enrichment analysis

Results and discussion

Reconstructed HPC-PPIN and its functional annotation

Modules involved in prostate cancer progression

Candidate disease-associated genes in prostate cancer progression and their statistical significance

Validation of candidate disease-associated genes regarded as potential module biomarker

Transcription factor Sp1 as a novel biomarker for prostate cancer

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors' contributions

Electronic supplementary material

Additional file 1: Supplementary file 1. All the Supplementary Figures and Tables mentioned in the paper (DOC 6 MB)

12920_2014_463_MOESM2_ESM.xlsx

12920_2014_463_MOESM3_ESM.xls

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation