Background

Colorectal cancer (CRC) is the third most common cancer and the fourth-leading cause of cancer death worldwide [1]. Currently, the TNM classification of malignant tumours based on clinicopathological staging remains the standard for CRC prognostication [2]. Many gene expression profiling (GEP) studies on prognosis of CRC have been performed in the last decade using microarray technology, with the aim to identify a gene expression profile to discriminate more aggressive from less aggressive CRC. However, due to their low overlap in the identified genes, no reliable signature useful in the clinical practice has been found. Recently, we published a systematic pathway-based enrichment analysis of 23 independent GEP studies on prognosis of CRC [3]. This analysis indicated the mitochondrial oxidative phosphorylation (OXPHOS) chain as a significantly and consistently overrepresented prognostic category for CRC.

Already several decades ago, it was suggested that impaired oxidative metabolism may cause malignant growth [4]. In cancer cells, there is an enhanced glucose use, slowing the rate of the tricarboxylic acid cycle and oxidative phosphorylation and increasing glycolysis, as the way to generate energy in form of adenosine triphosphate (ATP), despite aerobic conditions [5]. This assumption, known as Warburg’s hypothesis, has been rediscovered by a broad range of experimental approaches showing interaction of mitochondrial metabolism and tumour growth [68]. Additionally, germline mutations in the mitochondrial succinate dehydrogenase (complex II of the OXPHOS chain) subunits SDHD, SDHC, and SDHB are a frequent cause of paragangliomas of the head and neck and of phaeochromocytomas [9].

Thus, the OXPHOS chain appears as a promising candidate pathway for CRC progression. The OXPHOS system, whose proteins are encoded by both nuclear and mitochondrial DNA (mtDNA), consists of five major protein complexes named complex I, II, III, IV, and V, localized on the inner mitochondrial membrane. Its main function is the generation of ATP [10]. We therefore selected putatively functional single nucleotide polymorphisms (SNPs) in the genes from the pathway, and investigated their possible role in the progression of CRC and the prognosis of patients in a group of 613 German CRC patients.

Methods

Study population

The 613 patients included in the study belonged to the population-based PopGen project in Schleswig-Holstein (Germany) [11]. All individuals were of German origin, i.e. with both parents born in Germany. Patients fulfilling either the clinical Amsterdam or Bethesda criteria for hereditary non-polyposis colorectal cancer (HNPCC) were excluded from the study, as were patients with a history of malignant disease or inflammatory bowel disease. For a full description of the characteristics of the 613 patients see Table 1 and Castro et al. [12]. The work has been approved by the corresponding ethical committees and subjects gave informed consent to the work.

Table 1 Characteristics of the 613 CRC patients at the time of diagnosis

Gene and SNP selection

Candidate genes in the OXPHOS pathway (map00190 in the KEGG database [13]) were selected based on a recently published meta-analysis of GEP studies on CRC prognosis [3]. Three out of the seven genes reported in two GEP studies had SNPs in the putative regulatory regions (ATP5C1 from complex V, COX5B and COX6B1 from complex IV). Two additional genes with SNPs in putative regulatory regions, each reported in one GEP study, were also included, namely NDUFS2 from complex I and UQCRB from complex III. With these selection criteria, the four complexes from the OXPHOS chain which allow flowing of protons between the mitochondrial matrix and the intermembrane space, resulting into energy generation in form of ATP, were included in the study. Finally, SNPs in the genes GAPDH and HSPD1 were also selected, since both genes might define the cellular bioenergetic activity of the mitochondria [6].

SNPs within the set of candidate genes (Table 2 and Additional file 1: Table S1) were selected according to the following criteria: minor allele frequency (MAF) > 0.05 in HapMap CEPH population (Utah residents with ancestry from northern and western Europe - CEU), and location in potentially regulatory gene regions (5′ and 3′-UTR, promoter or non-synonymous coding SNPs). In regions with several SNPs in high linkage disequilibrium (LD), tagSNPs were selected for genotyping. LD between the SNPs was calculated using Haploview [14]. The promoter SNP rs2802460 in ATP5C1 was reported to regulate expression of the gene in lymphoblastoid cell lines (eSNPs) on a genome-wide association study of global gene expression [15]. Predictions of the possible impact of amino acid substitutions on the structure and function of the protein were performed with PolyPhen2 [16].

Table 2 SNPs evaluated in the study

Genotyping

The 613 DNAs used in the study were extracted from peripheral venous blood using the FlexiGene chemistry (Qiagen, Hilden, Germany) according to the manufacturer’s protocols. From them, 272 (44.4%) samples were subjected to genome-wide amplification using the illustra GenomiPhi V2 DNA Amplification Kit (GE Healthcare, Freiburg, Germany) according to manufacturer’s instructions. Genotyping was performed using 5 ng of either genomic or whole-genome amplified DNA in 384-well plate format, using KASPar assays on demand (KBiosciences, Hoddesdon, UK) and following the manufacturer’s protocols. Average genotyping call rate was 98.8%. All polymorphisms were in Hardy-Weinberg equilibrium.

Statistical analysis

SAS software version 9.2 (SAS Institute) was used in all analysis. Unadjusted association between the genotyped markers and patient characteristics were evaluated by chi-square test. Odds ratios (ORs) with 95% confidence intervals (CIs) were estimated using logistic regression. Effect of the different genotypes on CRC survival was evaluated using Kaplan-Meier method and was compared using log-rank testing. Analysis of different parameters for prognostic significance was done by univariate and multivariate Cox proportional hazard models. P values < 0.05 were considered statistically significant. Follow-up time was calculated from the date of CRC diagnosis to the CRC specific death or death by any cause or to the end of follow-up (date of last contact with the treating physician). Taking into account the possible biological effects of the selected genes/variants on cancer progression and a possible correlation between the variants, correction for multiple comparisons was excluded.

Results

SNP selection

A total of 29 SNPs were identified in the regions of interest (5′- and 3′-UTR, promoter and non-synonymous coding SNPs) of the seven genes NDUFS2, UQCRB, COX5B, COX6B1, ATP5C1, GAPDH and HSPD1 ( Additional file 1: Table S1). Out of them, 18 SNPs were selected for the association studies, covering all gene regions with a putative regulatory function.

Association with survival

The two polymorphisms in the 3′-UTR region of UQCRB (complex III) showed association with CRC survival (Table 3 and Additional file 1: Table S2). The marker rs7836698 was statistically significantly associated with a decreased risk of death due to CRC (HR = 0.53, 95% CI 0.31–0.91 for TT), or due to any cause (HR = 0.52, 95% CI 0.32–0.85). For rs10504961, only the association with overall survival was statistically significant (HR = 0.64, 95% CI 0.42–0.99 for TT). Interestingly, for both UQCRB polymorphisms, the TT carriers were more likely to develop tumours in the colon than in the rectum, with ORs of 2.02 (95% CI 1.24–3.28) for rs7836698 (TT vs. CC) and 1.74 (95% CI 1.08–2.78) for rs10504961 (TT vs. CC). When survival was analysed according to the cancer site, both SNPs showed association exclusively in the group of patients with cancer located in the colon (HR for overall survival 0.42, 95% CI 0.22–0.82 for rs7836698 TT carriers and 0.46, 95% CI 0.25–0.83 for rs10504961 TT carriers), but not in the rectum (Table 3). The Kaplan-Meier survival curves representing the survival rates of the patients according to their genotypes, as well as the P values for the log-rank test are presented in Figure 1.

Table 3 Association of two polymorphisms in UQCRB (complex III) with survival
Figure 1
figure 1

Kaplan-Meier estimates of CRC specific survival (months) according to UQCRB genotypes, (a) rs7836698, all CRC patients (n = 606), log rank p value = 0.07, (b) rs7836698, all patients diagnosed with colon cancer (n = 324), log rank p value = 0.05, (c) rs10504961, all CRC patients (n = 606), log rank p value = 0.55, and (d) rs10504961, all patients diagnosed with colon cancer (n = 324), log rank p value = 0.06.

We examined also whether inclusion of other variables associated with CRC survival affected the parameter estimates for the UQCRB polymorphisms. The risk of death for carriers of the TT genotype at rs7836698 remained statistically significantly decreased after adjustment for age at diagnosis and TNM stage. Among patients with tumour located in the colon, the hazard ratios were similar both in the uni- and multivariate analyses for rs7836698 and rs10504961, although the statistical significance was sometimes lost.

Association with CRC progression

Three of the 16 SNPs investigated showed an association with one of the different components that form the UICC stage (T, N and M) in a dominant model. Two SNPs in COX6B1 (complex IV) showed an association with the presence of affected lymph nodes (rs6510502, OR = 1.75, 95% CI 1.20–2.57; rs10420252, OR = 1.68, 95% CI 1.11–2.53). The polymorphism rs6510502 was also associated with distant metastasis (OR = 1.67, 95% CI 1.09–2.56). The marker rs7971637 in GAPDH was associated with tumour stage T3/T4 (OR = 1.60, 95% CI 1.11–2.30).

Discussion

In a homogeneous Northern German population of 613 CRC patients, we observed several associations of polymorphisms in the OXPHOS chain with survival and progression of the disease. Mainly, two SNPs in the 3′-UTR region of UQCRB (complex III), rs7836698 and rs10504961, were associated with survival. The two polymorphisms were in relatively strong linkage disequilibrium (D′ = 0.98, r2 = 0.65 in the 613 samples), explaining the association observed with the two SNPs. Added to that, the two studied polymorphisms in COX6B1 (complex IV), rs6510502 in the promoter and rs10420252 in the 5′-UTR, were associated with lymph node metastasis, and rs6510502 also with distant metastasis. The two markers were in high LD (D′ = 0.97, r2 = 0.71 in the 613 samples).

Patients carrying the TT genotype for any of the two SNPs in the UQCRB gene had an increased CRC survival. Interestingly, the association appeared to be restricted to the group of patients with tumour located in the colon. Multivariate analyses confirmed that the association was independent of other variables associated with CRC survival, indicating that both markers might act as independent prognostic markers. In the group of patients with tumour located in the colon the hazard ratios remained low, even though the significance sometimes disappeared when other variables were included. Our results may partially explain the previously reported higher familial risk for colon than for rectal cancer, which was shown to decrease gradually from proximal colon to distal colon and to rectum [17]. Also for survival, differences have been observed between colon and rectal cancer, with rectal cancer having somewhat better survival compared to colon cancer [18].

UQCRB is one of the ten nucleus-encoded subunits of the complex III in the mitochondrial respiratory chain, the ubiquinol-cytochrome c oxidoreductase. In a genome-wide analysis of pancreatic cancer with CGH (comparative genomic hybridization) arrays, a DNA copy number variation was identified at the UQCRB locus, and this gene was suggested as a potential diagnostic marker or a therapeutic target [19]. Recently, it has been demonstrated that UQCRB plays a crucial role in the oxygen sensing mechanism that regulates responses to hypoxia. Using the small molecule terpestacin to inhibit UQCRB, tumour angiogenesis was blocked in vivo [20]. COX6B1 is one of the ten nucleus-encoded subunits of the complex IV in the mitochondrial respiratory chain, the cytochrome c oxidase. Mutations in this gene have been linked to severe infantile encephalomyopathy, a phenotype associated with cytochrome c oxidase deficiency [21]. Our results suggest that genetic variation in the regulatory regions of genes in the OXPHOS chain may associate with CRC progression. Whether such variation affects the gene expression pattern, impairs the function of mitochondria and leads to cancer development and progression remains to be elucidated.

One of the polymorphisms we investigated, rs2802460, located on the promoter of ATP5C1 (complex V), was reported to regulate expression of the gene in lymphoblastoid cell lines in a genome-wide association study of global gene expression [15]. However, this SNP did not show any association with CRC survival or disease progression. A recent report has evaluated the role of 376 tagSNPs (one included in our study, rs11576415 in NDUFS2) in 78 nuclear-encoded mitochondrial genes (including ATP5C1, NDUFS2, and UQCR) in prostate cancer risk, without clear evidence of association [22]. Another recent large association study included variants from 90 genes involved in oxidative phosphorylation (including ATP5C1, COX5B, COX6B1, GAPDH, and NDUFS2) and tested their association with epithelial ovarian cancer risk; no indication of association was found [23]. To our knowledge, this is the first association study of variants in the respiratory chain with CRC survival and progression.

Conclusions

Our findings suggest a possible influence of genes in the OXPHOS chain in CRC survival and progression, and support the newly discovered Warburg’s hypothesis. This is the first report of a possible role of genes in the respiratory chain in the prognosis of CRC patients, but further independent studies are clearly needed to elucidate the validity of the results.