Background

Colorectal cancer (CRC) remains major cancer worldwide [1]. Although numerous epidemiological and biological studies have revealed risk/protective factors for CRC, present knowledge is still insufficient to allow the disease to be overcome, and the struggle to elucidate mechanisms is ongoing.

Recently, several a number of genome-wide association studies (GWAS) have revealed an association between variants on chromosome 8q24 and several sites of cancer, including CRC [211]. Each study showed that rs6983267 resides in 128.47-128.54 MB on Chromosome 8, denoted as 'region 3,' [7] and consistently associated with CRC [6, 9, 12]. This association was confirmed in a subsequent large-scale replication study in Caucasians [1318]. Most of these CRC GWASs were conducted in Caucasian populations, however, and the data available for Asian populations is limited especially about possible gene-environment interaction [6, 19].

The aim of the present case-control study was to clarify the impact of rs6983267 on CRC risk in a Japanese population. In addition, we explored the gene-environmental interaction between potential confounders and rs6983267.

Methods

Subjects

Cases were 481 patients who were histologically diagnosed with CRC (245 with colon cancer, 231 with rectum cancer) between January 2001 and November 2005 at Aichi Cancer Center Hospital (ACCH) and who had no prior history of cancer. Controls were first-visit outpatients at ACCH during the same periods who were confirmed to have no cancer or a prior history of neoplasm. Controls were randomly selected and matched for sex and age (± 4 years) with a 1:2 case-control ratio (n = 962). The subjects were selected from the database of the Hospital-based Epidemiologic Research Program at Aichi Cancer Center (HERPACC). The framework of HERPACC has been described elsewhere [20, 21]. Briefly, all outpatients aged 20-79 years were asked at first visit to fill out a questionnaire regarding their lifestyle and provided 7 ml of blood. A trained interviewer checked the completion of each questionnaire. Approximately 95% of eligible subjects completed the questionnaire and 55% provided blood samples. Some 30% of first-visit outpatients were diagnosed at ACCH as having cancer. Under the assumption that the non-cancer population within HERPACC will visit ACCH if they develop cancer in the future, we defined non-cancer first-visit outpatients as those from among whom such cases may arise. Our previous study confirmed that the lifestyle patterns of first-visit outpatients matched the profile of a group randomly selected from the general population of Nagoya City, conferring external validity on the study [22]. Written informed consent was obtained from all subjects and the ethics committee of ACC approved the study.

Determination of the 8q24 loci genotype

DNA of each subject was extracted from the buffy coat fraction with a Blood Mini Kit (Qiagen K.K., Tokyo, Japan) and assessed using the polymerase chain reaction (PCR) TaqMan method [23] with the 7500 Fast Real-time PCR system (Applied Biosystems, Foster City, CA, USA). The probes used were specifically designed for rs6983267 and rs10090154 in 8q24. rs10090154 in the 8q24 'region 1' [7] was chosen because it showed a significant association for a Japanese population in Hawaii [6]. The quality of genotyping was assessed by duplicate analysis of 5% of random samples, with an agreement rate of 100%.

Exposure data

Cumulative smoking dose was evaluated as pack-years, the product of the number of packs consumed per day and years of smoking. Smoking habit was classified into the three categories of never, pack-years < 20 (low-moderate) and ≥ 20 pack years (heavy). Consumption of types of alcoholic beverages (Japanese sake, beer, shochu, whiskey and wine) per occasion was determined with reference to the average number of drinks per day, which was then converted into a Japanese sake (rice wine) equivalent (one unit sake = 23 g ethanol) [24]. Daily ethanol consumption was estimated as the product of the frequency of alcohol beverage and average ethanol consumption occasion, and drinking habit was classified into the four categories of non-drinker, low (< 5 g/day), moderate (< 23 g/day) and heavy (≥ 23 g/day). Consumption of folate was determined using a semi-quantitative food frequency questionnaire (SQFFQ) as described in detail elsewhere [25]. Briefly, the SQFFQ consisted of 47 single food items with frequencies in the eight categories of never or seldom, 1-3 times/month, 1-2 times/week, 3-4 times/week, 5-6 times/week, once/day, twice/day, and 3+ times/day. Average daily intake of nutrients was estimated by multiplying the food intake (in grams) or serving size by the nutrient content per 100 grams of food as listed in the Standard Tables of Food Composition in Japan, 5th edition. Consumption of supplemental folate was not considered in total consumption because the questionnaire for multi-vitamins was not quantitative. Energy-adjusted intake of nutrients was calculated by the residual method [26]. The SQFFQ was validated by reference to a 3-day weighted dietary record as a standard, which showed the reproducibility and validity to be acceptable [27, 28]. The de-attenuated correlation coefficients for energy-adjusted intakes of folate were 0.36 in men and 0.38 in women. Body mass index (BMI) was calculated as the self-reported weight (kilograms) divided by the square of self-reported height (meters). A family history of CRC in first-degree relatives was based on self-reporting, as described elsewhere [29]. The questionnaire also covered the regularity of physical exercise: subjects were asked to report the frequency and intensity of recreational exercise, with average daily exercise hours in any intensity calculated and categorized into the three levels of none, and < 0.5 and ≥ 0.5 hours/day.

Statistical analysis

Odds ratios (ORs) and 95% confidence intervals (CIs) for assessment of the impact of each 8q24 locus, included in the model as an ordinal score (1 to 3), were calculated using multivariable conditional logistic regression models. We explored two models: model 1 was a crude model; model 2 included age and sex plus potential confounders as indicator variables. Confounders considered in model 2 were smoking status (never, former, current moderate, and heavy), drinking habit (non, low, moderate, and heavy), folate consumption by tertile (T1-3), BMI (< 22.5, 22.5 - 24.9, 25.0-27.4 and ≥ 27.5 kg/m2), family history of colorectal cancer (yes or no), and regular exercise (none, < 0.5 hour/day, and ≥ 0.5 hour/day). Interactions between rs6983267 assuming linear effect of allele and potential confounders similarly assuming linear effect were assessed in multivariable unconditional logistic regression models to avoid the dropping of subjects in conditional logistic regression models. To assess possible discrepancies between expected and observed haplotypes, accordance with the Hardy-Weinberg equilibrium (HWE) was checked for controls with the χ2 test. Statistical analyses were performed using STATA version 10 (Stata, College Station, TX), with P-values < 0.05 considered statistically significant.

Results

Table 1 shows baseline characteristics of the 481 CRC cases, with an average age of 60 years, and the 962 controls matched for sex and age. Males accounted for 62.4% of subjects. Apart from a family history of CRC in a first-degree relative, potential confounders showed no clear difference between cases and controls. A family history of CRC was significantly more frequent among CRC cases.

Table 1 Characteristics of cases and controls

Genotype distributions for 8q24 rs6983267 and rs10090154 are shown in Table 2. Among controls, both genotypes were accordant with the HWE. The minor allele frequency for rs6983267 was 0.338 (G-allele). The age- and sex-adjusted in the allelic model showed an OR of 1.22 (1.04-1.44, p = 0.0144) and the confounder-adjusted model an OR of 1.25 (1.06-1.48, p = 0.0071). Genotypic model showed a significant association only with rs6983267 GG genotype (OR = 1.64, 1.15-2.35, p = 0.0063). In contrast, rs10090154 showed no association with CRC risk. Table 3 shows stratified analyses conducted to explore possible interactions between potential confounders although point estimates for ORs were not static; no significant interactions were seen between the factors examined and rs6983267. The lack of association in those with a positive family history was of interest vis a vis the significant association in those without it, albeit that the number of subjects with a family history was limited.

Table 2 Genotypes distribution of 8q24 polymorphisms and odds ratios for the minor alleles and genotypes.
Table 3 Stratified analysis according to potential confounding factors for 8q24 rs6983267 genotype

Discussion

In this study, we found that the G allele in rs6983267 was associated with a significantly increased risk of CRC in a Japanese population. This finding is consistent with those from previous GWASs [6, 9, 11] and a pooled analysis [12], as reviewed in Table 4, which reported the consistency of this association with CRC and colorectal adenoma in populations with European ancestry. The only previous study of rs6983267 in a population with Asian ethnicity (Japanese-American) was that by Haiman et al [6], and to our knowledge the present study is the first indication in Japanese living in Japan. Tenesa et al. reported significant association with rs7014346 in 8q24, which is in high linkage disequilibrium with rs6983267, in Japanese population [19], supporting significant association between the rs6983267 in CRC in Japanese. Recent advances in genetic analysis have enabled a comprehensive approach to identifying disease susceptibility loci. The consistency of findings in this and the previous studies warrants the usefulness of the GWAS approach across ethnicities. We also evaluated potential interactions between common background factors and rs6983267, but found no significant interaction between them. Berndt et al. also reported a lack of interaction between rs6983267 and age, sex, smoking, family history of CRC and cancer site [12]. The consistency of this finding indicates that rs6983267 is associated with CRC risk independently of common risk factors.

Table 4 Review of results of 8q24 rs6983267 for colorectal cancer in allelic model.

Rs6983267 was originally identified using a non-hypothesis-based approach, and evidence has suggested a possible biological mechanism behind this observed association. The rs6983267 polymorphism resides 15 kb upstream of a processed pseudogene (POU5F1P1) of the POU-domain factor gene, POU5F1, which encodes transcription factor OCT4, with 97.5% shared identity [30]. OCT4, a transcript of POU5F1, plays a role in maintaining stem cell pluripotency, self-renewal and chromatin structure in stem cells [31], and promotes tumor growth in a dose-dependent manner [32]. A conserved POU5F1-binding site I at the 5' promoter region of the WNT-signaling gene, FZD5, has been reported [33]. Tomlinson et al. reported the expression of either POU5F1 or POU5F1P1 in cell lines and primary CRCs [9], while Suo et al. similarly reported the expression of these genes in cancer cell lines and cancer tissues [30]. Given that OCT4 pseudogenes in mice are reported to mediate stem cell regulatory function [34], it is possible to hypothesize that OCT4 pseudogenes, including POU5F1P1, might play a role in stem cell proliferation. However, no difference in expression according to rs6983267 status was observed [9]. Berndt discussed the potential contribution of MYC, which is located > 300 KB distant to rs6983267[12]. Recently, Pomerantz et al. reported rs6983267 displays a difference in binding of transcription factor 7-like 2 (TCF7L2) leading to a different physical interaction with MYC [35]; however, Tuupanen et al. failed to find clear association between rs6983267 genotype and MYC expression. There still remains controversy between MYC and rs6983267 requiring further studies. Moreover, Tuupanen et al. reported rs6983267 affects a binding site for the Wnt-regulated transcription factor (TCF4), with the risk allele G showing stronger binding in vivo and in vitro. Overall, these findings indicate that the possible biological mechanism behind the effect of rs6983267 polymorphism on CRC carcinogenesis requires further study.

We did not observe any association with rs10090154 (OR = 0.90) on the contrary to the results from Multi-ethnic cohort study [6]. The point estimate for minor allele in the previous study was 1.41 (95%CI: 1.14-1.75). Following case-control study for Japanese American in Hawaii showed lack of association (OR = 1.07, 95%CI: 0.78-1.48)[6]. Inconsistency across studies might come from the finding in the original GWAS was by chance although threshold in statistical significance was high enough. Or, statistical power in following studies including ours was not good enough. By all means, more evidence is needed to clarify significance of the locus.

Several potential limitations of the present study require consideration. First, use of hospital-based control in this study for potential cause of selection bias. We used non-cancer patients at our hospital as controls, given the likelihood that our cases arose within this population base. Moreover, we previously showed that individuals selected randomly from our control population were similar to the general population in terms of baseline characteristics [22]. Given the similarity in minor allele frequency between our controls and that in the HapMap database for Japanese, it is reasonable to assume the external validity of our study results to the general population. Second, as with other case-control studies, this study may have suffered from information bias: although the questionnaires were completed before the diagnosis in our hospital, some patients referred from other institutions might have known their diagnosis. Lack of interaction needs careful interpretation because confounders assessed in this study showed no association with CRC risk by themselves.

Conclusion

Our present investigation showed that rs6983267 in 8q24 is an independent risk factor of CRC in a Japanese population. Further studies to clarify the biological mechanisms of this association are warranted.