Background

Genome wide association studies (GWAS), have discovered several new genetic polymorphisms affecting breast cancer risk [13]. Even though these individual risk-factors each confer quite small increases in risk, a positive association is seen between the number of risk alleles carried and risk for breast cancer [4, 5].

The phenotypic variables height, body mass index (BMI) and use of hormone replacement therapy (HRT) reflect to varying degrees genetic background and environmental exposure. Both height and BMI have previously been shown to associate with breast cancer [6, 7]. Increase in height has been shown to yield a proportional increase in breast cancer risk and obese women have a greater risk to contract postmenopausal breast cancer. Increased risk is also established for users of HRT [7], which has been speculated to interact with low-risk polymorphisms in the FGFR2 gene [8, 9].

Although there have been investigations on gene-environment interactions in breast cancer [10], this area remains to a large extent unexplored.

The aim of this study was to investigate if height, BMI and HRT modify the genetic predisposition to breast cancer conferred by reported low-risk polymorphisms. For this purpose we had access to two well defined Swedish population based cohorts as well as an Icelandic hospital based case control study, altogether 7738 samples (3016 cases and 4722 controls).

Methods

Study populations

The samples originate from two Swedish independent population based cohorts; the Malmö Diet and Cancer Study (MDCS) from southern Sweden and the North Sweden Health and Disease Study (NSHDS), together comprising 2410 incident cases and 3829 controls. The third sample collection was an Icelandic population-based case control study including 866 cases and 948 controls. Written informed consent was retrieved from all women prior to donating their samples. All cohorts have been described previously [11] and are briefly presented below.

MDCS

The Malmö Diet and Cancer Study (MDCS) is a prospective cohort study initiated in 1991. Totally it comprises 17035 female residents of Malmö Sweden recruited between 1991 and 1996 [12, 13]. By linkage to the national cancer registry until 31st of December 2007, 730 incident cases of invasive breast cancer were identified among MDCS participants. They were matched to 1460 controls from the same cohort according to sex, age (+/− 6 months), and date of sampling at baseline (+/− 2 months). Median age at breast cancer diagnosis was 65 years (range 45–84). Thirty-three cases and 65 controls were ≤50 years of age at time of diagnosis.

The MDCS and the present analyses were approved by the Ethical Committee at Lund University (LU 51–90, Dnr 652/2005 and Dnr 2009/682).

NSHDS

The Northern Sweden Health and Disease Study (NSHDS) include the Västerbotten Intervention Program (VIP), and the Mammography Screening Program (MSP), initiated in 1985 and 1995 respectively. Participants in the VIP are screened at 40, 50 and 60 years of age and mammography screening and blood sampling is performed among women between 50 and 69 years of age [14]. Through linkage with the cancer register up to December 31st, 2008, 1680 prospective cases of invasive breast cancer (median age 56 years, range 27–95) were identified. They were matched to 2314 controls by sex, age (+/− 6 months), and date of sampling at baseline (+/− 2 months), (474 cases and 606 controls ≤50 years of age). Information on HRT use was available for 1420 of these cases.

The NSHDS and the present analyses were approved by the Ethical Committee at Umeå University (Dnr: 2010-147-132 and 07–141).

ICELAND

The Icelandic samples were collected between 1998 and 2006 and represent 45–77% of all Icelandic women with invasive breast cancer diagnosed between 1957 and 2007. The rate of participation varied somewhat depending on the year of diagnosis and was highest between 1999 and 2003 (77%). Unmatched controls were collected between 2000 and 2004, either from women who participated in the population-based cervical or breast cancer screening program and found free of breast cancer or from older women in retirement homes who had not been diagnosed with breast cancer, to generally reflect the ages of the cases. By linkage to the Icelandic cancer registry in 2008 we identified cases diagnosed before 31st of December 2007. Totally 866 cases (median age 55 years, range 22–98, 314 ≤ 50 years) and 948 controls (median age 58 years, range 25–102, 256 ≤ 50 years) had DNA available and were eligible to us.

The use of these samples was approved by the data protection law (200605037), and the Icelandic Science Ethics Committee (VSNb2006050001/03-16 and VSNb2005070008/03-16).

Data collection

Participants in both Swedish cohorts completed a questionnaire providing information about current medication at the time of recruitment. Participants in the NSHDS also provided information about height and weight while a trained nurse at the study centre measured height and weight, for participants in MDCS [15].

The Icelandic women answered questions about height, weight and HRT use when they attended the Detection Cancer Clinic (breast cancer mammography or cervical screening) at the Icelandic Cancer Society. The women answered questions at least every tenth year and the most recent answers were used in the study. For the Icelandic cases only data collected prior to breast cancer diagnosis was used. BMI for all participants was calculated as kg/m2.

SNP selection

All loci identified by GWAS to be associated with breast cancer and published before June 31st 2007 were initially included in the study [13]. Individual SNPs were selected from the publications by Easton et al. and Stacey et al. This primary selection included 10 SNPs, as well as one SNP in CASP8 identified using the candidate gene approach [16]. Two SNPs selected from our own candidate CpG SNP study [11] were also included making a total of 13 SNPs (Figure 1).

Figure 1
figure 1

Odds Ratios and Confidence Intervals for all SNPs. All 13 primary polymorphisms and their respective OR and p-value in this sample set. Squares represent OR and brackets represent 95% CI for samples adjusted for age and study population. A subset of previously published data [11].

Assay design and genotyping

Eleven SNPs, combined by the SEQUENOM MassARRAY® Designer software in a single multiplex assay were analyzed on a MALDI-TOF mass spectrometer (SEQUENOM MassArray) using standard iPLEX reagents and protocol (SEQUENOM) and 10 ng DNA as PCR template. Primer sets were from Metabion (Martinsried, Germany).

SNPs rs2981582 and rs1045485 were analyzed by a separate TaqMan® “assay by design” genotyping assay on a 7900HT instrument, using Master mix No UNG from Applied Biosystems according to the manufacturer’s instructions. Reaction mixtures (6μL) containing 2 ng of DNA template, primers (rs2981582 forward primer 5′-CAG CAC TCA TCG CCA CTT AAT G-3′, reverse primer 5′-GAC ACC ACT CGG ACT GCT-3′, and probes 5′-VIC-TCT CCG CAA ACA GG-MGB-3′ and 5′-FAM-CTC TCC ACA AAC AGG-MGB-3′) (rs1045485 forward primer 5′-ACC ACG ACC TTT GAA GAG CTT -3′, reverse primer 5′-ACT GTG GTC CAT GAG TTG GTA GAT-3′, and probes 5′-VIC-CCC CAC GAT GAC TG-MGB-3′ and 5′-FAM-CCC CAC CAT GAC TG-MGB-3′) were subjected to two minutes at 50°C and ten minutes at 95°C, followed by 50 PCR cycles of 95°C for 15 seconds and 60°C for one minute.

Three percent of the samples from NSHDS and five percent of the Icelandic samples were included as blinded duplicates for quality control purposes.

Statistical analysis

Individual samples producing results in < 80% of the assays were excluded prior to statistical analyses to eliminate samples with low-quality DNA. Genotype data from control samples were tested for consistency with Hardy-Weinberg equilibrium (HWE) using a χ² p-value cutpoint of 0.001. Unconditional logistic regression was used to measure the independent association between each genotype and breast cancer, with Odds ratios and 95% confidence intervals (CI) estimated for each genotype. Per allele OR (p-trend) was calculated using 0, 1 or 2 copies of the minor allele (a) as a continuous variable. OR and 95% CI were calculated between each phenotypic variable (Height, BMI and HRT) and risk for breast cancer, these results were also age adjusted. Data was then stratified into tertiles according to height (<162 cm, 162–166 am and >166 cm), and into subcategories of BMI according to the WHO guidelines (Normal weight: 18.5-25, Overweight: 25–30 and Obese > 30). For HRT subjects data was stratified according to reported “non use” and “current use”. The current users were further divided into users of only estrogen or combined hormones. OR and 95%CI were calculated for each variable (Height, BMI and HRT) and risk for breast cancer.

A p-value for interaction was estimated for each pair of genotype/phenotype and a value of less than 0.05 was considered statistically significant. As adjustment for multiple comparisons this value was divided by the number of interaction analyses, according to Bonferroni, (8 SNPs x 3 =24) and the new significance threshold was 0.002. All results were adjusted for age and study population.

Results

Of the initial 7738 samples selected for the project 7392 (95.5%) were successfully retrieved and genotyped for ≥ 80% of the SNPs. All SNPs had a genotyping success rate > 94%, with an average of 98.0%. Results of all 200 analyses performed on duplicate samples were in 100% concordance.

Per allele OR for each independent SNP is presented in Figure 1. Ten of the SNPs were significant (p < 0.05) in our material with rs2981852 (FGFR2), rs889312 (MAP3K1) and rs3803662 (TOX3) exhibiting the highest ORs. Two of the SNPs had p-values >0.1 (rs1045485 [CASP8] and rs30099 [5q11]) and were excluded from further analysis.

Three of the SNPs in TOX3 (rs3803662, rs12443621 and rs8051542) exhibited linkage (results not shown) as has previously been reported [1, 4]. Rs12443621 and rs80515442 were therefore excluded from further analysis.

Independent analysis of risk association with each phenotypic variable (height, BMI, HRT) within the entire study population revealed a significantly increased risk of breast cancer for individuals >162 cm compared to shorter women, this association was weakened following age adjustment. No statistical significant correlation between BMI and risk for breast cancer was found in this population. For current use vs. non-use of HRT, a significantly increased risk was seen for users, OR (95% CI) 1.24 (1.08-1.42), which remained after adjustment for age (Table 1).

Table 1 Environmental risk factors (MDCS, NSHDS and ICELAND)

Stratified analysis and interactions

After stratification by height (as described in materials and methods), one SNP (rs851987) in ESR1 had a p-interaction = 0.007 with height, with an increasingly protective effect of the major allele in taller women, but it did not pass the threshold for multiple comparisons (p = 0.002) (Table 2).

Table 2 OR adjusted for Age and Study Population, stratified by Height

None of the SNPs showed any tendencies towards significant interactions after stratification according to BMI (Table 3).

Table 3 OR adjusted for Age and Study Population, stratified by BMI

Following stratification of genotypes according to reported current use or non-use of hormone replacement therapy, rs13281615 (8q24) was significant only in non users of HRT with a p-for interaction of 0.03, indicating borderline significance (Table 4).

Table 4 OR adjusted for Age and Study Population, stratified by HRT

Discussion

In this study we have explored interactions between reported genetic risk factors for breast cancer and the three additional established risk factors; height, BMI and HRT in 2884 cases and 4508 controls. The strongest tendency for interaction found was that between height and rs851987 in ESR1, although it did not pass the threshold for multiple comparisons. Taller women carrying the T-allele appeared to have reduced breast cancer risk (p for interaction = 0.007) (Table 2). Rs851987 was described by Harlid et al. [11] and is situated in the far end of the extended promoter region of ESR1, about 3.7 kb 5′ of exon F. Exon F and its promoter were originally described by Thompson et al. [17] and have later been shown to affect the level of ESR1 expression in osteoblastic cells [18, 19]. A potential association between ESR1 and height has been described in another study comprising adult males from two Swedish population cohorts [20]. Mutations in ESR1 have been reported to delay fusion of the epiphyseal plates at puberty [21], and one may speculate that rs851987 either participates in this biological effect or is linked to other causal variants.

One SNP (rs13281615 in 8q4) first described by Easton et al. [1] showed a weak tendency for interaction with use of HRT. The minor allele seems to confer increased breast cancer risk in HRT non-users but no excess risk in current users. The association in non-users is strong with a per-allele OR (95%CI) of 1.20 (1.10-1.31) (p-trend = 6.1 × 10-5) compared to a per-allele OR (95%CI) of 1.08 (1.01-1.15) (p-trend = 0.03) in all users. The SNP is situated in region 8q24 that contains no known genes but is in close proximity to FAM84B (coding for a breast cancer membrane associated protein) and the proto-oncogene MYC. The 8q24 locus has previously been reported to associate with other types of cancer in addition to breast cancer [22] and to be more strongly associated with ER + than ER- tumours [23].

Since the first GWAS on breast cancer was published in 2007 several replication and interaction studies of varying sizes have been published [2427]. In 2010, a large interaction study comprising 7610 breast cancer cases from the Million Women Study in UK was undertaken and potential interactions between 12 different SNPs and 10 different variables (including height, BMI and HRT) were tested [10]. This study did not find (contrary to previous suggestions) any significant gene-environment interactions. Our study originally included ten of the same polymorphisms as in the Million Women Study (excluding rs1982073 in TGFB1 and rs1800054 in ATM), but also included one additional SNP from Easton et al. [1] and two additional SNPs from our own candidate CpG study [11] (rs7766585 and rs851987 both in ESR1). Although our material is not as large, our study is comprised of three well described study-populations, two of which were prospectively followed for breast cancer incidence using the comprehensive, population-based Swedish Cancer Registry [28]. Thus, our complete case ascertainment and ability to select matched controls from the same study base is likely to have resulted in low risk for selection biases. However, the intervals between data collection, blood sampling, and diagnosis differ substantially between the three different study populations, something that might be considered a limitation of the study.

Considering demographic traits, participants in the MDCS have a slightly higher socioeconomic status than the general population, but as this selection is the same for the study base from which cases and controls are derived, it should not affect the validity of our study [13]. MDCS participants were recruited at age 45–65 years. The exclusion of prevalent cases removes early breast cancer cases from this population. While the NHSDS participants were primarily included from age 40 and upwards, mammography screening had identified some cases as young as 27 years. In Iceland prevalent cases of breast cancer were recruited at varying times after diagnosis, resulting in an exclusion of early lethal cases and older women with other causes of death. As the Icelandic controls were collected later and from the same sample population as the cases there is the possibility of selection bias. Another limitation of our study is the fact that HRT is reported only once (at recruitment) without information about duration. We also lacked information about other risk factors than age, height, BMI, HRT and therefore could not adjust our results for other potential confounders.

Conclusions

Our evaluation of genetic predisposition for breast cancer in relation to three different environmental risk factors found no significant gene-environment interactions. We did find tendencies for certain SNPs to exert an effect on breast cancer risk only in women with certain phenotypes. In particular the potential interaction between height and rs851987 in ESR1 in relation to breast cancer risk could merit further investigation. However, independent studies with many more cases would be needed to verify this finding.