Background

Data in the National Statistics of Cancer Registries by Region (1975–2004) indicate that the prevalence of breast cancer in Japan has increased steadily since 1975. More than 60,000 patients had breast cancer in 2008 and the mammary gland is the most common site of a malignant tumor in Japanese women [1]. Additionally, the Vital Statistics Japan database of the Ministry of Health, Labor and Welfare indicates that mortality due to breast cancer in Japan has increased since 1960, with more than 10,000 deaths from breast cancer in 2011 [2].

The relationship of lifestyle factors, including food and nutrition, physical activity, body composition, environmental factors, and reproductive factors, with breast cancer risk have been widely studied, mainly in Europe and the United States, and much evidence linking cancer to these factors has been accumulated. According to the 2007 World Cancer Research Fund/American Institute for Cancer Research (WCRF/AICR) Second Expert Report, the evidence that breastfeeding decreases the breast cancer risk and that alcohol increases this risk is described as “convincing” [3]. In postmenopausal women, evidence that body fat and adult attained height increase breast cancer risk is also stated to be “convincing”. However, the evidence of a relationship of other foods with breast cancer risk remains at the level of “limited-no conclusion”. Thus, it is important to identify risk factors for breast cancer with the goal of prevention through efficient screening and surveillance.

In the United States, a breast cancer risk assessment tool based on a statistical model known as the “Gail model” has been produced by the National Cancer Institute (NCI) [4, 5]. However, this model has been developed from epidemiological data in Caucasians and it may be inappropriate to apply the Gail model in the Japanese population [6]. However, there are few epidemiological studies of breast cancer risk in Japanese women and a breast cancer risk model applicable to Japanese women has yet to be established.

Regarding genetic factors, genome-wide association studies (GWAS) have identified several breast cancer susceptibility single nucleotide polymorphisms (SNPs) [7]. However, most of these studies were also conducted in subjects with European ancestry, with some in populations with Chinese ancestry or in African Americans. There is only one such study in subjects with Japanese ancestry. However, allele frequencies related to breast cancer risk and the extent of linkage disequilibrium differ among races. Thus, the validity of the reported associations of SNPs with breast cancer needs to be tested in a Japanese population.

Current findings suggest that the interactions between breast cancer susceptibility SNPs and breast cancer risk are not as strong as those for BRCA1 or BRCA2 gene mutation. However, carriers of risk SNP alleles are more common compared with carriers of BRCA1 or BRCA2 mutation. Evaluation of the need to incorporate SNPs into a breast cancer risk model requires examination of the influence of these SNPs and established breast cancer risk factors to determine whether these are mutually confounding factors. Moreover, such findings might allow risk allele carriers to reduce their incidence of breast cancer through guidance on lifestyle habits.

The current study was performed to add to the relatively small number of studies that have examined genomic factors such as SNPs in combination with non-genomic factors such as those associated with lifestyle. We first aimed to validate whether reported breast cancer susceptibility SNPs are applicable in the Japanese population. We then examined the possible confounding effects on breast cancer risk of SNPs and lifestyle factors such as food, nutrition, physical activity, body composition, environment factors and reproductive factors.

Methods

Subjects

A multicenter population-based case–control study was conducted between December 2010 and November 2011 in Japan. The subjects were consecutive patients with non-invasive or invasive breast cancer aged over 20 years old who were treated at Okayama University Hospital, Okayama Rousai Hospital and Mizushima Kyodo Hospital in Okayama and at Kagawa Prefecture Central Hospital in Kagawa. The controls were women aged over 20 years old without a history of breast cancer who underwent breast cancer screening at Mizushima Kyodo Hospital and Okayama Saiseikai Hospital in Okayama and at Kagawa Prefectural Cancer Detection Center in Kagawa. All subjects gave written informed consent before enrollment in the study. A blood sample (5 ml) used for SNP analysis was collected from each subject. Subjects were also given questionnaires that they completed at home and mailed back to Okayama University Hospital. The study was approved by the institutional ethics committee on human research at Okayama University.

Survey of lifestyle

A survey of lifestyle was performed using an 11-page self-administered questionnaire that included questions on age, height and body weight (current and at 18 years old), cigarette smoking, alcohol drinking, intake of 15 foods items, intake of 4 beverages, leisure-time exercise (current and at 18 years old), menstruation status, age at first menstruation, age at first birth, parity, breastfeeding, age at menopause, hormone replacement therapy (HRT), history of benign breast disease, familial history of breast cancer, and education. Controls answered the survey based on their current status and patients referred to their prediagnostic lifestyle.

Body mass index (BMI) was calculated as body weight/square of height. Former or current alcohol drinkers were asked to give the frequency per week and type of drink usually consumed (beer, wine, sake, whisky, shochu, or others). The alcoholic content of each drink was taken to be 8.8 g per glass (200 ml) of beer, and 20 g per glass of sake (180 ml), wine (180 ml), shochu (110 ml) and whisky (60 ml) [8]. Alcohol intake per day (g/day) was calculated as follows: (total alcohol content per occasion × frequency of consumption per week)/7. Women who currently engaged in leisure-time exercise were asked to give the intensity of physical activity per occurrence and frequency per week. Metabolic equivalent (MET) values of 10, 7, 4, and 3 METs were assigned for strenuous-, moderate-, low-, and very low intensity activities per occurrence, respectively [9], to allow calculation of the intensity of physical activity in leisure-time exercise per week (METs/week). A family history of breast cancer included mother, sisters and daughters (first-degree family history). History of benign breast disease included the non-cancerous breast. Clinical data on patients were obtained from hospital medical records.

Selection of SNPs

Sixteen breast cancer-associated SNPs were identified from previous GWAS [7] and candidate-gene association studies: ATM/11q22-rs1800054 [10], 8q24-rs1562430 [11], MAP3K1/Chr5-rs889132 [10, 12], 2q-rs4666451 [10], 8q24-rs13281615 [10, 12, 13], TTNT3/11p15-rs909116 [11], 5q-rs30099 [10], IGF1/12q23.2-795399 [10, 14], ESR1/6q25.1-rs2046210 [15, 16], CAPSP8/2q33-34-rs1045485 [10], 2q35-rs13387042 [10], ESR1/6q25.1-rs3757318 [11], TNRC9/16q12-rs3803662 [12, 17], FGFR2/10q26-rs2981282 [10, 12], LSP1/11p15.5-rs381798 [12], and HCN1/5p12- rs98178 [10]. Risk alleles associated with breast cancer were identified with reference to the Japanese Single Nucleotide Polymorphism (JSNP) database [18].

SNP genotyping

Genomic DNA was isolated from whole blood with a Taq-Man® Sample-to-SNP™ kit (Applied Biosystems, Foster City, CA, USA). Samples were analyzed by a TaqMan genotyping assay using the StepOne™ real-time polymerase chain reaction (PCR) system (Applied Biosystems) in a 96-well array plate that included four blank wells as negative controls. The PCR profile consisted of an initial denaturation step at 95°C for 10 min, 40 cycles of 92°C for 15 sec, and 60°C for 1 min. PCR products were analyzed by StepOne™ Software Ver2.01 (Applied Biosystems). To assess the quality of genotyping, we conducted re-genotyping of a randomly selected 5% of samples and obtained 100% agreement.

Statistical analysis

For all analyses, significance was defined as a p-value <0.05. Associations between lifestyle and breast cancer risk were estimated by computing age adjusted odds ratios (OR) and their 95% confidence intervals (CI) from logistic regression analyses. Height was categorized as ≤150, 151–155, 156–160 and >160 according to quartile. Weight was categorized as <50, 50–54.9, 55–59.9 and ≥60 according to quartile. BMI was categorized as ≤20, 20–21.9, 22–23 and ≥24 according to quartile. Alcohol intake per day (g/day) was categorized as 0, <5, 5–10 and ≥10 g/day according to quartile. Food intake, including meat, fish, egg, soy, milk, fruits, green and yellow vegetables and mushrooms, was categorized as ≤1, 2–4 and 5 times/week. Beverage intake including coffee and green tea was categorized as ≤1, 2–3 and ≥3 cups/day. Intensity of physical activity in leisure time was categorized as 0, <6, 6–11.9, 12–23.9 and ≥24 METs/week. Age at menarche was classified as ≤12, 13 and ≥14 years old, parity as 0, 1–2 and ≥3, and age at first childbirth as <25, 25–29 and ≥30 years old. Education level was categorized as high school or less, two-year college, and university or higher.

In analysis of SNPs, accordance with the Hardy-Weinberg equilibrium was checked in controls using a chi-squared test. The associations between genotype and the risk of breast cancer were estimated by computing OR and the 95% CI from logistic regression analyses. Per allele OR was calculated using 0, 1 or 2 copies of the risk allele (a) as a continuous variable. The reported OR and 95% CI denote the risk difference when increasing the number of risk alleles by one. Two models of analyses were performed, with the first model adjusted only for age and the second model adjusted for factors that were significantly associated with breast cancer risk in this study (multivariate adjustment).

For SNPs associated with breast cancer, we classified subjects as risk allele carriers or non-risk allele carriers and examined associations of lifestyle factors with breast cancer risk in these subgroups. Two models were also used in this analysis, with the second model adjusted for factors that were significantly associated with breast cancer risk in the first model.

All statistical analyses were performed with Statistical Analysis System software JMP version 9.0.3 (SAS Institute).

Results

A total of 515 patients and 527 controls agreed to participate in the study and gave written informed consent. Of these women, 476 patients (92.4%) and 464 controls (88.8%) returned self-administered questionnaires. In 2 cases, blood samples could not be obtained because of brittle vessels and in another 2 cases SNP genotyping could not be performed because of poor DNA amplification. Thus, the final data set for analysis included 472 patients and 464 controls with completed questionnaires and SNP genotyping.

Adjusted OR with 95% CIs for lifestyle factors are shown in Table 1. BMI ≥24 (vs. 20–21.9) and current or former smoker (vs. never) were associated with a significantly increased risk for breast cancer. Meat intake ≥2 times/week (vs. ≤once/week), mushroom intake (vs. ≤once/week), yellow and green vegetable intake (vs. ≤once/week), coffee intake 2–3 cups/day (vs. <1 cup/day), green tea intake 2–3 cups/day (vs. <1 cup/day), current leisure-time exercise (vs. none), intensity of physical activity in leisure-time exercise 6–23.9 METS/week (vs. 0 METS/week), and university education (vs. high school or less) were all associated with a significantly decreased risk for breast cancer. Height, alcohol intake, age at first menstruation, parity, age at first birth, and familial history of breast cancer have generally been considered to be associated with breast cancer risk, but did not show a significant association in this study.

Table 1 Adjusted odds ratios and 95% confidence intervals for lifestyle factors in 472 cases and 464 controls (recruitment period: December 2010 to November 2011)

In analysis of SNPs, deviation from the Hardy-Weinberg equilibrium (P <0.05 by chi square test) was found for rs1800054 and rs1045485, and thus these SNPs were excluded from analysis. The minor allele frequencies were <0.05 for rs4666451 and rs104548, and these SNPs were also excluded, leaving 12 SNPs for analysis. Multivariate ORs were adjusted for factors that were found to be significantly associated with breast cancer: BMI, smoking status, meat intake, mushroom intake, yellow and green vegetable intake, coffee intake, green tea intake, leisure-time exercise and education level.

Age adjusted ORs and multivariate ORs with 95% CIs for independent SNPs in all subjects and in subjects stratified by menopausal status are shown in Table 2. In all women, three SNPs were significantly associated with breast cancer risk in multivariate adjustment: rs2046210 (per allele OR = 1.37 [95% CI:1.11-1.70]), rs3757318 (per allele OR = 1.33 [1.05-1.69] and rs3803662 (per allele = 1.28 [1.07-1.55]). rs2046210 and rs3757318, both of which are located on 6q25.1, are not in strong linkage disequilibrium (LD) (D = 0.68, r2 = 0.21) according to Hap-Map JTP [19]. Among pre-menopausal women, s3803662 (per allele OR = 1.58 [95% CI: 1.17-2.16]) and rs2046210 (per allele OR = 1.70 [95% CI: 1.24-2.35]) were significantly associated with breast cancer risk in multivariate adjustment. Among post-menopausal women, there were no SNPs significantly associated with breast cancer risk.

Table 2 Odds ratio with 95% confidence intervals for individual SNPs in all subjects and in subjects stratified by menopausal status

A subgroup analysis was performed for rs2046210 and rs3757318. For rs2046210, leisure time exercise was associated with a significantly decreased breast cancer risk in risk allele carriers (AA + AG), but not in non-risk allele carriers (GG). In contrast, BMI ≥ 24 and current smoking were associated with a significantly increased breast cancer in non-risk allele carriers (GG), but not in risk allele carriers (AA + AG). Intensity of physical activity in leisure exercise of 12.0-23.9 METS/week and university education were associated with breast cancer risk in risk allele and non-risk allele carriers (Table 3). For rs3757318, BMI ≥ 24 was associated with a significantly increased breast cancer risk in risk allele carriers (GG), but not in risk allele carriers (AA + AG). University education and current smoking were associated with breast cancer risk in risk allele and non-risk allele carriers (Table 4).

Table 3 Age-adjusted odds ratio and multivariate adjusted odds ratio with 95% confidence intervals for lifestyle factors in rs2046210
Table 4 Age-adjusted odds ratio and multivariate adjusted odds ratio with 95% confidence intervals for lifestyle factors in rs3757318

Discussion

Associations of breast cancer risk with lifestyle factors and SNPs alone and in combination were examined in a case–control study in 472 patients and 464 controls. Reproductive factors such as early age at first menstruation, late age at menopause, late age at first birth, nulliparity, and no breastfeeding have been associated with an increase in breast cancer risk [20], including in the Japanese population [21]. In our study, parity and breastfeeding showed a tendency for an association with decreased breast cancer risk, but this association was not significant; and age at first menstruation, age at first birth, and age at menopause were not significantly associated with breast cancer risk. In most previous studies, comparisons were made using categories for age at first menstruation of 12–13 and >15 years old [22] and age at first birth of ≤24 and >30 years old [23]. In the current study, the sample sizes for women who were >15 years old at first menstruation and >30 years old at first birth were too small to analyze correctly, which is a limitation in the study.

The associations of food and nutrition with breast cancer risk have been summarized by the WCRF/AICR [3]. The effects of some foods on breast cancer are unclear, but we found that intake of meat, mushrooms, yellow and green vegetables, coffee and green tea was associated with decreased breast cancer risk. The evidence that alcohol is associated with breast cancer was judged to be “convincing” by the WCRF/AICR, but we did not find this association, which is consistent with other Japanese studies. The frequency and amount of food consumption depends on cultures and customs in different countries, and this may cause the factors and threshold level for breast cancer risk to also vary in the respective countries.

Cigarette smoking [24, 25] is also considered to be associated with increased breast cancer risk, while leisure-time exercise [26] is associated with decreased breast cancer risk, including in the Japanese population. The mean BMI of the Asian population, including the Japanese population, is lower than that in non-Asians [27]. However, we found that BMI ≥24 is associated with increased breast cancer risk, as found in other Japanese studies [28].

A high education level has been associated with increased breast cancer risk, but this may be explained by highly educated women having a high rate of nulliparity and being older at first birth. However, in Japan, social advances and college attendance have only become more common for women in recent years, and thus education level may not correlate well with social status and an unwed state. Instead, more highly educated women are more likely to be involved in preventive health behavior such as exercise, non-smoking, no alcohol intake and avoidance of obesity, compared to women with less education, and some studies have associated a higher education level with a decreased breast cancer risk [29, 30].

The current study has several limitations. First, selection bias may have influenced the results because we enrolled women who underwent breast cancer screening as controls. In Japan, the rate of breast cancer screening was no more than about 25% in 2010 [31]. Thus, women who undergo screening may have more interest in trying to maintain their health and may have a family history of cancer, which may have eliminated the significant association of a family history of breast cancer with breast cancer risk in our study. Second, recall bias may have influenced the results because of the use of self-administered questionnaires. In particular, data from patients might lack accuracy because their answers reflected their behavior before diagnosis.

In all subjects, 3 of the 16 SNPs analyzed in the study were significantly associated with breast cancer risk. These included rs2046210 and rs3757318, which are located at 6q25.1, in proximity to the estrogen receptor 1 gene (ESR1). ESR1 encodes an estrogen receptor (ERα), a ligand-activated transcription factor composed of several domains important for hormone binding, DNA binding, and activation of transcription [32]. ERα is mainly expressed in the uterus, ovary, bone, and breast in females [33], ERα is also overexpressed in 60-70% of cases of breast cancer and is involved in the disease pathology. Although these SNPs are located in the same chromosome region, they are not in strong LD based on the HapMap Project. Potential involvement of both SNPs in regulation of ESR1 is unclear [14, 34]. rs2046210 is located 29 kb upstream of the first untranslated exon. The risk allele frequency of rs2046210 is 33.3% in Europeans (HapMap-CEU), 37.8% in Chinese (Hap Map-HCB) and 30.0% in Japanese (HapMap-JTP) [19]. Our result indicated a 27% risk allele frequency, which was about the same as that in HapMap-JTP. Thus, the risk allele frequency of Asians differs little from that of Europeans. Several studies have associated rs2046210 with breast cancer risk [15, 3436]. Guo et al. found a significant association between rs2046210 and breast cancer risk in the overall population (per allele OR 1.14, 95% CI =1.10–1.18) and in Asians (per allele OR 1.27, 95% CI =1.23–1.31) and Europeans (per allele OR 1.09, 95% CI =1.07–1.12), indicating that rs2046210 has a larger effect in Asians [34]. Our results also suggest that rs2046210 is significantly associated with breast cancer risk in Japanese subjects.

Turnbull et al. first reported a significant association of rs3757318 with breast cancer risk [11]. rs3757318 is located 200 kb upstream of ESR1. The risk allele frequency of rs3757318 is 6.6% in Europeans (HapMap-CEU), 33% in Chinese (HapMap-HCB) and 25% in Japanese (HapMap-JTP) [19]. We found a 22% risk allele frequency, consistent with HapMap-JTP. Thus, the risk allele frequency for rs3757318 varies between Europeans and Asians. In an analysis of the association between rs2046210 and rs12662670 as a surrogate for rs3757318 and breast cancer risk, Heins et al. found that that per allele OR for rs3757318 was higher in Asians (1.29, 95% CI 1.19–1.41) than in Europeans (1.12, 95% CI 1.08–1.17) [31]. These results suggest that screening for the rs3757318 genotype may be important in Asian women.

We also found that SNPs associated with breast cancer differed with regard to menses state, with rs2046210 and rs3803662 associated with breast cancer risk in premenopausal women. rs3803662 lies 8 kb upstream of TNRC9 and was found to have a significant association with breast cancer risk by Easton et al. [12]. TNRC9 is located on chromosome 16q12 and consists of seven exons. The protein encoded by this gene is a member of the high mobility group box (HMG-box) family. TNRC9 is expressed in brain and breast tissue, and has a higher expression level in breast cancer compared to that in normal tissue [37]. The risk allele frequency of rs3803662 is 24% in Europeans (HapMap-CEU), 72% in Chinese (HapMap-HCB) and 60% in Japanese (HapMap-JTP) [19]. Thus, Asian populations have a higher risk allele frequency than Europeans. However, Chen et al. found that rs3803662 was significantly associated with breast cancer in Europeans [17], but that this relationship was unclear in Asians [38]. Among the breast cancer-associated SNPs found in the current study, rs2046210 and rs3757318 are located near ESR1 and are related to breast cancer risk in Asians. To examine whether lifestyle factors associated with breast cancer risk vary in risk allele and non-risk allele carriers, we performed a subgroup analysis. Leisure-time exercise were associated with a decreased breast cancer risk in rs2046210 risk allele carriers. Although low-penetrance susceptibility SNPs may confer only a small effect on breast cancer risk alone, the risk for development of breast cancer in a risk allele carrier is about 1.2-1.3 fold higher than that in non-carriers. However, our results suggest that risk allele carriers can reduce their breast cancer risk through exercise, whereas obesity and smoking may increase breast cancer risk in non risk-allele carriers. An understanding of the mechanisms underlying the different lifestyle factors associated with breast cancer in rs2046210 and rs3757318 risk allele and non-risk allele carriers may clarify the effects of these SNPs located near ESR1. Examination of interactions between SNPs and lifestyle factors in a larger Japanese population is needed to confirm the current findings for SNPs, lifestyle factors and breast cancer.

Conclusions

This case–control study showed that rs2046210 and rs3757318 located near the ESR1 gene and rs3808662 located on TNRC9 are associated with breast cancer risk in Japanese women. Our results suggest that leisure-time exercise can reduce the breast cancer risk in rs2046210 risk allele carriers, whereas smoking and obesity may increase the breast cancer risk in non-risk allele carriers. Further studies are required to confirm the validity of the association of these SNPs and lifestyle factors with breast cancer risk in the Japanese population.