Background

Data from recent prospective [1, 2], case-control [3,4,5,6], and case-only studies [7,8,9] suggest that parity increases risk of hormone receptor-negative breast cancer, while breastfeeding decreases risk (reviewed in [10]) or attenuates the detrimental effect of parity [3, 5, 11]. A general limitation of many prior prospective studies on these associations [1, 2, 12,13,14,15,16] is the inclusion of relatively few estrogen receptor (ER)-negative cases, given that only ~ 20% of tumors lack ER expression. Prospective data on the combination of higher parity and history of breastfeeding and ER− breast cancer risk are sparse [1, 11]. The African American Breast Cancer Epidemiology and Risk (AMBER) Consortium [11], including both case-control [4, 5] and cohort studies [1, 17], is the largest study to date (ER+, n = 2446; ER−, n = 1252). Within the consortium, parity was associated with increased risk of ER− breast cancer; this risk was attenuated among women who reported ever breastfeeding. Further, breastfeeding was only inversely associated with ER− disease; no association was observed for ER+ breast cancers.

To our knowledge, there are no prospective data on the combined effects of parity and breastfeeding by molecular phenotype (e.g., luminal subtypes, HER2-enriched, basal-like). Case-control [5, 7, 18] and case-only [9] studies suggest an increased risk of basal-like breast cancer only among parous women who did not breastfeed and an inverse association between breastfeeding and basal-like disease. In an earlier investigation of reproductive factors in the Nurses’ Health Studies (NHS, NHSII), we observed an inverse association between breastfeeding and basal-like breast cancer, but did not investigate the combined effect of parity and breastfeeding [13].

If breastfeeding is confirmed to reduce risk of hormone receptor-negative breast cancer, it represents an accessible and cost-effective risk-reduction strategy for this aggressive subtype. We hypothesized heterogeneous associations between parity and breastfeeding and ER+ and ER− breast cancer risk, with inverse associations hypothesized between parity and ER+ disease risk and between breastfeeding and ER− disease risk, and a positive association between parity and ER− breast cancer subtypes among women who did not breastfeed. Given intriguing, but inconsistent, prior results on the association between parity, breastfeeding, and breast cancer by subtype, and sparse prospective data on the combined effect of higher parity and breastfeeding, we examined these associations in the NHS and NHSII among almost 200,000 women.

Methods

Study population: the Nurses’ Health Study and Nurses’ Health Study II

The NHS was initiated in 1976 when 121,701 registered nurses, ages 30–55, completed and returned a mailed questionnaire [19]. The NHSII began in 1989 with 116,429 female registered nurses ages 25–42, using the same protocols. Participants have been followed biennially to update information on lifestyle and reproductive factors, and ascertain disease diagnoses. Cumulative follow-up rates are ≥ 94% in both cohorts.

Parity and breastfeeding assessment

Participants reported pregnancies and breastfeeding on the biennial questionnaire (pregnancies: NHS 1976–1984, 1996, and NHSII 1989–2009; breastfeeding NHS 1986 and NHSII 1993, 1997). Parity was investigated as nulliparous vs. parous and total number of pregnancies (1, 2, 3, 4+ children); breastfeeding was investigated as nulliparous/never vs. ever and total duration (< 6, 7–11, ≥ 12 months). Further, we cross-classified parity with breastfeeding (nulliparous, parous/ever breastfed, parous/never breastfed; and nulliparous, never breastfed and 1, 2, or ≥ 3 children, ever breastfed and 1, 2, or ≥ 3 children).

Covariate assessment

Age at each questionnaire was calculated using date of birth and questionnaire return date. Age at menarche and height were collected on baseline questionnaires. Other covariates included (date of collection) weight at age 18 (NHS 1980; NHSII 1989), height (NHS 1976; NHSII 1989), weight (biennially), oral contraceptive use (OC; NHS biennially until 1982; NHSII biennially), postmenopausal hormone therapy (HT; biennially), alcohol consumption (every 4 years, beginning in NHS 1980; NHSII 1991), age at first birth (NHS 1976 and biennially until 1982; NHS 1989 and biennially until 2011), menopausal status and age at menopause (biennially), diagnosis of benign breast disease (biennially), and family history of breast cancer (NHS 1982, and every 4 years beginning in 1988; NHSII 1989, and every 4 years beginning in 1997). Body mass index (BMI, kg/m2) at age 18 was calculated using self-reported weight and height.

Breast cancer case ascertainment

Participants reported disease status on the biennial questionnaires. Cases included in this analysis reported no prior cancer diagnosis (except non-melanoma skin) and were diagnosed with invasive breast cancer through June 2014 (NHS) or June 2013 (NHSII). A total of 12,452 (NHS n = 8807; NHSII n = 3645) incident breast cancer cases were identified; 82% had data on tumor ER status (ER+ n = 8235; ER− n = 1978). A study physician confirmed breast cancer cases and extracted invasive vs. in situ, hormone receptor status, and tumor characteristics from the medical record. Vital status was ascertained through June 2015, using the National Death Index. Of the 12,452 breast cancer cases identified, 11,369 (91%) were confirmed through medical record review; remaining cases were participant confirmed. Given the high confirmation rate by medical record for breast cancer in this cohort (99%), both medical record and participant-confirmed cases are included.

Tumor tissue collection

Details of tumor tissue collection have been detailed previously [20]. Tumor tissue samples were available from 4058 breast cancer cases (NHS, n = 3126; NHSII, n = 932). Tissue microarrays (TMAs) were created at the Dana Farber/Harvard Cancer Center (Boston, MA) with three 0.6-mm cores taken from each breast tumor included in the TMA block.

Immunohistochemical (IHC) analyses

IHC staining for ER, progesterone receptor (PR), human epidermal growth factor receptor 2 (HER2), epidermal growth factor receptor (EGFR), and cytokeratin (CK) 5/6 was performed using a Dako Autostainer (Dako Corporation, Carpinteria, CA). IHC methods have been described previously [21]. Immunostained TMA slides were evaluated by a study pathologist. Tumor sections with any nuclear staining for ER or PR were classified as ER+ or PR+; ER− or PR− had complete absence of staining. Tumors were considered HER2+ when more than 10% of cells showed moderate or strong staining. Tumors were classified as CK5/6+ or EGFR+ if there was evidence of cytoplasmic or membranous staining.

Classification of molecular subtypes

ER, PR, HER2, CK5/6, EGFR, and tumor grade were used to classify tumors into molecular subtypes; classification by IHC is a reasonable proxy for gene expression data [22,23,24]. Cases that were ER+ and/or PR+, HER2− and of histologic grade 1 and 2 were classified as luminal A cancers; cases that were either (a) ER+ and/or PR+ and HER2+ or (b) ER+ and/or PR+, HER2−, and of histologic grade 3 were classified as luminal B; cases that were ER−, PR−, and HER2+ were classified as HER2-enriched; and cases that were negative for ER, PR, and HER2 and positive for CK 5/6 and/or EGFR were categorized as basal-like. Cases that lacked expression of all five markers were considered “unclassified.” Of the invasive tumors on tissue microarrays, 3454 could be classified into the luminal A (n = 1903; 55%), luminal B (n = 934; 27%), HER2-enriched (n = 186; 5%), basal-like (n = 341; 10%), or unclassified (n = 90; 3%) subtypes. The few unclassified tumors were excluded from further analyses.

Statistical analysis

After exclusions for prior cancer (n = 4528) or missing data on parity, age at first birth, or breastfeeding (n = 34,088), 199,514 women remained in the analysis. We calculated person-years beginning at date of baseline questionnaire return and ending at the earliest of date of diagnosis of any cancer (except non-melanoma skin cancer), date of death, or end of follow-up (NHS: June 1, 2012; NHSII: June 1, 2013). We used multivariable Cox proportional hazards models to calculate hazard ratios (HR) and 95% confidence intervals (CI) for risk of breast cancer by hormone receptor subtype (ER+, ER−; ER+/PR+, ER+/PR−, ER−/PR−) and by molecular phenotype (luminal A, luminal B, HER2-enriched, basal-like). Models were stratified by age in months, follow-up year, and cohort. Covariates were updated whenever possible throughout follow-up. Covariates in the final models were height, BMI at age 18, weight change since age 18, age at menarche, age at natural menopause, menopausal status, HT use, alcohol consumption, total physical activity, family history of breast cancer, and history of benign breast disease. Models restricted to parous women were additionally adjusted for age at first birth; in sensitivity analyses, we adjusted for years between menarche and first birth and years between last birth and minimum of current age and age at menopause. In analyses restricted to parous women, parity and breastfeeding were mutually adjusted. We further evaluated associations for ER+ and ER− disease in subgroups defined by time between last pregnancy and diagnosis.

Heterogeneity (Phet) was assessed using a likelihood ratio test comparing models assuming the same association between the exposures and breast cancer subtypes to one assuming different associations for disease subtypes [25].

P values were considered statistically significant at < 0.05; all statistical tests were two-sided. Analyses were conducted in SAS 9.3 (Cary, NC).

Results

A total of 93% of NHS participants and 67% of NHSII participants were parous at study baseline; among parous women, the majority reported ever breastfeeding (63% NHS, 79% NHSII). Median ages at recruitment for nulliparous women were 43.1 years in the NHS and 32.6 years in the NHSII; parous women were somewhat older (e.g., 2+ children, ever breastfed, median age 42.8 years in the NHS and 35.5 in the NHSII) (Table 1).

Table 1 Characteristics of the Nurses’ Health Study (NHS) and NHSII participants at baseline (NHS: 1976; NHSII: 1989)

The associations between parity, breastfeeding, and breast cancer risk differed by tumor ER status (Phet = 0.03). Parity was associated with lower risk of ER+ breast cancer (parous vs. nulliparous: HR = 0.82 [95% CI 0.77–0.88]) (Table 2), but not associated with ER− disease (HR = 0.98 [0.84–1.13]). In contrast, ever breastfeeding was only associated with reduced risk of ER− breast cancer (HR = 0.83 [0.75–0.92]), and not ER+ disease (HR = 0.99 [0.94–1.04]), relative to parous women who never breastfed and nulliparous women. Results were similar regardless of duration of breastfeeding (e.g., ≥ 12 months vs. never, ER− HR = 0.83 [0.73–0.94], Ptrend = 0.05; ER+ HR = 1.02 [0.96–1.09], Ptrend = 0.22). The association between parity and ER+ tumors did not differ by breastfeeding history (e.g., parous, never breastfed: HR = 0.83 [0.77–0.90]; parous, ever breastfed: HR = 0.82 [0.76–0.88]), and parity was not associated with ER− breast cancer risk in either breastfeeding history subgroup (parous, never breastfed: HR = 1.11 [0.94–1.31]; parous, ever breastfed: HR = 0.92 [0.79–1.08]).

Table 2 Multivariable-adjusteda hazard ratios (HRs) and 95% confidence intervals for breast cancer overall and by ER status in relation to parity and breastfeeding: Nurses’ Health Study (1976–2012) and Nurses’ Health Study II (1989–2013)

We next restricted the analyses to parous women and evaluated number of children, breastfeeding, and breast cancer risk (Table 2). Relative to women with parity of 1, only parity of 4 or higher was associated with lower ER+ breast cancer risk (≥ 4 children HR = 0.91 [0.82–1.00]). No association was observed in the other parity categories (e.g., 3 children HR = 1.03 [0.94–1.13]), or in any category for ER− breast cancer. The association between breastfeeding and breast cancer risk among parous women was similar to that observed in the analysis among all participants, including nulliparous women (e.g., ever vs. never, ER+ HR = 0.99 [0.94–1.05]; ER− HR = 0.82 [0.74–0.91]). Duration of breastfeeding was associated with a trend toward higher risk of ER+ breast cancer (Ptrend among ever breastfed = 0.02), but no significant associations were observed in categories of duration (e.g., ≥ 12 months vs. never, HR = 1.04 [0.97–1.11]). The associations between breastfeeding and ER− breast cancer risk were similar regardless of breastfeeding duration (e.g., ≥ 12 months vs. never, HR = 0.80 [0.70–0.92]; < 6 months vs. never, HR = 0.84 [0.75–0.95]; Ptrend among ever breastfed = 0.35). Patterns were similar for ER+/PR+ and ER−/PR− breast cancer (Additional file 1: Table S1). Statistical adjustment for years between menarche and age at first birth and years between age at last birth and minimum of current age and age at menopause did not impact the results (data not shown).

We next considered the joint association of parity and breastfeeding and breast cancer risk by molecular phenotypes (Table 3). For luminal A breast cancer, parity of 3 or higher, relative to nulliparity, was associated with significantly lower risk only among women who breastfed (≥ 3 children, ever breastfed: HR = 0.84 [0.71–0.99]; never breastfed: HR = 0.93 [0.77–1.12]). In contrast, for luminal B breast cancer, the association between parity and breast cancer risk was similar regardless of breastfeeding history (e.g., ≥ 3 children, ever breastfed: HR = 0.78 [0.62–0.98]; never breastfed: HR = 0.76 [0.58–1.00]). We observed the suggestion of increased risk of basal-like tumors among parous women who never breastfed (e.g., ≥ 3 children, never breastfed: HR = 1.43 [0.92–2.23]); this increased risk was not observed among those who ever breastfed (HR = 1.05 [0.70–1.57]). Risk of HER2-enriched disease was higher in parous, as compared to nulliparous, women, regardless of breastfeeding (parous/never breastfed, HR = 2.09 [1.02–4.30]; parous/ever breastfed, HR = 1.90 [0.95–3.80]).

Table 3 Multivariable-adjusteda hazard ratios (HRs) and 95% confidence intervals for molecular breast cancer subtypes in relation to parity and breastfeeding: Nurses’ Health Study (1976–2012) and Nurses’ Health Study II (1989–2013)

Overall, results were similar in the individual cohorts (NHS and NHSII). We observed significant heterogeneity between cohorts for ER+ disease (p = 0.02) in analyses considering the cross-classification of parity and breastfeeding. For ER+ disease, parity ≥ 3 among women who never breastfed was more strongly associated with breast cancer risk in the NHS (HR = 0.75 [0.67–0.84]) than in the NHSII (HR = 0.85 [0.67–1.08]); associations were similar in other categories (e.g., ≥3 children ever breastfed, NHS HR = 0.76 [0.69–0.85]; NHSII HR = 0.79 [0.69–0.89]).

Finally, we evaluated parity, breastfeeding, and breast cancer risk in strata of time between last pregnancy and diagnosis (< 10 vs. ≥ 10 years). Inverse associations between parity and ER+ disease and breastfeeding and ER− disease were observed only in women diagnosed 10+ years after last pregnancy (Additional file 1: Table S2). No associations were observed among participants diagnosed within 10 years of last pregnancy.

Discussion

This large, prospective study provides additional evidence of lower risk of ER− breast cancer among women who have breastfed and, to our knowledge, is the first prospective investigation on the joint effect of parity and breastfeeding on breast cancer risk by molecular phenotype. While breastfeeding was inversely associated with ER− breast cancer, no association was observed for the ER+ subtype. In contrast, parity was associated with lower risk of ER+ breast cancer, but not associated with ER− disease. In analyses by molecular phenotype, higher parity was inversely associated with the luminal subtypes. Parity was associated with higher risk of basal-like disease among women who did not breastfeed and HER2-enriched breast cancer risk regardless of breastfeeding history. These results provide confirmatory evidence of heterogeneity in the associations between parity and breastfeeding and breast cancer subtypes and support a risk-reducing role of breastfeeding for ER- breast cancer.

The breast undergoes proliferation and differentiation during pregnancy, in preparation for lactation. Terminal differentiation of the terminal ductal lobular unit (TDLU) occurs in the final trimester; this is hypothesized to be an important mechanism linking full-term pregnancy to long-term reduced risk of breast cancer, shielding the breast tissue from carcinogenic transformation [26]. Recent research has identified parity-related gene expression signatures in normal breast tissue [27,28,29], providing further evidence for the long-term impact of pregnancy on the breast. A “parity signature” including differential expression in inflammatory and immune-related pathways may be retained in ER+, but not ER−, tumors [27] supporting a differential role for parity in the etiology of ER+ vs. ER− breast cancer. However, data are limited and the parity-related gene signatures are not entirely consistent across studies.

Direct data describing underlying physiologic mechanisms between breastfeeding and breast cancer subtypes are limited [30]. Post-pregnancy, and at cessation of lactation, the breast undergoes involution. Involution has features of wound healing [31, 32], which has established parallels to tumorigenic microenvironments, including involvement of cytokines, growth factors, and matrix metalloproteases (MMPs) (reviewed in [33]). Breastfeeding delays the process of involution, and it has been hypothesized that the more gradual involution associated with lactation (i.e., from exclusive breastfeeding through weaning) may reduce longer-term breast cancer risk. Further, the postpartum hormonal milieu differs in breastfeeding vs. non-breastfeeding women (e.g., breastfeeding is associated with lower estradiol and higher prolactin levels for up to 12 weeks postpartum and has been associated with a slower decline in postpartum osteoprotegerin levels) [34, 35]. This postpartum hormonal milieu may differentially impact risk in the context of involution in the immediate postpartum vs. after breastfeeding and may differentially impact development of ER+ vs. ER− disease. Finally, murine models show a lower percentage of ER+ luminal cells in the mammary glands of parous, relative to virgin, mice with no change in the number of basal-like progenitor cells, though these basal-like progenitor cells demonstrated characteristics suggesting reduced carcinogenic potential in parous mice [36]. Nonetheless, involution proximate to delivery (i.e., in the absence of breastfeeding) may potentiate basal-like progenitors toward a carcinogenic pathway.

While some prior research is suggestive of a positive association between parity and ER− breast cancer, data to date are not consistent (reviewed in [37]). The AMBER consortium is the largest study to date [11]. Parity was associated with 33% increased risk of ER− disease (n = 1252; parous vs. nulliparous, RR = 1.33 [1.11–1.59]). Higher risk was observed with higher parity among women who did not breastfeed (e.g., ≥ 4 vs. 1 birth, RR = 1.68 [1.15–2.44]) but not among women reporting ever breastfeeding (RR = 1.33 [0.91–1.95]). Ever parous, or number of children, were not significantly associated with ER− breast cancer in the current study, regardless of breastfeeding. Importantly, the AMBER consortium includes women of African-American descent, whereas the NHS/NHSII study population is predominantly non-Hispanic white. ER−/PR− (and triple negative) breast cancer is more common among African American women, as are higher parity [38] and lower rates of breastfeeding [39]. Consistent with most prior prospective studies, ever breastfeeding was significantly associated with lower risk of ER− disease, and not associated with ER+ disease, in the current study. These results are in line with a recent meta-analysis of prospective studies (ever vs. never, summary RRs: ER−/PR−, 0.84 [0.72–0.97]; ER+/PR+, 1.00 [0.90–1.10]) [10]. In analyses evaluating duration of breastfeeding, similar associations were observed between breastfeeding and ER− breast cancer regardless of duration, suggesting even shorter-term breastfeeding (cumulative < 6 months) may decrease ER− breast cancer risk. In contrast, a trend toward higher risk was observed between breastfeeding duration and ER+ breast cancer, though no significant associations were observed in any of the categories of duration. To our knowledge, this has not previously been reported. Previous prospective studies have not observed significant associations between breastfeeding duration and breast cancer risk by subtype [2, 12, 14,15,16]. Additional studies with detailed data on breastfeeding duration are warranted.

Few prior studies have examined the joint effects of parity and breastfeeding and breast cancer risk by molecular phenotype [5, 7, 9, 18]. Millikan et al., in the Carolina Breast Cancer Study [5], observed higher risk of basal-like breast cancer among women with high parity in the absence of breastfeeding (parity 3+, never breastfed: OR = 1.9 [1.1–3.3]); breastfeeding attenuated the increased risk (OR = 1.3 [0.7–2.3]). A case-only investigation reported similar associations [9]. In our prior analysis by molecular phenotype in the NHS/NHSII, we observed a positive association between parity and HER2-enriched disease, but no association with the other subtypes [13]. Breastfeeding was associated with a reduced risk of basal-like breast cancer (e.g., 7+ months vs. never, among parous women, HR = 0.65 [0.49–0.87]); however, the joint effects of parity and breastfeeding were not considered. In the current investigation, consistent with prior studies, parity was suggestively positively associated with basal-like breast cancer, but only among women with high parity and who never breastfed. The positive association of parity with HER2 disease did not differ by breastfeeding.

We provide the largest prospective study to date on parity, breastfeeding, and breast cancer by subtype, and add to the limited literature on these associations by molecular phenotype. While we expect that parity was accurately reported, there may be misclassification of breastfeeding duration. We would expect any misclassification to be nondifferential. Tumor tissue to define molecular phenotype was available for 33% of the cases, resulting in small case numbers in some subgroups.

Conclusion

Our results support a role for breastfeeding in reducing risk of hormone receptor-negative breast cancer, adding to the established benefits of breastfeeding. While further research is needed to describe the mechanistic pathway resulting in this decreased risk, our investigation provides additional compelling evidence that breastfeeding is a modifiable risk factor for the breast cancer subtypes with the fewest targeted therapies and the least favorable outcomes.