Background

Breast cancer is common in the United States (US), with approximately one in eight women diagnosed during their lifetime [1]. Incidence increases with age, but roughly 18% occur before the age of 50 years [1]. Breast cancer risk factors have been reported to vary across age of onset [2, 3], which suggests that there is etiologic heterogeneity of breast cancer associated with age at onset (young-onset vs. older-onset).

According to a provocative hypothesis proposed by Trichopoulos in 1990, some breast cancers may etiologically originate in utero [4]. He proposed that prenatal exposure to endogenous or exogenous hormones might influence the trajectory of breast cancer development later in life, implying that a critical window for carcinogenesis of the breast might begin during prenatal life. For example, intrauterine exposure to insulin-like growth factors has been proposed to have a role in breast cancer risk [5]. Evaluating the effects of very early life exposures poses methodological challenges, since few investigators have had data to characterize the intrauterine environment and relate that to a subsequent breast cancer diagnosis. However, some epidemiological studies have examined this hypothesis by using surrogate markers for in utero exposures to steroids and growth hormones.

One example of this is preeclampsia, which may be associated with high maternal concentrations of androgens and low concentrations of estrogen [6, 7]. Because preeclampsia is also a complication that may require early termination of the pregnancy [8], daughters born from a preeclamptic pregnancy may, on average, have had hormonally different gestational environments due both to lower levels and to shorter gestations. A meta-analysis [9] assessing the role of maternal preeclampsia on the offspring’s later risk of breast cancer found a significant inverse association (risk ratio (RR) 0.48; 95% CI 0.30–0.78) [9]. However, results from studies [10,11,12,13,14] assessing the impact of maternal preeclampsia on daughters’ risk specifically for young-onset breast cancer have been inconsistent.

Other perinatal factors, such as birth weight and birth length, might reflect in utero exposure to growth hormones [15]. Most of the research studies evaluating the association between birth weight and risk among young women have reported a modest increase in breast cancer risk associated with high birth weight [14, 16,17,18,19,20,21,22,23]. Another perinatal factor that may influence a woman’s subsequent risk of breast cancer is exposure to breast milk [24]. A meta-analysis undertaken to assess the relationship between breastfeeding during infancy and adult breast cancer incidence found no overall relationship (RR 0.94; 95% CI 0.85–1.04) but found evidence for a reduced risk of premenopausal breast cancer (RR 0.88; 95% CI 0.79–0.98) [25]. Other feeding modalities in infancy have been evaluated to gain a better understanding of their impact on breast cancer risk. Soy contains phyto-estrogens, and a population-based case-control study [26] assessed the association between soy formula consumption during infancy and breast cancer and reported that being fed only soy formula during the first 4 months of age was not associated with an increased odds of having developed breast cancer (OR 0.42; 95% CI 0.13–1.40). Other perinatal factors that have been previously studied as potential risk factors for breast cancer risk include twin membership, parental age at delivery, and gestational length [13, 14, 27].

Evaluating the role of rare perinatal factors such as preeclampsia and twin membership on young-onset breast cancer requires a large sample of women. Many of the previous studies of perinatal factors and breast cancer risk have used registry data [11, 12, 14, 27, 28] while others have been hampered by small numbers of breast cancer cases. While using registry data can provide a large sample size and reduces the influence of recall and selection bias, data on some perinatal factors may not be available. In addition, confounder adjustment in the analysis is necessarily limited to variables that were collected in the registry’s database. We conducted a sister-matched case-control analysis of participants in the Two Sister Study to assess the effect of perinatal factors on young-onset (under age 50) breast cancer risk. The sister-matched case-control design provides a unique opportunity to partially control for possible confounding factors not measured in the study that could tend to be shared among sisters, such as their mother’s body mass index (BMI).

Methods

Study population and design

Participants in The Two Sister Study were enrolled between 2008 and 2010. Participants were eligible as cases if they had been diagnosed with breast cancer before 50 years of age and within the previous 4 years and had one or more unaffected sisters who had enrolled in the Sister Study (2003–2009). Sister Study participants had to be 35–74 years of age at the time of enrollment and had to live in the United States or Puerto Rico. Participants in the Sister Study had reported on their sister’s earlier breast cancer diagnosis at the time of their enrollment, and the list of families to target for recruitment into the Two Sister Study was based on that information. The sisters with a previous breast cancer diagnosis who appeared to be eligible were contacted through their participating sister, who was asked to send their affected sister a letter inviting their participation. Case sisters who then contacted the study and were found to be eligible were administered the same computer-assisted telephone interviews used in the Sister Study and another questionnaire about their breast cancer diagnosis. Their consent was requested for retrieval of their breast cancer medical record, which was used to collect pathological data on tumor characteristics (including estrogen receptor (ER) status). When medical records were not provided, we relied on self-report. We were able to obtain medical records or pathology reports to confirm the diagnosis for 88.7% breast cancer cases, and we relied on self-report for the remaining 11.3%. Very high positive predictive values for self-reported breast cancer (> 99.0%) have been reported for incident breast cancer cases in the Sister Study [29].

Because of the sampling design, controls were often older than their case sister. To address this potential source of bias, we restricted our controls to participants who had remained disease-free (breast cancer) up to the age at which their case sister was diagnosed with breast cancer, so that either could have been the case. The sample for the Two Sister Study was augmented by recruiting additional incident young-onset cases that were diagnosed during follow-up of the Sister Study cohort through September 2016, netting an additional 243 cases, many of whom also had an unaffected Sister Study sister who was also being followed. Those that did not have a control sister were nonetheless included in the case-only analyses to study etiologic heterogeneity. In total, 1759 cases and 1672 sister controls (including the total numbers to be included in matched analyses or case-only analysis) were enrolled in the Two Sister Study.

Exposures of interest

Perinatal factors obtained via questionnaire at study enrollment included participant’s birth weight, used to identify those with high birth weight (≥ 8.8 pounds) or low birth weight (≤ 5.5 pounds); participant’s gestational age at birth, used to categorize them as less than 38 weeks or not; and whether their gestation was complicated by diethylstilbestrol exposure, preeclampsia, eclampsia, gestational hypertension, or gestational diabetes. We dichotomized gestational age into < 38 weeks or ≥ 38 weeks because the question women had been asked was as follows: Were you born within 1 week of your mother’s due date, more than 1 week before her due date, or more than 1 week after her due date? If the participant reported that she was born early or late, she was further asked the number of weeks (less than 2 weeks, 2–4 weeks, 1–2 months, more than 2 months, I do not know). We also ascertained some postnatal exposures: having been breastfed as an infant and history of soy formula consumption during infancy.

Data on perinatal and postnatal exposures were from a “Family History” questionnaire mailed to all participants at enrollment. Participants were encouraged to speak to their mothers (or other family members) to improve accuracy and most questions allowed qualified answers (e.g., options of definitely yes, probably yes, probably no, definitely no). In addition, we carried out a validation study (manuscript in preparation) in which mothers of participants who were under 60 at enrollment were asked to complete the same questionnaire. Agreement was good for most variables, with kappas ranging from 0.5 to 0.9, but mother’s data was also retrospectively reported. Examiners retrieved these questionnaires from Sister Study participants when conducting home visits for blood collection. Because most cases were identified retrospectively and had no home visit, they had to mail in their questionnaire. The cases consequently had more missing data than controls.

Covariate data

Data on participants’ demographics (e.g., age, race/ethnicity) and height and weight (used to calculate body mass index) were obtained at enrollment.

Statistical analysis

We provide descriptive statistics for cases and controls separately for each pre- and perinatal factor. Since there was a small fraction of data missing for some of the covariates and exposures of interest, we conducted a complete case analysis [30]. We used conditional logistic regression to calculate odds ratios (OR) and 95% confidence intervals (CI) to explore the association between perinatal factors and young-onset breast cancer. Each pre- and perinatal factor was considered as the primary exposure variable in a separate regression model. Variables of known biological importance in the perinatal factors-breast cancer relationship were evaluated for inclusion as potential effect modifiers (e.g., maternal smoking while pregnant with the study participant). In addition, we used directed acyclic graphs to identify potential confounders. For the analyses of preeclampsia, gestational hypertension, and their composite, i.e., “any” hypertensive disorder, we adjusted for whether or not the participant was their mother’s first baby and whether or not the mother had smoked in that pregnancy. For analyses of low and high birth weight and short gestational length, we adjusted for preeclampsia and for maternal smoking in that pregnancy. In a further sensitivity analysis, we tried to isolate the effect of gestational preeclampsia by restricting the preeclampsia analysis to those participants who had not themselves had a preeclamptic pregnancy. A total of 341 women (155 cases and 186 controls) had a preeclamptic pregnancy themselves and were excluded from this analysis. The reasoning for that approach was that participants from a preeclamptic pregnancy may have inherited a propensity to develop this complication in their own pregnancies, and the experience of having a preeclamptic pregnancy may confer reduced breast cancer risk [31]. We repeated these analyses with restriction to invasive cases and their sisters (N = 2717, 1250 cases and 1467 controls) and stratifying by ER status.

We also conducted case-only analysis to explore etiologic heterogeneity in the relationship between perinatal factors and the ER status of the breast tumor [32]. The case-only analysis used unconditional logistic regression, treating ER− breast cancer as the outcome. We also carried out a sensitivity analysis where we restricted to invasive cases. All the statistical analyses were conducted using STATA version 15.1 (Stata Corp, College Station, TX) and SAS version 9.4 (SAS Institute, Cary, NC).

Results

Table 1 shows selected characteristics of cases and controls. Control sisters were slightly older than their case sisters (46.3 vs. 45.0 years old at the time of the case sister’s diagnosis). The percentages of cases born with a low birth weight (8.4%) or a high birth weight (9.4%) were slightly higher than those for control sisters (6.7% low birth weight and 7.3% high birth weight). More cases than controls were born with a gestation less than 38 weeks (15.2 vs. 11.5%).

Table 1 Baseline characteristics and in-utero exposures by case status (N = 3431)

In general, we observed weak associations between perinatal factors and breast cancer (Table 2). We excluded gestational diabetes from consideration because few women reported it. There was a positive association between breast cancer and high birth weight (OR = 1.59, 95% CI 1.07–2.36). Participants who had a short gestational length (< 38 weeks) (OR = 1.26, 95% CI 0.83–1.92) and those whose mother smoked during their gestation (OR = 1.19, 95% CI 0.81–1.74) had estimated positive associations with breast cancer. After adjusting for possible confounding by whether they were their mother’s first child and maternal smoking during that pregnancy, we observed a positive association between being born to a pregnancy with pre-eclampsia and breast cancer risk (OR = 1.92, 95% CI 0.82–4.51). Further excluding from this analysis participants who themselves had had a preeclamptic pregnancy, led to a slightly increased estimated odds of breast cancer (OR = 2.28, 95% CI 0.89–5.82). Prenatal exposure to DES during pregnancy (OR = 0.82, 95% CI 0.44–1.54) and having been breastfed (OR = 0.76, 95% CI 0.58–1.00) were associated with a slightly reduced odds of breast cancer. Neither having being fed only soy formula during infancy nor having a low birth weight was associated with breast cancer (OR = 1.04, 95% CI 0.66–1.65; OR = 1.05, 95% CI 0.70–1.59, respectively). In a sensitivity analysis that excluded the sibships with in situ breast cancer, similar associations were observed with most perinatal factors. However, the point estimates associated with being born to a pregnancy with pre-eclampsia, high birth weight, and any hypertensive disorder increased (Table 3).

Table 2 Breast cancer odds ratio (ORs) in relation to participant’s perinatal exposures
Table 3 Breast cancer odds ratio (ORs) in relation to participant’s perinatal exposures, excluding in situ cases

Table 4 displays results from case-only analyses. We observed that cases born to a mother with pre-eclampsia in that pregnancy (OR = 2.23, 95% CI 0.94–5.30) were at increased odds for ER-negative tumors. Additionally, cases who had been born with a high birth weight (OR = 0.64, 95% CI: 0.35–1.17) were at somewhat reduced odds of having an ER-negative tumor. We observed a near-null association between ER-negative tumors and prenatal exposure to DES (OR = 0.91, 95% CI: 0.38–2.21) and maternal smoking (OR = 1.08, 95% CI 0.81–1.45). We observed similar associations when restricting to participants with invasive cancer (Table 5). Stratified analysis by ER status (Table 6) showed that the overall association appears to be largely driven by ER-negative cancers. In addition, the overall positive association between breast cancer and high birth weight persisted among ER-positive case subjects (OR = 1.68, 95% CI 1.07–2.62).

Table 4 Case-only odds ratios for estrogen receptor-negative breast cancer in relation to participant’s exposures
Table 5 Case-only odds ratios for estrogen receptor-negative breast cancer (excluding in situ) in relation to participant’s exposures
Table 6 Breast cancer odds ratio (ORs) in relation to participant’s in utero exposures by ER status, with counts including only participants with complete information on exposure of interest

Discussion

In the Two Sister Study, we observed that maternal preeclampsia and high birth weight showed some evidence of a positive association with young-onset breast cancer. We observed a weak inverse association between having been breastfed during infancy and young-onset breast cancer. Additionally, women born from pregnancies with preeclampsia or gestational hypertension who did develop young-onset breast cancer appeared to be at increased odds for the difficult-to-treat ER-negative subtype. Restricting attention to invasive breast cancer strengthened those associations.

Among the studies that have assessed the relationship between birth weight and young-onset breast cancer [12, 14, 16,17,18,19,20,21,22,23, 33], a positive association with birth weight was detected in the majority of them [14, 16,17,18,19,20,21,22,23]. Michels and Xue conducted a systematic review of some studies published in this area and noted a positive association between birth weight and risk of breast cancer [15]. Of the articles included that had evaluated only young-onset breast cancer, they reported that, overall, high birth weight appeared to be associated with increased risk (OR 1.25; 95% CI 1.14–1.38). Three of the studies [14, 16, 17] included in their meta-analysis reported a J-shaped or a U-shaped association between birth weight and breast cancer. The results of our study, which is based on women with breast cancer diagnosed before the age of 50, align with those previous results.

It has been proposed that the biological mechanism that underlies this birth weight association encompasses a complex interplay between the influences of growth factors and steroid hormones on mammary cells and epigenetic changes [15]. Maternal circulating hormones, such as estradiol and estriol, have been positively associated with birth weight in previous studies [34,35,36]. Certain metabolic byproducts of estrogen may be mutagenic [37] and may promote cell proliferation and cell differentiation [38]. Also, the levels of estriol in the umbilical cord have been reported to be positively correlated with birth weight [36]. Thus, the intrauterine environment of high birth weight babies may alter the normal mammary cell growth and impact later breast cancer risk. Other mechanisms that may explain the observed association between high birth weight and breast cancer involve growth factor pathways [39]. Cord blood concentrations of insulin-like growth factor-1 (IGF-1) have been reported to be positively associated with birth weight [40, 41]. In addition, IGF-1 is thought to be an important factor in fetal growth [42] and to be a potent mitogen of the mammary gland [43]. Based on their critical role in fetal growth and in the development of the mammary gland, IGF-1 and estrogen may play a role in the initiation and promotion of breast tumors.

Gestational length has been studied in association with risk of breast cancer [9, 44]. Maternal hormone levels consistently increase with gestational age [35]. Gestational length consequently impacts the fetus’ cumulative exposure in utero to maternal endogenous hormones, which could influence their breast cancer risk. Evidence from two meta-analyses reported no association between gestational length and breast cancer risk [9, 44], while results from the individual studies included in these articles provide heterogeneous evidence of both an increased [27] and decreased [19] risk of breast cancer associated with shorter gestational length. We did not observe a statistically significant association between gestational length and early-onset breast cancer. Our results are thus consistent with the previous meta-analyses [9, 44].

Inconsistent results have been reported in previous studies assessing the association between pre-eclampsia and daughter’s breast cancer risk [9, 12, 14]. Two studies with a small number of exposed participants reported a non-significant positive association, but the limited sample sizes of these studies limited their statistical power [10, 12]. Our study also found a positive association with breast cancer. This result contrasts with results from two record linkage studies [11, 27] from Sweden that reported a reduction in breast cancer risk among women whose mothers had pre-eclampsia or eclampsia during the participant’s gestation. These studies included both women with pre- and post- menopausal breast cancer. Additionally, the populations being analyzed overlapped partially. The discrepant results observed in our study could be due in part to the different age at breast cancer diagnosis in our study compared to both studies conducted by Ebkom and colleagues [11, 27] as young-onset and late-onset breast cancer can have different etiologies [45, 46]. One other consideration is that some women born to a preeclamptic pregnancy may have inherited a tendency to develop that complication in their own pregnancies and that particular reproductive history has been associated with reduced risk of breast cancer [31], reducing any increase that might have been due to the prenatal exposure. When we excluded women who themselves had a preeclamptic pregnancy, the association seen in our data was somewhat strengthened.

The results from our study are subject to some limitations. First, exposures were self-reported, which could have led to misclassification of the exposures. Also, limited numbers with the exposure prevented us from assessing gestational diabetes as a potential exposure.

Our results are strengthened by our study design, and a large number of young-onset breast cancer cases contributing to a case-only analysis. We had sufficient numbers to evaluate the association of perinatal factors with ER status among women affected by young-onset breast cancer. Finally, the sister-matched design helped us control for unmeasured confounding.

Conclusion

In summary, in the present study, we observed small and imprecisely estimated positive associations between two prenatal exposures (high birth weight and being gestated in a preeclamptic pregnancy) and young-onset breast cancer. These results provide indirect support for the hypothesis that prenatal exposures affect the subsequent risk of breast cancer in young women. In addition, our results suggest that being born to a preeclamptic pregnancy may increase risk for young-onset breast cancer, especially for the ER-negative subtype. Future epidemiological research on the connection between maternal hypertensive disorders, birth weight, and young-onset breast cancer is warranted to help us gain a better understanding of the etiology of the disease.