Introduction

The heterogeneous nature of breast cancer highlights the importance of identifying prognostic and predictive markers for clinical implementation of tailored treatment. Recently, there has been increasing interest in the androgen receptor (AR) as a potential prognostic biomarker and treatment target [1, 2]. The prognostic role of AR has consistently been described as favorable for breast cancer in general and within the estrogen receptor-α positive (ER+) subgroup [3]. Some studies have reported that AR is prognostically beneficial irrespective of ER [4], but the results regarding the role of AR expression in the ER negative (ER−) setting diverge [2, 4,5,6,7,8]. Ongoing clinical trials are evaluating anti-androgens and selective androgen receptor modulators across different breast cancer subtypes [2, 9, 10]. It is essential to better define AR’s prognostic role in different breast cancer subtypes.

The most recent meta-analysis reported AR to be prognostically favorable irrespective of ER expression, but only included studies published until early 2015 [11]. In late 2015, we reported a differential role of AR depending on the ER status of the tumor using disease-free survival (DFS) as endpoint in a large population-based observational cohort. Patients with AR-positive (AR+) ER+ tumors had superior prognosis compared to all other AR/ER combinations. Markedly, the patients with AR+ER− tumors had worse prognosis in all adjusted models compared to patients with AR-negative (AR−) ER− tumors, and there was significant interaction between AR and ER expression with respect to DFS [8].

The aim of this study was to validate and elaborate the previous findings of a differential role for AR on the outcome depending on ER status, using breast cancer-related death as endpoint, in an independent cohort with longer follow-up, the population-based Malmö Diet and Cancer Study (MDCS). We hypothesized that AR positivity would be associated with overall favorable tumor characteristics and prognosis and that analyses stratified by tumor ER status would reveal a differential role of AR depending on ER. The primary endpoint was breast cancer mortality (BCM), and the secondary endpoint was all-cause mortality. Another aim was to investigate whether AR could add long-term prognostic information.

Materials and methods

The Malmö Diet and Cancer Study

The population-based prospective MDCS was initiated to examine associations between diet and cancer [12] and included people living in Malmö, Sweden in 1991–1996. Swedish language skills and mental abilities sufficient to understand the questionnaire were required for enrollment. The participation rate was 40% of the source population. A previous report showed lower mortality due to cancer and lifestyle-related causes among participants during recruitment and follow-up compared to non-participants [13]. Baseline data were collected from interviews, questionnaires, and examinations.

The female MDCS cohort consists of 17,035 women (born 1923–1950). Information on incident breast cancer and vital status were annually retrieved from the Swedish Cancer Registry, the South Swedish Regional Tumor Registry, and the Swedish Cause of Death Registry. Ethical permission was obtained from the Ethical Committee at Lund University (Dnr 472/2007). All participants signed a written informed consent form.

Study population and patient characteristics

This is a case-only analysis of incident primary breast cancer within the MDCS, and patients with a previous breast cancer diagnosis at enrollment (n = 576) were therefore excluded (Fig. 1). During follow-up until December 31, 2010, a total of 1016 women were identified with incident breast cancer through record linkage. Patient characteristics at diagnosis were obtained from medical records. To investigate invasive tumor characteristics in relation to survival, in situ only cancers (n = 68) and patients who received neo-adjuvant treatments (n = 4) were excluded. Patients were also excluded if they had distant metastasis at diagnosis (n = 14) or died from breast cancer-related causes ≤0.3 years from diagnosis (n = 2). Patients with bilateral cancers (n = 17) were excluded due to difficulties in evaluating the relation between tumor characteristics and prognosis. Finally, one patient who declined treatment for four years before accepting surgery was excluded. Thus, the final study population consisted of 910 patients. Information on cause of death and vital status was retrieved from the Swedish Causes of Death Registry, with last follow-up December 31st, 2014.

Fig. 1
figure 1

Flow chart of the Malmö Diet and Cancer Study (MDCS) and the study population

Tumor and histopathological analyses

Information on tumor size, grade, and axillary lymph node involvement (ALNI) was retrieved from pathology reports for tumors from 2005 and onwards. From 2008 onwards, information was also obtained for the immunohistochemistry (IHC) based markers ER, progesterone receptor (PR), proliferation index (Ki67), and human epidermal growth factor receptor-2 (HER2) status. From 1991 to 2007, ER, PR, Ki67 and HER2 were assessed using tissue microarrays (TMAs) [14]. A pathological re-evaluation was performed regarding invasiveness and grade for tumors diagnosed prior to 2006 [15]. A cut-off for positivity of >10% positively stained nuclei was applied for ER, PR, and Ki67. Regarding HER2, in situ hybridization (ISH) results were used when available. HER2 IHC was considered positive (HER2+) when annotated as 3+ and negative (HER2−) for 0 or 1+. HER2 IHC scorings of 2+ were categorized as missing if not confirmed to be amplified or normal in ISH analyses.

A TMA was constructed with two 1 mm cores from each tumor (Beecher, WI, USA). Among the 910 patients included in the study population, tumor tissue was available from 718 patients (Fig. 1). For IHC analysis of AR, 4-μm sections were automatically pretreated using the PT Link system and stained (Autostainer Plus, Dako, DK) for monoclonal antibody Ab-1 (clone AR441, dilution 1:200, Thermo Scientific). Microscopy assessments were performed using digital pathology (PathXL, http://www.pathxl.com). Following assessment of tumor invasiveness using hematoxylin and eosin stained slides, a semi-quantitative scoring of AR fractions (0, 1–10, 11–50, 51–75, and 76–100%) of positively stained nuclei, irrespective of nuclear staining intensity, was performed by one observer (KE), Fig. 2. If the cores from one tumor were discordant, a final score was evaluated across both cores. The AR variable was dichotomized using a cut-off of 10% as pre-specified in a statistical plan aimed at elaborating on our previous findings [8], and to avoid an exploratory study design approach. This study adheres to the REMARK criteria [16].

Fig. 2
figure 2

af Histology of invasive breast cancer, hematoxylin eosin stain (left) and immunohistochemistry of AR expression (right): a 0 % positively stained nuclei, b 1-10%, c 11-50%, d 51-75%, e >75 %, f overview, >75%

Statistical analyses

The distribution of patient and tumor characteristics at diagnosis in relation to AR expression was categorized and presented as percentages. Continuous variables are presented as the median and interquartile range (IQR). Distributional differences were assessed by X 2-analyses, logistic regression, or Mann–Whitney U tests as appropriate.

The association between AR and prognosis was examined with BCM as the primary endpoint. BCM was defined as the incidence of breast cancer-related death (when breast cancer was the cause of death or the contributing cause of death). The secondary endpoint was all-cause mortality, i.e., death from any cause. Follow-up was calculated from the date of breast cancer diagnosis to the date of breast cancer-related death, date of death from another cause, date of emigration or the end of follow-up as of December 31, 2014.

Firstly, AR expression in relation to BCM was assessed by comparison of cumulative BCM (CBCM) for patients with AR+ and AR− tumors in the overall population and stratified by ER status. Rather than analyzing cause-specific survival, this method [17] takes competing risks into account, which was considered relevant due to the relatively long follow-up (median 10 years) and high median age at diagnosis of the patients (64.9 years). AR in relation to all-cause mortality was assessed by analyzing cumulative all-cause mortality (CM), both overall and stratified by ER status. Differences in CBCM and CM between subgroups were evaluated by the LogRank test. AR in relation to BCM was further investigated by cause-specific Cox proportional hazards analysis with follow-up censored when a death from another cause occurred. Effects are presented as the hazard ratios (HR) and 95% confidence intervals (CI) for the overall population, stratified by ER status, and for combinations of AR and ER status using the AR+ER+ subgroup as reference.

Adjustments were made in two multivariable models including the following covariates. Model 1 included age at diagnosis (continuous), tumor size >20 mm (yes/no), ALNI (≥1 yes/no), histological grade (III vs. I–II), and ER status (±). Model 2 included the covariates from Model 1 with the addition of planned adjuvant treatments, chemotherapy (yes/no), radiotherapy (yes/no), and endocrine treatment (yes/no). An interaction term between AR and ER was added to the Cox models to evaluate the strength of evidence against the null hypothesis of no interaction between these factors on outcome. Proportional hazards assumptions were evaluated visually by inspecting the log minus log survival curves and formally using Schoenfeld’s test. There was weak evidence for a non-proportional hazards effect on AR ± for the overall follow-up (P = 0.06). The follow-up time was subsequently divided into categories of 0–5 years and >5 years, where the proportional hazards assumption was better met, and survival analyses by AR status were repeated for 0–5 years and >5 years of follow-up.

All tests were two-sided, and P values should be interpreted as the level of evidence against each null hypothesis. Nominal P values without adjustments for multiple testing are presented. The user-contributed program stcompet.ado for the statistics package Stata version 14.1 (StataCorp LP, College Station, TX, USA) was used to estimate cause-specific cumulative mortality. Stata was also used to draw the cumulative mortality graphs and to test proportional hazards assumptions. All other statistical analyses were performed in SPSS 22.0 (IBM).

Results

AR expression, patient, and tumor characteristics

Tumor AR expression was assessable in 671 of 718 cases (93%) where tumor tissue was available (Fig. 1). The distribution of AR expression included 573 AR+ tumors (85%) and 98 AR− tumors (15%). AR+ status was associated with smaller tumor size, lower histological grade, ER/PR co-expression, and low proliferation index (Ki67 ≤ 10%). Within the ER−PR-negative (PR–) subset, 34% (15/44) of HER2– tumors (triple-negative breast cancer, TNBC) were AR+, and among HER2-positive tumors, 86% (12/14) were AR+. More patients with AR− tumors had received adjuvant chemotherapy and died from breast cancer-related causes compared to patients with AR+ tumors (Table 1).

Table 1 Distribution of patient and tumor characteristics

Cumulative breast cancer mortality and all-cause mortality by AR expression

During follow-up, 178 of the patients included in survival analyses died, of whom 93 died from breast cancer-related causes (Table 1). The median follow-up period for patients who were still alive by the end of 2014 was 10 years. The incidence of breast cancer-related death was graphically illustrated by CBCM plots (Fig. 3). CBCM was lower among patients with AR+ tumors compared to patients with AR− tumors (LogRank P = 0.002). When stratified by ER status, the CBCM observed among all patients with AR+ tumors was attributable to patients with AR+ER+ rather than AR+ER− tumors. However, the evidence for an association between AR+ER+ and lower CBCM was weak (LogRank P = 0.20) compared to AR−ER+. During the first 5 years of follow-up, no difference in CBCM by AR+ was observed for patients with ER− tumors. AR+ was weakly associated with higher all-cause mortality (LogRank P = 0.08), but this association was not seen in CM analyses stratified by ER status (Fig. 4). Cox regression analyses were performed to further investigate CBCM in relation to AR status, as well as to ER status and time from diagnosis to breast cancer-related death. No significant prognostic differences depending on diagnostic period before and after the trastuzumab introduction (≤year 2005 vs. 2006+) was revealed.

Fig. 3
figure 3

Cumulative breast cancer mortality according to AR status, among all patients and stratified by ER status

Fig. 4
figure 4

Cumulative all-cause mortality according to AR status, among all patients and stratified by ER status

Breast cancer mortality by AR and ER expression for overall follow-up

For the overall follow-up period, the incidence of BCM for patients with AR+ tumors was half that of patients with AR− tumors in univariable Cox analyses (HR 0.48: 95% CI 0.30–0.77; P = 0.002, Table 2). However, after adjustments for potential confounders, no risk reduction by AR+ remained (Table 2, models 1 and 2). When associations between AR and incidence of breast cancer-related death were assessed in the ER+ and ER− subgroups separately, no prognostic impact by AR was seen in either group. Furthermore, no interaction was observed between AR and ER (P interaction ≥ 0.58) regarding BCM.

Table 2 Breast cancer mortality according to AR status, stratified by ER status and according to combinations of AR and ER status. Crude and adjusted models are presented for the overall follow-up period, for 0–5 years and >5 years of follow-up

In the analyses of AR/ER combinations, patients with AR−ER− as well as AR+ER− tumors had increased risk of CBCM compared to patients with AR+ER+ tumors. After adjustments for age and tumor characteristics (Table 2, model 1), there was slight evidence that patients with AR+ER− tumors had higher incidence of breast cancer-related death compared to patients with AR+ER+ tumors. However, this association did not remain after adjusting for adjuvant treatments.

Breast cancer mortality by AR and ER in short-term versus long-term follow-up

Separate analyses were performed for the initial 5-year period and from 5 years onwards (Table 2). This was done to better meet the assumption of proportional hazards and to differentiate between potential early and late impacts of tumor AR on BCM.

The lower BCM in the AR+ group observed in the overall period seemed driven by a lower BCM relative to AR− during the first 5 years (HR 0.37: 95% CI 0.19–0.71; P = 0.003, 43 events). The corresponding figures for the interval from five years and onwards pointed in a similar direction, although non-significant (HR 0.61: 95% CI 0.31–1.20; P = 0.15, 50 events). In the adjusted models for ER+ versus ER− strata during the short-term follow-up, the estimated hazard ratios for BCM by AR diverged to a higher degree compared to the corresponding estimate in the overall follow-up period. However, no interactions between AR and ER status were observed (P interaction ≥ 0.35). After adjustments for age and tumor characteristics (model 1, 0–5 years), there was moderate evidence that patients with AR+ER− tumors had higher BCM during the short-term follow-up compared to patients with AR+ER+ tumors. In this model, BCM for patients with AR+ER− tumors was more than threefold that of patients with AR+ER+ tumors. Furthermore, patients with AR+ER− tumors had almost double the BCM compared with patients with AR−ER− tumors. When further adjustments for adjuvant treatment were incorporated into the model (model 2), the patients with AR+ER− tumors still had worse prognosis compared to all other AR/ER combinations.

Discussion

In this prospective cohort with long-term follow-up, breast cancer patients with AR+ tumors had lower BCM but not all-cause mortality compared to patients with AR− tumors. AR+ was not an independent prognostic factor, and the hypothesis of an interaction between AR and ER was not confirmed. However, in line with our hypothesis, patients with AR+ER− tumors had the worst prognosis of all AR/ER combinations, as demonstrated in multivariable analyses with short-term follow-up.

The current interest in AR in breast cancer is multilayered. AR is a promising primary target for treatment in ER− breast cancer and especially in TNBC, where treatment options are scarce. Furthermore, specific molecular subgroups with androgen-regulated genetic signatures have emerged [18,19,20]. For example, the Luminal AR (LAR) subtype has been associated with a clearly different clinical course from that of the other TNBC subtypes [21,22,23].

In ER+ disease, AR antagonizes the proliferative effect of ER signaling [24]. AR may improve prediction of endocrine response [25, 26] and also offer an alternative endocrine treatment target when anti-estrogen regimens fail. Studies focused on integrating AR with other transcription factors such as ER are warranted to better understand the associations between AR and survival across breast cancer subtypes [2]. One possible next step would be to address AR as a ratio of ER, as done previously [22]. This may enhance the understanding of AR actions, due to the suggested competitive interactions between these two receptors [27].

We conclude that AR adds information for BCM compared to that of ER alone, but in the present study only to a limited extent. The findings of AR status on BCM did not remain after adjustments for confounders, suggesting that AR did not drive the association. Instead, ER was interpreted as the primary driver since AR and ER were highly co-expressed, and the association with BCM remained for patients with ER+ tumors in short-term adjusted analyses (data not shown). This interpretation was supported by the unadjusted analysis of combined AR/ER. However, in line with our hypothesis, AR added short-term prognostic information after adjustment for confounders. The poorest prognosis was seen among patients with AR+ER− tumors. This was also true after adjustment for treatment, but the adjusted HRs were closer to 1.0, and the evidence for higher BCM compared to the other subgroups was weaker after adjustment. If AR+ER− breast cancer is confirmed to have inferior prognosis, this may impact the choice of AR-targeted treatments and reveal a need for closer surveillance of this patient group.

There are several suggested reasons as to why we could not fully confirm our hypothesis. Our previous results on AR expression considered DFS in a cohort of median follow-up of 5 years. The time from diagnosis to breast cancer-related death may not be comparable with the event of local or regional recurrence, contralateral cancer, or distant metastasis, as evaluated previously [8]. There may also be biological differences in the AR effect in relation to these different endpoints. For the follow-up beyond 5 years, there was a selection of healthier patients with more favorable biology. Furthermore, the patients in the MDCS were diagnosed earlier (from 1991 to 2010) and during a 19-year period, with indications for treatment regimens such as tamoxifen treatment changing over time, where tamoxifen was initially prescribed according to menopausal status rather than ER status [28]. Our initial results [8] were based on the BC Blood Study, which was initiated in 2002 and included more aromatase inhibitor-treated patients.

AR+ was confirmed to be associated with favorable tumor characteristics, as consistently shown previously [8, 29, 30]. In the present study, the proportion of patients with AR+ tumors (85%) was equal to that of the independent cohort BC Blood Study (AR+ 85%), in which the same antibody was used [8]. This provides a measure of external validity to the AR assessments, since similar exclusion criteria were adopted. Both studies involved population-based Swedish cohorts, and similar distributions could thus be expected.

The strengths in this study were the long-term follow-up and a study population from a population-based, well-characterized cohort [13, 31]. Since the median age at diagnosis was greater than 60 years, the findings are mostly applicable to postmenopausal breast cancer. The completeness of the Swedish Cancer Registry is considered to be high with low underreporting of breast cancer [32]. The Swedish Cause of Death Registry has been reported to have complete and valid data from an international perspective, with the highest accuracy for cancer diagnoses [33].

A lack of standardization of AR antibodies and cut-offs used has been common and may explain some discrepancies across studies [4, 11]. In the present study, a validated antibody and a common cut-off were used [4]. The amount of missing AR values among patients for which tumor tissue was available was low and was similar in distribution to the overall population and was not considered a to exert a risk for selection bias. However, patients with missing tumor tissue were older and had smaller tumors, with a lower proliferation index and grade. They also more often had node-negative disease, suggesting under-representation of tumors with favorable characteristics in the material, which constitutes a selection bias. Thus, the true proportion of AR+ may be slightly higher than the value presented in this paper, and the incidence of BCM may be slightly lower in the underlying population from which the study population was sampled.

The tumor marker data were derived from TMAs for patients diagnosed up until 2007. Thereafter, routine clinical pathology reports were used. For the PR variable, this introduced a systematic bias due to higher positive frequency in clinical data as compared to TMA data. Thus, PR was only addressed in the descriptive analyses and should be interpreted with caution. In contrast, ER status did not depend on the diagnostic period. The missing ER was addressed by presenting the associated HRs separately.

This study indicates that cohorts with larger sample sizes are needed to further elucidate the prognostic role of AR in ER− breast cancer. Indeed, there is a need to incorporate several hormonal receptors in future studies of prognosis and treatment prediction [34]. AR may be valuable for short-term prognosis, whereas additional specific markers may be needed for long-term prognosis for ER+ breast cancer, which tends to relapse late [35].

The number of ongoing clinical studies involving AR targeting is steadily expanding and has included patients with both early and advanced breast cancer and of different subtypes [36,37,38,39] (ClinicalTrials.govidentifier: NCT01842321, NCT01889238, NCT02689427, NCT02676986, NCT02457910, NCT02463032, NCT02368691). Identification of subtypes based on gene expression analyses have provided a more comprehensive picture of hormonal regulation, such as the LAR subtype which is heavily enriched in hormonally regulated pathways in spite of being ER− [21].

There is yet no consensus on whether AR antagonists or agonists are preferable as targeted treatments in any breast cancer subtype. The forthcoming reports of ongoing clinical trials—combined with observational studies and mechanistic studies on treatment response/resistance [40,41,42,43,44]—will impact the future direction. Most likely, a wider clinical implementation of AR assessments will be of highest value for patients with TNBC [23] and for patients in the metastatic setting that have suffered from treatment resistance, and may benefit from the addition of AR-targeted treatments such as enzalutamide [22, 44,45,46].

In conclusion, AR+ was associated with lower BCM in the overall study population and for the overall and short-term follow-up intervals, but not for long-term follow-up. The poorest prognosis was seen among patients with AR+ER− tumors after adjustment for confounders in the short-term follow-up. Future studies on the role of AR in breast cancer require larger cohorts, especially in the ER− subset, and the inclusion of gene expression analyses may add valuable information.