Introduction

In recent years, breast cancer has surpassed lung cancer in mortality and has become the most frequently occurring malignancy, accounting for 24.5% of all malignancies, among women globally [1]. In 2020, it accounted for more than 680 000 reported deaths globally, more than half of them in low- or middle-income countries (LMICs) [1]. In high-income countries, breast cancer survival is reported to be more than 90%, but in sub-Saharan African countries, survival has been reported to be about 50% (48%–53%) [2]. In a recently published urban-based, South African study, the 4-year breast cancer overall and disease-free survival proportions were reported to be 53.5% and 55.8%, respectively [3].

HIV infection has been associated with increased mortality among patients diagnosed with cancer [4,5,6,7]. Phakathi et al. reported that the overall and disease-free survival for breast cancer was worse among HIV-positive patients [3]. Moreover, among the HIV-positive patients, survival was better among those who were on anti-retroviral therapy (ART) at the time of breast cancer diagnosis [3]. Whether HIV infection directly affects breast cancer progression is not yet known. However, younger age, more advanced disease at breast cancer diagnosis, and greater difficulty in completing systemic therapy observed among HIV-positive patients, may contribute to their poorer outcomes [4,5,6,7,8].

Treatment decision making for breast cancer patients depends on their estimated risk for disease recurrence. Traditionally, such estimates have been based on clinical-pathological factors. But in the past few decades, several predictive tools or systems have been developed to improve the accuracy and usefulness of such estimates [9]. For example, immunohistochemistry looks at specific proteins expressed on tumour cells; in-situ hybridisation assesses gene amplification; and reverse transcription-polymerase chain reactions examine gene transcription [10, 11]. Immunohistochemistry and the PAM50 gene expression assay identify intrinsic subtypes of breast cancer.

The prediction analysis microarray (PAM50) gene expression assay measures mRNA expression of 50 cancer-related genes; the assay classifies the tumour by breast cancer intrinsic subtype and generates its risk of recurrence (ROR) score [12]. The ability of PAM50 scoring to prognosticate and predict recurrence and metastasis exceeds that of scoring based on the traditional clinico-pathological characteristics of breast cancer [12]. Moreover, the ROR score has been reported to add more prognostic information than the clinical treatment score, recurrence score (Oncotype Dx) and IHC-4 in both node-negative and node-positive, HER-2 negative early breast cancer [9, 12]. It has also achieved analytical validation and level 1 clinical validation and has shown clinical utility and effectiveness in predicting the risk of recurrence in post-menopausal women [12,13,14,15]. In a study in Canada, among patients in the high-risk group, PAM50 was able to distinguish those who would respond well to chemotherapy from those who would not by intrinsic subtype [16]. Similarly, in a Norwegian study, the PAM50 assay identified low-risk patients who could be followed safely by observation and would not derive an additional survival benefit from adjuvant hormonal therapy. This group of patients had a breast-cancer specific survival of 96.3% after 15 years of follow-up [12]. The PAM50 assay also identified some patients in the intermediate risk group who could derive the same survival benefit from adjuvant hormonal therapy as the low-risk group [12]. In a US sample, the PAM50 assay predicted the effectiveness of adjuvant chemotherapy as well as that of neoadjuvant chemotherapy; with an estimated negative predictive value for a complete pathological response of 97% [17]. Moreover, the PAM50 ROR was able to predict which patients with early stage breast cancer,, ER positive/ HER-2 negative, node-positive breast cancer could be safely treated with adjuvant hormonal therapy only as well as those who could benefit from chemotherapy [18]. The assay was also found to be cost-effective when compared to current clinical practice and other molecular assays [19].

However, as far as we can determine, our study is the first to report on breast cancer survival by PAM50 intrinsic subtype & ROR in HIV-negative and HIV -positive patients of South Africa. We hypothesised that HIV-positive women with breast cancer would have a more aggressive tumour phenotype than HIV-negative patients, and therefore poorer survival.

Methodology

Among participants in the South African Breast Cancer and HIV Outcomes (SABCHO) cohort study, we selected formalin-fixed, paraffin-embedded (FFPE) specimens of histologically confirmed invasive carcinoma from age-matched, HIV-positive and -negative patients. We obtained mastectomy/wide local excision specimens from patients who had primary surgery, and core biopsy specimens from patients who had primary chemotherapy. We retrieved the FFPE breast tissue blocks from the archives of the National Health Laboratory Service (NHLS). The pathologist examined a hematoxylin and eosin (H&E) stained slide, marked the area of invasive breast cancer suitable for the test, and sent the slides to the Molecular Laboratory, at the University of Witwatersrand, where the molecular work was undertaken.

We extracted and purified the RNA successfully using the AllPrep® DNA/RNA FFPE kit, Qiagen (Hilden, Germany). We measured the extracted RNA on the Nanostring nCounter Analysis System (Nanostring Technologies, Seattle, WA) and processed the samples using the NanoString nCounter Prep Station and digital analyser. PAM50 analysis was done on 384 samples, 6 (1.65%) failed the QC for the PAM50 assay and one sample was excluded because of a lack of clinical data, thus the total study population included in this analysis was 377. Of these 377, one patient had an unknown HIV status and four were excluded from the survival analyses as they lacked a follow-up period. We used NanoString RUO PAM50 algorithm to determine the molecular subtype and the ROR of each sample. We obtained the data on demographic characteristics, clinical stage at presentation, PAM50 intrinsic subtypes (luminal A, luminal B, HER2-enriched, and basal-like), ROR, HIV status, CD4 count, viral load, and ART use from the electronic breast cancer database. We categorised each patient based on her ROR score as [12]:

  • Low risk: ROR ≤ 40

  • Intermediate risk: ROR 41 – 60, pN0

  • High risk: ROR 41 – 60, pN1 or ROR > 60

We defined overall survival as the interval from the date of breast cancer diagnosis to the date of death from any cause, and disease-free survival as the interval from the date of breast cancer diagnosis to the date of radiologically & histologically confirmed disease recurrence or death from any cause [3]. Patients with metastatic breast cancer at the time of diagnosis were not included from the analysis of the disease-free survival [3]. The date of death was documented as indicated in the medical records or provided by the family member, for patients who have died. Associations between the clinical and demographic characteristics in HIV-positive and HIV-negative participants were evaluated using a chi-squared test. Kaplan–Meier survival curves were generated and p-values were calculated using a log-rank test of equality. Both adjusted and unadjusted (crude) HRs as well as their 95% confidence intervals were estimated using Cox regression models. STATA v14.2 and the stset suite of commands were used for the data analysis, with p < 0.05 considered to be statistically significant [3]. The study’s ethics approval was obtained from the Human Research Ethics Committee (Medical) at the University of Witwatersrand (clearance numbers: M161130 and M150351).

Results

A total of 377 patients (176 HIV-positive and 200 HIV-negative patients; 1 HIV status unknown) were included in the final analysis. (Table 1). The median age of the cohort was 48 years, and the HIV-positive patients were younger than the HIV-negative patients. A total of 213 (56.5%) patients had advanced disease at the time of diagnosis and 81.7% of patients had a high risk for disease recurrence. Luminal B was the most common intrinsic subtype in overall and among the HIV-negative patients.

Table 1 Demographic and clinical characteristics and PAM 50 intrinsic subtypes by HIV status

Overall survival

The study participants’ 5-year overall survival was 48.0%. Patients with the luminal A and luminal B intrinsic subtypes had better survival than those who had HER2-enriched or basal-like intrinsic subtypes (Fig. 1 and Table 2). As expected, the patients with the highest ROR scores had poorer survival than those with intermediate or low ROR scores (Table 2). Regardless of the intrinsic subtype, HIV-positive patients had poorer 5-year survival than HIV-negative patients (34.1% vs 59.6%, p < 0.001). Only among patients with luminal subtypes was HIV status associated with survival (Fig. 2 & 3).

Fig. 1
figure 1

Overall survival by molecular subtype for 373 black South African patients subtyped using the PAM50 assay. The figure shows the number at risk by subtype for each time point. Unadjusted hazard ratios (95% confidence intervals (CIs)) compared to Luminal A: Luminal B 1.24 (0.78 – 1.98), p = 0.365; HER2-enriched 2.38 (1.49 – 3.80), p < 0.001; Basal-like 2.07 (1.29 – 3.33), p = 0.003

Table 2 Unadjusted and adjusted breast cancer mortality hazard ratios
Fig. 2
figure 2

Overall survival by molecular subtype in (A) 175 HIV-positive patients; Unadjusted hazard ratios (95% CIs) compared to Luminal A: Luminal B 1.18 (0.68 – 2.07), p = 0.555; HER2-enriched 1.45 (0.81 – 2.58), p = 0.207; Basal-like 1.23 (0.73 – 2.17), p = 0.462. B 197 HIV-negative patients; Unadjusted hazard ratios (95% CIs) compared to Luminal A: Luminal B 2.39 (0.91 – 6.26), p = 0.077; HER2-enriched 6.09 (2.34 – 15.90), p < 0.001; Basal-like 5.47 (2.07 – 14.48), p = 0.001

Fig. 3
figure 3

Overall survival by HIV status for (A) 195 patients with luminal breast cancer, unadjusted hazard ratio (95% CI) compared to HIV-negative: HIV-positive 3.63 (2.28—5.78), p < 0.001; and (B) 177 patients with non-luminal breast cancer, unadjusted hazard ratio (95% CI) compared to HIV-negative: HIV-positive 1.29 (0.87—1.90), p = 0.202

Patients aged 50 years or more had better survival than younger patients in an unadjusted model (p = 0.024) but not in a model adjusted for stage and HIV status (p = 0.829) (Table 2). Among HIV-positive patients, the duration of HIV infection and ART use had no association with the overall survival.

Disease-free survival

Overall 5-year disease-free survival (DFS) was 45.4%, but it was far worse among HIV-positive than HIV-negative patients (23.9% vs 59.6%, p < 0.001). The HER-2 and basal-like intrinsic subtypes were associated with poorer DFS than Luminal A and luminal B intrinsic subtypes among HIV-negative patients but there was no statistical difference in DFS by molecular subtype in HIV-positive patients (Fig. 4). Patients with high ROR scores had very poor DFS (Table 3). Of 248 patients, 67 (27%) had breast cancer recurrence and their overall median (IQR) time to recurrence was 2 (1 – 3) years. The commonest site of disease recurrence was the lung, accounting for 45% of all sites. HER2-enriched and Basal-like intrinsic subtypes spread predominantly to the lungs while the Luminal B intrinsic subtype spread mainly to the liver. HIV infection had no impact on the site of distant metastasis.

Fig. 4
figure 4

Disease-free survival by molecular subtype in (A) 151 HIV-positive patients; Unadjusted hazard ratios (95% CI) compared to Luminal A: Luminal B 1.18 (0.64 – 2.19), p = 0.601; HER2-enriched 1.06 (0.55 – 2.05), p = 0.864; Basal like 1.40 (0.77 – 2.55), p = 0.269. (B) 179 HIV-negative patients; Unadjusted hazard ratios (95% CI) compared to Luminal A: Luminal B 2.53 (0.86 – 7.42), p = 0.091; HER2-enriched 5.86 (2.00 – 17.12), p = 0.001; Basal like 6.46 (2.19 – 19.08), p = 0.001

Table 3 Unadjusted and adjusted hazard ratios for disease recurrence

Discussion

Our findings of 48% 5-year overall survival and 45.4% disease-free survival are similar to those of other studies showing 50% 3-year survival in LMICs and 90% 5-year survival in HICs [2, 20]. Several features of our cohort may explain its poor survival. Overall in the SABCHO cohort, the median age at breast cancer diagnosis was 54 years, and HIV-positive patients were younger than HIV-negative patients (44 vs 57 years, < 0.001) [3, 7]. Moreover, the overall median age was younger than that in a cohort in the United States [21]. In the current sub-study, HIV- positive and HIV-negative patients were age matched, hence, our cohort’s median age was 48 years. In Norway, women younger than 50 years had a twofold risk for mortality compared to women 50- 59 years of age [22]. However, after adjusting for stage and HIV status, our study participants younger and older than 50 years did not differ in survival.

Breast cancer diagnosis at advanced stage is another known predictor of poor survival [6]. In our cohort, advanced disease at presentation was associated with poor overall and disease-free survival, and 56.6% of our patients had an advanced (Stage III/ IV) disease on presentation; similar to patients in a study in Tanzania (53.2%) [23], while in a study in Rwanda, more than 75% of patients presented with advanced disease [26]. In contrast, in the United States, less than 20% of patients had advanced breast cancer at the time of diagnosis [24]. Several patient-related and healthcare facility-related factors contribute to delayed presentation and late stage at diagnosis of breast cancer [25,26,27,28].

Although the direct effect of HIV infection on breast cancer progression is not yet fully understood, HIV infection among breast cancer patients has been associated with poor survival [3, 5, 6], except among patients with metastatic breast cancer [29]. In our cohort of patients diagnosed in stages I-III, HIV-positive status was associated with poor breast cancer survival.

Another known prognostic factor is the intrinsic breast cancer subtype. Triple-negative and HER2-enriched intrinsic tumours are more aggressive than luminal A and luminal B tumours and are associated with reported 2.5 and threefold risks of mortality [3, 7, 21, 30, 31]. About 20% of breast cancers are HER2-positive; the subtype associated with poorer clinico-pathological outcome features: younger age, larger size, lymph node involvement, increased nuclear grade, and negative hormone receptors [32, 33]. Moreover, it is associated with an increased risk for loco-regional and distant site recurrence, including a > 50% risk of developing central nervous system metastases [34]. About 15% of breast cancers are triple-negative, and that subtype is associated with young age at diagnosis, African descent, and BRCA 1 gene mutations [33]. It is also associated with poor disease-free and overall survival and with metastasis to the lungs and central nervous system [33,34,35,36]. In this cohort, the PAM50-based HER2-enriched and basal-like (triple negative) subtypes were associated with poor survival, even after adjusting for age, stage, and HIV status. Moreover, they accounted for 23.9% and 24.7%, respectively, of all the intrinsic subtypes in our cohort, higher prevalence than previously described (i.e., 20% and 15%, respectively) [32, 33, 37, 38]. In our cohort, HIV infection was not associated with PAM50 intrinsic subtype and these findings were also reported by several other studies, but a Mozambique-based study found that a higher proportion of HIV-positive than HIV-negative patients had triple-negative breast cancers [31, 39, 40]. Regarding survival, in our cohort HIV-negative patients with luminal breast cancer subtypes had significantly better survival than patients with non-luminal breast cancer subtypes. Interestingly, HIV-positive patients, did not differ in overall survival by molecular subtype. It is known that luminal breast cancer subtypes are less aggressive and are associated with more favourable outcomes than non-luminal breast cancer subtypes [3, 30, 33]. How HIV infection adversely affected the survival of patients with luminal breast cancer subtypes in our cohort still needs to be determined. Ayeni et al. recently reported an increased rate of non-compliance to prescribed tamoxifen treatment among HIV-positive patients with luminal breast cancer subtypes [41]. Tamoxifen is a selective estrogen receptor modulator proven to improve the survival of patients with luminal breast cancer subtypes [42].

In our study, the most common site for distant metastasis among patients with HER2-enriched or basal-like tumours was the lung, similar to what has been reported by other studies [33,34,35,36]. The liver and lung were the commonest sites of metastases for luminal B intrinsic subtypes, respectively, while the Luminal A subtypes were evenly spread between sites. HIV status had no impact on the site of distant metastases.

The ROR score is based on the measurement of the 50 genes included in the PAM50 assay, and the size of the tumour itself [9]. Low, intermediate, and high-risk groups by ROR have an estimated 10-year distant recurrence-free survival of: 96.7%, 91.3% and 79.9% respectively [9, 14]. In this study, 82.1% of the patients were in the high-risk category by ROR score and had relatively poor overall and disease-free survival. However, HIV status did not affect the ROR score.

The strength of this study is its duration of follow-up, which yielded 5-year overall and disease-free survival, matching the international standard, unlike most studies of breast cancer survival in LMICs, which have typically reported survival up to 4 years. Moreover, our study is the first, to our knowledge, to use the gene expression assay PAM50 to determine intrinsic subtypes and ROR among HIV-negative and HIV-positive patients. The limitations include small sample size and not being able to determine the breast cancer-specific mortality rate.

Conclusion

In our cohort, HIV-negative status was associated with Luminal B intrinsic subtype. We also found that HIV infection, PAM50 HER2-enriched and basal-like intrinsic subtypes, and high ROR score were associated with poor overall and disease-free survival. Moreover, HIV-positive patients did not differ in the overall survival by molecular subtype, but HIV-negative patients with luminal breast cancer subtypes had better survival than those with other subtypes, (p < 0.001).