Introduction

Cathepsin D is an aspartic endoproteinase that is localized in the lysosomes and extracellular matrix and is involved in the malignant progression of breast cancer [1]. It catalyses proteins into several polypeptide fragments that digest other lysosomal endopeptidases and exopeptidases [2]. Cathepsin D is synthesized on the rough endoplasmic reticulum as preproprotein (412 amino acids, 52 kDa) that is subjected to a series of proteolytic cleavages during biosynthesis to produce the mature enzyme [3].

Importance of Cathepsin D in breast cancer was first emerged by the studies of proteins whose expression is regulated by oestrogen in breast cancer [4]. Cathepsin D has long been recognized as an oestrogen-regulated protein [5]. Oestrogen regulates Cathepsin D through binding to oestrogen response elements (EREs) and modulating the transcription of Cathepsin D [6].

Cathepsin D induces cell proliferation, metastasis, tumour invasion, angiogenesis and apoptosis in cancer and stromal cells [7,8,9,10]. As a consequence, several studies have been undertaken to evaluate its clinical significance in breast cancer [11]. However, Cathepsin D has not been recommended for clinical use by American Society of Clinical Oncology (ASCO) because of insufficient evidence [12].

This study aimed to address the gap in the literature on the prognostic value of Cathepsin D expression in breast cancer patients. We analysed a large and well-characterized cohort of breast cancer cases with complete data on classical prognostic factors and relevant molecular markers, such as oestrogen receptor (ER), progesterone receptor (PgR) and HER2 as well as Cathepsin D’s role in response to tamoxifen therapy. The results described in this paper detailed findings in the whole series as well as selected patient subgroups according to their ER, HER2 and Cathepsin D status.

Materials and methods

Patients and tissue specimens

Tissue microarrays (TMAs) were constructed from 954 cases of primary operable invasive breast carcinoma obtained from Nottingham Tenovus Primary Breast Carcinoma Series. This series is a well-characterized case of primary breast carcinoma of long-term follow-up with clinical and pathological data including tumour type, tumour size, histological grade, vascular invasion and Nottingham Prognostic Index (NPI). Information on locoregional recurrence, distance metastasis, nodal status, survival and therapy was prospectively collected [13, 14]. Data on a wide range of breast cancer biomarkers were also available including ER-PgR and HER2 [15].

Breast cancer-specific survival (BCSS) was defined as the time interval (in months) from the date of the primary surgery to the time of death from breast cancer, and disease-free interval (DFI) was defined as the length of time (in months) from the date of the primary surgical treatment to the first locoregional recurrence or distant metastasis. Patients were divided according to their NPI score into three prognostic groups: good (≤ 3.4), moderate (3.41–5.4) and poor (> 5.4) [16]. The median follow-up time was 147 months (range 1–243). Clinicopathological and patient characteristics are listed in Table 1.

Table 1 Patient characteristics

The patients were managed in a uniform way according to their hormonal status and NPI score; those within the good prognostic group received no adjuvant therapy, those with an NPI score > 3.4 received Tamoxifen if ER positive (± Zoladex if pre-menopausal) or classical cyclophosphamide, methotrexate and 5-fluorouracil if ER negative and suitable for chemotherapy [17].

Ethical approval for this study was granted by Nottingham Research Ethics committee 2 under the title of “Development of a molecular genetic classification of breast cancer”.

Immunohistochemical staining

Immunohistochemical staining was performed using Dako REAL™ EnVision™ Detection System (Invitrogen). A monoclonal antibody for Cathepsin D (Millipore Mouse monoclonal antibody #MAB422) was used for immunohistochemically evaluation. After deparaffinization in xylene and rehydration through graded alcohol, sections were immersed in 1:10 citrate buffer pH 6.0 and microwaved for 20 min in order to retrieve antigenicity. Once the retrieval is complete, slides were rinsed with tris buffered saline (TBS) pH 7.6 and endogenous peroxidase activity was inhibited by applying 100 µl peroxidase block (Dako Real peroxidise blocking solution, S2023) to the TMA sections for 5 min followed by TBS rinse. Sections were then treated with 100 µl Ultra V block (ThermoScientific TA-125-UB) to block non-specific staining by the primary antibody for 5 min at room temperature. The antibody was diluted to 1:100 optimal working dilution and incubated for 30 min at room temperature. After washing with TBS, all sections were then incubated with 100 μl of a secondary antibody (dextran coupled peroxidase molecules and goat anti-mouse/rabbit immunoglobulin; Dako REAL™ EnVision™/HRP, Rabbit/Mouse bottle A, K5007) for 30 min. Sections were washed in TBS and incubated with 100 μl of a freshly prepared solution of 3′3-diaminobenzidine (Dako Envision Kit, Bottle B and C, K5007) for 5 min and repeated once. After rinsing in TBS three times, sections were then counterstained with haematoxylin (Dako Real Automation Haematoxylin, S3301) for 6 min. After washing in tap water, the sections were dehydrated in ethanol, cleared in xylene and mounted with DPX (BDH, Poole, UK). Formalin-fixed paraffin-embedded normal liver tissue was used as a positive control. We employed normal tissue within the same tumour sample as the negative control.

Analysis of immunohistochemical staining

Immunoreactivity of Cathepsin D in the TMA cores was evaluated in a semi-quantitative way by assessing both percentages of cells stained and intensity of staining. Cytoplasmic staining of the TMA cores was measured using the modified Histo-score (H-Score) with a range from 0 to 300 scores. Only staining of the invasive malignant cells within the tissue cores was considered. TMAs were scored using high-resolution digital images (NanoZoomer; Hamamatsu Photonics, Welwyn Garden City, UK), at ×20 magnification, using a web-based interface (Distiller; Slidepath Ltd, Dublin, Ireland). All samples were scored by one observer (IA) and a proportion of these were scored twice on two separate assessments. Cores were only scored if tumour cells represented greater than or equal to 15% of the total core.

Statistical analysis

Statistical analysis was performed using SPSS v24 statistical software. Chi-square (x2) analyses were used to test correlations between antigen expression and clinicopathological parameters, steroid receptors and biomarkers. Possible correlation between antigen expression levels and breast cancer-specific survival (BCSS) and disease-free interval was examined using Kaplan–Meier curves and differences between the curves were analysed using the log-rank test. Cox regression models were used for multivariate analysis to test the effect of antigen expression and clinicopathological parameters on disease-free survival and breast cancer-specific survival (BCSS) as well as its statistical independence. A p-value of < 0.05 was considered statistically significant. Data were dichotomized into two groups according to frequency distributions and Kaplan–Meier curves of the effect on BCSS and cut-off was chosen using X-tile Bioinformatics software [18].

Results

Evaluation of immunohistochemical analysis

Immunohistochemical examination of breast carcinoma microarray sections revealed positive staining of Cathepsin D localized to the cytoplasm of invasive malignant cells. Most tumour cores were homogenously stained with Cathepsin D. A total of 80.4% of tumour cores showed different degrees of cytoplasmic staining to Cathepsin D while 19.6% did not show any cytoplasmic staining (Fig. 1). Negative Cathepsin D expression was defined as cytoplasmic H-score ≤ 30 and positive expression as cytoplasmic stained score > 30. Negative expression rate of Cathepsin D using the cut-off value was 275/954 (28.8%) while positive expression rate of Cathepsin D was 679/954 (71.2%).

Fig.1
figure 1

A Cathepsin D strong positive cytoplasmic staining, B Cathepsin D-negative/weak cytoplasmic staining

Correlation of Cathepsin D expression with clinicopathological parameters

Cathepsin D expression was significantly correlated with histological grade, pleomorphism, NPI, distant metastasis and recurrence (Table 2). Positive Cathepsin D expression was associated with high-grade tumours (p = 0.007), high pleomorphism score (p = 0.002), poor NPI score (p = 0.002), development of distant metastasis (p < 0.0001) and recurrence (p = 0.005). Cathepsin D expression was not associated with age at diagnosis, tumour stage, tumour size, nor vascular invasion. Positive expression of Cathepsin D was associated with invasive ductal, tubular mixed, atypical medullary and lobular mixed histological tumour types (p = 0.009) as shown in Table 3.

Table 2 Correlation of Cathepsin D expression with clinicopathological parameters
Table 3 Association of Cathepsin D expression in breast cancer with histological type

Association of Cathepsin D with hormone receptor and HER2 status

Cathepsin D was highly associated with ER and HER2 status but not PgR. Tumours positive for Cathepsin D were associated with ER-positive (p = 0.006) and HER2-negative (p = 0.006) status. Tumours positive for both ER and PgR were also associated with positive Cathepsin D expression (p = 0.037). Cathepsin D immunoreactivity showed no correlation with triple negative status (p = 0.228). Correlation between Cathepsin D and hormone receptor status is summarized in Table 4.

Table 4 The relationship between Cathepsin D expression and other biomarkers

Association of Cathepsin D expression with patient outcome

Kaplan–Meier survival analysis of breast cancer-specific survival (BCSS) showed significantly poor prognosis for Cathepsin D-expressing tumours compared with those negative for the biomarker (p = 0.001) (Fig. 2). In multivariate Cox-proportional hazards analysis, Cathepsin D expression was a significant prognosticator of BCSS (p = 0.021) independent of tumour grade, tumour size, lymph node stage and ER status (Table 5). However, expression of Cathepsin D was not significantly independent when HER2 was added to the regression model (p = 0.057).

Fig.2
figure 2

Kaplan–Meier analysis for Cathepsin D expression in correlation with patient outcome in the whole series, ER+ cancers and ER+ cancers on tamoxifen therapy for survival (A), local recurrence (B) and distant metastasis (C)

Table 5 Multivariate Cox regression analysis of factors associated with BCSS

Univariate analysis showed that positive expression of Cathepsin D was strongly associated with development of distant metastasis (p < 0.0001) and recurrence (p = 0.001); (Fig. 2). In multivariate Cox-proportional hazards analysis, Cathepsin D expression was a significant prognosticator of distant metastasis and recurrence (p = 0.004 and p = 0.021 ,respectively) independently of tumour grade, size, lymph node stage, HER2 and ER status (Table 6).

Table 6 Multivariate Cox regression analysis of factors associated with DFI

Prognostic significance of Cathepsin D expression as a function of the oestrogen receptor status and tamoxifen therapy

Patients were subdivided into ER-negative and ER-positive subsets and the clinical outcome in terms of BCSS and DFI was analysed according to Cathepsin expression using univariate Kaplan–Meier survival analysis (Fig. 2). In ER-positive tumours, patients with positive Cathepsin D expression showed significantly shorter BCSS than those negative for the marker (p = 0.01). In ER-positive tumours, patients with positive Cathepsin D expression showed significantly shorter disease-free interval (p = 0.005 for recurrence and p < 0.0001 for metastasis) than those negative for the marker. In ER-negative tumours, Cathepsin D expression was not associated with BCSS and DFI.

Patients with cancers expressing Cathepsin D exhibited significantly lower BCSS if they had received tamoxifen (p = 0.001). On the other hand, patients with Cathepsin D-negative tumours, who received tamoxifen therapy, did not have statistically different survival rates as patients who had not received the drug (Fig. 3).

Fig.3
figure 3

Association of Cathepsin D expression with tamoxifen treatment. A Cathepsin D-positive, ER-positive tumours. B Cathepsin D-negative, ER-positive tumours. Cathepsin D predicts the development of BCSS and distant metastasis in patients on tamoxifen

Clinical outcome of patients with Cathepsin D expressing tumours in relation to HER2

BCSS of patients with cancers expressing Cathepsin D was analysed according to HER2 status. Patients with HER2-positive tumours expressing Cathepsin D had significantly lower survival rates than those with negative HER2 (p < 0.0001), Fig. 4. In Cathepsin D-negative tumours, HER2 status did not affect BCSS.

Fig.4
figure 4

BCSS as a function of Cathepsin D status in subgroups of HER2-negative and HER2-positive patients

Discussion

In breast cancer, a large number of studies have investigated Cathepsin D expression using either immunohistochemistry or measurement of cytosolic Cathepsin D content (by radiometric immunoassay (IRMA), ELISA or Western blot). Results of immunohistochemical measurement of Cathepsin D expression were variable among different studies, no standard protocols, no control of confounding caused by treatment and inconsistent correlations with prognosis [12]. For this reason, there is a great need to re-evaluate Cathepsin D significance in breast cancer addressing these issues in particular. Göhring et al. concluded that immunohistochemistry (IHC) is the preferred method for measuring Cathepsin D compared to immunoradiometric assays (IRMA) due to its simpler methodology and minimal tissue requirements [19]. In addition, immunohistochemistry provided more predictive data with respect to prognosis. Despite the substantial research on Cathepsin D expression and its prognostic significance in breast cancer, conflicting results have emerged, particularly from studies using immunohistochemistry (IHC) with small patient cohorts. Our study aimed to address this inconsistency by leveraging a large and well-characterized cohort of primary operable invasive breast carcinomas. This well-defined population, with long-term median follow-up and uniform treatment, has previously proven valuable for studying various biomarkers using tissue microarrays.

Our study found that 71.2% of breast cancer tissues displayed positive Cathepsin D expression using the cut-off of > 30% cytoplasmic staining. These results are consistent with findings of other research on Cathepsin D expression in breast cancer [20]. Two studies showed that 68% (17/25) and 58.71% (91/155), respectively, of malignant breast tissues tested had positively stained cells for Cathepsin D [21, 22]. Several approaches, such as immunohistochemistry and cytosolic immunoassay, have demonstrated that in most breast cancers, Cathepsin D is overexpressed 2- to 50-fold compared to its concentration in normal mammary gland cells [23].

A key finding of our study is that Cathepsin D expression significantly correlates with several adverse clinicopathological parameters. Statistical analysis demonstrated a highly significant association of Cathepsin D with poor prognostic variables including high tumour grade (p < 0.007) and poor NPI (p = 0.002). This was consistent with another study which also found a significant association of Cathepsin D expression with tumour grade (p < 0.001) [24]. Unexpectedly, our study did not find a significant correlation between tumour stage and Cathepsin D expression but it showed a trend (p = 0.084). No correlation was found between Cathepsin D expression and nodal status in several immunohistochemistry studies in breast cancer [25, 26]. In the available literature, there are conflicting results between different studies regarding this relationship.

Cathepsin D expression was also associated with highly increased risk of subsequent metastasis, recurrence and tumour grade [27, 28]. Prior studies have noted that Cathepsin D is involved in degradation of extracellular matrix and basement membrane in cancers due to its proteolytic activity [29]. A study using a vectorized Cathepsin D inhibitor found that anti-proliferative activity was associated with Cathepsin D inhibition, suggesting that intracellular Cathepsin D plays a major role in cancer cell proliferation [30].

Furthermore, our study found that Cathepsin D expression is a powerful and independent prognostic factor for both breast cancer-specific survival (BCSS) and disease-free interval (DFI). This finding remained significant even after accounting for other key prognostic factors such as lymph node status, tumour size, tumour grade and oestrogen receptor status. These results align with previous research [31,32,33,34,35].

The relationship between Cathepsin D expression and steroid receptor status has been studied by many researchers. This study shows a highly significant association of Cathepsin D expression and ER status (p = 0.006) but no association with PgR status. Here, we also show that Cathepsin D is a prognostic marker within ER-positive patients, a group of breast cancer patients with relatively good prognosis. Cathepsin D has subdivided ER+ patients into two prognostic groups. Cathepsin D-negative patients have significantly better BCSS and less risk of either recurrence or metastasis than Cathepsin D-positive patients within this ER+ patient group. Other studies have reported similar findings [11, 31]. While some studies found no association between Cathepsin D expression and ER status, their conclusions may be limited by their small sample sizes and reliance on ELISA for cytosolic Cathepsin D measurement [25, 32].

Our study revealed a highly significant difference in breast cancer-specific survival (BCSS) between patients who received tamoxifen and those who did not, but only in patients with Cathepsin D-positive tumours (p = 0.025). This suggests that tamoxifen may increase Cathepsin D concentration in tumours that already express it, thereby altering the prognosis for this specific patient group. Consequently, evaluating Cathepsin D expression in breast cancer prior to initiation of hormonal therapy may predict anti-oestrogen responsiveness. Interestingly, our study and previous research suggest a potential link between Cathepsin D expression and the response to tamoxifen therapy in ER-positive breast cancer patients. Although the previous study did not reach statistical significance (p = 0.09), it supports our findings and warrants further investigation [32].

The level of expression of Cathepsin D was correlated with oestrogen receptor status in breast cancer [31]. Studies of RNA levels of Cathepsin D showed that tamoxifen increased Cathepsin D RNA level regardless of the ER status of the tumours and that this increase is directly proportional to protein level in the cytosol in ER+ tumours but not in ER− tumours [33].

Although Cathepsin D is regulated by oestrogen, in ER-negative breast cancer what regulates or stimulates Cathepsin D overexpression is unknown and might involve other pathways, e.g. other enhancers stimulated by transcription factors, hypoxia induced expression and thrombin [1, 34, 35].

The relationship between Cathepsin D and HER2 expression has not been studied extensively. In this study, HER2 expression was highly associated with Cathepsin D expression (p = 0.006). Clinical outcome of patients who are Cathepsin D-positive and HER2-positive was compared to HER2-negative patients. Patients with HER2-positive tumours have significantly lower survival rates than those with negative HER2 (p < 0.0001). This further emphasizes the association of Cathepsin D with poor prognostic variables. In addition, from this study, Cathepsin D lost its independent significance as prognostic factor (for BCSS) when HER2 expression is introduced in the multivariate analysis along with tumour grade, size and stage.

While previous research has reported an association between Cathepsin D expression and HER2-neu amplification, contradictory findings exist, demonstrating no such link [36, 37]. Additionally, silencing Cathepsin D has been shown to increase ER expression and decrease HER2 expression, suggesting a complex interplay between these proteins [38]. Further investigation into this relationship, including subgroup analysis based on trastuzumab therapy, is warranted to determine the potential of Cathepsin D as a predictor of treatment response.

This study has some limitations. Tumour size, and grade and NPI score information were missing for 25 and 27, respectively, of the 954 cases. While listwise deletion was employed to handle missing data, this could potentially introduce bias into the results. Future studies could explore alternative approaches such as imputation techniques to address missing data and further strengthen the generalizability of findings.

Conclusion

Cathepsin D expression significantly predicts poor prognosis in breast cancer and is associated with aggressive clinicopathological features and worse outcomes, including shorter breast cancer-specific survival (BCSS), higher risk of recurrence and distant metastasis, suggesting its potential as a prognostic biomarker and therapeutic target.

It is noteworthy that the results suggest a possible interaction between Cathepsin D expression and tamoxifen therapy in ER-positive breast cancer. It could be clinically useful to predict response to tamoxifen treatment and help identify patients at risk of resistance. This observation warrants further investigation to elucidate the underlying mechanisms.

Cathepsin D expression in breast cancer correlates with HER2 status, warranting further investigation in larger, trastuzumab-treated patient cohorts to assess its potential as a predictor of treatment response.

In conclusion, this large-scale study provides compelling evidence for the clinical relevance of Cathepsin D as a prognostic marker in breast cancer. Further research is needed to confirm these findings and to explore the potential therapeutic implications of targeting Cathepsin D in breast cancer treatment.