Introduction

Breast cancer is the leading cause of death among women in developed countries [30]. Despite important advances in therapy, more than half of the affected patients undergoes relapses [17].

Many studies on gene expression carried out by microarrays have identified expression profiles and gene sets that are prognostic and/or predictive of outcome of breast cancer patients. Unfortunately, genes identified by these studies show only a slight overlap, even if they seem to be able to predict the outcome of the same group of patients [7]. The major concern related to the use of microarray platforms is the requirement of high-quality ribonucleic acid (RNA), generally derived from frozen samples. Such tumour sets are often small, and an extensive clinical history is not yet available, whereas formalin-fixed paraffin-embedded (FFPE) materials are abundant and allow the possibility of selecting samples related to patients that have been followed-up for long periods. Although reverse transcriptase–polymerase chain reaction (RT-PCR)-based methods from FFPE samples can have clinical application, genome-wide microarray analysis has proven to be difficult. A recent paper by Penland et al. [19], has demonstrated that only a minority of FFPE blocks could be analysed by microarray.

To investigate the progression of breast carcinoma, we analysed the level of expression of a well-defined panel of genes involved in the maintenance/abrogation of the cell senescence machinery in primary breast cancers. This study represents an extension of a previous paper where we analysed a part of this cohort of patients for cell cycle-related molecules, such as cyclins, cyclin-dependent kinases (CDKs) and the phosphatase cell division cycle 25a (CDC25a) [2].

Recent studies have demonstrated that the first step in creating an in vitro model of human cancer involves the abrogation of the cellular senescence mechanism. In fact, a tumour will develop only if it has bypassed all senescence controls [5]. Two tumour suppressor genes, retinoblastoma and p53, are central factors in replicative and premature senescence. Suppression or down-regulation of the retinoblastoma pathway together with the activation of immortalisation factors, such as telomerase activation and RAS pathway activation, is essential to define breast cancer tumourigenesis and progression [12], also at a clinical level. By using quantitative RT-PCR, we looked at the messenger RNA (mRNA) expression level of 22 genes belonging to the RAS pathway (epidermal growth factor receptor 1 [EGFr1] and 2 [HER2], k-RAS, v-raf-1 murine leukaemia viral oncogene homolog 1 [RAF-1], mitogen-activated protein kinase 1 [ERK1], phosphatidylinositol 3 kinase [PI3K], protein kinase B isoform 3 [AKT3]), cell cycle regulation/senescence (Cyclin A, D1 and E1, CDK2, CDK4, CDK6, CDK inhibitor 2A [p16], retinoblastoma 2 [p130-RB2], retinoblastoma [RB], CDC25a), telomerase complex (mRNA coding for catalytic component and RNA component, hTRT and hTR, respectively) and differentiation/metastasis suppression genes such as non-metastatic cells 1 protein (NM23), sel-1 suppressor of lin-12-like (SEL-1L) and cytokeratin 8 (CK8). Genes were selected according to their known links to malignant cell behaviour in breast cancer and their importance in major cancer hallmarks, such as cell senescence/proliferation and survival. Our main goal was to investigate whether the gene expression profiles of the reported molecular markers in primary tumours are predictive of metastatic potential.

Materials and methods

Eighty women were selected who were residents in the Northeastern Italian province of Trieste and whose first primary cancer was unilateral infiltrating duct breast carcinoma. Cancers were diagnosed at the age of 34 or older, between 1971 and 1989 and treated only with Halsted radical mastectomy and complete axillary dissection, without any additional chemotherapy or radiotherapy. Seventy-three patients belonging to this cohort were also part of a previous study [2]. All the slides from the primary tumour were reviewed by one of the authors (Stanta). Carcinomas were histologically graded according to Elston and Ellis [6] and grouped in stages according to the TNM Classification of Malignant Tumours, 6th edition [10].

The cohort of women was followed through the general population-based Friuli–Venezia Giulia Cancer Registry [1] from diagnosis of breast cancer to death or until 31 December 2006, whichever came first [18]. One patient was lost in 1993, after 4 years of follow-up. During the observation period, 69 women died, and all of them underwent complete autopsy at the Surgical Pathology Department of the University of Trieste.

FFPE specimens from primary breast cancers were microdissected and used to extract and quantify the RNA as described in detail elsewhere [3, 24, 27]. RT-PCR was performed as previously reported [24]. Cyclins A, D1 and E, CDK2, 4 and 6 and CDC25a were previously analysed in 73 patients [2] and seven additional patients. Oligonucleotides and RT-PCR conditions were already described for: cyclins A, D1 and E, CDK2, 4 and 6 and CDC25a [2], p16, RB, k-RAS, EGFr1, HER2 [25] and telomerase component genes [26]. For AKT3, CK8, ERK1, NM23, RAF-1, PI3K and SEL-1L and p130 (RB2) oligonucleotides, PCR and hybridisation conditions are reported in Tables 1 and 2. We used paraffin blocks without tissues as negative controls. The basal level of expression of the genes was quantified by analysing ten normal mammary glands, from biopsies negative for cancer performed in the same range of time as the cases. The baseline expression level was therefore detected for epithelial, myoepithelial and fibrous tissue but not adipose tissue elements of the normal gland. RNA extracted from the breast cancer cell line (BT-549) was used as a positive control. To test the reproducibility of the method, for every gene analysed, we randomly selected ten samples from the case study for re-submission for RT-PCR analyses. Amplification products were quantified by dot hybridisation as previously reported [24]. Standardisation using housekeeping genes was reported in detail elsewhere [2, 3].

Table 1 Oligonucleotide sequences
Table 2 RT-PCR and hybridisation conditions

T test analysis was applied to compare the mean value of expression of each gene in patients with recurrences and in those without metastasis. To estimate the joint effects of the analysed covariates on patients’ survival, the data were analysed by fitting the Cox proportional hazard regression model as previously reported [2]. Cox proportional hazard analysis included age at diagnosis, stage and grade of the tumours and the complete set of the molecular markers. In this analysis, tumours with expression of molecular markers lower or higher than the mean value were classified as low or high status of expression, respectively.

To investigate possible associations among the genes detected by Cox regression analysis, we chose pairs of molecular markers involved in opposite or conflicting functions. We considered groups defined by: high expression status of one marker and low expression status of the other marker as well as the opposite situation. The chosen pairs of genes were: HER2 and CK8 as de-differentiation/differentiation markers, CDK2 and RB as markers of cell cycle progression/senescence, PI3K-RAF1 and AKT3-RAF1 as markers of the main branches of the RAS pathways and PI3K and AKT3 as markers of activation of cell transcription factors. The log-rank test was used to compare the survival between the above mentioned groups.

Statistical analyses were performed with the package Stata/SE 9.0 (Stata, College Station, TX).

Results

The cohort of 80 women affected by breast cancer showed a median age at diagnosis of 61 years (25th–75th percentile = 51–70). All of them were affected by infiltrating duct carcinoma of the breast, the diameter of which was not larger than 3.0 cm at pathological examination. Staging and grading of tumours are shown in Table 3. The axillary lymph nodes were tumour negative in 43 patients, while they were positive in 37 patients at the time of diagnosis of the primary tumour. The median number of positive nodes was 3 (range = 1–15). During the period of observation, 42 patients developed distant metastases.

Table 3 Characteristics of the 80 breast carcinomas by stage, grading and patients’ age at diagnosis

The median duration of follow-up was 6 years (25th–75th percentile = 3–12 years). Of the 69 deceased patients, 42 died of breast cancer metastases, one patient died of ovary carcinoma and 26 of non-neoplastic diseases.

This study confirms our previous findings on CDC25a (p = 0.01), CDK2 (p = 0.01) and CDK4 (p = 0.03) expression in the groups of patients who developed recurrences and the patients that did not [2]. Moreover, the present study also confirms that CDK2 and CDC25a affected patients’ survival in univariate analysis (p = 0.0001 and p = 0.02, respectively) as previously reported [2].

The results of the Cox proportional hazard regression analysis are reported in detail in Table 4. An independent influence on cancer-specific death was detected for AKT3 (p = 0.01), RAF-1 (p = 0.05), CDK2 (p = 0.04), CK8 (p = 0.04), RB (p = 0.03) and HER2 (p = 0.02). A borderline influence on patients’ survival was detected for PI3K (p = 0.06). Conversely, tumour stage and grade, patients’ age at diagnosis and the remaining molecular markers analysed did not show an effect on survival. After dividing the stage into lymph node status (positive or negative for metastasis) and tumour dimension (diameter expressed in centimetres), the results of Cox proportional hazard regression analysis were restated. In this analysis, lymph node status and tumour diameter did not affect patients’ survival (p = 0.1 and p = 0.5, respectively).

Table 4 Results of Cox multivariate analysis

Among the genes related to survival, AKT3, CK8 and RB seem to have a protective effect. Higher expression levels were indeed associated with better prognosis (Figs. 1, 2 and 3, respectively). Conversely, CDK2, HER2, PI3K and RAF-1 showed a worse effect on patient survival (Figs. 4, 5, 6 and 7, respectively). In particular, CDK2 and HER2 high-expression statuses seem to be associated with early relapses, and patients’ survival decreased to 27 and 36% at 5 years, respectively (Figs. 4 and 5).

Fig. 1
figure 1

Survival curves related to Cox proportional hazard regression with respect to AKT3 expression level in the sub-groups of low AKT3 (AKT3 expression lower than the mean value) or high AKT3 (AKT3 expression higher than the mean value)

Fig. 2
figure 2

Survival curves related to Cox proportional hazard regression with respect to cytokeratin 8 expression level in the sub-groups of low cytokeratin 8 (cytokeratin 8 expression lower than the mean value) or high cytokeratin 8 (cytokeratin 8 expression higher than the mean value)

Fig. 3
figure 3

Survival curves related to Cox proportional hazard regression with respect to retinoblastoma expression level in the sub-groups of low retinoblastoma (RB expression lower than the mean value) or high retinoblastoma (RB expression higher than the mean value)

Fig. 4
figure 4

Survival curves related to Cox proportional hazard regression with respect to CDK2 expression level in the sub-groups of low CDK2 (CDK2 expression lower than the mean value) or high CDK2 (CDK2 expression higher than the mean value)

Fig. 5
figure 5

Survival curves related to Cox proportional hazard regression with respect to HER2 expression level in the sub-groups of low HER2 (HER2 expression lower than the mean value) or high HER2 (HER2 expression higher than the mean value)

Fig. 6
figure 6

Survival curves related to Cox proportional hazard regression with respect to PI3K expression level in the sub-groups of low PI3K (PI3K expression lower than the mean value) or high PI3K (PI3K expression higher than the mean value)

Fig. 7
figure 7

Survival curves related to Cox proportional hazard regression with respect to RAF-1 expression level in the sub-groups of low RAF-1 (RAF-1 expression lower than the mean value) or high RAF-1 (RAF-1 expression higher than the mean value)

Moreover, groups of patients with a high-expression status of RB but a low-expression status of CDK2 showed a longer disease-free survival in comparison to the group characterised by low RB status and high CDK2 status (p = 0.001) as described in Fig. 8. A borderline significance (p = 0.07) was detected when pairing CK8 and HER2 statuses (Fig. 9). The associations RAF1–PI3K and PI3K–AKT3 as well as the pairing between RAF1 and AKT3 did not affect patients’ survival (p = 0.9 and p = 0.7, respectively).

Fig. 8
figure 8

Kaplan–Meier survival curves for patients with respect to RB and CDK2 expression level in the sub-groups of high RB and low CDK2 or low RB and high CDK2 status

Fig. 9
figure 9

Kaplan–Meier survival curves for patients with respect to Cytokeratin 8 and HER2 expression level in the sub-groups of high CK8 and low HER2 or low CK8 and high HER2 status

Seventy-three percent of patients belonging to the group with low RB and high CDK2 statuses died during the first 5 years of follow-up. A similar pattern was found for the group described by low CK8 and high HER2, in which 47% of patients died within the fifth year of follow-up. Moreover, the groups described by these two couples did not share any patient. Conversely, breast cancer patients described by high RB and low CDK2 showed a 5- and a 10-year survival probability of 77 and 68%, respectively. Similarly, the survival probability of patients with high CK8 and low HER2 was 81 and 69% at 5 and 10 years, respectively. These two groups overlapped for three patients. The four groups identified by these couples of markers included 45 of the 80 patients (56%).

Discussion

The ability to define the risk of recurrence is useful in medical decision making. Several clinical and pathological factors have been proposed to predict the recurrence of breast cancer. However, at the present time, it is not possible to predict recurrence vs no recurrence at an individual level. It is commonly accepted that the different clinical courses of patients characterised by the same histology and stage are the result of molecular differences among cancers [21]. Different single molecular markers have been assessed in the past 20 years but failed to be considered for clinical use for various reasons [21]. Microarray studies have identified expression profiles and gene sets that are prognostic or predictive or both for patients with breast cancer. Comparison of the lists of genes derived from these studies shows only a slight overlap probably because of differences in the cohorts of patients, microarray platforms and mathematical methods of analysis [7]. Understanding the molecular alterations connected with progression to eventual recurrence is of particular importance, especially for therapeutic decisions.

In the present study, carried out on 80 women with infiltrating duct carcinoma of the breast, quantitative RT-PCR was performed for a set of 22 genes involved in cell senescence, malignant transformation and tumour suppression. This study confirms our previous findings on the expression level of CDK2, CDK4 and CDC25a in breast cancer progression [2] and their correlation with recurrences. As previously reported [2], CDK2 over-expression seems to have an influence on the patient’s survival both by uni- and multi-variate analysis, while data on CDC25a and survival were confirmed only by uni-variate analysis. In our study, Cox proportional hazard analysis did not identify tumour stage as significantly affecting survival in these patients (p = 0.1). The main differences between the previous and the present study is the enrolment of seven women in the cohort of patients and 15 new molecular markers that have been compared with the previous ones.

Patients’ survival was analysed by the Cox proportional hazard regression method. We divided the cases with a high level of expression (over the average value) from those with lower expression (below the average value). In this analysis, only those genes that showed a significant level of survival between the two groups are discussed.

Cox proportional hazard analysis identified different molecules involved in patients’ survival. In particular, our results showed that patients with a higher expression level of HER2 and/or PI3K and a lower expression level of AKT seem to have the worst outcome. Our data agree with previous reports. It is well known that alteration in the PI3K/AKT signalling cascade is common in human cancers and results in hyper-activation of the pathway that leads to tumour progression [8]. In particular, our data associate a worse outcome to patients over-expressing HER2 and/or PI3K but not AKT. The association of higher AKT3 mRNA expression level with better prognosis shown in our study is discordant with a previous paper [29]. A possible explanation of this discrepancy is related to methodological issues, as Totunaka et al. detected AKT in its activated status, i.e. the expression of phosphorylated AKT by immunohistochemistry (IHC). Our primer set was able to identify only isoform 3 of the transcript (AKT3). In fact recently, Maroulakou et al. reported that AKT2 ablation accelerates whilst AKT1 and AKT3 ablation seem to inhibit the development of mammary adenocarcinoma in mice [13] suggesting a complex functional pattern for this group of proteins.

RAF-1 is a protein serine–threonine kinase that plays an important role in cell growth, proliferation and survival. Our results show that patients with high RAF-1 status had a decreased disease survival as detected by Cox multivariate analysis (Fig. 7). These data agree with previous findings by Mewani et al. [15], these authors reported that an over-expression of RAF-1 in mammary epithelial cells induced the expression of genes promoting cell proliferation, invasiveness, angiogenesis, cell survival and also with mitogen-activated protein kinase kinase/extracellular regulated kinase-independent functions [14].

It is well-documented that the RB pathway plays a critical role in cell cycle progression, acting as a crucial negative regulator of cellular proliferation [23]. Our latest results on RB expression level and survival indicate a protective role of this molecule in breast cancer progression; patients with high RB seem to have a longer survival time (Fig. 3). Our findings agree with Derenzini et al. [4], who found that patients with low RB mRNA and a high level of phosphorylated RB labelling index had shorter disease-free intervals and overall survival time. Moreover, our data show that the CDK2 and RB expression levels, when combined together, are well able to predict patients’ survival. The fact that patients characterised by a high status of CDK2 and a low status of RB presented a worse prognosis agrees with the role of these molecules in progression of the cell cycle through the G1/S phase and even indicated overcoming of this control checkpoint.

CK8 is characteristically present in luminal cells of a normal mammary gland. A wide range of cytokeratin expression is often seen in invasive breast cancer, which may be caused by loss of differentiation of the tumour cells [28]. In our study, higher expression levels of CK8 were associated in Cox multivariate analysis with better prognosis, and CK8 has a protective influence on patients’ survival. Our data are concordant with the investigation by Fuchs et al. [9] based on IHC in breast carcinomas and breast cancer cell lines. That study revealed that suppression of CK8 was significantly correlated with short survival. Moreover, in cell lines with increasing invasion potential, epithelial keratins were also lost [9]. In our case study, CK8 (at high-expression status) in association with HER2 (low-expression status) identifies a group of patients with better prognosis. In this group of patients, the oestrogen receptor (ER) status was analysed by RT-PCR as previously reported [16] (data not shown). The patients of the group characterised by a high expression level of CK8 and low expression levels of HER2 were all positive for ER at the mRNA level. On the contrary, the other group, defined by the opposite pattern of the two markers (high HER2 and low CK8 levels), presented very low or no expression levels of the ER. These data agree with the recently reported molecular classification of breast cancer [20]. Patients with negative or weakly positive ER status and higher expression levels of HER2 presented a worse outcome, in accordance with Perreard et al. [20] who found ER-negative tumours more aggressive.

Of the 22 markers analysed in our study, 15 genes did not show influence on patients’ survival. Among them, telomerase components need a comment. This enzyme catalyses the de novo synthesis of telomere repeats, which is necessary for unlimited cellular proliferation. The telomerase reverse transcriptase hTRT was detected at mRNA levels both in normal breast and breast cancer tissues [11]. Recently, Salhab et al. [22] investigated the expression level of the key telomere-related genes in human breast cancer and found that hTRT levels predicted the patients’ overall survival. In our study, we did not find any association among telomerase components (hTR, the RNA template, and hTRT, the catalytic component), pathological parameters and patients’ survival. One possible explanation is the normalisation methodology used by Salhab. Indeed, the authors normalised the expression levels of the telomerase genes against cytokeratin 19 (CK19) and not against one or more reliable housekeeping genes. This could modify the expression level of the telomerase machinery according to the level of CK19 expressed in cancer epithelial cells. From our study, it seems that telomerase expression could be a prerequisite for the early phase of tumorigenesis rather than a mechanism connected with progression. The presence and relative abundance of epithelial cells expressing telomerase within normal breast tissues could suggest that breast tumours arise from these highly expressing telomerase epithelial cells [11].

The molecular tumour progression analysed in the present study shows the natural history of the disease because the enrolled patients were only treated with the same type of surgical therapy. Our findings strongly demonstrate that the combination of two pairs of functionally conflicting markers (RB and CDK2 on one side, CK8 and HER2 on the other) may be divided into sub-groups that cover more than 50% of the patients, and these pairs of markers are able to identify sub-groups of patients with shorter relapse-free and survival times from those with long survival without recurrences. Groups characterised by low RB and high CDK2 (27% of survivors at 5 years and 14% at 10 years) as with low CK8 and high HER2 (53% of survivors at 5 years and 33% at 10 years) have a higher risk of early recurrence and death, suggesting that breast cancer progression in these patients could be related to these markers. Moreover, the recognition of these sub-groups of patients with higher risk of early relapse could have further relevance for the selection of patients to submit to therapy against HER2 or also against CDK2 as possible therapy targets. On the contrary, those with low CDK2 expression and high RB (68% of survival after 15 years) and those with high CK8 and low HER2 (69% of survival after 15 years) present a very good prognosis.

These results are preliminary because the real predictive value of these double-biomarker groups could result only from larger retrospective and prospectively randomised studies to evaluate that the use of these molecular assays yield better clinical outcome than current clinical decision-making systems. Moreover, a possible use of IHC markers as surrogates for the reported genes could represent a further simplification with the prospective of introducing them into routine tumour management.