Introduction

One of the commonest genetic lesions in breast cancer is mutation of the tumor suppressor gene TP53, encoding the p53 protein. p53 is a transcription factor that mediates antiproliferative mechanisms in response to various forms of cellular stresses, in particular DNA damage [1]. Different types of DNA damage activate p53 through different pathways, resulting in different responses including senescence, cell-cycle arrest and apoptosis [2].

Experimental models of breast cancer also show that mutation of p53 may confer an aggressive tumor behavior that is not seen in p53-null models [3]. Most mutant p53 proteins lose their ability to bind wild-type p53 responsive elements and to regulate the expression of p53 transcriptional targets, thus losing tumor suppressor activity. However, cellular preservation of mutated p53 may confer malignant potential such as the capacity to metastasize, through gains of function activities (reviewed in [4] Oren and Rotter, 2010).

TP53 mutation is generally associated with a poor prognosis, predicting poor disease-free survival (DFS) and overall survival (OS) in breast cancer patients [5, 6]. As a predictive biomarker for treatment response, the role of p53 remains a matter of debate. More than a decade ago, p53 emerged as an important factor in the activity of DNA-damaging chemotherapies [7]. Indeed, preclinical studies suggested p53-dependent anthracycline-induced apoptosis and p53-independent taxane activity [7, 8]. Many clinical studies undertaken in the last decade have sought to validate these results. Most studies have retrospectively assessed p53 in subgroups from biologically unselected breast cancer trials [913]. Clinical data remains conflicting and inconclusive, and no robust predictive correlation has surfaced. An important recent study is the neoadjuvant phase III EORTC 10994/BIG 00-01 trial, which is the only study to be prospectively powered to assess the predictive role of p53 [14]. p53 status was assessed using an RNA-based technique, which detects functionally important p53 mutations using a yeast-based assay [15]. The prognostic role of p53 was confirmed, but p53 was not predictive of response or resistance to docetaxel.

The methods used to evaluate TP53 status and the diversity of observed mutations constitute sources of heterogeneity when analyzing the clinical impact of mutations. More than 75% of TP53 mutations are missense mutations that produce mutant proteins, and up to 25% of mutations are small insertions or deletions that produce truncated proteins. Determination of p53 status by immunohistochemistry (IHC) is plagued by high false-positive rates (overexpression of p53 wild-type protein), high false-negative rates (truncating mutations that stain negative), and a poor level of correlation with TP53 gene mutations [9]. IHC has been surpassed by direct DNA sequencing, functional assays in yeast and p53 genetic signatures. Studies that have used gene resequencing to assess TP53 status have produced more consistent results for the prognostic value of mutations [5, 16]. However, results of gene resequencing should be interpreted in terms of downstream p53 protein functions as TP53 gene mutations impact differently on protein functions, as evidenced in functional assays in yeast or human cells [17, 18]. Indeed, assessment of the transactivation capacity of p53 mutant proteins on different p53 target sequences has shown great variability of activities between mutant proteins and target sequences. Whereas hotspot missense mutations in the DNA-binding domain lead to a general loss of specific transactivation capacity on all target sequences, missense mutations outside the DNA-binding domain more often retain transcriptional activity on some promoters. Moreover, some mutant proteins may have dominant-negative effects on wild-type p53 or exert pro-oncogenic effects independently of wild-type p53 [19]. The TP53 Function Database created at the International Agency for Research on Cancer (IARC), Lyon, France, allows the classification of TP53 mutations according to their predicted impact on p53 protein activities [19].

In the late 1990s, the Breast International Group (BIG) 02-98 phase III randomized trial was one of many clinical trials launched to explore the role of adjuvant taxanes in early breast cancer (http://ClinicalTrials.gov Identifier: NCT00174655) [20]. Improved outcomes from the addition of taxanes to anthracycline-based therapy have been reported in many [2025] but not all trials [2628]. A recent meta-analysis reported superior outcomes from the addition of a taxane to anthracycline-based regimens in high-risk disease [29]. The absolute benefit deriving from the addition of taxanes is modest, less than 10% absolute improvement in DFS. A key issue for clinical practice is that despite taxane benefit for clinical trial cohorts, there is no predictive marker for the identification of individuals that are most likely to benefit from taxane.

With this background of inconsistent data for clinical utility of p53 status and absence of predictive markers for taxane use, we undertook a retrospective, exploratory analysis in BIG 02-98 to investigate the prevalence, and prognostic and predictive value of different types of TP53 mutations. Mutation classifications based on the IARC TP53 Function Database were used to estimate the functional impact of TP53 gene variations, rather than their presence per se.

Materials and methods

Study population

The current study is a retrospective study on primary tumor samples from a larger population of patients participating in the clinical trial BIG 02-98. Details of patients and methods in this trial have been described previously by Francis et al. [20]. Briefly, BIG 02-98 was a multicenter, randomized phase III adjuvant trial of 2887 women aged 18 to 70 years with operable, clinical stage T1 to T3 invasive breast adenocarcinoma, with at least one positive axillary lymph node. Between 1998 and 2001, patients were randomly assigned at a ratio of 1:1:2:2 to one of the following four adjuvant chemotherapy regimen arms: Arm A: sequential control = four cycles of doxorubicin (A) 75 mg/m2, followed by three cycles of classical cyclophosphamide, methotrexate and 5-fluorouracil (CMF); Arm AC: concurrent control = four cycles of AC 60/600 mg/m2, followed by three cycles of CMF; Arm A-T: sequential docetaxel = three cycles of A 75 mg/m2 followed by three cycles of docetaxel (T) 100 mg/m2 followed by three cycles of CMF; Arm AT: concurrent docetaxel = four cycles of AT 50/75 mg/m2 followed by three cycles of CMF. The planned cumulative doxorubicin dose was higher in the control arms (A: 300 mg/m2; AC: 240 mg/m2) than in the docetaxel arms (A-T: 225 mg/m2; AT: 200 mg/m2). A-T and AT had different docetaxel dose intensity but the same cumulative dose (300 mg/m2). Endocrine therapy and radiotherapy were administered according to local guidelines. Adjuvant trastuzumab was not available. Institutional Ethics Committees approved the protocol at all participating sites and patients provided written informed consent.

p53 substudy cohort

From BIG 02-98, patients who had formalin-fixed paraffin-embedded (FFPE) primary tissue submitted centrally and for which there was sufficient remaining tumor tissue for TP53 gene analysis were selected for the p53 biomarker study. A total of 666 cases were selected, of whom 520 were successfully analyzed for exons 5 to 8. The substudy population was representative of the entire BIG-02-98 population for all baseline patient and tumor characteristics. [See Additional file 1.] The translational study was approved by the Ethics Committee of the Jules Bordet Institute (IJB) in Brussels, Belgium, which served as the coordinating center for this retrospective study.

Tumor material and immunohistochemistry

The study protocol requested central collection of one FFPE tumor sample for each patient. Slides were prepared by the Pathology Department at IJB and sent to the European Institute of Oncology (EIO), Milan, Italy for central pathology analyses and genomic DNA extraction. Slide review, IHC and fluorescence in situ hybridization (FISH) were performed on whole tissue sections from FFPE samples. Tumor grade was centrally reviewed. Unstained tumor specimens were stained for estrogen receptor (ER), progesterone receptor (PgR), human epidermal growth factor receptor 2 (HER2) and Ki-67 (all specific monoclonal or polyclonal (for HER2) antibodies were purchased from Dako, Glostrup, Denmark). IHC results were reported as the percentage of invasive tumor cells showing definite immunoreactivity. FISH was performed for HER2 according to the manufacturer's instructions (Abbott-Vysis Inc., Downers Grove, IL, USA). Thresholds for positivity were defined as: ER: ≥ 1%; PgR: ≥ 1%, HER2: IHC 3+ (more than 10% invasive tumor cells with intense and circumferential membrane staining) or 2+ and FISH positive (HER2:CEP17 ratio > 2). The Ki-67 threshold used in the distinction of luminal A and luminal B subtypes was defined as ≥ 14%, based on published work by Cheang et al. [30].

Four breast cancer subtypes were defined using central laboratory defined parameters, as follows: (1) highly endocrine responsive (luminal A): ER-positive, PgR-positive, HER2-negative and Ki-67 low; (2) incompletely endocrine responsive (luminal B): ER-positive and PgR-negative, independent of other parameters, or ER-positive, PgR-positive and at least one of grade 3, HER2-positive and/or Ki-67 high; (3) HER2-positive: ER-negative, PgR-negative and HER2-positive; and (4) triple-negative: ER-negative, PgR-negative and HER2-negative. [See Additional file 2.]

DNA extraction

Genomic DNA was extracted from two to three serial sections of FFPE-archived specimens (10 μm thick). Dewaxing was obtained by xylene and ethanol with alternating vortexing and centrifuging. DNA was then extracted with a commercially available kit (DNeasy Blood & Tissue Kit, Qiagen N.V., Venlo, The Netherlands). Tissue was lysed overnight with proteinase K digestion in denaturing condition and DNA bound to spin column silica membrane. The contaminants were washed away and DNA was eluted with 100 μL of sterile distilled water. The final DNA concentration was evaluated by OD260 (GeneQuant II, Pharmacia Biotech, Uppsala, Sweden).

TP53mutation screening and sample classifications

Genomic DNA was screened for TP53 mutations at IARC. Exons 5 to 8 of TP53 were analyzed in all available samples by polymerase chain reaction (PCR). Exon 4 was analyzed in a subset of samples with sufficient available tumor material. Direct sequencing of genomic DNA has been described in the IARC protocol [31]. Mutations were screened on both DNA strands and were confirmed in an independent PCR product.

TP53 gene sequencing reported all variations. These variations were used to classify samples according to p53 protein status, which take into account the predicted impact of genetic variations on p53 proteins. Thus, a variation is considered a mutation when it is predicted to modify p53 protein sequence. Tumors were classified as wild-type p53 or mutated p53. (See Table 1.)

Table 1 p53 mutation status and prevalence.

Mutated p53 protein status was further subcategorized as missense or truncating. (See Table 1.) Missense mutations produce proteins that are changed by one amino acid. Truncating mutations include nonsense, frameshift insertions and deletions, and variations in consensus splice sites. Annotations from the IARC TP53 Database [19] were used to also stratify missense mutations according to their transactivation activities in yeast functional assays, or to their position inside or outside DNA-binding motifs (L1/L2/L3 loops) [17, 32]. Sample subclassification was done when sequencing result was obtained for all exons 5 to 8. Two samples were not subclassified because the sequencing result was missing for one exon.

Statistical analyses

The study was based on the hypotheses that p53 mutated tumors would have the worst clinical outcome and the largest benefit from the addition of docetaxel. Statistics were performed using SAS version 9.1. (SAS Institute Inc., Cary, NC, USA) and Minitab version 13 software (Minitab Inc., State College, PA, USA). The chi-square test was used to compare the distributions of clinicopathological parameters by randomized treatment arm (anthracycline control arms, A and AC, versus taxane arms, A-T and AT), and by p53 status (wild-type p53 versus p53 mutated). Fisher's exact test was used to compare the distributions of clinicopathological parameters by p53 mutation subcategorization (wild-type p53 versus missense mutation; wild-type p53 versus truncating mutation).

DFS was calculated from the date of randomization to the date of disease recurrence, second primary cancer or death from any cause. OS was calculated from the date of randomization to the date of last follow-up or death from any cause. Survival curves were estimated using the method of Kaplan-Meier and curves for different classifications were compared using the log-rank test. Multivariate Cox regression models, with backward selection, were used to test the prognostic effect of p53 status after adjusting for other important prognostic variables. Multivariate DFS analysis included patient characteristics (age, menopausal status, body mass index (BMI)), tumor characteristics (tumor size, number of positive lymph nodes, histopathological type, histological grade, IHC defined subtypes, p53 mutation status) and treatment characteristics (mastectomy, radiotherapy, use of tamoxifen, chemotherapy arm (A+AC versus A-T+AT arms), chemotherapy schedule (sequential or concurrent)).

Results

Results of BIG 02-98

After an 8-year median follow-up, the second efficacy results of BIG 02-98 did not show significant improvement in DFS from the incorporation of docetaxel compared with the doxorubicin-based control (hazard ratio (HR) = 0.91, 95% confidence interval (CI) = 0.80 to 1.05, P = 0.187). However, sequential A-T significantly improved DFS compared with the sequential control arm A (HR = 0.81, 95% CI = 0.67 to 0.99, P = 0.036), and significantly improved both DFS (HR = 0.84, 95% CI = 0.72 to 0.99, P = 0.035) and OS (HR = 0.79, 95% CI = 0.65 to 0.98, P = 0.028) compared with concurrent AT [33].

p53 substudy cohort

From BIG 02-98, 2172 of 2887 (75%) patients had FFPE primary tissue submitted centrally. Of these 2172 patients, 666 patients had sufficient remaining tumor tissue for TP53 gene analysis, of whom 520 (18% of the original trial population) were successfully analyzed for exons 5 to 8. Of the 520 tumors, 116 were also analyzed for exon 4. The remaining 142 samples could not be analyzed due to poor DNA quality.

The substudy population was representative of entire BIG-02-98 population for all baseline patient and tumor characteristics. [See Additional file 1.] DFS was similar for patients included and not included in this substudy. [See Additional file 3.]

The majority of patients in this p53 substudy were less than 50 years old, premenopausal and nonobese. (See Table 2.) Most tumors were infiltrating ductal carcinoma, ≤ 20 mm, grade 2 or 3, ER-positive and HER2-negative. Regarding the IHC-defined breast cancer subtypes, most tumors were classified as luminal B, of which, 59 were positive for HER2 (19%, not shown). Between the control (A + AC) and docetaxel (A-T + AT) arms, characteristics differed for ER/PgR status and IHC-defined subtypes: a significantly higher proportion of ER- and PgR-negative tumors were present in the control arms (P = 0.046). In keeping with this, there was a trend for a higher proportion of the HER2 subtype and triple-negative subtype in the control arms (P = 0.052).

Table 2 Characteristics of patients and samples in the p53 substudy.

p53 mutations and sample characteristics

A total of 96 variations within exon 5 to 8 were found in 90 samples (90 of 520, 17%). [See Additional file 4 for full data.] In the 90 patients with a TP53 gene variation, 85 patients had only one variation, four patients had two variations, and one patient had three variations. Most variations were missense and were located at classical hotspot codons, except for one specific mutation at codon 259 (p.D259N) that was found in six samples. It is of note that no in-frame deletions or insertions were found in these series. In the patient with three gene variations, each of the three variations was located in introns and was predicted to not affect protein sequence (not shown).

TP53 gene status was used to predict p53 protein status and classify patients as wild-type or mutated. Nonsynonymous mutations were further distinguished as missense or truncating. (See Table 1.)

The presence of a mutated p53 protein was associated with older age, postmenopausal status, ductal morphology, higher tumor grades and ER/PgR negativity. (See Table 3.) A strong correlation between IHC-defined subtypes and p53 status was found: the proportion of p53 mutated samples in each IHC subtype showed that the HER2 and triple-negative subtypes had the highest rates of p53 mutations, with 22% (7 of 32) and 36% (24 of 66) of mutated samples respectively, compared to 10% (8 of 84) in the luminal A subtype and 13% (45 of 315) in the luminal B subtype.

Table 3 Associations between p53 protein status and clinicopathological parameters.

Prognostic value of p53 mutations

At an 8-year median follow-up, the number of DFS events in the entire BIG 02-98 population was 916 of 2887 patients (32%). In this p53 substudy cohort, the number of DFS events was 164 of 520 patients (32%). Survival curves (DFS and OS) for patients included in this p53 substudy are shown in Figure 1. There was no statistically significant difference in DFS or OS based on p53 mutated status. However, when subclassifying mutations, truncating mutations but not missense mutations were found to be associated with a significant reduction in DFS and OS (P < 0.001). Further stratification of missense mutations, based on their transactivation capacities or location in the 3D structure of p53 protein, did not reveal classes of mutations with different prognostic value (data not shown). Thus, in univariate analysis only p53 truncating mutations were associated with poor survival.

Figure 1
figure 1

Prognostic value of p53 mutations. Survival in p53 wild-type versus p53 mutated cases, (a) DFS and (b) OS. Survival in p53 wild-type versus missense versus truncated mutations, (c) DFS and (d) OS. DFS, disease-free survival; OS, overall survival.

Multivariate DFS analysis was performed including patient characteristics (age, menopausal status, BMI), tumor characteristics (tumor size, number of positive lymph nodes, histopathological type, histological grade, IHC-defined subtypes, p53 mutation status) and treatment characteristics (mastectomy, radiotherapy, use of tamoxifen, chemotherapy arm (A+AC versus A-T+AT arms), chemotherapy schedule (sequential or concurrent)). The presence of p53 truncating mutations was associated with an increased risk of recurrence (HR = 3.21, 95% CI 1.74 to 5.94, P = 0.0002) compared to the absence of mutation within exons 5 to 8. In this multivariate model, only p53 truncating mutations, number of positive lymph nodes (≥ 4 nodes: HR = 1.99, 95% CI: 1.44 to 2.76, P < 0.0001) and the HER2 subtype (HR = 2.36, 95% CI 1.21 to 4.60, P = 0.0122) had independent prognostic value. (See Table 4.)

Table 4 Multivariate Cox proportional hazards models of disease-free survival.

Similar results were obtained for OS. In a multivariate OS analysis, p53 truncating mutations were associated with poor OS compared to the absence of mutation within exons 5 to 8 (HR = 2.75, 95% CI: 1.50 to 5.04, P = 0.0011).

Predictive value of p53 status

In keeping with efficacy results in the entire BIG 02-98 population, this substudy population showed a favorable trend but not a significant benefit in DFS from the addition of docetaxel (HR = 0.77, 95% CI: 0.56 to 1.06). Analyses of the predictive value of p53 status for prediction of improvement in DFS from addition of docetaxel are shown in Figure 2a (p53 wild-type and mutant) and Figure 2b (p53 wild-type, missense and truncating mutations). None of these predictive analyses were statistically significant.

Figure 2
figure 2

Value of p53 mutations in predicting DFS benefit from adding docetaxel to control anthracycline-based therapy. (a) Wild-type p53 versus mutant p53 protein. (b) p53 wild-type versus missense mutant versus truncated mutant. Treatment comparisons were made between (i) anthracycline control arms (A+AC) and sequential docetaxel (A-T); (ii) anthracycline control arms (A+AC) and concurrent docetaxel (AT); and (iii) anthracycline control arms (A+AC) and combined docetaxel arms (A-T+AT). A, doxorubicin; C, cyclophosphamide; DFS, disease-free survival; T, docetaxel.

Discussion

In this retrospective exploratory study performed on samples collected in the context of the prospective BIG 02-98 randomized phase III clinical trial, we show that p53 truncating mutations have a significant independent prognostic value in node-positive breast cancer patients treated with adjuvant chemotherapy. However, no significant value for p53 status in predicting response to docetaxel therapy was found in this cohort.

TP53 mutations were detected in a minority of patients (16.3%). Most mutations (75%) were missense, while the remaining (22%) were truncating. The prevalence and type of TP53 mutations found are similar with previous studies that have restricted their analysis to exons 5 to 8 and used DNA as starting material (73% of missense mutation in IARC TP53 Database, R15). The presence of mutated p53 was associated with patient characteristics of increased age and postmenopausal status, and tumor characteristics of ductal morphology, higher grades and ER/PR negativity, in agreement with previous studies [5, 34, 35]. The highest rates of mutations were seen in the HER2 and triple-negative subtypes, as reported in other studies [12, 36, 37].

Most mutations were found at classical hotspot codons and were loss of function mutations as assessed in yeast functional assays [17]. The only exception was a specific hotspot at codon 259, which was found in six samples. The resulting mutation, p.D259N, has been infrequently reported in human cancers (only 14 occurrences in the IARC TP53 Database, none in breast cancers [19]) and retains partial activity in yeast functional assays [17]. We ruled out PCR contamination and technical causation. The origin of this mutation remains to be determined.

Truncating mutations had an independent prognostic value against a large range of clinical and molecular variables. These results confirm and extend those obtained in previous studies, including our own performed on a large consecutive series of breast cancers [5]. Interestingly, missense mutations were not associated with poor survival in the current study. Discrimination between missense mutations according to location inside or outside DNA-binding motifs, mutations that retain or do not retain transactivation activities, or have dominant-negative effects did not differentiate missense mutations with poorer survival (data not shown). This is in contrast with our previous study, which showed that missense mutations within DNA-binding motifs (but not those outside these motifs) and non-missense mutations were associated with poor prognosis [5]. There was no obvious difference between the type of missense mutation found in this series and earlier studies to explain these discrepant results. Inconsistency of results regarding the prognostic value of missense mutations across studies may be due to the fact that missense mutations may retain some activity, beyond the specific activity detectable in a particular study. This is in contrast to more consistent results for truncating mutations, which are true loss of function mutations.

The multivariate model for the prognostic value of p53 included many variables, notably it included BMI- and IHC-defined molecular subtypes. BMI was included as obesity was found to have an independent poor prognostic value in this trial as well as in other studies [38, 39]. In this substudy however, BMI had no independent prognostic value. The independent predictors of poor DFS and OS were p53 truncating mutations, high number of positive lymph nodes and the HER2 subtype. It is of note that BIG 02-98 was conducted prior to the use of adjuvant HER2-targeted therapies. The prognostic value of HER2 subtype is thus independent of HER2-targeted treatment in these patients. Interestingly, the triple-negative subtype did not have prognostic value in the univariate analysis and had no independent prognostic value. Many studies in early breast cancer have reported that the triple-negative subtype is particularly aggressive and associated with a high rate of recurrence and early death. A possible explanation for the lack of adverse outcomes in the triple-negative subtype in the current study may be the limited sample size.

In this study, p53 mutations had no significant predictive value. This study adds to the conflicting body of clinical evidence regarding the potential clinical role of p53 as a single biomarker for prediction of chemotherapy response. Other studies assessing TP53 gene sequence have presented data in support of reduced anthracycline activity in TP53 mutated tumors [911]. However, the predictive correlations were not robust enough for TP53 gene status, as a single biomarker, to be considered a clinical tool to guide treatment decisions. Three recent neoadjuvant studies in early breast cancer have reported a predictive role for TP53 mutations. In a phase II Xeloda in Neoadjuvant (XeNa) trial assessing capecitabine and docetaxel, with trastuzumab if HER2-positive, a secondary study endpoint was evaluation of TP53 status and response to chemotherapy [12]. TP53 mutations, detected by AmpliChip p53 (Roche Diagnostics, Pleasanton, CA, USA), correlated significantly with improved pathological complete response rate compared with TP53 wild-type (30% versus 10%, respectively, P = 0.0032). In a study using dose-dense epirubicin-cyclophosphamide, TP53 mutations were detected using a yeast functional assay. Pathological complete responses, which occurred in 19% (15 of 80 patients), were restricted to tumors with a TP53 mutation (28 of 80 patients) [40]. In another small study that included only triple-negative breast cancers patients treated with neoadjuvant cisplatin, nonsense and frameshift TP53 mutations were associated with good response to treatment [41]. The lack of a control arm in these three studies limits interpretation of prognosis versus prediction, and of general versus specific agent chemosensitivity.

An important recent study is the neoadjuvant phase III EORTC 10994/BIG 00-01 trial which is the only study to be prospectively powered to assess the value of p53 status for prediction of docetaxel benefit [14]. Using an RNA-based technique that detects functionally important p53 mutations, the prognostic role of p53 was confirmed, but p53 was not predictive of response or resistance to docetaxel.

The negative results from the prospective EORTC study and our BIG 02-98 substudy, with a background of inconsistent results, suggest that p53 status as an isolated marker may be inadequate to predict treatment response to docetaxel. p53 is not a drug target, but rather a surrogate measure for cellular capacity for apoptosis, and cell proliferation control. The complexity of p53 function, post-transcriptional effects and up- and down-stream signaling pathway cross-talks may limit the power of p53 status as a sole biomarker, in particular for treatments that combine drugs that have different mode of action towards p53.

A strength of the current study is the biological distinction of nonsynonymous mutations as missense (in and out of DNA-binding motifs) and truncating. Grouping all mutations together for assessment of outcomes assumes homogeneity in the impact of different mutations. Use of a dichotomous classification did not identify a prognostic or predictive correlation, whereas the more specific classification system revealed the independent prognostic significance value of the truncating mutations. A limitation of this approach however, is that such a cohort contain few patients, limiting to some degree the confidence with which conclusions may be drawn for the subsets.

The presumption of a homogeneous role of p53 across diverse biological subgroups may also account for inconsistent findings. In distinct breast cancer subgroups, p53 mutations may have variable prevalence, and impact differently on prognosis and treatment sensitivity. As part of the I-SPY neoadjuvant initiative (CALGB 150007), a recent study reported TP53 status determined by gene chip technology and sequencing in women treated with anthracycline- then taxane-based therapy [42]. In the luminal A subtype, 14 of 48 patients has a TP53 mutation, and there was no differential efficacy of either treatment based on TP53 status. In contrast, TP53 mutations had a much higher prevalence in the luminal B, basal-like and HER2 subgroups, in whom, anthracycline efficacy was independent of TP53 status and taxane efficacy was greater in the presence of wild-type TP53. In the current study, mutated p53 was indeed more prevalent in HER2 and triple-negative subtypes. However, small numbers precluded analyses by subgroups.

A critical issue in the clinical translation of p53 is the impact of gene variations on p53 protein function. Herein the classification of p53 mutations was based on estimated downstream impact on protein sequence. A reported alternative for identification of 'functional' loss of wild-type p53 is a p53 transcriptional fingerprint. Several gene expression signatures have been reported that distinguish between wild-type and mutant p53 [36, 43, 44]. Interestingly some TP53 wild-type tumors expressing the mutation-associated 32-gene signature behaved aggressively, while some TP53 mutant tumors lacking the signature had favorable outcome [43]. In another recent report, a 39-gene and a 30-gene p53 signature were developed in ER-positive and -negative breast cancer, respectively [45]. It is of note that there was no overlap in genes across the two signatures. The ER-positive p53 signature in ER-positive disease was predictive of poor prognosis and increased chemosensitivity. The ER-negative p53 signature in ER-negative disease was associated with improved prognosis, but had no prediction of treatment response. Downstream functional assessment of p53 status may also capture the impact of other p53 variables that may influence tumor outcome and treatment sensitivity, such as p53 codon polymorphisms [46] and p53 protein isoforms [47]. Further studies using downstream functional assessment of p53 status are thus needed to better understand the impact of p53 mutations on tumor phenotype.

The current study has limitations. This is a retrospective exploratory analysis, which was not considered in the initial statistical trial design or trial sample size determination. In the trial, all patients received polychemotherapy so it is impossible to determine interactions between single-agent activity and p53 status. Only exons 5 to 8 of TP53 gene were screened for mutation due to the limited amount of DNA available. According to previous studies, up to 20% of mutations may fall outside these exons (IARC TP53 Database, R15). To evaluate the number of misclassified samples due to this partial sequencing, 84 samples were successfully sequenced for exon 4, the next most mutated exon in breast cancers (IARC TP53 Database, R15). One truncating and 13 missense mutations were found. Since all missense mutations retained transactivation activity, only 1 of 84 (1.2%) was considered a true deleterious mutation. As mutations in other exons are expected to be rare (less than 1% of mutations reported in breast cancers fall outside exons 4 to 8), we estimate that less than 2% of samples may have been misclassified as wild-type in our series.

Conclusions

Despite an abundant literature on the prognostic and predictive value of p53 status in breast cancer, a limited number of studies have used gene sequencing to assess p53 status and even fewer studies have been performed in the context of controlled clinical trials. The results confirm that loss of wild-type p53 through protein truncating mutations is associated with poor prognosis. Our results point to differences in the prognostic value of different types of mutations and to the difficulty of assessing the predictive value of a molecular marker in treatment regimens that combine drugs with different modes of action regarding this marker.