Response assessment by imaging methods has always been considered not precise enough and not sufficiently correlated with outcome of patients. In opposite, response assessment by pathological examination was considered to be objective and strongly correlated with a high chance for a cure. Pathologic complete response (pCR) was therefore recommended as the gold standard for response evaluation after neoadjuvant treatment [1]. However, with recent treatment advances, detection of pCR became more frequent and knowledge increased about the potential pitfalls and shortcomings of this surrogate endpoint marker.

Whereas older definitions of pCR referred only to the absence of residual invasive or even gross invasive disease in the breast [2], newer definitions requested also tumor-free axillary nodes or even no residual in situ disease in the breast. This was based on observations in the last decade that patients with no tumor residuals in the breast but involved nodes showed a significant relapse risk [3], and just recently that even patients with in situ breast tumor residuals only showed a higher relapse risk than patients without any residuals in the breast and nodes [4]. Especially the last observation was contradictory to the beliefs that in situ disease is not responsive to systemic treatment at all, and that residual in situ disease cannot be the source of subsequent distant metastasis. However, these beliefs have to be revisited due to evidence reported in this issue of the journal that neoadjuvant chemotherapy given together with trastuzumab can eliminate adjacent ductal carcinoma in situ in about 50 % of cases [5]. As residual in situ disease might simply indicate a higher chance of non-detected residual invasive disease which might explain the higher relapse risk of these patients observed in larger cohorts [4]. Nevertheless, harmonization of pCR definitions within clinical trials or at least elaborate reporting of the components of residual disease is warranted. Aiming to identify a cohort of patients with a minimum risk for relapse, the most conservative definition should be used.

The increasing understanding of the biology of distinct intrinsic breast cancer subtypes raises the question if the chance to achieve a pCR after specific treatments is comparable as well as if pCR correlates similarly with survival in all subtypes. With regard to chemotherapy, hormone-receptor (HR) status has so far been shown to be a reproducible predictor for pCR with higher pCR rates in HR-negative tumors than in HR-positive tumors [6]. It was suspected that pCR rates might be higher in HR-positive tumors if endocrine treatment is added. Indeed, a small trial presented in this issue of the journal found a significantly higher pCR rate when letrozole was given in combination with chemotherapy compared to chemotherapy alone [7]. However, this is in contrast to larger trials reported earlier by our group and as well the ECTO group again in this issue of the journal where neither tamoxifen [8] nor exemestane [9] in combination with chemotherapy achieved sufficient pCR rates. However, we have learned that pCR nicely correlates with survival in HR-negative (triple-negative or HER2-positive) tumors, but much less or not at all in HR-positive tumors [4]. So the observed lower pCR rate in HR-positive tumors does not predict an unfavorable prognosis of these patients. Other surrogate endpoint markers for HR-positive tumors are therefore warranted.

A multitude of markers has been proposed to better predict outcome of neoadjuvant chemotherapy, reports on another 4 of these markers are published in this issue of the journal [1013]. Therefore, some general prerequisites should be described for the development and validation of such new markers: preferably markers should be searched for separately in cohorts with either HR+/HER−, HER2+, or triple-negative tumors; new markers should be immediately validated on independent data sets before their release; and markers should not only be correlated with the occurrence of a pCR but also with survival. Otherwise, we will never understand if this marker will be of clinical importance.

Can we therefore use the neoadjuvant model to test new drugs? Also, could it be used for rapid and early drug approval? Currently, this model can be used only to achieve an early signal of efficacy for new drug combinations if restricted to tumors with triple-negative or HER2-positive disease [14] or to identify potential predictive markers for drugs with new mechanisms of action. Unless a quantitative correlation between increments in pCR and gains in survival in these two most promising subtypes have been confirmed on large data sets, we still have to conduct large-scale adjuvant study to fully assess the effect of systemic treatments.