Breast cancer is a profoundly heterogeneous disease that includes a number of distinct entities with specific pathological features, biological behaviours and different sensitivities to systemic and targeted therapies [14].

The recent development of high-throughput molecular methods offers new opportunities to capture the wide range of this variability. Indeed, the novel molecular techniques of gene expression profiling permit evaluation of the expression of thousands of genes in a large number of tumours, so that a fully comprehensive portrait or profile of the different molecular pathways can be obtained. The reproducibility and the clinical usefulness of assays based on gene expression have been demonstrated with increasing confidence over the past decade, but cost considerations limit the wide availability of these techniques [5]. Immunohistochemical (IHC) determination of oestrogen and progesterone receptors (ER and PgR), the detection of overexpression and/or amplification of human epidermal growth factor receptor 2 (HER2) and the Ki67 labelling index have been defined as a convenient alternative to molecular subtyping and are considered a sufficient to guide therapeutic choices [6].

Several studies have investigated the correlation between these clinical/pathological prognostic and predictive markers and the levels of radiotracer uptake by primary breast tumours, measured as maximum standardized uptake value (SUVmax), during 18F-FDG PET/CT. High SUVmax was found to be significantly correlated with ER negativity, PgR negativity, triple negativity and Ki67 values, while conflicting results were found for the relationship with HER2 status [712].

In the present issue of the European Journal of Nuclear Medicine and Molecular Imaging, García Vicente et al. take a step forward in this approach to evaluation. They analysed the correlation between the glycolytic characteristics of primary breast tumours and their molecular subtypes. The authors prospectively evaluated 168 patients who were submitted to FDG PET/CT before neoadjuvant therapy for locally advanced breast cancer. Tumour subtypes were classified by IHC surrogates as luminal A, luminal B-HER2(−), luminal B-HER2(+), HER2(+) or basal (triple negative), following the recommendations of the 12th St Gallen International Breast Cancer Conference. Statistically significant differences were found in semiquantitative metabolic parameters among the different subtypes, with greater values in HER2(+) and basal tumours [13]. These results are not unexpected, as the relationship between increased glucose metabolism and tumour aggressiveness is well known. This is one of the first studies demonstrating a correlation between molecular and glycolytic phenotypes of breast cancer.

Humbert et al. found that baseline FDG uptake of primary tumours and its early response to neoadjuvant chemotherapy may vary according to the IHC subtype of the breast cancer. Triple-negative tumours showed the highest baseline SUV. After the first course of therapy the decrease in SUV was significantly higher in triple-negative and HER2-positive subtypes. Pathological complete response (pCR) occurred more often in these two subtypes, rather than in luminal tumours, and the relative change in SUV was predictive of pCR only in HER2-positive tumours with an accuracy of 76 % [14]. Zhang et al. evaluated 244 patients with metastatic breast cancer. They found that molecular subtype, visceral metastases and number of organs bearing metastases could be used to predict logarithmic values of SUVmax only in previously untreated metastatic breast cancer [15].

These few studies have led to important suggestions for future PET/CT studies in patients with breast cancer, since this disease cannot be considered as a single entity any longer. First, a clear definition of the different clinical/pathological subtypes is needed in order to allow comparison between studies carried out in different centres and the translation of the results obtained in the clinical management of patients. Moreover, the IHC classification requires the availability of reliable measurements of its individual components, which should be performed in accordance with the published guidelines to minimize errors [1618]. The assessment of hormone receptors and HER2 status in multicentre study, such as that of Garcia Vicente et al. [13], should be performed in a single, and preferably experienced, laboratory, in order not to affect the statistical validity of the results. Indeed, the false-negative and false-positive rates in the assessment of these biological parameters are still unacceptably high, as is the intra- and interlaboratory discordance rate. It has been repeatedly shown that the false-negative rate for ER and PgR may be as high as 20 %, whereas the false-positive rate is 2–4 % for ER but much higher, up to 15 %, for PgR [19].

We should keep in mind, however, that subtypes defined by clinical/pathological criteria are similar but not identical to intrinsic subtypes. For example, basal-like tumours are often referred to as triple-negative breast cancer because most basal-like tumours are negative for ER, PgR and HER2. This is true for about 75 % of these tumours, while the remaining 25 % include all the other molecular subtypes [20]. Moreover, the clinical/pathological criteria used for the definition of subtypes are rapidly changing and might differ in the near future from those used in the present studies. Recent data support a threshold of 20 % or higher as potentially indicative of high Ki67 status. Recent literature data also support the introduction of PgR to optimally categorize luminal A and luminal B tumours. The proposed value of PgR for distinguishing between these two subtypes is more than 20 %, according to a recent study in which the fine tuning was done by comparing the multigene expression-based assays across five independent cohorts and IHC-based definition of luminal A and B tumours [2123].

In the neoadjuvant setting, higher sensitivities and higher rates of pCR to anthracycline-based chemotherapy among basal-like and HER2(+) tumours have been reported. However, these two breast cancer subtypes show a poor prognosis, with high relapse rate in patients who do not achieve pCR [4, 24]. The earlier identification of this subgroup of patients by FDG PET/CT could facilitate the choice of a more effective therapy.

Another point that deserves attention is the use of FDG PET/CT in the initial staging of breast cancer. Garcia Vicente et al. [13] found no statistically significant relationship between molecular subtypes and disease stage, although patients with HER2(+) tumours showed a higher percentage of distant metastases (35 %) compared to patients with the other subtypes (12 %, 18 %, 15 % and 20 % for luminal A, luminal B-HER2(−), luminal B-HER2(+) and basal, respectively). Therefore some subtypes could benefit more than others from a baseline FDG PET/CT evaluation. This point should be clarified by way of a targeted prospective study with an appropriate number of patients.

18F-FDG is not the only radiopharmaceutical available to evaluate patients with breast cancer. Novel PET tracers already tested in humans, such as 18F-fluoroestradiol (FES, that binds to ER), 18F-FFNP (a progesterone analogue) and 68Ga-ABY-002 and 89Zr-trastuzumab (molecular imaging agents with high specificity and affinity for HER2), may provide additional useful information about breast cancer marker expression, tumour heterogeneity and responsiveness to therapy [2528].

The use of FES PET could have the advantage of allowing in vivo evaluation of ER expression and functional status, and bypasses the possibility of an error in the pathological determination of this molecular marker. Linden et al. demonstrated that the degree of FES uptake has the ability to predict response to endocrine therapy: low or absent FES uptake can identify patients unlikely to obtain an objective response and can lead to the exclusion of an ineffective treatment in patient management [29]. We can also hypothesize that the presence of FES uptake in triple-negative breast cancer could in vivo identify non-basal-like subtypes, that can show ER expression, even if decreased, as reported by Bidard et al. [30]. These patients could therefore be treated with targeted endocrine therapies, which would otherwise be denied.

Finally, both FDG and non-FDG tracers could be useful in the evaluation of metastatic breast cancer. Metastases may display characteristics that do not match those of the primary tumours, with discordance rates reaching 40 % for ER status [31]. Clinical management of disease must therefore adapt to the changing and evolving nature of the tumour over time. The reassessment of breast cancer molecular markers has proved useful in the management of relapsed disease, leading to a change in planned therapy in a significant percentage of patients [31, 32]. However, not all metastatic sites can be easily accessible to biopsy, and the results can be inconclusive in a proportion of patients. Histopathological evaluation of a single site of disease is therefore not representative of the global state of marker expression. In contrast, PET/CT is a total-body procedure and provides information about all sites of relapse in a single test.

In conclusion, combined evaluation of a patient by PET/CT scanning with different tracers could permit the outlining of a portrait of breast cancer heterogeneity. Thus, clinicians could have the tools to choose the best treatment option at each stage, to evaluate response to each therapy, to better characterize the prognosis of patients and to monitor the evolution of the disease. A noninvasive, personalized determination of glycolytic activity and molecular marker expression may therefore become the basis for treatment of individual patients with different subtypes of breast cancer.