Introduction

Breast cancer (BC) has already become the most frequently diagnosed tumor and the leading cause of cancer-related death among women worldwide [1, 2]. According to the different expressions of specific molecular receptors, BC can be classified into various subtypes, which are tightly associated with the treatment strategy selection and prognosis [3, 4]. However, even with classic targeted therapies, a certain proportion of patients are insensitive to treatments and develop varying degrees of resistance [5,6,7,8]. Therefore, more attention should be paid to new molecular targets to assist in the diagnosis and treatment of BC.

Most recent studies have investigated the roles of androgen receptor (AR) in BC patients, showing that AR has different subtypes with considerable levels of expression [9, 10]. AR and estrogen receptor (ER) competitively bind to estrogen-dependent signaling pathways, and this crosstalk may contribute to acquired resistance to endocrine therapy [11, 12]. Besides, the expression of AR is highly correlated with the amplification of human epidermal growth factor 2 (HER2), which is coupled with AR activation and enhances the oncogenic function of certain signaling pathways [13, 14]. Triple-negative BC (TNBC) lacks recognized molecular targets, in which the expression of AR is related to ER-regulated gene transcription and a lower rate of pathological complete response (pCR) to neoadjuvant chemotherapy [15, 16]. Consequently, early recognition of AR expression not only helps predict prognosis but also guides the application of AR-targeted therapy, effectively improving patients’ sensitivity to traditional therapies and prolonging their overall survival.

At present, the expressions of molecular receptors are generally detected through immunohistochemistry (IHC). Such an approach is mainly based on biopsy as an invasive procedure, and it is rarely repeatable and unrepresentative due to small tissue samples and intratumor heterogeneity [17]. 18F-FDG PET/CT (Fluorine-18 fluorodeoxyglucose positron emission tomography/computed tomography), which combines functional metabolic quantification and morphological imaging, is commonly used for initial staging, post-treatment assessment and re-staging in BC [18, 19]. Radiomics, which uses medical imaging to noninvasively quantify intratumoral heterogeneity, has emerged as a translational research topic in BC [20]. Recent literature has indicated that radiomic features obtained from 18F-FDG PET/CT can contribute to the prediction of tumor molecular subtypes, while there are still limited studies focusing on the role of AR as a predictive parameter [17].

In the present study, we aimed to develop a machine-learning model for individual prediction of AR expression in BC patients by exploring the correlations between radiomic features, clinicopathological factors, and AR status. In addition, our findings might help identify the patients who could benefit from the AR-targeted treatment.

Materials and methods

Patients

This retrospective study was approved by the institutional review board of the First Affiliated Hospital of Soochow University, and the informed consent was waived because of its retrospective nature. Our current study was conducted in accordance with the Declaration of Helsinki, and the trial registration number of this study was ChiCTR2200062858.

All patients were histologically diagnosed as primary BC and underwent 18F-FDG PET/CT imaging for staging between December 2017 and April 2022. The specific inclusion criteria were as follows: (1) aged beyond 18 years; (2) without any therapy before the standard examination of 18F-FDG PET/CT; (3) with histological data (type and grade) derived from biopsy (generally for large lesions) or surgical specimens (routinely for relatively smaller lesions); (4) with IHC examination of AR performance; and (5) with complete clinical and imaging datasets. Exclusion criteria were as follows: (1) without complete 18F-FDG PET/CT images or good imaging quality; (2) with the too-small size of the primary lesion for segmentation; (3) with multiple lesions of BC; (4) with other types of cancers; and (5) with unclear histological proof.

18F-FDG PET/CT imaging

The patients were injected 18F-FDG (4.07–5.55 MBq/kg) for routine preparation after fasting for at least 6 h to ensure the blood glucose level was below 11.1 mmol/L. Approximately 40–60 min later, an integrated PET/CT scanner (Discovery STE, General Electric Medical Systems, Milwaukee, WI, USA) was adopted to acquire images from the base of the skull to the midthigh. With the acquisition parameters (transaxial field of view of 70 cm, pitch of 1.75, rotation time of 0.8 s, and slice thickness of 3.75 mm), low-dose (140 kV, 120 mA) CT images were acquired for the following attenuation correction and anatomic localization. Immediately, PET image acquisition was performed for 2–3 min per bed position. Finally, the ordered subset expectation maximization algorithm was used for image reconstruction.

Pathological evaluation

Histological sections stained with hematoxylin and eosin (H&E staining) were observed under the microscope to confirm the histologic type and grade by two pathologists independently. The expressions of ER, PR, HER2, Ki-67, and AR were detected by IHC. More than 10% of tumor nuclei with positive AR staining were defined as AR positive (AR+) regardless of staining intensity, otherwise defined as AR negative (AR-). As for ER and PR, both proportional staining (%) and staining intensity (weak, medium, and strong) were considered. The combination of IHC scores (1+, 2+, and 3+) and fluorescence in situ hybridization (FISH) was used to determine HER2 status. Ki-67 was divided into high expression and low expression according to a cutoff value of 30% of positively stained cells.

Extraction and selection of radiomic features

After image acquisition, PET and CT images in the DICOM format were imported into LIFEx freeware (v7.0.0 https://www.lifexsoft.org/), which automatically fused them for quantitative PET/CT analysis [21]. Two experienced nuclear medicine physicians, each with over 15 years of diagnostic experience, manually segmented the three-dimensional volume of interest (VOI) on every slice of primary breast lesions and metastatic lymph nodes. They were blinded to the clinical and pathological outcomes, except for the presence of BC. The VOI was defined with a threshold of 40% of the maximum standardized uptake value (SUVmax), which included MTV and total lesion glycolysis (TLG). Intensity discretization for CT data was performed with the number of gray levels of 400 bins and absolute scale bounds from − 1,000 to 3,000 HU. In contrast, it was segmented into 64 bins and measured with SUVmax ranging from 0 to 20 for PET images. Moreover, the radiomic features of PET and CT were calculated automatically by this software for texture analysis. Intra-class correlation coefficients (ICCs) were calculated to evaluate the repeatability of all the features, of which ICCs > 0.75 were selected for model construction. Subsequently, the least absolute shrinkage and selection operator (LASSO) regression algorithm and t-test were used for the selection of the remaining features. Finally, 10-fold cross-validation was used to ensure the robustness of the optimal features.

Construction and evaluation of the prediction model

Univariate regression analysis was used to screen out clinicopathological risk factors with statistically significant differences from potential predictors, including age, BMI, menopausal status, tumor location, histological types, clinical TNM staging, and molecular types. The above-mentioned risk factors were subsequently combined with radiomic features and analyzed by multivariable logistic regression analysis to develop a prediction model. The receiver operating characteristic (ROC) curve, Hosmer-Lemeshow (H-L) test, and decision curve analysis (DCA) were used to evaluate the discrimination, calibration, and clinical usefulness of the model, respectively.

Statistical analysis

All statistical data were calculated and analyzed with IBM SPSS statistics version 26.0, Python version 3.11 (https://www.python.org), MedCalc software (MedCalc Software, Ostend, Belgium), and R version 4.2.1 (http://www.R-project.org). The normality and homogeneity of variance of continuous data were evaluated by the Kolmogorov-Smirnov test and Levene’s test, respectively. The numeric variables were tested by the independent t-test and Mann-Whitney U test. Instead, the Chi-square test and Fisher’s exact test were applied to analyze categorical variables. P < 0.05 was considered statistically significant. Figure 1 presents the workflow of this study.

Fig. 1
figure 1

The workflow of this study. CT image (A), PET image (B), PET/CT fusion image (D), coronary image (E), sagittal image (F), and transaxial image (G) of 18F-FDG PET/CT display an example of VOI for extracting imaging features of BC. A patient with stage IV of BC underwent PET/CT image showing a metabolically active right breast lesion

Results

Clinicopathologic characteristics

A total of 48 BC patients were included in our study. Table 1 shows the baseline characteristics of these patients. Among the 48 patients, 33 patients (68.75%) were histologically confirmed as AR+, while 15 patients (31.25%) were confirmed as AR-. In our univariate analysis, MTV was tightly relevant to the expression of AR (p = 0.032). The ER and PR status were weakly correlated with AR status (p = 0.127 and 0.061, respectively). However, there was no significant difference found across the other clinical characteristics between the AR + and AR- groups.

Table 1 Characteristics of included BC patients with different AR expression statuses

Feature selection

Table 2 presents 80 radiomic features extracted from CT and PET datasets. Five features were considered valuable for the prediction of the AR expression using both the t-test and LASSO regression model. Figure 2 shows the most suitable Lambda parameter of LASSO regression. For the CT dataset, SHAPE_SphericityCT, GLCM_ContrastCT, and NGLDM_CoarsenessCT (CT Coarseness from neighborhood gray-level difference matrix) were selected. SHAPE_SphericityPET (PET Sphericity from SHAPE) and NGLDM_CoarsenessPET (PET Coarseness from neighborhood gray-level difference matrix) were extracted from the PET dataset. The ICC of the radiomic signatures was all above 0.75. Included MTV, we calculated the Pearson correlation coefficient between these features. Figure 3 displays the Pearson correlation coefficient matrix heatmap.

Table 2 Radiomic parameters extracted from PET/CT images
Fig. 2
figure 2

The Lamda of LASSO regression. The least absolute shrinkage and selection operator (LASSO) was conducted to select the radiomic features of CT and PET. Using 10-fold cross-validation, the suitable value of tuning parameter Lambda (λ) in LASSO regression was selected, and a vertical line was drawn here. MSE: mean squared error

Fig. 3
figure 3

The Pearson correlation coefficient matrix heatmap in the prediction of AR expression. The darker color presents a higher correlation between the two factors

Model construction

In the univariate analysis, the potential meaningful factors were selected for further multivariable regression analysis (p < 0.20) [22]. In order to prevent the omission of possible associated factors, those with P < 0.15 were included in our multivariate analysis. In addition, ER and PR have been confirmed to be correlated with AR expression in other research, and thus they were also included in our multivariable analysis [23]. Based on the above risk factors, we developed a nomogram capable of predicting the AR expression (Fig. 4). According to the cutoff of the ROC curve, continuous variables were transformed into binary variables (Table 3). It was worth mentioning that high MTV might be related to the negative expression of AR (Fig. 5). Moreover, these lesions were similar on PET/CT images but showed significant differences in the histograms of the radiomic features. Based on binary logistic regression, a diagnostic model consisting of significant risk factors was constructed, including MTV, SHAPE_SphericityCT, and GLCM_ContrastCT (Table 4). GLCM_ContrastCT with a significant statistical difference between the AR + and AR- (OR [odds ratio]: 1.420; 95% CI [confidence interval]: 9.000 [1.460–55.478]; P = 0.018) groups was the independent predictive factor by using the Forward stepwise regression method.

Fig. 4
figure 4

The nomogram of risk factors predicting the expression of AR. The importance of each variable that established the diagnostic model was visualized as the points

Table 3 ROC analyses for meaningful risk factors
Fig. 5
figure 5

A typical case of the relationship between MTV and AR expression. Two patients were diagnosed with IDC (invasive ductal breast carcinoma) successively in a year. The former (A: PET/CT fusion image, B: The VOI of the lesion, C: The histogram of VOI) was confirmed as AR-positive (SUVmax: 8.00; MTV: 10.76 cm3). The latter (D: PET/CT fusion image, E: The VOI of the lesion, F: The histogram of VOI) was confirmed as AR negative (SUVmax: 9.12; MTV: 21.32 cm3). These lesions were similar on PET/CT images but showed significant differences in the histograms of the radiomic features

Table 4 The Univariable and multivariate analysis of this prediction model

Model effectiveness evaluation

Figure 6 A shows the ROC curve of the combined prediction model. The AUC of the combined model for predicting the AR + was 0.832 (95% CI: 0.697–0.924). The sensitivity and specificity were 75.76% and 80.0%, respectively. Subsequently, 10-fold cross-validation was used to verify the stability of the AUC. The mean value of AUC was 0.833. After bootstrap resampling, we got the calibration curve (Fig. 6B). The concordance index (c-index) of this calibration curve was 0.885 (95%CI: 0.805–0.965), which was reduced to 0.882 after the bias correction. The p-value of the H-L test was beyond 0.05. These results all provided a higher calibration of this model. Finally, the DCA of this study displayed that more benefits could be added to clinical strategies when this prediction model was used to predict the AR expression (Fig. 6C).

Fig. 6
figure 6

The evaluation of the prediction model of AR expression in internal validation using Bootstrap. The ROC curve of the combined model consisted of radiomic features and clinical characters (A); the calibration curve (green) after correction of bias fluctuates around the ideal curve (red), which revealed the good accuracy between the actual probability and predicted probability (B); decision curve showed that the prediction model led to a higher net benefit than intervention or no-intervention of all patients among a large range of threshold (C)

Discussion

Some studies have focused on the correlation between the status of other receptors and the extensive expression of AR, indicating that it can promote the proliferation of cancer cells [24]. Accumulating preclinical and clinical evidence supports that some patients can benefit from AR inhibitor therapy, which especially improves the prognosis of TNBC patients [25,26,27]. In order to avoid the limitations of IHC, recent studies have demonstrated the feasibility of radiomics combined with clinical data for predicting intrinsic molecular subtypes and the expressions of receptors in BC [28, 29]. However, only very few studies have focused on predicting the expression of AR status. In our present study, MTV, SHAPE_SphericityCT, GLCM_ContrastCT extracted from 18F-FDG PET/CT could predict the AR status in BC patients. We further developed a machine-learning model with a higher AUC score on the ROC curve (0.832, 95% CI: 0.697–0.924).

Previous studies have shown that standard parameters (SUVmax, SUVmean, and MTV) extracted from PET images are associated with ER or PR status [17, 30, 31]. In our univariate analysis, there was no obvious correlation between SUVmax, SUVmean, and AR status. Huang et al. have concluded that ER and PR status have significant differences between AR + and AR- groups [23]. ER and PR status were weakly correlated with the expression of AR (p = 0.061, p = 0.127) in our study. The reason was attributed to the residual confounding factors, and it needs to be further investigated in a larger cohort population [32, 33]. However, high MTV showed a tendency to be associated with AR negativity (p = 0.032). Kaida et al. have also found a similar situation, in which a feeble correlation exists between high MTV and ER-negative status [31]. In further textural analysis, SHAPE_SphericityCT (p = 0.004), GLCM_ContrastCT (p = 0.008), NGLDM_CoarsenessCT (p = 0.008), SHAPE_SphericityPET (p = 0.012), and NGLDM_CoarsenessPET (p = 0.004) selected by LASSO and t-test were correlated with the expression of AR. In multivariable logistic analysis, only SHAPE_SphericityCT and GLCM_ContrastCT were included in the prediction model. Due to the limited density resolution of the PET images, they performed worse than CT images in screening and extracting meaningful radiomics signatures [34]. Currently, GLCM (grey-level co-occurrence matrix) textural features are used to classify the tumor grade of BC [35,36,37,38]. Here, we are the first to report the independent performance of GLCM_ContrastCT in predicting AR molecular markers (p = 0.018).

Based on the forward stepwise logistic regression, we established a combined model, in which the accuracy was usually assessed by discrimination, calibration, and clinical applicability [39, 40]. According to its AUC (AUC = 0.832), this model presented a better efficacy in the discrimination of AR status. Cross-validation and Bootstrap have proved to be used for the predictive performance of a small sample [41]. Furthermore, using the above methods for internal validation, we finally obtained a very stable value of AUC (AUC = 0.833). Some studies have proved that the c-index, with its 95% CI, can provide a more comprehensive assessment of calibration [42, 43]. Combined with the p-value of the H-L test, our model showed a higher capacity for stability [44]. DCA can be used to map the utility of a net benefit assessment model in decision-making within a clinically reasonable risk threshold [45]. The DCA curve of this study visualized that the diagnostic model achieved better net benefit than the all-intervention or no-intervention strategy within a larger threshold range, further proving the excellent clinical applicability of this model.

To the best of our knowledge, we originally established a combined model to predict AR expression based on 18F-FDG PET/CT imaging. There were some limitations in our present study due to the single-center and retrospective design. Next, due to the novelty of the molecular receptor detection of AR, the number of patients included in this study was relatively small. Thirdly, except for internal validation, the resulting model needed to be validated with an external cohort to improve the confidence of clinical applicability. Lastly, a classical supervised classification algorithm should be used to construct a prediction model, and deep learning-based features from PET/CT images need to be verified in further studies.

Conclusions

In conclusion, by combining clinicopathological factors and radiomic features extracted from 18F-FDG PET/CT images, we established a model to predict AR expression in BC patients with a single primary lesion. Our model could serve as a novel strategy to select patients who could benefit from anti-AR treatment and to assist clinicians in making clinical decisions.