Background

Breast cancer is the most common cancer in women around the world [1]. Approximately 10% of patients with breast cancer carry germline pathogenic variants (GPVs) in cancer predisposition genes (CPGs) implicated in the DNA repair pathway [2, 3]. Distinguishing breast cancer patients with GPVs is essential for employing systemic treatment strategies and risk-reducing interventions [4, 5]. However, less than 10% of these carriers are referred for genetic testing in current clinical practice due to the cost and time spent [6, 7].

The probability of carrying GPVs among breast cancer patients has long been evaluated in terms of family cancer history and clinical characteristics, such as the age at diagnosis and tumor pathological information [8, 9]. One of the most commonly used criteria is the National Comprehensive Cancer Network (NCCN) criterion [7, 10,11,12]. However, adhering to the current NCCN criteria would overlook nearly half of breast cancer patients with a clinically actionable GPV [7, 11,12,13]. Nonetheless, routine genetic testing of all or most breast cancer patients would require vastly increased genetic counseling and management, which might not be easily achieved with presently available resources [14]. Furthermore, extending population-based genetic testing to patients with low rates of or non-existent founder mutations might pose a considerable financial burden, ethical concerns, and other barriers [15, 16]. Therefore, an accurate prediction model for GPVs in clinically actionable genes is urgently needed. Recently, deep learning algorithms were demonstrated to improve clinical practice in genomic diagnostics due to their high accuracy and ability to extract information from big data [17]. Recent studies have demonstrated deep learning as a feasible and potentially useful tool for predicting germline BRCA1/2 status for cancer patients using demographic and clinical characteristics, medical images, or pathology images [18,19,20]. It is not known whether deep learning algorithms can be used to improve the precise selection of breast cancer patients to undergo genetic testing.

Here, we evaluated the family history of multiple cancer types and detailed phenotypes in a multi-center cohort of 3041 female Chinese breast cancer patients who underwent multigene genetic testing. Based on the distinct endophenotypes of breast cancer patients with GPVs in genes involved in homologous recombination and other DNA repair pathways, we designed a deep learning-driven model named DrABC (DNA-repair Associated Breast Cancer) to improve the accuracy in identifying carriers for GPVs in CPGs among breast cancer patients.

Methods

Study participants and design

In this multi-center cohort study, we consecutively recruited unselected female patients with breast cancer from October 1, 2017, to August 31, 2019, at the Cancer Hospital of Chinese Academy of Medical Sciences and Peking Union Medical College (CHCAMS, i.e., the discovery cohort) and other six hospitals(i.e., the validation cohort), including (1) Huanxing Cancer Hospital, (2) Guiyang Maternal and Child Healthcare Hospital in Guiyang, (3) the Affiliated Cancer Hospital of Zhengzhou University, (4) the Affiliated Yantai Yuhuangding Hospital of Qingdao University, (5) the Fourth Hospital of Hebei Medical University, and (6) Beijing Tiantan Hospital all in China. The diagnosis of each patient was based on pathological results from resection specimens. This study was reviewed and approved by the ethics committees at each participating hospital. Written informed consent was obtained from each participant. This article follows the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guidelines [21].

As a result, 3041 women with breast cancer were enrolled, while 113 patients without available samples were excluded. The germline genetic test and analysis of 50 CPGs and detailed phenotypic evaluation were conducted in the remaining 2928 patients.

Phenotype data

We collected phenotypic data including the age at diagnosis, family cancer history, personal cancer history, pathological features, molecular subtype, and clinical stage (Additional file 1: Supplementary method). Molecular subtyping was performed based on hormone receptor (HR, including estrogen receptor [ER] and progesterone receptor [PR]) and HER2 status [22]. Staging was determined according to the 8th edition of the classification of breast cancer staging from the American Joint Commission of Cancer [23].

GPV analysis

Genomic DNA was extracted from peripheral blood or saliva. GPVs in patients from each center were analyzed by their local diagnostic laboratory, which generated a clinical genetic test report for each participant. Each laboratory provided results by the enrichment of the coding regions and consensus splice sites of 50 CPGs in the DNA repair pathway using a targeted panel followed by sequencing (Additional file 1: Supplementary method) [24, 25]. Only novel variants or variants with < 0.1% population frequency in the 1000 Genomes (October 2013) and the genome Aggregation Database (gnomAD, http://gnomad.broadinstitute.org/) were collected in this study. The clinical significance of each GPV was evaluated based on a 5-tier classification system of pathogenic/likely pathogenic (P/LP), benign/likely benign (B/LB), and variants of uncertain significance (VUS) according to guidelines of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology and in-house pipeline [25,26,27,28]. The variants in BRCA1/2 were further analyzed according to the ENIGMA expert panel review [29, 30]. For those variants without available expert panel results, the consensus classifications in ClinVar were referred to. Variants classified as P/LP were considered pathogenic in this study (Additional file 1: Supplementary method).

DrABC model development

The DrABC risk prediction model was designed based on a hierarchical neural network that starts with an input layer of 25 neurons corresponding to features of carriers of GPVs in CPGs followed by two hidden layers. A dropout operator is applied to the hidden layers with a 25% chance of disabling a random neuron, which prevents the model from overfitting. In addition, a non-linear activation function, Scaled Exponential Linear Unit [31], is attached to the output of the hidden layers, which helps keep the representation distributions close to Gaussian. Finally, the output layer consists of two neurons with a sigmoid activation function, such that it produces two valid probabilities (i.e., in the range of [0, 1]): P1 and P2.Using P1 and P2, the final prediction is calculated using the following equations:

$${P}_a={P}_1,$$
(1)
$${P}_b={P}_1{P}_2,$$
(2)
$${P}_c={P}_1\left(1-{P}_2\right),$$
(3)

where Pa is the probability of having mutation in any CPGs, Pb is the probability of having BRCA1/2 mutation, and Pc is the probability of having mutations in other CPGs.

With the paired input features and ground truth annotations of [Pa, Pb, Pc] (in the form of one-hot encoding), we trained 101 deep learning models using cross-entropy loss via gradient descent. The final prediction is derived by aggregating results from all deep learning models through the ensemble learning strategy (Additional file 1: Supplementary method) [32, 33]. The cutoff points for each prediction scenario were determined to achieve 90% sensitivity (or the maximum sensitivity).

To evaluate the performance between the DrABC model and other machine learning models, we compared six kinds of commonly used machine learning algorithms, including a fixed grid of Generalized Linear Models (GLMs), a naive Bayes (NB) classifier, five pre-specified Gradient Boosting Machine (GBM) models, three pre-specified and a random grid of eXtreme Gradient Boosting (XGBoost) models, a default Random Forest (RF), a near-default Deep Neural Net (DNN), and a random grid of DNNs. All models were trained on the discovery dataset to predict whether a breast cancer patient carries germline pathogenic variants in any cancer predisposition genes (CPGs) using an inner five-fold cross-validation strategy. For each algorithm family, only the best model was retained to represent the maximum performance of each kind. These common machine learning algorithms were performed using the R package h2o [34].

Statistical analysis

Student’s t-tests were used to analyze age at enrollment and age at diagnosis. The prevalence of personal cancer history, family cancer history, tumor size, histological grade, ER/PR/androgen receptor (AR)/HER2 status, and lymph nodes metastasis were compared using Pearson χ2 or Fisher’s exact tests. The risk of carrying a GPV in BRCA1/2 or CPGs was also estimated using NCCN guidelines (version 1.2020) [12], BRCAPRO (version 2.1-7) [35, 36], Myriad II [37], PENN II [38], and BOADICEA (v3) [39] models in the multi-center validation cohort. Sensitivity, specificity, accuracy, and area under the curve (AUC) with the receiver operating characteristic (ROC) were calculated to evaluate the predictive performance of DrABC, other machine learning, and previous models. The performance of two ROC curves was compared through the “DeLong’s test” [40] using the algorithm of Sun and Xu [41]. Two-sided p < 0.05 was considered statistically significant. Statistical analysis was performed using SPSS version 15.0 (SPSS, USA) and R statistical software, version 3.5.1. The Youden index (J = sensitivity + specificity − 1) was used to evaluate the balance and potential effectiveness of each model with the suggested threshold [42].

Results

Patient characteristics

In total, patients were diagnosed at 42.9 ± 9.1 years of age, with 1168 (39.9%, 1168/2928) having early-onset cancer (age at diagnosis ≤ 40 years [43]). There were 400 (13.7%) patients with a family history of breast cancer, 86 (2.9%) patients with bilateral breast cancer, and 96 (3.3%) patients with an additional primary cancer other than breast cancer.

Prevalence of GPVs

In total, 332 (11.3%, 332/2928) patients harbored 335 GPVs in CPGs (including 334 single nucleotide variants/indels and one deletion of BRCA2 exons 22-24), while 295 VUS were found in 249 (8.5%) patients (Fig. 1, Additional file 2: Fig. S1 and Additional file 3: Table S1) and were excluded from further analysis to avoid potential contamination of datasets. Patients with GPVs (n = 332) were further divided into four subgroups according to the clinical significance of mutated genes: BRCA1 (n = 131); BRCA2 (n = 132) (Fig. 2A and B); other homologous recombinational repair (HRR)-related genes [44] including PALB2 (Fig. 2C), RAD51C, RAD51D, BARD1, and BRIP1 (n = 43); and other CPGs [45] (n = 26).

Fig. 1
figure 1

Patient enrollment and study design. GPV, germline pathogenic variant; CPG, cancer predisposition gene; CHCAMS, Cancer Hospital of Chinese Academy of Medical Sciences

Fig. 2
figure 2

Genotype-phenotype atlas of hereditary breast cancer. A Germline pathogenic variants (GPVs) in BRCA1 were found in 131 (4.5%) patients. Most BRCA1 carriers had triple-negative breast cancer (82/131, 62.6%). B GPVs in BRCA2 were found in 134 (4.6%) patients. Most BRCA2 carriers were hormone receptor (HR)-positive and HER2-negative (90/134, 67.2%). C GPVs in PALB2 were found in 33 (1.1%) patients. Most PALB2 carriers were HR-positive and HER2-negative (21/33, 63.6%)

Association of GPVs with clinical characteristics

Compared with non-carriers, patients with GPVs in any CPGs are associated with younger onset ages (40.15 ± 8.29 in any CPGs vs. 43.43 ± 9.08 in non-carriers, p = 5.7 × 10− 10; Table 1 and Fig. 1). Furthermore, patients with GPVs in BRCA1/2 are associated with even younger-onset ages, personal history of all cancers, previous breast cancer, and ovarian cancer, family history of breast cancer (41.2% in BRCA1 carriers and 32.6% in BRCA2 carriers vs. 10.8% for non-carriers, p = 8.4 × 10− 18 and 9.5 × 10− 11, respectively) and all cancers (64.9% in BRCA1 carriers and 53.8% in BRCA2 carriers vs. 30.9% for non-carriers, p = 1.6 × 10− 14 and 1.5 × 10− 7; Additional file 4: Fig. S2), and bilateral breast cancer (11.45% in BRCA1 and 8.33% in BRCA2 vs. 2.22% in non-carriers, p = 1.0 × 10− 6 and 3.6 × 10− 4, respectively).

Table 1 Comparison of clinical characteristics between patients with and without DNA-repair pathway gene mutation

In particular, patients with GPVs in BRCA1 are associated with a family history of ovarian cancer (14.5% for BRCA1 carriers vs. 0.8% for non-carriers, p = 1.2 × 10− 15), more grade III (64.89% for BRCA1 carriers vs. 25.01% for non-carriers, p = 1.5 × 10− 20), more negative cases in ER, PR, and AR (71.76%, 70.99%, and 38.17% for BRCA1 carriers vs. 26.08%, 26.89%, and 7.46% for non-carriers, p = 4.1 × 10− 25, 2.6 × 10− 23, and 4.6 × 10− 21, respectively). Significantly more breast cancer with ki67 > 30%, EGFR-positive breast cancer, and CK5/6-positive breast cancers were also seen in BRCA1 mutation carriers (Table 1). Meanwhile, patients with GPVs in BRCA2 are associated with a family history of leukemia or male breast cancer, lymph node metastasis (56.82% in BRCA2 carriers vs. 38.01% in non-carriers, p = 4.0 × 10− 5), more positive cases in ER and PR (81.06% and 81.06% for BRCA2 carriers vs. 67.83% and 66.94%for non-carriers, p = 7.8 × 10− 3 and 2.3 × 10− 3, respectively), and more wild-type P53 (41.7% for BRCA2 carriers vs. 26.8% for non-carriers, p = 4.1 × 10− 4). Besides, patients with GPVs in other HRR-related genes are associated with a family history of pancreas cancer (9.3% in other HRR-related genes carriers vs. 1.5% in non-carriers, p = 4.3 × 10− 3) and more wild-type P53, while patients with GPVs in other CPGs are also associated with lymph node metastasis (Table 1).

However, HER2-positive status was less common among patients with HRR-related GPVs but not among those with GPVs in other CPGs (1.5% for BRCA1 carriers, 4.6% for BRCA2 carriers, 0% for other HRR-related gene carriers, 26.9% for other CPG carriers, vs. 21.6% for non-carriers, p = 4.2 × 10− 11, 1.1 × 10− 7, 4.3 × 10− 5, and 0.48, respectively). Triple-negative breast cancer was more common among patients with GPVs in BRCA1 than among non-carriers (62.6% vs. 12.9%, p = 6.8 × 10− 37; Table 1). However, most BRCA2 and other HRR-related gene mutation carriers were HR+/HER2− (66.67% and 67.44%, respectively; Additional file 5: Fig. S3). When combining molecular subtypes with the age at diagnosis and family cancer history, the CPG mutation carriers were further enriched accordingly (Additional file 6: Fig. S4).

Using a deep learning model to predict GPVs in DNA repair genes

To ensure data integrity and cleanness, 249 patients with VUSs and 247 patients without complete clinical information or family cancer history were excluded from model construction [46]. A total of 1701 patients from the CHCAMS constituted the discovery cohort, and 731 patients from six other institutions constituted the independent multi-center validation cohort (Additional file 7: Fig. S5).

We used 25 clinical features associated with GPVs in CPGs to develop the prediction model. These 25 features correspond to an input layer of 25 neurons (Additional file 8: Table S2), followed by two hidden layers of 16 and 8 neurons, respectively (Additional file 9: Fig. S6). As a result, DrABC achieved a superior performance through the inner five-fold cross-validation in the discovery cohort, which was slightly higher than other traditional machine learning models but without significance (p > 0.05 when comparing each model with the DrABC; Additional file 10: Fig. S7).

Performance of DrABC versus previous models

DrABC generates probabilities of whether a breast cancer patient carries GPVs in BRCA1/2, other CPGs except for BRCA1/2, or any CPG. In predicting GPVs in any CPG, the AUCs for DrABC were 0.80 (95% CI, 0.78–0.83) for the discovery cohort and 0.74 (95% CI, 0.69–0.79) for the validation cohort, which were superior to those for previous models (AUC = 0.65 for BRCAPRO [35], AUC = 0.57 for BOADICEA [39], AUC = 0.56 for Myriad [37], and AUC = 0.61 for PENN II [38] in the validation cohort; p < 0.01 when comparing each model with the DrABC; Fig. 3A, Table 2, and Additional file 11: Table S3). Of the 731 patients in the multi-center validation cohort, 513 (70.2%) met NCCN criteria for genetic testing criteria and 218 (29.8%) did not. Patients meeting NCCN criteria were more likely to carry GPVs in any CPG than patients not meeting the criteria (15.2% [78/513] vs. 9.6% [21/218], p = 0.045; OR = 1.7 [95% CI, 1.0–2.8)]. As a result, the NCCN criteria showed a sensitivity of 78.8%, specificity of 31.2%, and accuracy of 37.6% (Table 2). Expansion of NCCN criteria [13] to include all patients diagnosed with breast cancer at ≤ 65 years of age could increase the sensitivity to 100% but reduced specificity to 2.5% and accuracy to 15.7%. When achieving the highest detection rate, DrABC had a sensitivity of 90.8% and specificity of 53.2% for all GPVs in the discovery cohort and a sensitivity of 83.8% and specificity of 51.3% for all GPVs in the multi-center validation cohort (Table 2 and Additional file 12: Table S4).

Fig. 3
figure 3

Performance of risk prediction models for hereditary breast cancer. A DrABC performed better than previous models in predicting germline pathogenic variants (GPVs) in any cancer predisposition genes (CPGs) (AUCs of 0.74 for DrABC, 0.65 for BRCAPRO, 0.57 for BOADICEA, 0.56 for Myriad, and 0.61 for PENNII). B In predicting GPVs in BRCA1/2, the AUC of DrABC was 0.79 (95% CI, 0.74–0.85) for the validation cohort, which was superior to those for previous models (0.70 for BRCAPRO, 0.59 for BOADICEA, 0.59 for Myriad, and 0.63 for PENN II). C, D The probabilities generated by DrABC were distributed differently between non-carriers and patients with GPVs in any CPG (C) or BRCA1/2 (D). **p < 0.01, ****p < 0.0001, when comparing with the DrABC

Table 2 The prediction accuracy of the algorithms and NCCN criteria in multi-center validation cohort

In predicting GPVs in BRCA1/2, the AUCs for DrABC were 0.81 (95% CI, 0.78–0.84) for the discovery cohort and 0.79 (95% CI, 0.74–0.85) for the validation cohort, which were also superior to those for previous models (AUC = 0.70 for BRCAPRO [35], AUC = 0.59 for BOADICEA [39, 47], AUC = 0.59 for Myriad [37], and AUC = 0.63 for PENN II [38] in the validation cohort; p < 0.01 when comparing each model with the DrABC; Fig. 3B, Table 2, and Additional file 11: Table S3). The DrABC had a sensitivity of 85.6% and specificity of 65.5% for GPVs in BRCA1/2 in the discovery cohort and a sensitivity of 82.1% and specificity of 63.1% for GPVs in BRCA1/2 in the validation cohort, when achieving the highest detection rate (Additional file 12: Table S4). Compared to previous models, the DrABC demonstrated the highest Youden index with the corresponding threshold for detecting GPVs in BRCA1/2 or any CPG (Table 2), suggesting DrABC has a more balanced performance compared with previous models.

The probabilities generated by DrABC were distributed differently between non-carriers and patients with GPVs in any CPG (p = 2.0 × 10− 10; Fig. 3C) or BRCA1/2 (p = 7.8 × 10− 16; Fig. 3D and Additional file 13: Fig. S8), suggesting its capability in distinguishing patients with hereditary breast cancer. However, DrABC was less satisfactory in predicting GPVs in other CPGs, with AUCs of 0.72 (95% CI, 0.64–0.79) in the discovery cohort and 0.58 (95% CI, 0.46–0.70) in the validation cohort, which was still higher than other models (AUC range = 0.44-0.53 in the validation cohort; Table 2 and Additional file 14: Fig. S9) but without significant difference (p > 0.05 when comparing each model with the DrABC; Additional file 11: Table S3). There was no significant distribution difference between non-carriers and patients with GPVs in CPGs other than BRCA1/2 (p = 0.39; Additional file 14: Fig. S9).

Contributions of family cancer history and pathological features to DrABC performance

To identify their contributions of features to the deep-learning model, we assessed the performance of DrABC after eliminating family cancer history or pathological feature data in the validation cohort. Eliminating family cancer history data did not reduce the performance of DrABC, with AUCs of 0.72 in predicting GPVs in any CPG, 0.75 in predicting GPVs in BRCA1/2, and 0.52 in predicting GPVs in other CPGs. However, eliminating pathological feature data reduced the performance of DrABC, with AUCs of 0.62 in predicting GPVs in any CPG, 0.66 in predicting GPVs in BRCA1/2, and 0.44 in predicting GPVs in other CPGs (Additional file 15: Fig. S10). Therefore, pathological feature represents an important predictive factor for hereditary breast cancer.

Reconstructing previous prediction models using in-house data in discovery cohort

To investigate the contribution of the Chinese-specific training dataset to the superior performance of the DrABC model to the previous model, we reconstructed the previous prediction models of BRCAPRO, BOADICEA, Myriad, and PENN II using the underlying algorithms (i.e., Bayes’ theorem for BRCAPRO and BOADICEA, Logistic regression for Myriad and PENN II; through the R package h2o [34]) and input variables (Additional file 16: Table S5) [35,36,37,38,39] of each model. These reconstructed models were trained in the Chinese discovery cohort and validated in the multi-center validation cohort. In predicting GPVs in any CPG, DrABC was superior to the reconstructed models of BRCAPRO, Myriad, and PENN II (AUC = 0.74 for DrABC, AUC = 0.64 for BRCAPRO, AUC = 0.63 for Myriad, and AUC = 0.66 for PENN II in the validation cohort; p < 0.01 when comparing each model with the DrABC; Additional file 17: Fig. S11 and Additional file 18: Table S6). Similarly, in predicting GPVs in BRCA1/2, DrABC was superior to these three reconstructed models (AUC = 0.79 for DrABC, AUC = 0.68 for BRCAPRO, AUC = 0.68 for Myriad, and AUC = 0.70 for PENN II in the validation cohort; p < 0.01 when comparing each model with the DrABC; Additional file 17: Fig. S11 and Additional file 18: Table S6). However, there was no significant difference between the AUCs for DrABC and the reconstructed BOADICEA model in both predicting GPVs in any CPG and BRCA1/2 (AUC = 0.75 and 0.78 for BOADICEA, p = 0.48 and 0.32 when comparing each model with the DrABC, respectively; Additional file 17: Fig. S11 and Additional file 18: Table S6).

Online DrABC tool

We implemented a website interface (http://gifts.bio-data.cn/) to accommodate extensions to the DrABC model and make it easily accessible to healthcare providers and researchers (Additional file 19: Fig. S12). The user guide was provided in the Additional file 20: A user guide for the DrABC model.

Discussion

Breast cancer patients with GPVs in BRCA1/2 and other breast cancer-associated genes benefit from particular patterns of systemic treatments and risk-reducing interventions [48]. Although risk prediction models have been developed for combined groups of patients with breast or ovarian cancer as well as healthy individuals with a family history of hereditary breast and ovarian cancer [36, 37, 49, 50], no clinical tool has been specifically developed for patients already diagnosed with breast cancer. Therefore, we developed and validated a reliable prediction model using deep learning algorithms to identify GPV carriers among unselected breast cancer patients with better accuracy than previous models and no trend toward overfitting.

In this study, we have compared and tested the currently available risk prediction models and identified the shortfalls and limitations as follows: (1) the probability of carrying GPVs was derived from data from multi-generation families and computed based on the family history of specific cancers, age at diagnosis, and ancestry [37, 50, 51]. Thus, their performances in small family structures with simple pedigrees would be significantly limited [8]. (2) Evolutionarily recent or de novo mutations may have a more significant influence on disease susceptibility or protection than ancient mutations (Additional file 16: Table S5) [52]. (3) The vast majority of models were developed based on the data driven from European populations, but the performance in Asian populations has not been validated [53]. (4) Most of the existing models were specifically designed to predict the GPV carrier risk in BRCA1/2 genes and thus cannot be readily used to assess the risk for other breast cancer predisposition loci, which are also important for personalized healthcare decisions.

Thus, to identify whether the superior performance of DrABC may also be attributed to its Chinese-specific training dataset, we imitated the previous prediction models of BRCAPRO, BOADICEA, Myriad, and PENN II using the corresponding algorithms and input variables and trained them in the discovery cohort of this study. As a result, the performance of DrABC was superior to those of the reconstructed models of BRCAPRO, Myriad, and PENN II, but similar to the reconstructed BOADICEA model. Notably, only DrABC and the reconstructed BOADICEA model have incorporated pathological features in the algorithm. Collectively, the DrABC model has shown better performance in the Chinese population than all these previous models in their current versions. After training these previous models with the Chinese-specific dataset, the previous models without the inclusion of pathological information still cannot compete with the DrABC model, while the BOADICEA model involving the pathological features demonstrated similar performance to the DrABC model.

In comparison with traditional machine learning models, although DrABC achieved a slightly superior performance than other traditional machine learning models, there was no significant difference among them (Additional file 10: Fig. S7). While the difference between the performance of the machine learning models was also not observed in a previous study of predicting GPVs status in pancreatic cancer patients [20]. However, based on the similar deep learning technique, DNN models had the worst performance with an AUC of 0.75, suggesting that DNNs in particular are difficult to perform well without ingenious design. In addition, as a complex classification task with three categories, we specially designed the prediction model based on a hierarchical neural network, producing two probabilities: P1 and P2, where P1 is the probability of having a mutation in any CPGs, P2 is the probability of having BRCA1/2 mutation when the patient is known to carrier mutation in any CPGs. To sum up, DrABC is a specially designed and well-performed model for this scenario.

As each CPG has distinct endophenotypes in terms of clinical and pathological features, the detailed phenotype of a proband with breast cancer should be incorporated in risk prediction. However, previously incorporating ER/PR/HER2 status into the BOADICEA model did not improve its predictive accuracy [53], inconsistent with the present study. Intriguingly, pathological features contributed more than family cancer history to the ability of DrABC to predict GPVs in BRCA1/2 and any CPGs, which might contribute to the superior performance of DrABC and the reconstructed BOADICEA model than the other previous models.

Asian breast cancer patients exhibit several unique features. Breast cancer is diagnosed at much younger ages in Asian women than in women from Western countries [1, 54]. Moreover, BRCA2 mutations are more common than BRCA1 mutations in Asian women as compared with Caucasian women [55, 56]. However, we found that breast cancer patients with GPVs in BRCA2 have less distinct endophenotypes than those with GPVs in BRCA1. These two features reduce the performance of previous risk prediction models and criteria [53]. To our knowledge, DrABC is the first available GPVs risk prediction model suitable for Asian breast cancer patients, which might contribute to the better performance than previous models based on Western populations.

Therefore, we introduced an applicable pipeline for GPV carrier risk assessment among patients with breast cancer (Additional file 21: Fig. S13). This approach would strike a balance between identifying more GPV carriers and testing fewer breast cancer patients and, in turn, would bolster national guidelines for genetic testing, and reduce healthcare costs. However, we cannot rule out that testing breast cancer patients with a low risk of GPVs would further increase the detection rate [57] but should be undertaken considering local healthcare resources and patient desires.

However, there are some limitations in this study. As our study included few carriers of GPVs in CPGs other than BRCA1/2, their endophenotypes were not well-represented. Although this study employed a multi-center design, only Chinese female patients with breast cancer were investigated. Extending the usage of this model in other ethnicities requires further tuning via training the model with ethnicity-specific dataset, following by validating in larger cohorts in the corresponding population.

Conclusions

Breast cancer patients with GPVs in different CPGs exhibit distinct endophenotypes. Based on these distinct features, we developed and validated a phenotype-driven risk prediction model using a deep learning algorithm to identify GPV carriers among unselected breast cancer patients in a multi-center cohort. The DrABC model better predicted the risk of carrying GPVs in BRCA1/2 or other CPGs in the Chinese population compared to previous risk prediction models which were trained in other populations. This robust germline defect risk stratification tool can be utilized to triage patients at higher risk for genetic testing.