Introduction

Breast cancer incidence remains the highest in women worldwide and has substantially increased in Korea over the past several decades [1, 2]. Early detection is crucial to reducing mortality associated with breast cancer and morbidity during treatment [3]. Current breast cancer screening includes physical examination and mammography, although mammography has shown low accuracy in young women or women with dense breasts [4]. Mammography may also cause pain or discomfort during testing, and harm from irradiation may outweigh its benefits in young women [5, 6].

Blood-based markers can address the drawbacks of the current breast screening tools [7]. We recently reported a new proteomic assay that quantifies blood levels of three proteins (Mastocheck®) using multiple reaction monitoring mass spectrometry (MRM-MS) [8]. The three proteins used in the assay are apolipoprotein C-I (APOC1), carbonic anhydrase I (CA1), and neural cell adhesion molecule L1-like protein (CHL1). APOC1 was identified as a downregulated serum protein marker in an independent study [9]. Studies have suggested that the CA1 protein contributes to microcalcification in breast cancer and atherosclerosis [10, 11], and high levels of blood CA1 are detected in patients with breast cancer [10]. CHL1, a cell adhesion molecule, has been proposed as a tumor suppressor in breast cancer; however, its value as a diagnostic marker is unclear [12].

Three proteins were selected among the 124 proteins discovered in our previous proteomic experiments [13,14,15,16] based on the diagnostic performance of plasma levels of each protein in patients with breast cancer [8]. The three-protein signature was subsequently validated in a large-scale study using samples stored in biorepositories [17]. Additionally, this signature has been shown to increase the diagnostic accuracy of breast ultrasonography when used in combination [18]. Herein, we report the results of a prospective multicenter trial on the diagnostic accuracy of the three-protein signature in women with suspicious breast lesions.

Methods

Study design and participants

This multi-protein assessment using serum to determine breast lesion malignancy (MAST) (KCT0004847) was a prospective multicenter trial conducted in 13 teaching hospitals in Korea. From August 2019 to September 2020, plasma and serum samples were collected from 471 participants with moderate to highly suspicious breast lesions identified by breast mammography or ultrasound, for which a breast biopsy was planned (Fig. 1). Moderate to highly suspicious breast lesions were defined as category 4B, 4C, or 5 of the Breast Imaging Reporting and Data System (BI-RADS) [19]. Women younger than 18 years of age, who had a personal history of breast cancer, or other malignancy within the five years were ineligible. Up to 20 ml of whole blood was drawn before the breast biopsy and was used for the three-protein signature. Demographic and pathological information was collected after the biopsy results were reported. A total of 451 patients were included in the final analysis after excluding patients for various reasons (Fig. 1). Additionally, the blood samples of 191 participants who showed no suspicious breast lesions (BI-RADS 1 or 2) were collected for the three-protein signature assessment. The three-protein signature (Mastocheck®, a regression model based on the relative expression of CHL1, APOC1, and CA1) was approved as an in vitro diagnostic tool using plasma samples by the Korean Ministry of Food and Drug Safety in Jan 2019. In this trial, serum samples were collected to determine the correlation between plasma and serum.

Fig. 1
figure 1

Study flow

This trial was registered at the Clinical Research Information Service of Korea, a member of the World Health Organization (WHO) International Clinical Trials Registry Platform (ICTRP), with the identification number KCT0004847. The original version of the study protocol is provided in Additional file 1. The detail of sample processing and statistical analysis is described in Additional file 2: Additional Methods.

Results

In this trial, 451 women with suspicious breast lesions and 191 with no suspicious lesions were included in the final analysis (Fig. 1). Among the 451 women with suspicious breast lesions, 313 were diagnosed with breast cancer through subsequent breast biopsies. Therefore, the analysis included plasma and serum samples from 313 patients with breast cancer and 329 women with biopsy-proven benign disease or no suspicious lesions in the breast. The clinical and demographic information of all participants is described in Additional file 3: Tables 1 and 2.

The sensitivity, specificity, and overall accuracy values of the three-protein signature for detecting breast malignancies in 642 women were 74.4%, 66.9%, and 70.6%, respectively (Table 1). The concordance index was 0.698 (95% CI 0.656, 0.739). The performance of the three-protein signature was not affected by the various clinical or demographic features of the participants and was not biased by site (p = 0.383) (Table 2). Notably, accuracy was not influenced by the mammographic density of the participants (p = 0.878). The three-protein signature assay was repeated using serum samples to determine the concordance rate with the results from the plasma samples. The results were concordant in 605 (94.2%) patients (Additional file 3: Table 1).

Table 1 Diagnostic accuracy of the three-protein signature
Table 2 Distribution of demographic characteristics by whether the 3-protein signature is accurate

Although the three-protein signature showed trends for higher sensitivity for in situ tumors and the American Joint Committee on Cancer (AJCC) stage I cancers, the sensitivity of the three-protein signature did not show statistically significant differences across AJCC stages (p = 0.859, Fig. 2a). Additionally, breast cancer subtypes, as defined by hormone receptor status and human epidermal growth factor receptor 2 (HER2) expression, did not show statistically significant association with the sensitivity of the three-protein signature (p = 0.902, Fig. 2b). Figure 2c shows the levels of the three protein markers among all participants. The levels of APOC1 and CA1 showed statistically significant differences between 313 patients with breast cancer and 329 participants with BI-RADS C1 or 2 breast imaging or biopsy-proven benign breast lesions. However, the CHL1 did not show such differences in the present study. The levels of APOC1 showed statistically significant differences between participants who had no suspicious lesion, patients who had biopsy-proven benign breast lesions, and patients with malignant breast lesions (p < 0.001, Fig. 2d). While the CA1 levels of patients with breast malignancy were significantly higher when compared to those of participants with biopsy-proven benign lesions or participants with no suspicious lesions (p = 0.019 and p < 0.001, respectively), the levels of CA1 did not show significant difference between the latter two groups (p = 0.482) (Fig. 2d).

Fig. 2
figure 2

Results of the three-protein signature analysis. a, b depict the sensitivity of the three-protein signature in different cancer stages and subtypes, respectively. The sensitivity of the three-protein signature showed no statistical differences across AJCC stages and molecular subtypes (p = 0.859 and p = 0.902, respectively). c The concentration of each protein in all participants. APOC1 and CA1 levels were statistically different between 313 patients with breast cancer (Cancer group) and 329 participants (Non-cancer group) with no suspicious breast lesions (Normal group) or biopsy-proven benign breast lesions (Benign group), whereas CHL1 did not show such a difference. Wilcoxon rank sum test was used to compare differences between two groups because the concentration of each protein was not normally distributed. d Depict the concentrations of the three markers in three participants’ group. APOC1 were significantly different between Normal group, Benign group, and Cancer group. While CA1 levels of Cancer group were higher than those of Normal group and Benign group, CA1 did not show significant difference between Normal group and Benign group. p-values calculated using Dunn’s nonparametric comparison for post hoc Kruskal–Wallis testing. *p < 0.05, **p < 0.01, ***p < 0.001 and ns non-significant

Discussion

To our knowledge, this is the first report of a prospective multicenter trial that addressed the accuracy of blood-based proteomic diagnostic markers for breast cancer. Although many studies have focused on discovering blood-based protein biomarkers for the early detection of breast cancer, the clinical value of these biomarkers has not yet been tested in a prospective clinical trial setting [20]. Our results demonstrated that the sensitivity of the three-protein signature was 74.4%, which is comparable with the results of our previous reports using archived blood samples [17]. Interestingly, the sensitivity of the three-protein signature was not affected by the mammographic density of participants or other clinicopathological factors. As low sensitivity has been a major hurdle for using traditional serum markers of breast cancer in the clinic [21], the present findings suggest that this proteomic-based assay may be a useful tool for breast cancer diagnosis in women with dense breasts. Indeed, our previous study using large-scale biorepository samples suggested that combining the three-protein signature with mammography can improve diagnostic accuracy in women with dense breasts [22].

Our study carries several limitations. First, no follow-up data were available for the 191 participants who showed no suspicious lesions in breast imaging. Second, detailed pathologic information on the 61 (19.5%) participants with breast cancer was not available. While the stage or subtype did not influence the accuracy of the three-protein signature in the remaining 252 patients, it would be important to prospectively address this issue in a larger prospective cohort since the current trial was not designed for determination of the stage- or subtype-specific accuracy of the assay. Third, unlike APOC1 and CA1, which showed significant differences between benign and cancer patients, CHL1 levels failed to show such differences in the current trial. Further efforts to discover additional biomarker candidates are warranted to improve the diagnostic accuracy of the assay. Finally, the demographic features and co-morbidities showed differences between patients with breast malignancy and participants without suspicious lesions or with benign breast lesions (Additional file 3: Table 2). This finding also suggest that it require further data form a large cohort to verify the value of three-protein signature.

Screening tests are subject to high specificity to ensure a low rate of false positives, minimizing unnecessary diagnostic workups. The results of this trial revealed that the three-protein signature showed a moderate degree of specificity (60.9 (52.7–69.0)%) for 451 women with suspicious lesions and all participants (66.9 (61.8–72.0)%). A larger clinical trial with asymptomatic women undergoing breast cancer screening is required to determine the recall rates of the three-protein signature. Considering the benefits of blood-based biomarkers in breast cancer screening [7], our data warrant further efforts to validate the value of this proteomic assay. This prospective study showed that a three-protein signature based on MRM-MS is a sensitive tool for breast cancer diagnosis. A future large clinical trial is warranted to determine its value in breast cancer screening.