Background

Mammographic density (MD) reflects the composition of fibro-glandular tissue of the breast, as visualised on a mammogram. MD is an independent predictor of breast cancer risk, although the strength of its association varies across studies, due in part to the different methods of MD assessment and different partitioning thresholds used to define high and low MD [1,2,3]. Efforts to make measuring MD less reader-dependent and more reproducible have resulted in the development of a number of fully-automated methods for measuring MD [4,5,6], including both volumetric and area-based assessments methods.

In women of European ancestry Volumetric assessments of density have been shown to be a stronger predictor of risk compared to area-based density [7, 8]. Volumetric methods are less influenced by compression force and are more sensitive to breast thickness, and may more accurately estimate the amount of fibroglandular tissue for women with larger breasts [9,10,11]. However Asian women have smaller and denser breasts compared to women of European ancestry, and the performance of area and volume-based densities have hitherto not been compared in the same study.

In this study, we aim to determine and compare the effects of two automated MD measures, namely STRATUS measurements of area densities, and Volpara measurements of volumetric densities, on breast cancer risk in the Asian population, and to explore the potential variation by menopausal status.

Methods

Study participants, data collection and eligibility criteria

Cases comprised of patients who were recruited sequentially into the Malaysian Breast Cancer Genetics (MyBrCa) study from Subang Jaya Medical Centre (SJMC), between 2012 and 2020, and University Malaya Medical Centre (UMMC), between 2003 and 2020. Controls were women between 40 and 74 years old with no prior history of breast cancer that were recruited into the Malaysian Mammography Study (MyMammo) from the same participating hospitals as cases. The study details have been previously published [12]. All participants answered a detailed questionnaire which included information on lifestyle and reproductive risk factors, socio-demographic factors, and family history and provided blood sample for genetic testing.

Bilateral full-field digital mammograms (FFDMs) for cases were retrieved from the medical image storage servers retrospectively starting in June 2018 and for controls were collected at recruitment. The bilateral cranio-caudal (CC) and medio-lateral oblique (MLO) views for both raw and processed images, where possible, were retrieved. Cases were excluded from the research study if: (a) digital mammograms were conducted more than 12 months prior to cancer diagnosis, (b) only mammograms ipsilateral to the breast cancer were available. Controls were excluded from the research study if no mammograms were available for analysis. All participants included in the study were of self-declared Chinese, Malay or Indian ethnicity and had information on age at mammography, body mass index (BMI) and/or menopausal status. In total, 10% of cases and 69% of controls were available and eligible for matching.

Matching

For the case–control analysis of mammographic density and breast cancer risk, as raw and processed images were not available for all women, cases and controls in the full dataset were matched for age (within 5 years) and ethnicity (exact) separately for the analyses of STRATUS, which measures processed images, and Volpara, which measures raw images. Age-matching was performed in each ethnic group using a 1:4 case to control ratio nearest neighbour propensity score matching using the matchit package in R. For the STRATUS study, a total of 488 cases and 1796 controls were included in the matched case–control study, of which 82.2% of cases were matched to four controls, 9% to three controls, 3.5% to two controls and 5.3% to only one control. For the Volpara study, a total of 436 cases and 1623 controls were included, of which 81.4% were matched to four controls, 12.4% to three controls, 3.2% to two controls and 3% to only one control. In total, 531 cases and 2297 controls were included for analysis of which data was available for both STRATUS and Volpara in 393 cases and 1122 controls.

Mammographic density (MD) assessments

Mammography was performed using machines from three different manufacturers; Hologic [Models: Lorad Selenia, Selenia Dimensions and Tomo Selenia Dimensions], General Electric (GE) Senographe Essential, and Siemens Mammomat Novation. Area-based MD was determined using STRATUS, a fully automated machine-learning method for assessing MD based on image features assessed using thresholding methods, by the developers of STRATUS at the Karolinska Institute, Sweden [4]. Volumetric MD was computed using Volpara Data Manager version 1.1.109 [5]. Six MD phenotypes were considered in this study: absolute dense area (DA) and volume (DV), percent dense area (PDA, i.e., absolute dense area/total breast area) and volume (PDV, i.e., absolute dense volume/total breast volume), and non-dense area (NDA) and volume (NDV). We also categorised MD according to the computer-generated BI-RADS scores (cBIRADS) generated by STRATUS, and the clinical classification score (Volpara Density Grades (VDG)).

Image laterality

Pearson’s correlation coefficients and previous studies showed that there were strong correlations between CC and MLO measurements [13, 14]. The Wilcoxon rank sum test was performed to compare the distribution of MD in the left and right mammograms in the control group. For the CC view mammograms, percent dense volume was higher in the right breast (Left median 9.1%; Right 9.5%, P = 0.035), whereas for the MLO view, three measures were higher in the left breast [dense volume (Left 57.6 cm3; Right 56.5 cm3, P = 0.006), non-dense volume (Left 591.8 cm3; Right 561.7 cm3, P = 0.011) and total breast volume (Left 653.4 cm3; Right 628.0 cm3, P = 0.006)]. As there was less variation in MD measurements for the CC view, MD measurements from the CC view mammograms of unaffected breasts of cases were used in all analyses, and matched by laterality in the controls.

Statistical analyses

Box-Cox transformation was used to transform MD phenotypes into approximately normal distribution.

Confounder selection

Covariates that were assessed include socio-demographic factors, known lifestyle and reproductive risk factors of breast cancer, mammogram machine and compressed breast thickness. A covariate was considered confounding if: (a) it was significantly associated with MD in controls at P < 0.05, after accounting for other associated variables; (b) it was significantly associated with breast cancer risk at P < 0.05, after accounting for other associated variables; and (c) it had a magnitude of confounding that was greater than 5%.

Age at first full term pregnancy, total number of live births and breast feeding were only evaluated among parous women. Parous women were defined as those who have had at least one full-term pregnancy. The use of hormone replacement therapy (HRT) was only evaluated among postmenopausal women. Postmenopausal women were defined as women who have not had their periods for at least 12 months prior to their enrolment into the study or if they self-reported that they were postmenopausal at enrolment.

Association of mammographic density (MD) phenotypes and breast cancer risk

We assessed the association between mammographic density phenotypes (treated either as continuous or categorical variables) and breast cancer risk using conditional logistic regression, adjusting for selected confounders. When MD was treated as a continuous variable, odds ratios per-adjusted standard deviations (OPERA [15]) was calculated to allow comparison across MD phenotypes. When MD was treated as a categorical variable, MD phenotypes were categorised into four equal quartiles based on the MD distribution in controls, using the first quartile as the reference group. We also categorised MD according to the computer-generated BI-RADS scores, cBIRADS, generated by STRATUS, and Volpara Density Grades (VDG), which is the classification used to report density, measured by Volpara, in the clinic. Weighted kappa, using quadratic weighting, was calculated to assess the concordance between quantiles of STRATUS and Volpara measurements.

The association between MD phenotypes and breast cancer risk by menopausal status were conducted using unconditional regression. Z-tests were conducted to determine whether the odds ratios for mammographic densities and breast cancer risk were different for premenopausal and postmenopausal women.

All statistical analyses were performed with R version 3.6.1.

Results

Characteristics of study participants

Participant selection and descriptive statistics of cases and controls are presented in Fig. 1 and Table 1. The majority of controls within the STRATUS study (67.7%) were recruited from the private tertiary hospital (SJMC), while approximately half of the controls within the Volpara study were from the government-funded teaching hospital (UMMC). Most of the mammograms were obtained from the Hologic machine.

Fig. 1
figure 1

Flowchart illustrating a the selection of cases and controls for mammographic density (MD) assessment by STRATUS and Volpara, and b participants included in the different analyses performed including the analysis of (1) the association of covariates with MD, (2) the association of covariates with breast cancer risk, and (3) the association of MD and breast cancer risk

Table 1 Characteristics of controls and cases with STRATUS and Volpara measurements available (unmatched dataset)

Confounders

We identified potential confounders as covariates with P value < 0.05 with both MD phenotypes and breast cancer risk in the multivariable models, and these were breastfeeding for absolute dense area and dense volume, alcohol intake for non-dense volume and breast thickness for all MD phenotypes except dense area (Additional file 1: Table S1). Additionally, although not significant in our study, menopausal status and parity were included as potential confounders as these variables have consistently been reported to be associated with both MD and breast cancer risk in the literature. Of the list of potential confounders, only those resulting in > 5% change in the magnitude of MD association with risk were retained in the model for adjustment. The final list of variables included in the association analyses for adjustment can be found Additional file 1: Table S2.

Association of mammographic density (MD) phenotypes and breast cancer risk

All women

When treated as a continuous variable, both dense area and dense volume were significantly associated with breast cancer risk, the odds per adjusted standard deviation (OPERA) and the corresponding 95% CI were 1.19 (1.08–1.32) and 1.14 (1.02–1.28), respectively (Fig. 2). However, when categorised into quartiles, only the highest quartile of dense area was significantly associated with risk (odds ratio (95% CI) was 1.44 (1.03–1.21)). This association was no longer significant in analyses limited to overlapping samples between the STRATUS and Volpara studies (1.26, 95% CI: 0.83-1.91) (Fig. 3).

Fig. 2
figure 2

Associations of a STRATUS area mammographic densities and b Volpara volumetric mammographic densities, with breast cancer risk. *Adjusted for relevant confounding factors. Z-tests comparing estimated regression coefficients between the STRATUS and Volpara studies

Fig. 3
figure 3

Associations of a STRATUS area mammographic densities, b Volpara volumetric mammographic densities, with breast cancer risk in the dataset of 393 cases and 1122 controls included in both STRATUS and Volpara studies. *Adjusted for relevant confounding factors. Z-tests comparing estimated regression coefficients between the STRATUS and Volpara studies

For percent density, OPERA for percent dense area was significant (1.23, 95% CI 1.10–1.37) while the OPERA for percent dense volume was not significant (1.08, 95% CI 0.97–1.20). However, quartiles analyses of percent density showed significant association for both MD measurement methods, with risk estimates increased consistently across quartiles. The OR of highest versus lowest quartile was 2.06 (95% CI 1.42–2.99) for percent dense area and 2.21 (95% CI 1.44–3.39) for percent dense volume (Fig. 2). There was no significant difference between the ORs of percent dense area and percent dense volume (p-value of Z-test < 0.05). Similar results were observed for analyses limited to overlapping samples between the STRATUS and Volpara studies (Fig. 3).

Non-dense area was significantly associated with a lower breast cancer risk (OPERA 0.85, 95% CI 0.76–0.95). Risk estimates decreased consistently, from OR 0.62 (95% CI: 0.45-0.86) to 0.50 (95% CI: 0.35-0.71) and 0.30 (95% CI: 0.19-0.47), comparing the first quartile of non-dense area with the second, third and fourth quartile, respectively. By contrast, the OPERA estimate for non-dense volume was not significant (1.06, 95% CI: 0.95-1.19), although the pattern of association for quartiles was similar to that observed for non-dense area i.e., OR 0.87 (95% CI: 0.61-1.24), OR 0.66 (95% CI: 0.42-1.01) and OR 0.54 (95% CI: 0.31-0.94) for the second, third and fourth quartiles respectively, the corresponding strengths of association were weaker (Figs. 2 and 3).

Where densities were categorised according to area-based cBIRADS and volume-based VDG, women in cBIRADS 3 and cBIRADS 4 were associated with a 2.5-fold (P = 0.003) and 2.9-fold (P < 0.001) greater odds of disease, respectively (Fig. 2a). By contrast, VDG was not associated with breast cancer risk (Fig. 2b). The same pattern was observed for analyses limited to overlapping samples between the STRATUS and Volpara studies (Fig. 3).

There are no appreciable differences between the results generated using the CC and MLO view measurements (Additional file 1: Table S2).

The agreement between the STRATUS and Volpara measurements for classifying women into mammographic density quartiles was fair for absolute density (Weighted Kappa, κw = 0.28) and percent density (0.35), and moderate for non-dense area and volume (0.50). Figure 4 illustrates the magnitude of concordance for the classification of area and volumetric MD quartiles. Although there is some agreement between STRATUS and Volpara, there are instances of discordance where individuals shift to adjacent quartiles or even skip one quartile altogether.

Fig. 4 
figure 4

Concordance between the classification of a absolute dense area/volume, b percent dense area/volume and c non-dense area/volume into quartiles using STRATUS and Volpara measurements. Note: Agreement between the STRATUS and Volpara measurements for classifying women into mammographic density quartiles was calculated using Cohen’s weighted kappa. Weighted Kappa, κw values for dense area/volume, percent dense area/volume, and non-dense area/volume were 0.28, 0.35 and 0.50, respectively

Analyses by menopausal status

Figures 5 and 6 show the association of MD with breast cancer risk for premenopausal and postmenopausal women, respectively. For dense area and dense volume, both OPERAs and quantile analyses were not significantly associated with breast cancer risk in premenopausal women. By contrast, consistent with the all-women analysis, OPERA for both dense area (OR 1.23, 95% CI: 1.07–1.41) and dense volume (OR 1.30, 95% CI: 1.10–1.54) were significant in postmenopausal women, but the corresponding quartile analyses did not show significant associations for dense area and was only significant for the association of the highest dense volume quartile and risk when compared to the lowest dense volume quartile among postmenopausal women (OR 1.65, 95% CI: 1.01-2.71).

Fig. 5
figure 5

Associations of a STRATUS area mammographic densities and b Volpara volumetric mammographic densities, with breast cancer risk in premenopausal women. *Adjusted for relevant confounding factors. Reference category is cBIRADS 1 (< 2%) + cBIRADS 2 (2− < 17%). ‡ Reference category is VDG 1 (< 4.8%) + VDG 2 (4.8—< 8.0%). §Z-tests comparing estimated regression coefficients between the STRATUS and Volpara studies

Fig. 6
figure 6

Associations of a STRATUS area mammographic densities, b Volpara volumetric mammographic densities, with breast cancer risk in postmenopausal women and, c comparison of regression coefficients for premenopausal and postmenopausal women. *Adjusted for relevant confounding factors. Z-tests comparing estimated regression coefficients between the STRATUS and Volpara studies. Z-tests comparing estimated regression coefficients between premenopausal and postmenopausal women

For percent density, OPERAs for percent dense area and percent dense volume were significant in postmenopausal women but not premenopausal women. The OPERAs were 1.23 (95% CI 1.07–1.41) and 1.16 (95% CI 1.01–1.34) for precent dense area and percent dense volume, respectively, in postmenopausal women. The corresponding estimates in premenopausal women were 1.16 (95% CI 0.99–1.35) and 1.01 (95% CI 0.86–1.19). The observed significant association of highest versus lowest quartile in all women analysis was replicated in postmenopausal women (OR 1.89, 95% CI: 1.22-2.96 for percent dense area; OR 1.96, 95% CI: 1.19-3.27 for percent dense volume), but not premenopausal women (OR 1.59, 95% CI: 0.95-2.67 for percent dense area; OR 1.24, 95% CI: 0.70-2.22 for percent dense volume).

For non-dense MD phenotype in both premenopausal and postmenopausal women, OPERAs were not significant in both non-dense area (0.88, 95% CI: 0.75-1.02 and 0.87, 95% CI: 0.75-1.01 for premenopausal and postmenopausal women, respectively) and non-dense volume (1.11, 95% CI: 0.95-1.31 and 1.00, 95% CI: 0.85-1.18, respectively) measurements. However, the quartile analyses for both non-dense area and non-dense volume were significant in premenopausal women, comparing the highest and lowest non-dense area quartiles (OR 0.37, 95% CI: 0.20-0.66) and non-dense volume quartiles (OR 0.31, 95% CI: 0.14-0.67), but not postmenopausal women (OR 0.60, 95% CI: 0.36-1.01 for non-dense area; OR 0.88, 95% CI: 0.51-1.54 for non-dense volume).

Discussion

In this study of women of Asian-ancestry, we found that percent mammographic density is a strong breast cancer risk factor, with similar magnitudes of association for both area and volumetric mammographic density measures. Comparing women in the lowest quartiles, women with percent density in the highest quartiles had approximately two-fold higher odds of breast cancer. The observed association was however significant only in postmenopausal women but not in premenopausal women.

The two-fold risk estimates reported in this study are consistent with those found in a meta-analysis of Japanese, Korean and Singaporean women comprising of one cohort study and five case–control studies, which reported a summary effect size of 2.2 (95% CI 1.5–3.2) [16], as well as with a large meta-analysis of European women using the BI-RADS density four-category classification [3]. The corresponding odds ratio per adjusted standard deviation (OPERA) was similar to a Korean study of 213 cases and 630 controls [17], but lower than those previously reported in women of European ancestry. A study of Australian women reported OPERA of 1.52 (95% CI 1.34–1.73) for percent dense area, compared to 1.23 (95% CI 1.10–1.37) this study, suggesting potential ethnic differences in MD-risk associations [18].

Our findings of lack of MD-risk association in premenopausal women align with similar-sized studies in other Asian populations [19,20,21,22]. For instance, a multicentre Japanese study (530 cases, 1043 controls) found a near three-fold increase in breast cancer odds (OR 2.9, 95% CI 1.1–7.2) among postmenopausal women with extremely dense breast (>75% glandular tissue), while no significant association was observed in premenopausal women [19]. Similarly, another study in Japanese women (146 cases, 659 controls) revealed a four-fold higher odds of breast cancer among postmenopausal women with > 75% percent densities, with no significant association in premenopausal women [21]. However, it is important to note that a recent large prospective Korean study comprising of ~ 65,000 breast cancer cases reported that breast density is associated with breast cancer risk in both premenopausal (OR 2.4, 95% CI 2.2–2.5) and postmenopausal (OR 2.9, 95% CI 2.8–3.0) women, suggesting that larger sample sizes in premenopausal women are required to detect a significant association with breast cancer risk [20].

Our study did not yield conclusive evidence regarding the association of absolute MD measures with breast cancer risk. While the odds ratios for continuous dense area and dense volume were significant at a nominal level (1.19, 95% CI 1.08–1.32 and 1.14, 95% CI 1.02–1.28, respectively), the results from quartile analysis did not support the significant associations. We also observed a stronger inverse association with non-dense area compared to non-dense volume that was significant in our analyses of all women and premenopausal women, but not that of postmenopausal women. This inverse association is consistent with previous studies in women of European ancestry reporting a protective effect of having greater amounts of fat or non-dense tissue in the breast [23].

In summary, our study confirms the significance of MD as a robust breast cancer risk factor in Asian-ancestry women, with percent density showing consistent associations across area and volumetric-based measures. However, the lack of MD-risk association in premenopausal women underscores the need for further investigation in larger datasets. While our findings contribute to the understanding of MD and breast cancer risk, the inconclusive evidence regarding absolute MD measures prompts a critical evaluation of their utility in risk prediction models for this population.

This study had several limitations. First, more than 90% of the cases were recruited from one recruitment centre, making it impossible to match cases and controls based on centre. However, we adjusted our analyses for this factor. Second, some covariates have missingness rates greater than 10%, which may explain some of the unexpected results (e.g. the protective effect observed for HRT among postmenopausal women and alcohol consumption). Third, the healthy controls were women attending an opportunistic screening mammography programme and may be enriched for a family history of breast cancer. This is likely to be the reason family history of breast cancer is not associated with breast cancer risk in this study. Finally, only the mammograms performed at the time of cancer detection (or close to cancer detection) were available for the cases. Given that densities measured from the unaffected contralateral breasts have been shown to be similarly associated with risk of disease [8], densities of the contralateral breasts were used as surrogate measurements.

Conclusions

In conclusion, our study underscores the significance of mammographic density (MD) as a strong predictor of breast cancer risk in women of Asian-ancestry, particularly in postmenopausal individuals. While percent density, for both area- and volume-based measures, consistently demonstrated significant association, absolute MD measures yielded inconclusive results. Future research should aim to elucidate ethnic-specific MD-risk associations and refine risk prediction models to incorporate the most predictive MD measures, thus enabling more targeted preventive strategies for women of Asian ancestry.