Causal relationships between breast cancer risk factors based on mammographic features

Ye, Zhoufeng; Nguyen, Tuong L.; Dite, Gillian S.; MacInnis, Robert J.; Schmidt, Daniel F.; Makalic, Enes; Al-Qershi, Osamah M.; Bui, Minh; Esser, Vivienne F. C.; Dowty, James G.; Trinh, Ho N.; Evans, Christopher F.; Tan, Maxine; Sung, Joohon; Jenkins, Mark A.; Giles, Graham G.; Southey, Melissa C.; Hopper, John L.; Li, Shuai

doi:10.1186/s13058-023-01733-1

Causal relationships between breast cancer risk factors based on mammographic features

Research
Open access
Published: 25 October 2023

Volume 25, article number 127, (2023)
Cite this article

Download PDF

You have full access to this open access article

Breast Cancer Research Aims and scope Submit manuscript

Causal relationships between breast cancer risk factors based on mammographic features

Download PDF

Zhoufeng Ye¹,
Tuong L. Nguyen¹,
Gillian S. Dite^1,2,
Robert J. MacInnis^1,3,
Daniel F. Schmidt⁴,
Enes Makalic¹,
Osamah M. Al-Qershi¹,
Minh Bui¹,
Vivienne F. C. Esser¹,
James G. Dowty¹,
Ho N. Trinh¹,
Christopher F. Evans¹,
Maxine Tan^5,6,
Joohon Sung⁷,
Mark A. Jenkins¹,
Graham G. Giles^1,3,8,
Melissa C. Southey^3,8,9,
John L. Hopper¹ &
…
Shuai Li^1,8,10,11

1510 Accesses
3 Citations
3 Altmetric
Explore all metrics

Abstract

Background

Mammogram risk scores based on texture and density defined by different brightness thresholds are associated with breast cancer risk differently and could reveal distinct information about breast cancer risk. We aimed to investigate causal relationships between these intercorrelated mammogram risk scores to determine their relevance to breast cancer aetiology.

Methods

We used digitised mammograms for 371 monozygotic twin pairs, aged 40–70 years without a prior diagnosis of breast cancer at the time of mammography, from the Australian Mammographic Density Twins and Sisters Study. We generated normalised, age-adjusted, and standardised risk scores based on textures using the Cirrus algorithm and on three spatially independent dense areas defined by increasing brightness threshold: light areas, bright areas, and brightest areas. Causal inference was made using the Inference about Causation from Examination of FAmilial CONfounding (ICE FALCON) method.

Results

The mammogram risk scores were correlated within twin pairs and with each other (r = 0.22–0.81; all P < 0.005). We estimated that 28–92% of the associations between the risk scores could be attributed to causal relationships between the scores, with the rest attributed to familial confounders shared by the scores. There was consistent evidence for positive causal effects: of Cirrus, light areas, and bright areas on the brightest areas (accounting for 34%, 55%, and 85% of the associations, respectively); and of light areas and bright areas on Cirrus (accounting for 37% and 28%, respectively).

Conclusions

In a mammogram, the lighter (less dense) areas have a causal effect on the brightest (highly dense) areas, including through a causal pathway via textural features. These causal relationships help us gain insight into the relative aetiological importance of different mammographic features in breast cancer. For example our findings are consistent with the brightest areas being more aetiologically important than lighter areas for screen-detected breast cancer; conversely, light areas being more aetiologically important for interval breast cancer. Additionally, specific textural features capture aetiologically independent breast cancer risk information from dense areas. These findings highlight the utility of ICE FALCON and family data in decomposing the associations between intercorrelated disease biomarkers into distinct biological pathways.

Identification of 31 loci for mammographic density phenotypes and their associations with breast cancer risk

Article Open access 09 October 2020

A genome-wide association study of mammographic texture variation

Article Open access 07 November 2022

Breast Tissue Organisation and its Association with Breast Cancer Risk

Article Open access 06 September 2017

Introduction

Mammographic density refers to the areas of a two-dimensional mammogram appears to be white. This is conventionally defined based on pixel threshold that can differentiate light or bright regions from dark regions, and can be measured using the semi-automated CUMULUS software, developed by Yaffe et al. in the 1990s [1]. We call this measure Cumulus. After transforming to normality and adjusting for age and body mass index (BMI), Cumulus measure is an established risk factor for breast cancer [2].

We found that two additional mammographic density measures defined by successively higher pixel brightness thresholds and called Altocumulus and Cirrocumulus, respectively (Fig. 1), when similarly transformed and adjusted can better predict risk of screen-detected breast cancer [3]. Moreover, when fitted together, the breast cancer risk association with Cumulus measure was attenuated far greater than the associations with Altocumulus and Cirrocumulus. Interval breast cancer risk, however, was better predicted by Cumulus [4, 5]. It is important to note that the dense areas measured by these three measures overlap with each other, specifically with Cirrocumulus being contained within Altocumulus, which, in turn, is encompassed by Cumulus (Fig. 1). Therefore, the mammographic image components defined by increasing pixel brightness threshold have the potential to reveal different information about breast cancer risk. This is also supported by recent findings from the Women's Environment, Cancer, and Radiation Epidemiology (WECARE) study that the dense areas defined by Cirrocumulus outperform another two dense areas, i.e. the dense areas between the threshold of Cirrocumulus and the threshold of Altocumulus, and between the threshold of Altocumulus and the threshold of Cumulus, respectively, in terms of predicting contralateral breast cancer risk [6]; see the details in “Discussion” section.

As well as mammographic density, the breast parenchyma could possess other mammographic patterns, such as textures, that predict breast cancer risk [7]. For example we developed an automated textural feature-based mammogram risk score, called Cirrus [8]. We found that Cirrus can improve risk prediction for interval cancers when fitted with Cumulus, as well as improve risk prediction for screen-detected and young-age-at-diagnosis breast cancer when fitted with Cirrocumulus [9, 10]. The associations between Cirrus and these three types of breast cancer remained after fitting the density measures. It is, therefore, possible that Cirrus contains independent and intrinsic risk information about breast cancer risk, especially given that pixel counting above a certain brightness level was not used as a criterion in its creation.

The different mammogram risk scores above are correlated with one another to varying degrees [4, 5, 10, 11]. One potential explanation is the spatial overlap of the density measures. Other studies have also identified associations between Cumulus and textural features [7, 12, 13] but the reasons for this have yet to be addressed.

We have consistently found that when the different mammogram risk scores are fitted together, their breast cancer risk gradients do not necessarily attenuate towards the null to the same extent as would be expected if their associations with one another were due solely to confounding [9, 10]. In particular, for screen-detected and young-age-at-diagnosis breast cancer, the Cumulus association became almost null after being fitted with Cirrus or Cirrocumulus.

There are also potentially causal relationships between Cumulus and the other risk scores. Whether the associations between risk scores are causal, and in which direction is unknown; they could also be due to non-causal effects such as confounding and conditioning on colliders [14]. To address the issues of causation, we took a novel approach.

Twin and family studies have shown that the density-based risk scores are correlated between relatives; i.e. they are familial [15]. This means that there could be genetic or non-genetic factors shared by relatives such as twin pairs or sisters that determine these risk scores. Irrespective of the source of such familial determinants, their existence means that we can apply a causal inference method based on data from related pairs called Inference about Causation from Examining FAmiliaL CONfounding (ICE FALCON) [16].

In this study, we applied ICE FALCON to try to understand if the causal relationships between the texture-based mammogram risk score (Cirrus) and three non-overlapping density-related mammogram risk scores (created from Cumulus, Altocumulus, and Cirrocumulus), in order to provide evidence for which risk scores are more relevant to breast cancer aetiology.

Materials and methods

Study sample

We used data from the Australian Mammographic Density Twins and Sisters Study [17], which included female twin pairs and their sisters aged 40–70 years and without a prior diagnosis of breast cancer at the time of mammography. Information of the participants was collected by questionnaires, and permissions to access mammograms were obtained [18]. The current study involved 371 monozygotic twin pairs with complete epidemiological information and the mammographic measurements required for analysis. No individual was identified as being at a high risk for breast cancer when taking mammography nor after assessment of their mammograms.

Questionnaire

Demographic information, anthropometric measurements, menstrual and reproductive history, lifestyle factors, and personal and family history of breast cancer were collected via telephone-administered questionnaires between 2004 and 2008. Zygosity was determined from genome-wide association data [19]. As there were time differences between age at questionnaire survey and age at mammography (on average 1.68 years, with 177 participants having a more than 3-year difference), menopausal status and BMI were updated to those at age at mammography as follows. For menopausal status, if a participant was postmenopausal at questionnaire survey, and her age at menopause was older than age at mammography, her status was changed to premenopausal. BMI at mammography was predicted from BMI at questionnaire survey using the method of Haby et al. [20]. BMI at questionnaire survey was treated as the dependent variable in a regression model which included birth cohort effects and 5-year group coefficients; the intercept of the regression was then used as the BMI at age of mammography.

Mammogram-based measures

Mammograms were retrieved from BreastScreen Australia services (80%), clinics (5%), and from participants themselves (15%) and digitised using the Lumysis 85 scanner at the Australian Mammographic Density Research Facility. For each woman, only the craniocaudal-view mammogram from the right breast taken closest to the survey was used in this study.

The dense areas were measured using a computer-assisted semi-automated thresholding technique and the CUMULUS software based on a sliding scale ranging from 0 to 4095 pixels. Four observers were trained to measure mammographic density independently, as previously described [4].

A conventional pixel threshold was first used to identify dense areas with grey levels appearing at least mammographically light (the areas of which we call Cumulus). Similarly, the pixel brightness threshold was then increased to identify the denser areas (the areas of which we call Altocumulus). The pixel threshold was then further increased to identify the densest areas (the areas of which we call Cirrocumulus). The reproductivity was assessed by conducting the measurements in sets of 100 mammograms, and 10% of samples in each set were repeated. The intraclass correlation coefficients were 0.98, 0.99, and 0.93 for Cumulus, Altocumulus, and Cirrocumulus, respectively. A total of 200 images were measured for Cumulus, Altocumulus, and Cirrocumulus with the correlations between readers being 0.95, 0.89, and 0.85, respectively. Details of these three density measurements can be found elsewhere [3, 4, 17].

We created two new non-overlapping measures: light areas, which subtracted Altocumulus from Cumulus, and bright areas, which subtracted Cirrocumulus from Altocumulus. Along with a measure of the brightest areas (Cirrocumulus), their relationships, in terms of relative brightness, are shown in Fig. 1.

Cirrus is an agnostic algorithm developed using deep learning techniques applied to 20 textural features extracted from 46,158 analogue, craniocaudal-view, mammograms [8]. The algorithm was applied to the mammograms of the study sample to produce the Cirrus measures.

In this study, we conducted analyses of Cirrus and the three spatially independent density measures including light, bright, and brightest areas (Cirrocumulus). Table 1 shows the summary characteristics of unadjusted measures.

Table 1 Characteristics of mammographic measures and covariates of the monozygotic twins

Full size table

Statistical methods

All mammographic measures were first transformed using a Box–Cox power transformation [21] to have an approximately normal distribution. As a result, (Cirrus-2907)², ${\mathrm{brightest areas }(\mathrm{Cirrocumulus})}^\frac{1}{5},$ and the cube root of light areas and of bright areas were used in the analyses.

Given that age at mammography is negatively associated with the mammographic density measures being studied as putative risk factors for breast cancer, and that breast cancer risk increases with age, all the measures were adjusted for age at mammography. This adjustment explained 8–11% of the variances in the studied measures, except for the light areas, for which the proportion of the variance explained was 2%. The variance explained by other breast cancer risk factors combined, including age at menarche, menopausal status, BMI, ever being pregnant, number of live births, benign breast disease history, and breast cancer family history, was between 4 and 7% (Additional file 1: Table S1).

The age-adjusted residuals were all standardised to have mean = 0 and standard deviation (SD) = 1. These standardised residuals are the mammogram risk scores used in the subsequent analyses. Correlations between these risk scores, within twin pair and within a person, respectively, were estimated using Pearson’s correlation coefficient.

The correlations between the risk scores were decomposed into different sources, including confounding and causal effects originated from various pathways using the Inference about Causation from Examination of FAmilial CONfounding (ICE FALCON) method [16]. ICE FALCON uses data for pairs of relatives and uses the relative’s exposure acts as a proxy instrumental variable for a person’s exposure. This method is analogous to Mendelian randomisation but does not use genetic variants as a presumed instrumental variable and does on rely on strong assumptions. ICE FALCON can make inference about causation even when the exposure and outcome are associated due to familial confounding (i.e. confounders, both known and unknown, that are shared by the exposure and the outcome and by the relatives). The ICE FALCON method has been applied in multiple fields to assess evidence for causality [16, 22,23,24,25,26,27,28].

Briefly, one risk score was assigned as the outcome Y variable and another as the predictor variable X, and the Y value of a twin was regressed against the X variable of herself and/or of her co-twin (Additional file 1: Figure S1). To assess the evidence for reverse causation, the assigning of X and Y was reversed, i.e. the aforementioned predictor and outcome swapped their positions in the refitted regression models. This was done for every pair of risk scores.

Given the Y variables are correlated within twin pairs, regression was conducted using generalised estimating equations. This effect conditioned the Y value of a twin on the Y value of her co-twin. Our model assumed that the risk score of a twin cannot have a causal effect on the same risk score of her co-twin but allowed for causation between the risk scores within a twin.

Three models were fitted to the twin pair data. First, a twin’s outcome variable was regressed on her own predictor variable to estimate the regression coefficient β_self (Model 1). Second, the twin’s outcome variable was regressed on her co-twin’s predictor variable to estimate the regression coefficient β_co-twin (Model 2). Third, the twin’s outcome variable was regressed on both her own and her co-twin’s predictor variables to estimate the conditional regression coefficients β^′_self and β^′_co-twin, respectively (Model 3). The use of the prime on the conditional regression coefficient estimates indicates that the Model 3 regression coefficients can be interpreted as the change in outcome for change in a given predictor while keeping the other predictor constant, which is not the same interpretation for the corresponding unconditional regression coefficients of Models 1 and 2.

If the predictor has a causal effect on the outcome, β_co-twin would be different from zero, β^′_co-twin would be closer to zero than β_co-twin, and β^′_self would not be different from β_self. If there is familial confounding between the predictor and the outcome, β^′_self and β^′_co-twin would both be away from their corresponding coefficients β_self and β_co-twin to a similar extent. If there is a combination of familial confounding and causal effects, the results would be the combinations of the two scenarios. According to Wright’s path tracing rules [29], the proportion of an association which could be attributed to causality is as follows:

$$\Pr = ((({\text{Change}}\;{\text{ in}}\; \beta_{{{\text{co}} - {\text{twin}}}} - ({\text{Change }}\;{\text{in}}\; \beta_{{{\text{self}}}} /\beta_{{{\text{self}}}} ) \times \beta_{{{\text{co}} - {\text{twin}}}} )/\rho )/\beta_{{{\text{self}}}} ) \times 100\%$$

where ${\text{Change}}\; {\text{in}}\; \beta_{{{\text{co}} - {\text{twin}}}}$ = $\beta_{{{\text{co}} - {\text{twin}}}} - \beta_{{{\text{co}} - {\text{twin}}}}^{\prime }$, ${\text{Change}}\;{\text{ in}}\; \beta_{{{\text{self}}}}$ = $\beta_{{{\text{self}}}} - \beta_{{{\text{self}}}}^{\prime }$, and $\rho$ = the within-twin correlation of the predictor. Note that the parameter estimates were extracted only from the models which suggest that the predictor causes the outcome, not those that suggest the outcome causes the predictor; see [16]. The causal effect = ${\beta }_{\mathrm{self}}\times \mathrm{Pr}$, so the proportion of an association that could be attributed to familial confounding is 1 − Pr.

To investigate causal pathways between two risk scores that are not through other risk scores, we used the standardised residuals of the predictor and the outcome after adjusting for the third risk score in addition to age at mammography. The results from the analyses were used to produce a summary causal diagram. Causal relationship analyses were also conducted by the level of breast density to check whether the causal relationships differ by density levels. The sample was divided into two subgroups according to the median of 30.5% for Cumulus per cent mammographic density, with each group including 140 complete twin pairs. ICE FALCON analyses were conducted within each subgroup; see Supplemental material for more details. All the analyses were conducted using the R package [30]. P < 0.05 was considered to be nominally statistically significant.

Results

Correlations between the mammogram risk scores

Table 2 shows that the mammograph risk scores were substantially correlated with each other and within the twin pairs. The within-twin-pair correlations in the risk scores ranged from 0.22 to 0.59. The within-twin cross-trait correlations ranged from 0.28 to 0.81, and the cross-twin cross-trait correlations ranged from 0.28 to 0.61 (all P < 0.05).

Table 2 The within-twin within-trait correlations, the within-twin cross-trait correlation, and the cross-twin cross-trait correlations between the mammogram risk scores (95% confidence intervals in parentheses)

Full size table

Causal inference for pairs of mammogram risk scores

Table 3 and Figure S2 (Additional file 1) show the ICE FALCON results and inference about proportions of familial confounding and causation; similar analyses using the Cumulus and Altocumulus measures can be found in Additional file 1: Table S2.

Table 3 The relationships between Cirrus and mammographic density measures analysed by using the ICE FALCON method

Full size table

The bright areas and light areas

With the light areas as the predictor and the bright areas as the outcome, there was a decrease of 4% (P = 0.02) from β_self = 0.802 (P = 10^–161) in Model 1 to β^′_self = 0.770 (P = 10^–117) in Model 3 and a decrease of 86% (P = 10^–34) from β_{co-twin =} 0.513 (P = 10^–34) in Model 2 to β^′_co-twin = 0.070 (P = 0.01) in Model 3. These results were consistent with the light areas having a causal effect on the bright areas that accounted for 89% of their association, with marginal evidence for familial confounding.

With the bright areas as the predictor and the light areas as the outcome, there was a decrease of 6% (P = 10^–5) from β_self = 0.770 (P = 10^–210) in Model 1 to β^′_self = 0.727 (P = 10^–174) in Model 3 and a decrease of 64% (P = 10^–33) from β_co-twin = 0.404 (P = 10^–19) in Model 2 to β^′_co-twin = 0.144 (P = 10^–9) in Model 3. These results were consistent with the bright areas having a causal effect on the light areas that accounted for 58% of their association and familial confounding that accounted for 42%.

Therefore, the ICE FALCON results suggest the existence of familial confounding and bidirectional causality between the light areas and bright areas. To avoid confusion arising from the potential bidirectional causation, we conducted analyses separately for the light areas and bright areas.

Cirrus and the light areas

With Cirrus as the outcome and the light areas as the predictor, there was a decrease of 7% (P = 0.03) from β_self = 0.352 (P = 10^–17) in Model 1 to β^′_self = 0.328 (P = 10^–16) after adjusting for co-twin’s light areas in Model 3. There was a decrease of 51% (P = 10^–7) from β_co-twin = 0.176 (P = 10^–5) in Model 2 to β^′_co-twin = 0.087 (P = 0.02) after adjusting for the twin’s light areas in Model 3. These results are consistent with there being a combination of familial confounding that accounted for 37% of the association between the two risk scores as well as the light areas having a causal effect on Cirrus that accounted for 63% of their association.

We then reversed the predictor and outcome roles by assigning the light areas to be the outcome and Cirrus as the predictor. There was an increase from β_self = 0.285 (P = 10^–15) in Model 1 to β^′_self = 0.303 (P = 10^–20) in Model 3 (P = 0.02). There was also an increase in the co-twin’s coefficient from β_co-twin = 0.091 (P = 0.006) in Model 2 to β^′_co-twin = 0.135 (P = 10^–5) in Model 3 (P = 0.003). These results were consistent with the findings above that the association was due to a combination of familial confounding and the light areas having a causal effect on Cirrus.

Cirrus and the bright areas

With Cirrus as the outcome and the bright areas as the predictor, there was a decrease of 8% (P = 0.002) from β_self = 0.398 (P = 10^–27) in Model 1 to β^′_self = 0.367 (P = 10^–21) in Model 3. There was a decrease of 35% (P = 10^–4) from β_co-twin = 0.215 (P = 10^–8) in Model 2 to β^′_co-twin = 0.139 (P = 10^–4) in Model 3. These results were consistent with there being a combination of familial confounding that accounted for 72% of the association between the two risk scores, and the bright areas having a causal effect on Cirrus that accounted for 28% of their association.

When we reversed the predictor and outcome roles by assigning the bright areas to be the outcome and Cirrus as the predictor, β_self was not significantly different from β^′_self after adjusting for co-twin’s Cirrus (P = 0.5), and a similar statement applies to the lack of a difference between β_co-twin and β^′_co-twin (P = 0.9). These results were not consistent with Cirrus having a causal effect on the bright areas.

Cirrus and the brightest areas (Cirrocumulus)

With brightest areas (Cirrocumulus) as the outcome and Cirrus as the predictor, there was a decrease of 7% (P = 10^–3) from β_self = 0.389 (P = 10^–23) in Model 1 to β^′_self = 0.360 (P = 10^–19) in Model 3 (P = 0.004), while there was a decrease of 40% (P = 10^–5) from β_co-twin = 0.191 (P = 10^–6) in Model 2 to β^′_co-twin = 0.114 (P = 0.002) in Model 3 (P = 10^–5). These results were consistent with Cirrus having a causal effect on the brightest areas (Cirrocumulus), which accounted for 34% of their association, as well as there being familial confounding which accounted for 64%.

When we reversed the predictor and outcome roles by assigning brightest areas (Cirrocumulus) to be the predictor and Cirrus as the outcome, there was no difference between β_self and β^′_self (P = 0.5), while there was an increase from β_{co-twin =} 0.138 (P = 10^–5) in Model 2 to β^′_co-twin = 0.166 (P = 10^–7) in Model 3 that was not nominally significant (P = 0.08). These results were not consistent with the brightest areas (Cirrocumulus) having a causal effect on Cirrus.

The light areas and brightest areas (Cirrocumulus)

With the light areas as the predictor and the brightest areas (Cirrocumulus) as the outcome, there was a decrease of 10% (P = 0.008) from β_self = 0.470 (P = 10^–38) in Model 1 to β^′_self = 0.424 (P = 10^–25) in Model 3 and a decrease of 64% (P = 10^–13) from β_co-twin = 0.284 (P = 10^–12) in Model 2 to β^′_co-twin = 0.103 (P = 0.005) in Model 3. These results were consistent with the light areas having a causal effect on the brightest areas (Cirrocumulus) that accounted for 55% of their association and familial confounding which accounted for 45%.

With the brightest areas (Cirrocumulus) as the predictor and the light areas as the outcome, there was an increase from β_self = 0.347 (P = 10^–23) in Model 1 to 0.395 (P = 10^–33) in Model 3 (P = 10^–5) and an increase from β_co-twin = 0.113 (P = 10^–4) in Model 2 to β^′_co-twin = 0.221 (P = 10^–13) in Model 3 (P = 10^–9). These results were not consistent with the brightest areas (Cirrocumulus) having a causal effect on the light areas.

The bright areas and brightest areas (Cirrocumulus)

With the bright areas as the predictor and the brightest areas (Cirrocumulus) as the outcome, there was a marginally significant decrease of 4% (P = 0.06) from β_self = 0.680 (P = 10^–159) in Model 1 to β^′_self = 0.653 (P = 10^–109) in Model 3 (P = 0.06), while there was a decrease of 85% (P = 10^–25) from β_co-twin = 0.378 (P = 10^–22) in Model 2 to β^′_co-twin = 0.058 (P = 0.07) in Model 3 of 85% (P = 10^–25). These results were consistent with the bright areas having a causal effect on the brightest areas (Cirrocumulus) that accounted for 85% of their association and familial confounding which accounted for 15%.

With the brightest areas (Cirrocumulus) as the predictor and the bright areas as the outcome, there was no difference between β_self and β^′_self (P = 0.21), while there was an increase in β_co-twin = 0.136 (P = 10^–4) in Model 2 to β^′_co-twin = 0.202 (P = 10^–14) in Model 3 (P = 0.02). These results were not consistent with the brightest areas (Cirrocumulus) having a causal effect on the bright areas.

The above results were consistent with the existence of causal pathways from both the light areas and the bright areas to the brightest areas (Cirrocumulus), and to Cirrus, and from Cirrus to the brightest areas (Cirrocumulus).

Causal inference for the light areas and brightest areas (Cirrocumulus) not through Cirrus

With the light areas adjusted for Cirrus as the predictor and the brightest areas (Cirrocumulus) adjusted for Cirrus as the outcome, there was no difference between β_self in Model 1 and β^′_self in Model 3 (P = 0.2), while there was a decrease from β_co-twin = 0.198 (P = 10^–6) in Model 2 to β^′_co-twin = 0.051 (P = 0.2) in Model 3 by 74% (P = 10^–10). These results were consistent with a causal effect from the light areas to the brightest areas (Cirrocumulus) that not through Cirrus, which accounted for 64% of their association and with familial confounding that accounted for 36%.

With brightest areas (Cirrocumulus) adjusted for Cirrus as the predictor and light areas adjusted for Cirrus as the outcome, β_self increased from 0.287 (P = 10^–17) in Model 1 to 0.339 (P = 10^–23) after adjusting for co-twin’s brightest areas (Cirrocumulus) in Model 3 (P = 10^–5), β_co-twin increased from 0.046 (P = 0.2) in Model 2 to β^′_co-twin = 0.165 (P = 10^–7) after adjusting for the twin’s brightest areas (Cirrocumulus) in Model 3 (P = 10^–11). These results were consistent with a combination of a causal effect from the light areas to the brightest areas (Cirrocumulus) not through Cirrus and familial confounding.

Causal inference for the bright areas and brightest areas (Cirrocumulus) not through Cirrus

With the bright areas adjusted for Cirrus as the predictor and the brightest areas (Cirrocumulus) adjusted for Cirrus as the outcome, β_self decreased, but not significantly (P = 0.3), from 0.608 (P = 10^–90) in Model 1 to β^′_self = 0.597 (P = 10^–75) after adjusting for co-twin’s bright areas in Model 3, β_co-twin decreased from 0.268 (P = 10^–9) in Model 2 to the null (P = 0.3) after adjusting for co-twin’s bright areas in Model 3 by 87% (P = 10^–13). These results were consistent with the bright areas having a causal effect on the brightest areas (Cirrocumulus) not through Cirrus, which accounted for 92% of their association and the familial confounding accounting for 8% of the association.

With the brightest areas (Cirrocumulus) adjusted for Cirrus as the predictor and the bright areas adjusted for Cirrus as the outcome, β_self did not change significantly (P = 0.3) after adjusting for co-twin’s brightest areas (Cirrocumulus), β_co-twin increased from 0.08 (P = 0.186) in Model 2 to β^′_co-twin 0.153 (P = 10^–6) after adjusting for the twin’s brightest areas (Cirrocumulus) in Model 3 (P = 0.001). These results are consistent with a combination of a causal effect, from the bright areas to the brightest areas (Cirrocumulus) not through Cirrus, and familial confounding.

From the subgroup analyses according to the level of per cent mammographic density based on Cumulus measure, similar causal evidence was found between subgroups for the causal relationships between most pairs of mammographic risk scores, which supported that the causal relationships were unlikely to depend on breast density level (Additional file 1: Tables S3 and S4). However, the reduced sample size in subgroups might limit the statistical power to detect a difference.

Figure 2 shows the possible causal pathways between the three mammographic density measures and Cirrus, based on the results presented.

Discussion

This study has shown that the amount of lighter (less dense) areas a woman has on her mammogram might cause her to also have more of the brightest (highly dense) areas, currently the strongest density measure associated with breast cancer risk [2,3,4, 7,8,9]. This causal relationship could be direct, but it could also be through the amount of less dense area having a causal effect on specific textural features that are themselves associated with breast cancer risk.

Our findings as encapsulated by Fig. 2 could also explain the observations from a recent publication of the WECARE study that the associations of contralateral breast cancer risk with the light areas and bright areas were attenuated to the null after adjusting for the brightest areas (Cirrocumulus), while the association with the brightest areas (Cirrocumulus) remained unchanged; the risk gradients for the light, bright, and brightest areas (Cirrocumulus) were 1.24, 1.34, and 1.40 when fitted alone, were 0.98, 1.01, and 1.40 when fitted together, respectively [6]. That is the associations of the light and bright areas with contralateral breast cancer risk could be due to their causal associations with the brightest areas (Cirrocumulus); the brightest areas (Cirrocumulus) are more aetiologically important than the light and bright areas in the causal pathway for contralateral breast cancer.

Note that when the light areas risk score was replaced by Cumulus (i.e. the areas including light, bright, and brightest areas (Cirrocumulus)), and the bright areas risk score was replaced by Altocumulus (i.e. the areas including bright and brightest areas (Cirrocumulus)) in the above models, the results were similar [6]. This is reasonable, because Cumulus is predominated by the light areas as well as bright areas (see Table 1), Altocumulus is predominated by the bright areas, and both the light and bright areas have similar causal relationships with the brightest areas (Cirrocumulus). Similar results were also observed for the associations of Cumulus, the brightest areas (Cirrocumulus), and/or Altocumulus with screen-detected and young-age-at-diagnosis breast cancer, respectively [5, 9, 10]. Conversely, the association of interval breast cancer with the brightest areas (Cirrocumulus) was attenuated to the null when adjusting for Cumulus [10]. Therefore, as what was observed from WECARE study for contralateral breast cancer, the brightest areas (Cirrocumulus) are more aetiologically important than lighter areas for screen-detected and young-age-at-diagnosis breast cancer, while the light areas may have independent causal effects on interval breast cancer, distinct from brightest areas (Cirrocumulus). It is worth noting that the risk associations between interval cancer and the brightest areas (Cirrocumulus) could be due to confounding caused by the light areas.

Subtracting the causal pathways involving Cirrus, the causal effect of bright areas on brightest areas (Cirrocumulus) remained strong, while the causal effect from light areas on brightest areas (Cirrocumulus) was much weaker (i.e. effect size of 0.3 versus 0.6). This is consistent with the observations that the greater the brightness of the dense region the stronger its association with breast cancer risk [6, 10].

For Cirrus, the effect size of its causal associations with density measures is around 0.1, regardless of the direction and the threshold for defining dense areas. Dense area-based measures all have positive associations with Cirrus. Of them, bright areas and light areas are both the causes; while brightest areas (Cirrocumulus) are causally affected by Cirrus, which is less aetiologically important than light areas and bright areas, with both of their total effect sizes larger than 0.2. The weak causal relationships between Cirrus and other mammogram risk scores are reasonable. Apart from Cirrus capturing textural information (i.e. various patterns identified by considering the spatial relationships between intensity levels), which is different from what can be captured using the threshold-based measures (i.e. pixel counts above or below an intensity level), Cirrus also uses the information from the whole breast, rather than the local information measured by threshold-based measures. These results add grounded evidence to previous association-based speculations that texture-based mammographic measures, at least Cirrus, could be an independent and intrinsic risk factor for breast cancer [7, 12].

The findings of this study could inform biological research in trying to understand why conventional density is associated with breast cancer risk. Mammographic density is defined by different levels of pixel brightness thought to represent different types of breast tissue based on their differential X-ray attenuation. Fat tissue appears to be radiologically dark, while fibroglandular tissue appears to be light or bright. As mammograms are two-dimensional representations of tissue composition, each pixel in the image reflects a combination of fibroglandular and fat tissue. Malignant breast tissue, however, is developed from fibroglandular tissue and appears radiologically brighter than normal fibroglandular tissue. Therefore, the brightest areas (Cirrocumulus) contain more pixels and are closer to the radiological appearance of malignant breast tissue than the light areas and bright areas. This might explain in part its typically stronger association with screen-detected, young-age-at-diagnosis, and contralateral breast cancer risk, than other mammographic density measures, as observed in the previous studies [6, 10]. It could also explain why the risk associations of conventional density measures were attenuated once the brightest areas (Cirrocumulus) were included [6, 10].

Additionally, light areas contain a greater amount of less dense tissue (dispersed fibroglandular tissue and possibly fat tissue), compared with bright areas. The difference in tissue composition might explain the observed bidirectional causal relationships between light areas and bright areas, with the causal effect size of 0.7 on bright areas caused by light areas being stronger than the reverse causal effect size of 0.4. Therefore, less dense tissue, including dispersed fibroglandular tissue or fat tissue, could have causal effects on dense fibroglandular tissue.

The study has also demonstrated the value of ICE FALCON in decomposing associations between intercorrelated disease biomarkers into pathways. These causal associations could be identified and quantified even though there was also familial confounding, something that by definition cannot be considered if we used Mendelian randomisation based on assuming genetic variants for these mammogram risk scores were true instrumental variable. The same approach could be applied to other biomarkers and diseases, such as lipids and cardiovascular diseases.

One strength of our study is that when conducting the ICE FALCON analyses, which can infer causation and its direction even if there is a familial confounding between mammogram risk scores, we used monozygotic twin pairs. This maximised the within-pair correlations in the risk scores, and they were close to 0.5. This also maximised the potential existence of non-genetic factors shared by twins and hence the amount of familial confounding.

Considering that there could be bidirectional causal relationships between two traits and the need to make robust inference, we conducted ICE FALCON analyses by switching the roles of predictors and outcome for each pair. Causal inference was only made when the evidence from the two rounds of analyses was consistent or does not contradict each other. We also extensively checked the influence of other common covariates, on the mammographic measures included in this study, that have established or possible associations with breast cancer risk or mammographic density measures according to the previous publications.

One limitation of the study is that the epidemiological data were not collected strictly contemporaneous with the participants’ mammography episodes. But we have previously shown that mammographic density adjusted for age and/or BMI tracks strongly with time, at least over 8 years [31, 32]. Also, the time difference was small (on average 1.68 years, with 177 participants having a more than 3-year difference), and we updated the menopausal status and BMI based on the date information from the questionnaires and mammograms; we consider that the influence on our conclusions due to the time difference should be minimal. Another limitation is that our analyses were based on film mammograms rather than digital mammograms, which have now replaced film mammograms and yield some different image characteristics [33]. We are currently working to try to replicate the study findings using digital mammograms. Given our research on different definitions of density based on brightness found similar results for digitised and digital mammograms [3, 4] in terms of breast cancer risk prediction, we expect the associations and relevant causal estimates between the three dense areas are similar between digitised and digital mammograms. The presence of measurement error in our study poses another limitation, as the assessment of dense areas relies on human measurement. Measurement error in the mammogram risk scores is expected to bias the associations between the risk scores and estimates for familial confounding and causal effects towards the null. However, considering the high reproducibility of the measures in our study (see Methods), the measurement error is negligible; therefore, the influence of measurement error on our results is likely to be minimal.

To the best of our knowledge, we are the first to investigate causal relationships between mammographic measures that predict breast cancer risk. The previous studies on mammographic density have primarily focused on its risk prediction capabilities for breast cancer [34, 35] with an emphasis on conventional mammographic density, and twin and family studies mainly for investigating familial aggregation and heritability of the density measure [15, 18, 19, 36,37,38]. In contrast, our study takes a novel approach by applying a twin-based design to make causal inference in cross-sectional data. Specifically, we utilise the within- and cross-twin relationships between two variables and apply the analytic method of ICE FALCON. We are also the creators of the mammographic density measures defined by the increasing threshold of pixel brightness.

Conclusions

In a mammograme, the less dense (light and bright) areas could have a positive causal effect on the densest (brightest) areas which eventually become cancers themselves. There could also be another pathway to cancer evident through textural features not related to density per se, i.e. Cirrus, which also has a causal effect on the brightest areas (Cirrocumulus). Given our previous findings, the brightest areas (Cirrocumulus) are more aetiologically important than lighter areas for screen-detected, young-age-at-diagnosis, and contralateral breast cancer, while light areas have an independent causal effect from the brightest areas for interval breast cancer. In addition to the causal effects from density measures, mammographic measures based on specific textural features, like Cirrus, contain aetiologically independent information for breast cancer risk [6, 10].

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Abbreviations

BMI:: Body mass index
ICE FALCON:: Inference about causation from examination of familial confounding
SD:: Standard deviation

References

Yaffe MJ. Mammographic density. Measurement of mammographic density. Breast Cancer Res. 2008;10(3):209.
Article PubMed PubMed Central Google Scholar
Boyd NF, Guo H, Martin LJ, Sun L, Stone J, Fishell E, et al. Mammographic density and the risk and detection of breast cancer. N Engl J Med. 2007;356(3):227–36.
Article CAS PubMed Google Scholar
Nguyen TL, Aung YK, Evans CF, Yoon-Ho C, Jenkins MA, Sung J, et al. Mammographic density defined by higher than conventional brightness threshold better predicts breast cancer risk for full-field digital mammograms. Breast Cancer Res. 2015;17:142.
Article PubMed PubMed Central Google Scholar
Nguyen TL, Aung YK, Evans CF, Dite GS, Stone J, MacInnis RJ, et al. Mammographic density defined by higher than conventional brightness thresholds better predicts breast cancer risk. Int J Epidemiol. 2017;46(2):652–61.
PubMed Google Scholar
Nguyen TL, Aung YK, Li S, Trinh NH, Evans CF, Baglietto L, et al. Predicting interval and screen-detected breast cancers from mammographic density defined by different brightness thresholds. Breast Cancer Res. 2018;20(1):152.
Article PubMed PubMed Central Google Scholar
Watt GP, Knight JA, Nguyen TL, Reiner AS, Malone KE, John EM, et al. Association of contralateral breast cancer risk with mammographic density defined at higher-than-conventional intensity thresholds. Int J Cancer. 2022;151(8):1304–9.
Article CAS PubMed PubMed Central Google Scholar
Gastounioti A, Conant EF, Kontos D. Beyond breast density: a review on the advancing role of parenchymal texture analysis in breast cancer risk assessment. Breast Cancer Res. 2016;18(1):91.
Article PubMed PubMed Central Google Scholar
Schmidt DF, Makalic E, Goudey B, Dite GS, Stone J, Nguyen TL, et al. Cirrus: an automated mammography-based measure of breast cancer risk based on textural features. JNCI Cancer Spectr. 2018;2(4):pky057.
Article PubMed PubMed Central Google Scholar
Hopper JL, Nguyen TL, Schmidt DF, Makalic E, Song YM, Sung J, et al. Going beyond conventional mammographic density to discover novel mammogram-based predictors of breast cancer risk. J Clin Med. 2020;9(3):627.
Article PubMed PubMed Central Google Scholar
Nguyen TL, Schmidt DF, Makalic E, Maskarinec G, Li S, Dite GS, et al. Novel mammogram-based measures improve breast cancer risk prediction beyond an established mammographic density measure. Int J Cancer. 2021;148(9):2193–202.
Article CAS PubMed Google Scholar
Nguyen TL, Choi YH, Aung YK, Evans CF, Trinh NH, Li S, et al. Breast cancer risk associations with digital mammographic density by pixel brightness threshold and mammographic system. Radiology. 2018;286(2):433–42.
Article PubMed Google Scholar
Warner ET, Rice MS, Zeleznik OA, Fowler EE, Murthy D, Vachon CM, et al. Automated percent mammographic density, mammographic texture variation, and risk of breast cancer: a nested case-control study. NPJ Breast Cancer. 2021;7(1):68.
Article CAS PubMed PubMed Central Google Scholar
Winkel RR, von Euler-Chelpin M, Nielsen M, Petersen K, Lillholm M, Nielsen MB, et al. Mammographic density and structural features can individually and jointly contribute to breast cancer risk assessment in mammography screening: a case–control study. BMC Cancer. 2016;16(1):1–12.
Article Google Scholar
Robins JM. Association, causation, and marginal structural models. Synthese. 1999;121(1/2):151–79.
Article Google Scholar
Nguyen TL, Li S, Dowty JG, Dite GS, Ye Z, Nguyen-Dumont T, et al. Familial aspects of mammographic density measures associated with breast cancer risk. Cancers (Basel). 2022;14(6):1483.
Article PubMed PubMed Central Google Scholar
Li S, Bui M, Hopper JL. Inference about causation from examination of familial confounding (ICE FALCON): a model for assessing causation analogous to Mendelian randomization. Int J Epidemiol. 2020;49(4):1259–69.
Article PubMed PubMed Central Google Scholar
Odefrey F, Stone J, Gurrin LC, Byrnes GB, Apicella C, Dite GS, et al. Common genetic variants associated with breast cancer and mammographic density measures that predict disease. Cancer Res. 2010;70(4):1449–58.
Article CAS PubMed Google Scholar
Boyd NF, Dite GS, Stone J, Gunasekara A, English DR, McCredie MR, et al. Heritability of mammographic density, a risk factor for breast cancer. N Engl J Med. 2002;347(12):886–94.
Article PubMed Google Scholar
Li S, Nguyen TL, Nguyen-Dumont T, Dowty JG, Dite GS, Ye Z, et al. Genetic aspects of mammographic density measures associated with breast cancer risk. Cancers (Basel). 2022;14(11):2767.
Article PubMed PubMed Central Google Scholar
Haby MM, Markwick A, Peeters A, Shaw J, Vos T. Future predictions of body mass index and overweight prevalence in Australia, 2005–2025. Health Promot Int. 2012;27(2):250–60.
Article PubMed Google Scholar
Box GEP, Cox DR. An analysis of transformations. J R Stat Soc Ser B (Methodol). 1964;26(2):211–52.
Google Scholar
Dite GS, Gurrin LC, Byrnes GB, Stone J, Gunasekara A, McCredie MR, et al. Predictors of mammographic density: insights gained from a novel regression analysis of a twin study. Cancer Epidemiol Biomarkers Prev. 2008;17(12):3474–81.
Article PubMed PubMed Central Google Scholar
Stone J, Dite GS, Giles GG, Cawson J, English DR, Hopper JL. Inference about causation from examination of familial confounding: application to longitudinal twin data on mammographic density measures that predict breast cancer risk. Cancer Epidemiol Biomarkers Prev. 2012;21(7):1149–55.
Article PubMed Google Scholar
Hopper JL, Bui QM, Erbas B, Matheson MC, Gurrin LC, Burgess JA, et al. Does eczema in infancy cause hay fever, asthma, or both in childhood? Insights from a novel regression model of sibling data. J Allergy Clin Immunol. 2012;130(5):1117-22 e1.
Article PubMed Google Scholar
Davey CG, Lopez-Sola C, Bui M, Hopper JL, Pantelis C, Fontenelle LF, et al. The effects of stress-tension on depression and anxiety symptoms: evidence from a novel twin modelling analysis. Psychol Med. 2016;46(15):3213–8.
Article CAS PubMed Google Scholar
Bui M, Bjornerem A, Ghasem-Zadeh A, Dite GS, Hopper JL, Seeman E. Architecture of cortical bone determines in part its remodelling and structural decay. Bone. 2013;55(2):353–8.
Article PubMed Google Scholar
Li S, Wong EM, Bui M, Nguyen TL, Joo JE, Stone J, et al. Causal effect of smoking on DNA methylation in peripheral blood: a twin and family study. Clin Epigenetics. 2018;10:18.
Article CAS PubMed PubMed Central Google Scholar
Li S, Wong EM, Bui M, Nguyen TL, Joo J-HE, Stone J, et al. Inference about causation between body mass index and DNA methylation in blood from a twin family study. Int J Obes. 2019;43(2):243–52.
Article CAS Google Scholar
Wright S. The mehod of path coefficients. Ann Math Stat. 1934;5(3):161–215.
Article Google Scholar
Team RC. R: a language and environment for statistical computing. Vienna, Austria: R foundation for statistical computing; 2022.
Stone J, Ding J, Warren RML, Duffy SW, Hopper JL. Using mammographic density to predict breast cancer risk: dense area or percentage dense area. Breast Cancer Res. 2010;12(6):1–7.
Article Google Scholar
Krishnan K, Baglietto L, Stone J, Simpson JA, Severi G, Evans CF, et al. Longitudinal study of mammographic density measures that predict breast cancer risk. Cancer Epidemiol Biomarkers Prev. 2017;26(4):651–60.
Article PubMed PubMed Central Google Scholar
Fischmann A, Siegmann KC, Wersebe A, Claussen CD, Müller-Schimpfle M. Comparison of full-field digital mammography and film–screen mammography: image quality and lesion detection. Br J Radiol. 2005;78(928):312–5.
Article CAS PubMed Google Scholar
Vachon CM, van Gils CH, Sellers TA, Ghosh K, Pruthi S, Brandt KR, et al. Mammographic density, breast cancer risk and risk prediction. Breast Cancer Res. 2007;9(6):217.
Article PubMed PubMed Central Google Scholar
Pettersson A, Graff RE, Ursin G, Santos Silva ID, McCormack V, Baglietto L, et al. Mammographic density phenotypes and risk of breast cancer: a meta-analysis. J Natl Cancer Inst. 2014;106(5):dju078.
Article PubMed PubMed Central Google Scholar
Hopper JL, Carlin JB. Familial aggregation of a disease consequent upon correlation between relatives in a risk factor measured on a continuous scale. Am J Epidemiol. 1992;136(9):1138–47.
Article CAS PubMed Google Scholar
Nguyen TL, Schmidt DF, Makalic E, Dite GS, Stone J, Apicella C, Bui M, et al. Explaining variance in the cumulus mammographic measures that predict breast cancer risk: a twins and sisters study. Cancer Epidemiol Biomarkers Prev. 2013;22(12):2395–403.
Article PubMed Google Scholar
Holowko N, Eriksson M, Kuja-Halkola R, Azam S, He W, Hall P, et al. Heritability of mammographic breast density, density change, microcalcifications, and masses. Cancer Res. 2020;80(7):1590–600.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We wish to thank Twins Research Australia and the twins who participated in this study. The Australian Mammographic Density Twins and Sisters Study was facilitated through access to Twins Research Australia, a national resource supported by a Centre of Research Excellence Grant (Grant No. 1079102) from the NHMRC. Z.Y is supported by China Scholarship Council—University of Melbourne PhD Scholarship. T.L.N. is supported by Cancer Council Victoria (AF7305). S.L. is an NHMRC Emerging Leadership Fellow (GNT2017373). V.F.C.E. is supported by an Australian Government Research Training Program Scholarship. M.C.S. is supported by a NHMRC Fellowship (GNT1155163). J.L.H. is supported by a NHMRC Fellowship (GNT1137349).

Funding

The AMDTSS was supported by National Health and Medical Research Council (NHMRC) (Grant Nos. 1050561 and 1079102) and Cancer Australia and National Breast Cancer Foundation (Grant No. 509307). This research was funded by the Cancer Council Victoria (AF7305), Victoria Cancer Agency (ECRF19020), NHMRC (APP1185980, APP2006899, and GNT2017373), National Breast Cancer Foundation (IIRS-20-054), and the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2020R1A2C2101041).

Author information

Authors and Affiliations

Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Parkville, VIC, 3051, Australia
Zhoufeng Ye, Tuong L. Nguyen, Gillian S. Dite, Robert J. MacInnis, Enes Makalic, Osamah M. Al-Qershi, Minh Bui, Vivienne F. C. Esser, James G. Dowty, Ho N. Trinh, Christopher F. Evans, Mark A. Jenkins, Graham G. Giles, John L. Hopper & Shuai Li
Genetic Technologies Limited, Fitzroy, VIC, 3065, Australia
Gillian S. Dite
Cancer Epidemiology Division, Cancer Council Victoria, Melbourne, VIC, 3004, Australia
Robert J. MacInnis, Graham G. Giles & Melissa C. Southey
Department of Data Science and AI, Faculty of IT, Monash University, Melbourne, Australia
Daniel F. Schmidt
Electrical and Computer Systems Engineering Discipline, School of Engineering, Monash University Malaysia, 47500, Sunway City, Malaysia
Maxine Tan
School of Electrical and Computer Engineering, The University of Oklahoma, Norman, OK, 73019, USA
Maxine Tan
Department of Public Health Sciences, Division of Genome and Health Big Data, Graduate School of Public Health, Seoul National University, Seoul, 08826, Korea
Joohon Sung
Precision Medicine, School of Clinical Sciences at Monash Health, Monash University, Clayton, VIC, 3168, Australia
Graham G. Giles, Melissa C. Southey & Shuai Li
Department of Clinical Pathology, The University of Melbourne, Parkville, VIC, 3051, Australia
Melissa C. Southey
Department of Public Health and Primary Care, Centre for Cancer Genetic Epidemiology, University of Cambridge, Cambridge, CB1 8RN, UK
Shuai Li
Murdoch Children’s Research Institute, Royal Children’s Hospital, Parkville, VIC, 3051, Australia
Shuai Li

Authors

Zhoufeng Ye
View author publications
You can also search for this author in PubMed Google Scholar
Tuong L. Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Gillian S. Dite
View author publications
You can also search for this author in PubMed Google Scholar
Robert J. MacInnis
View author publications
You can also search for this author in PubMed Google Scholar
Daniel F. Schmidt
View author publications
You can also search for this author in PubMed Google Scholar
Enes Makalic
View author publications
You can also search for this author in PubMed Google Scholar
Osamah M. Al-Qershi
View author publications
You can also search for this author in PubMed Google Scholar
Minh Bui
View author publications
You can also search for this author in PubMed Google Scholar
Vivienne F. C. Esser
View author publications
You can also search for this author in PubMed Google Scholar
James G. Dowty
View author publications
You can also search for this author in PubMed Google Scholar
Ho N. Trinh
View author publications
You can also search for this author in PubMed Google Scholar
Christopher F. Evans
View author publications
You can also search for this author in PubMed Google Scholar
Maxine Tan
View author publications
You can also search for this author in PubMed Google Scholar
Joohon Sung
View author publications
You can also search for this author in PubMed Google Scholar
Mark A. Jenkins
View author publications
You can also search for this author in PubMed Google Scholar
Graham G. Giles
View author publications
You can also search for this author in PubMed Google Scholar
Melissa C. Southey
View author publications
You can also search for this author in PubMed Google Scholar
John L. Hopper
View author publications
You can also search for this author in PubMed Google Scholar
Shuai Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

ZY, SL, TLN, GSD, RJM, and JLH helped in conceptualisation; SL, DFS, EM, OMA, and JLH helped in methodology; SL, MB, VE, and JLH worked in software; ZY helped in formal analysis; TLN, JGD, HNT, and CFE helped in data curation; ZY contributed to writing—original draft preparation; all contributed to writing—review and editing; SL, TLN, GSD, RJM, and JLH worked in supervision; TLN, MT, MAJ, GGG, MCS, and JLH worked in project administration; and TLN, MT, JS, MAJ, GGG, MCS, and JLH worked in funding acquisition. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Shuai Li.

Ethics declarations

Ethics approval and consent to participate

The study was conducted according to the guidelines of the Declaration of Helsinki and approved by Melbourne School of Population and Global Health Human Ethics Advisory Group (Ethics ID: 2057536.1).

Consent for publication

Not applicable.

Competing interests

GSD is an employee of Genetic Technologies Ltd. The other authors have no conflict of interest to declare.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Figure S1.

The diagram of ICE FALCON methodology. Figure S2. The diagram of the relationships between risk scores analysed using ICE FALCON. Table S1. Linear mixed-effects models of mammographic measures and covariates. Table S2. The relationships between Cirrus and mammographic density measures analysed by using the ICE FALCON method. Table S3. The relationships between Cirrus and mammographic density measures for percent mammographic density ≤ 30.5% analysed by using the ICE FALCON method. Table S4. The relationships between Cirrus and mammographic density measures for percent mammographic density > 30.5% analysed by using the ICE FALCON method.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Ye, Z., Nguyen, T.L., Dite, G.S. et al. Causal relationships between breast cancer risk factors based on mammographic features. Breast Cancer Res 25, 127 (2023). https://doi.org/10.1186/s13058-023-01733-1

Download citation

Received: 27 June 2023
Accepted: 17 October 2023
Published: 25 October 2023
DOI: https://doi.org/10.1186/s13058-023-01733-1

Causal relationships between breast cancer risk factors based on mammographic features

Abstract

Background

Methods

Results

Conclusions

Similar content being viewed by others

Identification of 31 loci for mammographic density phenotypes and their associations with breast cancer risk

A genome-wide association study of mammographic texture variation

Breast Tissue Organisation and its Association with Breast Cancer Risk

Introduction

Materials and methods

Study sample

Questionnaire

Mammogram-based measures

Statistical methods

Results

Correlations between the mammogram risk scores

Causal inference for pairs of mammogram risk scores

The bright areas and light areas

Cirrus and the light areas

Cirrus and the bright areas

Cirrus and the brightest areas (Cirrocumulus)

The light areas and brightest areas (Cirrocumulus)

The bright areas and brightest areas (Cirrocumulus)

Causal inference for the light areas and brightest areas (Cirrocumulus) not through Cirrus

Causal inference for the bright areas and brightest areas (Cirrocumulus) not through Cirrus

Discussion

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1: Figure S1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation