Background

Coronary artery disease (CAD), also called coronary heart disease (CHD), is the leading cause of multiple metabolic diseases and mortality [1,2,3]. To date, CAD is still incurable, and its pathogenesis is not clear [4]. Therefore, CAD prevention becomes particularly important, especially the identification and risk assessment of risk factors, which are the first steps in the prevention of CAD [1]. In addition to some common risk factors, such as age, sex and race, recent studies have more consistently shown that air pollution also leads to cardiovascular disease (CVD) [5,6,7,8].

Moreover, the limited previous studies involving the relationship between air pollution and CAD risk were mainly concentrated on certain subclinical or clinical disorders of CAD, such as coronary vasomotor abnormalities [9], coronary artery calcium [10, 11], acute myocardial infarction [12] and coronary plaques [13]. Other studies mainly considered the risk of near-roadway air pollution (NRAP) on CAD morbidity [14, 15]. However, national prospective population studies assessing the relationship between ambient air pollution exposure and CAD risk remain scarce.

Additionally, it is well accepted that both hereditary susceptibility and air pollution exposure contribute to the risk of CAD, but neglect of their combined or interaction effects made the results biased [1]. Recently, increasing evidence has reported that genetic susceptibility could interact with environmental factors to affect the onset of cardiometabolic diseases [16, 17], including CVDs [18, 19]. However, whether the air pollution exposure alters the association between genetic factors and CAD remains unknown.

Therefore, by means of the polygenic risk score (PRS) [20, 21], we applied data from the UK Biobank with comprehensive information on common baseline characteristics, exposure factors and outcomes to evaluate the separate, combined effect or interaction association between genetic factors and air pollutants, including particulate matter with diameters ranging from ≤ 2.5–≤10 μm (PM2.5, PM2.5−10 and PM10), nitrogen dioxide (NO2), and nitrogen oxides (NOx), and the risk of CAD.

Methods

Study cohort and data access

The current study utilized data from the UK Biobank, a national multicentre prospective cohort study [22]. Briefly, ~ 0.5 million residents aged 40–69 years between 2006 and 2010 were recruited from the UK National Health Service and living < 25 miles from 1 of the 22 study assessment centres across the UK (England, Wales and Scotland). The baseline summary characteristics can be viewed at the website of UK Biobank (https://biobank.ndph.ox.ac.uk/showcase/). The UKB was approved by the North West Multi-Center Research Ethics Committee, and all participants provided informed consent forms.

All eligible researchers could submit the application for data access. The detailed access procedure can be found at UK Biobank’s website (http://www.ukbiobank.ac.uk/register-apply/).

Air pollution exposure assessment

Average exposure to air pollutants, including PM2.5, PM2.5−10, PM10, NO2 and NOx, was assessed in the UK Biobank study under strict measures and internationally recognized standards [23, 24]. Details can be viewed in the Supplementary Materials.

CAD ascertainment

The outcome of this study was CAD, which was defined with the 9th /10th Revision of International Classification of Diseases (ICD). In the UK Biobank, CAD was determined with the Office of Population Censuses and Surveys’ Classification of Interventions and Procedures, self-reported diagnoses and Hospital Episode Statistics data, as previously described [25], and the corresponding UK Biobank codes are provided in Table S1. This definition of CAD includes myocardial infarction and its related sequelae.

Calculation of covariates

We included multiple covariates including baseline characteristics such as age and sex, and health-related outcomes. The detailed contents about variable assessment and data access code are provided in the Supplementary Materials.

PRS contraction

We used the PRS to calculate the accumulative effects of multiple genetic variations, and the detailed definition and formula are presented in the Supplementary Materials. The present PRS utilized data from the largest available CAD genome-wide association meta-analysis without the UK Biobank population [26], with 44 single nucleotide polymorphisms (SNPs). Forty SNPs were finally available in the UK Biobank imputed dataset (see Table S2).

Analytical cohort

Participants were excluded if they met one of the following criteria: (i) preexisting CAD at enrolment, (ii) lack of air pollution exposure information, or (iii) lack of CAD genetic information (Fig. 1). After exclusion for the above reasons, 447,530 subjects were used to investigate the association of particulate matter (PM2.5, PM2.5−10, PM10) and CAD, and 480,298 subjects were used to investigate the association of nitrogen oxides (NO2, NOx) and CAD. Furthermore, when considering the genetic factors, there were 407,470 individuals to investigate the relationship of PM, genetic factors and CAD and 438,736 in the NO group.

Fig. 1
figure 1

Flow diagram for participant inclusion. PM2.5, fine particulate matter with a diameter ≤ 2.5 μm; PM10, particulate matter with diameter ≤ 10 μm; PM2.5−10, particulate matter with diameter between 2.5 and 10 μm; NO2, nitrogen dioxide; NOx, nitrogen oxides. CAD, coronary artery disease; PRS, polygenic risk score; GWAS, genome-wide association study

Statistical analyses

All analyses in the current study were conducted using R software (Version 4.1.1) and Stata (Version 15.1), and two-sided P values < 0.05 were defined as statistically significant. Cox regression models were constructed to investigate the relationship between air pollution and CAD and to calculate hazard ratios (HRs) and 95% confidence intervals (CIs) with adjustment for multiple covariates. We tested the proportional hazards assumption with Schoenfeld residuals methods. We performed a restricted cubic spline (RCS) transformation to explore possible linear/nonlinear correlations of air pollution and CAD and used the Akaike information criterion (AIC) to determine the optimal number of knots of RSC transformation [27].

Because more than 90% of participants are white and race is a common confounding factor, we limited subjects to the White race to explore the combined effects of air pollution exposure and genetic factors on CAD incidence. The additive interaction term was assessed with two indices: the relative excess risk due to the interaction (RERI) and the attributable proportion (AP) due to the interaction [28]. The 95% CIs of the RERI and AP were determined using Excel written by T. Andersson [29], and when 0 was within the CIs of the RERI and AP, it meant that there was no additive interaction. The multiplicative interaction term was assessed by setting variable cross-product terms of gene‒environmental factors in the models to observe whether the corresponding P value was < 0.05.

Multiple imputation was conducted to account for the missing covariate data. The missing categorical variables were imputed with multiple imputation based on latent class (MILC), and the missing continuous variables (e.g., physical activity) were imputed with multivariate imputation by chained equation (MICE) using predictive mean matching. To validate the robustness of our results, we perform several sensitivity analyses. Considering the characteristics of the cohort study, we excluded participants with less than 2 years of follow-up time to avoid false-positive associations.

Results

Table 1 presents the baseline characteristics of eligible participants by incident CAD. In the PM dataset, participants who suffered from CAD were mainly males, older, and had a higher BMI than controls (all P < 0.001). Moreover, previous or current smokers were more likely to develop CAD, while the opposite is true for current alcohol drinkers (all P < 0.001). Additionally, participants with diabetes or CVD at baseline were more likely to develop CAD (all P < 0.001). Participants in the NOx dataset had similar descriptions of baseline characteristics (all P < 0.001). In addition, PM2.5 was highly correlated with nitrogen oxides (r = 0.86 for NO2; r = 0.85 for NOx), while PM2.5−10 was related to PM10 (r = 0.82) (see Figure S1).

Table 1 Baseline characteristics of participants in the UK Biobank study

During a median of 8.8 years (4,187,431 person-years) of follow-up, 22,897 incident CAD cases were recorded. Kaplan‒Meier (K‒M) plots for CAD in the PM2.5, PM10, PM2.5−10, NO2 and NOx, concentration quartiles are presented in Figures S2-S6, respectively. The results showed that the differences in Kaplan‒Meier curves were significant among PM2.5 (P = 4.0 × 10−13), PM10 (P = 0.010), NO2 (P = 9.0 × 10−11) and NOx (P = 6.0 × 10−14) concentration quartiles but not PM2.5−10 (P = 0.200).

Table 2 presents the relationships between air pollution exposure and CAD risk after adjustment for multiple possible confounders. The results showed that PM2.5, PM10, NO2 and NOx each was still related to an elevated risk of CAD in the multivariate-adjusted models (all P < 0.05). In the model 1, the HRs (95% CI) of CAD for 10-µg/m3 increases in PM2.5, PM10, NO2 and NOx concentrations were 2.56 (2.27, 2.90), 1.22 (1.14, 1.31), 1.11 (1.09, 1.13) and 1.05 (1.04, 1.06), respectively. After further adjusting for race, drinking, smoking status, education level, BMI, UK Biobank assessment centre, physical activity, diabetes and CVD at baseline, the HRs (95% CI) of CAD for 10-µg/m3 increases in PM2.5, NO2 and NOx concentrations were 1.25 (1.09, 1.44), 1.03 (1.01, 1.05) and 1.01 (1.00, 1.02), respectively. The relationship between PM2.5−10 and PM10 exposure and CAD risk were not statistically significant (P = 0.865 for PM2.5−10; P = 0.146 for PM10). Certainly, the third quartiles of PM10 concentration presented more significant effect on CAD risk than first quartiles (HRs [95% CI]: 1.05 [1.01, 1.09], P = 0.027). Interestingly, the association between air pollutants and CAD were not statistically significant after further adjusting TDI (All P > 0.05, Table S3). Furthermore, the RCS results also verified the significant association of air pollutants, including PM2.5 (P < 0.0001), PM10 (P = 0.0001), NO2 (P < 0.0001) and NOx (P < 0.0001), and CAD risk (Figures S7). In addition, subgroup analysis showed that PM2.5, NO2 and NOx had more significant effects on CAD risk in female group, overweight/obesity group and previous or current smokers (see Table S4). And NOx and sex have an interaction effect on CAD risk.

Table 2 Adjusted hazard ratio and 95% confidence interval of coronary artery disease by air pollution exposure

Then, we explored the combined effects of genetic factors and air pollution on the risk of CAD and found the statistically significant results (see Fig. 2). Although the multiplicative interactions of the genetic variation and air pollution on CAD risk were not statistically significant (all P-interaction > 0.05), we observed that individuals with high PRS and air pollution exposure simultaneously showed the highest risk of CAD compared with any other group. The HRs (95% CI) of CAD in the PM2.5, PM10, NO2 and NOx high joint exposure groups were 1.56 (1.48, 1.64), 1.55 (1.48, 1.63), 1.57 (1.49, 1.65), and 1.57 (1.49, 1.65), respectively.

Fig. 2
figure 2

The joint association of the included air pollutant exposure and genetic categories with the risk of incident CAD in the UK Biobank. A Joint effects of PM2.5 and genetic variations; (B) Joint effects of PM10 and genetic variations; (C) Joint effects of NO2 and genetic variations; (D) Joint effects of NOx and genetic variations. Adjusted for age (continuous), sex (male/female), alcohol consumption (never, previous, current, missing), smoking status (never, previous, current, missing), body mass index (< 25 kg/m2, 25 to 29.9 kg/m2, ≥ 30 kg/m2, missing), education level (College or University degree, A/AS-level, O-level/GCSE, CSE, NVQ/HND/HNC, other qualifications, none), UK Biobank assessment center, physical activity (continuous, MET-min/week), diabetes at baseline (yes/no), cardiovascular disease at baseline (yes/no), genotyping batch, and the first 4 genetic principal components

In addition, the RERI and AP were statistically significant, which demonstrated the positive additive interactions of genetic factors and air pollutants on CAD risk (see Table 3). Compared with the low genetic risk and low air pollution exposure group, the RERIs (95% CI) of CAD in the PM2.5, PM10, NO2 and NOx high joint exposure groups were 0.12 (0.05, 0.19), 0.17(0.10, 0.24), 0.14 (0.07, 0.21), and 0.17 (0.10, 0.24), respectively, and the APs (95% CI) of CAD in the PM2.5, PM10, NO2 and NOx high joint exposure groups were 0.09 (0.04, 0.14), 0.12(0.07, 0.18), 0.11 (0.06, 0.16), and 0.13 (0.08, 0.18), respectively. Specifically, in the high PM2.5 exposure and high PRS group, participants had an additional 13% risk of CAD compared with those with low PM2.5 exposure and a low PRS due to the synergistic effect of PM2.5 exposure and genetic risk; the gene-PM2.5 interaction was responsible for 10% of the CAD cases in the participants with high PM2.5 exposure and high genetic risk.

Table 3 Additive joint interaction for included air pollutants exposure and genetic categories on the incident coronary artery disease

Sensitivity analyses showed that the additive interactions and combined effects of genetic factors and air pollution remained statistically significant after excluding the individuals with a follow-up time of less than two years (see Tables S5, Tables S6-S9).

Discussion

In this prospective cohort study with a median of 8.8 years of follow-up, we observed that air pollution exposure was significantly related to an increased CAD risk, and the risk could be further strengthened by CAD gene susceptibility. Additionally, there were positive gene‒environmental additive interactions on the onset of CAD.

Air pollutants are complex mixtures containing various different gases, liquids and particulates, whose complexities and biological responses make it difficult to demonstrate the relationship between different pollutants and diseases [30, 31]. Previous population-based studies have reported that PM2.5 exposure was relevant to both cardiovascular diseases and all-cause mortality [32, 33], and our results further verified this finding. Moreover, we found that PM10 exposure was also linked to an increased risk of CAD, but the association of PM2.5−10 and CAD risk was not statistically significant. The possible reason was that the effect of PM10 might rely on PM2.5 and PM10 comprising PM2.5−10 and PM2.5 [34]. To our knowledge, PM2.5 has the lowest particulate matter compared to PM2.5−10 and PM10, subsequently having the strongest potential toxicity, which is also responsible for an extensive proportion of the effects of PM10 [31]. Moreover, we also found significantly positive associations between NO2 and NOx and CAD risk, even though the effects were lower than those of PM. Nitrogen oxides, other components of air pollutant mixtures, were also reported to be significantly related to the risk of CVD, but the results remained inconsistent due to short-term air pollution exposure [35, 36]. In addition, it is well known that older age and male sex are risk factors for CAD [1]. Our results further verified these findings. We also found that PM2.5, NO2 and NOx had interactions with sex on CAD risk. NOx and age have an interaction effect on CAD risk.

Previous animal studies have shown that particulate components may promote the growth of atherosclerotic plaques and add markers of plaque rupture vulnerability [37, 38]. Subsequently, some potential mechanisms linking air pollution and CVD have been demonstrated. Of those mechanisms, oxidative stress and inflammation are the most acceptable mechanisms accounting for the observed associations of air pollution and CAD [31, 39, 40]. Prolonged or repeated oxidative stress and inflammation may cause endothelial dysfunction, asymptomatic atherosclerosis, coronary vasomotor abnormalities, coronary artery calcium, acute myocardial infarction and coronary plaques, which could eventually develop into CAD [41,42,43].

In addition to environmental effects, genetic factors are also an important influencing factor of CAD risk [1, 44]. Previous studies and genome-wide association studies (GWAS) have demonstrated that CVD has a strong genetic component, with heritability estimates ranging between 40 and 60% [26, 45, 46]. Of course, the genetic effects are composed of multiple common single genomes. Indeed, GWAS have shown that most cardio-metabolic diseases, including diabetes, hypertension and CAD, are influenced by many risk alleles [47]. In this study, we found that participants with high genetic risk and high air pollution levels were prone to developing CAD compared to subjects with low genetic risk and low air pollution based on the PRS. Furthermore, air pollution and genetic factors presented clear additive interactions on CAD risk. To the best of our knowledge, this is the first large-scale prospective study to evaluate the combined and interaction effects of air pollution and genetic factors on the incidence of CAD. Until now, limited previous studies only assessed the combination and interactions of the gene‒environment in some common CVDs, such as hypertension [16, 18, 19]. Certainly, a large study [48] with approximately 60 919 CAD cases and 80 243 controls revealed that the protective effect of ADAMTS7 gene polymorphism (rs7178051) on CAD was weakened by environmental factors (5% lower CAD risk in smokers; 12% lower CAD risk in nonsmokers) compared to nonsmokers (12% lower risk), which showed the gene‒environmental interaction effects on CAD risk from another aspect. Considering the reverse causality and confounding bias, when excluding the participants with a follow-up time of less than two years or with baseline CVD, we repeated the analysis procedure, and the results did not change appreciably.

There are some strengths in our study. Our study was the first prospective comprehensive study to evaluate the separate, combined and interaction effects of air pollution and genetic factors on the risk of CAD. This new viewpoint provides clues to the aetiology of CAD and provides a reference for the prevention and treatment of susceptible people. Moreover, our article has strict quality control, including strict assessment of air pollution exposure and outcomes, proper statistical methods (PRS), further guaranteeing the robustness and reliability of our findings.

However, we must acknowledge that our study also has some shortcomings. First, air pollutants are made up of many different components [30, 31], but we only explored the associations between some common pollutants, including PM2.5, PM2.5−10, PM10, NO2 and NOx and CAD risk, ignoring the specific parts of air pollutant chemical constituents. Further studies may concentrate on the associations of certain components or physical, chemical or biological properties of air pollutant components and CAD risk [49, 50]. Second, air pollution exposure may be misclassified because the exposure data for this study were only restricted to residential addresses. Although the coverage area of the monitoring system is up to 400 km, the specific real-time environmental exposure of the participants cannot be detected. Future research should consider adorning high-tech portable devices to detect the participants’ surrounding environment in real time. Third, the data were obtained from UK populations, most of which were white (94.0%). Therefore, the generalization of gene-associated findings should be interpreted with caution. Meanwhile, in the future, studies on the respective and combined effects of air pollution-genetic factors on CAD could be conducted in different ethnic groups. Fourth, air pollutant levels were a time-varying factor, but the air pollution information in the UK Biobank was available for several years (2010 for PM2.5, PM2.5−10, and NOx; 2005–2007 and 2010 for NO2; 2007 and 2010 for PM10). If applicable, further study should dynamically detect the air pollutant concentration to avoid over- or underestimating air pollutant toxicity. Fifth, although education level and the Townsend deprivation index are possibly associated with CAD, we did not include them as covariables because they were not available for us in the UK Biobank dataset. However, we included as many variables as possible that were associated with CAD, such as common metabolic disease, physical activity, smoking and alcohol consumption status. Certainly, it is necessary to consider these influencing factors, including education level and the Townsend deprivation index, in future studies related to CAD.

Conclusions

In summary, air pollution exposure was significantly associated with an increased CAD risk, which could be further strengthened by CAD. Additionally, our findings showed positive additive gene‒environmental interactions on the onset of CAD and highlighted the importance of comprehensively evaluating air pollution and genetic factors in prevention efforts for people who are vulnerable to CAD.