Introduction

Cardiovascular disease (CVD) is the primary cause of morbidity and mortality worldwide [1,2,3]. Coronary artery disease, in particular, is the primary cause of disability-adjusted life years lost globally [3,4,5], and its risk increases with age [6, 7]. Coronary artery calcification (CAC) is a dynamic and complex biological process linked to aging and serves as an index of arteriosclerosis [8,9,10]. Recent studies have shown that CAC is a predictor of CVD and major adverse cardiac events [11,12,13] leading to the endorsement of computed tomography (CT) CAC scoring for risk decisions at the primary prevention level [14, 15•]. Consequently, CAC has emerged as one of the indications for starting statin therapy, given its well-established cardiovascular protective effects. Statins are inhibitors of HMG-CoA reductase and are considered the most effective class of drugs for reducing low-density lipoprotein cholesterol (LDL-c) levels [16, 17]. However, recent studies on the pharmacological mechanisms of statins have suggested that they have the potential to accelerate vascular calcification, implying the possibility of a paradoxical effect [18].

The increase in vascular calcification by statins can be mainly attributed to their pleiotropic, LDL-independent, effects. The concept of statin pleiotropy emerged after fully accounting for statin’s clinical influence on CVD, and it received substantial pharmacological and molecular investigations [19, 20••]. Notably, statins can inhibit the synthesis of vitamin K2, a key cofactor for matrix Gla-protein in blood vessels, crucial for protecting against calcification [20••, 21,22,23]. Additionally, statins can suppress various macrophage phenotypes, promoting calcium deposition [23,24,25]. Interestingly, the burden of calcified atherosclerotic plaques has been found to correlate with the overall coronary plaque burden [18, 26]. However the effect of statins on CAC in the general population remained uncertain.

Several previous human subject studies have been conducted to investigate the association of statin use and CAC, with major variations in design and methodology. A wide range of evidence from observational studies has suggested that statin use increases CAC [27,28,29,30,31,32]. Some others, however, have concluded that statins reduce CAC [33,34,35,36]. Randomized controlled trials (RCTs) have also shown contradictory results [37,38,39,40]. Consequently, the genuine effect remains disputed and inconclusive [20••, 41]. There are some drawbacks that limited the previous systematic literature reviews and meta-analyses on this association [42,43,44,45]. They mostly did not thoroughly consider the profound diversity of included studies in terms of study design, outcome metrics, imaging modality, and quantification methods of CAC. Furthermore, most had search strategy constraints, such as limiting the publishing period to after a certain date, focusing on those reporting CVD events, or limiting the source population to specific comorbidities.

Despite these extensive researches on the relationship between statin use and CAC, the genuine effect of statins on CAC in the general population remains unclear. Therefore, to provide a more comprehensive understanding of prior research and to reduce sources of uncertainty and conflicting results, we aimed to conduct a systematic literature review and meta-analysis. Our objective is to investigate the association between statin use and CAC in the general population while accounting for variations in study design and methods.

Methods

Protocol, Search Strategy, and Selection Criteria

We developed a protocol submitted to PROSPERO on 10 June 2021 and registered with the number CRD42021254187. The selection process and reporting items were based on the preferred reporting items for systematic reviews and meta-analysis (PRISMA) flow diagram and checklist [46].

We formulated a comprehensive search strategy of Embase, Medline All via Ovid, Web of Science Core Collection, and Cochrane Central Register of Controlled Trials for publications without a language or time restriction up to 10 February 2022. A manual search to identify records through reference searches and gray, unpublished literature was conducted. Retrieved records were checked for inclusion and exclusion criteria after duplicates were removed. Observational studies and RCTs were eligible for inclusion if they investigated the association between statin use and CAC using a CT scan, the conventional imaging modality recommended for CAC scoring in clinical practice. The search strategy and inclusion and exclusion criteria were detailed in the supplementary material. In brief, any analytical observational study, comprising cross-sectional and longitudinal, that reported an effect estimate of the association between statin use and CAC was included. RCTs that measured CAC before and after the administration of statins were included. RCTs that only used statins in combination with non-statin medications in the same arm were excluded. The process was performed by two independent reviewers (M. N. S and S. M. J). Discrepancies over eligibility were resolved through consensus with a third reviewer (F. A.).

Data Selection and Extraction

Two study investigators (M. N. S and S. M. J) independently extracted the data from the included records. The studies’ characteristics, including design, participants, exposure, outcome, and summary of statistical analysis were retrieved. From observational studies, effect sizes and corresponding 95% confidence intervals (CI) were collected. Where the 95% CI of the available effect estimate was not reported, we obtained it from the reported p-value [47]. We extracted the adjusted effect size, where both adjusted and unadjusted effects were reported.

The mean and the corresponding standard deviation (SD) or the median and the corresponding interquartile range (IQR) of CAC at baseline and follow-up were extracted from the included RCTs. Extracted median and IQR converted into mean and SD [48] for a unified outcome. Using that, we calculated the mean change from the baseline and its corresponding SD by a correlation coefficient for change from baseline [49] where it was not reported. CAC scores from the first and the last scans were extracted from studies with multiple follow-up scans (> 2 CTs). When RCTs undertook multiple arms, statin and placebo arms were used. If there were no placebo arms and only two arms with different statins or different dosages of statins were given, higher and lower defined daily doses (DDD) were identified according to the values of the World Health Organization; afterward, data of the lower DDD arm was used as the control arm.

Quality and Risk of Bias Assessment

For quality and risk of bias assessment the Newcastle–Ottawa Scale [50] and Cochrane Risk of Bias assessment [51] tools were used.

Statistical Analysis

Data Synthesis and Meta-analysis

Observational studies were classified first by their design and then by their outcome metric as continuous or binary. The effect size metrics and the corresponding 95% CI were unified in each group with odds ratios (OR) for binary outcomes (such as the prevalence of CAC), and beta coefficients for continuous outcomes (representing the CAC score). In RCTs, the mean change and corresponding SD of each study were used to measure the standardized mean difference (SMD), as the effect size. We conducted a meta-analysis when there were more than two studies that utilized the same study design, belonged to the same effect metric category, and, in the case of continuous outcomes, employed the identical CAC quantification method.

To pool the effect estimates, the inverse variance-weighted method was used. Using pre-calculated ORs, all were log-transformed before meta-analysis. I2 statistics were measured to quantify the variability in effect estimate due to between-study heterogeneity rather than chance. Meta-analysis of random effect models was generally applied. Exclusively, a fixed random effect model was applied only in the case of low (I2 < 25%) heterogeneity. Prediction interval (95% PI) was incorporated into random effect models to quantify the distribution of effect estimates and provides a range within which future research effects are anticipated to fall [52,53,54]. Influence diagnostic and leave-one-out analysis was conducted to detect the outliers and influential cases. If any outlier or influential case were detected, the meta-analysis model was once recalculated after removing the detected case. Visual assessment of the symmetry of the funnel plot and Egger’s test was done to identify small-study effect and publication bias. P-values were from 2-tailed tests, and if the p-value was < 0.05, the results were deemed statistically significant. All statistical analyses were performed using R studio for Windows V2021 and {meta}, {metaphor}, and {dmetar} packages [55,56,57].

Subgroup and Meta-regression Analysis

We used subgroup and meta-regression analyses to identify effect modifiers and other sources of heterogeneity. When there were at least eight studies, continuous factors were assessed in meta-regression models. Categorical variables were tested using subgroup analysis when there were at least ten studies in the meta-analysis model [55]. Subgroup analyses were performed based on the quantifying unit of CAC, DDD of the control arm, industry source of funding, follow-up time (long or short), quality assessment results (QAR). Each of the following variables was assessed in a separate meta-regression model: publication year, male proportion, average age, percentage of known CVD, mean baseline CAC score, follow-up time, and (QAR) (when continuous).

Synthesis Without Meta-analysis (SWiM)

Where including a study in a meta-analysis model was not possible, we used SWiM guidelines in systematic reviews [58], which occurred, for instance, when the description of the outcome of the association was too diverse from others to yield a meaningful summary estimate. We used arrows to visually summarize the direction of effect estimates for each study’s results. An indication of study size and statistical significance was used for the arrows, using size and color [58, 59].

Grading of Recommendations Assessment, Development, and Evaluation (GRADE)

The GRADE approach was used as a systematic and transparent judgment tool to assess the quality of the body of evidence for each outcome reported in the systematic review [60], as suggested by the Cochrane Collaboration [49]. The findings from the synthesis, with or without meta-analysis, were graded based on study design and seven other criteria. Downgrading factors included within-study risk of bias, imprecision of effect estimates, inconsistency, and indirectness. Upgrading factors included dose–response gradients, a large enough effect, and no plausible confounding or obvious bias. The GRADE method specifies quality into four levels: high, moderate, low, and very low.

Results

Search Results and Study Details

From 2377 initially identified records by the search, 188 full-texts were assessed for eligibility, leaving 41 original articles for inclusion in this systematic review. The PRISMA flow diagram of the selection process is available in the Supplementary material, Fig. S1. There were 13 cohort studies [27,28,29, 33, 34, 61,62,63,64,65,66,67,68], 16 cross-sectional studies [30, 32, 35, 36, 69,70,71,72,73,74,75,76,77,78,79,80], and 12 RCTs [37,38,39,40, 81,82,83,84,85,86,87,88] in this systematic literature review. This involved 12,520, 7072, and 1791 individuals, respectively. The average follow-up time per person was longer in cohorts compared to RCTs (3.90 years (SD = 1.27) and 1.25 years (SD = 0.25) respectively). The summarized features of included articles such as the study sample size, comorbidities, age, sex, the prevalence of CVD and statin use, follow-up time, and quantification method of CAC are presented in Tables 1, 2, and 3. In the last column of each table, the overall results of each study’s quality assessment are included. Details of the assessment are available in Tables S24.

Table 1 Characteristics of included cohort studies
Table 2 Characteristics of included cross-sectional studies
Table 3 Characteristics of included randomized controlled trials (RCTs)

Meta-analyses

Six meta-analysis models were developed by using 32 studies (Figs. S27). Table 4 provides data on the pooled results before and after removing outliers and influential cases. Across the six models, two showed statistically significant associations. One of which is the model of cohort studies which showed statin use was significantly associated with CAC — quantified as Agatston score — (B-coefficient: 0.11, 95% CI: 0.05; 0.16). The model showed no heterogeneity (I2: 0.0%), and the included cohorts had an average follow-up per person of 3.68 years. The other model is the model of cross-sectional studies. The pooled odds of the presence of CAC in those who used vs did not use statins was 2.11 (95% CI: 1.61; 2.78). This model had high heterogeneity (I2: 69.7%), and a statistically significant prediction interval (95% PI: 1.00; 4.53). The available data were insufficient to standardize the beta coefficients before the meta-analysis. In RCTs, the pooled effect estimates with CAC quantification in Agatston score or volume were not statistically significant (SMD: − 0.06, 95% CI: − 0.19; 0.06, I2 = 0.0% and SMD: 0.26, 95% CI: − 0.66; 1.19, I2 = 77.6%; Table 4).

Table 4 Pooled effect size estimate of the six meta-analysis models before and after removing the outliers and influential cases

Subgroup and Meta-regression Analysis

The subgroup analyses showed a statistically significant difference in the pooled effect size based on whether CAC was quantified as Agatston score or volume (Table 5; 95% CI: [0.02 to 0.62]). Detailed results of other subgroup analyses are presented in Tables S69.

Table 5 Subgroup analysis of randomized controlled trials (RCTs) based on the quantifying unit of CAC as either Agatston score or volume

Meta-regression was conducted in two meta-analysis models, the model of cross-sectional studies with binary effect metrics and the model of RCTs which quantified CAC as Agatston score. Table 6 displays the estimated effect for each variable in meta-regression and the difference in the true effect size explained by each variable. None of the included variables had a statistically significant modification in the estimated effect.

Table 6 Meta-regression of the meta-analysis models

Small-Study Effect and Publication Bias

Visual inspection of the funnel plots and the Eggers’ regression test results indicate small-study effects in the meta-analysis of RCTs quantified CAC in volume (Egger’s intercept: 3.78, 95% CI: 1.41; 6.16). The contour-enhanced funnel plot shows that the available small studies are more likely to have a larger effect size and a more significant result when compared to the larger available studies. The funnel plots and the details of the Egger regression tests are available in Figs. S1420 and Table S10, respectively.

SWiM

Inclusion in the meta-analysis was precluded for nine observational studies. In one cross-sectional study with a binary outcome, statin use was significantly associated with the CAC score > 238 [73]. In the two other studies with a continuous outcome, statin use was significantly associated with rising calcium mass scores and absolute calcified plaque counts [31, 32]. One cohort study with a dichotomous outcome revealed that statin use was significantly associated with a rapid progression of CAC [64]. One out of the four studies with a continuous outcome showed statin use was significantly associated with a decrease in the median CAC progression [33]; the other three identified that statin use was associated with an increase in CAC [27, 67, 68], with only one reaching statistical significance. Tables S1112 provide SWiM-detailed results accompanied by the reasons for exclusion from meta-analysis, according to McKenzie and Brennan [89].

GRADE

The confidence level of the body of evidence in RCT meta-analyses was high, while the confidence levels in observational studies, both with and without meta-analysis, were rated as low or very low. Detailed results of the GRADE approach are available in Table S13.

Discussion

The present study summarized quantitative evidence from previously published observational studies and RCTs on the association between statin use and CAC detected by CT scans in the general population. In observational studies, two meta-analysis models showed significant associations: one between statin use and increasing CAC, as quantified in the Agatston score, and the other with the presence of CAC. These results were inferred from cohort studies with an average follow-up time per person of 3.68 years and cross-sectional studies, respectively. Meta-analyses of RCTs did not reach statistical significance; however, the pooled effects differed significantly depending on whether CAC was quantified as Agatston score or volume.

Our findings from observational studies are supported by the results from in vivo investigations of the effect of statins on vascular calcification, which suggest an increase in calcification. These findings, potentially, contradict the expected cardiovascular protective effects of statins on CAC. Some explanations for this effect propose that statins may stabilize atherosclerotic plaques and prevent plaque rupture by increasing calcification. It is important to note that the beneficial effects of statins on coronary atherosclerotic plaque primarily manifest through their ability to slow down or reverse the progression of soft plaque components. Soft plaque components including lipid-rich core are particularly dangerous because such plaques are unstable and vulnerable to rupture [18, 90,91,92]. However, we would like to highlight that our findings from meta-analyses of observational studies were influenced by the inherent limitations of observational association studies which employed descriptive or etiological approaches. Among the most important ones is confounding and, in the case of our study, confounding by indication, which compromises the reliability of the conclusions regarding the genuine effect of statins. These inherent limitations also resulted in low or very low rates in the GRADE system used to assess the quality of evidence. As a result, it is uncertain how much of the observed effect estimate, suggesting an increase in CAC, would remain after accounting for confounding related to statin indication. Interestingly, the results from the two meta-analysis models of observational studies with statistically significant findings suggest a consistent pattern of association across included studies. This implication is supported by the use of a fixed-effect meta-analysis for cohort studies and the significant prediction intervals obtained from the random-effect meta-analysis for cross-sectional studies, after excluding the outliers. Therefore, there is a high probability that future observational studies employing similar designs and outcome metrics will yield similar effect estimates, showing an increase in CAC, in this level of multivariable-adjusted associations [53, 54]. Future observational studies are needed to address confounding using innovative causal inference methods. These methods can estimate causal effects without the need to measure all confounding factors, allowing for conclusions regarding the drug’s effects in the general population.

Our findings from RCTs in both quantification methods did not reach conventional levels of statistical significance after an average of 1.25-year period. The findings have a high-quality score, as indicated by the GRADE approach. They are also clinically relevant and promissings since an increased amount of CAC over serial scans has been closely associated with a higher risk of future cardiovascular events [11,12,13]. However, we believe that with this quite solid and yet limited available evidence obtained from RCTs, we cannot yet draw a comprehensive conclusion about the full scope of statin’s effect on CAC, since the drug’s effect on the CAC could be dynamic over time. Although the effect modification of follow-up time on the results from RCTs did not reach the statistical significance in both subgroup and meta-regression analysis, both analyses showed a trend toward smaller effect sizes as the follow-up was prolonged (range of 1–2 years) which aligns with our expectations. The pooled effects in cohorts and RCTs were assessed at the point in their follow-up period after the effect was detectable in clinical settings. This is inferred from statins’ in-clinic response time ranging from 4 to 12 weeks [93]. If the effect size were to continue increasing indefinitely, CAC would eventually reach a point where it obstructs blood flow, contradicting the established cardiovascular protective effect of statins. As a result, we expect the estimated effect size of statins on CAC to decline over time. This dilution of effect could possibly be affected in part by the natural aging-related increase in CAC. We think that a more significant effect size may be observed by assessing CAC immediately after and near the in-clinic response time. Considering the limited duration of time and degrees of freedom supplied by our data, which prevents drawing furthur conclusions about the effect in short-term vs long-term follow-up, future studies with tailored follow-up time customized to the pharmacological properties of statins are warranted.

The model of RTCs in volumetric units, one of our six meta-analysis models, was influenced by the small-study effect, which may indicate publication bias. This is despite our efforts to reduce the likelihood of this bias by undertaking the reference search and covering the gray and unpublished literature in our manual search. Since we observed that the small studies were more prone to publication bias and were more likely to have significant results, we think the missing studies are small studies with insignificant results. As a result, outcome reporting bias is probable, provided that RCTs mainly measure multiple outcomes of interest. It is likely that insignificant results in volumetric units, a less common method of quantification in clinics, were dropped.

The RCTs in this meta-analysis allowed us to compare the pooled effect estimates across the two quantification methods. Although the Agatston score and volume remain very closely correlated [94], the significant difference between the pooled effects could be attributed to what each scoring technique measures and the limitations inherent in each technique. The most commonly used method is the Agatston score, which is calculated by multiplying the total area of voxels (mm2) by an arbitrary density index based on the voxel with the highest density, ranging from one to four [94,95,96]. As a result, it is affected simultaneously by calcification area and density, and due to the weighted density index, it grows non-linearly as density increases. CAC volume score provides an alternate method of CAC scoring. It is calculated by multiplying the total number of calcium-containing voxels by the volume of 1 voxel [94, 95]. Volume score is considered more relevant for assessing CAC evolution over time since it allows for a linear augmentation when calcium rises [94, 95, 97]. By contrasting the pooled effect estimates between the two quantification techniques, we have highlighted the distinct potential effect of statin use on the density, area, and volume of CAC.

Recent data suggests that increased density in calcified coronary plaques could be protective against coronary artery diseases and major adverse events, consistent with the concept that it may increase the stability of atherosclerotic plaques and reduce the risk of plaque rupture [41, 98••]. The Agatston score, which was developed without specific histopathological data for correlation, assumes, by definition, that high-density CAC plaque is associated with a higher incidence of coronary artery diseases [94]. Recent evidence using coronary CT angiography and intravascular ultrasound (IVUS) has supported statin’s effect predominantly on increasing dense calcium [98••, 99]. Although the Agatston score is an independent predictor of cardiovascular risk, we hypothesize that by distinctly adding CAC density, area, and volume, the predictive value might be improved since the impact of statins on these measures could be different. Additionally, we believe there may be added value in using the inverse relation between CAC density and volume. This may help quantify what is already suggested as the stability and maturity of calcium within an atherosclerotic plaque.

Limitations and Strengths

We studied drug use as the exposure of the association under investigation; therefore, the duration and intensity of the exposure were not covered by the pooled results. Studying the general population, we designed to investigate the effect of the population’s risk profile on the association, which showed no modification in the effect. However, since few records were included in some meta-analysis models, we could not run subgroup or meta-regression analyses across all the models. We had limited data to provide a standardized measure of the beta coefficients in the analyses. However, it may not change the overall trend of the results considerably, given that the outcome data in most studies underwent the same transformation. Our meta-analysis employed study-level data, not individual patient-level data, which could have assisted in overcoming some of the already-mentioned limitations of this study.

Our study has a number of strengths. Statins have received the most extensive research attention in cardiovascular pharmacology. Our findings, building upon previous results, provided new insight into our understanding of the effect of these medicinal products on CAC in the general population. We assessed our findings’ robustness, inspected heterogeneity patterns, and discussed reasons why the effects differed. These approaches helped us develop informed hypotheses and conclusions from observational studies and RCTs in our systematic review and meta-analysis.

Conclusion

Drawing from our findings, the effects of statins on CAC demonstrate variations across different study designs and effect size metrics. Cohort and cross-sectional studies suggest a significant association between statin use and CAC score progression and CAC presence, respectively. Nevertheless, RCTs did not determine a significant effect, with the effects being different across the quantification unit of CAC, hindering forming a conclusion. Interpretation should consider limitations inherent in included studies, namely confounding by indication in observational studies, variations in CAC quantification method, and limited follow-up time points.

To gain a better understanding of this association, a large long-term RCT is required to consider the effect over customized follow-up times aligned with the pharmacological properties of statins. However, ethical, medical, and logistical aspects may restrict the feasibility of it. Observational studies with large sample sizes drawn from the general population could shed light on this association by taking one step towards addressing confounding by indication and establishing causality. Furthermore, future studies should independently assess the volume and density of the observed effects, thus providing a more detailed characterization of the effect.