Background

Cardiovascular diseases (CVD) in recent times have been reported by the world health organization (WHO) as the leading cause of death worldwide, with high serum low-density lipoprotein cholesterol (LDL-c) a major risk factor of CVD worldwide (Ephraim et al. 2018). It is estimated by WHO that about 17.9 million lives are lost every year due to CVD. Many studies have consistently reported a strong positive correlation between CDV and high serum LDL-c, hence the need for an accurate estimation of LDL-c in clinical practice to monitor and evaluate patients risk of developing CVD (Martin et al. 2013; Ephraim et al. 2018).

Several novel formulas for calculating serum LDL-c have been reported in many research studies all over the world as an ideal formula better with regard to accuracy (Puavilai and Laoragpongse 2004; Puavilai et al. 2009; Martin et al. 2013; Mehta et al. 2018; Alpdemir and Alpdemir 2021) yet there seems to be no consensus on the best method of estimating serum LDL-c with regard to accuracy of results and convenience. The direct estimation method of measuring serum LDL-c is widely accepted as the most accurate method of estimating LDL-c but this method cannot be routinely employed in many clinical settings due to cost and the fact that it is quite complex to use (Puavilai et al. 2009). So due to cost and complexity of direct estimation of serum LDL-c, the indirect method of estimating serum LDL-c using standardized Friedewald’s equation (sFE) is routinely employed in many clinical settings for assessing the coronary risk of patients (Ballantyne et al. 2000). Many other studies compared several other indirect serum LDL-c measurement formulas which were generally referred to as modified Friedewald’s equation (mFE) in many of those research studies and reports from those research studies indicate that most of proposed modified formulas are more accurate in estimating serum LDL-c in instances of hypertriglyceridemia (Puavilai and Laoragpongse 2004; Alpdemir and Alpdemir 2021).

Elevated triglycerides often above 300 mg/dl in serum results in a significant accuracy decline in the indirect estimation of serum LDL-c using standardized Friedewald’s equation according to many research studies, and this is the practical limitation of the standardized Friedewald’s equation (Puavilai and Laoragpongse 2004; Ephraim et al. 2018; Mehta et al. 2018). This leads to a misclassification of patients with high coronary risk index as patients with low or normal coronary risk index (Mehta et al. 2018). Another significant limitation of sFE is that serum LDL-c of a non-fasting sample cannot be accurately measured (Puavilai et al. 2009).

Regardless of the reported limitations of sFE, a research study conducted by Puavilai and Laoragpongse (2004) reported that calculating LDL-c using the sFE produced results that strongly correlates with direct measurement of LDL-c which is used as a reference (gold standard) in many of the research works which seeks to assess the validity or accuracy of sFE results compared to other modified LDL-c calculation method. However, the researchers of this study noticed that the LDL-c level from the direct measurement (reference standard) was higher than calculating one using sFE if the serum triglyceride is higher than 300 mg/dL (Puavilai and Laoragpongse 2004).

Hence, due to the lack of consensus on the best means of calculating an important clinical parameter as serum LDL-c there is a need for a broader or a more holistic approach in determining the best means of estimating LDL-c in serum in terms of accuracy and a systematic review which seeks to deliver a meticulous analysis and summary of all relevant research works about this subject can best achieve that.

Methods

Literature research

This review study systematically identified and critically evaluated and analyzed existing primary literature to make recommendations on the best equation to estimate LDL-c in resource limited areas. In this literature review, the Medline (PubMed), ScienceOpen, Biomedcentral and SpringerLink electronic databases were searched. The search was based on the following keyword terms or their derivatives or synonyms: LDL cholesterol and equation. The search used ‘free terms’ and ‘index terms’ funneled using the Boolean operators (OR, AND), into the search string; (LDL cholesterol OR low-density Lipoprotein cholesterol) AND (equation OR formula) as described in Table 1.

Table 1 Search term strategy used

The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines for conducting and reporting systematic review were followed.

Criteria for considering studies for this review

Studies whose report did not present enough information to allow full assessment of eligibility or studies reported only in a non‐English language were excluded. Published studies which are not original or primary research articles such patents, reviews, reports, newsletters, book, book chapters and conference papers were excluded from this study. Primary studies that employed diagnostic method comparison or method validation of calculated LDL-c were considered eligible for this study. Studies that were published earlier than 2001 were considered ineligible for review (Table 2).

Table 2 Inclusion and exclusion criteria for selection of studies

Participants

Studies recruiting or involving populations of healthy and diseased subjects who meet the condition required for testing for serum lipids including being in a fasting state were eligible for inclusion (Table 2).

Index test and reference standards

Studies were required to estimate LDL-c using methods of direct measurement/assay of LDL as reference standard with or without an added test method as reference standard. Studies evaluating, validating or comparing any equation for LDL-c estimation in addition to the Friedewald’s LDL-c equation were considered eligible for inclusion. Studies that evaluated only one LDL-c equation were considered ineligible (Table 2).

Data collection

Search results from the various databases were subjected to automated screening for eligibility using electronic filters available in the electronic databases based on pre-determined eligibility criteria as described in Table 2. Using pro-forma, eligibility of potentially relevant articles was assessed based on full text publications excluding non‐English language reports. Due to the number of studies identified, hand search and search of bibliography of included studies were not sourced. The outcomes of the selection process are described in Fig. 1.

Fig. 1
figure 1

PRISMA flowchart on selection of studies in this review

Appraisal of included studies and risk of bias

Two authors independently assessed the risk of bias of included studies, and disagreements were resolved through discussion. The Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) was employed to evaluate the methodological quality. Four domains of entries were participant’s selection, index test, reference standard and flow and timing. Definitions and judgment criteria for each domain are available in Cochrane Handbook.

Data extraction and management

Data relating to the correlation coefficient ‘r’ (Pearson correlation coefficient, Spearman rank correlation and intraclass correlation coefficient) estimation of each comparison of index test (calculated LDL-c equation) with reference test (direct LDL-c assays) were extracted and used for the meta-analysis. In three studies where coefficient of determination (r2) was reported, correlations coefficient was recalculated using the formulae, and direction of slope (√r2) was reported. In studies where estimates of coefficients were reported in strata or subgroups, the pooled or overall correlation coefficient was estimated by using the Schmidt–Hunter’s method for estimating pooled correlation coefficients. Also, characteristics of each study (such as the year of study publication, method of direct LDL-c used, type of study, location and sample size) were extracted from each study (Table 3). The author’s surname and year of publication of the study were used as identities for each study (Table 3).

Table 3 Characteristics of selected studies

Statistical analysis and data synthesis

For each test type, we plotted estimates of the observed correlation coefficients in forest plots. These plots illustrate the variation in test performance between studies. The Hedges–Olkin method for pooled correlation coefficient was used to calculate overall correlation coefficients. The summary estimates were plotted using forest plots. The random effect model was used as significantly large heterogeneity was observed among the studies. The StatsDirect statistical tool was used for the meta-analysis.

Investigations of heterogeneity

We inspected forest plots to visually assess heterogeneity between study-specific estimates of correlation. A heterogeneity test (I2 heterogeneity test) was also performed to ascertain the variations in the outcomes of interest among the various studies.

Results

Pooled correlation coefficients

See Figs. 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 and 14.

Fig. 2
figure 2

The pooled correlation coefficient of Friedewald’s LDL-c equation compared to direct measurement in determining levels of LDL-c from all included studies. There was a positive correlation between Friedewald’s LDL-c equation and direct measurement in determining levels of LDL-c [0.94 (95% CI 0.92–0.96)]

Fig. 3
figure 3

The pooled correlation coefficient of Friedewald’s and Martin’s LDL-c equation compared to direct measurement in determining levels of LDL-c from matching studies. The pooled correlation between Martin’s LDL-c equation and the direct measurement in determining levels of LDL-c as well as the correlation between the Friedewald’s LDL-c equation and the direct measurement in determining levels of LDL-c. This correlation pooled 12 studies in the analysis. There was a positive correlation between Martin’s LDL-c equation and the direct measurement in determining levels of LDL-c [0.96 (95% CI 0.94–0.98)]. Similarly, Friedewald’s LDL-c equation positively correlated with the direct measurement in determining levels of LDL-c [0.94 (95% CI 0.92–0.96)]. However, there was a stronger positive correlation between Martin’s LDL-c equation and the direct measurement in determining levels of LDL-c compared to Friedewald’s equation and the direct method

Fig. 4
figure 4

The correlation coefficient of Friedewald’s and the Anandaraja’s LDL-c equations with direct measurement in determining levels of LDL-c from 9 matching studies. The Anandaraja’s LDL-c equation and the direct method of determining LDL-c gave a positive correlation [0.90 (0.85–0.93)] as well as positive correlation between the Friedewald’s equation and the direct method [0.94 (95%0.89–0.96)]. However, there was a stronger positive correlation between the Friedewald’s equation and the direct method compared to the Anandaraja’s equation

Fig. 5
figure 5

The correlation coefficients of Friedewald’s and the Chen’s LDL-c equations with direct measurement in determining levels of LDL-c from 8 pooled matching studies. The two equations established a positive correlation between the direct method of determining LDL-c [0.95 (95% CI 0.90–0.97)]. There was no statistical difference between the correlation coefficients of Chen’s equation and the direct measurement as well as Friedewald’s equation and the direct method

Fig. 6
figure 6

The correlation coefficients of the Friedewald’s and the de Cordova’s LDL-c equations with the direct measurement in determining levels of LDL-c pooled from 6 matching studies. The de Cordova’s LDL-c equations and the Friedewald’s equation both demonstrated a positive correlation with the direct method of measuring LDL-c [0.87 (95% CI 0.76–0.93) and 0.94 (95% CI 0.88–0.97), respectively]. However, the Friedewald’s equation established a stronger positive correlation with the direct method of determining LDL-c compared with the de Cordova’s LDL-c equation

Fig. 7
figure 7

The correlation coefficients of Friedewald’s and the Puavilai’s LDL-c equations with direct measurement in determining levels of LDL-c from 6 matching studies. Both equations demonstrated a positive correlation with the direct method of measuring LDL-c. However, the Puavilai’s equation established a stronger correlation with the direct method [0.95 (95% CI 0.91–0.97)] than Friedewald’s LDL-c equation [0.93 (95% CI 0.89–0.95)]

Fig. 8
figure 8

The correlation coefficients of Friedewald’s and Ahmadi’s LDL-c equations with direct measurement in determining levels of LDL-c from 5 matching studies. Though both equations established a positive correlation with the direct method of measuring LDL-c, the Friedewald’s LDL-c equation presented with a stronger positive correlation with the direct method [0.95 (95% CI 0.90–0.98)] than the Ahmadi’s LDL-c equation [0.65 (95% CI 0.46–0.78)]

Fig. 9
figure 9

The correlation coefficients of Friedewald’s and the Hattori LDL-c equations with direct measurement in determining levels of LDL-c from 4 matching studies. Both the Friedewald’s and the Hattori LDL-c equations demonstrated a positive correlation with the direct method of measuring LDL-c. There was no statistical difference in their correlation coefficients

Fig. 10
figure 10

The correlation coefficients of Friedewald’s and the Vujovic’s LDL-c equations with direct measurement in determining levels of LDL-c pooled from 4 from matching studies. Both equations established a positive correlation with the direct method. Nevertheless, Vujovic’s LDL-c equation demonstrated a stronger positive correlation with the direct method [0.95 (95% CI 0.93–0.96)] than Friedewald’s equation [0.88 (95% CI 0.84–0.93)]

Fig. 11
figure 11

The correlation coefficients Friedewald’s and the DeLong’s LDL-c equations with direct measurement in determining levels of LDL-c pooled from 3 matching studies. Both equations had a positive correlation with the direct method, but DeLong’s LDL-c equation gave a strong correlation with the direct method [0.96 (95% CI 0.90–0.99)] compared with Friedewald’s equation [0.95 (95% CI 0.82–0.99)]

Fig. 12
figure 12

The correlation coefficients of Friedewald’s and the Rao’s LDL-c equations with direct measurement in determining levels of LDL-c from 3 matching studies. Both equations established a positive correlation with the direct method of measuring LDL-c; however, Friedewald’s equation demonstrated a stronger positive correlation with the direct method [0.95 (95% CI 0.82–0.99)] than the Rao’s LDL-c equation [0.93 (95% CI 0.60–0.99)]

Fig. 13
figure 13

The correlation coefficients of Friedewald’s and the Sampson’s LDL-c equations with direct measurement in determining levels of LDL-c from 3 matching studies. Though both equations had a positive correlation with the direct method of measuring LDL-c, the Sampson’s LDL-c equation showed a stronger positive correlation with the direct method [0.95 (95% CI 0.89–0.98)] as compared with the Friedewald’s equation [0.92 (95% CI 0.92–0.92)]

Fig. 14
figure 14

The proportions of LDL-c equations performing better than Friedewald’s equation in studies where TG > 400 mg/dl were included or excluded. A Triglycerides levels > 400 mg/dl included. About 6 matching studies that included serum triglycerides levels > 400 mg/dl, 100% of these studies established better performance of the Choi, DeLong and Molavi LDL-c equations than the Friedewald’s equation (Fig. 14A). B: Triglycerides levels > 400 mg/dl excluded. However, in about 24 studies, 100% of the studies demonstrated better performance of the Puavilai LDL-c equation compared with Friedewald’s equation. In about 13 studies that excluded serum triglycerides levels > 400 mg/dl, 100% recorded better performance of the Ephraim’s LDL-c equation than the Friedewald’s equation. This was followed by the Martin’s LDL-c equation with 83% out of the 75 studies assessed (Fig. 14B)

Discussion

A causal relationship between LDL-c levels and CV outcomes has been demonstrated in countless randomized controlled clinical trials (Steinberg 2005; Baigent et al. 2010; Levinson 2020; Mach et al. 2020; Piani et al. 2021). Direct measurements of LDL-c are still costly, time-consuming, not fully standardized and not universally available (Nauck et al. 2002; Miller et al. 2010; Langlois et al. 2018; Piani et al. 2021). Generally, LDL-c is alternatively estimated using the empirical Friedewald equation in most clinical environments (Bairaktari et al. 2005; Brown 2020). Nevertheless, many groups have consistently evaluated the accuracy of the Friedewald equation in different ethnicities or various disease entities and proposed alternative formulas for more precise LDL-c estimation until today (Table 4). To date, none of the proposed alternative methods have replaced the Friedewald equation in routine clinical practice and the lack of consensus on the best means of calculating LDL-c necessitated this study. Thus, to determine the best means of estimating LDL-c, this study evaluates the accuracy of the different equations known to calculate LDL-c levels.

Table 4 Types of LDL-c Equations

Indeed, our data showed a positive correlation between Friedewald’s LDL-c equation and direct measurement of LDL-c (D-LDL-c) from all the included studies [0.94 (95% CI 0.92–0.96)] (Fig. 2). Overall, all the twelve formulas showed a good positive correlation with the respective D-LDL-c (Figs. 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 and 13) and Martin’s equation (ME) had the highest correlation with D-LDL-c level measurement compared to all other formulas (Fig. 3). This present result can be validated from a study conducted by Rim et al. (2016) which reported that ME provided best agreement in a large Asian cohort as compared to other equations. Similarly, another study conducted on Korean population also showed that ME provided better results compared to other equations (Kang et al. 2017). However, our finding is not in agreement with the work of Cordova et al. (2020) who revealed that Martin’s equation is not accurate for estimating LDL-c in Southern Brazilians unless TG is between 300 and 400 mg/dl as well as Khan et al. (2017) whose work among the Pakistani population showed that the Teerakanchana equation (TE) provided higher correlation with D-LDL-c level followed closely by Martin equation. Also, our present results contradict studies conducted among the Indian population. They validated from their study that TE provided stronger correlation with the D-LDL-c level, followed by Vujovic equation (Dar et al. 2022), while in another study conducted among the Indian population Vujovic equation was preferred over others for calculating LDL-c level (Wadhwa and Krishnaswamy 2016). The differences in the performance (i.e., correlation with D-LDL-c level) of the equations in measuring D-LDL-c levels could be associated with the racial differences.

Again, a study conducted by Piani et al. (2021) on a large, random and blinded Italian population showed that Sampson equation provided higher correlation to measured D-LDL-c level. This outcome is not in keeping with our present data which revealed that ME is highly correlated with measured LDL-c levels. Here, we strongly believe that the difference in the outcomes could be due to the techniques employed in the estimation of the lipid levels. While Piani et al. (2021) estimated D-LDL-c level by employing the Sampson equation after all lipid samples were analyzed using direct homogeneous assay, Martin’s equation estimated D-LDL-c levels after cholesterol concentrations (LDL-c, VLDL-c and HDL-c) and triglycerides were directly measured by density-gradient ultracentrifugation and the ARCHITECT C-8000 system, respectively. These methodologies have different underlying principles and could possibly influence the estimated D-LDL-c level.

Moreover, our study found the lowest correlation between Ahmadi’s equation (AE) and the measurement of D-LDL-c level [0.65 (95% CI 0.46–0.78)] (Fig. 8). Interestingly, our findings are consistent with the results of Karkhaneh et al. (2019) and Molavi et al. (2020), where AE provided lowest correlation for Iranian population likewise in the Pakistani population (Khan et al. 2022). The reason why Ahmadi equation does not perform well could be due to the fact that this equation was derived only from the patients with high cholesterol (> 250 mg/dL, > 6.45 mmol/L) (Rim et al. 2016).

In addition, one interesting finding in our study is that studies that used LDL-c equations (Choi, DeLong, Puavilai and Molavi) established better performance (100%) than the Friedewald equation (FE) at serum triglycerides (TG) levels > 400 mg/dl (Fig. 14A). This is partly in conformity with the report of Wadhwa and Krishnaswamy (2016) who compared LDL cholesterol estimated using various formulae with directly measured LDL cholesterol in Indian population. While our study validates that Choi, DeLong, Puavilai and Molavi LDL-c equations gave better performance (100%) than the Friedewald equation (FE) at TG levels > 400 mg/dl, they reported that for TG range > 400 mg/dL, Vujovic equation correlated best with direct measurement of LDL-c levels. Nonetheless, this is contradictory to the study conducted by Sha et al., in Bangladeshi population who concluded that FE can be used up to serum TG concentration of 700 mg/dL (Sha et al. 2015). Here, the variance in the performance of the equations in estimating D-LDL-c levels at serum triglycerides (TG) levels > 400 mg/dl may be associated with sample size, fasting and non-fasting state of study participants, ethnicity and procedures employed in estimating lipid levels.

Furthermore, Cordova and Cordova suggested a new formula that performed better than Friedewald formula in Brazilian population over a wide TG range (de Cordova and de Cordova 2013). The results of this present study show FE performed better than Cordova equation (CE). Hence, we are in support of the findings of Wadhwa and Krishnaswamy (2016) that Cordova formula does not provide any significant advantage over Friedewald formula in Indian population. This study supports the study done by Gupta and Singh (2012), which concluded that Anandaraja formula does not provide any advantage over Friedewald formula for LDL-c estimation in Indian population (Gupta and Singh 2012). This is contradictory to the study done by Anandaraja et al., which found their formula more accurate than Friedewald formula for TG < 350 mg/dL (Anandaraja et al. 2005). Also, the modified Friedewald equation developed by Puavilai also correlated well with direct measurement and performed better than Friedewald formula at TG range > 200 mg/dL in Thai population (Puavilai and Laoragpongse 2004). It could be observed clearly that changes in TG range have the ability to affect the final value of the D-LDL-c irrespective of the equations that would be used.

Again, Ahmadi equation (AE) has been validated in Iranian subjects at TG < 300 mg/dL (Ahmadi et al. 2008). In our study, it performed better than FE at TG > 400 mg/dL. However, when TG > 400 mg/dL were excluded, it performed poorly than FE (Fig. 14A and B). Hence, it might not be suitable to be used in other populations.

Besides, Hattori equation (HE) developed by Hattori et al. had shown to perform better than FE in Japanese population which does not provide any advantage over FE in Indian population (Hattori et al. 1998). Our study outcome indicates that HE performed better than FE at TG > 400 mg/dL. However, without TG > 400 mg/dl inclusion, it performed poorly than FE (Fig. 14A and B). Again, this supports the claim that some of the D-LDL-c might not be suitable to be used in other populations because of racial differences.

Strength and limitations

The present study has one major strength and some limitations. We have analyzed all the principal LDL-c estimation formulas used in several studies. Hence, we strongly believe that our study is a representative of various racial origins and gives a generalizability to other populations. However, one major limitation in this study is that some studies were not included in the meta-correlational analysis due to insufficient information. In addition, several studies that we used in our study measured direct LDL-c by different assay methods such as Roche analyzer and Sekisui reagent instead of the recommended gold standard method of beta quantification. This could lead to incorrect comparisons of formulas with direct LDL-c data obtained from different methods.

Even though Martin’s equation has proven to be a better assessment tool than the other equations, it is limited by several factors. First, the Martin’s equation requires validation in an independent population and using other laboratory techniques. Second, factors such as race/ethnicity, obesity, diabetes and insulin resistance, which may affect variance in the adjustable factor (TG/VLDL-c ratio), were not available for analysis during the formulation of the Martin’s equation. Finally, the study that formed the Martin’s equation examines 1-time estimation of lipid levels. This is a major limitation because the study did not address the problem of intraindividual variation in lipid levels which has a greater potential to influence the adjustable factor.

Conclusion

Our data suggest that all the twelve formulas showed positive correlation with the respective D-LDL-c measurements. Martin’s equation provided the highest correlation with D-LDL-c measurement than the most used Friedewald equation. Again, at high TG ranges (i.e., TG > 400 mg/dl), Martin’s equation gave a better performance than Friedewald equation. Therefore, Martin’s equation can serve as a more accurate method to estimate LDL-c as compared to Friedewald equation especially is situations of the same ethnicity or race.