Introduction

In recent decades, the prevalence of diabetes mellitus (DM) has risen significantly worldwide, becoming a progressively more serious public health problem1. According to the International Diabetes Federation2, China, India, and the United States are the three countries with the highest rates of diabetes in 2019. This trend is predicted to continue to rise from 2030 to 2045, with China and India facing a huge burden of the disease.

Diabetic retinopathy (DR), one of the most common chronic complications of diabetes, is also the leading cause of irreversible visual impairment in adults, and in severe cases, can even lead to permanent blindness3. Notably, patients with DR may not have obvious symptoms until the disease is advanced. It is estimated that about 30% of the approximately 463 million patients diagnosed with diabetes worldwide also suffer from diabetic retinopathy4, and in our previous study5, we found that the prevalence of DR in the Xinjiang region of China was as high as 42.26%, which is much higher than the global average. In addition, according to relevant guidelines, it is recommended that diabetic patients should undergo a comprehensive eye examination within 5 years of diagnosis or at the time of diagnosis4,6. However, current DR diagnostic methods mainly rely on the detection of clinical symptoms, and although advances in imaging technology have improved diagnostic accuracy, these methods are costly, labor-intensive, and time-consuming when performing large-scale DR screening7. On the other hand, China, as a developing country, suffers from a lack of capacity for early detection of diabetic retinopathy and lacks a national DR screening program8. There are various reasons for this problem, including the lack of effective screening tools, shortage of ophthalmic professionals, and insufficient multidisciplinary strategies from image capture to DR diagnosis9. Further, regional economic disparities and differences in lifestyle make it challenging to replicate a DR management approach in China. This is particularly problematic in settings with limited healthcare resources, especially in regions where DR screening rates are significantly lower than the national average, such as Xinjiang, China. Therefore, the development of a simple, intuitive, and personalized prediction model is essential to stop the progression of the disease and protect patients vision10.

Prior to this study, we have developed a prediction model for DR in Xinjiang5. However, this model is highly dependent on whether the patient also has diabetic peripheral neuropathy (DPN), which occupies an extremely important position in the model, which greatly increases the difficulty of generalizing the model. In another of our studies11, we found that 25 hydroxyvitamin D3 [25(OH)D3] is an important risk factor for DPN. Also, existing studies have shown a strong relationship between 25(OH)D3 and DR12,13,14. In addition, the viewpoint that ApoA1 is more stable compared to lipid indicators has been proposed, but so far, no research has constructed DR risk prediction models using 25(OH)D3 and ApoA1 as risk factors. Therefore, we plan to construct a novel DR risk prediction model by using 25(OH)D3 as a risk factor for the first time based on previous studies. This model is intended to provide a reference for clinicians in the early screening, diagnosis, and treatment of DR, as well as the possibility of prioritizing retinal screening for patients who are at risk for DR but are unable to undergo routine eye examinations.

Methods

Study design and population

This is a retrospective analysis. The study protocol was approved by the Ethics Committees of First Affiliated Hospital of Xinjiang Medical University and Suzhou BenQ Hospital with a waiver for informed consent. The research methods were carried out in accordance with relevant guidelines and regulations.

In the development group, the retrospective study collected data of 2381 patients pathologically diagnosed with T2DM who were first admitted to the First Affiliated Hospital of Xinjiang Medical University in Xinjiang, China, between Jan 1, 2019 and Jun 30, 2022. In the external validation group, 962 patients with T2DM were Suzhou BenQ Hospital in Jiangsu, China, from Jul 1, 2020 to Jun 30, 2022.

The diagnostic criteria for type 2 diabetes mellitus was based on the Guideline for the prevention and treatment of type 2 diabetes mellitus in China (2017 Edition)15. According to these guidelines15, at the end of the routine examination, all patients were injected with 5 ml of sodium fluorescein at a concentration of 100 g/L into the elbow vein for fundus fluorescence angiography. About 10 s after the injection, a picture was taken with a KOWA fundus fluorescence camera. Subsequently, the captured images were analyzed using computer image processing techniques. Finally, two ophthalmologists reviewed and analyzed the results and made a diagnosis of diabetic retinopathy.

DPN: (a) abnormal vibration perception, (b) abnormal temperature perception, (c) ankle reflex disappeared, (d) nylon filament examination, foot sensation decreased or disappeared, and (e) nerve conduction velocity (NCV). If someone has two or more of the above phenomena, he/she is diagnosed as having DPN.

Inclusion criteria were as follows: (a). all research participants were able to communicate independently, and (b). patients at least 18 years old. Exclusion criteria were as follows: (a). incomplete clinical data, (b). Severe hepatic and renal insufficiency, and (c). Combined familial fundus disease.

Data collection

All baseline clinical characteristics, including gender, age, white blood cell, total cholesterol, high-density lipoprotein, low-density lipoprotein, direct bilirubin, indirect bilirubin, total bilirubin, aspartate aminotransferase, alanine aminotransferase, body mass index, systolic blood pressure, diastolic blood pressure, serum creatinine, hemoglobin A1c, glycosylated serum protein, apolipoprotein A1, apolipoprotein B, fasting blood glucose, triglyceride, blood urea nitrogen, 25-hydroxyvitamin D3, cystatin C, 2-h postprandial blood glucose,homocysteine, blood glucose, DR.

Statistical analysis

Statistical analysis was performed using R software (version 4.2.1). The continuous variables with normal distribution and homogeneity of variance are expressed as means ± standard deviations and tested using an independent sample t-test. Categorical variables were reported as percentages and compared using the Chi-square test. Independent risk factors were selected using least absolute shrinkage and selection operator (LASSO) regression and multivariate logistic regression16. A nomogram for identifying DR occurrence was constructed using covariates selected by multivariate regression, and discriminatory ability was assessed by measuring the area under the receiver operating characteristic curve (ROC)17. The Hosmer–Lemeshow goodness of fit test was used to assess the model fit and efficacy of the risk model18. Decision curve analysis and clinical impact curves were used to assess the clinical applicability of the nomogram19. All statistical analyses were two-sided, and statistical significance was set at p < 0.05.

Institutional review board statement

This study was conducted by the Declaration of Helsinki and was approved by the Ethics Committees of First Affiliated Hospital of Xinjiang Medical University (K202105-05) and Suzhou BenQ Hospital (SZMJYY2022102001) with a waiver for informed consent.

Informed consent statement

Waiver for informed consent.

Results

Patient characteristics

A total of 3537 patients with T2DM were screened (2526 in the development group and 1011 in the external validation group, respectively), of which 194 were excluded due to incomplete medical records (145 in the development group and 49 in the external validation group, respectively), and 3343 patients were finally included.

The detailed baseline demographics and clinical characteristics of the patients in the two groups are listed in Table 1. For the development group, the clinical information a total of 2381 patients was obtained from First Affiliated Hospital of Xinjiang Medical University. The patients (1464 males and 917 females) had a mean (SD) age of 57.268 ± 12.128 years of which the prevalence of DR was 44.6%. For the external validation group, the clinical information a total of 962 patients was obtained from Suzhou BenQ Hospital. The patients (627 males and 335 females) had a mean (SD) age of 57.279 ± 12.710 years, of which the prevalence of DR was 36.4%.

Table 1 Baseline characteristics of all patients in the development group and external validation group.

Screening for predictive factors

We use the 2381 patients in the development group to choose features with nonzero coefficients in the LASSO regression model. The variables in the model were gradually lowered when the penalty coefficient was changed. Eventually, the tenfold cross-validation error was chosen as the minimum λ + 1 (lambda. 1se = 0.024) and optimal value of the model; at this point, we have 5 variables (Fig. 1). These potential predictors consisted of Neutrophil, 25(OH)D3, Duration of T2DM, HbA1c, and ApoA1. These five candidate variables were then analyzed using multivariate logistic regression. Ultimately, the five variables of Neutrophil (OR = 0.908, 95% CI (0.870, 0.945), p < 0.001); 25(OH)D3 (OR = 0.877, 95% CI (0.863, 0.891), p = 0.001); Duration of T2DM (OR = 1.167, 95% CI (1.134, 1.210), p < 0.001); HbA1c (OR = 1.172, 95% CI (1.094, 1.213), p < 0.001); and ApoA1 (OR = 1.806, 95% CI (1.311, 2.492), p < 0.001) were statistically significant and hence selected for the development of the prediction model (Table 2).

Figure 1
figure 1

Demographic and clinical feature selection using the LASSO regression model.

Table 2 Multivariate Logistic Regression analysis for risk factors of DR.

Risk prediction nomogram development

Based on the LASSO regression and multivariate logistic regression analysis, and clinical correlation results, five predictors from the development group were included in the DR risk prediction model and presented as a nomogram (Fig. 2). The nomogram illustrated that each predictor corresponded to a specific score ranging from 0 to 100. The total score is determined by adding the scores of each predictor and is located on the “Total Points” axis. The probability of DR corresponds to the bottom “Diagnostic possibility” axis of each patient. A higher score for a related factor in the nomogram indicates a higher risk of developing DR.

Figure 2
figure 2

Nomogram for the perinatal prediction of DR. DR: diabetic retinopathy.

Predictive accuracy and net benefit of the nomogram

For the development group, The area under the curve (AUC) was 0.834(95%CI 0.820–0.849) (Fig. 3A), and the calibration curve was close to the ideal diagonal line (Fig. 4A), Hosmer–Lemeshow test showed the model was in line with observed data (p > 0.05). In addition, 962 patients from Suzhou BenQ Hospital were used for the external validation to test the nomogram. The AUC was 0.851(95%CI 0.829–0.874) (Fig. 3B), reflecting a good accuracy of the nomogram. Meanwhile, the model had good consistency, and the calibration curve of the external validation group was also close to the ideal diagonal line (Fig. 4B), Hosmer–Lemeshow test showed the model was in line with observed data (p > 0.05) as well.

Figure 3
figure 3

ROC curves. (A) Development group. (B) External validation group. ROC: receiver operating characteristic; AUC: area under the ROC curve.

Figure 4
figure 4

Calibration curve for predicting probability of DR. (A) Development group. (B) External validation group. DR: diabetic retinopathy.

The result of decision curve analysis for the nomogram is presented in Fig. 5 (Net benefits for different threshold probabilities were showed in Table S1, Supplementary Material). In the development and external validation groups, the decision curve showed that if the threshold probability of a patient were in the range of 0–0.92 and 0–0.95, respectively, which using the model achieved more net benefits than the "full treatment" or "no treatment" strategy. There was a broad spectrum of alternative threshold probability, suggesting that the model was a good assessment tool.

Figure 5
figure 5

Decision curve analysis in prediction of DR. (A) Development group. (B) External validation group. DR: diabetic retinopathy.

Discussion

Building on our previous study, the present study innovatively developed a DR risk prediction model that integrates five key variables: Neutrophil, 25(OH)D3, Duration of T2DM, HbA1c and ApoA1. The model not only demonstrated excellent predictive ability, but also filled an important gap in previous studies through external validation. During the development and external validation phases of our study, the model achieved AUC values of 0.834 and 0.851, respectively, which is a significant advantage over previous models (0.709 and 704, respectively)20. The calibration curve and Hosmer–Lemeshow (H–L) test results further confirmed the high accuracy of the model in adjusting the actual and predicted probabilities of outcome events. In addition, decision curve analysis (DCA) demonstrated that implementation of the model in clinical practice yields significant net benefits, thus highlighting its practical relevance in the field of medical research.

Our study reveals a significant positive correlation between ApoA1 and the severity of DR. Previous studies addressing the effect of ApoA1 on DR are scarce. However, recent studies have shown a positive correlation between high-density lipoprotein (HDL) and DR. Given that ApoA1 is the major structural protein of HDL, this finding provides some support for the hypothesis that ApoA1 may have an effect on DR. In a pivotal case–control study, Simo et al.21 compared ApoA1 expression in the retinas of diabetic and nondiabetic patients. It was found that overexpression of ApoA1 is an early phenomenon in the retinas of diabetic patients, which may be closely related to the pathophysiologic process of DR. In contrast, the study by Vinodhini et al.22 did not manage to establish a direct link between ApoA1 levels and DR severity. The widespread use of statins in the study and control groups may have contributed to this finding. In another study, Ankit and his team23 found that diabetic patients had higher levels of ApoA1 in vitreous fluid and retinal pigment epithelial cells compared to nondiabetic patients, which may increase the risk of developing diabetes. ApoA1 has an antifibrinolytic role in the atherosclerotic process and the plasma fibrinolytic system, and high levels of ApoA1 may lead to small-vessel occlusion24, which could promote the development of DR, which is consistent with the results of our study.

In addition, we noted differences in the composition of ApoE and HDL. Apolipoproteins are not only structural proteins, but also have enzymatic activity and receptor binding. The cholesterol in HDL, on the other hand, actually refers only to the cholesterol lipid content in it. Past studies have often focused only on HDL and ignored the role of other important lipoproteins, which may have led to less comprehensive and rigorous scientific studies. Furthermore, the metabolic environment of diabetic patients is significantly different from that of healthy individuals. In the setting of diabetes, lipoproteins undergo non-enzymatic glycation, oxidation, and modification of advanced glycation end products, and these changes may have an impact on test results. In addition, the accuracy of lipoprotein measurements may be variable, with sample handling, non-enzymatic glycation and oxidation having varying effects on different assays.

It is worth mentioning that the level of ApoA1 is more stable than the level of blood lipids, especially in the group of diabetes patients, and its level will not be significantly affected by the pre meal state. Therefore, compared with high-density lipoprotein and lipid indicators, apolipoprotein A1 may be a more accurate and reliable biomarker of diabetes retinopathy (DR).

Vitamin D, a fat soluble vitamin, exists in various forms in the human body. Among them, serum 25 hydroxyvitamin D3 [25(OH)D3] is its main circulating form and is widely considered a key indicator for evaluating vitamin D levels. In recent years, the relationship between vitamin D and microvascular complications of type 2 diabetes has gradually received extensive attention from scholars at home and abroad. Among them, diabetes retinopathy (DR), as one of the major microvascular complications of diabetes, is particularly worth studying. In a meta-analysis of 17,664 type 2 diabetes patients12, the study found that the risk of DR was significantly increased in patients with vitamin D deficiency. In a cross-sectional study, Millen et al.25 divided 15,792 subjects into three groups based on their serum 25(OH)D3 levels: vitamin D sufficient, vitamin D insufficient, and vitamin D deficient. The results showed that the incidence of DR was higher in vitamin D deficient subjects, consistent with the results of this study, further demonstrating the correlation between serum vitamin D status and DR risk. This suggests that 25(OH)D3 may serve as an important serum marker for predicting DR, which can aid in early screening of DR patients. In addition, Jamali et al.14 found in vitro experiments on rats that mice carrying vitamin D receptors performed better in promoting pericellular density arrest and endothelial cell survival compared to mice without vitamin D receptors, and were able to inhibit ischemia mediated retinal neovascularization through vitamin D. Lazzara et al.13 also found in vitro experiments on human cells that supplementing vitamin D can rebuild the integrity of the blood retinal internal barrier by increasing the expression of VE cadherin and ZO-1 proteins, protecting human retinal endothelial cells from high glucose induced damage. These in vitro studies provide new insights into the role of vitamin D in retinal protection.

Therefore, in the future health management of DR patients, attention should be paid to monitoring serum 25(OH)D3, and appropriate supplementation of vitamin D should be considered as one of the possible treatment strategies for preventing or improving DR.

Diabetes duration is an important independent risk factor in DR prediction models and is recognized as a major diabetic complication risk factor. Diabetes duration is strongly associated with the development of a variety of diabetes-related macrovascular and microvascular complications. Liu YY et al.26 in a 16-year cohort study of patients with T2DM, found that the prevalence of DR was 25% at 10 years of disease duration, and increased dramatically to 50% at 15 years of disease duration. Joanne et al.27 also found that DR was a significant risk factor in the prediction model of DR, and was recognized as a major risk factor for diabetic complications. A pooled analysis of 35 epidemiologic studies also found that the prevalence of DR increased significantly with disease duration, ranging from 21.1% in patients with a disease duration of less than 10 years to 76.3% when the disease duration was more than 20 years. Al Rubeaan et al.28 showed that diabetes with a disease duration of more than 10 years and older age of the patient were significant risk factors for DR. There was a 50-fold increase in the prevalence of DR between the youngest group with the shortest Duration of T2DM and the oldest group with the longest duration27. A study by Alramadan et al.29 also found that increasing age was significantly associated with the onset of DR. The prevalence of DR in patients ≤ 60, 61–70 and ≥ 71 years of age was 38.5%, 46.8% and 55.4%, respectively, and these patients generally had a long duration of type 2 diabetes. The baseline analysis of this study showed that the Duration of T2DM in DR patients was significantly longer than that in non-DR patients, and the risk of DR increased significantly with longer disease duration, which is consistent with national and international literature. Therefore, the frequency of screening should be increased in patients with longer disease duration in order to prevent or slow down the development of DR at an early stage.

It is speculated that the Duration of T2DM may be an indicator reflecting the overall level of blood glucose control and the exposure period of risk factors. Calderon et al.30 and Solomon SD et al.31 believe that blood glucose indicators are closely related to the severity of diabetes retinopathy (DR). For patients with T2DM, the higher the level of HbA1C, the higher the incidence of DR; On the contrary, when the HbA1C level decreases, the risk of DR occurrence will also decrease accordingly. In a cross-sectional study32 involving about 127,000 diabetes patients in northern Jiangsu, China, high HbA1C levels were confirmed to be an important risk factor for DR.

In this study, the data in Table 1 clearly shows that the levels of HbA1C in the DR patient population are significantly higher than those in the non DR patient population, and there is a statistically significant difference between the two. This further confirms the role of HbA1C as an independent risk factor for DR patients. HbA1C is a product formed by the combination of blood glucose and hemoglobin, usually used to reflect the patient's blood glucose control status over the past 8 to 12 weeks. Although the production process of glycosylated hemoglobin is relatively slow, it is not easy to decompose once it is formed, so it becomes an important indicator to monitor the blood sugar level of diabetes patients. Long term hyperglycemia may trigger irreversible vascular changes, but glycated hemoglobin, as an important indicator of blood glucose levels, is of great importance33. The level of glycosylated hemoglobin not only reflects the blood sugar content, but also is closely related to the complications related to diabetes, such as microvascular and macrovascular diseases34. Recent studies35 have shown that as HbA1c levels increase, it may affect the ability of hemoglobin to carry oxygen, thereby affecting retinal function, while a decrease in HbA1c levels helps to reduce the risk of DR30. Therefore, for patients with type 2 diabetes, strict blood glucose control and reduction of HbA1c level are still the key to diabetes management and prevention or mitigation of DR disease progress.

In the present study, we establish for the first time neutrophils as a protective factor in DR, a finding that has rarely been mentioned in previous studies. As the most abundant leukocyte type in the human body, accounting for up to 65% of all neutrophils, neutrophils, in addition to their well-known phagocytosis, have a unique mechanism of suicidal attack, neutrophil net trapping death (NETosis). This process culminates in the formation of neutrophil extracellular traps (NETs)36, and a large number of recent studies37,38,39 have demonstrated that NETs play an important role in the development and progression of DR. This is mainly because the hyperglycemic environment stimulates neutrophils to produce superoxide and a variety of cytokines that promote NETosis.

In an experimental study40, researchers found that serum levels of tumor necrosis factor-alpha (TNF-α) were elevated in diabetic patients, a change that stimulated the formation of NETs in neutrophils and led to the release of intracellular serine proteases, including neutrophil elastase (NE), which, in turn, led to an increase in the levels of NE and enzyme activity detected in the serum of diabetic patients. In turn, the increase in NE promotes the aggregation of neutrophils toward sites of inflammation and creates a negative feedback regulatory mechanism, which may further exacerbate the development of autoimmune diabetes. Thus, in diabetic patients, a decrease in the number of neutrophils and an increase in NETs are clearly key factors in this complex process. During the formation of NETs, neutrophils release negatively charged free DNA that activates coagulation factor XII, which triggers a series of reactions, including activation of kinin-releasing enzyme and production of bradykinin. Bradykinin binds to G protein-coupled bradykinin B1 and B2 receptors, which are highly expressed in the retina, and thus participates in both acute and chronic inflammatory injury processes in the DR41. Neutrophils, which originate mainly from the bone marrow and are released into the peripheral blood42, play an important role in innate immunity, acting as the first line of cellular defense. Therefore, a decrease in neutrophil numbers may signal an increase in NETs in the body, thereby promoting the onset and progression of DR, which provides an important early warning signal for diabetic patients. Few current predictive models involve neutrophils, making this study innovative in its exploration of this area.

In this study, an innovative predictive model was developed for assessing the risk of DR based on the analysis of risk factors associated with DR. Through external validation, the model showed excellent diagnostic efficacy and goodness of fit, providing an effective tool for early prediction of DR. Considering the large population of diabetic patients and the uneven distribution of healthcare resources in China, especially in remote areas, screening for DR is extremely challenging. The novel prediction model developed in this study, based on existing research, will greatly improve the current screening status quo and provide an efficient and practical tool. By utilizing simple and easily accessible data for the initial assessment of diabetic patients, this model not only provides a scientific assessment basis for primary and general practitioners, but also helps to guide further diagnostic and therapeutic decisions, thus effectively reducing the incidence of DR. In addition, despite the results of this study, as a retrospective study, its conclusions still need to be validated and strengthened by future prospective studies. This not only provides a more reliable clinical basis for the prevention and treatment of DR, but also points the way for future research directions.

To summarize, this predictive model serves as an effective tool for assessing the risk of DR, incorporating five key clinical features. It is not only applicable to areas with uneven distribution of healthcare resources, but can also play an important role in primary care settings where fundus examination is not available. The application of this model will help clinicians to intervene at an early stage, thus effectively reducing the morbidity and mortality of DR in the future, which will have far-reaching implications for improving the long-term health prognosis of diabetic patients.