FormalPara Key Summary Points

Why carry out the study?

TIR is an important metric in CGM for interpreting individuals’ glucose fluctuations. However, it can be difficult to obtain when CGM’s accessibility is limited. A reliable method for estimating TIR is needed to facility the glycaemic management for patients with T2D.

What was learned from the study?

In the absence of CGM, the FPG and PPG have been shown to be effective alternative indicators for estimating TIR, in addition to HbA1c.

The proposed TIR prediction model, which utilizes random forest regression and incorporates FPG, PPG and HbA1c as indicators, demonstrates superior prediction performance compared with previously reported models. This novel approach holds great potential as a reliable and effective tool for accurately estimating TIR levels.

Introduction

Glycaemic variability control is closely associated with the occurrence of micro- and macrovascular diabetic complications [1]. Rapid postprandial glucose elevation and impaired glucose tolerance are risk factors for abnormal vascular function leading to serious cardiovascular diseases. Hypoglycaemia, especially nocturnal hypoglycaemia, is the most common adverse effect of diabetes therapy and is associated with unfavourable health outcomes. Poor control of hypoglycaemia will also lead to cardiovascular morbidity and mortality. It was reported that 2–4% of people with diabetes die from hypoglycaemia. The management of hypoglycaemia in individuals with diabetes has been recognized as a crucial aspect of ensuring safe and effective diabetic therapy [2]. However, patients with well-controlled HbA1c levels (< 6%) were found to have an increased risk of hypoglycaemia. HbA1c has long been the gold standard for assessing long-term glycaemic management. It has been demonstrated to have a significant correlation with the mean glucose levels measured by continuous glucose monitoring (CGM) over a certain period of time [3]. However, HbA1c falls short in reflecting glucose fluctuations, e.g. hyper- and hypoglycaemia, and individuals with the same HbA1c may exhibit significantly different diurnal glucose variations [4]. A previous study found that the risk of cardiovascular complications in patients with T2D is high even in the absence of elevated HbA1c, suggesting that fasting plasma glucose (FPG) and postprandial plasma glucose (PPG) are risk factors for cardiovascular disease [5,6,7]. The VADT study pointed out that the risk of all-cause mortality risk was elevated even with a significant decline of HbA1c level [8]. The results of the VADT study indicated that HbA1c is not a sufficient indicator of prognosis in diabetes, particularly with regard to hypoglycaemia and postprandial hyperglycaemia, and considered the underlying causes of glucose fluctuations [9].

The fasting and 2-h postprandial glycaemic levels commonly assessed through oral glucose tolerance test (OGTT) or mixed-meal glucose tolerance test (MMTT) have been recommended as an alternative diagnostic measurement for T2D alongside HbA1c [10].

The level of FPG has been reported as a marker of nocturnal hypoglycaemia [11]. PPG, on the other hand, was recommended as an effective measurement for evaluating meal-related glucose fluctuations and diabetes treatment [12, 13]. These results indicate that the levels of FPG and PPG from traditional glycaemic measurements might be closely correlated with parameters of glycaemic variability [14].

The monitoring and alarming of hypo- or hyperglycaemic events can be facilitated through the continuous and real-time recording of glucose fluctuations by the CGM system. Time in range (TIR), as the main CGM metric, can capture the percentage of time that an individual’s glucose is between 70 and 180 mg/dl, along with time below range (TBR) and time above range (TAR), and it shows daily fluctuations in glucose and the amount of time spent high or low across several days. It was reported to be closely correlated with the incidence of hypo- and hyperglycaemia, as the rise and decline of blood glucose outside of the target range can be reflected by the TIR level [15, 16]. TIR provides a comprehensive picture of glucose control within a period of days, as it takes into account both the frequency and duration of hyper- and hypoglycaemic readouts. This information can be more useful and actionable than HbA1c in short-term glycaemic variability management, helping individuals with diabetes and clinicians personalize their management strategies [17]. Poor control of TIR has been reported to be associated with many diabetic complications in the Chinese population, including diabetic retinopathy, osteoporosis, cardiovascular disease, and others [18,19,20]. This highlights TIR as a novel perspective for improving clinical outcomes and the cost-effectiveness of diabetes healthcare economics. By incorporating TIR into diabetic care, healthcare providers can optimize patient care by addressing specific glycaemic events and improving overall glucose management.

Despite the potential benefits of using CGM for monitoring glycaemic variations, its adoption in routine clinical practice, particularly in primary care, remains limited [21]. China has the highest number of patients with diabetes globally, with approximately 141 million people suffering from the disease. It is predicted that this number will increase to 174 million by 2045 [10]. At the same time, 42% of the population reside in rural areas where the medical resources and economic conditions are limited. The circumstance is even worse in other developing and undeveloped countries [22]. This has generated a huge demand for CGM in diabetes management, but it also poses very large challenges regarding clinician awareness and cost of reimbursement. Nevertheless, widespread adoption of CGM in large-scale trials still faces challenges due to various factors [23], such as the difference in time spent wearing the device, and CGM data are often difficult to compare between studies [24,25,26]. Until now, CGM data have not been a major indicator for regulators in large-scale trials [26, 27]. However, the glucose tolerance tests are more acceptable than CGM even in undeveloped areas. In China, the Guideline for the Prevention and Treatment of Type 2 Diabetes recommends using an OGTT for the diagnosis of T2D and a mixed-meal glucose tolerance test for disease management. The fasting and postprandial glucose levels obtained from OGTT are considered the primary criteria for diagnosing diabetes, and monitoring postprandial hyperglycaemia is recommended in all stages and types of T2D management [28]. And the cost for tests is often covered by healthcare insurance.

Studies have investigated the relationship of TIR with various clinical biomarkers and attempted to infer the TIR value through alternative glucose profiles when CGM data are not available [5, 29, 30]. The main clinical method to investigate TIR derived from HbA1c is univariate regression analysis, which has been widely adopted in estimating the changes in TIR through changes in HbA1c [30, 31]. Vigersky revealed the linear relationship between HbA1c and CGM-derived or self-monitored blood glucose (SMBG)-derived TIR and suggested a conversion table between HbA1c and TIR [31]. The conversion table is currently the most commonly used method in clinical settings when TIR is not available. However, the applicability and accuracy of the conversion table have yet to be determined.

There is no study that considers the diverse glycaemic variation information represented by TIR in exploring the relationship between HbA1c and TIR or a prediction model for TIR through a combination of FPG, PPG and HbA1c values. The FPG and PPG concentrations measured during a glucose tolerance test have provided a high-quality dataset as well as a more comprehensive understanding of an individual’s glucose fluctuations. They can be informative in the estimation of TIR [5, 32].

The aim of this study was to investigate the relationship between the CGM-derived glycaemic profile and FPG and PPG concentrations measured by the glucose tolerance test and to develop a more comprehensive prediction model with novel machine learning approaches for TIR estimation based on FPG, PPG and HbA1c values.

Methods

Study Population

Subjects were enrolled in this study from the Nanjing First Hospital affiliated with Nanjing Medical University. Patients who met the following criteria were studied: (1) patients who were newly diagnosed with T2D or on stable antidiabetic therapy; (2) patients who consented to an OGTT/MMTT before the treatment period; (3) patients who collected CGM profiles within 14 days of OGTT or MMTT test; and (4) patients with CGM data sufficient to calculate TIR.

A total of 399 subjects were included in this study; 195 (49%) of them were newly diagnosed patients with T2D, and the rest (51%) were patients who were stable on at least 6 months antidiabetic therapy without any change. A total of 154 (39%) subjects underwent an OGTT, and 245 (61%) subjects underwent an MMTT. Age, height, weight and medical history were recorded. Body mass index (BMI) was calculated as weight divided by the square of height (kg/m2).

This study has used only retrospective data, and therefore ethics committee approval was not required. However, data use in our study was approved by the Research Ethics Committee of Nanjing First Hospital (ky20120928-06) and conducted in compliance with the Declaration of Helsinki and current ethical codes; all patients provided informed consent.

Biochemical Investigations

Blood samples were collected after overnight fasting and 2 h after OGTT or MMTT. Blood glucose levels were measured using the glucose oxidase method with an autoanalyzer (Modular E170, Roche, Mannheim, Germany). HbA1c was measured by high-performance liquid chromatography assay (Bio-Rad Laboratories, Inc., CA, USA).

CGM and Data Processing

CGM was performed in an iPro2 CGM system (Medtronic, MiniMed, USA). These CGM devices continuously measure interstitial glucose levels within a range of 40–400 mg/dL every 5 min, up to a maximum of 288 read counts per day. For each patient, CGM data with a minimum of 70% of possible CGM readings over a day and collected within 14 days of MMTT/OGTT were retained. On average, 3 days of CGM data for each subject were kept for analysis. TIR referred to the time spent in an individual’s target glucose range (70–180 mg/dL [3.9–10 mmol/L]). The minimum and maximum glucose concentrations during each nocturnal and meal-intake (breakfast, lunch, dinner) period were extracted to assess the glucose variability and fluctuations (Supplementary Fig. S1). For the hospitalized management, meals were generally served at 7:00, 11:00 and 17:00, so we considered three meal-intake period (breakfast, lunch, dinner) as 7.00–10.59, 11:00–16.59 and 17:00–21:59, respectively. The minimum and maximum glucose for each period referred to the lowest and highest glucose values during this period. And the nocturnal minimum glucose referred to the lowest glucose value from 22:00 to 06:59.

Prediction Model

The dataset was randomly split into a training set (80%) and a test set (20%). This step was repeated 20 times to generate 20 replicates for the training sets and testing sets. Then, the prediction models were built and tested on these datasets.

Univariate linear regression, multivariate linear regression and random forest regression were used to predict TIR. The univariate linear regression model estimated TIR with HbA1c only, whereas the multivariate linear regression models utilized FPG, PPG and HbA1c to predict TIR. In the random forest regression model, TIR was predicted with FPG, PPG and HbA1c in a nonlinear manner. TIR prediction model based on other variables has been evaluated too, and the results are shown in Supplementary Table 1.

The function lm from the R package stats was used to access the linear regression models. The R package randomForest was used to perform the random forest model. The number of trees in the random forest model was set to 400, and other parameters were kept at the default settings.

Model Evaluation

Pearson’s correlation coefficient (R), mean absolute error (MAE), root mean square error (RMSE) and mean square error (MSE) were used to evaluate the prediction performance of the models. The models were first built on the training sets and then applied on the testing sets. As there were 20 training sets and testing sets, R, MAE, RMSE and MSE were calculated for each of the 20 sets. Student’s t test was used to compare the results between different models to measure the prediction performance. MAE is a measure of the absolute average magnitude of the errors in a set of predictions, without considering their direction. It measures the average difference between the actual and predicted values. MAE is calculated as the average of the absolute differences between the actual and predicted values [3]. RMSE is another measure of the average magnitude of the errors. RMSE is the square root of the mean of the squared differences between the actual and predicted values [4]. MSE is the mean of the squared differences between the actual and predicted values. MSE provides information about the overall prediction performance of the model by measuring the average of the squared differences between the actual and predicted values.

Statistical Analyses

All demographic information is presented as frequencies and percentages for categorical variables and mean ± SD for continuous variables in Table 1. Linear regressions were applied to assess the association of continuous variables. Correlation coefficients and p values for each variable were used to determine the relationship between FPG, PPG, HbA1c and maximum/minimum glucose. The R2 and p values were used to evaluate the linear relationship in regression. All analyses were conducted with R, version 4.1.0.

Table 1 Baseline characteristics

Results

Individual Trial Characteristics

A total of 399 Chinese patients with T2D were included in this study; 154 (39%) subjects received OGTT to access the glycaemic profile, and 245 (61%) subjects received MMTT to access the glycaemic profile. The median age was 56 years, and a majority of the participants were male. The mean HbA1c was 8.91% (Table 1).

Association Between HbA1c, FPG, PPG and Minimum/Maximum Glucose Values

FPG, PPG and HbA1c values are strongly correlated with minimum and maximum glucose concentrations during a day as measured through CGM data (Fig. 1, Supplementary Fig. S2). FPG concentrations were positively correlated with minimum glucose concentrations, particularly with minimum glucose concentrations at breakfast (p < 0.001). PPG concentrations, on the other hand, were the most positively correlated with maximum glucose concentrations, especially at breakfast (p < 0.001), compared with the coefficient of minimum and maximum glucose concentrations at lunch and dinner. The significance level and coefficient of HbA1c values are overall less comparable to those of FPG and PPG.

Fig. 1
figure 1

Linear regression analysis results of HbA1c, FPG, PPG as related to minimum (left) and maximum (right) glucose concentrations during the breakfast period. FPG fasting plasma glucose, PPG postprandial plasma glucose

Association Between FPG, PPG, HbA1c and %TIR Values

Linear regression analysis was used for the investigation of association between HbA1c and %TIR values. As shown in Fig. 2, there was a significant correlation between HbA1c and TIR values (R2 = 0.26, p < 0.001). The correlation was strengthened when FPG and PPG concentrations were added to the multivariate regression analysis (R2 = 0.36, p < 0.001). In multivariate linear regression, FPG and PPG concentrations were incorporated with HbA1c values as arguments. All three arguments are significantly correlated with %TIR values. The coefficient of HbA1c values in multivariate linear regression decreased from − 8.17 to − 2.91.

Fig. 2
figure 2

Linear regression analysis to evaluate the correlation of HbA1c with TIR (left) and multiple variables (HbA1c, FPG, PPG) with TIR (right). R2 for correlation of HbA1c with TIR, 0.2586; R2 for correlation of multiple variables with TIR, 0.3592. TIR time in range, FPG fasting plasma glucose, PPG postprandial plasma glucose

TIR Prediction Model in the General T2D Population

A performance matrix of the prediction model to estimate TIR was constructed to evaluate the performance of the multivariate linear and nonlinear models (FPG, PPG, HbA1c) and the univariate linear model (HbA1c). The results demonstrate that the multivariate nonlinear random forest model (R = 0.79, RMSE = 18.16, MAE = 14.77) outperformed the linear model and the univariate model, and the multivariate linear model (R = 0.73, RMSE = 20.17, MAE = 16.67) was superior to the univariate model (R = 0.62, RMSE = 25.97, MAE = 21.29) in the same training and testing sets (Table 2). It is worth mentioning that the univariate regression model was built upon a previously published formula [31]. The predictions based on testing set among the three models are shown in Fig. 3.

Table 2 Performance of the TIR prediction model in general population
Fig. 3
figure 3

Scatter plot for CGM-derived TIR and predicted TIR between univariate linear regression model (A), multivariate linear regression model (B), random forest regression model (C) in randomly generated testing set. X CGM-derived TIR, Y predicted TIR, TIR time in range, FPG fasting plasma glucose, PPG postprandial plasma glucose, R Pearson’s correlation coefficient, MSE mean square error, MAE mean absolute error, RMSE root mean squared error

TIR Prediction Model in the Newly Diagnosed T2D and Existing T2D Population

The subgroup modelling in the newly diagnosed T2D and existing T2D population displayed an improved prediction performance for random forest model as we stratified and simplified the disease history of the subjects. For newly diagnosed T2D population, the correlation coefficients between the true and predicted values for the random forest regression model were 0.84, and 15.89 and 13.15 in RMSE and MAE, respectively; the correlation coefficients for the multivariate linear model were 0.8, and 17.16 and 14.50 in RMSE and MAE, respectively. For the existing T2D population, the correlation coefficients between the true and predicted values for the random forest regression model were 0.84, and 18.48 and 15.26 in RMSE and MAE, respectively (Tables 3, 4).

Table 3 Performance of the TIR prediction model in the newly diagnosed T2D population
Table 4 Performance of the TIR prediction model in the existing T2D population

Discussion

In this study, we investigated the relationship between three clinically widely used glycaemic parameters (FPG, PPG, HbA1c) and HbA1c values, and the CGM-derived minimum and maximum glucose concentrations in patients with T2D. The results reveal that FPG and PPG concentrations measured by the glucose tolerance test could to some extent reflect glycaemic excursion within a day. We observed the highest correlation coefficient between FPG and minimumbreakfast as well as PPG and maximumbreakfast, which is consistent with the clinical approach, as the measurements of FPG and PPG through MMTT or OGTT are typically performed in the morning time by clinicians. HbA1c is not as sensitive as FPG and PPG to the glycaemic excursion represented by the minimum and maximum glucose during a day. The results are further supported by the correlation analysis between the TIR and those three glycaemic parameters. In the linear regression, a significant correlation between HbA1c and TIR values (R2 = 0.26, p < 0.001) was observed, and the correlation with TIR was further improved by adding FPG and PPG to the model (R2 = 0.36, p < 0.001). All three parameters are significantly correlated with the TIR. The results suggest that the FPG and PPG can also be an indicator of glucose fluctuation in a relatively longer period of time and further demonstrate the potential of FPG and PPG in the estimation of TIR values.

In the light of the results above, we built TIR prediction models based on different parameters involved and algorithms. The results of the TIR prediction models in the same training and testing sets showed that the nonlinear model was superior to the linear model of FPG, PPG and HbA1c, and the multivariate linear model outperformed the univariate linear model. These results are similar to those of previous studies [16, 32], although those studies only evaluated each correlation between FPG, PPG, HbA1c and TIR values separately, while the multivariate relationship has never been considered together.

By incorporating FPG and PPG data into the analysis, we confirmed the limitations of relying solely on HbA1c values to describe and predict TIR. The performance of TIR estimation is significantly improved. The results demonstrate that the estimation of TIR requires a comprehensive understanding of glucose fluctuations beyond that provided by HbA1c values. The superiority of the random forest regression model over the linear model highlights the complex relationship between TIR values and glucose profiles, as TIR measures the diabetes control status in terms of glucose variation and the time window of glycaemic management.

The subgroup analysis shows additional comparisons on segments of the study population, and it demonstrates that there were significant differences in the performances of the prediction model among the newly diagnosed T2D, existing T2D and the general population. The prediction performance in multivariate models is improved in the newly diagnosed T2D population. However, for the existing T2D population, prediction performance on multivariate models was not fully outperformed compared with models is the general population, as the individual errors measured by RMSE, MAE and MSE are slightly raised but the overall coefficient was improved. This means that the background diversity in patients with T2D leads to great differences in glucose profiles; thus, the relationship between TIR and glucose values is different. This has been well demonstrated by a previous study, as their CGM data revealed that patients with different HbA1c values exhibited different glycaemic variations, and patients with higher HbA1c values had larger glycaemic variations. For patients with these characteristics, special therapies aimed at reducing glycaemic variations and prolonging time-to-peak hyperglycaemia are recommended [33]. The establishment of personalized models based on population characteristics is helpful to improve the adaptability and accuracy of the models in different populations. And further confirmed that the correlation analysis of all individuals with a single variable linear model is not enough for TIR estimation.

As discussed, previous studies of CGM metrics and glucose profiles have focused on illustrating the relationship between those variables. It provided the association of the change in HbA1c with TIR elevation and was evaluated through regression analysis. The regression model may not be extended to other datasets for the nature of its purpose. In this study, we evaluated three regression models in the same training and testing sets. The univariate linear regression model for TIR and HbA1c was adopted from a published formula [31]. The model performance evaluation is based on R, RMSE, MAE and MSE of the observed value and predicted value. The results suggest the robustness of the random forest regression model in estimating TIR through FPG, PPG and HbA1c over traditional regression methods.

Although HbA1c remains the gold standard biomarker for long-term glycaemic control, a large percentage of patients with desirable HbA1c control still face the risk of T2D complications, especially when diabetes management focuses on only one metric instead of considering a comprehensive glucose profile [5,6,7,8, 32, 34]. Among patients with the same HbA1c levels, there may be significant differences in their blood glucose fluctuations as reflected by CGM [35], resulting in substantial variability in their TIR [16, 36]. As CGM technology has become widely accepted in the clinical practice of T2D management lately, as a generated parameter from CGM data, TIR has been generally applied in continuous interpretation of glucose data [16, 37] and strongly recommended by various guidelines or consensus worldwide [38,39,40,41]. The American Diabetes Association (ADA)’s Standards of Medical Care in Diabetes—2020 [42] recommended a 70% TIR (70–180 mg/dL) target for optimized T2D management. TIR reveals the glycaemic fluctuations and variability that cannot be easily detected by traditional monitoring methods, which is advantageous for achieving glucose homeostasis and may provide additional information to HbA1c [43]. With this comprehensive glucose biomarker, clinicians can accurately comprehend fluctuations in patients’ glucose levels and evaluate the patterns of their patients’ current lifestyles as well as the efficacy and safety of their current therapeutic method, avoiding hypoglycaemic events and adjusting the treatment programs as needed. However, the use of TIR as an alternative to HbA1c has been debated [44]. The adoption of CGM technology in clinical settings faces several challenges, such as additional cost and time for patients and clinical trials, maintenance and interference from different devices and systems, the requirement of proper training in interpreting ambulatory glucose profile (AGP) reports, and limited availability of the technology, especially for undeveloped regions or for specific patient populations [21, 23, 25]. Accurate and effective assessment of TIR using non-CGM and clinically accessible biomarkers is imperative in clinical practice. By considering the feasibility of the TIR estimation, we aim to provide an alternative approach that may be more accessible and feasible, particularly in resource-limited or rural areas. The glucose tolerance test and HbA1c test are well-established clinical tests that can be performed in various healthcare settings, including those with limited resources. They involve a one-time procedure that does not require continuous wear or specialized equipment. Therefore, in areas where CGM availability is limited, the approach can serve as a valuable tool for estimating TIR and providing insights into glycaemic control.

Our TIR prediction models provide a good fit for external validation with collected multiorigin data. It facilitates clinicians’ assessment of the change in TIR in patients with T2D and the adjustment of therapeutic methods even without CGM devices. However, the unrecognized patient-specific covariates and the uncertainty of disease history may limit the applicability of the models among different populations. A more personalized model may further improve the model performance for patients with different characteristics and disease histories. Random forest regression is a machine learning model that has been adopted by many studies for disease or complication risk prediction [45, 46]. Its strength lies in its ability to handle nonlinear relationships between variables and its robustness to outliers, making it a powerful tool for modelling complex relationships. It also has the ability to handle large datasets, which is useful in TIR prediction models where multiple variables such as FPG, PPG and HbA1c are considered. Additionally, the use of multiple decision trees in the model provides greater stability and reduces the risk of overfitting. The results from this study have shown its potential in the prediction of TIR and have shown its robustness compared with other models. One of the limitations of this study is a limited population of Chinese Han individuals, meaning that the results and performance of the TIR prediction model might not be perfectly generalized to all populations. At the same time, the performance differences in subgroup modelling highlight the potential to enhance performance by incorporating additional disease history information into the model. Additionally, FPG and PPG concentrations were assessed through both OGTT and MMTT, which may introduce variability into the dataset. And the TIR was calculated through a 3-day CGM dataset obtained from the inpatient setting with the Medtronic iPro2 CGM system [47], which does not align with the recent consensus recommendation of at least 10 days of data for predominantly personal use CGM devices [42, 47]. Future studies could optimize the model by incorporating diverse CGM systems with longer monitoring durations, which would help reduce systematic errors and improve the prediction performance of TIR. Besides, the relatively small sample size used in the machine learning model may result in overfitting and affect the representativeness of the training and testing sets; however, this limitation has been partially addressed through cross-validation and the inherent nature of the random forest model. Validation with a more diverse and larger sample size is needed in future work.

Conclusion

Our study highlights the strong correlation between fasting and postprandial glycaemic profiles and the minimum and maximum glucose levels measured by CGM. The important characteristics of glycaemic variation associated with TIR are not appropriately described by HbA1c alone. The TIR prediction model that incorporates FPG, PPG and HbA1c yields significantly improved results compared to the univariate model. Our findings further support the presence of a nonlinear relationship between TIR and these variables, as evidenced by the superior performance of the random forest regression model and the subgroup modelling analysis. The results suggest the presence of an applicable model for TIR estimation. The application of machine learning methods in clinical data modelling setting, especially in the area of chronic disease, provides a new insight into future patients’ disease management.