Introduction

The postnatal nutritional status of very preterm infants (VPI), and extrauterine growth retardation (EUGR) in particular is receiving increasing attention. Studies have shown that EUGR affects not only the physical development and immediate complications of very preterm infants, but also their long-term health, especially neurocognitive function, and may increase the risk of cardiovascular system diseases and chronic metabolic syndrome in adulthood1. EUGR is relative to intrauterine growth retardation(IUGR). However, until now there is still much controversy about the timing, population and criteria for the evaluation of EUGR. Data from a 2016 multicenter survey including 25,899 preterm infants with birth weight < 1500 g at 22–32 weeks of gestational age showed that the incidence of EUGR evaluated based on weight was 38%2. A multicenter cohort study in Shandong Province, China, in 2018 showed that the overall EUGR incidence was 60.7% in 1051 preterm infants with gestational age < 32 weeks or weight < 1500 g, compared with 53.7% in non-small for gestation age(SGA) preterm infants3. Figueras Aloy et al.4 followed up 479 very preterm infants less than 32 weeks of gestational age and the incidence of EUGR was 51% when they reached 34–36 weeks of gestational age. El Rafei et al.5 investigated 6792 preterm infants in 19 regions of 11 European countries and the incidence of EUGR remained at 60.3%. This indicates that EUGR is a common problem that very preterm infants need to face after birth. Exploring the risk factors of EUGR and early prediction of the risk of EUGR in VPI is currently a hot topic in neonatal medicine research. Low birth weight is considered the most important risk factor for EUGR by numerous studies6,7, and multiple pregnancies, hypertensive disease complicating pregnancy(HDCP), SGA and prenatal glucocorticoid use also increase the incidence of EUGR5. Many neonatal complications such as severe infections, neonatal respiratory distress syndrome(nRDS), bronchopulmonary dysplasia(BPD) and necrotizing enterocolitis(NEC)8, have also been shown to be risk factors for EUGR. Protein and energy are closely related to weight gain in the early postnatal period and increasing energy intake in the first postnatal week can reduce postnatal weight loss, while insufficient cumulative intake can directly lead to lagging weight gain in preterm infants, thus adversely affecting their growth and development9. The effect of gestational age on EUGR is controversial, with most studies suggesting that increasing gestational age reduces the risk of EUGR10, but some studies have found that increasing gestational age is an independent risk factor for EUGR11. EUGR is strongly associated with birth weight, gestational age, perinatal abnormalities, neonatal diseases, and inadequate nutritional support after birth. In fact, these risk factors have significant multicollinearity with each other, and most of the previous studies performed univariate analysis only at the screening stage of independent variables and analyzed the dominance ratio of risk factors by logistic regression equation, which does not deal well with the interference of various confounding factors. Moreover, when too many independent variables are included in the analysis, there is a high requirement for the sample size of the study cohort, and the results of studies with small sample sizes may suffer from insufficient test efficacy. Therefore, constructing a nomogram based on large-sample clinical data from multiple centers to provide a personalized, evidence-based, and highly accurate risk prediction model can predict the risk of EUGR in VPI at an early stage and facilitate decision-making related to clinical management, which is of great significance for improving the prognosis of very preterm infants and reducing the incidence of EUGR.

Materials and methods

The present prediction model was written based on the TRIPOD list12.

Study population

The study population was derived from a prospective multicenter study on extrauterine growth retardation in very preterm infants and the factors affecting it in 28 hospitals in level III NICUs in 7 regions of China from September 2019 to December 2020. The aim of original study was to investigate the incidence and related factors of extrauterine growth retardation in very premature infants during hospitalization from different regions of China. Therefore, although the original research design has fit this study in depth, strictly speaking, this study presented in the manuscript was a deep mining of the original data.

The study was registered in the Chinese Clinical Trials Registry (http://www.chictr.org.cn; registration number: ChiCTR1900023418). The study protocol was approved by the Ethics Committee of Women and Children's Hospital of Xiamen University/Xiamen Maternal and Child Health Hospital (Batch number: KY-2019-016), and all informed consent was obtained from the families of the children.

Inclusion criteria: 1. gestational age < 32 weeks; 2. hospitalization > 2 w; 3. admission within 24 h after birth. Exclusion criteria: 1. congenital structural malformations; 2. hospitalization less than 2 w; 3. inherited metabolic diseases; 4. death, interruption of treatment or automatic discharge during hospitalization; 5. incomplete data. Discharge criteria: 1. cure of primary disease and stable vital signs; 2. milk intake up to full enteral feeding; 3. weight 1800 ~ 2000 g or more.

Sample size

This study was designed as a cohort study to investigate the prevalence of EUGR, based on the literature13, which reported that the prevalence of EUGR in preterm infants born at gestational age < 32 weeks ranged from 51 to 60%, setting the test level α at 0.05 and the permissible error δ at 2%. Thus, the required sample size was calculated to be 2305 cases, and the estimated missing data rate was 10%, so the total sample size was expected to be 2,500 cases.

Data collection

The Women and Children's Hospital affiliated to Xiamen University was used as the survey center, and EpiData was applied to establish a database with a uniform format of questionnaire. The data of the case report form were entered in two-person duplex, and the data entry was performed strictly according to the requirements of the study protocol.

Observation indicators

General indicators: The perinatal data of mothers were recorded, such as mode of delivery, antenatal corticosteroids (ACS) and gestational diabetes mellitus(GDM), HDCP, etc. The gestational age, birth weight, sex, postnatal glucocorticoid use, mechanical ventilation of preterm infants were also recorded.

Nutrition and developmental indicators: Nutritional and developmental indicators included time of initiation of enteral feeding, main milk feeding during hospitalization, time to reach total enteral feeding (150 ml/kg/d), cumulative days of fasting, cumulative time of parenteral nutrition, time to regain birth weight, growth velocity after regaining birthweight, cumulative calories in the first week after birth, cumulative amino acid and fat milk intake in the first week after birth, total calories up to 110 kcal/kg/d of age and oral calories up to 110 kcal/kg/d of age, etc.

Major complications in preterm infants: The major complications of prematurity include grade III-IV nRDS, moderate to severe BPD, stage II or higher NEC, grade III-IV intraventricular hemorrhage(IVH), periventricular leukomalacia (PVL), hemodynamically significant patent ductus arteriosus (hsPDA), early-onset sepsis (EOS), late-onset sepsis (LOS), retinopathy of prematurity requiring intervention (ROP), feeding intolerance (FI), metabolic bone disease of prematurity (MBPD), parenteral nutrition-associated cholestasis (PNAC), etc.

Criteria for related diseases and treatment protocols

EUGR evaluation criteria: using the 2013 Fenton growth curve14 as a reference, correcting for weight, length and head circumference indicators below the 10th percentile of the growth curve at 36 weeks of gestational age or at hospital discharge.

SGA: refers to newborns whose birth weight is less than the 10th percentile of the mean birth weight for children of the same sex and gestational age.

Growth velocity (GV) (g/kg-d): the formula is GV15 = [1000 × ln(Wn/W1)]/(Dn-D1), where Wn is real-time weight (g), W1 is birth weight (g), Dn is the age in days (d), and D1 is time to regain birth weight (d). The body weight was measured every two days after birth, and growth velcoity was evaluated every week.

FI evaluation criteria: gastric residuals exceeding 50% of the previous feeding with vomiting and/or abdominal distention; feeding plan failure, including reduced, delayed or interrupted enteral feeding, with one of the two being diagnostic16.

Brain injury (BI) defined as the presence of grade III-IV IVH, PVL and hemorrhagic infarction on cranial ultrasound or cranial MRI.

Diagnostic criteria for MBDP: blood alkaline phosphatase > 900 IU/L with blood phosphorus < 1.8 mmol/L17.

Diagnostic criteria for PNAC: persistent PN ≥ 14 d, skin yellowing that cannot be explained by the original disease, hepatosplenomegaly, light-colored stools or white clay-like stools, serum conjugated bilirubin > 2 mg/dl or > 20% of total bilirubin, except for other biliary atresia, infections and cholestasis due to genetic metabolic diseases18.

NEC was defined as Bell stage ≥ 2, wherein the NEC suspected based on signs and symptoms was confirmed by pneumatosis intestinalis on radiology19.

A diagnosis of BPD was indicated by at least 28 days of oxygen therapy above the fraction of inspired oxygen (FiO2) of 0.21. Based on the FiO2 requirement at 36 weeks post-menstruation age or discharge, whichever came first, BPD was characterized as mild (room air), moderate (FiO2 < 0.3), or severe (FiO2 ≥ 0.3 or requiring positive pressure ventilation)20.

ROP that requires intervention: ROP requiring intravitreal drug injection, laser therapy, or surgery.

EOS and LOS include confirmed sepsis with positive blood cultures versus clinically diagnosed sepsis, with diagnostic criteria referring to the Expert Consensus on the Diagnosis and Treatment of Neonatal Sepsis (2019 edition)21.

hsPDA was considered when catheter diameter was > 1.5 mm, left atrial diameter/aortic diameter was ≥ 1.4 mm, and left-to-right shunt (or bi-directional bi-phase shunt) was needed. In addition, the PDA was accompanied by at least one of the following clinical manifestations: heart murmur, tachycardia, increased respiration, increased pulse pressure, hypotension, flushing or cardiac dilation.

nRDS grade III-IV, IVH grade III-IV, and PVL were diagnosed with reference to Practical Neonatology (5th edition)22.

Statistical analysis

The data were processed for statistical analysis using R-software (version 4.1.3; R Foundation for Statistical Computing, Vienna, Austria). We have used the mice function of the mice package to perform multiple chain imputation on the original data based on the random forest rule, then we have got 5 data sets, and we have finally selected the data set obtained by the fourth imputation for statistical analysis. Continuous variables were expressed as median and interquartile spacing, and Kruskal–Wallis nonparametric test was used for comparison between groups. Count data were expressed as number of cases and percentages (%), and the Person chi-square test was used for comparison between groups. Differences were considered to be remarkable when P values below 0.05.

Stratified random sampling of the study cohort based on gestational age sorting across trial centers was performed using the sampling package in R software. The random seed was set at 666 in a ratio of 7:3 and divided into a training set of 1760 cases and an internal validation set of 754 cases.

In this study, the data of cases with missing outcome were directly deleted, and the absence of predictor variables was treated by multiple imputation using the mice package, the number of imputation was 5, including all the predictive factors and outcome variables in the imputation model; after the imputation, the analysis results of the imputation data were combined according to Rubin's rules.

After univariate analysis, LASSO regression with k-fold cross-validation (tenfold validation) was performed using the glmnet package in R software to select the independent variables for the prediction model, and the optimal lambda was selected as lambda.1se. At this time, 14 independent variables were obtained. Next, we eliminated 6 variables again according to the inverse stepwise selection based on the Akaike information criterion, and finally selected 8 optimal independent variables. Based on these 8 most suitable predictors, a prediction model was constructed by binary multi-factor logistic regression using the rms package, and a nomogram was drawn, and the risks of clinical characteristics were expressed by OR values, P values and 95% CIs.

Finally, the accuracy of the prediction model was estimated using the following validation methods in the training set and internal validation set, respectively. The ROC curves were plotted with the pROC package and the area under the curve was calculated to verify the discriminatory ability of the prediction model. Plotting Calibration curves with the rms package to assess the calibration of the prediction model. Plotting DCA benefit curves with the ggDCA package to evaluate the clinical efficacy of the prediction model.

Ethics approval and consent to participate

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Ethics Committee of Women and Children's Hospital affiliated to Xiamen University/Xiamen Maternal and Child Health Hospital (Batch number kY-2019–016). Consent to participate and for publication All participants gave written informed consent. The informed consent was obtained from the families of the children.

Consent to participate and for publication

All participants gave written informed consent.

Results

Characteristic of the study cohort

Our study cohort included 2514 very preterm infants of gestational age 24–31+6 weeks. Based on weight assessment, 1189 preterm infants were combined with EUGR and 1325 non-EUGR preterm infants, with an incidence of EUGR of 47.3%. These preterm infants were randomly sampled in a 7:3 ratio according to gestational age into the training set and the validation set (7 in the training set and 3 in the validation set), with 1760 cases in the training set and 754 cases in the validation set, as shown in Fig. 1. The basic characteristics of the two groups of preterm infants were shown in Table 1. There were not statistically significant differences in the independent variables between the training set and the validation set. So, the training set and the validation set obtained were well representative of the overall population.

Figure 1
figure 1

Flow chart of study design. VPIs very preterm infants, GA gestation age. Select the predictors and fit the model using only the training set.

Table 1 Characteristics of the 2514 preterm infants with gestational age less than 32 weeks.

Independent risk factors in the training set

Univariate analysis revealed differences in 37 of the 49 clinical indicators. Predictor variables were selected from these variables by LASSO regression with k-fold cross-validation, and 14 predictor variables were screened, as shown in Fig. 2. Then, the inverse stepwise selection based on the Akaike information criterion resulted in the inclusion of 8 best-fit predictors, namely birth weight, SGA, HDCP, GDM, multiple births, cumulative duration of fasting, growth velocity and postnatal corticosteroids. The AIC values corresponding to the full model, the model fitted with 14 independent variables and the model fitted with 8 independent variables were 1393.8, 1770.5 and 1776.7, respectively.

Figure 2
figure 2

Variable selection by the LASSO regression analysis with k-fold cross-validation in the training set. (a) Forteen variables with nonzero coeffificients were selected by deriving the optimal lambda. (b) we plotted the AUC curve versus log(lambda) and drew dotted vertical lines based on 1 standard error criteria.

Predictive model construction

Eight of the initial 49 variables were included in the predictive model. Since all eight predictors showed statistically significant differences (shown in Table 2) and were independent of each other(all variance inflation factors were less than 2), they were all introduced into the predictive model and a nomogram of EUGR risk probabilities was constructed (shown in Fig. 3). The endpoints of each variable were vertically upward to the score scale in the first column to obtain the corresponding score, and the scores of the eight variables were summed to obtain the total score, which was used to determine the risk probability of EUGR for patients. For example: a preterm infant with birth weight 1000 g, singleton, SGA, cumulative fasting period of 10 days, so the baby will have total points of 132, and EUGR predictive value of 70%. In addition, when the total score is below 40, the EUGR risk of preterm infants is extremely low, and we believe that EGUR prediction is unnecessary at this time.

Table 2 Logistic regression analysis of the predictors for the risk of EUGR in the training set.
Figure 3
figure 3

Risk factors of birth weight, SGA, HDCP, GDM, multiple births, cumulative duration of fasting, growth velocity and postnatal corticosteroids for nomogram prediction model. EUGR extrauterine growth retardation, SGA small for gestation age, HDCP hypertensive disease complicating pregnancy, GDM gestational diabetes mellitus.

Predictive model validation

The ROC curve was used to evaluate the discriminatory ability of the predictive model. The AUC of this predictive model was 83.1% in the training set and 84.6% in the validation set (shown in Fig. 4a,b), suggesting that the model has good discrimination. The AUC in the validation set was higher, which may be due to the following reasons: The study cohort was divided into 1760 training sets and 754 internal validation sets by stratified random sampling based on gestational age ranking in different trial centers. The baby of the training set and validation set were from the same population, and there was little difference in data between two group; 2. The sample size of the validation set is small; 3. The proportion of low-risk population in the validation set may be higher.

Figure 4
figure 4

Validation of the EUGR risk nomogram. Receiver operating characteristic curve validation of the EUGR risk nomogram in the training set (a) and validation set (b). Calibration curves of the predictive EUGR risk nomogram in the training set (c) and validation set (d). Decision curve analysis for the EUGR risk nomogram in the training set (e) and validation set (f). EUGR extrauterine growth retardation.

The prediction model was calibrated using calibration plots and the Hosmer–lemeshow test. The calibration curves, as well as the Hosmer–lemeshow test, showed that the predicted probabilities were highly consistent with the actual probabilities (P = 0.913 for the training set; P = 0.445 for the validation set) (shown in Fig. 4c,d), and the prediction model showed a good degree of fit in both the training and validation sets. The DCA curves show that the prediction model achieves good net benefit within a wide threshold of 0–95% in both the training and validation sets (shown in Fig. 4e,f).

Discussion

The nomogram is a graphical visualization of the interrelationship between variables for outcome prediction based on the results of multifactorial analysis, integrating multiple predictors and assigning them according to certain proportions. Currently, there are no reports on the application of nomogram line graphs in EUGR management. To our knowledge, this is the first study to construct a nomogram for risk assessment of EUGR in preterm infants born at less than 32 weeks of gestational age.

EUGR, an abnormal disease state commonly found in very preterm infants, which is the result of a combination of influencing factors. In this process, various confounding factors must be considered as variables and the issue of multicollinearity among different variables. In this prospective study, we designed a more comprehensive list of factors and used LASSO regression analysis with k-fold cross-validation to screen variables and then divided the study cohort in a 7:3 ratio. The accuracy and stability of the model were modeled by the training set and validated by the internal validation set, and thus the method we chose could better address the accuracy and stability of the model. The probability of risk of EUGR in preterm infants born at gestational age < 32 weeks was well predicted by our prediction model constructed using eight factors, including birth weight,SGA, HDCP, GDM, multiple births, cumulative duration of fasting, growth velocity and postnatal corticosteroids. Although most of the factors were similar to those reported in previous studies, the statistical methods of this study differed significantly from these previous studies, providing more accurate results while demonstrating the contribution of each predictor to the risk probability of EUGR.

Birth weight is a key determinant of the occurrence of EUGR. The occurrence of many diseases is negatively correlated with birth weight, and as birth weight increases, the incidence of these diseases progressively decreases, and the corresponding medical interventions are reduced23,24. Thus, lower birth weight preterm infants are at higher risk for disease and take longer to achieve catch-up growth, making low birth weight preterm infants more likely to deviate from the growth curve and result in EUGR.

SGA is the adverse outcome of IUGR of the fetus itself. The cumulative retardation of intrauterine development will directly affect the growth and development of the fetus after birth4. The secondary EUGR is the continuation of intrauterine growth retardation. Controlled for other confounding factors in their study, Lima et al.25 found that SGA was the most influential variable for the weight (PR = 4.33) and head circumference (PR = 2.11) of preterm infants.The occurrence of IUGR is related to maternal obstetric diseases and nutritional status during pregnancy, such as HDCP. The basic pathophysiological changes of HDCP are parturient systemic small vessel spasm and increased peripheral vascular resistance, leading to reduced utero-placental blood supply and decreased placental function, which are more obvious when combined with adverse factors such as preeclampsia or decreased umbilical artery blood flow during diastolic period, thus causing fetal intrauterine growth retardation and intrauterine distress26. Even if the fetus was not evaluated as SGA at birth, its birth weight relative to gestational age was still small, which was closer to P10 on Fenton's intrauterine growth curve. Radmacherc et al. found that p-value of birth weight was an important predictor of EUGR, and a lower P-value was associated with an increased risk of EUGR6. In addition, intrauterine fetal gastrointestinal ischemia caused by HDCP increases the risk of postnatal feeding problems and even NEC in preterm infants, and affects their ability to digest and absorb food and establish enteral feeding, which may also increase the risk of EUGR27.

Fetal overgrowth is a common complication of GDM, and the risk of macrosomia in pregnant women with GDM may be about 3 times that in pregnant women with normal glucose tolerance28. Maternal hyperglycemia can lead to fetal hyperinsulinemia and an increase in unused glucose, which promotes adipose tissue and protein synthesis, resulting in fetal overgrowth in utero. The advantage of intrauterine growth and development may make it maintain good growth or evaluation advantage for a long time after birth. The slight deviation of growth curve in very preterm infants may partially compensate for this, thereby reducing the incidence of EUGR. However, birth weight increase is not always a benign event. Pregnant women with GDM are often associated with an increased risk of perinatal adverse outcomes such as prolonged labor, dystocia, fetal malformation, congenital heart disease, neonatal respiratory distress syndrome, and neonatal hypoglycemia29. It also increases the risk of obesity, diabetes and other metabolic diseases in adulthood30.

Multiple pregnancies increase many perinatal complications, including fetal anomalies, pre-eclampsia and gestational diabetes mellitus. Multiple pregnancies are associated with an increased risk of fetal and infant morbidity and mortality due to preterm birth and complications associated with prematurity as a result of multiple pregnancies31,32. This effect is significantly and positively correlated with the number of fetuses. Even when gestational age at birth is taken into account, infants with multiple pregnancies face more preterm complications than singleton infants born at the same gestational age33.

The growth rate of preterm infants during their NICU stay has a significant independent effect on neurological and physical growth outcomes. Several previous studies have concluded that the rate of neonatal weight gain at 24–38 weeks of gestational age is approximately 17 g/kg/d14,34 and that the rate of postnatal weight gain is closely related to nutritional management and depends on the accumulated protein, caloric and fluid intake in the early postnatal period13. Good nutritional management can meet the needs of preterm infants to achieve intrauterine growth rate and thus reduce the risk of EUGR.

Aggressive enteral nutrition is the optimal strategy for nutritional management of preterm infants35, with specific measures including early initiation of enteral nutrition, breastfeeding, application of breast milk fortification36,37, and avoidance of unnecessary milk reduction and fasting. The main causes of repeated multiple fasting in preterm infants are feeding intolerance and NEC, and prolonged fasting can cause gastrointestinal mucosal atrophy and peristaltic dysfunction, which are important influencing factors for the successful establishment of enteral feeding. Feeding intolerance and prolonged fasting inevitably delay the time to achieve total enteral feeding in preterm infants and increase the need for intravenous nutrition, thereby increasing the risk of complications associated with parenteral nutrition.

Glucocorticoids can reduce lung tissue edema by inhibiting lung inflammation, promote the generation of lung antioxidant enzymes and pulmonary surfactant, help shorten the time of ventilator adjuvant therapy, and reduce the incidence of BPD. Low dose, short course of systemic dexamethasone has been widely used in premature infants who rely on mechanical ventilation for a long time38,39. Systemic glucocorticoid therapy can improve the success rate of ventilator withdrawal and reduce the incidence of BPD in preterm infants who are still dependent on mechanical ventilation and have high risk factors for BPD at 7–14 days after birth40. Poets et al.41 also suggested that systemic dexamethasone treatment for ultra-low birth weight infants who were still dependent on mechanical ventilation and oxygen 14 days after birth could help to avoid BPD. The mechanisms of EUGR in children with BPD include insufficient nutritional support, increased respiratory work and resting metabolic rate, intermittent hypoxia, restricted fluid intake, use of diuretics and glucocorticoids, frequent pulmonary infection or sepsis, and feeding difficulties, which aggravate the undernutrition status of preterm infants and increase the risk of EUGR42. The use of postnatal glucocorticoids was positively correlated with the incidence of BPD. Although univariate analysis showed that the use of glucocorticoids was significantly higher in children with EUGR, glucocorticoids still played a positive role in preventing the occurrence of EUGR by reducing the incidence of BPD and other complications.

However, there are several limitations to our study. First, due to regional differences and heterogeneity in clinical practice among centers, multi center effects cannot be completely avoided, so there is a possibility of bias in the study results. Second, the study was conducted during the spread of the CoVID-19 epidemic, and the provision of breast milk was restricted due to epidemic prevention policies, resulting in a low breastfeeding rate in this study cohort.

Conclusion

In conclusion, based on 8 factors including birth weight, SGA, HDCP, GDM, multiple births, cumulative duration of fasting, growth velocity and postnatal corticosteroids, we constructed the first predictive model of EUGR risk in very preterm infants. Internal validation confirmed good accuracy, consistency, and broad net benefit of the model. The visualized predictive model demonstrates the risk weights of different predictors, providing clinicians with a simple, intuitive and practical predictive tool that can provide better individualized predictive risk assessment for choosing the most optimal treatment strategy for preterm infants born after gestational age < 32 w.