Introduction

Bronchopulmonary dysplasia (BPD) is a chronic respiratory disease originating in the neonatal period and a major complication of prematurity. With the development of neonatology and the wide use of pulmonary surfactant (PS), the pathology of BPD has changed greatly. However, BPD still accounts for a significant extent of preterm morbidity and mortality. Additionally, BPD can cause numerous long-term sequelae in many aspects [1, 2]. The prognosis of BPD depends on its severity. Patients with mild severity perform as well as their non-BPD peers in terms of lung function, while patients with severe disease always have more severe airway obstructions, and patients with moderate severity are in between these two patient groups [3]. Moreover, poor short-term as well as long-term neurodevelopmental outcomes occur in patients with BPD, especially in those with moderate and severe BPD [4, 5].

Despite more than 50 years of research, there is a lack of effective treatment options for BPD [6, 7]. According to the National Institute of Child Health and Human Development (NICHD) diagnosis criteria, the determination of BPD severity cannot be decided until postmenstrual age (PMA) 36 weeks. Thus, the early prediction of BPD severity is worth attention, as it can predict the clinical burden and guide treatment and follow-up strategies.

Several researchers have developed prediction tools, with clinical, imaging, genomic, or even more complicated biomarkers as factors, using traditional statistics or machine learning models to identify infants at high risk of BPD [8,9,10,11]. Most of the existing models focus on the early prediction of BPD; few pay attention to BPD severity, although the three levels of severity show different consequences and prognoses. In addition, most of the tools were established in developed countries based on European–American populations, which differ in race and clinical treatment guidelines from Chinese populations. Therefore, current tools may not perform well in China.

With the advantage of being able to detect a very small causal relation among data, machine learning models are promising prediction tools and have been used in many clinical applications over the years [12]. A variety of machine learning algorithms have been applied to develop decision models used to help clinical diagnosis and treatment. In the present study, we aimed to identify the clinical risk factors for BPD severity and develop an effective early prediction model for BPD severity based on machine learning techniques in the Chinese population.

Methods

Patients

This retrospective, single-center study was conducted at a level IV neonatal intensive care unit (NICU) in a tertiary children hospital in Shanghai, China. All premature infants with a gestational age (GA) < 32 weeks who were admitted to the Children’s Hospital of Fudan University from Jan 01 2016 to Dec 31 2020 and diagnosed with BPD during hospitalization were eligible. After excluding patients with incomplete clinical data and patients who were admitted to the hospital after 14 days of life, 471 patients remained for the final analysis. We also involved all 1032 preterm infants with GA < 32 weeks without BPD in the same period, and their detailed data are described in the supplementary file (Supplementary Table 1). This study was approved by the Ethics Committee of the Children’s Hospital of Fudan University (Ethics approval number: 2021–122). Informed consents were obtained from their parents of the participants. The study flowchart is shown in Fig. 1.

Fig. 1
figure 1

Flow chart of the study showing the process of the model development, from the patients’ involvement and the filtration of risk factors to the development and assessment of prediction models based on the machine learning algorithm. BPD bronchopulmonary dysplasia

Potential predictor variables

As the etiology of BPD is complex, all clinical factors during antenatal, perinatal, and early postnatal life were considered. In addition to well-known risk factors, such as GA, birth weight and sex, we also evaluated clinical factors shown to be related to BPD recently, such as C-reactive protein (CRP) [13].

Data on maternal and birth characteristics were collected from the medical history taken at admission. Respiratory support [initial mode as continuous positive airway pressure (CPAP), initial fraction of inspiration O2 (FiO2), invasive ventilation], laboratory examination results (routine blood test, blood-gas test, culture results), complications [patent ductus arteriosus (PDA), necrotizing enterocolitis (NEC)] and treatment (caffeine, corticosteroid treatment, antibiotics exposure without culture, prolonged antibiotics exposure) were obtained from the electronic medical database. Airway specimens included tracheal aspirates and broncho-alveolar lavage fluid. Routine blood and blood gas tests were performed as clinically needed. All the above clinical information was collected within the first 14 days of life. We adopted the maximum CRP level and white blood cell count (WBC), as well as the minimum pH value, sodium concentration, and chloride concentration as variables for statistical analysis.

Gestational diabetes mellitus (GDM) was defined as any degree of glucose intolerance during pregnancy, regardless of the presence of diabetes before pregnancy [14]. Maternal hypertensive disorder was defined as a mean blood pressure (BP) ≥ 140 mmHg and/or diastolic BP ≥ 90 mmHg more than once during pregnancy [15]. Early-onset neonatal sepsis (EOS) and late-onset sepsis (LOS) were defined as bacteremia occurring within 72 hours and after 72 hours, respectively [16]. Small-for-gestational-age was diagnosed by a birthweight below the 10th percentile for the GA at birth, in accordance with Fenton chart [17]. PDA was diagnosed by color-flow Doppler echocardiography. NEC was diagnosed according to the modified Bell’s staging criteria [18]. Positive airway specimen culture was defined as the presence of bacteria other than “normal airway flora” on culturing and was classified by Gram staining. Antibiotic exposure without culture was defined within the first week of life, while prolonged antibiotics were defined as antibiotic exposure without culture started in the first week and lasted for more than five days.

Definition of study outcomes

We referred to the 2001 NICHD criteria for model development [19]. The primary outcome was BPD severity categorized as mild, moderate, or severe based on respiratory support at a postmenstrual age of 36 weeks. Mild BPD was defined as breathing room air, moderate BPD was defined as FiO2 < 0.30, and severe BPD was defined as FiO2 ≥ 0.30 or positive pressure ventilation at a postmenstrual age of 36 weeks.

Data analysis

SPSS version 20.0 (IBM Corporation, Armonk, New York, USA) was used for data analysis. Numerical variables are expressed as the mean ± standard deviation (SD) or as the median with percentiles 25–75 (P25–P75). Normally distributed variables were analyzed using the analysis of variance, while non-normally distributed variables were analyzed using the Kruskal-Wallis H test. For variables with significant P values on the Kruskal-Wallis H test, we applied the non-parametric Mann-Whitney U test for further pairwise comparisons. Categorical data were compared using the chi-square test. We entered factors with a P value < 0.10 into the regression test. All continuous covariates were centered on the mean and scaled to the standard deviation. Ordinal logistic regression was used for univariate and multivariate analyses. Variance inflation factor (VIF) coefficients were reviewed to avoid collinearity. All risk factors with statistical significance in multivariate analysis (P value < 0.05) were used to develop the prediction model. Moreover, all the selected risk factors were compared between preterm infants without BPD and all BPD infants to prove evidence.

Model development

Machine learning modeling was implemented using R 4.4.1 statistical software (R Foundation for Statistical Computing, Vienna, Austria). Patients were randomly split into a training set (80%) and test set (20%). Fivefold cross-validation was used to improve the models. Four types of algorithms were used to determine the optimal strategy and model: gradient boosting decision tree (GBDT), logistic regression (LR), XGBoost (XGB) and random forest (RF). Receiver operating characteristic (ROC) curves were plotted, and the area under the ROC curve (AUC) was used to evaluate the diagnostic value of each model.

Results

Demographics and clinical burden of the patients

A total of 471 infants with BPD were enrolled. The demographics and clinical burden of the patients are shown in Table 1. Among all the patients, 279 were mild (59.2%), 147 moderate (31.2%), and 45 severe (9.6%). The mean GA was 28.0 ± 1.6 (27.0, 29.0) weeks, and the mean birth weight (BW) was 1136 ± 248 (960, 1280) g. BW and GA were significantly lower for severe BPD than for mild and moderate BPD. Severe BPD patients had the longest hospital stay as well as oxygen therapy, while mild BPD patients had the shortest and moderate were in between. All the patients had high medical costs, but severe patients had highest cost, which is about 1.5 times higher than mild patients.

Table 1 Demographic and clinical burden in the patients with mild, moderate and severe bronchopulmonary dysplasia (BPD)

The risk factors during the antenatal and perinatal periods are shown in Table 2. On univariable analysis, patients with severe BPD were more likely to suffer from maternal GDM and less likely to have antenatal corticosteroid treatment than the others. No significant differences were observed among the severity groups in the presence of maternal hypertensive disorder, maternal age, or delivery mode. After birth, the five-minute Apgar score and intubation rates were significantly different among the severity groups. Infants with severe BPD were more likely to undergo intubation in the delivery room (DR) than infants with mild or moderate BPD.

Table 2 Antenatal and birth characteristics in the patients with mild, moderate and severe bronchopulmonary dysplasia (BPD)

The clinical characteristics in early postnatal life are shown in Table 3. Patients with severe BPD needed the highest FiO2 initially, while the mild group needed the lowest FiO2 among the three groups. Patients with severe BPD had the highest WBC and CRP levels as well as the lowest pH value among the three groups. Patients with severe BPD were also more likely to suffer from PDA than those with mild or moderate BPD. Early treatment was different among the different groups. Patients with moderate BPD were more likely to have postnatal corticosteroid treatment than patients with mild BPD. In addition, patients with severe BPD were more likely to be exposed to prolonged antibiotics without a positive blood culture. Regarding infections, patients with severe BPD were more likely to have LOS and positive respiratory specimen culture than patients with mild or moderate BPD.

Table 3 Early postnatal clinical characteristics of the patients of mild, moderate and severe bronchopulmonary dysplasia (BPD)

Multivariable analysis

The results of the ordinal logistic regression analysis are shown in Table 4. Severity was entered as the outcome, and patients with mild, moderate, and severe disease were designated as 1, 2, and 3, respectively. Maternal diabetes mellitus, initial FiO2 value, invasive ventilation, acidosis, hypochloremia, C-reactive protein level, patent ductus arteriosus and Gram-negative respiratory culture were independent risk factors for BPD severity. The calculated VIFs varied between 1.019 and 1.154 (no VIF > 10), which indicated the absence of collinearity between the variables. Moreover, all the factors were significantly different between the BPD group and the non-BPD group (Table 5). We also supplemented the selected risk factors for a preliminary model based on the 2018 NICHD definition [20]. The odds ratios are shown in Supplementary Table 2; ROC of the machine learning model is shown in Supplementary Fig. 1, and detailed indexes are shown in Supplementary Table 3.

Table 4 Results of the ordinal logistic regression model for bronchopulmonary dysplasia (BPD) severity levels
Table 5 Risk factors between the very preterm infants with and without bronchopulmonary dysplasia (BPD)

Model performance

All eight clinical factors with significant differences on multivarible analysis as well as key factors (GA and BW) were entered into the machine learning models. ROC curves were drawn for the test set (Fig. 2). The AUCs of the GBDT, XGB, RF, and LR models were 0.79 ± 0.045 (95% CI 0.70–0.88), 0.85 ± 0.043 (95% CI 0.77–0.94), 0.84 ± 0.038 (95% CI 0.77–0.91) and 0.86 ± 0.036 (95% CI 0.79–0.93), respectively. Thus, the LR, XGB and RF models all had good performance in this database. The detailed indexes for the four models are presented in the supplemental material (Supplementary Table 4).

Fig. 2
figure 2

Performance of different machine learning models showing the comparisons for different learning algorithms on test sets. a–d The ROC curves of the LR, GBDT, XGB and RF models. Class 1 represents mild BPD; class 2 represents moderate BPD, and class 3 represents severe BPD. LR logistic regression, RF random forest, GBDT gradient boost decision tree, ROC receiver operating characteristic, AUC area under the receiver operating characteristic curve, BPD bronchopulmonary dysplasia

Discussion

The differences among various severities of BPD have been intensively investigated in recent years. Patients with severe BPD have worse prognoses and require more intensive care and follow-up. The prediction of severe BPD is of great importance for patients with BPD. Currently, there are dozens of models, few of which can distinguish BPD severity [8,9,10,11]. In the present study, we developed an early prediction model to distinguish the severity of BPD based on a five-year cohort at a level IV NICU. To our knowledge, this is the first prediction model for BPD severity based on a large sample in a Chinese population. The developed model enables an early prediction of BPD severity at 28 days of oxygen treatment.

In the present ordinal logistic regression analysis, we found that GDM, initial FiO2 value, invasive ventilation, acidosis, hypochloremia, CRP level, PDA and positive respiratory culture were independent risk factors for BPD severity.

The present univariable and multivariable analyses suggest that maternal DM may aggravate the clinical severity of BPD. Meta-analysis showed that hyperglycemia during pregnancy may indicate adverse outcomes in preterm infants, such as preterm birth and respiratory distress syndrome [21]. However, in the previous literature, controversy exists regarding whether GDM is related to BPD. A meta-analysis based on studies conducted in developed countries found no difference in the incidence of BPD between infants with and without GDM, and contrasting results have been reported in China [14, 22]. Given that poor DM management may lead to hormonal disorders and unstable glucose levels in neonates, differences in results between these studies may be related to disparities in pregnancy management. As GDM was a significant risk factor for BPD severity in the present study and there is no relevant previous study on BPD severity, further attention should be given to how maternal diabetes influences the infant’s lung development.

Respiratory support plays an important role in BPD development. Non-invasive methods were recommended by randomized controlled trials and meta-analyses to reduce the risk of BPD [21, 23]. Consistent with these studies, we found that intubation in the DR was related to BPD severity, which is probably because of its ventilator-induced lung injury, volutrauma and barotrauma [24]. We also found an initial high FiO2 to be an independent risk factor. Hyperoxia can cause the release of reactive oxygen species, resulting in both cellular and molecular damage to the lung. Thus, attention is needed to avoid intubation in DR. Strategies, such as less-invasive surfactant administration and CPAP, have already been proven to improve respiratory outcomes [24]. In addition, a recent study showed that better lung recruitment maneuvers may reduce the rate of intubation in DR [25].

Acid-based and electrolyte disturbances have been previously identified as risk factors for the development of BPD [26]. Hypochloremia and hyponatremia were related to BPD severity in the present study, which is consistent with previous studies [27]. Several studies have linked electrolyte disturbances to the use of diuretics; however, in the present study, we collected data within the first 14 days of life, when diuretics are seldom used. Thus, electrolyte disturbances in this cohort had a unique pathogenesis, rather than being secondary to therapy. Electrolytes are of paramount importance to both intra- and extracellular fluid balance as well as the normal function of cells. Some researchers suggest that electrolytes disrupt the normal function of cells, while others suggest that they may elevate arginine vasopressin levels, leading to pulmonary edema and, eventually, BPD [26, 28]. Most of the patients with BPD experienced acidosis to some extent in this study. According to the previous literature, acidosis may reflect immature renal function, complex metabolism, or respiratory failure [29,30,31]. Although the mechanism remains unclear, the present study results suggest the importance of close follow-up and the correction of the pH value and serum electrolyte levels in preterm infants.

Inducing systemic inflammatory responses, sepsis has been shown to increase the BPD incidence rate. In the present study, LOS was related to BPD severity, which is consistent with previous studies [32]. However, in the multivariable analysis, the inflammation indicator CRP replaced LOS as an independent risk factor. This is consistent with findings by Yang et al., which found that early CRP levels may be used as an early diagnostic marker for mild-to-severe BPD with an AUC of 0.867 [13]. Although culture remains the gold standard for sepsis, the positive rate is low, especially in early life. With advantages of convenience and quickness, CRP has proven to be a good biomarker for neonatal sepsis. In recent years, new inflammation indicators, such as presepsin, have been proposed to be more promising biomarker of neonatal sepsis [33].

Pulmonary infection also plays an important role in BPD pathology. Both Gram-positive and Gram-negative pathogenic bacteria were associated with an increased risk of greater BPD severity in the present study, while Gram-negative bacteria were more related to the outcome. In the current decade, researchers have focused on the role of respiratory flora in BPD; previous studies have shown a strong association between Gram-negative bacteria and poor respiratory outcomes in preterm infants [34, 35]. Lipopolysaccharide, a major component of the outer membrane of Gram-negative bacteria, interacts with cell-surface receptors, leading to an increased secretion of inflammatory mediators, prompting an inflammatory cascade and causing diffuse lung tissue damage and alveolar simplification [36]. Thus, investigations on methods to reduce Gram-negative bacteria in the airway of chronically high-risk preterm infants are critical.

In summary, the role of inflammation in the pathogenesis of BPD has been firmly established, and both localized and systemic infections were revealed as independent risk factors in the present study. Thus, clinicians should pay more attention to optimal anti-infection treatments, as well as strict nosocomial infection control.

Despite the role of infection in BPD, antibiotics may be a double-edged sword in clinical management. Ting et al. found that among infants without culture-proven sepsis, more antibiotic exposure was associated with adverse neonatal outcomes [37]. Consistent with this, Fajardo et al. found that antibiotic treatment for more than five days in preterm infants with negative cultures was associated with increased BPD risk [38]. In the present study, we found similar results that prolonged antibiotics increased the risk for BPD severity. According to previous studies, this may be because antibiotic use in preterm infants may lead to disruption of the neonatal microbiome development and an increased risk of pulmonary development failure [39, 40]. However, a recent study using stepwise hierarchical analyses suggested the existence of factors confounding the association between early antibiotic exposure and BPD. In other words, antibiotic treatment in early life is more likely a sign of illness severity than a true contributor to BPD.

Factors impeding alveolar maturation, such as BW and GA, are believed to be of prime importance in the pathology of BPD. Contrary to general impressions, the present multivariable analysis indicated no significant differences in GA among BPD severity groups. Since we restricted the cohort to patients with BPD, this result might be partly due to sample homogeneity. Most patients in this cohort were in the saccular stage [41]. Thus, postnatal factors may have displaced GA as the main risk factor.

In previous studies, male preterm infants were more prone to developing BPD than female preterm infants. Estrogen may promote lung maturation and stimulate the expression of PS [42]. In the present study, male neonates accounted for 61% of BPD cases, verifying the susceptibility in boys; however, sex was not related to the clinical severity of BPD according to our data.

Most existing prediction models were developed using traditional statistical methods. However, with computer science widely utilized in clinics, many centers have also attempted to establish BPD prediction models based on artificial intelligence technology in recent years. In 2021, a center in Denmark proposed an artificial intelligence model based on support vector machine learning to predict the occurrence of BPD; by combining postnatal clinical characteristics and exhaled gas nitrogen content, the accuracy reached approximately 90% [8]. Another center established a machine learning model for predicting BPD and severe BPD using clinical data and genomics, with an AUC of 0.872 [9]. Our model performed as well as the previous ones, and we have some advantages. First, the previous study took metabolomic and genomic indicators into account in the models, and the huge cost might render it difficult to popularize the previous models. In the present study, we selected easily accessed clinical factors and validated the findings by ordinal regression analysis, increasing the interpretability of our models. Second, we used different machine learning modeling methods, further supporting the reliability and generalizability of our models. Moreover, our sample size is larger than those of past machine learning studies, which can avoid overfitting risk and thereby biased results.

This study developed an early prediction model using simple selected clinical and laboratory factors. First, all the factors involved are easy to access, which means that the model can be applied in most hospitals even with variability in medical quality. Second, this study also drew attention to some factors related to BPD severity in our own population, which can help improve our clinical management.

The present study has some limitations. First, as a retrospective study, we missed some factors that might be related to BPD, such as chorioamnionitis, early extrauterine growth retardation and early fluid management. We may add the above factors in future prospective studies. Second, as a single-center study, the model still needs external validation in a new group.

In conclusion, in this single-center study, GDM, initial FiO2 value, invasive ventilation, acidosis, hypochloremia, CRP level, PDA and positive respiratory culture were independent risk factors for BPD severity and we developed machine learning models with good performance. With simple factors involved, these models may become a useful tool in clinical practice for the early stratification of BPD, which can help clinicians choose individualized treatment protocols and follow-up strategies in the Chinese populations.