Predicting the need for intubation within 3 h in the neonatal intensive care unit using a multimodal deep neural network

Im, Jueng-Eun; Park, Seung; Kim, Yoo-Jin; Yoon, Shin Ae; Lee, Ji Hyuk

doi:10.1038/s41598-023-33353-2

Predicting the need for intubation within 3 h in the neonatal intensive care unit using a multimodal deep neural network

Article
Open access
Published: 17 April 2023

Volume 13, article number 6213, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Predicting the need for intubation within 3 h in the neonatal intensive care unit using a multimodal deep neural network

Download PDF

Jueng-Eun Im¹^na1,
Seung Park¹^na1,
Yoo-Jin Kim²,
Shin Ae Yoon² &
…
Ji Hyuk Lee²

1598 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

Respiratory distress is a common chief complaint in neonates admitted to the neonatal intensive care unit. Despite the increasing use of non-invasive ventilation in neonates with respiratory difficulty, some of them require advanced airway support. Delayed intubation is associated with increased morbidity, particularly in urgent unplanned cases. Early and accurate prediction of the need for intubation may provide more time for preparation and increase safety margins by avoiding the late intubation at high-risk infants. This study aimed to predict the need for intubation within 3 h in neonates initially managed with non-invasive ventilation for respiratory distress during the first 48 h of life using a multimodal deep neural network. We developed a multimodal deep neural network model to simultaneously analyze four time-series data collected at 1-h intervals and 19 variables including demographic, physiological and laboratory parameters. Evaluating the dataset of 128 neonates with respiratory distress who underwent non-invasive ventilation, our model achieved an area under the curve of 0.917, sensitivity of 85.2%, and specificity of 89.2%. These findings demonstrate promising results for the multimodal model in predicting neonatal intubation within 3 h.

A deep learning model for real-time mortality prediction in critically ill children

Article Open access 14 August 2019

The past, current, and future of neonatal intensive care units with artificial intelligence: a systematic review

Article Open access 27 November 2023

Early prediction of need for invasive mechanical ventilation in the neonatal intensive care unit using artificial intelligence and electronic health records: a clinical study

Article Open access 23 October 2023

Introduction

Respiratory distress is the most common indication for admission to the neonatal intensive care unit (NICU)^1,2. Endotracheal intubation is the end-stage of respiratory support and is a critical procedure in neonates with respiratory difficulties. Recent non-invasive ventilator (NIV) strategies have reduced the incidence of endotracheal intubation and duration of mechanical ventilator support in NICU management^3,4. While the increasing use of NIV includes high-flow nasal cannula (HFNC), nasal continuous positive airway pressure (NCPAP), bilevel positive airway pressure (BIPAP), and non-invasive positive pressure ventilation (NIPPV) in neonates with respiratory difficulties, a significant proportion fail on NIV support and require intubation within the first few days of birth, especially in preterm infants^5,6,7.

Neonatal respiratory distress syndrome (RDS) is a major source of morbidity in NICU. Ordinary treatments include mechanical ventilation and surfactant replacement therapy^8,9. Considering the lower incidence of RDS in late preterm and term neonates, it is difficult to distinguish RDS from other less severe respiratory diseases that do not require endotracheal intubation¹⁰. Although several risk factors for RDS have been established, such as prematurity, cesarean section, perinatal asphyxia, male sex, maternal diabetes mellitus, and multiple births^11,12,13, intubation time is often delayed in late preterm and term neonates with respiratory distress receiving NIV support.

Recently, deep neural networks have been widely implemented in neonatal medicine. Examples include a predictive model of mortality during NICU hospitalization¹⁴ and prediction of long-term neurodevelopmental outcomes at the corrected age of 2 years¹⁵ using electronic medical records including demographics, vital signs, and images. Previous studies have proposed predictive models for RDS and NCPAP failure using clinical and laboratory parameters in both adult and neonatal medicine^{11,16,17,18,19}. During the early neonatal period, according to the success of the adaptation to extra-uterine environments, neonates’ cardiopulmonary status is vulnerable and fluctuating. Because the NIV failure commonly occurred in the first-hour stabilization period^20,21, short-term prediction has practical use in NICU settings. In this study, we designed a multimodal deep neural network (MDNN) model to predict the need for intubation within the next 3 h in neonates with respiratory difficulty who were admitted to the NICU within the first 48 h of life and initially received NIV support. This model is intended to support clinical decisions by providing diagnostic alternatives to physicians and proposing appropriate treatments based on demographic, bedside clinical, and laboratory parameters at the time of NICU admission.

Methods and materials

Ethics statement

Data collection was approved by the Institutional Review Board of the Chungbuk National University Hospital (IRB No. 2021-02-034). The review board waived the requirement for informed consent, owing to the retrospective design of this study. We confirm that all methods were performed in accordance with the relevant guidelines and regulations.

Study population

We retrospectively obtained datasets of all neonates who were admitted to the NICU within the first 48 h of life at Chungbuk National University Hospital between June 1, 2020, and November 30, 2021. We excluded neonates without respiratory problems, those hospitalized after 48 h of life, and those intubated at the time of admission. To improve model performance, we excluded neonates intubated 12 h after admission and those with missing data, defined as more than two tabular data or ≥ 10% of time-series data.

Datasets

Demographic data, physiological parameters, and laboratory data were collected. The datasets comprised 19 tabular and 4 time-series features. The tabular data in this study are defined as either categorical or numerical variables. We collected data such as gestational age (GA), birth weight, Apgar scores at 1 and 5 min, sex, delivery mode, antenatal steroid use, pregnancy-induced hypertension, gestational diabetes mellitus, premature membrane rupture, birth place, multiple births, initial body temperature, clinical risk index for babies (CRIB-II) score²⁰, and parameters in the initial blood gas analysis, including pH, PO₂, PCO₂, base excess (BE), and lactate as the tabular features. Additionally, we analyzed four time-series features: heart rate (HR), respiratory rate (RR), fraction of inspired oxygen (FiO₂), and pulse oximetry (SpO₂). Time-series data were recorded at 1-h intervals until 12 h after admission. The missing values in the tabular data and time-series data were filled with the average values and the most recent data, respectively.

The time of intubation was defined as the first record of endotracheal intubation or ventilation data. We classified infants with intubation time ≤ 12 h of NICU admission as intubated patients and all others as non-intubated patients. Since the initial tabular data such as body temperature and blood gas analysis results (pH, PCO₂, PO₂, BE, and lactate) could gradually recover or worsen over time, it has limitations to provide long-term (> 12 h) predictions. Instead, we focused on alleviating the model’s complexity and improving its practical use by using tabular data, so our model was designed to predict the need for short-term (≤ 12 h) intubation.

Multiple samples were generated from each patient over time. To classify intubation cases 3 h in advance, the samples taken within the cutoff time (t_c = 3 h) are labeled as “1” for intubated patients and “0” for non-intubated patients (Fig. 1). For the model training and test, the patients were first split into training and test sets as shown in Supplementary Table S1, multiple samples were then generated from each patient set over time. Therefore, the samples from one patient do not go to both training and test sets. It was inevitable to extract multiple samples from a patient, dividing the entire time sequence by a specific sequence length to balance negative and positive data. For positive data collected prior to intubation attempts, the overall time sequence range was varied from 1 to 12, but the negative data included all time points during 12 h period after admission. We considered that the difference in sequence length could lead to biased results in the model training. In addition, most intubation occurred within 3 h (29/36), so we cut the entire time sequence into the same time sequence length of 3.

Since the aim of this study was to provide decision support for clinicians while assessing the need for intubation upon NICU admission, we limited our prediction time window to within the first 12 h of NICU hospitalization.

Models

We designed an MDNN using three subnetworks to jointly analyze the tabular x_n \({\in {\mathbb{R}}}^{a}\) and time-series data x_t \({\in {\mathbb{R}}}^{b\times l}\) as shown in Fig. 2, where a and b are denoted as the feature numbers of tabular and time-series data. Subscript l indicates the length of the time sequence. First, x_t is flattened, then the flattened x_t and x_n are fed into the multilayer perceptron (MLP) blocks, consisting of a single fully connected layer with d (= 32) nodes, batch normalization, and rectified linear units, followed by a dropout to alleviate overfitting. The vectors from the MLP blocks are concatenated, and the concatenated vector \({\mathrm{x}}_{\mathrm{cat}}\in {\mathbb{R}}^{2\mathrm{d}}\) is analyzed using the last MLP block. Finally, the analyzed vector was used to calculate the intubation probability (0–1) by the fully connected layer with sigmoid activation. The proposed MDNN was implemented using TensorFlow 2.4 (https://www.tensorflow.org/).

To compare the MDNN with widely used machine learning (ML) methods, we further implemented linear regression (LR), support vector machine (SVM)²¹, and an extreme gradient boosting decision tree (XGBoost) regressor²². The SVM is a supervised machine learning algorithm based on kernel functions, and we employed the Gaussian radial basis function for model predictions. The XGBoost model is also considered a supervised technique that ensembles decision trees using the gradient boosting framework. The LR, SVM, and XGBoost are open-source library, and we programmed them using Python scikit-learn 1.0.2 (https://scikit-learn.org/) and XGBoost 1.7.2 (https://xgboost.ai/) libraries.

Statistical analysis

Continuous variables were compared using the Student’s t-test or the Mann–Whitney U test and are presented as the mean (95% confidence interval). Categorical variables were compared using the chi-square test or Fisher’s exact test and are presented as percentages and frequencies. SPSS version 25 (SPSS Inc., Chicago, IL, USA) was used for all statistical analyses, and P < 0.05 was considered statistically significant.

Model evaluation

The models were internally validated using fourfold cross-validation to assess performance and minimize overfitting (Supplementary Table S1). The total datasets were split into training (75%) and test (25%) sets by maintaining the overall positive/negative ratio. The proportion of positive patients in the training and test datasets was set to approximately 30%, the same as in the overall dataset (36 positive patients out of 128 patients).

For each fold, we evaluated five quantitative measures, including the area under the receiver operating characteristic curve (AUROC), F1-score, sensitivity, specificity, and accuracy. The AUROC and F1-score metrics were considered the highest priority because we need to use robust metrics against imbalanced dataset. The F1-score calculates the harmonized mean between precision and recall, and AUROC is calculated from the ROC graph that visualizes the tradeoff between true positive rate and false positive rate. Statistical calculations were performed using the Scikit-learn library (https://scikit-learn.org/)²³.

Results

Baseline demographics

Of the 577 neonates admitted to the NICU during the study period, we excluded 449 who did not meet the inclusion criteria and 30 with missing data. The datasets included 128 eligible neonates with 36 intubated (positive) and 92 non-intubated patients (negative) (Fig. 3). The mean GA and birth weight were 35.8 ± 2.8 (30–42) weeks and 2.6 ± 0.8 (0.9–4.9) kg, respectively. Table 1 shows the clinical characteristics of the intubation and non-intubation groups.

Table 1 Baseline characteristics and outcomes of cohort.

Full size table

In the initial blood analysis results of the intubated group, average pH was considerably lower (P = 0.004) and PCO₂ was higher (P = 0.001) than those of the non-intubated group. Of the 128 neonates who initially received NIV support, 22 of 101 (22%) infants primarily supported by HFNC and 14 of 27 (52%) infants initially treated with NCPAP or BIPAP were intubated (P = 0.003). The average time to intubation was 124 (15–510) minutes in the intubated group. The mean time to intubation in neonates with HFNC was 159 ± 131 min and in neonates with NCPAP or BIPAP was 70 ± 78 min (P = 0.016).

Model evaluation

Figure 4 shows the confusion matrices of the entire dataset for each model. The MDNN and conventional ML (LR, SVM, and XGBoost) models were evaluated regarding the mean AUROCs and confusion matrices from fourfold validation (Fig. 5). The average AUROCs for these models were 0.917 for MDNN, 0.890 for SVM, 0.886 for LR, and 0.853 for XGBoost. In addition, the MDNN outperformed the ML model with respect to four metrics (F1-score, sensitivity, specificity, and accuracy). Specifically, the MDNN showed the best performance with F1-score of 0.884, sensitivity of 85.2%, specificity of 89.2%, and accuracy of 88.2%, followed by the SVM model with an F1-score of 0.882, sensitivity of 82.7%, specificity of 89.7%, and accuracy of 88.0% (Table 2).

Table 2 Comparison of model performances between the proposed model and the conventional machine learning models.

Full size table

Model interpretation

To interpret the proposed model prediction, we used Shapley Additive Explanations (SHAP)²⁴ and sensitivity analysis representing the contribution of each feature to the model outcome. A positive SHAP value indicates that the corresponding feature contributes to a higher probability of needing intubation, whereas a negative value suggests that the corresponding feature leads to a lower probability of requiring intubation. The magnitude of the SHAP value represents the contribution of a feature to prediction performance.

Figure 6 shows a summary plot of the SHAP values used to visualize model interpretation. These results showed that GA, FiO₂, SpO₂, birth place, and HR were identified as the key features of the MDNN model. In addition, we performed additional SHAP analysis for the three machine learning models. In the LR model, the top five factors associated with intubation risk were GA, FiO₂, birth place, BE in the initial blood gas analysis, and SpO₂, and those of SVM were FiO₂, SpO₂, HR, and GA (Supplementary Fig. S1). The four important factors for these models were identical to those of the proposed model (GA, FiO₂, SpO₂, birth place, and HR). XGBoost showed that pH and BE in the initial blood gas analysis, birth place, SpO₂, and GA were the important factors, which showed the greatest difference from the proposed model and the worst performance in AUROC and F1-score (Table 2).

The sensitivity analysis was also performed as follows: We held all the attributes at their mean value while varying just one of the inputs to evaluate how input parameters affect the output variation derived by the proposed model. Five representative values (minimum, mean-to-minimum median, mean, mean-to-maximum median, and maximum) for each feature were used in this analysis. The baseline output was initially derived from the mean values of all features, and the changes in intubation risk (%) from baseline output were then calculated. The absolute values of the cumulative changes from the 23 features are plotted in Fig. 7. Figure 7 informed us that the GA causes the greatest change in the intubation risk (%), followed by SpO₂, FiO₂, HR, and birth place.

In both the sensitivity analysis and SHAP analysis, the top five factors were perfectly matched with slight differences in order. This result informed us that the GA causes the greatest change in the intubation risk (%), and the other key factors such as SpO₂, FiO₂, HR, and birth place also contributed significantly to the model prediction.

Discussion

We collected datasets of 128 neonates with respiratory difficulties who underwent initial NIV therapy and developed an MDNN model to identify neonates requiring endotracheal intubation and mechanical ventilation within the following 3 h. The proposed model should provide useful information to alert medical staff of a need for intubation occurring within a short time (< 3 h) and reduce persistent monitoring efforts.

There were several studies on adult intubation^17,25,26. Varzaneh et al.²⁵ predicted the intubation risk of hospitalized coronavirus disease 2019 (COVID-19) patients using a decision tree-based model and showed a reasonable level of accuracy (93%). Siu et al.¹⁷ also predicted intubation in adults using a random forest model with an open dataset (medical information mart for intensive care, MIMIC) and achieved an AUROC of 0.87. As a study targeting NICU patients, Clark et al.’ study²⁷ was conducted on very low birth weight infants (birth weight < 1500 g). Vital sign and electrocardiogram data collected at 2-s intervals were used to predict intubation after 24 h using a logistic model and had an AUROC of 0.84.

The MDNN, based on a multimodal approach, has the advantage of accessing multivariate information simultaneously, resulting in the highest predictive performance with an AUROC of 0.917, sensitivity of 85.2%, and specificity of 89.2%. The ablation study was performed to compare the non-time series model (tabular data model). We input only 19 tabular data to the deep neural network (DNN), and the structure of the DNN was constructed by removing the first MLP block that analyzes time-series data. The DNN achieved an average of AUROC 0.680, which decreased by 0.237 compared to the MDNN model. The other performance metrics of the DNN model were as follows: F1-score, 0.718; sensitivity, 69.5%; specificity, 70.5%; accuracy, 68.0%. These results showed that multimodal analysis was essential for improving performance.

In addition, we computed SHAP values to characterize the clinical factors potentially contributing to intubation in the MDNN model. SHAP values have been widely used to explain and clinically validate model outcomes^{28,29,30,31,32}. The features with the highest SHAP values for the proposed model were GA, birth place, HR, FiO₂, and SpO₂. Lower GA and high fractional oxygen requirements have been considered clinically significant factors for RDS in previous studies^{12,17,19,33,34,35,36}. For the machine learning models, the four key factors of SVM (GA, FiO₂, SpO₂, and birth place) and LR models (GA, FiO₂, SpO₂, and HR) were consistent with those of the proposed model. In addition, the XGBoost showed three important factors (GA, SpO₂, and birth place), which showed the worst performance in AUROC and F1-score. From these results, it can be inferred that these five key factors (GA, FiO₂, SpO₂, birth place, and HR) from the MDNN model highly contribute to the model performance, and these factors were almost consistent across the study. The findings of this study suggest that by monitoring significant factors with the highest SHAP values, including time-series data like HR and SpO₂, we would be able to predict a neonate who requires prompt endotracheal intubation. The findings of this study suggest that by monitoring significant factors with the highest SHAP values, including time-series data like HR and SpO₂, we would be able to predict a neonate who requires prompt endotracheal intubation.

Our study was designed to predict intubation within 3 h using initial tabular data and time-series data collected over 1–3 time points recorded at 1-h intervals. Longer time-series data with dense intervals can help stabilize and improve model performance. However, among 36 neonates who underwent endotracheal intubation within 12 h, 29 neonates required endotracheal intubation within 3 h of admission. Therefore, prompt decisions must be made using short-term records. Furthermore, our dataset targeted neonates who initially received NIV supports, and NIV failure commonly occurred in the first-hour stabilization period; therefore, the short-term prediction (≤ 3 h) is the most practical for use in the NICU. In addition, the interval between recording time-series data varies from minutes to hours at each institute; therefore, this model can be practically applied to other situations. We used 23 clinical variables to predict the number of infants requiring endotracheal intubation and mechanical ventilation. Diverse data, such as radiologic images, could improve model performance. However, it is difficult to adjust radiologic image findings that require data labeling; therefore, we selected the minimum variables easily obtained at NICU admission as input variables. Furthermore, neonatal patients requiring intubation after 12 h were excluded. The tabular data included critical information such as body temperature and blood gas analysis results (pH, PCO₂, PO₂, BE, and lactate). Although these values could gradually recover or worsen over time, the initial data alone were collected, to alleviate the model’s complexity and improve its practical use. For long-term (> 12 h) predictions, we would try to input these data every hour.

RDS is the most common cause of respiratory distress in neonates who require endotracheal intubation within 48 h of birth^37,38,39. Of the 36 neonates in the intubated group, 30 neonates with RDS and 2 with meconium aspiration syndrome received surfactant replacement therapy. In practice, the differential diagnosis of respiratory morbidities should be made using a combination of FiO₂ to maintain normal SpO₂, RR, degree of respiratory distress, and aeration of the lung on radiologic images^36,40. Less severe respiratory diseases such as transient tachypnea of the newborn and mild RDS spontaneously resolve with NIV support within 48 h of life^37,38,41. Delayed diagnosis of moderate to severe RDS leads to several complications including air leaks and intraventricular hemorrhage^9,42. Timely diagnosis of RDS and earlier surfactant replacement therapy will improve neonatal outcomes. Several observational studies have attempted to predict RDS and NCPAP failure using perinatal variables such as prenatal ultrasound measures⁴³, biomarkers in gastric aspirate^44,45, and a combination of maternal and neonatal data at birth^46,47. In this study, we developed a model to predict the need for endotracheal intubation within the next 3 h using 19 tabular data variables at birth and 4 time-series variables representing varying patient conditions before endotracheal intubation.

Since 2011, the insurance system in South Korea has covered early prophylactic surfactant administration in preterm neonates with a GA < 30 weeks or a birth weight < 1250 g within 2 h of birth⁸. In this study, 73 neonates who had already been intubated for prophylactic surfactant administration or for medical conditions requiring resuscitation within the delivery room were excluded. Therefore, most of the enrolled patients were late preterm or term neonates. During the study period, we implemented the conventional surfactant administration method (endotracheal intubation, bolus instillation with subsequent intermittent positive pressure ventilation for distribution, followed by mechanical ventilator support) instead of an intubation-surfactant-extubation (INSURE) strategy⁴⁸ or a less invasive surfactant administration (LISA) technique⁴⁹. Among the enrolled neonates, GA was still one of the significant factors predicting intubation within the following 3 h, although GA did not differ between the intubated and non-intubated groups. The intubated group showed significantly more severe respiratory acidosis than the non-intubated group. De Bernardo et al. have reported that low umbilical cord blood arterial pH (< 7.12) was correlated with RDS in full-term neonates⁵⁰. Blood gas analysis is a useful tool for evaluating in patients with respiratory failure and severe respiratory acidosis is a definite indication for endotracheal intubation. Because endotracheal intubation is one of the most invasive procedures in the NICU, clinicians are reluctant to perform intubation in unclear cases. Moreover, pH and PCO₂ were not considered among the top five factors in both SHAP and sensitivity analyses and were thus not included as key factors in our study. FiO₂ and SpO₂ were included in the five key factors and were already associated with RDS and CPAP failure according to the previous studies, even in the era of the LISA technique^51,52,53,54. HR variability is known as a key factor for predicting mortality and sepsis in adults^55,56 and neonatal medicine^14,57,58.

The number of neonates assigned to NCPAP or BIPAP support was significantly higher in the intubated group than in the non-intubated group. The NIV failure rate in neonates receiving NCPAP or BIPAP was significantly higher than that in those receiving HFNC support. Additionally, the time to intubation was significantly shorter in the NCPAP and BIPAP groups than in the HFNC group. There could be a potential bias that physicians assigned neonates with more severe respiratory distress to NCPAP or BIPAP^3,59. Artificial intelligence has recently expanded its clinical scope in modern medicine, especially for critically ill patients. With the aid of this artificial intelligence model, physicians’ performance can be more accurate and refined.

Limitations

Despite the benefits of the artificial intelligence techniques applied here, our study had some limitations. First, the time-series variables were collected at 1-h intervals; therefore, data may be insufficient to capture all relevant clinical changes. Second, the model was evaluated using single-center data. Since we have no documented protocol defining when to intubate infants with respiratory difficulty, decision-making occurred on a case-by-case basis when NIV support was deemed insufficient. Third, there is limited open dataset for neonatal patients who experienced respiratory distress. The specific protocol for data collection makes it more difficult to find an equivalent dataset for comparison, and we could not perform model validation due to the insufficient number of positive cases (intubation) during training and testing. Therefore, our model requires external validation in other cohorts. Additionally, this study was designed to predict the need for short-term (≤ 12 h) endotracheal intubation in neonatal patients. For mid- to long-term predictions, we could technically improve the model architecture and reduce the time interval for clinical data after NICU admission. Furthermore, this model could be expanded and applied to the subacute and chronic hospitalization in the NICU. To our knowledge, our model is the first MDNN model to predict the need for intubation within the next 3 h in neonates with respiratory distress.

During the study period, all the world have experienced the pandemic era, fortunately, there were no COVID-19 related effects in this study. All pregnant women and newborns who were admitted to our institute underwent severe acute respiratory syndrome coronavirus 2 reverse transcription-polymerase chain reaction testing. None of the enrolled infants had COVID-19 infection.

Conclusions

The integration of clinical data and time-series variables using MDNN predicted the need for intubation within the next 3 h in neonates with respiratory distress with 88.2% accuracy. The proposed model could help in the decision-making for neonates with respiratory distress who require endotracheal intubation. Further investigation is required to apply continuous time-series variables to the model and integrate the software into clinical practice.

Data availability

The data that support the findings of this study are available from the corresponding author (dalen@hanmail.net, dalen@chungbuk.ac.kr) upon reasonable request.

References

Escobar, G. J., Clark, R. H. & Greene, J. D. Short-term outcomes of infants born at 35 and 36 weeks gestation: We need to ask more questions. Semin. Perinatol. (Elsevier) 30, 28–33 (2006).
Article Google Scholar
Niknafs, P., Faghani, A., Afjeh, S.-A., Moradinazer, M. & Bahman-Bijari, B. Management of neonatal respiratory distress syndrome employing ACoRN respiratory sequence protocol versus early nasal continuous positive airway pressure protocol. Iran. J. Pediatr. 24, 57 (2014).
PubMed PubMed Central Google Scholar
de Winter, J. P., De Vries, M. & Zimmermann, L. Clinical practice: Noninvasive respiratory support in newborns. Eur. J. Pediatr. 169, 777–782 (2010).
Article PubMed PubMed Central Google Scholar
Wheeler, C. R. & Smallwood, C. D. 2019 year in review: Neonatal respiratory support. Respir. Care 65, 693–704 (2020).
Article PubMed Google Scholar
James, C. S., Hallewell, C. P., James, D. P., Wade, A. & Mok, Q. Q. Predicting the success of non-invasive ventilation in preventing intubation and re-intubation in the paediatric intensive care unit. Intensive Care Med. 37, 1994–2001 (2011).
Article PubMed Google Scholar
Fernandez-Gonzalez, S. M., Sucasas Alonso, A., Ogando Martinez, A. & Avila-Alvarez, A. Incidence, predictors and outcomes of noninvasive ventilation failure in very preterm infants. Children 9, 426 (2022).
Article PubMed PubMed Central Google Scholar
Wheeler, C. R. & Smallwood, C. D. Neonatal respiratory support: 2019 year in review. Respir. Care 65, 5 (2020).
Article Google Scholar
Shin, J. E. et al. Pulmonary surfactant replacement therapy for respiratory distress syndrome in neonates: A nationwide epidemiological study in Korea. J. Korean Med. Sci. 35, 253 (2020).
Article Google Scholar
Ng, E. H. & Shah, V. Guidelines for surfactant replacement therapy in neonates. Paediatr. Child Health 26, 35–41 (2021).
Article PubMed PubMed Central Google Scholar
Philip, A. G. Bronchopulmonary dysplasia: Then and now. Neonatology 102, 1–8 (2012).
Article ADS PubMed Google Scholar
Gulczyńska, E., Szczapa, T., Hożejowski, R., Borszewska-Kornacka, M. K. & Rutkowska, M. Fraction of inspired oxygen as a predictor of CPAP failure in preterm infants with respiratory distress syndrome: A prospective multicenter study. Neonatology 116, 171–178 (2019).
Article PubMed Google Scholar
Jing, L., Na, Y. & Ying, L. High-risk factors of respiratory distress syndrome in term neonates: A retrospective case–control study. Balkan Med. J. 2014, 64–68 (2014).
Google Scholar
Condò, V. et al. Neonatal respiratory distress syndrome: Are risk factors the same in preterm and term infants?. J. Matern. Fetal Neonatal Med. 30, 1267–1272 (2017).
Article PubMed Google Scholar
Feng, J., Lee, J., Vesoulis, Z. A. & Li, F. Predicting mortality risk for preterm infants using deep learning models with time-series vital sign data. npj. Digit. Med. 4, 1–8 (2021).
Article Google Scholar
Villarroel, M. et al. Non-contact physiological monitoring of preterm infants in the neonatal intensive care unit. NPJ Digit. Med. 2, 1–18 (2019).
Article Google Scholar
King, A., Blank, D., Bhatia, R., Marzbanrad, F. & Malhotra, A. Tools to assess lung aeration in neonates with respiratory distress syndrome. Acta Paediatr. 109, 667–678 (2020).
Article PubMed Google Scholar
Siu, B. M. K., Kwak, G. H., Ling, L. & Hui, P. Predicting the need for intubation in the first 24 h after critical care admission using machine learning approaches. Sci. Rep. 10, 1–8 (2020).
Article Google Scholar
Rozencwajg, S., Pilcher, D., Combes, A. & Schmidt, M. Outcomes and survival prediction models for severe adult acute respiratory distress syndrome treated with extracorporeal membrane oxygenation. Crit. Care 20, 1–10 (2016).
Article Google Scholar
Tagliaferro, T., Bateman, D., Ruzal-Shapiro, C. & Polin, R. Early radiologic evidence of severe respiratory distress syndrome as a predictor of nasal continuous positive airway pressure failure in extremely low birth weight newborns. J. Perinatol. 35, 99–103 (2015).
Article CAS PubMed Google Scholar
Ezz-Eldin, Z. M., Hamid, T. A. A., Youssef, M. R. L. & Nabil, H.E.-D. Clinical risk index for babies (CRIB II) scoring system in prediction of mortality in premature babies. J. Clin. Diagn. Res. JCDR 9, SC08 (2015).
PubMed Google Scholar
Vapnik, V., Golowich, S. & Smola, A. Support vector method for function approximation, regression estimation and signal processing. Adv. Neural Inf. Process. Syst. 3, 9 (1996).
Google Scholar
Chen, T. & Guestrin, C. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 785–794.
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Lundberg, S. M. & Lee, S.-I. A unified approach to interpreting model predictions. Adv. Neural Inf. Process. Syst. 30 (2017).
Varzaneh, Z. A., Orooji, A., Erfannia, L. & Shanbehzadeh, M. A new COVID-19 intubation prediction strategy using an intelligent feature selection and K-NN method. Inform. Med. Unlocked 28, 100825 (2022).
Article PubMed Google Scholar
Afrash, M. R., Kazemi-Arpanahi, H., Nopour, R., Tabatabaei, E. S. & Shanbehzadeh, M. Proposing an intelligent monitoring system for early prediction of need for intubation among COVID-19 hospitalized patients. J. Environ. Health Sustain. Dev. 7, 1698–1707 (2022).
Google Scholar
Clark, M. T. et al. Predictive monitoring for respiratory decompensation leading to urgent unplanned intubation in the neonatal intensive care unit. Pediatr. Res. 73, 104–110 (2013).
Article ADS PubMed Google Scholar
Xue, B. et al. Use of machine learning to develop and evaluate models using preoperative and intraoperative data to identify risks of postoperative complications. JAMA Netw. Open 4, e212240–e212240 (2021).
Article PubMed PubMed Central Google Scholar
Farzaneh, N., Williamson, C. A., Gryak, J. & Najarian, K. A hierarchical expert-guided machine learning framework for clinical decision support systems: An application to traumatic brain injury prognostication. NPJ Digit. Med. 4, 1–9 (2021).
Article Google Scholar
Duckworth, C. et al. Using explainable machine learning to characterise data drift and detect emergent health risks for emergency department admissions during COVID-19. Sci. Rep. 11, 1–10 (2021).
Article Google Scholar
van den Bosch, T. et al. Predictors of 30-day mortality among Dutch patients undergoing colorectal cancer surgery, 2011–2016. JAMA Netw. Open 4, e217737–e217737 (2021).
Article PubMed PubMed Central Google Scholar
Ziobrowski, H. N. et al. Development and validation of a model to predict posttraumatic stress disorder and major depression after a motor vehicle collision. JAMA Psychiatr. 78, 1228–1237 (2021).
Article Google Scholar
Brix, N., Sellmer, A., Jensen, M. S., Pedersen, L. V. & Henriksen, T. B. Predictors for an unsuccessful INtubation-SURfactant-Extubation procedure: A cohort study. BMC Pediatr. 14, 1–8 (2014).
Article Google Scholar
Group H.S. Randomized study of high-frequency oscillatory ventilation in infants with severe respiratory distress syndrome. J. Pediatr. 122, 609–619 (1993).
Article Google Scholar
Fang, J. L., Mara, K. C., Weaver, A. L., Clark, R. H. & Carey, W. A. Outcomes of outborn extremely preterm neonates admitted to a NICU with respiratory distress. Arch. Dis. Child Fetal Neonatal Ed. 105, 33–40 (2020).
Article PubMed Google Scholar
Greiner, E., Wittwer, A., Albuisson, E. & Hascoët, J.-M. Outcome of very premature newborn receiving an early second dose of surfactant for persistent respiratory distress syndrome. Front. Pediatr. 9, 663697 (2021).
Article PubMed PubMed Central Google Scholar
Sweet, D. G. et al. European consensus guidelines on the management of respiratory distress syndrome—2019 update. Neonatology 115, 432–450 (2019).
Article PubMed Google Scholar
Jung, Y. J. Causes of transfer of neonates (born after≥ 34 weeks of gestation) to the neonatal intensive care unit owing to respiratory distress and their clinical features. Neonatal Med. 25, 66–71 (2018).
Article Google Scholar
Salvo, V. et al. Comparison of three non-invasive ventilation strategies (NSIPPV/BiPAP/NCPAP) for RDS in VLBW infants. J. Matern. Fetal Neonatal Med. 31, 2832–2838 (2018).
Article PubMed Google Scholar
Troshani, A. & Vevecka, E. Respiratory morbidity in term infants delivered by elective caesarean section: Cohort study. Original Sci. Pap. 23, 238–243 (2018).
Google Scholar
Kim, H. A., Yang, G. E. & Kim, M. J. Early neonatal respiratory morbidities in term neonates. Neonatal Med. 22, 8–13 (2015).
Article Google Scholar
Bahadue, F. L. & Soll, R. Early versus delayed selective surfactant treatment for neonatal respiratory distress syndrome. Cochrane Database Syst. Rev. 11, CD001456 (2012).
PubMed Google Scholar
Laban, M., Mansour, G. M., Elsafty, M. S., Hassanin, A. S. & EzzElarab, S. S. Prediction of neonatal respiratory distress syndrome in term pregnancies by assessment of fetal lung volume and pulmonary artery resistance index. Int. J. Gynecol. Obstet. 128, 246–250 (2015).
Article Google Scholar
Heiring, C. et al. Predicting respiratory distress syndrome at birth using a fast test based on spectroscopy of gastric aspirates: 2. Clinical part. Acta Paediatr. 109, 285–290 (2020).
Article CAS PubMed Google Scholar
Raschetti, R. et al. Estimation of early life endogenous surfactant pool and CPAP failure in preterm neonates with RDS. Respir. Res. 20, 1–8 (2019).
Article Google Scholar
Betts, K. S., Kisely, S. & Alati, R. Predicting neonatal respiratory distress syndrome and hypoglycaemia prior to discharge: Leveraging health administrative data and machine learning. J. Biomed. Inform. 114, 103651 (2021).
Article PubMed Google Scholar
Kakkilaya, V. et al. Early predictors of continuous positive airway pressure failure in preterm neonates. J. Perinatol. 39, 1081–1088 (2019).
Article PubMed Google Scholar
Dani, C. et al. The INSURE method in preterm infants of less than 30 weeks’ gestation. J. Matern. Fetal Neonatal Med. 23, 1024–1029 (2010).
Article PubMed Google Scholar
Göpel, W. et al. Avoidance of mechanical ventilation by surfactant treatment of spontaneously breathing preterm infants (AMV): An open-label, randomised, controlled trial. Lancet 378, 1627–1634 (2011).
Article PubMed Google Scholar
De Bernardo, G. et al. Predict respiratory distress syndrome by umbilical cord blood gas analysis in newborns with reassuring Apgar score. Ital. J. Pediatr. 46, 1–6 (2020).
Article Google Scholar
Dell’Orto, V. et al. Early nasal continuous positive airway pressure failure prediction in preterm infants less than 32 weeks gestational age suffering from respiratory distress syndrome. Pediatr. Pulmonol. 56, 3879–3886 (2021).
Article PubMed Google Scholar
Murki, S., Kandraju, H., Oleti, T. & Gaddam, P. Predictors of CPAP failure—10 years’ data of multiple trials from a single center: A retrospective observational study. Indian J. Pediatr. 87, 891–896 (2020).
Article PubMed Google Scholar
Radicioni, M. et al. How to improve CPAP failure prediction in preterm infants with RDS: A pilot study. Eur. J. Pediatr. 180, 709–716 (2021).
Article CAS PubMed Google Scholar
Kruczek, P., Krajewski, P., Hożejowski, R. & Szczapa, T. FiO₂ before surfactant, but not time to surfactant, affects outcomes in infants with respiratory distress syndrome. Front. Pediatr. 9, 1042 (2021).
Article Google Scholar
Hou, N. et al. Predicting 30-days mortality for MIMIC-III patients with sepsis-3: A machine learning approach using XGboost. J. Transl. Med. 18, 1–14 (2020).
Article Google Scholar
Sandfort, V., Johnson, A. E., Kunz, L. M., Vargas, J. D. & Rosing, D. R. Prolonged elevated heart rate and 90-day survival in acutely ill patients: Data from the MIMIC-III database. J. Intensive Care Med. 34, 622–629 (2019).
Article PubMed Google Scholar
Cabrera-Quiros, L. et al. Prediction of late-onset sepsis in preterm infants using monitoring signals and machine learning. Crit. Care Explor. 3, e0302 (2021).
Article PubMed PubMed Central Google Scholar
Song, W. et al. A predictive model based on machine learning for the early detection of late-onset neonatal sepsis: development and observational study. JMIR Med. Inform. 8, e15965 (2020).
Article PubMed PubMed Central Google Scholar
Anne, R. P. & Murki, S. Noninvasive respiratory support in neonates: A review of current evidence and practices. Indian J. Pediatr. 88, 670–678 (2021).
Article PubMed PubMed Central Google Scholar

Download references

Funding

This research was supported by a grant from the Korea Health Technology RD Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health Welfare, Republic of Korea (grant number HI21C1074070021). The funder had no role in the study design, data collection and analysis, decision to publish, or manuscript preparation.

Author information

These authors contributed equally: Jueng-Eun Im and Seung Park.

Authors and Affiliations

Biomedical Engineering, Chungbuk National University Hospital, Cheongju, Republic of Korea
Jueng-Eun Im & Seung Park
Department of Pediatrics, Chungbuk National University Hospital, Chungbuk National University College of Medicine, Chungdae-ro 1, Seowon-gu, Cheongju, 28644, Republic of Korea
Yoo-Jin Kim, Shin Ae Yoon & Ji Hyuk Lee

Authors

Jueng-Eun Im
View author publications
You can also search for this author in PubMed Google Scholar
Seung Park
View author publications
You can also search for this author in PubMed Google Scholar
Yoo-Jin Kim
View author publications
You can also search for this author in PubMed Google Scholar
Shin Ae Yoon
View author publications
You can also search for this author in PubMed Google Scholar
Ji Hyuk Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.A.Y. designed the study and collected the datasets. J.I. developed a multimodal deep neural network model and conducted performance evaluation. S.P. and S.A.Y. interpreted the experimental results. J.I. drafted the manuscript. Y.J.K. and J.H.L. collected the data and reviewed the manuscript. S.P. and S.A.Y. reviewed and revised the manuscript. S.P. and S.A.Y. coordinated and supervised data collection and critically reviewed the manuscript. All authors have approved the final manuscript as submitted.

Corresponding author

Correspondence to Shin Ae Yoon.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Im, JE., Park, S., Kim, YJ. et al. Predicting the need for intubation within 3 h in the neonatal intensive care unit using a multimodal deep neural network. Sci Rep 13, 6213 (2023). https://doi.org/10.1038/s41598-023-33353-2

Download citation

Received: 19 July 2022
Accepted: 12 April 2023
Published: 17 April 2023
DOI: https://doi.org/10.1038/s41598-023-33353-2
Springer Nature Limited

This article is cited by

Explainable artificial intelligence (XAI) for predicting the need for intubation in methanol-poisoned patients: a study comparing deep and machine learning models
- Khadijeh Moulaei
- Mohammad Reza Afrash
- Sayed Masoud Hosseini
Scientific Reports (2024)

Predicting the need for intubation within 3 h in the neonatal intensive care unit using a multimodal deep neural network

Abstract

Similar content being viewed by others

A deep learning model for real-time mortality prediction in critically ill children

The past, current, and future of neonatal intensive care units with artificial intelligence: a systematic review

Early prediction of need for invasive mechanical ventilation in the neonatal intensive care unit using artificial intelligence and electronic health records: a clinical study

Introduction