Introduction

Premature infants are a very vulnerable patient group, and prematurity is the leading cause of neonatal death worldwide. The mortality rate of extremely preterm infants (less than 27 weeks gestation) and extremely low birth weight (ELBW) infants (< 1000 g) is 30–50% [1]. Survivors are also at risk of BPD, necrotizing enterocolitis (NEC), intraventricular hemorrhage (IVH), retinopathy of prematurity (ROP), and septicemia [1].

The “golden hour” is a term initially used in adult trauma emergency care; however, it has increasingly been applied to other medical fields, including neonatology, with a particular focus on extremely low gestational age (GA) preterm infants [2, 3]. For preterm infants, the “golden hour” is the first hour of life after delivery and includes the time during resuscitation in the delivery room (DR), transport and admission to the neonatal intensive care unit (NICU). It involves a series of interdependent tasks and procedures of DR resuscitation and maintenance of stability during NICU admission [3].

The importance of the initial treatment after birth of preterm infants has been demonstrated in previous studies [4, 5]. Studies have found that hypothermia is prevalent in very low birth weight (VLBW) infants and is associated with IVH and mortality [6, 7]. Several studies demonstrated that quality improvement (QI) improved admission hypothermia in preterm infants [8,9,10,11]. Other studies on the golden hour interventions for premature infants have focused on outcomes including BPD, IVH, or hypotension [12,13,14,15,16]. Despite the complexity of the early care, golden hour QI protocols including evidence-based practices and standardized implementation process to the first few minutes after birth have been used to improve the quality of care for extremely premature and extremely low birth weight (EP-ELBW) infants [17,18,19,20,21,22].

A synthesis of literature can provide valuable information for clinical decision-making and future research. However, to the best of our knowledge, no study has systematically summarized the existing literature on the implementation of golden hour QI protocol and evaluated its impact on the outcomes of premature infants. Therefore, we sought to conduct a scoping review to summarize these studies. We chose the scoping review methodology, because it can be conducted to meet various objectives, allowed the authors to include multiple study outcomes and explore a broad clinical question [23].

Methods

This scoping review followed the Preferred Reporting Items for Systematic reviews and Meta-Analyses extension for Scoping Reviews (PRISMA-ScR) [23].

Search strategy

A comprehensive literature search was conducted in PubMed, Embase, the Cochrane Library, and SinoMed databases from inception to April 03, 2023. Relevant studies were identified and search accuracy was maximized using the following terms: golden hour, QI, preterm infants, and newborn. There were no search limits or restrictions. Moreover, reference lists of included studies and relevant systematic reviews were further searched to identify additional studies. The detailed search strategy in PubMed is shown in Supplementary material 1 in the addendum.

Eligibility criteria

Articles were eligible for inclusion if: (1) the population was preterm infants, very-low-birth-weight (VLBW) infants, or extremely-low-birth-weight (ELBW) infants; (2) the intervention was improvement initiatives focused on the golden hour after birth and presented outcomes over time; (3) eligible study designs were randomized controlled trials (RCTs), non-randomized interventional studies, pre-post comparative studies, or interrupted time series study (ITS). Articles were excluded if they investigated only a single intervention. Conference abstracts and review articles were also excluded. There was no limitation on the publication language.

Outcome measures

The primary outcomes included admission temperatures, serum glucose concentration, time to completion of stabilization. Hypoglycemia was defined as glucose < 45 mg/dL. Time to completion of stabilization was defined as time to close of incubator top in minutes. Completion of admission stabilization included DR resuscitation, initiation of intravenous fluids and antibiotics (when indicated), the establishment of central access, initiation of humidity, and provision of decreased environmental stimulation to approximate the intrauterine environment, and closure of the top of the isolette.

Process outcomes included time to intubation, time to surfactant, time to NICU admission, time to admission temperature, time to umbilical line placement(confirmation), time to initiation of IV fluids, time to initiation of antibiotics. All times were defined as the time from birth to measurement in minutes.

Balancing outcomes included hyperthermia, insertion-related catheter-associated bloodstream infections, irrational use of antibiotics (defined as antibiotics initiated without a physician order), mechanical ventilation duration, and postmenstrual age (PMA) on the day of discharge home.

The main long-term outcomes included: severe IVH (Grade III or IV hemorrhages), BPD (defined as oxygen or positive pressure support required at 36 weeks PMA), and mortality.

Study selection and data extraction

Two researchers (LJS and GCZ) independently screened the titles, abstracts, and full texts of all studies for eligibility. Any discrepancies were resolved through a consensus discussion with a third author. Two researchers (LJS and RRX) independently extracted data using a standardized data collection form. The following details were extracted from each study: author(s), publication year, location, setting, study duration, sample size, target population, study design, QI bundled elements, primary outcomes, secondary outcomes, process outcomes and balancing outcomes. Any differences in the abstracted data were resolved by consensus with a third researcher (ZBY).

Critical appraisal

The Quality Improvement Minimum Quality Criteria Set (QI-MQCS) checklist was used for the critical appraisal of the methodological quality of the included studies [23]. It includes the following 16 content domains: Organizational Motivation, Intervention Rationale, Intervention Description, Organizational Characteristics, Implementation, Study Design, Comparator, Data Source, Timing, Adherence/Fidelity, Health Outcomes, Organizational Readiness, Penetration/Reach, Sustainability, Spread, and Limitations. Each study was evaluated based on the 16 content domains, with each domain recording 1 point if it met the minimum criteria, and 0 points if it did not. Based on the criteria, the included studies were rated as high, medium, and low quality, where > 10 indicated high quality, 7–10 indicated medium quality, and < 7 indicated low quality. Finally, two authors (LJS. and GCZ) independently assessed the studies, and discrepancies were resolved by group discussion.

Data analysis

Golden hour QI bundled elements were summarized as frequencies and percentages. The histogram of the detailed interventions of the Golden Hour bundled items was constructed in Microsoft Excel 2009. The meta-analysis was conducted using Review Manager 5.4 software. Due to the absence of heterogeneity between studies (I2 ≤ 50%), a fixed-effects model approach was used to analysis. We reported RR and 95% CI using the Mantel–Haenszel (MH) method for dichotomous outcomes to summarize the effect on long-term outcomes. P < 0.05 was considered significant.

Results

A total of 218 articles were initially identified, and 53 duplicates were excluded. Of the remaining 165 articles, 146 completely unrelated studies were excluded after screening titles and abstracts. After a full-text review of the remaining 19 studies, nine studies were excluded for the following reasons: three studies were excluded because they did not include a complete intervention in the first hour after birth [12, 24, 25], one study was excluded due to a mismatch in primary outcome [13], four studies were conference abstracts [26,27,28,29], and one study was not QI research [30]. Ten studies were finally included in the scoping review [17,18,19,20,21, 31,32,33,34,35]. The study selection flow chart is shown in Fig. 1.

Fig. 1
figure 1

Study selection flow chart

Study characteristics

Characteristics of the included studies are detailed in Table 1. Nine studies were conducted in the USA and one study was conducted in Israel. All studies were from single centers. Seven reports were pre-post comparative studies (70%) and three were observational studies. The studies were published between 1999 and 2017. The sample sizes ranged from 24 to 1439 (median = 190). Six studies used GA alone as an inclusion criterion and three studies used GA and BW as an inclusion criterion; GA ranged from < 27 weeks to < 33 weeks and BW ranged from < 1000 g to < 1500 g. One report used BW (< 1500 g) alone as an inclusion criterion.

Table 1 Characteristic of included studies

Critical appraisal

QI-MQCS was used to assess QI publications, and all included studies had scores between 7 and 14 points (Table 2). Eight studies were medium-quality while two were high-quality. All studies met the minimum quality criteria for five of the 16 domains, including Organizational Motivation, Intervention Rationale, Intervention Description, Implementation and Organizational readiness. Most of the studies, however, failed to meet the minimum standards in various areas, including the lack of information about comparator care processes (Comparator, 6/10), data sources and outcome definition (Data source, 5/10), compliance with the intervention for the duration of the study, fidelity data on intervention use, or described mechanisms that ensure compliance (Adherence/Fidelity, 7/10), the number of units or sites participating in the intervention compared to the available/eligible units (Penetration/Reach, 10/10), the sustainability or the potential for sustainability (Sustainability, 9/10), and the potential for spread, existing tools for spread, or spread attempts/large-scale rollout (Spread, 9/10).

Table 2 Critical Appraisal of studies using QI-MQCS [23]

QI bundled elements

QI bundle elements included 11 interventions (Table 3). The most common elements were antenatal counseling and team briefing (10/10), prevention of hypothermia (10/10), respiratory system support (10/10), and detailed record keeping (10/10). Other relatively common elements included: rational use of oxygen (7/10), cardiovascular system support (6/10), early nutritional care (8/10), prevention of infection (6/10), laboratory investigation (8/10), and communication with family (6/10). Only one study reported delayed cord clamping (DCC) (1/10).

Table 3 Interventions included in the golden hour QI

Detailed interventions for each element were shown in Fig. 2. Most studies reported two interventions to prevent hypothermia: use of plastic wrap or bag, plastic caps, radiant warmer, thermal mattress, pre-warmed incubators, warm humidified gases (10/10) and maintaining DR temperature at 26–28℃ (9/10). The most common respiratory support included DR continuous positive airway pressure (CPAP) (8/10), T-piece resuscitation (4/10), and early rescue pulmonary surfactant (PS) (9/10). The rational use of oxygen included starting resuscitation with low-oxygen concentrations of 21–30% in preterm neonates (3/10) and targeted saturation (5/10). The most common cardiovascular support was the assessment of heart rate, maintenance of normal perfusion, and blood pressure (6/10). Most studies reported interventions in early nutritional care, including insertion of umbilical lines or cannula (8/10), total parenteral nutrition and enteral nutrition (6/10), starting intravenous fluids infusion, and prevention of hypoglycemia (7/10). The most commonly reported infection control was the use of antibiotics as a first dose if indicated (6/10). Laboratory investigation included two aspects: complete blood count, blood culture, glucose, and arterial blood gas analysis from the central line (5/10) and chest X-ray confirmed the location of umbilical arterial and venous catheters or endotracheal tube (7/10). A detailed record keeping included complete resuscitation record, birth weight, the axillary temperature at admission to nursery, time of surfactant instillation, time of umbilical catheterization, and position of the endotracheal tube, umbilical catheters, and feeding tube. No study reported skin-to-skin contact, detection of shock in compensated phase, and the use of strict asepsis methods.

Fig. 2
figure 2

Detailed interventions for each QI bundle element. “n/N” represents the number of included studies/ total number of included studies. DR: delivery room; CPAP: continuous positive airway pressure; PS: pulmonary surfactant; HR: heart rate, BP: blood pressure; TPN: total parenteral nutrition; UAC/UVC: umbilical arterial and venous catheters; ETT: endotracheal tube

Primary outcomes

This review included three primary outcomes (Table 1). The effect on outcomes were summarized in Table S1. The most common primary outcome was admission temperatures (9/10). Five studies reported improvements in admission temperature after QI (n = 2308), including improvements in hypothermia (59% vs. 26% vs. 38% for pre-protocol, Phase I, and II, respectively; p = 0.001), percentage of reaching target temperature (40% vs. 57%; p = 0.001, and 28.3% vs. 49.6%; p = 0.002), and other two mean admission temperatures (35.26 vs. 36.26℃; p < 0.001 and 36.04 ± 0.81 vs. 36.56 ± 0.82℃ vs. 36.68 ± 0.65 for pre-, initial, and revised QI, respectively; p = 0.0001) [17, 18, 20, 32, 35]. One study reported an increasing trend in the percentage of infants admitted with temperature in the normothermic range post-intervention (54.8% vs. 71.3%; p = 0.056) [21]. Four studies reported glucose after improvement (n = 2052) [17, 18, 20, 32], of which one study reported improvement of hypoglycemia (18% vs. 7% vs. 4% for pre-protocol, phase I, and II, respectively; p = 0.012 ) [32], and one reported proportion of glucose > 50 mg/dL increased (72.3% vs. 55.7%; p = 0.012) [20], while another study found an increased incidence of hypoglycemia (25.8% vs. 35.1%; p = 0.047) [17]; however, no difference was found in subgroups with gestational age ≤ 28 weeks.

Process outcomes and balancing outcomes

One study did not report any major process outcomes [17] (Table 4). Five studies reported initial infusion time [18, 32,33,34, 20], and two study found a statistical difference after QI (78.9 ± 43.3 vs. 27.4 ± 12.7 min; p < 0.01 and median [IQR], 55 [26] vs. 106 [40]([26]and [40]were [IQR], just a number, not references) minutes; p < 0.001) [18, 20]. Three studies reported the time to initiation of antibiotics [32,33,34] and only one study reported a descending trend in the use of ampicillin and gentamicin [33]. Six studies reported time to surfactant [18, 19, 31, 32, 34, 35], of which one study showed decreased in time (79.8 ± 56.6 vs. 30.8 ± 21.8; p < 0.01) [18]. Only one study reported improvement in the stabilization time (median [IQR]: 110 [89–138] vs. 111 [94–135] vs. 92 [74–129] minutes for pre-protocol, Phase I, and II respectively; p = 0.004]) [32]. One study reported time to intubation [19] but found no effect. One study reported a decline in time to umbilical line placement [19]. Three studies found no effect in time to NICU admission [18, 19, 21]. Six studies reported major balancing outcomes (Table 4) [17,18,19, 21, 32, 34]. Two studies found no difference in mechanical ventilation duration [17, 19], and one study reported a decline in PMA on the day of discharge home [21].

Table 4 Process outcomes and balancing outcomes included in the golden hour QI

Long-term outcomes

Long-term outcomes reported in the included studies mostly were comorbidities of prematurity. Three studies did not report any long-term outcomes [20, 33, 34] (Table 1). Five studies reported severe IVH (n = 1201) [17,18,19, 21, 32]. The pooled estimate showed no statistically significant difference after QI: 55/570 (9.6%) vs. 59/530(11.1%) [OR = 0.79, 95% CI 0.53–1.18, p = 0.25]. Four studies involving 907 preterm infants showed the incidence of BPD was lower after QI to the controls: 106/408 (26.0%) vs. 122/424 (29.5%) [OR = 0.68, 95% CI 0.48–0.97, p = 0.04] [17,18,19, 21]. Five studies reported mortality (n = 1201) [17,18,19, 21, 32]. The pooled estimate did not show a statistically difference in mortality after QI: 93/580 (16.0%) vs. 68/541(12.6%) [OR = 1.32, 95% CI 0.92–1.87, p = 0.13] (Figs. 3, 4 and 5).

Fig. 3
figure 3

The incidence of severe IVH before and after QI

Fig. 4
figure 4

The incidence of BPD before and after QI

Fig. 5
figure 5

The mortality before and after QI

One study reported a decline in ROP requiring treatment [18]. One study reported decreased incidence of late-onset sepsis (LOS) [17]. Three studies reported NEC incidence [17,18,19] while another study reported patent ductus arteriosus (PDA) incidence [19]; however, no statistical difference was found in all studies. Only one study demonstrated improvement in the assessment of clear role definitions in the DR and the organization and efficiency both in the DR and during the NICU admission (p < 0.05) [21].

Discussion

This scoping review including ten studies summarized golden hour QI practices for extremely preterm neonates (GA ≤ 32 weeks and/or BW < 1500 g). The included studies had substantial heterogeneity and deficiencies. Most studies focused on primary and long-term outcomes but lacked detailed definitions of process and balancing measures. Most studies showed improvement in hypothermia, but the range of target temperature varied. Several studies showed improvement in hypoglycemia. However, the QI bundle had no consistent effect on other primary outcomes. A single study found improvement in time to surfactant and time to completion of stabilization respectively. The pooled estimate showed the incidence of BPD decreased after QI, but no statistically significant difference in severe IVH and mortality.

The QI bundle is often complex, multi-component, and customized for specific settings and has also evolved. Valid and reliable critical appraisal tools advance QI intervention impacts by helping stakeholders to identify higher-quality studies [36]. We selected the QI-MQCS checklist to guide the critical appraisal of included studies. Most included studies were of medium quality (80%) and lacked information on the comparator, adherence/fidelity, penetration/reach, sustainability, and spread. To conduct high-quality QI research, attention should be paid to the neglected content domains in addition to the common ones. Other widely used QI assessment methods include Plan-Do-Study-Act (PDSA) cycle, which was first published in 2009 [37], key driver diagrams, Pareto charts, and time-series data analysis [38]. Three included studies used PDSA to evaluate the effectiveness of interventions [18, 32, 34] and one study used the Lean Six Sigma methodology [32]. Four studies used statistical process control charts to analyze process performance [17, 18, 32, 34]. This review found that the application of these methods was published between 2016 and 2020.

There was variability within the bundled elements. The most common elements included antenatal counseling and team briefing, prevention of hypothermia, respiratory system support, and detailed record keeping. All the included studies indicated significant staff education, training, and multidisciplinary teamwork in the QI implementation. Studies have demonstrated the importance of continuous education and training process, team briefing, communication and multidisciplinary collaboration in the success of QI initiatives [39,40,41]. However, common element such as providing skin-to-skin contact was not reported. A Cochrane review reported that early skin-to-skin contact initiated promote breast-feeding and successful newborn transition to the outside world [42]. Another study found that a dose-dependent relationship appears to exist between skin-to-skin contact and breastfeeding, with longer durations of first contact [43]. Critically, women who have early skin-to-skin contact with their newborns have a lowered risk of postpartum hemorrhage and faster expulsion of the placenta than women who do not experience skin-to-skin contact [42]. Therefore, it is significant to emphasize early skin-to-skin contact in future Golden Hour QI studies. DCC was only reported in one study. However, a survey of Australian and New Zealand Neonatal Network units’ golden hour practices for initial stabilization of very preterm infants showed that DCC was practiced ‘always’ or ‘often’ by 21 units (88%) [44]. A meta-analysis involving 48 studies showed that DCC may reduce the risk of death before discharge (average risk ratio (aRR) 0.73, 95% confidence interval (CI) 0.54–0.98, moderate certainty) compared with early cord clamping [45]. Future studies should focus on the neglected interventions and identify the most important components of individual elements.

Process outcomes that directly contributed to thermoregulation, glycemic control, and time to completion of stabilization were tracked. In our study process measures varied substantially. The measures that closely associated with primary outcomes, such as time to NICU admission, intubation time, central line placement time, could be effective drivers of QI initiatives. Due to a changing airway management in the DR over time, non-invasive support is preferred today. A study about respiratory management protocol in DR, including DCC, in combination with optimized nCPAP with high PEEP levels and less invasive surfactant administration (LISA), showed a lower rate of intubation in DR and were less likely to need mechanical ventilation on day 3 and during the hospital stay after QI, and did not differ in terms of mortality or neonatal morbidity between the two groups [46]. The establishment of a standardized process measurement could help to better implement QI in preterm care and assess the association with outcomes.

Balancing measures are necessary to track in the QI project, which help us spot unintended consequences (good or bad) or issues that can occur as a result of process changes [47]. However, most of the included studies did not clearly define balancing outcomes. It is noteworthy that some balancing measures were ignored, such as hyperthermia, only two studies reported [32, 34], and the latter also reported insertion-related catheter-associated bloodstream infections and antibiotics initiated without an order. On the “golden hour” checklist, “time of initiation of antibiotics” can easily be misinterpreted as all infants need antibiotics regardless of risk factors or physician order. Tracking the duration of mechanical ventilation was to determine if changes in the QI practices affected ventilation time and length of stay. Thus, future studies should comprehensively evaluate the benefits and adverse consequences of QI.

The commonly reported long-term outcomes were BPD (70%), severe IVH (50%) and mortality (50%) in this review. A systemic review involving 22 reports showed that respiratory QI interventions successfully reduced BPD or other key respiratory measures, particularly for infants with BW over 1000 g [32, 34]. Another QI of golden hour management of respiratory distress syndrome in preterm newborns showed that the overall BPD rate decreased from 33.5 to 16.5% [13]. Other long-term outcomes were rarely reported, including ROP, LOS, NEC, PDA, and early blood transfusion, 13 times in total.

The current study summarized the experiences of the implementation and impact of the golden hour QI protocol for extremely preterm infants, which may help inform future studies of higher-quality golden hour QI. To our knowledge, this is the first scoping review to evaluate the effect of the golden hour QI bundle on the outcomes of preterm infants.

However, this systemic review has several limitations. First, quantitative synthesis of primary outcomes was not carried out due to significant heterogeneity across studies, including demographic characteristics, the composition of interventions, implementation methods, definition (e.g. target temperature), interested outcomes, etc. Second, because some of the included trials had small sample sizes and low event rates, selective reporting and publication biases were inevitable. However, current published studies are informative and provide the basis for further studies to improve the golden hour care of preterm infants.

Conclusion

This scoping review first summarized previous literature on the implementation of the golden hour QI protocol and evaluated its impact on the outcomes of extremely preterm infants (GA ≤ 32 weeks and/or BW < 1500 g). Our study showed that the golden hour QI bundle can improve the short-term and long-term outcomes for extremely preterm infants in the first hour after birth. Most studies showed that the interventions improved hypothermia and hypoglycemia, but the impact on other outcomes was inconsistent. The pooled estimate showed the incidence of BPD decreased after QI, but no statistically significant difference in severe IVH and mortality. The key to the success and maintenance of the Golden Hour QI initiative is a continuous education and training process and the ongoing collaboration of multidisciplinary team members. There was considerable heterogeneity and deficiencies in the included studies, and the variation in impact on outcomes suggests the need to use standardized and validated measures. Future studies are necessary to develop locally appropriate, high-quality, and replicable QI projects.