Introduction

Preterm infants in the neonatal intensive care unit (NICU) frequently undergo painful procedures such as venipuncture, heel-stick, and endotracheal suctioning, as well as orogastric tube insertion [1]. The pain and stress from frequent procedures can have both transient and enduring impacts on the behavior, physiology, and neurodevelopment of preterm infants [2]. Research indicated that at 7 years of age, preterm children who underwent more invasive neonatal procedures had higher salivary cortisol levels and internalizing behavior scores greater than full-term children [3]. Another research found that cumulative pain and stress were associated with neurobehavioral outcomes such as stress/abstinence and habituation responses in preterm infants [4]. Reports from South Korea, Canada and Kenya indicated that many preterm infants continued to receive highly invasive procedures without adequate analgesia, highlighting an ongoing need to improve pain management practices in this vulnerable population [5,6,7,8].

Given the suboptimal pain management practices and risk of adverse outcomes demonstrated in preterm infants, there is growing interest in identifying and evaluating effective analgesic interventions for this population. However, a review reported that commonly used anesthetic and sedative agents may have both acute and long-term detrimental neurological impacts in preterm infants [9]. The American Academy of Pediatrics also stated that the long-term effects and safety of pharmacologic analgesia are yet to be studied [10]. Clearly, there is a need to explore alternative, neuroprotective pain management strategies in this vulnerable population. In recent years, non-pharmacological interventions such as skin-to-skin contact, non-nutritive sucking, facilitated tucking position, breastfeeding, oral sucrose, olfactory stimulation, and music therapy have emerged as effective methods for pain management in preterm infants [11, 12]. Evidence has already confirmed their efficacy and safety in pain management and some other pain-related indicators such as oxygen saturation, respiratory rate, and heart rate [13,14,15,16,17,18].

Previous systematic reviews have primarily examined the effectiveness of individual or combined non-pharmacologic interventions for the treatment of pain in preterm infants [13,14,15, 17, 18]. While there have been several recent systematic reviews assessing the effectiveness of different non-pharmacological interventions, it is important to note that these reviews have not encompassed the entire spectrum of interventions, and the evidence has not been consolidated [11, 16, 19, 20]. As such, the objective of this network meta-analysis is to integrate various non-pharmacological interventions and evaluate their efficacy in managing pain in preterm infants, providing high-quality clinical evidence for improving pain care.

Methods

Search strategy

The review was conducted and reported following the PRISMA guidelines [21]. The protocol for this meta-analysis has been registered in the PROSPERO database (CRD42023412200). We searched PubMed, Web of Science, Embase, and the Cochrane Library for randomized controlled trials (RCTs) from 2000 to the present (updated March 2023) using the targeted search strategy provided in the Data Supplement (Additional file 1: Appendix Table 1). The search was restricted to English articles. The search strategy used both medical subject heading terms and keywords for pain, infant, preterm, neonatal intensive care unit and so on.

Study selection

Two authors (Yuwei Weng and Jie Zhang) independently evaluated the articles to determine their eligibility for inclusion, and differences were addressed by agreement. The eligible full texts were reviewed after they were screened for titles and abstracts (Fig. 1). The criteria for inclusion were as follows: (1) The participants were preterm infants in the NICU (gestational age < 37 weeks). (2) Studies were RCTs. (3) The experimental group implemented tactile relief (Kangaroo mother care, massage, etc.), auditory intervention (mothers’ voice, white noise, lullaby, etc.), olfactory stimulation (maternal breast milk odor, vanilla odor, amniotic fluid odor, etc.), combined oral sucrose and non-nutritive sucking (OS + NNS), facilitated tucking, or mixed intervention. (4) The control group received routine nursing care, including placebo, pacifier, and incubator. The criteria for exclusion were as follows: (1) Full-term infants or other non-preterm infants. (2) Results of pain score, oxygen saturation and heart rate were ambiguous or missing. (3) Non-RCTs, non-English literature, non-human studies, repeated publications, reviews, and meta-analyses were excluded.

Fig. 1
figure 1

Flow diagram of study inclusion and exclusion

Data extraction

Two authors (Yuwei Weng and Jie Zhang) independently reviewed the article and extracted relevant data and parameters. The EndNote X9 software was used to import all the retrieved articles, and duplicates were removed. After a preliminary selection of titles and abstracts, the remaining eligible articles were checked for full text according to the prespecified inclusion and exclusion criteria. When the eligible articles were reviewed, the following parameters were extracted: first author, publication year, RCT design, participants, intervention and control groups (sample size), gestational age and birth weight, painful procedures, and outcomes. Any discrepancies were resolved with the assistance of the third author (Zhifang Chen) throughout the entire process of study search, article review, and data extraction.

Quality assessment

To assess the risk of bias in the included RCTs, two authors (Yuwei Weng and Jie Zhang) independently used the Cochrane Risk of Bias Assessment Tool [22]. This tool evaluated six aspects of the studies, namely selection bias, performance bias, detection bias, attrition bias, reporting bias, and other bias. Each aspect was evaluated through one or more items and was classified as low, high, or unclear risk. Due to the large number of interventions and articles involved in this meta-analysis, if there was any disagreement in the evaluation process, consensus would be reached through discussion. In addition, the certainty of evidence was assessed using the Confidence in Network Meta-Analysis (CINEMA) [23] framework, which comprises six domains: within-study bias, reporting bias, indirectness, imprecision, heterogeneity, and incoherence. The certainty of the results was graded as high, moderate, low, or very low.

Outcomes

The main outcome was pain score. The extraction of pain data was based on the last time node. Data were expressed as continuous variables (SMD). The results of the pain score were evaluated using the Premature Infant Pain Profifile (PIPP) [24]. The scale is a tool designed to assess pain in preterm infants who are between 28 to 36 weeks of gestation. It consists of seven items, which are further categorized into three behavioral items, two physiological items, and two contextual items. The scale was revised and promoted in 2014 to improve its accuracy and sensitivity in consideration of psychometric properties for extremely low gestational age (ELGA) infants and feedback from clinical medical staff on the percentage calculation problem [25]. The scale measures pain on a range of 0 to 21, with higher scores indicating more significant pain. The secondary outcomes measured were oxygen saturation and heart rate. The form of data presentation and the time nodes extracted were consistent with the pain score.

Statistical analysis

Standardized mean differences were initially chosen based on the expression of continuous variables. Following this, the Cochran’s Q statistic and the I2 statistic were used to explore the heterogeneity among studies. The random-effects model was chosen when there was heterogeneity (I2 ≥ 50%). Otherwise, the fixed-effects model was chosen [26, 27]. Afterwords, the efficacy of various interventions was assessed using network meta-analysis, with the consistency of direct and indirect comparisons being evaluated using the loop inconsistency test, and the efficacy ranking of the interventions was observed. Subsequent to the main analysis, sensitivity analyses were carried out and the possibility of publication bias was evaluated through the use of funnel plots. Ultimately, all statistical assessments were conducted with the aid of Stata version 17.0 and the gemte package in R software [28].

Results

Study selection

A comprehensive search was conducted on PubMed, Web of Science, Embase, and the Cochrane Library, resulting in the identification of 14,456 publications. After removing duplicates, 9,996 publications were reviewed. Through preliminary screening of titles and abstracts, 130 studies were identified that focused on non-pharmacological interventions for pain in preterm infants. Of these, 1 was excluded because the full text was unavailable. After reviewing the remaining 129 publications, 13 were excluded due to the fact that the study subjects were not preterm infants. Additionally, 10 publications were excluded because the study sites were not NICUs. 26 publications were excluded due to incomplete or missing outcome data, while 15 were excluded because the control group received non-routine care. Therefore, 35 RCTs of 2134 preterm infants included at the end [29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63]. The PRISMA flow diagram of the included studies is presented in Fig. 1.

Table 1 presents the categorization of the studies based on their characteristics. The studies included in this analysis were published between 2000 to the present (updated March 2023), and the sample size ranged from 20 [35] to 200 [57]. The mean gestational age of preterm infants varied between 26 [45, 60] to 37 [31, 38, 58, 63] weeks, while the mean birth weight ranged from 932.3 [62] to 2,299.03 [30] grams. In addition, the design of RCTs, painful procedures, details of grouping, and outcomes for all studies were also summarized in Table 1.

Table 1 Characteristics of included studies

Risk of bias in included studies

Figure 2 reported the results of the bias risk assessment for the included studies. Of the allocation concealment methods, 3 studies [53, 56, 58] were marked as high risk for not applying, 17 [29,30,31, 36, 39,40,41,42, 44, 46, 48, 49, 51, 52, 54, 55, 62] were marked as unclear risk for not being mentioned in the study, and the remaining were marked as low risk. In the blinding of participants and personnel, 2 studies [58, 59] took a high risk since they couldn’t apply the blinding, and 28 [29,30,31,32, 35, 36, 38,39,40,41,42, 44,45,46,47,48,49,50,51,52,53,54,55,56,57, 60, 62, 63] were marked as unclear risk. In the blinding of outcome assessment, 8 studies [29, 30, 33, 35, 47, 48, 51, 52] were found to be marked as high risk for not applying the blinding and 10 [31, 36,37,38,39, 41, 42, 58, 61, 62] were marked as unclear risk. Among other bias, 1 study [56] was marked as high risk as the study personnel and outcome assessment was the same person. In addition, all studies explained the use of randomization methods and were accordingly marked as low risk. Incomplete outcome data and selective reporting were also not found in the studies.

Fig. 2
figure 2

Risk of bias graph

PIPP scores

A total of 29 RCTs [29,30,31,32,33,34,35,36,37,38, 40,41,42, 44,45,46, 48, 49, 51,52,53,54,55, 57,58,59,60,61,62] were included in this meta-analysis by PIPP score, involving 6 interventions. Olfactory stimulation (8 RCTs), OS + NNS (3 RCTs), facilitated tucking (8 RCTs), auditory intervention (5 RCTs), tactile relief (7 RCTs), and mixed intervention (3 RCTs) were included. A total of 7 nodes were included in this meta-analysis, with each node representing an intervention or control (Fig. 3). The nodes with more significant interactions were control (34 interactions), olfactory stimulation (11 interactions), auditory intervention (10 interactions), facilitated tucking (9 interactions), and tactile relief (8 interactions). The results of the consistency analysis was shown in Additional file 1: Appendix Table 2. The results of the heterogeneity test indicated a high degree of heterogeneity with an I2 value of 97.1%.

Fig. 3
figure 3

Network evidence diagram (PIPP score)

Based on moderate-quality evidence (Additional file 1: Appendix Table 5), OS + NNS had the greatest SUCRA score, followed by facilitated tucking, auditory intervention, olfactory stimulation, tactile relief, mixed intervention, and control group (Fig. 4 and Additional file 1: Appendix Table 3). Compared to the control group, OS + NNS was 3.92 (95% CI: 1.72,6.15, SUCRA score: 0.73) lower, facilitated tucking was 2.51 (95% CI: 1.15,3.90, SUCRA score: 0.29) lower, auditory intervention was 2.48 (95% CI: 0.91. 4.10, SUCRA score: 0.27) lower, olfactory stimulation was 1.80 (95% CI:0.51,3.14, SUCRA score: 0.25) lower, and mixed intervention was 2.26 (95% CI:0.10,4.38, SUCRA score: 0.14) lower (Table 2). This study found that the comparison between tactile relief and the control group wasn’t statistically significant (OR: 1.38, 95% CI: -0.11,2.87, SUCRA score: 0.26). Therefore, the complete probability ranking was OS + NNS (73%) > facilitated tucking (29%) > auditory intervention (27%) > olfactory stimulation (25%) > mixed intervention (14%) > control group (94%) (Additional file 1: Appendix Table 3). The stability and credibility of the results were demonstrated in Additional file 1: Appendix Table 4 through the sensitivity analysis.

Fig. 4
figure 4

Probability ranking diagram of best interventions (PIPP score)

Table 2 The results of network meta-analysis (consistency model, PIPP score)

Oxygen saturation

A total of 14 RCTs [32, 33, 38, 39, 41, 43, 47,48,49,50, 55, 56, 60, 63] were included in this meta-analysis by oxygen saturation, involving 5 interventions. Olfactory stimulation (3 RCTs), OS + NNS (2 RCTs), facilitated tucking (2 RCTs), auditory intervention (6 RCTs), and tactile relief (3 RCTs) were included. A total of 6 nodes were included in this meta-analysis, with each node representing an intervention or control (Fig. 5). The nodes with more significant interactions were control (16 interactions), auditory intervention (8 interactions), olfactory stimulation (5 interactions), and facilitated tucking (5 interactions). The results of the consistency analysis was shown in Additional file 1: Appendix Table 6. The results of the heterogeneity test indicated a high degree of heterogeneity with an I2 value of 81.5%.

Fig. 5
figure 5

Network evidence diagram (oxygen saturation)

Based on moderate-quality evidence (Additional file 1: Appendix Table 9), facilitated tucking had the greatest SUCRA score, followed by tactile relief, auditory intervention, OS + NNS, olfactory stimulation, and control group (Fig. 6 and Additional file 1: Appendix Table 7). Compared to the control group, facilitated tucking was 1.94 (95% CI: 0.66,3.35, SUCRA score: 0.64) higher, the auditory intervention was 1.04 (95% CI: 0.22,2.04, SUCRA score: 0.36) higher, and the remaining comparisons were not statistically significant (Table 3). Therefore, the complete probability ranking was facilitated tucking (64%) > auditory intervention (36%) > control (63%) (Additional file 1: Appendix Table 7). The stability and credibility of the results were demonstrated in Additional file 1: Appendix Table 8 through the sensitivity analysis.

Fig. 6
figure 6

Probability ranking diagram of best interventions (oxygen saturation)

Table 3 The results of network meta-analysis (consistency model, oxygen saturation)

Heart rate

A total of 14 RCTs [32, 33, 38, 39, 41, 43, 47,48,49,50, 55, 56, 60, 63] were included in this meta-analysis by heart rate, involving 5 interventions. Olfactory stimulation (3 RCTs), OS + NNS (2 RCTs), facilitated tucking (2 RCTs), auditory intervention (6 RCTs), and tactile relief (3 RCTs) were included. A total of 6 nodes were included in this meta-analysis, with each node representing an intervention or control (Fig. 7). The nodes with more significant interactions were control (16 interactions), auditory intervention (8 interactions), olfactory stimulation (5 interactions), and facilitated tucking (5 interactions). The results of the consistency analysis was shown in Additional file 1: Appendix Table 10. The results of the heterogeneity test indicated a high degree of heterogeneity with an I2 value of 99.3%.

Fig. 7
figure 7

Network evidence diagram (heart rate)

Based on moderate-quality evidence (Additional file 1: Appendix Table 13), OS + NNS had the greatest SUCRA score, followed by control group, olfactory stimulation, auditory intervention, facilitated tucking, and tactile relief (Fig. 8 and Additional file 1: Appendix Table 11). However, this study found that there were no statistical differences between the interventions based on the data in Table 4. Therefore, the probability ranking analysis shown in Additional file 1: Appendix Table 11 was invalid. The stability and credibility of the results were demonstrated in Additional file 1: Appendix Table 12 through the sensitivity analysis.

Fig. 8
figure 8

Probability ranking diagram of best interventions (heart rate)

Table 4 The results of network meta-analysis (consistency model, heart rate)

Discussion

This study conducted a network meta-analysis of 35 RCTs consisting of 2134 preterm infants to compare the efficacy of different interventions for pain relief. The results showed that in addition to tactile relief, interventions such as OS + NNS, facilitated tucking, auditory intervention, olfactory stimulation, and mixed intervention were significantly more effective in reducing pain compared to the control group. Among these interventions, OS + NNS was relatively more effective, while the mixed intervention was relatively less effective. In addition to analyzing the effects of the interventions on pain scores, the study also investigated their effects on oxygen saturation and heart rate. The results showed that only facilitated tucking and auditory intervention had a statistically significant improvement in oxygen saturation compared to the control group. However, none of the interventions exerted a significant effect on heart rate.

Research studies have indicated that multiple acute pain procedures in NICUs can result in “a chronically painful state” for preterm infants [64, 65], which is detrimental to pain recovery, neurodevelopment, and psychosocial health [66]. Non-pharmacological interventions for managing pain have gained significant attention in recent years. However, the effectiveness of these interventions has not yet been clear and remains an area of ongoing research [66, 67]. Based on this, this meta-analysis compared their efficacy and found that OS + NNS, facilitated tucking and auditory intervention were relatively more effective. According to the results of network meta-analysis, this study evaluated the combined intervention of OS + NNS. However, the role of OS + NNS in reducing pain among preterm infants did not reach a consensus. A systematic review [68] investigating the impact of non-pharmacological analgesic interventions during heel prick showed that OS + NNS did not significantly reduce pain scores, oxygen saturation, and heart rate. The difference in results may be due to variations in concentration and dose of sucrose solution used, as has been observed in other studies [69,70,71,72]. Subsequent studies can compare varying concentrations and dosages of sucrose solutions to more comprehensively observe their impact on pain levels among preterm infants.

Overall, the role of facilitated tucking in reducing pain scores and improving oxygen saturation was more pronounced. A systematic review [73] of the effects of different body positions on procedural pain in NICUs indicated that facilitating tucking by parents for at least 30 min was optimal for alleviating pain and stabilizing physiological, hormonal, and behavioral responses of the newborns. Notably, this meta-analysis only focused on facilitated tucking and did not include other postural interventions. A recent review [74] found that 7 different modified positions have positive effects on sleep, flexion maintenance, and pain management in preterm infants. This indicates that facilitated tucking is not necessarily the best position to reduce pain, and the efficacy and safety of different positions should be analyzed according to their different physiological conditions.

The third ranked non-pharmacological intervention was auditory intervention, which involves exposing preterm infants to sound stimulation through audio playback of white noise, mother's voice, lullabies, and maternal heart sounds [29, 38, 47, 49, 50, 60, 63]. Recent studies [18, 19] have demonstrated that this intervention has a positive impact on reducing pain levels, increasing comfort, improving physiological indicators, and promoting feeding outcomes among preterm infants. However, a systematic review [75] of 39 studies further differentiated auditory intervention, categorizing music-based interventions into music medicine and music therapy. Music medicine involves using recorded audio to stimulate preterm infants, while music therapy involves the use of live music interventions that are clinically and evidence-based and guided by a therapist. The results indicated that music medicine interventions were linked to pain relief, while music therapy had a beneficial impact on cardiac and respiratory function, as well as weight and eating behaviors. This indicates that the effects of the two auditory interventions are different, and subsequent studies can be discussed in this respect.

While traditional meta-analyses focus on comparing individual or the same category of interventions, this meta-analysis underwent a detailed search to systematically integrate published articles on pain management in preterm infants in English, and used a network meta-analysis to compare the relationship and efficacy between six non-pharmacological interventions. This study is closely relevant to clinical practice and may help medical professionals to adopt more effective interventions to reduce the pain and stress suffered from preterm infants during repeated painful procedures. In addition, this study recommends that clinical practitioners adopt a systematic, specialized, and multidisciplinary model for managing pain in preterm infants, paying attention to the combined effects of non-pharmacological interventions and their possible shortcomings in implementation.

Although this meta-analysis has provided some insights, it is crucial to recognize the limitations of this study. One of the major limitations was the high level of heterogeneity, which may be related to the differences in non-pharmacological interventions, the setting in which preterm infants were treated, variations in painful procedures, and the sample sizes used in the studies. Another was the publication bias suggested by funnel plots (Additional file 1: Appendix Figs. 1, 2 and 3), which may be relevant to the inclusion of several studies with poor study design or small scale in this meta-analysis. To increase the validity of these findings, it is necessary to conduct more high-quality studies.

Conclusion

In conclusion, our study found that non-pharmacological interventions had different levels of efficacy in reducing pain scores and improving oxygen saturation, but no impact on heart rate was observed. Specifically, OS + NNS was found to be the relatively more effective intervention in reducing pain for preterm infants, while facilitated tucking was the relatively more effective in improving oxygen saturation. In addition, we hope for the development of more non-pharmacological interventions to ease the pain experienced by preterm infants during painful procedures, and we also aspire that our study can offer some support to clinical practices.