Introduction

Pain has been designated by the World Health Organization(WHO) as the fifth vital sign [1]. Neonates exhibit central sensitization to repeated painful stimuli, which can adversely affect brain microstructure, lead to the production of stress hormones, impair cognitive and motor development, and contribute to behavioral issues, metabolic disorders, and cardiovascular stress [2]. Advances in neonatal care have improved the survival rates of premature and ill neonates. However, this progress has led to an increased number of invasive procedures that may induce pain in these vulnerable infants [3, 4]. Neonates exhibit a lower pain threshold (30 to 50% lower than that of adults) and reduced pain tolerance compared to older children, their experience of pain is often more intense and enduring [5, 6]. Optimal neonatal pain management requires valid non-verbal pain assessment, yet clinical methodology to assess neonatal pain is rudimentary [7]. Due to limitations in neonatal language expression, it fully depends on nurses to assess their pain through observation currently. The challenge for consistency increases when pain measures are subjective in nature [8].

Currently, there is no universally accepted “gold standard” for assessing neonatal pain, most clinical assessments rely on neonatal pain assessment scales. Over 40 such scales have been published [9], with distinct variations among which factors are included in the assessment. There are more than 100 evaluation indicators with varying descriptors utilized in the scales, lacking consensus and standardized metrics. General limitations with these scales have meant that clinicians and researchers have adapted and tweaked them without the necessary validation steps to make them more appropriate for use in their specific settings. This inconsistency makes it difficult to combine and synthesize evidence about how much pain infants are experiencing and whether interventions to prevent or alleviate pain are efficacious [10]. Simultaneously, it is decreasing the rate of neonatal pain assessment, posing challenges to ensure its accuracy. Globally, the rate of pain assessment in neonatal units ranges from a mere 6 to 50.3% [11,12,13]. Accurate pain assessment is crucial to ensuring the effectiveness and safety of pain management therapies in neonates during their stay in the Neonatal Intensive Care Unit (NICU). However, accurately measuring neonatal pain remains the greatest challenge in neonatal pain management [10, 14].

Most scales are too complicated and require live measurements and calculations (e.g., the Premature Infant Pain Profile scale requires the operator to calculate the percentage of decrease in SaO2 and of increase in heart rate in the first 30 s after the pain onset) that cannot be done in real-time and in particular by a single operator who is simultaneously performing the potentially painful procedure [15]. The clinical environment is complex, the passive state of newborns during hospitalization because of treatment methods or constraints may limit the sensitivity of certain behavioral or physiological indicators. Single-dimensional indicators (such as facial expressions, body movements, etc.) lack sufficient specificity to enable accurate pain measurement by nurses. Practicality and clinical utility are also significant when choosing an assessment tool for research or clinical use [8]. Moreover, pain responses exhibit individual differences, with demographic contextual factors contributing to explaining it. For example, preterm infants may lack the motor skills to express pain compared to full-term infants [16], and there are significant negative correlations between infants’ birth weight and parameters of pain severity [17]. But most of the scales ignore it.

In light of these challenges, this study conducted a systematic literature review to gather not only neonatal pain response influencing factors but also integrated multiple assessment items from various pain scales. Through panel meetings and the Delphi study, we selected the most representative and clinically practical indicators to develop a multidimensional neonatal pain response variable set, which aligns with the clinical characteristics of neonatal pain and ensures ease of pain observation and assessment. It is anticipated that this will facilitate future research on standardized tools and procedures for the clinical assessment of neonatal pain.

Method

This study conducted a three-stage mixed-methods study to develop a set of neonatal pain response variables. Commencing with a literature review to extract influencing factors and assessment indicators of neonatal pain response. Subsequently, a panel meeting was conducted to screen and modify the gathered information, resulting in the initial draft of the neonatal pain response variable set. Finally, two rounds of the Delphi study were carried out to achieve consensus on the neonatal pain response variables.

Literature review

A comprehensive systematic search up to December 2021 was conducted across WANFANG, CNKI, CBM, PubMed, Embase, Web of Science, and Cochrane. The search utilized combinations of terms/keywords such as “infant,” “newborn,” “child,” “neonate,” “baby,” “pain response,” “pain reaction,” and “factor,” considering both original research articles, guidelines, and consensus as study types. All records and data were managed using EndNote X9. Duplicate literature was eliminated, followed by screening of titles and abstracts followed by a full-text screening for eligibility was performed by 2 researchers independently, with any disagreements resolved by a third researcher or through group discussion. I dentifying relevant influencing factors and pain assessment indicators in neonatal pain research, ensuring that variables can best capture essential pain-related information.

Panel meeting

To ensure both the clinical applicability and scientific validity of each variable assessment item, and to align with the clinical characteristics of neonatal pain responses, healthcare professionals specializing in neonatology, pediatric thoracic surgery, and pediatric internal medicine were recruited for an in-person panel meeting. Pertinent information regarding the meeting’s agenda was sent one week in advance. During the meeting, panelists provided opinions and feedback on the neonatal pain response variables and descriptors extracted from the literature review. It is advisable to have a panel of experts ranging from 10 to 15 members to ensure the quality of the meeting [18]. Post-meeting, the initial draft of the neonatal pain response variable set was generated for the subsequent Delphi study.

Delphi study

Invited experts specializing in the field of pediatrics to participate in the consultation to gather and synthesize expert opinions and achieve consensus on the neonatal pain response variables, determine the final draft of neonatal pain response variables, and establish a variable set. Guided by Grime et al.’s suggestion of involving 5–20 participants [19], we invited 16 experts with a minimum of 10 years of experience in the field of pediatrics. All hold bachelor’s degrees or higher and possess senior academic titles of associate professor or above, ensuring broad professional representation.

The survey was conducted either face-to-face or through the online platform www.wjx.cn, with responses collected within one week of questionnaire distribution. The first round of questionnaires was designed based on the initial draft of neonatal pain response variables. The questionnaire comprised four main sections: (1) Introduction to the research content and significance of the topic; (2) General information about the experts; (3) Evaluation of pain response variables; and (4) Experts’ self-assessment of familiarity and judgment criteria for the consultation content. Expert panelists used a 5-point Likert scale (1 = strong disagreement, 3 = neither agreement nor disagreement, 5 = strong agreement) to rate the importance of each variable. An expert opinion column was provided for each variable, along with a blank column at the end of the consultation content for experts to offer additional opinions. Variables with an importance rating > 3.5 and a coefficient of variation ≤ 0.25 were considered for inclusion, following the criteria outlined by Hasson et al. [20]. The second round of questionnaires was created after modifying variable content based on expert opinions. Subsequent consultation rounds were conducted until a consensus was reached.

Data analysis was carried out using Excel 2016 and SPSS 29.0: (1) General information about experts: Descriptive statistics included mean and standard deviation for measurement data and frequency and percentage for count data. (2) Expert active coefficients: effective survey response rate was used. (3) Expert authority coefficient: expressed through the authority coefficient (Cr), derived as the average of the judgment coefficient (Ca) and the familiarity coefficient (Cs). Ca represented the basis for expert judgments on consultation content, while Cs reflected experts’ self-assessment of familiarity with the content. (4) Expert opinion consensus level: assessed using the coefficient of variation (CV) and Kendall's W coefficient.

Result

Literature review

The literature search yielded a total of 706 results, 10 articles and 18 guidelines met the inclusion criteria and underwent data extraction. Figure 1 illustrates the systematic flow of the literature search. From the 10 identified articles, 10 influencing factors of neonatal pain response were included. Among the 18 included guidelines, a total of 131 pain assessment tools were recommended. After removing duplicates, tools not applicable to neonates, and those not validated in neonatal populations, we ultimately selected 21 tools, from which we extracted a total of 123 assessment indicators. Through discussions and synthesis of similar indicators, three types of variables were classified: contextual, physiological, and behavioral, totaling 26 indicators. The extracted results of variables are presented in Table 1. The included articles, guidelines, and the 21 neonatal pain assessment scales recommended by the guidelines are provided in Appendix 1.

Fig. 1
figure 1

Flow diagram of the systematic literature search

Table 1 The extracted results of influencing factors and assessment indicators

Panel meeting

A total of 12 experts were included in this expert panel meeting. The experts' clinical areas of expertise encompassed neonatology, pediatric general thoracic surgery, and pediatric internal medicine, comprising both nurses and physicians. Among them, 8 held associate professor titles, and 4 held full professor titles. In terms of educational background, there was 1 expert with a bachelor’s degree, 9 with master’s degrees, and 2 with doctoral degrees. Their years of clinical experience ranged from 10 to 31 years.

In influencing factors, the factors “Sleep/Wake state” and “Gestational age (GA)” were eliminated due to repeated assessment indicators. The experts emphasized that assessment indicators and their descriptors should align with clinical reality, ensuring their broad applicability. Therefore, in assessment indicators, “Blood pressure changes”, “Alertness”, “Muscle tone” and “Body posture” were recommended for deletion. Similarly, some descriptors that posed challenges in a clinical setting were either removed or modified. Furthermore, experts recommended comparing “changes” to baseline values and using specific value change ranges for a more intuitive assessment. For instance, the description of “heart rate changes” was revised to indicate the frequency of increases or decreases compared to the baseline. Following summarizing the experts’ opinions, we further discussed and revised the relevant content, developing the first draft of the neonatal pain response variable set. The details of experts’ opinions and the first draft of the neonatal pain response variable set are presented in Appendix 2.

Delphi study

Sixteen experts, consisting of nurses and physicians, from two general hospitals in Guizhou and Guangdong provinces of China, were invited from neonatology, neonatal intensive care units (NICUs), pediatric internal medicine, and pediatric general thoracic surgery departments. All these experts had over 10 years of clinical experience in pediatric-related fields. Among them, 3 had undergraduate degrees, 10 had master’s degrees, and 3 had doctoral degrees. Participants in both rounds of the survey remained consistent, and the effective survey response rate exceeded 70%, indicating high expert interest and attention. The authority coefficient (Cr), calculated as (Ca + Cs)/2, was 0.875 in the first round and 0.885 in the second round, both exceeding 0.7, signifying high expert authority and reliable consultation results.

In the first round, variables were screened using the criteria of importance assignment mean > 3.5 and coefficient of variation ≤ 0.25. Thirteen variables, including “nurse’s perception”, “requires O2 for Sat > 95”, and “SaO2 changes”, were deleted. “Duration of crying” was recommended to be retained with modified descriptors, specifying the duration as the time elapsed from the onset of crying to its cessation following pain stimulation. The details of experts’ opinions are presented in Appendix 2.

In the second round, responses, modifications, and explanations based on first-round comments were presented. Experts reached a consensus on all variables except “duration of crying”, which was suggested to be deleted due to potential interference factors affecting newborns’ crying times during procedures. The results of the two rounds of the Delphi study are shown in Tables 2 and 3.

Table 2 The survey results in round 1
Table 3 The survey results in round 2

Following two rounds of the Delphi study and extensive expert discussions, a consensus was reached, resulting in the establishment of a neonatal pain response variable set. This set included 9 contextual variables, 2 physiological variables, and 5 behavioral variables, as presented in Table 4.

Table 4 The neonatal pain response variable set

Discussion

In this study, we focus on the characteristics of neonatal pain response, through a comprehensive data collection and meticulous screening process, involving a literature review, panel meetings, and a Delphi study, we aim to develop a neonatal pain response variable set, which serves as a crucial theoretical foundation for future research in the field of neonatal pain.

In its revised definition of pain in 2020, the International Association for the Study of Pain (IASP) emphasized the significance of respecting a person’s report of pain as a valid experience, the inability to communicate does not negate the possibility that a human or a non-human animal experiences pain [21]. Due to limited verbal communication, the primary mode of assessing neonatal pain responses is through observation. Nevertheless, the absence of a “gold standard” results in not all assessment indicators from existing scales being suitable for clinical practice, and the influence of situational factors, such as demographic contextual factors, on neonatal pain responses is frequently disregarded.

From a neurological standpoint, no individual behavioral or physiological component is adequate in alerting to the presence of pain responses that occur on a cortical level [22]. Our constructed variable set aligns with recommendations from the previous studies and guidelines [23, 24], emphasizing the importance of a multidimensional assessment approach that considers both behavioral and physiological responses. It aims to overcome the limitations of single-dimensional assessments, which often fail to adapt to the complexity of clinical environments and capture the sometimes-subtle pain behaviors exhibited by neonates. Furthermore, our variable set also incorporates contextual variables to address individual differences in neonatal pain, providing a comprehensive assessment framework composed of three key dimensions: behavioral, physiological, and contextual.

To enhance discrimination, we selected descriptors with improved discriminatory capabilities. Upon analyzing and comparing each variable’s descriptors, we considered the appropriateness of their scope and the clarity of their grade divisions. In the behavioral variables, the assessment of facial expressions expanded to include five dimensions: “Facial muscles fully relaxed, relaxed open mouth”, “Normal facial tension”, “Intermittent eye squeeze and brow furrow”, “Continuous eye squeeze and brow furrow”, and “Facial muscles contorted and grimacing”. These descriptions offer greater clarity and understanding compared to existing assessment scales, with well-defined boundaries closely aligning with neonatal pain-related facial expressions. For descriptors related to “crying” and “breathing pattern”, we used the contents of the NIPS scale, incorporating dimensions like “no cry”, “whimper”, and “vigorous crying” for crying assessment, and “relaxed” and “change in breathing” for breathing patterns. While some scales previously mentioned assessing “extremity movement” via finger and toe movements, we recognized the impracticality of these observations due to medical interventions or protective measures. Therefore, we opted to assess the frequency of arm and/or leg movements, categorizing them as “no movement”, “up to three arm and/or leg movements”, and “more than three arm and/or leg movements”, which enhances assessment applicability, efficiency, and accuracy.

Concerning physiological variables, we defined “SpO2” in terms of its deviation from baseline values, categorizing it as a decrease in Oxygen Saturation: “0 ~ 2”, “3 ~ 5”, “6 ~ 8”, or “ > 8”, or indicating an “increased oxygen requirement”. This definition deviates from the BPSN scale’s “decrease ≤ 1.9%”, “decrease 2 ~ 2.9%”, “decrease 3 ~ 4.9%”, and “decrease ≥ 5%”. Experts noted that a broader fluctuation range better reflects actual SpO2 changes in clinical newborns. Furthermore, our clinical observations revealed that SpO2 levels do not necessarily decrease during painful episodes, thus our descriptors extend beyond mere increases or decreases. For assessing “heart rate”, we employed a straightforward definition based on specific numerical changes compared to baseline values, categorizing them as “0 ~ 5 times”, “6 ~ 20 times”, or “ > 20 times”. It offers greater intuitiveness and practicality compared to percentage-based calculations used in some assessment tools.

The limitations of this study encompass the included literature’s language restriction to 2 languages, did not explicitly consider the potential influence of regional factors on neonatal pain response, and the articles primarily focused on hospitalized newborns, potentially overlooking research on healthy newborns, which could affect the comprehensiveness of included literature and the factors related to neonatal pain extracted in this study. Our next objective is to construct a neonatal pain response model, that serves as a valuable tool for healthcare practitioners. This model aims to empower healthcare professionals in assessing the significance of each pain-related variable.

Conclusion

In this study, we developed a neonatal pain response variable set through a literature review, panel meeting, and Delphi study. This set includes three dimensions: contextual variables, physiological variables, and behavioral variables, totaling sixteen variables, establishing a structured framework for neonatal pain assessment and management.