Background

As an increasing number of deaths occur in the intensive care unit (ICU), studies have sought to describe, understand, and improve end-of-life care in this setting [1,2,3,4,5]. Most of these studies are centered on the patient’s and/or the relatives’ experience and have permitted to highlight elements of care that need improvement. Fewer studies have evaluated the healthcare professionals’ experience and perception of dying and death. Whether in the hospital in general [6] or in the ICU in particular [4, 7,8,9,10], these studies show a discrepancy between physicians’ and nurses’ experience and perception of end-of-life care—the latter generally expressing less satisfaction concerning pain control and quality of discussions and decision-making.

Among end-of-life research projects, tools have been developed to evaluate the quality of dying and death in the ICU, such as the Quality Of Dying and Death (QODD) instrument [11]. Quality of dying and death is defined as “the degree to which a person’s preferences for dying and the moment of death agree with observations of how the person actually died as reported by others” [12]. The quality of dying and death is subjectively determined with multiple factors that influence its evaluation, including culture, type and stage of disease, social and, importantly, professional role in the dying experience. A study comparing QODD ratings by relatives and by healthcare professionals [9] shows that relatives and attending physicians give the most favorable ratings of death, while nurses and residents provide less favorable ratings. Significant differences between these groups are notable on items related to patient autonomy. However, this tool was designed and validated in the USA, where hospital and end-of-life culture, and physicians’, nurses’, relatives’, and patients’ roles are different than in Europe [4, 13].

Thus, the French CAESAR project aimed to develop an instrument to assess the experience of relatives of patients who died in the ICU as well as the experience of healthcare professionals, physicians, and nurses. The relatives’ instrument was validated in a previous publication and presents valuable psychometric properties [5]. This 15-item questionnaire covered the patient’s preferences and values, interactions with/around the patient, and family satisfaction. Relatives’ lower scores were associated with greater risks of anxiety and depression at 3 months; post-traumatic stress-related symptoms at 3, 6, and 12 months; and complicated grief at 6 and 12 months.

Here, our objective was to develop and validate two tools specifically designed to assess the overall experience of physicians and nurses caring for patients who died in the ICU. We used a mixed methodology to create the instrument that was then validated in a multicenter prospective study in 41 French ICUs.

Participants and methods

The study was approved by the institutional review board of the Paris-North Hospitals (IRB00006477, approval#11019), Paris 7 University. Informed consent was obtained from each relative before study inclusion.

Physicians and nurses were invited to participate and to complete the CAESAR tool on a voluntary basis. They received information about the study, and those who did not wish to participate could decline. No written consent was required.

Before inclusion of patients, in each participating center, the local investigator, together with the head nurse, completed a description of the ICU.

Development and validation of the CAESAR-P and CAESAR-N instruments

Development of the CAESAR instrument was described in a previous publication [5]. A 33-item questionnaire for physicians and nurses was validated in a multicenter prospective study conducted from July 2011 to July 2013 in 41 French ICUs belonging to the FAMIREA network. The 33 items fell into three domains: the patient (preparation for death, whole person concerns, symptoms, personal care, and treatment preferences), interactions with and around the patient (quality of communication between ICU team and the patient and between the ICU team and the relatives, particularly whether conflicts arose), and family needs and satisfaction.

In each ICU, the intensivist included consecutive adults who died after at least 48 h in the ICU and for whom the relatives had visited at least once. For each patient, the physician and the nurse in charge of the patient were asked to complete the instrument within 24 h after the death. At the end of the instrument, participants were also asked to complete information such as age, sex, years of experience in the ICU, and religious beliefs. As described elsewhere, relatives completed the tool 21 days after the patient’s death [5].

After full data completion, 7 investigators (4 physicians, 1 psychologist, 1 sociologist, and 1 biostatistician) for the physicians’ instrument (CAESAR-P), and 6 investigators (2 nurses, 1 physician, 1 psychologist, 1 sociologist and 1 biostatistician) for the nurses’ instrument (CAESAR-N), worked to reduce the instrument from 33 to 15 items, based on clinical relevance and item distribution, discrimination, and redundancy [5]. The remaining 15 items (Table 1) were analyzed to explore the psychometric characteristics of the instrument, among physicians and among nurses separately. The psychometric validation was conducted using two sets of instruments: those completed by healthcare professionals for patients included in the CAESAR study [5] as the learning cohort (derivation set), and those completed by healthcare professionals for patients who had the same inclusion criteria, but who were not included in the CAESAR study, as the reliability cohort (validation set). These data were recorded at the beginning of the study period before inclusion started in the CAESAR study. Briefly, factorial validity was assessed in the derivation set and validation set separately by determining the dimensional structure of the final 15-item CAESAR questionnaire. To this end, we used maximum likelihood factor analysis with varimax rotation. The number of factors was determined from the observation of the scree plots and value of simulations [14,15,16]. Internal consistency was deemed acceptable when Cronbach’s alpha [17] was in the 0.70–0.95 range [18]. For the final 15-item tool, the item scores (range 1–5) were summed to obtain a global score (15–75). Each item included a written description and a score on a 5-point scale: 1, traumatic; 2, painful; 3, difficult; 4, acceptable; and 5, comforting. As a result, if a clinician rates all questions at 1, his/her total score will be 15, reflecting an overall traumatic experience. If a clinician rates all questions at 5, his/her total score will be 75, reflecting a very positive experience. Based on these calculations, scores between 60 and 75 reflect an overall positive experience; scores lower than 45 reflect an overall negative experience; scores between 45 and 60 are considered intermediary, reflecting that some elements of the end-of-life process were experienced negatively.

Table 1 Physician and nurse CAESAR scores

We also collected ICU, healthcare professionals’, patients’, and end-of-life characteristics.

Job strain evaluation

Physicians’ and nurses’ job strain was evaluated using an instrument that explores three domains: job demand (3 questions), control (5 questions), and social support (4 questions) [19,20,21]. High job demand, low decision control, and poor social support were graded as high for a score at 2–3, at 3–5, and at 3–4, respectively. Global job strain score was considered high (i.e., job strain positive) if there were 2 or 3 positive items among high job demand, poor decision control, and poor social support.

Statistical analysis

Statistical analyses were performed using the R 3.1 package with the “psy” package [22].

Quantitative data were described as median (25e–75e percentiles, i.e., interquartile range); comparisons between groups were tested using ANOVA test. Binary and categorical data were described as number and percentages. No imputation for missing data was performed. To assess the number of dimension identified in the survey, a principal component analysis and the corresponding scree plot were used. The cutoff was determined using a set of simulations that illustrated the amount of variance that may be expected by chance alone [14]. The number of factors over the simulations may be of interest. The internal consistency was assessed with the Cronbach alpha (95% bootstrap confidence interval), split-half, and composite reliability. The association between job strain results and CAESAR-P and CAESAR-N scores was assessed using ANOVA test, according to the 3 dimensions. The center effect on CAESAR-N and CAESAR-P scores was assessed using a mixed regression model. All statistical tests were two-sided, and p values of 0.05 or less were considered significant.

Results

Study population

Patients’, ICUs’, and management of end-of-life characteristics are summarized in Supplemental Table 1. Physician and nurses’ characteristics are summarized in Supplemental Table 2. Among the 4607 patients admitted to the 41 participating ICUs during the study period, 875 (19%) died, including 228 who met exclusion criteria, 104 for whom the opportunity for inclusion was missed, and 68 whose relatives refused participation [5]. For each of the remaining 475 (54%) patients, one relative was included. Within 24 h of the death, the physician and the nurse in charge of the patient completed the 33-item instrument as well as the job strain (completed only once during the study period).

CAESAR scores

Among the 475 patients included, 398 nurse scores and 417 physician scores were analyzed.

The response rates were 441/475 (92.8%) for nurses and 446/475 (93.9%) for physicians. Among these, respectively 398/441 (90.2%) and 417/446 (93.5%) were fully completed, allowing score calculation and analysis.

The median global CAESAR score was 62/75 (59; 66) for nurses and 64/75 (61; 68) for physicians; the proportion of surveys with scores higher than 60/75 were 248/398 (62.3%) for nurses and 313/417 (75.1%) for physicians. There was no center effect for nurses (p 0.28), but a center effect was detected for physicians’ scores (p < 0.001).

Supplemental Tables 3 and 4 depict the distribution of individual item scores for physicians and nurses, respectively. Figure 1 a and b depict the distribution of global scores for physicians and nurses, respectively. The factorial analysis of the main sample, assessed by scree plots, was consistent with a single dimension for both questionnaires, in the learning cohort and in the reliability cohort (Fig. 2a, b). Internal consistency was acceptable for both scales (Supplemental Tables 5 and 6).

Fig. 1
figure 1

Distribution of the global CAESAR-P and CAESAR-N scores. Histograms: dark gray, score < 45; gray, score 45–60; light gray, score > 60

Fig. 2
figure 2

Psychometric validation of the CAESAR-P and CAESAR-N. The factorial analyses and their corresponding scree plots are presented in a and b. In both cohorts and in both questionnaires, the first factor on its own explained a substantial amount of the item variance and no other factor appears of comparable importance, compared to the value of simulations. The scale is thus fairly homogeneous, if not strictly unidimensional

Factors associated with the CAESAR-P and the CAESAR-N scores

Physicians’ CAESAR-P score (Table 2) was higher when there was a room dedicated to family information, when the information was delivered together by nurse and physician, when families were systematically informed of the EOL decision rather than occasionally, when implementation of the EOL decision involved the nurse and physician rather than the physician only, and when the ICU was open 24 h/24 h. The physicians’ score was also higher when the patient had a McCabe score [23] at 1 rather than higher (i.e., no fatal expectancy); was not immunocompromised; had no dementia, liver failure, nor hematological disease; and did not require psychotropic medication. The score was higher when a decision to withdraw or withhold treatment was made, when no cardiopulmonary resuscitation was performed, and when the death was disclosed to the family at the bedside rather than by phone or upon arrival at the ICU.

Table 2 Factors associated with the nurses’ and physicians’ CAESAR scores

The nurse CAESAR-N score (Table 2) was significantly higher in the absence of conflict with physicians, when pain control was handled by physicians rather than by nurses alone, when the death was disclosed to the family at the bedside rather than by phone or upon arrival at the ICU, and when invasive care such as surgery, chest tube, or bronchoscopy was not performed.

Job strain

Job strain was evaluated among 231 physicians and 379 nurses. Ten percent of physicians had a positive job strain. Among the 3 domains (job demand, decision control, and social support), the most frequent complaint for physicians was a high job demand (38.2%), but there was no association between the CAESAR score and job strain for physicians. Twenty-seven percent of nurses had a positive job strain. The most frequent complaint for nurses was a low decision control (54.2%) and a high job demand (44.5%). Low decision control was associated with lower CAESAR score for nurses (61 (58; 65) versus 63 (60; 67), p = 0.002).

Discussion

This multicenter study allowed us to develop and validate two 15-item CAESAR questionnaires (CAESAR-P and CAESAR-N) designed to measure self-reported experience of caregivers about patients’ end-of-life in the ICU, as we did previously for relatives [5].

Both tools (for nurses and for physicians) showed good internal consistency and a single dimension. The global score was high for both nurses [62/75 (59; 66)] and physicians [64/75 (61; 68)], reflecting global satisfaction regarding dying and death in the ICU, in keeping with previous studies [4, 24]. The nurses’ and the physicians’ results cannot however be strictly compared, as 2 questions differ between the two instruments.

Previous studies aiming at evaluating quality of dying and death [4, 9, 25] used the QODD, which was designed and validated in the USA, where hospital and end-of-life culture, and physicians’, nurses’, relatives’, and patients’ roles are different than in Europe. However, this tool has recently been used in Europe: in a study by Gerritsen et al. [25], the single-item QODD summary score was significantly higher for nurses in the Netherlands than in the USA, probably due to organizational (presence of a physician in the ICU and more often at the bedside in the Netherlands) and cultural differences.

In France, nurses’ perception of dying and death was evaluated over 10 years ago in hospitals [6] and in ICUs [7], showing poor ratings of quality of death and of end-of-life decisions. However, French legislation changed in 2005 and 2016, allowing withdrawing and withholding treatment, as well as palliative sedation, which has positively modified end-of-life culture in ICUs: our results are consistent with these improvements. In a more recent multicenter French study [24], nurses rated the end-of-life of their patients under mechanical ventilation at 8 on a scale from 1 (worst) to 10 (best), concordant with our findings. Research thus shows high rating of dying in the ICU, whether in Europe or the USA.

In our current study, nurses’ experience of the patient’s end-of-life in the ICU was worse in case of conflict with physicians. In a previous European study about ICU conflicts [20], end-of-life care was one of the main reported sources of conflict. In these situations, the principal sources of conflict were lack of psychological support, absence of unit-level meetings, and problems with the decision-making process. Two factors were associated with less conflict, i.e., symptom control performed jointly by physicians and nurses, and routine unit-level meetings. In our study, these two factors were also associated with a better experience of end-of-life for nurses (pain control handled by nurses rather than by physicians only, relatives’ information delivered together by nurse and physician) and for physicians (implementation of the end-of-life decision involving the nurse rather than the physician alone). Indeed high-quality end-of-life care requires good inter-professional collaboration and communication [26]. The job strain evaluation shows that for nurses, low decision control was associated with lower CAESAR score, highlighting the importance of valuing nurses’ involvement in decision-making processes. Physicians’ job strain was not associated with CAESAR score.

For both nurses and physicians, a better experience of end-of-life was associated with an absence of invasive care (considered aggressive in this setting): for nurses, absence of invasive care such as surgery, chest tube, or bronchoscopy; for physicians, decision to withhold or withdraw treatment, and absence of cardiopulmonary resuscitation—factors concordant with other study results [8, 9]. Absence of overaggressive treatment at the end-of-life can also be interpreted as a sign of good inter-professional communication as well as good communication with patients and family members [26, 27]. Early integration of palliative care that focuses on reducing suffering among patients with serious illness and their family members is recommended: in a study from the USA, nurse-assessed quality of dying was significantly improved with an intervention to integrate palliative care in the ICU [28].

Finally, good communication with family members and their presence in the ICU were associated with a better experience of patients’ end-of-life for nurses and physicians (communication at the bedside rather by phone, presence of a room dedicated to family information, systematic information of decision rather than occasional, open visiting hours) and are key elements in end-of-life care, as shown in previous studies [8, 25, 29]. Families are no longer simple visitors in the ICU: they play important roles and should now be considered by the ICU team as active partners, including in end-of-life situations [30].

This study has several limitations. First, all participating ICUs were in France and the findings may not be pertinent in different cultural settings. However, the large number of participating ICUs and clinicians, the very high response rate, and the validation of the results in a reliability cohort support the robustness of the data. Second, authors are aware of the potential issues with using an older dataset, specifically the potential changes of EOL practices across the world. However, use of the tool will permit to compare and describe changes across time and countries. Third, the global score was high in both groups of clinicians and the score differences were low and may not be meaningful to clinicians. However, this is true for other end-of-life tools used in Europe, such as the QODD [25]. Lastly, the description of the score on a 5-point scale (1, traumatic; 2, painful; 3, difficult; 4, acceptable; 5, comforting) raises a challenge in terms of measuring subjective responses. While pilot testing did not yield any difficulties between scoring on the scale, this is something to consider in future translations of this tool.

Conclusion

In summary, we described and validated two new instruments for assessing nurses’ and physicians’ experience of end-of-life in the ICU. Our study shows factors associated with a better experience that include quality communication, both with family members and inter-professional communication and collaboration; family presence in the ICU; and avoidance of aggressive care. These results will help design future interventional studies aimed at improving end-of-life care in the ICU.