Background

Haematopoietic stem cell transplantation (HSCT) is a standard treatment for malignant and non-malignant conditions that lead to immune system failure, such as haematologic malignancies, solid tumours and autoimmune diseases [1]. Chemotherapy and/or radiotherapy-based conditioning regimens are administered prior to stem cell infusion to eradicate the underlying disease, to create space for engraftment and to provide immunosuppression [2]. Patients undergoing autologous HSCT receive their own stem cells, while those undergoing allogeneic HSCT receive stem cells from related or unrelated donors. Stem cells may be harvested from bone marrow, peripheral blood or cord blood [3].

Oral mucositis (OM), a common toxic effect of the regimen drugs used, occurs in 70–99% of HSCT patients [4, 5], and the incidence of its severe forms range from 20 to over 75%, depending on the intensity of the conditioning regimen and on the patient’s predisposing factors [6, 7]. OM results from the inflammation of the oral mucosal barrier and is accompanied by various signs and symptoms, such as erythema, ulcers, difficulty eating and/or drinking and pain [8]. Severe cases of OM are associated with extreme discomfort and may affect patients’ quality of life (QoL) and transplant-related outcomes, including infection risk and procedure-related costs [9,10,11].

There is limited evidence in the literature on OM prevention, and treatment is frequently symptomatic and largely based on anecdotal evidence [12, 13]. Recent evidence is available regarding the topical application of natural products, including honey, aloe vera (AV), bovine colostrum (BC) and others [14, 15]. These composite agents contain a wide variety of biologically active substances such as lactoferrin, lactoperoxidase, growth factors, immunoglobulins, cytokines, iron, folic acid, electrolytes and vitamins, which may interfere with the pathobiological pathway underlying OM [16,17,18]. The beneficial effects of these agents on mucosal healing have already been described as mainly attributable to their immune-modulatory, anti-inflammatory and antibacterial activity [14, 15, 19]. In addition, their topical formulas provide emollient, moisturising and hydrating effects.

The impact of OM on clinical outcomes is generally underestimated, although patients often cite OM as one of the worst side effects of their treatment [20]. For this reason, collecting patient-reported outcome measures (PROMs) as part of the patient’s clinical assessment is recommended in daily practice [21]. Increased consideration of patients’ perspectives during anticancer treatments has allowed healthcare professionals (HCPs) to improve the quality of supportive care in oncology, and PROMs are commonly assessed in clinical trials [22]. It has been shown that patient-reported assessment tools make it possible to detect symptom onset, peak and resolution earlier than HCPs’ objective evaluation [23, 24].

Knowledge of the impact of OM on patient QoL during HSCT is still limited due to the tendency to consider QoL as a secondary outcome in clinical trials which are designed to explore the toxicity of cancer treatments. Furthermore, QoL is affected by various factors during HSCT, and the assessment of its relationship with OM can be very complex [25]. The increase in the degree of OM has been associated with worse patient QoL; the adoption of strategies to prevent OM, such as low-level laser therapy (LLLT) and professional oral care, has led to improvements in QoL [11, 26].

A recent phase II study conducted by our group demonstrated that a BC–AV-based oral care protocol effectively and safely reduced the incidence of severe OM (sOM) in patients undergoing HSCT [27]. In that study, both PROMs and healthcare professional-reported outcomes (HCP-ROs) were collected to give a true picture of toxicity and its effect on QoL.

This paper reports a secondary analysis on patient-reported data with the aim to describe the variations of QoL perceived by HSCT patients during their hospital stay and to assess the severity of patient-reported symptoms and functional impairments. Significant differences between patient- and HCP-reported assessment strategies were evaluated.

Methods

Adult recipients undergoing autologous or allogeneic HSCT were recruited in a single-arm monocentric phase II study assessing the efficacy and safety of a BC–AV-based oral care protocol in addition to standard practice to prevent and treat conditioning regimen-related sOM. Standard practice provides oral and dental hygiene followed by bland saline solution rinses (normal saline or sodium bicarbonate) 3 times per day, increased to 5 per day after OM onset. Two commercially available products, both containing BC and AV, were added to standard practice at each time point. Remargin Colostrum OS® was administered as mouthwash, and Remargin Colostrum Gel® was administered orally to prevent and treat oesophageal mucositis.

The study was designed following the optimal two-stage design by Richard Simon [28]. A first stage of 19 participants was scheduled to evaluate safety, followed by a second stage of 40 patients to complete the planned sample The cutoff for study interruption after the first step was at least 5 participants with sOM.

The World Health Organization (WHO) scale and the Oral Mucositis Daily Questionnaire (OMDQ) [24, 29] were used daily to assess OM from conditioning initiation to day 21 post-transplant or discharge. The OMDQ provided data on mouth and throat soreness (MTS) using a 0–4 Likert scale, where 0 corresponds to “no soreness” and 4 to “extreme soreness”. MTS-activity limitation (MTS-AL) scores regarding swallowing, drinking, eating, talking and sleeping were collected using a 0–4 Likert scale, where 0 is “not limited” and 4 is “unable to do”; the overall-MTS score was assessed on a 0–10 scale, where 0 is “no soreness” and 10 is “worst possible soreness”. The OMDQ made it possible to also collect data on diarrhoea (not reported here). Data on swallowing and talking limitations were also collected by HCPs using the specific parts of the Tardieu scale [30] included in the hospital charts. Clinical and demographic details and data on time of OM onset and duration were collected. Pain was assessed daily by HCPs using 0–10 numerical rating scale.

The Euro Quality of life, 5 dimensions, 3-level tool (EQ-5D-3L) instrument [31] was used to assess QoL at admission (baseline), the day of transplant (T01) and at days + 7 (T07), + 14 (T14), + 21 (T21) and + 28 (T28) post-transplant.

The study protocol was approved by the local ethics committee (n. 2016/0030535, December 28, 2016), and the study was conducted in agreement with the Helsinki Declaration of 1975 and the Guidelines for Good Clinical Practice. All participants gave written informed consent before any study-related procedure took place.

Data are presented as numbers and percentages for categorical variables, and continuous data are expressed as the mean ± standard deviation (SD), unless otherwise specified.

Chi-squared test and Fisher’s exact test were performed to evaluate significant differences in proportions or percentages between the two groups. In particular, Fisher’s exact test was used where the chi-squared test was not appropriate. The test for normal distribution was performed by the Shapiro–Wilk test. The t test was used to test differences between two means for paired data. Alternative non-parametric tests, such as Wilcoxon test, were used when the data distribution was not normal. Finally, all tests with p value (p) < 0.05 were considered significant. Statistical analyses were performed using Matlab statistical toolbox version 2008 (MathWorks, Natick, MA, USA) for 32-bit Windows. The EuroQol Research Foundation user guide was followed to present EQ-5D-3L findings [32]. Three different analyses of the EQ-5D-3L data were done at each time point: the cohort’s health profile description was reported as frequencies and percentages of the 3 levels of perceived problems (no, mild, severe) on mobility, self-care, usual activity, pain/discomfort, anxiety/depression; central tendency measures were used to describe the participants’ overall self-rated health status assessed with the Euro quality of life visual analogue scale (EQ-VAS) on a 0–100 scale; the EQ-5D-3L index was calculated through a time trade-off method and is represented as mean and median values [33].

Results

Fifty-nine adult HSCT patients (32 [54.2%] males, 27 [45.8%] females) were recruited in the study between the end of 2017 and September 2019; the mean age was 52.4 years (SD ± 12.0; range 18–71). Forty-four of the patients were of working age (74.6%), six were smokers (10.2%), and none of them had alcohol abuse problems. The patients were affected by plasma-cellular disorders (29; 49.2%), acute leukaemia (9; 15.3%), lymphoma (19; 32.2%) or bone marrow failure (2; 3.4%). Forty-four (74.6%) were treated with autologous HSCT and 15 (25.4%) received allograft. Further details on patients’ clinical characteristics are shown in Table 1.

Table 1 Patient characteristics

Healthcare professional-reported outcomes (HCP-ROs)

As reported in our previous work [34], OM was observed in 46 (78.0%) participants; 40 (67.8%) developed mild OM and 6 (10.2%) sOM. Time of OM onset was on an average of 9.1 (SD ± 3.5) days after initiating conditioning regimen, while severe forms occurred on a mean of 11.2 (SD ± 2.9) days after initiation. Duration of OM was on average 10.4 (SD ± 4.3) days, while the duration of sOM was on average 5.2 (SD ± 3.5) days. The average maximum pain score (MPS) was 3.7 (SD ± 2.7).

Data recorded in the hospital charts showed that the ability to swallow was altered in 39 (66.1%) participants starting on an average of 11.4 (SD ± 3.8) days after starting chemotherapy; the mean duration of this problem was 6.2 (SD ± 3.2) days. Difficulty talking was recorded in the hospital chart of 27 (45.8%) participants; the mean time of onset was 12.1 (SD ± 3.8) days later, and duration was an average of 5.0 (SD ± 3.4) days.

Patient-reported outcome measures (PROMs)

OMDQ

Fifty-one recipients (86.4%) reported mouth and throat soreness (MTS): 22 (37.3%) patients reported mild MTS (score 1–2), while 29 (49.1%) reported severe MTS (score 3–4). Oral symptoms occurred on average 7.9 (SD ± 4.0) days after the initiation of conditioning, while severe MTS began on average 11.9 (SD ± 3.8) days after. Symptom duration was an average of 10.5 (SD ± 4.6) days, while severe MTS duration was an average of 4.4 (SD ± 2.5) days. Swallowing was altered in 50 (84.7%) patients; drinking limitations were present in 45 (76.3%) patients, and 46 (78.0%) reported eating problems. Oral condition-related difficulty talking and sleep disturbances were reported by 32 (54.2%) and 21 (35.6%) participants, respectively. Maximum overall-MTS was on average 6.1 (SD ± 2.3).

Participants reported that swallowing issues began on average 8.8 (SD ± 4.3) days from the start of conditioning, and its mean duration was 9.3 (SD ± 4.4) days. Difficulty talking started an average of 9.0 (SD ± 4.0) days after the initiation of conditioning and lasted an average of 7.4 (SD ± 6.3) days.

Comparisons

Comparing the patient-reported outcomes with those of the HCPs (Table 2), no significant difference was found in the number of patients developing oral problems (p = 0.23); OMDQ allowed detection of oral symptom onset sooner compared to the WHO scale (p = 0.025), although no difference was found for the duration of oral problems. The overall highest patient-related MTS score was significantly higher than that of the HCPs using the NRS scale (p < 0.0001). A significant difference was found between the MTS and the WHO severity grading (p < 0.0001).

Table 2 Comparison of PROMs and HCP-ROs

Quality of life

Descriptive health profiles of the sample are shown in Table 3, where the 3 levels of perceived alterations (no problem, mild problems, severe problems) at each time point are reported. Excluding mobility, percentages of the activities that were worse on day + 7 post-transplant compared to baseline values were observed, in particular regarding pain (69.5% vs 39.0% of patients referring pain).

Table 3 EQ-5D-3L: cohort’s health profile at each time point

The analysis of the overall self-rated health status assessed using the EQ-VAS score showed that general health status worsened, with a significant difference between baseline mean value and day + 14 post-transplant (p < 0.0001), with the nadir on day + 7 (mean 56.1; SD ± 18.8) followed by a slight improvement on day + 14. As shown in Table 4, many comparisons of the mean rank value at various time points proved to be significant, in particular those including day + 7 values. The same trend was observed for the EQ-5D-3L index; a significant difference between baseline and day + 14 (p = 0.0006) was found, as was the worsening of patient-reported general condition between baseline and day + 7 post-transplant (mean 0.714 ± 0.288). Figure 1 a and b) report both EQ-VAS mean score and EQ-5D-3L index mean score variations. The inferential analysis did not include T21 and T28 due to patient discharge from hospital.

Table 4 EQ-5D-3L visual analogue scale and index
Fig. 1
figure 1

EQ-VAS mean score and EQ-5D-3L index mean score variations

Discussion

This paper reports the secondary findings of a phase II study on the efficacy of an experimental BC–AV-based oral care protocol to prevent and treat OM in patients undergoing HSCT (autologous or allogeneic grafts). A first work compared this experimental approach with a historical control group [27]. The study collected OM-related PROMs in addition to HCP-ROs. In this second study, PROMs were described and compared with those collected and managed by HCPs. In addition, the QoL findings presented here highlight the results regarding the participants’ health profile description, their perceived general health status and EQ-5D-3L index variations among repeated measures.

PROMs allow patients to evaluate their health without any external HCP influence [34]. When used in clinical practice, these tools can be coupled with other clinical assessment instruments, which together provide a more accurate representation of reality [35]. The importance of patients’ perception of measures to understanding the impact of health on daily living and to modulating healthcare actions has been well described [36, 37], and PROMs have been recommended in clinical trials evaluating new strategies to OM prevention and treatment [38]. Diagnosis, treatment and various aspects of the patient-HCP relationship, such as communication issues, patient compliance and clinical outcomes, benefit from the use of PROMs [39, 40]. OM and its related pain, including physical, emotional, functional and psychological impairments, may have a dramatic impact on various aspects of HSCT patients’ QoL [41].

As previously demonstrated by Stiff and colleagues [24], our findings show that the use of OMDQ made it possible to detect oral problems at least 1 day sooner compared to the clinical assessment, while the duration of oral symptoms was similar. As reported by other authors [21], the implementation of OMDQ in clinical practice may foster a proactive approach by HCPs managing OM. In our study, OMDQ allowed us to identify some patients with mild MTS who were undetected by the WHO scale. Significant differences between the two assessment methods were found on OM severity grading distribution and on OM-related pain scores. These differences could be associated with various patient-related factors, including anxiety, depression, individual pain tolerance and difficulty coping with the emerging changes. They could also be due to HCP-related issues, such as the tendency to underestimate referred symptoms or difficulty maintaining competence in the use of the assessment scales. All HCPs who assessed OM and its related symptoms in this study regularly received training on the use of the WHO scale and other assessment tools. In addition, they assessed OM as they routinely do in their clinical practice and were not informed of the study objectives. However, a study effect cannot be excluded. For these reasons, a double-blind placebo-controlled randomised trial is being set up at our research hospital.

The trend of OM severity during HSCT has been well described in the literature. In the majority of affected patients, it increases rapidly after the initiation of conditioning, reaching the highest score on day + 7 post-transplant, then tends to decrease more slowly until day + 28 [42]. For this reason, and considering the pathobiological definition of OM [43], the collected OM data refer to the first day of conditioning and not to the day of transplantation, as in most studies. An interesting finding of our study was the description of both EQ-VAS and EQ-5D-3L Index score variations during the HSCT pathway, which highlighted a clear correlation with the trend of OM severity reported by Spielberg and colleagues [44] (Fig. 1 a and b), with the nadir of QoL scores at day + 7 post-transplant. Although OM is considered one of the worst toxic effects of the conditioning regimens [20], few studies have provided any information about the link between QoL and OM in HSCT patients [44], and only one study has attempted to understand the patients’ experience when affected by OM [45]. Further quantitative and qualitative studies are necessary to explore patients’ perception and to enhance its role in daily practice.

Conclusions

This work provides some findings on the relationship between PROMs and HCP-ROs in the setting of a phase II study exploring the efficacy and safety of a BC–AV-based oral care protocol to prevent and treat OM in HSCT patients. Compared to the HCPs’ assessments using the WHO and NRS scales, the OMDQ detected OM earlier, and its severity and related pain were significantly greater. The variations in the QoL measured using EQ-5D-3L instrument (EQ-VAS and EQ-5D-3L Index) were clearly correlated with the severity of OM during the hospital stay. A randomised controlled trial is being set up at our institution to confirm the efficacy of the experimental oral care protocol and to shed light on what has been reported here.