Introduction

The wish to hasten death (WTHD) is a reaction to a suffering which can occur in patients with life-threatening conditions [1, 2]. In the last two decades, several instruments have been designed to measure the WTHD and to identify factors related with it [36]. The most widely used instrument is the schedule of attitudes toward hastened death (SAHD). Developed by Rosenfeld et al. [7] the SAHD is a self-report questionnaire with 20 items that respondents must rate as true or false. The psychometric properties of the SAHD were originally studied in North American samples involving patients with AIDS [7] and palliative care (PC) cancer patients [4]. The SAHD has been successfully adapted into several languages [811], including Spanish [12].

Although the SAHD has become a well-established instrument for research purposes [7] some of its characteristics make it less suitable for use with patients who are physically or emotionally fragile. For example, the task of responding to 20 items may be exhausting for some patients [10, 12, 13]. Furthermore, while a clinically defined cut-off score for “high” desire to die has been proposed previously [13], a score for predicting the risk of a WTHD has yet to be established.

This study therefore had two aims: to develop a short form of the Spanish SAHD and to define, for this short form, a risk score for the WTHD in PC patients, using a stochastic model based on scores from the SAHD.

Materials and methods

We conducted an item reduction study of the Spanish version (SV) of the SAHD (SAHD-20 SV). Data came from a previous cross-sectional study [12] in which a convenience sample of 101 patients (62 male; M age 61.7 ± 11 years, range 33–84) from a PC oncology unit in Barcelona (Spain) completed the SAHD-20 SV. The sample is described in detail elsewhere [12].

Data analysis

We used a Rasch model to reduce the number of SAHD-20 SV items and to analyse whether the new set of items would retain the information needed to detect a risk for the WTHD in PC patients. The procedure involved three steps, in each of which a Rasch analysis was carried out iteratively: (1) after fitting data to the dichotomous Rasch model we selected items with high discrimination values (point-biserials ≥0.70). Point-biserials are Pearson’s product-moment correlations between each item location and the mean Rasch item estimation, excluding the item being analysed [14]. Higher values would correspond to a stronger WTHD construct. (2) Items with fit statistics (Infit and Outfit) outside the range 0.7–1.4 were examined and eventually removed. These goodness-of-fit statistics indicate how accurately or predictably the data fit the model [15]. (3) The person separation index—analogous to Cronbach’s alpha (reliability)—and the unidimensionality assumption of the final set of items were examined. A paired t test for the difference between two sets of items identified by a residual-based principal components analysis (PCAR) was made at the person level; the percentage of these tests outside the range of the 95 % confidence interval was not expected to exceed 5 %. If the Rasch quality criteria were satisfied, we examined the test probability function of the reduced set of items to find a score that would indicate patients with a ≥50 % risk of having a WTHD according to the new short version. In addition, the strength of the relationship between the reduced SAHD and the SAHD-20 SV was examined using Pearson’s correlations. In order to assess correlations with factors associated with the WTHD, we also examined the correlations of both the reduced (SAHD-5 SV) and 20-item versions of the SAHD with other instruments (all Spanish versions): Karnofsky Performance Status (KPS) [16], where higher scores [0–100] indicate a healthier status; the Barthel Index (BI) [17], to assess the degree of independence in activities of daily living (range 0–100, with higher scores indicating greater independence); the Eastern Cooperative Oncology Group Performance Status (ECOG-PS) [18] (range 0–4, with higher scores indicating poorer function); the Palliative care Outcome Scale (POS) [19], to assess patients’ palliative needs (range 0–44, with higher scores indicating greater needs); and the Hospital Anxiety and Depression Scale (HADS) [20], to assess psychological distress (range 0–21 for both anxiety (HADS-A) and depression (HADS-D), with higher scores indicating levels of anxiety or depression requiring specialist assessment).

Results

The first step of the reduction process showed that only seven items had point-biserial correlations >0.70. Table 1 shows the correlations between item measures and the mean Rasch item measures. In the second step (goodness-of-fit analysis), two of these items (20 and 16) were removed as their Infit and Outfit values were outside the established range; inspection of these two items suggested that the most likely cause of the misfit was the use of a double-barrelled statement (item 20) and a threatening wording that might drive participants to respond in a contrary way (item 16). The five remaining items had adequate goodness-of-fit values. Table 2 shows the item location values of the final five items and the goodness-of-fit statistics, as well as the two items removed in each of the analyses. Reliability of the final five-item scale was adequate (item separation index = 0.7). The percentage of individual t tests outside the range ±1.96 between items with positive and negative loadings from the PCAR was optimal (<5 %), thus confirming the hypothesis that the five items of the reduced scale (SAHD-5 SV) could be aligned on the same latent trait continuum. Participants’ mean score on the SAHD-5 SV was −1.72 logits (SE = 0.21). Figure 1 shows a plot of the test characteristic curve, or the sum of the item information functions, representing the Rasch model prediction for each measure relative to the WTHD when using the new SAHD-5 SV. The dashed lines indicate that the risk (≥50 %) of suffering a WTHD corresponds to patients who scored >2 on the SAHD-5 SV. Using this score and the SAHD-5 SV, 21 (20.8 %) participants had a risk of having a WTHD.

Table 1 Correlations between item measures of the SAHD and the mean Rasch item measure estimation
Table 2 Item location values of the final five items and the corresponding goodness-of-fit statistics (items are True/False in the schedule of attitudes toward hastened deatha), plus the two items (20 and 16) removed in each of the analyses

Correlation between the Rasch measures of the SAHD-5 SV and scores on the SAHD-20 SV was strong (r = 0.944, p < 0.01), and both versions of the SAHD (5 and 20 items) produced similar correlations with the other measures (Table 3).

Table 3 Pearson coefficients for SAHD-5 SV and SAHD-20 SV [12] in relation to the measures of performance status: Barthel Index [17], Karnofsky Performance Status [16], and Eastern Cooperative Oncology Group (ECOG) [18], Palliative Outcome Scale (POS) [19], and Hospital Anxiety and Depression Scale (HADS) [20]

Discussion

The results suggest that the proposed 5-item scale could be a valid and reliable instrument for assessing and detecting the risk of a WTHD. If so, the item reduction would confer an important advantage when assessing the WTHD in the most fragile patients.

The reduction process (similar to that used by Rosenfeld et al. [21] in the recently published English short form) drew attention to the possible influence of double-barrelled wording on patients’ understanding of some SAHD-20 SV items, the scores on which might therefore reflect two issues rather than just one as intended. This aspect, not highlighted in previous research [12], is likely due to the fact that Rasch analysis is the most restrictive model within IRT; other IRT models–such as 2PL or 3PL—might be less restrictive and would fit these items. It should also be noted that the psychometric properties of all other adaptations of the SAHD were originally examined using methods derived from classical test theory, which does not stress the study of individual items, unlike the approach taken here.

Clearly, the item reduction entails a certain loss of information. However, correlations with other instruments used to assess patients’ physical and psychological status were comparable for the Spanish SAHD-5 and SAHD-20 (see Table 3). Moreover, the SAHD-5 SV contains a feasible number of items for obtaining the information needed to assess the risk of a WTHD in clinical practice.

Assessment of the WTHD by means of the SAHD-5 SV identified 20.8 % of patients as being at risk of a WTHD (score >2), whereas the SAHD-20 SV [12] detected 16.8 % of patients with a score considered “high” WTHD in the literature. The short form thus appears to show increased sensitivity for detecting the WTHD. Because Rasch analysis is a probabilistic model it allows us to estimate the “amount” of WTHD among participants. The usual criterion in this context is that the probability or risk of presenting a given construct is high if the respondent scores ≥50 % on the measure used to assess that construct. This approach contrasts with previous research in which labels of ‘high’, ‘moderate’ or ‘low’ WTHD were established clinically, without using a measurement model or subsequent verification. Our results therefore suggest that the SAHD-5 SV might be useful for detecting the risk of a WTHD in PC patients. Further research is, however, necessary to evaluate the effectiveness of the instrument in similar patient populations, to clarify criteria for labelling a patient as experiencing a WTHD or not, and, thereafter, to assess the sensitivity and specificity of the new instrument using suitable methods (e.g. ROC curves).

This study has several limitations. One derives from the nature and purpose of the SAHD-20 SV, which meant that data were obtained from a relatively small convenience sample consisting solely of cancer patients. The lack of a gold standard for assessing the specificity and sensitivity of the proposed short form is a further limitation. However, data goodness of fit to the Rasch model was optimal. Furthermore the use of the Rasch model allowed us to convert discrete units into intervals, and thus the SAHD-5 SV could be used to study the sensitivity and specificity of other instruments. Another limitation concerns the weak to moderate correlation obtained for some of the scales that assess physical status or daily function. Other scales that examine psychological aspects such as meaning in life or hopelessness would likely have been more appropriate here.

By analysing the SAHD-20 SV at the item level, we were able not only to produce a short form in Spanish but also to identify some potential shortcomings in the original SAHD. We therefore believe that our approach and results will be of interest to researchers from other countries who are seeking to adapt this instrument or to interpret their own results, since the functioning of the SAHD is likely to be influenced by cultural factors.

Conclusion

The SAHD-5 SV may constitute a valid and reliable instrument for assessing and detecting the risk of the WTHD. As such, it could contribute to the development of more effective care plans for PC patients and be used as an outcome measure of intervention studies.