Patient-reported outcome measures for cancer caregivers: a systematic review

Shilling, Valerie; Matthews, Lucy; Jenkins, Valerie; Fallowfield, Lesley

doi:10.1007/s11136-016-1239-0

Patient-reported outcome measures for cancer caregivers: a systematic review

Review
Open access
Published: 12 February 2016

Volume 25, pages 1859–1876, (2016)
Cite this article

Download PDF

You have full access to this open access article

Quality of Life Research Aims and scope Submit manuscript

Patient-reported outcome measures for cancer caregivers: a systematic review

Download PDF

Valerie Shilling¹,
Lucy Matthews¹,
Valerie Jenkins¹ &
…
Lesley Fallowfield¹

5127 Accesses
44 Citations
3 Altmetric
Explore all metrics

Abstract

Purpose

Informal caregivers provide invaluable help and support to people with cancer. As treatments extend survival and the potential burdens on carers increase, there is a need to assess the impact of the role. This systematic review identified instruments that measure the impact of caregiving, evaluated their psychometric performance specifically in cancer and appraised the content.

Methods

A two-stage search strategy was employed to: (1) identify instruments that measure the impact of caregiving, and (2) run individual searches on each measure to identify publications evaluating psychometric performance in the target population. Searches were conducted in MEDLINE, EMBASE, CINAHL and PsycINFO and restricted to English for instrument used and article language. Psychometric performance was evaluated for content and construct validity, internal consistency, test–retest reliability, precision, responsiveness and acceptability. Individual scale items were extracted and systematically categorised into conceptual domains.

Results

Ten papers were included reporting on the psychometric properties of eight measures. Although construct validity and internal consistency were most frequently evaluated, no study comprehensively evaluated all relevant properties. Few studies met our inclusion criteria so it was not possible to consider the psychometric performance of the measures across a group of studies. Content analysis resulted in 16 domains with 5 overarching themes: lifestyle disruption; well-being; health of the caregiver; managing the situation and relationships.

Conclusions

Few measures of caregiver impact have been subject to psychometric evaluation in cancer caregivers. Those that have do not capture well changes in roles and responsibilities within the family and career, indicating the need for a new instrument.

Development and validation of the caregiver roles and responsibilities scale in cancer caregivers

Article Open access 18 March 2019

Care provided by older adult caregivers to a spouse in active cancer treatment: a scoping review

Article 04 June 2022

Systematic review of caregiver responses for patient health-related quality of life in adult cancer care

Article 14 March 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Background

Informal caregivers, whether they are spouse, family member or friend, often provide a significant amount of help and support for people with cancer. Informal caregiving is pivotal to the overall outcome of a patient’s treatment, and thus, maintaining the health and satisfaction of caregivers is essential to maximise the well-being of both parties [1, 2]. Caregiving can undoubtedly place a strain on the caregiver [3], but the role can also provide a source of happiness and boost self-efficacy and a sense of worth [4].

An increasing number of patients are living a longer life with cancer. As such, there is a growing recognition that broader aspects of their lives and those of the family are affected across the disease trajectory [5]. Informal caregivers could be viewed as “second-order patients in their own right” [6]; consequently, a well-validated measure to assess the impact of disease and treatment on their lives and overall well-being is crucial.

There is a raft of measures designed to assess caregiver impact so the choice for researchers may be unclear. The instruments currently used focus on three areas: caregiver burden, caregiver need and quality of life. Some measures are not well validated, and many have been developed for use with caregivers in very different circumstances, for example the elderly with cognitive impairment [7]. In order to better inform researchers on the content and evaluation of commonly used instruments, we identified and evaluated the psychometric performance of measures used in the cancer caregiver population and appraised their content, what is and what is not captured, with particular regard to broader areas of life experience such as the impact on career and family.

Methods

The review involved a two-stage search: (1) to identify generic and cancer-specific self-report instruments used to measure the impact of caregiving on informal caregivers, and (2) to identify evidence about psychometric properties and performance of these instruments in the specific context of cancer.

Search stage 1: Identifying candidate instruments

A combination of controlled syntax (MeSH) and free-text terms were used. Four groups of terms were generated: (1) generic names for measures; (2) impact on caregiver; (3) describing the population; and (4) psychometric performance. OvidSP was used for MEDLINE [MEDLINE(R) In-Process & Other Non-Indexed Citations and MEDLINE(R) 1946 to Present] and EMBASE (1947–current) searches. EBSCOhost was used for CINAHL (1937–present) and PsycINFO (from 1800s to present) searches. Terms were modified as appropriate for each database and limited to English language only. Searches were run on 20 November 2014 (see “Appendix 1” for the search strategy used for MEDLINE, adapted for other databases).

Study selection criteria

Inclusion criteria for stage 1 were self-report instruments of the impact of caring for patients with cancer or any other condition on the caregiver. Searches were not limited by study design or date, but were restricted to articles in the English language.

Instruments were excluded if they were developed: (1) to address a broad population not specifically for caregivers; (2) to focus on caregivers of children or children who are themselves caregivers; (3) to be administered only by an interviewer or clinician; (4) to measure unmet needs or objective aspects of caregiving, e.g. the amount of time or nature of tasks fulfilled; (5) to evaluate caregivers’ assessment or beliefs about their caregiving skills or performance; (6) for use in a non-English-speaking population and for which an English version was not available; (7) for use by patients rather than caregivers (e.g. patient estimates of the impact of their illness on the caregiver); and/or (8) to measure caregiver bereavement.

Titles and abstracts were screened independently by two reviewers (VS/LM) for names of instruments that met the inclusion criteria, resulting in a list of eligible candidate instruments (Fig. 1).

Search stage 2: Identifying evidence of the psychometric properties of candidate instruments in the cancer caregiver population

Separate searches were conducted for each of the candidate instruments for studies designed to evaluate their psychometric performance in caregivers of cancer patients. Search terms are grouped as follows: (1) names and acronyms of the candidate instruments identified in stage 1; (2) target population; (3) psychometric terms; and (4) cancer terms (see “Appendix 2” for the search strategy used for MEDLINE and adapted for other databases). Searches were run on 16 January 2015 (CINAHL and PsycINFO) and 5 February 2015 (MEDLINE and EMBASE).

Study selection criteria

Studies that reported the reliability, validity, responsiveness, precision and/or acceptability of the caregiver impact measure and met the inclusion and exclusion criteria listed above were selected for this review. Cross-cultural studies were included only if referencing an English language version of the instrument. Searches were not limited by study design or date, but were limited to articles and instrument use in the English language.

In addition to the exclusion criteria from stage 1, papers were excluded if: the instrument was used as a “gold standard” to test other measures; psychometric evidence was reported incidentally in studies not designed to evaluate those properties; studies addressing preference weighting or scaling issues for preference-based measures; editorials, opinions, letters and meeting abstracts. Titles and abstracts were screened independently by two reviewers (VS/LM, Fig. 2).

Citation chasing

Backwards citation chasing (one generation) using reference lists of all studies included in this stage of the review and forwards citation chasing (one generation) using Science Citation Index Expanded and Social Science Citation Index Expanded via Web of Science identified no additional eligible studies.

Data extraction

For each included measure, we extracted: name of measure and acronym, key reference/development paper, purpose of measurement, number of items, completion time, response options, recall period, population originally developed with and types of domains/dimensions assessed.

For each included paper, the following descriptive data were extracted: instrument version, first author name, publication year, study aim, study population, number of participants and setting/country where the study was conducted. Any data on evidence of the psychometric properties or performance of instruments were extracted including content validity (theoretical framework and/or qualitative research), construct validity (structural validity and hypothesis testing), internal consistency, test–retest reliability, precision, responsiveness and acceptability. Data were extracted by one reviewer (LM/VS) and checked by a second reviewer (LM/VS).

Evidence for psychometric performance

Evidence of psychometric performance was compared to reference criteria for (1) content validity (qualitative research with potential respondents and involvement in development stage and item generation, clear conceptual framework); (2) construct validity assessed through convergent and divergent validity demonstrated by the ability to differentiate known groups, and/or a pattern of correlation between the scale and other measures; (3) structural validity from factor analysis; (4) criterion validity (concurrent validity assessed through correlation with a gold standard and/or predictive validity where the predicted strength and direction of correlations/direction of group differences should be identified a priori); (5) reproducibility/test–retest reliability (intraclass correlation coefficient >0.7 adequate, >0.9, excellent); (6) internal consistency (Cronbach’s alpha coefficient 0.7 ≤ α ≥ 0.9, item total correlations >0.2); (7) responsiveness (change pre–post intervention statistically significant and/or difference of expected magnitude); (8) precision (assessment of measurement error, floor or ceiling effects <15 %; evidence from Rasch analysis); and (9) acceptability (non-response/non-completion of questionnaires, proportion of missing data) [8].

For each property, the paper was given a rating of 0 if it did not evaluate or report the property, ~ if the property was evaluated and met the criteria partially (e.g. not for all domains), + if the property was evaluated and met the criteria and – if the finding went against the prediction. Judgements on whether criteria were met were made by two reviewers (VS/LM) with disagreement resolved in discussion with another reviewer (LJF/VJ) where necessary. Content validity is only appraised for papers reporting measure development.

Examination of instrument content and categorisation into related domains

Individual scale items from all included measures were systematically categorised by the authors into conceptual domains. Initial domains were identified from the literature, and additional domains were defined until all individual items had been mapped. The content of each was then reviewed by the team to ensure that the concepts were consistently applied and had face validity [9].

Results

The purpose of stage 1 was to generate a list of eligible candidate instruments. Thirty-two were identified (Fig. 1), and in stage 2, individual searches were conducted for each. Ten measures were excluded as no candidate papers were returned. The combined searches for each of the remaining 22 individual measures resulted in 365 records. After deduplication, 186 unique records were screened. One hundred and seventy-six were excluded because they did not meet inclusion criteria resulting in the inclusion of 10 papers that reported on the psychometric properties of 8 eligible measures in the cancer caregiver population (see Fig. 2 for full details of reasons for exclusion). Table 1 details the general characteristics of the 8 included measures, including full name and acronym. We refer to measures by the acronym. Of the 8 included measures, 5 were initially developed for cancer caregivers, 2 of which were specifically developed for use in the palliative setting [10, 11]. Some measures were developed relatively recently [10–12], five between 1980 and 1999 [13–17]. Two [10, 13] measure caregiver appraisal specifically, with a theoretical underpinning from the stress and coping model of Lazarus and Folkman [18]; two were designed to measure subjective burden ± distress [12, 17] (the Zarit Burden Interview was later revised [19]); three were multidimensional quality of life measures [11, 14, 16] and one a multidimensional measure of caregivers’ reactions to caring for a family member [15].

Table 1 General characteristics of identified measures

Full size table

Table 2 describes the ten studies reporting on the psychometric properties of the measures in the cancer caregiver population in terms of the instrument and version, study aim, population, setting country and number of participants.

Table 2 Studies evaluating psychometric performance of measures in the cancer caregiver population

Full size table

Psychometric performance

Appraisal of the psychometric performance reported in each paper is given in Table 3.

Table 3 Appraisal of measure performance and characteristics in the cancer caregiver population

Full size table

Content validity

Of the six studies describing measure development, content validity was generally well described and acceptable. Four (BASC, CQOLC, CRA, QOLLTI-F) describe qualitative work with potential respondents for item development and reduction [11, 12, 14, 15]; two (ACS, FACQ-PC) describe a clear underpinning conceptual framework but no involvement of potential respondents [10, 13].

Criterion validity

Concurrent validity: the 6 short forms of the ZBI were validated against the 22-item version as gold standard. Spearman rank order correlations ranged from 0.63 for the one-item version to 0.95 for the 12-item scale [20]. Concurrent validity of the ACS Benefit subscale only was assessed against the Benefit Finding Scale as the gold standard (r = 0.56) [21].

Predictive validity: predictive validity of the ACS was assessed against hopelessness and depression scores at time 2 [21]. Although overall a significant amount of variance in hopelessness (33.3 %) and depression (27.8 %) was explained by ACS scores at time 1, only half of the predictive validity hypotheses were supported. Criterion validity of the QOLLTI-F was assessed using a 2-item measure of global quality of life. QOLLTI-F was predicted between 43 and 55 % of the variance depending on whether individual items (55 %), subscale scores (53 %) or total score (43 %) was regressed.

Structural validity

Structural validity using factor analysis was described in five of the studies. For the CRA [15], exploratory factor analysis supported the five-subscale solution accounting for 65.1 % of variance. Confirmatory factor analysis demonstrated factorial invariance across disease (cancer vs dementia), caregiver type (spouse vs non-spouse) and over time. For the QOLLTI-F [11], the authors describe an acceptable seven-factor solution with exploratory factor analysis (although the total amount of variance explained is not reported) with factor loadings from 0.39 to 0.88. For the FACQ-PC [10], principal axis factor analysis supports a four-factor solution with factor loadings ranging from 0.33 to 0.92. Although all items load highest on the predicted factor, two items cross load (>0.3). Lambert and colleagues [21] report a three-factor solution for the ACS which supports the original subscales, had minimal cross-loadings and factor loadings ranging from 0.405 to 0.726. Glajchen et al. [12] report a five-factor solution for the BASC while noting that one item cross loads. The authors do not report their methods or the factor loadings.

Construct validity: hypothesis testing

Six studies assessed construct validity through convergent and divergent validity. For the ACS, only 5/12 correlations between subscales of the ACS and other measures exceeded the authors’ criterion of ±0.3 to demonstrate construct validity [21]. Both papers assessing the CQOLC report moderate-to-high correlations with measures completed at the same time [14, 22]. Only the initial validation study assessed divergent validity using dissimilar measures and found that these gave low correlations with CQOLC scores as expected [14]. Construct validity of the BASC was supported by an appropriate pattern of moderate–strong correlations with similar measures [12].

Strong correlations were found between subscales of the FACQ-PC [10] and measures used to test convergent validity; however, positive caregiving appraisals were only weak–moderately associated with positive affect (r = 0.3). Similarly, to demonstrate divergent validity, four correlations were calculated between subscales and other measures which should yield low, negative correlations. While all were negative, two correlations were moderate in magnitude (r = −0.4 and r = −0.38).

Construct validity for the CRA [15] was assessed by correlating subscale scores with caregiver depression and patient dependencies in activities of daily living (ADL). The five subscales were, as predicted, weakly correlated with patient dependencies in ADL. Correlations with depression were in the appropriate direction and ranged from −0.23 to 0.57 in magnitude.

Three studies conducted hypothesis testing by assessing “known-group” differences. Group differences analysis for the ACS was only partially supportive of construct validity with only 3/9 hypotheses significant [21]. The BASC was able to discriminate between male and female caregivers and between different relationships between caregivers and patients. The negative personal impact subscale, but not the total score, differentiated between caregivers with and without mental health conditions. There were weak correlations overall with depression, high blood pressure and gastrointestinal complaints. All short forms of the ZBI were shown to have good discriminative validity to correctly classify participants as those with and without burden (contrasting to the classification on the 22-item version as gold standard).

Two studies [13, 23] report comparisons between groups (e.g. male/female; spouse/non-spouse), but these were not established a priori as known-group differences for hypothesis testing. One did not examine construct validity with convergent, divergent or known-group analysis [11].

Internal consistency

Internal consistency was assessed in all papers. All subscales of the ACS had α > 0.7 in both papers [13, 21] with the exception of the challenge subscale, which was subsequently dropped from the measure [13]; the threat subscale slightly exceeded the upper limit of α at 0.91 in one paper. [13] Overall α for the BASC was just acceptable (0.7); the negative personal impact factor, which can be used as an independent subscale, was 0.8. For the CQOLC, α approaches and slightly exceeds the upper limit (α = 0.87 and 0.91, respectively) [14, 22]. The five subscales of the CRA range from α = 0.8 to 0.9 [15] and the four subscales of the FACQ-PC from α = 0.73 to 0.86. Item total correlations were all in excess of 0.2, the strongest 0.78 [10]. Overall α for the QOLLTI-F was 0.86. The individual subscales were generally weaker ranging from α = 0.48 to 0.81 which may reflect the small number of items in some subscales. The measure also includes two single-item subscales [11]. Internal consistency for the QOL-FV was α = 0.89. Finally, internal consistency for the ZBI 22-item version as gold standard was α = 0.88 and ranged from 0.69 for the 4-item short version to 0.85 for the 12-item version.

Test–retest reliability

Three of the five papers with at least two time points did not attempt to assess test–retest reliability [15, 21, 23]. Test–retest reliability of the CQOLC was found to be excellent (0.95) [14]. For the total QOLLTI-F, test–retest reliability was found to be acceptable between T1 and T2 (0.77) and T2 and T3 (0.80). Intraclass correlations for individual subscales were below an acceptable level in 10 out of 14 cases, which may reflect the small number of items in the subscales [11].

Responsiveness

Responsiveness of the QOLLTI-F [11] was assessed by contrasting subscale scores on days that participants considered to be bad, average and good. These differences were statistically significant in all comparisons with the exception of the financial concerns subscale between average and good days. All differences between good and bad days exceeded 0.5 s.d. for minimal important difference. Only 3/8 comparisons did so between good and average and 4/8 between average and bad days.

The “potential to be responsive to change” of the CQOLC was assessed by using CQOLC to predict patient performance status at a single time point rather than measure responsiveness to change over time. The studies report contradictory findings: in one [14], the predicted significant negative correlation between CQOLC scores and patient performance status is reported as significant (r = −0.46, p < 0.0001), but in the other [22], this correlation approaches zero (r = 0.09).

Precision

None of the included studies conducted Rasch analysis or an assessment of measurement error. Floor and ceiling effects were not formally reported in any paper although two subscales of the QOLLTI-F [11] were described as having a lack of variance due to ceiling effects which made them less predictive of global quality of life. The subscales, quality of care and relationships, comprised two items each.

Acceptability

The acceptability of measures was not consistently reported and was difficult to assess using missing data and participation rates, as the measure is often given as part of a pack and information is not assessed separately. No information pertaining to acceptability was provided by four studies [10, 20, 21, 23]. In five studies, acceptability was appraised as only partially evidenced due to high dropout or incomplete data [11, 12, 14, 15, 22], surprising for the QOLLTI-F which had thoroughly tested acceptability in the development phase [11]. For the ACS [13], overall response rate was 74 % (including postal responses) and only 3/50 participants were eliminated due to missing data, suggesting the questionnaire was acceptable.

Examination of instrument content and categorisation into related domains

The 8 included instruments yielded 194 individual items. These were categorised into 16 conceptual domains under 5 overarching themes of approximately equal size: lifestyle disruption (22 % of items); well-being (22 %); health of the caregiver (21 %); managing the situation (18 %) and relationships (18 %). Most dominant domains were “confidence, self-esteem and self-efficacy” (24 items across 7 measures) and “psychological health of the caregiver” (22 items across 6 measures). Least represented were “impact on other family members” (2 items across 2 measures) and “impact on paid employment” (2 items across 2 measures). The distribution and total number of items across the different domains along with example items are given in Table 4.

Table 4 Domains assessed by each measure

Full size table

Discussion

This systematic review was conducted to investigate instruments commonly used to measure caregiver impact in cancer. Specifically, we sought to identify (1) what caregivers were being asked about, and (2) whether the measures performed well in psychometric evaluation. Psychometric appraisal is critical to establish the quality and standards of a measure in a given context. With so many instruments available to researchers, this review is intended as a resource to enable researchers to judge for themselves whether the content and quality of the instruments described match their requirements.

For 24 of the 32 identified measures, we found no evidence of psychometric performance using English language versions with cancer caregivers (see electronic supplementary material for a list of these measures). This is not to say that the measures have not been evaluated, but that we found no evidence in cancer. When assessing the performance of an instrument, the context is critical as it may perform differently in other populations. Without evaluation in cancer, researchers cannot be sure that instruments are reliably measuring the intended constructs. For 6 of the remaining 8 questionnaires, evidence of psychometric performance was identified in only a single study. The small number of studies identified meant that the evidence for psychometric performance was appraised for each individual paper, rather than allowing the appraisal to consider performance across a group of studies. In particular, the lack of studies beyond the initial validation of some measures is of concern. Most studies collected data at a single time point, and so, responsiveness to change, test–retest reliability and measurement error were not assessed. For the most part, content validity and internal consistency were reported and were adequate. Structural validity was assessed using factor analysis in five studies; four met these criteria, one only partially. Construct validity was assessed using different approaches to hypothesis testing in seven papers, fully meeting the criteria in only three. In the limited number of papers included, strongest support for psychometric performance was reported for the CRA and CQOLC.

We set out to examine not only the psychometric performance of these measures in a cancer population, but also to understand what concepts and domains were being assessed. Only one of the 16 conceptual domains, time for self, social life and leisure, was represented in all eight instruments. There was considerable overlap in the domains measured, however, with 8/16 domains being assessed in at least 6/8 measures. We have identified several areas which are not well captured by the instruments included in this review.

Paid employment

Impact on paid employment was assessed with a single question on two measures, neither of which addressed impact on career aspiration and planning or career progression, simply whether paid employment had been affected (FACQ-PC [10], QOL-F [16]).

Sexual activity

Only two questionnaires ask about relationships in terms of sexual activity (CQOLC [14], QOL-F [16]).

Family members

We also found that impact on the family as a unit was not well covered in the current measures. Impact on other family members was only addressed with a single question on two questionnaires (again CQOLC [14] and QOL-F [16]).

Current scales do not adequately capture role changes and responsibilities in the household and family routines, for example the impact on other caregiving responsibility such as to children or parents. Taking on new roles and responsibilities can raise issues around feelings of competence in the role, role strain and conflict and family cohesion. Such changes may be fluid as the cared-for member of the family moves through different phases of their cancer and treatment.

Five of the eight measures included in this review were developed between 1980 and 1999. There has been considerable societal change in the intervening years; families, roles and responsibilities are structured differently. It is not clear how appropriate some of the older questionnaires are for the present day. The activities associated with informal caregiving incorporate a range of tasks affecting different aspects of the life of the caregiver and the whole family [25, 26]. The impact of caregiving will vary depending not only on the patient’s situation but also on family make-up, in terms of other caregiving responsibilities, financial and occupation role responsibilities and time of life [6, 27, 28]. Impact is also likely affected by the number of other social roles, such as employment and other caregiving responsibilities that the caregiver has [3]. There is limited research about how the effect on variables such as employment and role strain might change over time, as caring responsibilities likely vary in line with different lines of treatment or transition to palliative care [27, 29, 30]. Future measures should attempt to capture the changing nature of caregiver impact.

We have identified a number of areas which are currently not well captured by measures that have been evaluated in cancer. These gaps may exist for several reasons. First, some measures were not initially developed for this population, and so, constructs important to cancer caregivers may not have the same salience. Second, some measures are old and may not reflect what is important in current society. Third, advances in cancer treatments mean that many more people are living a long life with cancer. For some, initial therapy is just the start of a journey that will involve repeated lines of treatment over time. The patient and the whole family have to continually adjust to a fluid situation and will be impacted variably at different times while trying to maintain a sense of normality in other aspects of their lives. Better treatments mean a longer life with cancer is a possibility for patients; however, we need ways to measure the longer-term impacts of cancer and cancer treatment for them and their informal caregivers. At the very least, we would suggest caregiver input into updating some of the older content if not the development of a new measure to capture the broader impacts we have described.

Limitations

The search strategy may have limited the number of papers identified in two ways: (1) searching for measures by name and acronym. The precise wording of the measure name and even the acronym sometimes varied; (2) reporting standards have changed; some older papers have poor use of keywords and do not always include psychometric terms or the names of measures in title/abstract/keywords. The impact of both of these limitations is mitigated by thorough backwards and forwards citation chasing.

We intentionally restricted the review to studies that reported on the psychometric properties of the English version of measures. This decision was taken as we felt we could not assume cultural equivalence for the caregiving role or the salience of different aspects of burden and impact in diverse populations. We took the decision to exclude all non-English versions of the measures rather than make subjective decisions as to whether one culture was sufficiently similar, while another was not. We are aware, however, that there are a number of studies reporting on the psychometric properties of other language versions of measures included in this review, e.g. [31–38]. We acknowledge there may be cultural differences between and within different countries where English is commonly spoken and where measures developed in English have been used. While this is an extremely important area of research, it is beyond the remit of the current review and it is not an aim of this study to investigate these potential differences. In this review, 7/10 included studies were conducted in the USA and 1 study each in the UK, Australia and Canada.

We also recognise that the pool of individual items identified is restricted by our stringent inclusion criteria for measures. Measures developed in other contexts, e.g. family function in a paediatric setting [39], for economic evaluation [40], domain-specific measures [41] and multidimensional measures which have not been subject to psychometric evaluation in cancer caregivers in the English language [6, 42, 43] may include concepts and items that are pertinent but which would need to be evaluated in appropriate studies.

Conclusions

A large number of measures purport to assess caregiver impact, but most have not been subject to psychometric evaluation in cancer populations. Few studies met our inclusion criteria so it was not possible to consider psychometric performance of the measures across a group of studies. Our content analysis identified several areas which are currently not well captured. These include changes to career aspiration and planning, changes in roles and responsibilities within the family and the way the family functions as a unit. We also note that some of the measures were developed up to 35 years ago, and their relevance to the current day may need to be reviewed. Strategies to overcome some of these limitations could include caregiver input into revising existing measures or using two or more measures to cover a broader range of outcome domains. However, our review suggests there is a need for a new measure capturing the impacts on broader areas of life for the caregiver and the family unit.

References

Glajchen, M. (2004). The emerging role and needs of family caregivers in cancer care. The Journal of Supportive Oncology, 2(2), 145–155.
PubMed Google Scholar
Northouse, L. L., et al. (2010). Interventions with family caregivers of cancer patients: Meta-analysis of randomized trials. CA: A Cancer Journal for Clinicians, 60(5), 317–339.
Google Scholar
Kim, Y., et al. (2006). Psychological adjustment of cancer caregivers with multiple roles. Psychooncology, 15(9), 795–804.
Article PubMed Google Scholar
Brouwer, W. B. (2006). Too important to ignore: Informal caregivers and other significant others. Pharmacoeconomics, 24(1), 39–41.
Article PubMed Google Scholar
Haylock, P. J. (2010). Advanced cancer: Emergence of a new survivor population. Seminars in Oncology Nursing, 26(3), 144–150.
Article PubMed Google Scholar
Cameron, J. I., et al. (2002). Lifestyle interference and emotional distress in family caregivers of advanced cancer patients. Cancer, 94(2), 521–527.
Article PubMed Google Scholar
Deeken, J. F., et al. (2003). Care for the caregivers: A review of self-report instruments developed to measure the burden, needs, and quality of life of informal caregivers. Journal of Pain and Symptom Management, 26(4), 922–953.
Article PubMed Google Scholar
Fitzpatrick, R., et al. (1998). Evaluating patient-based outcome measures for use in clinical trials. Health Technology Assessment, 2(14), 1–74.
CAS PubMed Google Scholar
Macefield, R. C., et al. (2014). Developing core outcomes sets: Methods for identifying and including patient-reported outcomes (PROs). Trials, 15, 49.
Article PubMed PubMed Central Google Scholar
Cooper, B., Kinsella, G. J., & Picton, C. (2006). Development and initial validation of a family appraisal of caregiving questionnaire for palliative care. Psychooncology, 15(7), 613–622.
Article PubMed Google Scholar
Cohen, R., et al. (2006). QOLLTI-F: Measuring family carer quality of life. Palliative Medicine, 20(8), 755–767.
Article PubMed Google Scholar
Glajchen, M., et al. (2005). Development of a brief assessment scale for caregivers of the medically ill. Journal of Pain and Symptom Management, 29(3), 245–254.
Article PubMed Google Scholar
Oberst, M. T., et al. (1989). Caregiving demands and appraisal of stress among family caregivers. Cancer Nursing, 12(4), 209–215.
Article CAS PubMed Google Scholar
Weitzner, M. A., et al. (1999). The Caregiver Quality of Life Index-Cancer (CQOLC) scale: Development and validation of an instrument to measure quality of life of the family caregiver of patients with cancer. Quality of Life Research, 8(1–2), 55–63.
Article CAS PubMed Google Scholar
Given, C. W., et al. (1992). The caregiver reaction assessment (CRA) for caregivers to persons with chronic physical and mental impairments. Research in Nursing & Health, 15(4), 271–283.
Article CAS Google Scholar
Ferrell, B. R., et al. (1999). Family caregiving in cancer pain management. Journal of Palliative Medicine, 2(2), 185–195.
Article CAS PubMed Google Scholar
Zarit, S. H., Reever, K. E., & Bach-Peterson, J. (1980). Relatives of the impaired elderly: Correlates of feelings of burden. Gerontologist, 20(6), 649–655.
Article CAS PubMed Google Scholar
Lazarus, R., & Folkman, S. (1984). Stress, appraisal and coping. New York: Springer.
Google Scholar
Zarit, S. H., Orr, N. K., & Zarit, J. M. (1985). The hidden victims of Alzheimer’s disease: Families under stress. New York: New York University Press.
Google Scholar
Higginson, I. J., et al. (2010). Short-form Zarit Caregiver Burden Interviews were valid in advanced conditions. Journal of Clinical Epidemiology, 63(5), 535–542.
Article PubMed Google Scholar
Lambert, S. D., et al. (2015). Measuring appraisal during advanced cancer: Psychometric testing of the appraisal of caregiving scale. Patient Education and Counseling, 98(5), 633–639.
Article PubMed PubMed Central Google Scholar
Weitzner, M. A., & McMillan, S. C. (1999). The Caregiver Quality of Life Index-Cancer (CQOLC) Scale: Revalidation in a home hospice setting. Journal of Palliative Care, 15(2), 13–20.
CAS PubMed Google Scholar
Sherman, D. W., et al. (2006). Quality of life of patients with advanced cancer and acquired immune deficiency syndrome and their family caregivers. Journal of Palliative Medicine, 9(4), 948–963.
Article PubMed Google Scholar
Stommel, M., et al. (1992). Confirmatory factor analysis (CFA) as a method to assess measurement equivalence. Research in Nursing and Health, 15(5), 399–405.
Article CAS PubMed Google Scholar
Girgis, A., & Lambert, S. D. (2009). Caregivers of cancer survivors: The state of the field. Cancer Forum, 33, 167–171.
Google Scholar
Girgis, A., et al. (2013). Physical, psychosocial, relationship, and economic burden of caring for people with cancer: A review. Journal of Oncology Practice, 9(4), 197–202.
Article PubMed Google Scholar
Fletcher, B. S., et al. (2012). The cancer family caregiving experience: An updated and expanded conceptual model. European Journal of Oncology Nursing, 16(4), 387–398.
Article PubMed Google Scholar
Weitzner, M. A., Haley, W. E., & Chen, H. (2000). The family caregiver of the older cancer patient. Hematology/Oncology Clinics of North America, 14(1), 269–281.
Article CAS PubMed Google Scholar
Stenberg, U., Ruland, C. M., & Miaskowski, C. (2010). Review of the literature on the effects of caring for a patient with cancer. Psychooncology, 19(10), 1013–1025.
Article PubMed Google Scholar
Given, B. A., Sherwood, P., & Given, C. W. (2011). Support for caregivers of cancer patients: Transition after active treatment. Cancer Epidemiology Biomarkers & Prevention, 20(10), 2015–2021.
Article Google Scholar
Bachner, Y. G. (2013). Preliminary assessment of the psychometric properties of the abridged Arabic version of the Zarit Burden Interview among caregivers of cancer patients. European Journal of Oncology Nursing, 17(5), 657–660.
Article PubMed Google Scholar
Can, G., et al. (2011). A psychometric validation study of the Quality of Life and FAMCARE scales in Turkish cancer family caregivers. Quality of Life Research, 20(8), 1319–1329.
Article PubMed Google Scholar
Grov, E. K., et al. (2006). The caregiver reaction assessment: Psychometrics, and temporal stability in primary caregivers of Norwegian cancer patients in late palliative phase. Psychooncology, 15(6), 517–527.
Article PubMed Google Scholar
Khanjari, S., Oskouie, F., & Langius-Eklof, A. (2012). Psychometric testing of the Caregiver Quality of Life Index-Cancer scale in an Iranian sample of family caregivers to newly diagnosed breast cancer women. Journal of Clinical Nursing, 21(3–4), 573–584.
Article PubMed Google Scholar
Lafaye, A., et al. (2013). The Caregivers Quality of Life Cancer index scale (CQoLC): An exploratory factor analysis for validation in French cancer patients’ spouses. Quality of Life Research, 22(1), 119–122.
Article PubMed Google Scholar
Nijboer, C., et al. (1999). Measuring both negative and positive reactions to giving care to cancer patients: Psychometric qualities of the Caregiver Reaction Assessment (CRA). Social Science and Medicine, 48(9), 1259–1269.
Article CAS PubMed Google Scholar
Rhee, Y. S., et al. (2005). Korean version of the caregiver quality of life index-cancer (CQOLC-K). Quality of Life Research, 14(3), 899–904.
Article PubMed Google Scholar
Schur, S., et al. (2014). Validation of the “Quality of Life in Life-Threatening Illness-Family Carer Version” (QOLLTI-F) in German-speaking carers of advanced cancer patients. Supportive Care in Cancer, 22(10), 2783–2791.
Article PubMed Google Scholar
Stein, R. E., & Riessman, C. K. (1980). The development of an impact-on-family scale: Preliminary findings. Medical Care, 18(4), 465–472.
Article CAS PubMed Google Scholar
Brouwer, W. B., et al. (2006). The CarerQol instrument: A new instrument to measure care-related quality of life of informal caregivers for use in economic evaluations. Quality of Life Research, 15(6), 1005–1021.
Article CAS PubMed Google Scholar
Giovannetti, E. R., et al. (2009). Construct validity of the Work Productivity and Activity Impairment questionnaire across informal caregivers of chronically ill older patients. Value Health, 12(6), 1011–1017.
Article PubMed PubMed Central Google Scholar
Minaya, P., et al. (2012). The CareGiver Oncology Quality of Life questionnaire (CarGOQoL): Development and validation of an instrument to measure the quality of life of the caregivers of patients with cancer. European Journal of Cancer, 48(6), 904–911.
Article PubMed Google Scholar
McKee, K. J., et al. (2003). The COPE index—a first stage assessment of negative impact, positive value and quality of support of caregiving in informal carers of older people. Aging and Mental Health, 7(1), 39–52.
Article CAS PubMed Google Scholar

Download references

Acknowledgments

We would like to thank Tamylin Bakas, Robin Cohen, Bryan Cooper, Charles Given, Dorothy Luckie, Laurel Northouse, Madeleine Oberst and Steve Zarit for their helpful responses to requests for additional information about measures. This review is part of an investigator-initiated study funded by Bristol-Myers Squibb.

Author information

Authors and Affiliations

Sussex Health Outcomes Research and Education in Cancer, Brighton and Sussex Medical School, University of Sussex, Falmer, Brighton, BN1 9QG, UK
Valerie Shilling, Lucy Matthews, Valerie Jenkins & Lesley Fallowfield

Authors

Valerie Shilling
View author publications
You can also search for this author in PubMed Google Scholar
Lucy Matthews
View author publications
You can also search for this author in PubMed Google Scholar
Valerie Jenkins
View author publications
You can also search for this author in PubMed Google Scholar
Lesley Fallowfield
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Valerie Shilling.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (DOCX 14 kb)

Appendices

Appendix 1: Search strategy phase 1 (used in MEDLINE and adjusted for other databases)

caregivers MedLine final 20th Nov

1.
exp “outcome assessment (Health Care)”/
2.
tool.ti,ab.
3.
instrument.ti,ab.
4.
questionnaire.ti,ab.
5.
index.ti,ab.
6.
indices.ti,ab.
7.
scale.ti,ab.
8.
survey.ti,ab.
9.
interview.ti,ab.
10.
inventory.ti,ab.
11.
outcome assessment.ti,ab.
12.
outcome measure.ti,ab.
13.
(measur* adj4 (quality or health or impact or burden or well being or wellbeing or lifestyle or family function or experience)).ti,ab.
14.
(assess* adj4 (quality or health or impact or burden or well being or wellbeing or lifestyle or family function or experience)).ti,ab.
15.
or/1-14
16.
exp “quality of life”/
17.
quality of life.ti,ab.
18.
health outcome*.ti,ab.
19.
health status.ti,ab.
20.
(well being or wellbeing).ti,ab.
21.
((caring or caregiving or caregiver* or carer*) adj2 impact).ti,ab.
22.
((caring or caregiving or caregiver* or carer*) adj2 burden).ti,ab.
23.
((caring or caregiving or caregiver* or carer*) adj2 experience).ti,ab.
24.
((caring or caregiving or caregiver* or carer*) adj2 stress).ti,ab.
25.
((caring or caregiving or caregiver* or carer*) adj2 strain).ti,ab.
26.
health utility.ti,ab.
27.
lifestyle interference.ti,ab.
28.
family function*.ti,ab.
29.
or/16-28
30.
exp “caregivers”/
31.
(carer or caregiver).ti,ab.
32.
((family or spouse or husband or wife or partner or friend) adj5 caring).ti,ab.
33.
((child* or son or daughter or parent or relative or relation) adj5 caring).ti,ab.
34.
or/30-33
35.
exp “reproducibility of results”/
36.
exp “psychometrics”/
37.
reliab*.ti,ab.
38.
valid*.ti,ab.
39.
psychometric.ti,ab.
40.
or/35-39
41.
15 and 29 and 34 and 40
42.
limit 41 to english language

Appendix 2: Search Strategy to identify evidence of psychometric performance of candidate instruments when used with caregivers to cancer patients (used in MEDLINE and adjusted for other databases)

Database: Ovid MEDLINE(R) In-Process & Other Non-Indexed Citations and Ovid MEDLINE(R) <1946 to Present>
Search strategy:
1. 1.
  caregivers/
2. 2.
  (carer* or caregiver*).ti,ab.
3. 3.
  ((family or spouse or husband or wife or partner or friend) adj5 caring).ti,ab.
4. 4.
  ((child* or son or daughter or parent or relative or relation) adj5 caring).ti,ab.
5. 5.
  or/1-4
6. 6.
  reliab*.ti,ab,kw.
7. 7.
  valid*.ti,ab,kw.
8. 8.
  evaluat*.ti,ab,kw.
9. 9.
  repeatab*.ti,ab,kw.
10. 10.
  acceptab*.ti,ab,kw.
11. 11.
  responsiv*.ti,ab,kw.
12. 12.
  feasib*.ti,ab,kw.
13. 13.
  psychometr*.ti,ab,kw.
14. 14.
  or/6-13
15. 15.
  neoplasm/
16. 16.
  cancer.ti,ab.
17. 17.
  oncology.ti,ab.
18. 18.
  15 or 16 or 17
19. 19.
  5 and 14 and 18
For each candidate instrument: 19 AND [Name of measure, including variants & acronyms]

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Shilling, V., Matthews, L., Jenkins, V. et al. Patient-reported outcome measures for cancer caregivers: a systematic review. Qual Life Res 25, 1859–1876 (2016). https://doi.org/10.1007/s11136-016-1239-0

Download citation

Accepted: 27 January 2016
Published: 12 February 2016
Issue Date: August 2016
DOI: https://doi.org/10.1007/s11136-016-1239-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Patient-reported outcome measures for cancer caregivers: a systematic review

Abstract

Purpose

Methods

Results

Conclusions

Similar content being viewed by others

Development and validation of the caregiver roles and responsibilities scale in cancer caregivers

Care provided by older adult caregivers to a spouse in active cancer treatment: a scoping review

Systematic review of caregiver responses for patient health-related quality of life in adult cancer care

Background

Methods

Search stage 1: Identifying candidate instruments

Study selection criteria

Search stage 2: Identifying evidence of the psychometric properties of candidate instruments in the cancer caregiver population

Study selection criteria

Citation chasing

Data extraction

Evidence for psychometric performance

Examination of instrument content and categorisation into related domains

Results

Psychometric performance

Content validity

Criterion validity

Structural validity

Construct validity: hypothesis testing

Internal consistency

Test–retest reliability

Responsiveness

Precision

Acceptability

Examination of instrument content and categorisation into related domains

Discussion

Paid employment

Sexual activity

Family members

Limitations

Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Supplementary material 1 (DOCX 14 kb)

Appendices

Appendix 1: Search strategy phase 1 (used in MEDLINE and adjusted for other databases)

Appendix 2: Search Strategy to identify evidence of psychometric performance of candidate instruments when used with caregivers to cancer patients (used in MEDLINE and adjusted for other databases)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation