Identification, description and appraisal of generic PROMs for primary care: a systematic review

Murphy, Mairead; Hollinghurst, Sandra; Salisbury, Chris

doi:10.1186/s12875-018-0722-9

Identification, description and appraisal of generic PROMs for primary care: a systematic review

Research article
Open access
Published: 15 March 2018

Volume 19, article number 41, (2018)
Cite this article

Download PDF

You have full access to this open access article

BMC Family Practice Aims and scope Submit manuscript

Identification, description and appraisal of generic PROMs for primary care: a systematic review

Download PDF

4564 Accesses
21 Citations
12 Altmetric
Explore all metrics

Abstract

Background

Patients attend primary care with many types of problems and to achieve a range of possible outcomes. There is currently a lack of patient-reported outcome measures (PROMs) designed to capture these diverse outcomes. The objective of this systematic review was to identify, describe and appraise generic PROMs suitable for measuring outcomes from primary care.

Methods

We carried out a systematic Medline search, supplemented by other online and hand-searches. All potentially relevant PROMs were itemised in a long-list. Each PROM in the long-list which met inclusion criteria was included in a short-list. Short-listed PROMs were then described in terms of their measurement properties and construct, based on a previously published description of primary care outcome as three constructs: health status, health empowerment and health perceptions. PROMs were appraised in terms of extent of psychometric testing (extensive, moderate, low) and level of responsiveness (high, medium, low, unknown).

Results

More than 5000 abstracts were identified and screened to identify PROMs potentially suitable for measuring outcomes from primary care. 321 PROMs were long-listed, and twenty PROMs were catalogued in detail. There were five PROMs which measured change directly, without need for a baseline. Although these had less strong psychometric properties, they may be more responsive to change than PROMs which capture status at a point in time. No instruments provided coverage of all three constructs. Of the health status questionnaires, the most extensively tested was the SF-36. Of the health empowerment instruments, the PEI, PAM and heiQ provided the best combination of responsiveness and psychometric testing. The health perceptions instruments were all less responsive to change, and may measure a form of health perception which is difficult to shift in primary care.

Conclusions

This systematic review is the first of its kind to identify papers describing the development and validation of generic PROMs suitable for measuring outcomes from primary care. It identified that: 1) to date, there is no instrument which comprehensively covers the outcomes commonly sought in primary care, and 2) there are different benefits both to PROMs which measure status at a point in time, and PROMs which measure change directly.

View this article's peer review reports

Development of a standard set of PROs and generic PROMs for Dutch medical specialist care

Article Open access 09 February 2023

A philosophical perspective on the development and application of patient-reported outcomes measures (PROMs)

Article 17 October 2021

General practitioners’ views on use of patient reported outcome measures in primary care: a cross-sectional survey and qualitative study

Article Open access 24 January 2020

Background

Introduction

Patient-reported outcome measures (PROMs) are self-report questionnaires designed to capture information on patients’ health. An ‘outcome’ is change in health status, knowledge or behaviour which is attributable to preceding healthcare [1], and PROMs provide important evidence about this change as it is experienced by the patient [2]. There are thousands of PROMs in existence, with new PROMs being developed every day [3]. Experts in the field have called for harmonisation in this area [4], including research into existing instruments before development of new ones [5].

PROMs were originally developed to aid in evaluating and comparing the effectiveness of healthcare interventions [3]. By comparing patients’ PROM scores before and after an intervention, the outcome of the intervention can be assessed. Numerous primary care interventions have been developed in recent years to meet changing population and service needs (including an aging population and increasing numbers of people with multi-morbidity [6, 7]). While there are a number of disease and problem-specific PROMs which can be used in primary care, many primary care interventions are targeted at people with a range of conditions or problems. Examples include electronic consultations [8], health coaching and behavioural change therapies [9, 10], and new approaches to address the needs of frequent attenders in general practice [11, 12]. Assessing the effectiveness of such interventions from a patient perspective requires a generic PROM, which can be administered across a population, regardless of presenting problem. Such a PROM should cover multi-layered outcomes encompassing aspects of enablement, resilience, symptoms and function, and health perceptions.

We conjectured that there was no such suitable PROM for primary care and undertook to investigate this through a review of the literature. We identified existing structured reviews of PROMs on related topics: for example PROMs for mental health [13], empowerment [14, 15], integrative medicine [16], patient experience [17] patient safety in primary care [18] and generic health status in older people [19]. However, an initial search of the literature found there was no structured review for generic PROMs specific to the measurement of outcome across all primary care patients.

We firstly carried out a qualitative study to delineate the domains which should be captured by a Primary Care PROM [20]. We then conducted a systematic review of PROMs suitable for primary care, which captured these domains.

Prior qualitative study

In our prior Qualitative study, we identified and categorised inter-related outcomes into ten groups occupying three domains:

1.
Health Status: This involves both symptoms and medication side-effects and the impact of symptoms on patients’ lives.
2.
Health Empowerment: These are the internal and external resources which enable patients to improve their health. The internal resources include an understanding of health conditions, and an ability to self-care, stay healthy and follow a clinician-patient agreed plan. The external resources include patients’ confidence in seeking healthcare, and ability to access suitable health-related supports. Although these external aspects are closely related to the patient experience of the consultation, they are the enduring impacts of the consultation that have a direct influence on patient’s overall health status and are qualitatively different from measures of patient experience [20].
3.
Health Perceptions: This involves health concerns and satisfaction, and confidence in their health for the future.

This study reports on a systematic review of PROMs suitable for use in primary care to measure these outcomes.

Methods

Search strategy

We designed a customised search strategy for Medline Ovid SP, following PRISMA guidelines where appropriate [21]. This was peer-reviewed by a University of Bristol librarian and included indexed papers from 1950 to 8th March 2014. The PICO framework normally used in systematic reviews (population, intervention, comparator and outcomes) [22] was adapted in order to identify primary care PROMs; the framework used was: Population, Aim, Subject and Construct (PASC). Four filters were developed using these PASC categories combined with an AND operator. The population filter was designed to retrieve papers relevant to primary care; the aim and subject filters combined were designed to retrieve papers describing development and validation of PROMs; and the construct filter was aimed at the domains of interest (health status, health empowerment, and health perceptions). Search terms for each category were developed through an iterative process of adjusting filters and performing test searches. A full description of the four filters is shown in Additional file 1.

We recognised that limiting the search to the Medline database meant some relevant papers may have been missed. We therefore followed-up all PROMs referred to in screened abstracts, contacted eighteen experts in the field, hand-searched three compilations of PROMs (McDowell [23], Bowling [24], and PROQOLID [25]) and screened all abstracts on the Oxford University PROMs group database using the keywords “individualised”, “generic”, “utility” or “primary care”. (This a database of papers relating to patient-reported outcome measures, which contains references to more than 14,000 papers, last updated in 2005) [26]. A backwards reference search was carried out on all twenty original papers included in the final review, and a forward reference search for sixteen of the twenty original papers. (The four exceptions were those which had been cited more than 600 times).

Selection of PROMs

All abstracts identified in the electronic searches were screened. During this process, any PROMs named in the abstract were listed, apart from those which did not meet the inclusion criteria.

For each of the PROMs in the long list, a copy of the instrument was obtained from either a PROMs compilation [23, 24, 26, 27] or the initial development paper. Selection was based on inclusion and exclusion criteria on the PROMs, as opposed to the papers (see Table 1). To ensure decisions were made consistently, reasons for exclusion were documented against each PROM. (See Additional file 2).

Table 1 Inclusion and exclusion criteria for long-listed PROMs

Full size table

Abstract screening and data extraction was done by a single reviewer (MM). The other two reviewers (CS/SH) independently checked the extracted data (shown in Figs. 1, 2 and 3), and reviewed the long-list of PROMs excluded from the review. (Additional file 2).

Data extraction

The selected PROMs were described in a tabular format (see Figs. 1 to 3). Data was extracted on their measurement properties, construct and psychometric properties.

The measurement properties extracted were adapted from an existing PROM classification framework [25], and included the number of items, the nature of the scale, the recall period, the level of PROM adaptability and the dimensionality.

The construct categories extracted were based on the prior qualitative study [20]. Where a construct was explicitly covered, this was block-highlighted in the tabular description. Where it was implicit (for example, an individualised questionnaire which asks about symptoms covers pain, but not explicitly) it was shaded.

The review of psychometric properties was limited to the extent of psychometric testing, and the level of responsiveness. PROMs were categorised as having an extensive, moderate or low extent of psychometric testing, depending on the number of validation studies published, whether the authorship of these papers extended beyond the original authors, and on number of times the original development paper has been cited. (see additional file 3 for details). With the exception of responsiveness, we chose not to provide categorical ratings for individual psychometric properties (for example, validity, reliability, interpretability) because these are actually properties of PROMs as administered in a particular population, not of property of a PROM in and of itself [28]. Although some structured reviews of PROMs have provided such categorical ratings[16], leading textbook compilations appraise these properties descriptively [23, 27] and we also took this approach. We made an exception for responsiveness, because the current study was based on the hypothesis that there is no suitably responsive PROM for testing interventions in primary care, so it was necessary to test this hypothesis. We categorised responsiveness as: unknown (responsiveness not known or tested); low (responsiveness shown in at least one study); medium (repeated evidence for responsiveness, including in primary care); high (responsiveness shown in primary care studies where other leading PROMs are not responsive).

PROM compilations [23, 24, 26, 27] were reviewed to extract psychometric information relating to the most common PROMs (e.g. the SF-36). Where the PROM was not listed in a complication, a forward reference search on the original PROM development paper was used.

A fuller description of the data extraction sheet is shown in Additional file 3.

Results

Search and selection of PROMs

Figure 4 shows the number of papers screened and PROMs identified. Many PROMs were excluded at short-listing stage because of the construct or the population. For example, the Sickness Impact Profile was excluded because it is most suitable for very ill populations [27]. Although several preference-based instruments were long-listed, only the EQ-5D and SF-36/SF-12 were included in the final review. ICECAP [29] and AQoL-4D [30] were excluded because of their focus on general quality of life, which was not part of the construct under consideration. The Health Utilities Index was excluded because of its deliberately narrow focus on specific aspects of function [27].

The twenty selected PROMs are described in the following three sections by domain: Health Status, Health Empowerment and Health Perceptions. Where a PROM covered more than one domain, it is presented under the domain with which it has most overlap. A referenced list of the twenty PROMs can be found in Additional file 4.

Health status PROMs

Construct and measurement properties

Ten instruments which measure some form of health status were included. As shown in Fig. 1, nine of the ten instruments contain a standard list of questions. One of the instruments (MYMOP) is individualised, such that patients define the outcomes of interest in their own words [31]. Three of the PROMs contain transitional items: the CMP, PPQ [32] and ORIDL [33].

Five of the instruments result in a profile of scores. The most widely used and well-validated of these is the SF-36, which measures physical and emotional function [34, 35]. The second profile PROM (the SF-12) was designed as a short version of the SF-36 [36] and was validated based on assessing how well the twelve-item scale scores predicted the 36-item scale scores [36]. The third most commonly cited profile instrument listed is the Dartmouth COOP charts. These were designed as a rapid way to assess functional health routinely in clinical practice [37] and consist of a set of nine charts, each with a title, a question, and a five-point pictorial response scale. The fourth profile reviewed is the transitional ORIDL. This consists of two scores designed to be comparable across different people and diseases [33]. The last profile is the Complementary and Integrative Medicine Outcome Scales (CIMOS) [38]. This instrument was developed to measure the outcomes typically experienced by people receiving complementary and alternative medicine. Four of the instruments generate index (but not utility) scores. These are the CMP, howRU, MYMOP and PPQ. Three instruments generate utility scores: the, SF-36, SF-12 and the EQ-5D [39,40,41] (which is recommended by NICE as the tool of choice for economic evaluation [42]).

Psychometric properties

The SF-36, SF-12 and the EQ-5D have been extensively used and tested, with the original papers describing each cited approximately 21,000, 9000 and 3000 times respectively. The COOP Charts, the CMP and MYMOP have undergone moderate levels of psychometric testing. The original PPQ French version has also had moderate testing, but testing of the English version has been more limited. The remaining three instruments have had limited testing.

The most responsive to change are the individualised instrument (MYMOP), and the three transitional instruments (PPQ, CMP and ORIDL). MYMOP shows change when the SF-36 does not [31, 43, 44]. Various formats of the CMP have been used in primary care trials [45,46,47]. However, as a single-item, this has lower reliability than multi-item instruments [28]. ORIDL is also transitional. Although limited testing has been done of ORIDL, in the initial validation study, it showed good correlation with MYMOP and PEI [33]. One study showed that ORIDL continued to show change on repeated follow-up when MYMOP did not [48].

The SF-36, SF-12, EQ-5D and the COOP charts all have a medium level of responsiveness. However, there is considerable variability within this. The most responsive of these instruments is the SF-36 profile scores [27]. The COOP charts show good reliability and validity in primary care populations. Unsurprisingly, given that each profile score is based on a single item, they are less reliable than the SF-36, have a ceiling effect, and are less responsive to change over time [27]. In terms of preference-based values, testing has showed that the recent SF-12 value set is just as responsive to change and generates similar estimates to the SF-36 preference-based index [49]. A study of patients with depression also showed the SF-12 utility scores to be more responsive than the EQ-5D-3 L which suffers from ceiling effects in general [27].

HowRU [50], has not been tested for responsiveness. However, it has shown less of a ceiling effect than EQ-5D-3 L, despite being even shorter [51]. This may be because of wording e.g. “low or worried” (howRU) rather than “anxious or depressed” (EQ-5D). Responsiveness is also unknown in CIMOS, which has undergone very little psychometric testing, demonstrating acceptable levels of reliability and validity in only one study [52].

Summary

The SF-36 is, by far, the most validated of the health status instruments and the most responsive of the moderately and extensively tested PROMs. MYMOP and ORIDL both represent good attempts to increase responsiveness, but the ORIDL detailed nine-point scale has not been widely validated, and, although it has been used in trials [47, 53], MYMOP is not recommended for self-completion, which is necessary for routine or trial use [31]. EQ-5D and SF-12 have the benefits of brevity, and the ability to generate a preference-based score. HowRU shows that it is possible to have a valid instrument that is very short. Haddad’s transitional scale provides another good option for increasing responsiveness while maintaining a standard list of items.

Health empowerment

Construct and measurement properties

Figure 2 shows the seven instruments which cover both internal and external aspects of the Health Empowerment construct.

The constructs measured by these seven instruments all include internal aspects of empowerment, with explicit items on understanding of health problems, and the ability to self-care, or stay healthy. External aspects of empowerment are less extensively covered, perhaps because these are traditionally seen as measures of patient experience, not outcome [54], and because we did not include measures which exclusively capture patient experience in our review. None of the instruments directly address symptoms. The three most widely used are PAM-13, PEI and the heiQ. PAM-13 is based on a single construct of activation: which is being engaged in managing one’s own health. Patients are measured on a four-stage Guttman scale of activation: from belief that an active role is important, to taking action and staying the course under stress [55]. The heiQ was developed to assess the impact of patient education programs across a broad range of chronic conditions [56]. It has a wider construct than the PAM-13 and contains eight independent dimensions: positive and active engagement in life, health directed behaviour; constructive attitudes and approaches; self-monitoring and insight; health service navigation; social integration and support and emotional wellbeing [56, 57]. These domains overlap with both internal and external empowerment, and also with the other two domains. For example, “positive and active engagement in life” overlaps with Health Status. The instrument also includes aspects of Health Perceptions, including “satisfaction with health”, and “health concerns”. PEI was developed specifically for primary care, and asks the patient to retrospectively rate change in enablement, resulting from a single consultation. As well as understanding and self-care, it addresses concerns, and indirectly addresses the impact of symptoms (through questions on coping with illness, and coping with life) [58].

The four remaining instruments are less widely used. EC-17 was developed to measure the skills and attributes of an effective consumer, for use in self-management interventions [59, 60]. PE-LTCs was developed to measure empowerment in long-term conditions [61]. The Barriers instrument does not purport to assess empowerment, rather barriers to self-management in long-term conditions [62]. However, the construct of “barriers” is related to empowerment, in that reducing barriers increases empowerment. CAM-3 measures the quality of the therapeutic relationship as: 1) patient-centred care, 2) perceived provider support 3) empowerment. While this is described as an experience measure, it focuses on the consequences of a positive experience, for example, trust in the therapist and belief that the root causes are being identified and treated. In measuring patient-centred care and perceived provider support as well as empowerment, it includes some external aspects of empowerment in addition to internal [63].

All seven instruments contain a standard list of questions, asking about today, or a person’s perception of their current self. Two of the instruments are, at least partly, transitional. The status instruments are based on a list of belief statements with a Likert (bipolar) response scale (e.g. strongly disagree to strongly agree) apart from the EC-17, which has behavioural statements with an adjectival scale (never to always). Four of the instruments provide a single index score, and the remaining three give a profile of scores. All instruments are scored using a summative method, apart from PAM-13, which uses a Rasch scoring algorithm [55].

Psychometric properties

The first three of the instruments (PAM-13, PEI and heiQ) have undergone moderate levels of testing. PEI has been used widely in UK general practice and has shown acceptable psychometric properties. As a transitional questionnaire, it measures change directly, and is thus responsive.

The properties of the heiQ were investigated using item response theory and structural equation modelling. It has demonstrated good construct validity, including, most recently, testing for measurement invariance [64]. Some of the heiQ sub-scores have shown responsiveness to change in randomised controlled trials [65,66,67].

PAM-13 has strong psychometric properties, and association with a number of other health outcomes [68]. Recent studies in the US found patient activation was influenced by community interventions [69, 70], suggesting it may be appropriate as an outcome measure in primary care.

The EQ-17 has shown some preliminary evidence for responsiveness to change in arthritis patients [60, 71], although psychometric testing has been more limited. However, the authors of the instrument acknowledge that, while some skills of an effective consumer can be learned, others are a “part of personality” and not amenable to change [59]. (pg. 1932) When compared directly to PAM-13 it was less responsive (standardised response mean 0.25/ 0.41).

Summary

All the health empowerment instruments reviewed could, in theory, be used to measure empowerment outcomes in primary care. However, as all except PEI and CAM-3 were developed with long-term conditions in mind, they are less suitable for people without long-term conditions. This is most problematic with EC-17, PE-LTCs and the Barriers questionnaire, which all refer to a “disease”. The first three instruments (PAM-13, PEI and heiQ), which are the most robust and responsive to change, make minimal reference to “your illness” or else refer to “health problems” in general. Of these three, only the PEI was developed specifically for primary care. The main weakness of this is that it only works at a single consultation level, through the words “as a result of your consultation today”. A format of the PEI which asks patients to rate a longer episode of care has been tested for acupuncture. This adjusted the wording to “as a result of visiting the acupuncturist over the last few weeks or months.” However, patients had difficulty attributing change directly to the intervention [72]. PAM-13, is more robust, but the construct is relatively narrow: its emphasis is on the internal, and it contains elements about control and responsibility which are not present in the construct described in Chapter 3. The heiQ has the widest construct. The main weakness of the heiQ for use in primary care is its length, and the fact that it does not explicitly address symptoms, which, for some patients, may be the primary reason for attendance.

Health perceptions

Construct and measurement properties

Three instruments which predominantly cover a Health Perceptions construct were identified (Fig. 3). The single self-rated health status item is based on empirical evidence that people possess insights into their own health, and that this can be captured through a single rating of how they perceive their health at a point in time [27]. Self-rated health status items ask for a general impression of health, rather than symptoms, function or health problems, and thus capture a Health Perceptions construct, specifically the “satisfaction with health” outcome. The HPQ is an important extension of single items, which covers six domains: prior health, current health, health outlook, resistance/susceptibility to illness, health worry/concern and sickness orientation. The developers of the HPQ contended that this subjective concept has as much to do with a person’s feelings and beliefs as their actual health status [73]. The IPQ is based on a model of the cognitive representation of illness measured by eight domains: Consequences, Timeline, Personal control, Treatment control, Identity, Concern, Understanding, Emotional response [74].

All three instruments contain a standard list of questions, asking about a person’s perception of their current self. All are status questions, although the self-rated health status item can also be asked as a transitional question. The HPQ can be scored as a profile, comprised of six sub-scores, or an overall index can be created from 22 of the 33 items. The IPQ can be reported as a profile (the responses to the nine questions) or a summative index.

Psychometric properties

The self-rated health status item is quick to administer. It consistently predicts long-term outcomes such as mortality. This suggests that it reflects health trajectory, and not merely current health status [27], which may make it less suitable for use over the short to medium term. A body of research has shown that general questions, self-rated health included, are answered less reliably than specific questions [75]. Single items are, by necessity, less precise than multiple items, and less responsive to change [76]. The HPQ has shown good reliability and validity. The high stability over time suggests that it may be more useful as a personality indicator than an outcome measure that is responsive to change [27]. The IPQ is most suitable for relatively ill populations [74]. The IPQ has shown good reliability and validity in populations with illness, but has generally been used as a variable which is associated with various outcome measures, rather than a measure in its own right [77]. Responsiveness to change in primary care has never been tested. One paper has shown the measure to be responsive to change in secondary care [78] and the authors suggest that the role of medical interventions in shifting illness perceptions is an important and under-researched area [77].

Summary

The three health perceptions instruments are among the least responsive in this review. The constructs captured by them are similar to the Health Perceptions construct which arose in the prior qualitative study but also has some important differences, in that the items capture more general perceptions about health, which are less likely to be shifted by intervention.

Discussion

Key findings

As far as we are aware, this is the first systematic review of its kind for generic PROMs for primary care. This review identified PROMs that are potentially suitable for measuring a wide-range of outcomes in any adult primary care patient, and twenty of these were reviewed in detail. The two key findings of the review were that: to date, there is no instrument which comprehensively covers the outcomes commonly sought in primary care; and there are different benefits both to PROMs which measure status at a point in time, and PROMs which measure change directly. These findings will inform researchers selecting or developing PROMs to test interventions in primary care, and should have a subsequent effect on policy and patient care, as the conclusions of clinical research studies which inform healthcare policy depend on the PROM selected as an endpoint. Confirming that a gap exists by reviewing existing PROMs is a necessary first step in any PROM development [5]. Our findings provided sufficient grounds for proceeding with development of a PROM for primary care; and, following completion of this review, we developed and tested the Primary Care Outcome Questionnaire and made it publicly available [79].

Strengths and limitations

Strengths of this review include a reproducible search strategy, developed in collaboration with a librarian, a set of clear inclusion and exclusion criteria, and publication of the longlist of excluded PROMs for transparency.

The search strategy successfully identified the twelve papers used in the iterative process of developing and testing. However, all systematic reviews have the potential of omitting relevant articles, including unpublished material. Our exclusion criteria omitted PROMs which captured a narrow construct, such as pain, fatigue, or anxiety. This limited the scope of the review to generic measures, and meant that modular measures, such as PROMIS [80], were excluded. The electronic search was limited to the Medline database, and one of the filters relied on keywords assigned by authors. Some papers describing the development and validation of PROMs for primary care may not have been picked up by this strategy. However, although the use of keywords in a filter is not usually recommended in systematic reviews [81], it can be justified when the unit of analysis is the PROM, not the paper, because all PROMs alluded to in abstracts were followed up, which meant that even if the original development paper of a PROM was not identified by the search strategy, it could be identified through other means. Moreover, the search was supplemented by forward and backward reference searching, review of the Oxford PROMs Bibliography, and consultation with experts. Lastly, the decision on which PROMs to include and exclude at abstract screening stage, and any data extraction was carried out by a single researcher. Independent screening and extraction would have helped to identify and minimise error. We have attempted to be as transparent as possible by describing inclusion and exclusion criteria and by publishing a longlist of all potential PROMs identified in Additional file 2.

Comparison with the literature

Systematic reviews of health status measurement instruments are often poorly conducted with many studies having a poorly described search strategy, using only a single database and failing to report whether data extraction was done by two independent reviewers [82].

We followed PRIMSA and COSMIN guidelines in this review, diverging from these only where there were clear reasons. For example, we did not use the existing search strategy for measurement properties published by the COSMIN group [83], because it was highly sensitive (therefore over-inclusive) and did not fit the purposes of this review. The approach we took had much in common with other systematic reviews of PROMs on related topics (such as empowerment [14, 15], integrative medicine [16], and patient experience) [17]. For example, in their review of patient experience measures, Hudon et al. similarly relied on keywords for one filter, and took a similar approach to mapping the constructs captured by the instruments reviewed onto their defined construct of “patient-centred care”, excluding instruments which measured only a narrow part of this construct [17].

This review identified various benefits and downsides to standardised, transitional and individualised PROMs respectively. Standardised PROMs are most successful in terms of their psychometric properties. Item wording and selection of a scale for standardised strongly affect interpretation. The review contained five PROMs with transitional questions. As anticipated [84], these were more responsive than the status measures. Only one individualised PROM, MYMOP was included in the review. As with other individualised PROMs [85,86,87,88] this is recommended for completion only through interview.

None of the instruments reviewed provided coverage of all outcome groups identified in the prior qualitative study. Of the Health Status questionnaires, the most extensively tested is the SF-36. This is also the most responsive of the standardised status instruments reviewed. Individualised and transitional questionnaires show greater responsiveness to change. Of the Health Empowerment instruments, the PEI, PAM and heiQ provide the best combination of responsiveness to change and psychometric testing. The Health Perceptions instruments reviewed are all less responsive to change, and may measure a form of health perception which is difficult to shift in primary care.

Conclusions

This systematic review is the first of its kind to identify papers describing the development and validation of generic PROMs suitable for measuring outcomes from primary care. It identified that, to date, there is no instrument which comprehensively covers the outcomes primary care patients seeks and primary care clinicians seek to deliver, and thus provided grounds for proceeding with development of a PROM for primary care. It also provides a reusable search strategy for the identification of papers describing primary care PROMs. Finally, it presents information on a range of instruments which measure health status, health empowerment and health perceptions, and a critique of their strengths and limitations for use in primary care, which will be of benefit to other researchers in this field.

Abbreviations

CAM-3:: Three scales for Complementary and Alternative Medicine
CIMOS:: Complementary and Integrative Medical Outcomes Scale
CMP:: Change in Main Problem
EC-17:: Effective Consumer Scale – 17 items
EQ-5D:: European Quality of Life-5 Dimensions
heiQ:: Health Education Impact Questionnaire
HPQ:: Health Perceptions Questionnaire
ICECAP:: ICEpop CAPability measure
IPQ:: Illness Perceptions Questionnaire
MYMOP:: Measure Yourself Medical Outcomes Profile
NICE:: National Institute for Health and Care Excellence
ORIDL:: Outcome in Relation to Impact on Daily Living
PAM:: Patient Activation Measure
PEI:: Patient Enablement Instrument
PE-LTCs:: Patient Empowerment in Long-Term Conditions
PPQ:: Patient Perception of Quality
PRISMA:: Preferred Reporting Items for Systematic Reviews and Meta-Analyses
PROM:: Patient-reported outcome measure
SF-12:: Short Form 12
SF-36:: Short Form 36
SRHS:: Self-Rated Health Score

References

Donabedian A. The quality of care. How can it be assessed? J Am Med Assoc. 1988;260(12):1743–8.
Article CAS Google Scholar
Fitzpatrick R. Patient-reported outcomes and performance measurement. In: Performance Measurement for Health System Improvement: Experiences, Challenges and Prospect, smith P, et al. Cambridge: Cambridge University Press; 2009. p. 63–86.
Google Scholar
Black N, et al. Patient-reported outcomes: pathways to better health, better services, and better societies. Qual Life Res. 2016;25(5):1103–12.
Article CAS PubMed Google Scholar
Center for Drug Evaluation and Research (CDER), Guidance for Industry Patient-Reported Outcome Measures: Use in Medical Product Development to Support Labeling Claims, US Food and Drug Administration. 2009: http://www.fda.gov/downloads/Drugs/.../Guidances/UCM193282.pdf.
Google Scholar
Garratt A, et al. Quality of life measurement: bibliographic study of patient assessed health outcome measures. Br Med J. 2002;324(7351):1417.
Article Google Scholar
Barnett K, et al. Epidemiology of multimorbidity and implications for health care, research, and medical education: a cross-sectional study. Lancet. 2012;380(9836):37–43.
Article PubMed Google Scholar
Salisbury C, et al. Epidemiology and impact of multimorbidity in primary care: a retrospective cohort study. Br J Gen Pract. 2011;61(582):e12–21.
Article PubMed Google Scholar
Olayiwola JN, et al. Electronic consultations to improve the primary care-specialty care Interface for cardiology in the medically underserved: a cluster-randomized controlled trial. Ann Fam Med. 2016;14(2):133–40.
Article PubMed PubMed Central Google Scholar
Thom DH, et al. A qualitative study of how health coaches support patients in making health-related decisions and behavioral changes. The Annals of Family Medicine. 2016;14(6):509–16.
Article PubMed Google Scholar
Sharma AE, et al. What happens after health coaching? Observational study 1 year following a randomized controlled trial. Ann Fam Med. 2016;14(3):200–7.
Article PubMed PubMed Central Google Scholar
Hudon C, et al. Case Management in Primary Care for frequent users of health care services with chronic diseases: a qualitative study of patient and family experience. Ann Fam Med. 2015;13(6):523–8.
Article PubMed PubMed Central Google Scholar
Barnes R. ISRCTN registry: Footprints in Primary Care. study registered 2015. https://doi.org/10.1186/ISRCTN62939408, http://www.isrctn.com/ISRCTN62939408.
Fitzpatrick R, Garratt A, Schmidt L. Instruments for mental health: a review., report from the patient-reported health instruments group to the Department of Health. 2000.
Google Scholar
Herbert RJ, et al. A systematic review of questionnaires measuring health-related empowerment. Research & Theory for Nursing Practice. 2009;23(2):107–32.
Article Google Scholar
Hudon C, et al. Assessing enablement in clinical practice: a systematic review of available instruments. J Eval Clin Pract. 2010;16(6):1301–8.
Article PubMed PubMed Central Google Scholar
Hunter J, Leeder S. Patient questionnaires for use in the integrative medicine primary care setting—a systematic literature review. European Journal of Integrative Medicine. 2013;5(3):194–216.
Article Google Scholar
Hudon C, et al. Measuring patients’ perceptions of patient-centered care: a systematic review of tools for family medicine. Ann Fam Med. 2011;9(2):155–64.
Article PubMed PubMed Central Google Scholar
Ricci-Cabello I, et al. Measuring experiences and outcomes of patient safety in primary care: a systematic review of available instruments. Fam Pract. 2015;32(1):106–19.
Article PubMed Google Scholar
Haywood KL, Garratt AM, Fitzpatrick R. Quality of life in older people: a structured review of self-assessed health instruments. Expert Review of Pharmacoeconomics & Outcomes Research. 2006;6(2):181–94.
Article Google Scholar
Murphy M, et al. Patient and practitioners’ views on the most important outcomes arising from primary care consultations: a qualitative study. BMC Fam Pract. 2015;16:108.
Article PubMed PubMed Central Google Scholar
Liberati A, et al. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate healthcare interventions: explanation and elaboration. Br Med J. 2009;339:b2700.
Article Google Scholar
Centre for Reviews and Dissemination, University of York. York: CRD's guide for undertaking reviews in health care; 2009.
McDowell I. Measuring Health. 2 ed. 2006, New York: Oxford University Press.
Bowling A. Measuring Health: A review of quality of life measurement scales. 3 ed. Vol. 1. Maidenhead: Open University Press; 2004.
Google Scholar
MAPI Research Trust. PROQOLID. 2014 [cited 2014 17/10/2014]; Available from: https://eprovide.mapi-trust.org/about/about-proqolid.
Valderas JM, Alonso J. Patient reported outcome measures: a model-based classification system for research and clinical practice. Qual Life Res. 2008;17(9):1125–35.
Article PubMed Google Scholar
University of Oxford. Oxford PROMs bibliography. 2005 [cited 2014 17/10/2014]; Available from: http://phi.uhce.ox.ac.uk/instruments.php.
Streiner DL, Norman GR. Health measurement scales: a practical guide to their development and use. New York: Oxford University Press; 2008.
Book Google Scholar
Al-Janabi H, Flynn TN, Coast J. Development of a self-report measure of capability wellbeing for adults: the ICECAP-A. Qual Life Res. 2012;21(1):167–76.
Article PubMed Google Scholar
Richardson JR, et al. Construction of the descriptive system for the assessment of quality of life AQoL-6D utility instrument. Health Qual Life Outcomes. 2012;10:38.
Article PubMed PubMed Central Google Scholar
Paterson C. University of Bristol website, PHC section, MYMOP. 2012 [cited 2014 25/04/2014]; Available from: http://www.bristol.ac.uk/primaryhealthcare/resources/mymop/strengthsandweaknesses/.
Haddad S, et al. Patient perception of quality following a visit to a doctor in a primary care unit. Fam Pract. 2000;17(1):21–9.
Article CAS PubMed Google Scholar
Reilly D, et al. Outcome related to impact on daily living: preliminary validation of the ORIDL instrument. BMC Health Serv Res. 2007;7:139.
Article PubMed PubMed Central Google Scholar
Ware, J.E., Jr. And C.D. Sherbourne, The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Med Care, 1992. 30(6): p. 473–483.
Article PubMed Google Scholar
Ware J, et al. In: Metric Q, editor. Chapter 1, in User's Manual for the SF-36v2 Health Survey. Lincoln: Quality Metric; 2007.
Ware J Jr, Kosinski M, Keller SD. A 12-Item Short-Form Health Survey: construction of scales and preliminary tests of reliability and validity. Med Care. 1996;34(3):220–33.
Article PubMed Google Scholar
Nelson EC, et al. The functional status of patients. How can it be measured in physicians’ offices? Med Care. 1990;28(12):1111–26.
Article CAS PubMed Google Scholar
Eton DT, et al. Developing a self-report outcome measure for complementary and alternative medicine. Explore (NY). 2005;1(3):177–85.
Article Google Scholar
Brooks R. EuroQol: the current state of play. Health Policy. 1996;37(1):53–72.
Article CAS PubMed Google Scholar
Herdman M, et al. Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5L). Qual Life Res. 2011;20(10):1727–36.
Article CAS PubMed PubMed Central Google Scholar
Devlin NJ, Parkin D, Browne J. Patient-reported outcome measures in the NHS: new methods for analysing and reporting EQ-5D data. Health Econ. 2010;19(8):886–905.
Article PubMed Google Scholar
National Institute of Clinical Excellence Guide to the methods of technology appraisal 2013. 2013.
Google Scholar
Paterson C. Measuring outcomes in primary care: a patient generated measure, MYMOP, compared with the SF-36 health survey. Br Med J. 1996;312(7037):1016–20.
Article CAS Google Scholar
Mirza S, et al. Comparing sensitivity to change of two patient-reported outcome measures in a randomised trial of patients referred for physiotherapy services. Trials. 2013;14(Suppl 1):O50. https://doi.org/10.1186/1745-6215-14-S1-O50.
Campbell JL, et al. Telephone triage for management of same-day consultation requests in general practice (the ESTEEM trial): a cluster-randomised controlled trial and cost-consequence analysis. Lancet. 2014;384(9957):1859–68.
Article PubMed Google Scholar
Kamper SJ, Maher CG, Mackay G. Global rating of change scales: a review of strengths and weaknesses and considerations for design. The Journal of Manual & Manipulative Therapy. 2009;17(3):163–70.
Article Google Scholar
Salisbury C, et al. Effectiveness of PhysioDirect telephone assessment and advice services for patients with musculoskeletal problems: pragmatic randomised controlled trial. Br Med J. 2013;346(jan29 3):f43.
Article Google Scholar
Higgins M, et al. Evaluation report of wellness enhancement Learning,Piloted for people with CFS/ME. 2009, TheWEL Programme: www.thewel.org.
Brazier JE, Roberts J. The estimation of a preference-based measure of health from the SF-12. Med Care. 2004;42(9):851–9.
Article PubMed Google Scholar
Benson T, et al. Evaluation of a new short generic measure of health status: howRu. Informatics in Primary Care. 2011;18:89–101.
Google Scholar
Benson T, et al. Comparison of howRU and EQ-5D measures of health-related quality of life in an outpatient clinic. Informatics in Primary Care. 2013;21(1):12–7.
Article PubMed Google Scholar
Eton DT, Temple LM, Koffler K. Pilot validation of a self-report outcome measure of complementary and alternative medicine. Explore: The Journal of Science & Healing. 2007;3(6):592–9.
Article Google Scholar
Flower A, Lewith GT, Little P. A feasibility study exploring the role of Chinese herbal medicine in the treatment of endometriosis. J Altern Complement Med. 2011;17(8):691–9.
Article PubMed Google Scholar
Ahmed S, et al. The use of patient-reported outcomes (PRO) within comparative effectiveness research: implications for clinical practice and health care policy. Med Care. 2012;50(12):1060–70.
Article PubMed Google Scholar
Hibbard JH, et al. Development and testing of a short form of the patient activation measure. Health Serv Res. 2005;40(6 Pt 1):1918–30.
Article PubMed PubMed Central Google Scholar
Osborne RH, Elsworth GR, Whitfield K. The health education impact questionnaire (heiQ): an outcomes and evaluation measure for patient education and self-management interventions for people with chronic conditions. Patient Education & Counseling. 2007;66(2):192–201.
Article Google Scholar
Elsworth GR, Nolte S, Osborne RH. Factor structure and measurement invariance of the health education impact questionnaire: does the subjectivity of the response perspective threaten the contextual validity of inferences? SAGE Open Medicine. 2015;3:1–13.
Article Google Scholar
Howie JG, et al. A comparison of a patient enablement instrument (PEI) against two established satisfaction scales as an outcome measure of primary care consultations. Fam Pract. 1998;15(2):165–71.
Article CAS PubMed Google Scholar
Kristjansson E, et al. Development of the effective musculoskeletal consumer scale. J Rheumatol. 2007;34(6):1392–400.
PubMed Google Scholar
Santesso N, et al. Responsiveness of the effective consumer scale (EC-17). J Rheumatol. 2009;36(9):2087–91.
Article PubMed Google Scholar
Small N, et al. Patient empowerment in long-term conditions: development and preliminary testing of a new measure. BMC Health Serv Res. 2013;13:263.
Article PubMed PubMed Central Google Scholar
Bayliss EA, Ellis JL, Steiner JF. Barriers to self-management and quality-of-life outcomes in seniors with multimorbidities. Ann Fam Med. 2007;5(5):395–402.
Article PubMed PubMed Central Google Scholar
Bann CM, Sirois FM, Walsh EG. Provider support in complementary and alternative medicine: exploring the role of patient empowerment. Journal of Alternative & Complementary Medicine. 2010;16(7):745–52.
Article Google Scholar
Nolte S, et al. Tests of measurement invariance failed to support the application of the “then-test”. J Clin Epidemiol. 2009;62(11):1173–80.
Article PubMed Google Scholar
Cadilhac DA, et al. A phase II multicentered, single-blind, randomized, controlled trial of the stroke self-management program. Stroke. 2011;42(6):1673–9.
Article PubMed Google Scholar
Francis KL, et al. Effectiveness of a community-based osteoporosis education and self-management course: a wait list controlled trial. Osteoporos Int. 2009;20(9):1563–70.
Article CAS PubMed Google Scholar
Stone GR, Packer TL. Evaluation of a rural chronic disease self-management program. Rural Remote Health. 2010;10(1):1203.
PubMed Google Scholar
Mosen DM, et al. Is patient activation associated with outcomes of care for adults with chronic conditions? Journal of Ambulatory Care Management. 2007;30(1):21–9.
Article PubMed Google Scholar
McDonald EM, et al. Improvements in health behaviors and health status among newly insured members of an innovative health access plan. J Community Health. 2013;38(2):301–9.
Article PubMed Google Scholar
Deen D, et al. Asking questions: the effect of a brief intervention in community health centers on patient activation. Patient Education and Counselling. 2011;84(2):257–60.
Article Google Scholar
Kennedy CA, et al. A prospective comparison of telemedicine versus in-person delivery of an interprofessional education program for adults with inflammatory arthritis. J Telemed Telecare. 2016;3:1–10.
Google Scholar
Paterson C. Measuring changes in self-concept: a qualitative evaluation of outcome questionnaires in people having acupuncture for their chronic health problems. BMC Complement Altern Med. 2006;6:7.
Article PubMed PubMed Central Google Scholar
Ware J. Scales for measuring general health perceptions. Health Serv Res. 1976;11:396–415.
PubMed PubMed Central Google Scholar
Broadbent E, et al. The brief illness perception questionnaire. J Psychosom Res. 2006;60(6):631–7.
Article PubMed Google Scholar
Herrmann D. Reporting current, past, and changed health status. What we know about distortion. Med Care. 1995;33(4 Suppl):AS89–94.
CAS PubMed Google Scholar
Bowling A. Just one question: if one question works, why ask several? J Epidemiol Community Health. 2005;59(5):342–5.
Article PubMed PubMed Central Google Scholar
Petrie KJ, Jago LA, Devcich DA. The role of illness perceptions in patients with medical conditions. Current Opinion in Psychiatry. 2007;20(2):163–7.
Article PubMed Google Scholar
Petrie KJ, et al. Changing illness perceptions after myocardial infarction: an early intervention randomized controlled trial. Psychosom Med. 2002;64(4):580–6.
Article PubMed Google Scholar
Murphy, M., S. Hollinghurst, and C. Salisbury. The primary care outcomes questionnaire. 2017 [cited 2017 22/05/2017].
National Institutes of Health. Patient reported outcomes measurement information system (PROMIS). 2016; Available from: http://www.healthmeasures.net/explore-measurement-systems/promis#2.
Google Scholar
Higgins, J. and S. Green, Cochrane handbook for systematic reviews of interventions version 5.1.0 The Cochrane Collaboration, Editor. updated March 2011.
Mokkink LB, et al. Evaluation of the methodological quality of systematic reviews of health status measurement instruments. Qual Life Res. 2009;18(3):313–33.
Article PubMed Google Scholar
Terwee CB, et al. Development of a methodological PubMed search filter for finding studies on measurement properties of measurement instruments. Qual Life Res. 2009;18(8):1115–23.
Article PubMed PubMed Central Google Scholar
Lloyd H, et al. Patient reports of the outcomes of treatment: a structured review of approaches. Health Qual Life Outcomes. 2014;12:5.
Article PubMed PubMed Central Google Scholar
O'Boyle CA, et al. Individual quality of life in patients undergoing hip replacement. Lancet. 1992;339(8801):1088–91.
Article PubMed Google Scholar
Ruta DA, et al. A new approach to the measurement of quality of life. The patient-generated index. Med Care. 1994;32(11):1109–26.
Article CAS PubMed Google Scholar
Patel KK, Veenstra DL, Patrick DL. A review of selected patient-generated outcome measures and their application in clinical trials. Value Health. 2003;6(5):595–603.
Article PubMed Google Scholar
MacDuff C, Russell EM. The problem of measuring change in individual health-related quality of life by postal questionnaire: use of the patient-generated index in a disabled population. Qual Life Res. 1998;7(8):761–9.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors would like to thank Sarah Dawson for advice on developing systematic reviews for PROMs and Cath Borwick for reviewing the search strategy.

Funding

This study was funded by a capacity building grant from the NIHR School for Primary Care Research (SPCR). The Avon Primary Care Research Collaborative funded time to write the paper. Neither funding body had any role in the study design, data collection, analysis, or interpretation or in writing the manuscript.

The NIHR SPCR is a partnership between the Universities of Bristol, Cambridge, Keele, Manchester, Newcastle, Nottingham, Oxford, Southampton and University College London. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health.

Availability of data and materials

The datasets generated during this study are reproducible from the search strategy provided as additional file 2. The Primary Care Outcomes Questionnaire, developed based on the constructs described in this study, is licenced by the University of Bristol, and available free for non-commercial purposes from: http://www.bris.ac.uk/primaryhealthcare/resources/pcoq/.

Author information

Authors and Affiliations

Population Health Sciences, Bristol Medical School, University of Bristol, Canynge Hall, 39 Whatley Road, Bristol, BS8 2PS, UK
Mairead Murphy, Sandra Hollinghurst & Chris Salisbury

Authors

Mairead Murphy
View author publications
You can also search for this author in PubMed Google Scholar
Sandra Hollinghurst
View author publications
You can also search for this author in PubMed Google Scholar
Chris Salisbury
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

MM, SH and CS designed the study. MM collected the data. MM carried out the data analysis, and CS/SH reviewed and validated it. MM drafted the manuscript. CS/SH reviewed and revised the manuscript. All authors approved the final manuscript.

Corresponding author

Correspondence to Mairead Murphy.

Ethics declarations

Ethical approval and consent to participate

Not applicable

Consent for publication

Not applicable

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Medline Search Strategy. Reproducible Medline search strategy, including the contents of all four filters. (DOCX 21 kb)

Additional file 2:

Long-list of PROMs identified. Long-list of all PROMs identified through the first review of abstracts, as potentially meeting the inclusion criteria. The reasons for excluding these after the PROMs was reviewed are also documented. (DOCX 44 kb)

Additional file 3:

Data extraction sheet column headings description. Description of the column headings and possible categories for the data extraction sheet, used to create Figs. 1–3. (DOCX 16 kb)

Additional file 4:

List of instruments reviewed. Referenced list of the twenty instruments reviewed with a short description of each instrument. (DOCX 42 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Murphy, M., Hollinghurst, S. & Salisbury, C. Identification, description and appraisal of generic PROMs for primary care: a systematic review. BMC Fam Pract 19, 41 (2018). https://doi.org/10.1186/s12875-018-0722-9

Download citation

Received: 05 June 2017
Accepted: 23 February 2018
Published: 15 March 2018
DOI: https://doi.org/10.1186/s12875-018-0722-9

Identification, description and appraisal of generic PROMs for primary care: a systematic review

Abstract

Background

Methods

Results

Conclusions

Similar content being viewed by others

Development of a standard set of PROs and generic PROMs for Dutch medical specialist care

A philosophical perspective on the development and application of patient-reported outcomes measures (PROMs)

General practitioners’ views on use of patient reported outcome measures in primary care: a cross-sectional survey and qualitative study

Background

Introduction

Prior qualitative study

Methods

Search strategy

Selection of PROMs

Data extraction

Results

Search and selection of PROMs

Health status PROMs

Construct and measurement properties

Psychometric properties

Summary

Health empowerment

Construct and measurement properties

Psychometric properties

Summary

Health perceptions

Construct and measurement properties

Psychometric properties

Summary

Discussion

Key findings

Strengths and limitations

Comparison with the literature

Conclusions

Abbreviations

References

Acknowledgements

Funding

Availability of data and materials

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethical approval and consent to participate

Consent for publication

Competing interests

Publisher’s Note

Additional files

Additional file 1:

Additional file 2:

Additional file 3:

Additional file 4:

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation