Inconsistent reporting of outcomes in clinical trials of patients with non-specific low back pain (NSLBP) hinders comparison of findings and the reliability of systematic reviews. A core outcome set (COS) can address this issue as it defines a minimum set of outcomes that should be reported in all clinical trials. In 1998, Deyo et al. recommended a standardized set of outcomes for LBP clinical research. The aim of this study was to update these recommendations by determining which outcome domains should be included in a COS for clinical trials in NSLBP.
An International Steering Committee established the methodology to develop this COS. The OMERACT Filter 2.0 framework was used to draw a list of potential core domains that were presented in a Delphi study. Researchers, care providers and patients were invited to participate in three Delphi rounds and were asked to judge which domains were core. A priori criteria for consensus were established before each round and were analysed together with arguments provided by panellists on importance, overlap, aggregation and/or addition of potential core domains. The Steering Committee discussed the final results and made final decisions.
A set of 280 experts was invited to participate in the Delphi; response rates in the three rounds were 52, 50 and 45 %. Of 41 potential core domains presented in the first round, 13 had sufficient support to be presented for rating in the third round. Overall consensus was reached for the inclusion of three domains in this COS: ‘physical functioning’, ‘pain intensity’ and ‘health-related quality of life’. Consensus on ‘physical functioning’ and ‘pain intensity’ was consistent across all stakeholders, ‘health-related quality of life’ was not supported by the patients, and all the other domains were not supported by two or more groups of stakeholders. Weighting all possible argumentations, the Steering Committee decided to include in the COS the three domains that reached overall consensus and the domain ‘number of deaths’.
The following outcome domains were included in this updated COS: ‘physical functioning’, ‘pain intensity’, ‘health-related quality of life’ and ‘number of deaths’. The next step for the development of this COS will be to determine which measurement instruments best measure these domains.
The Global Burden of Disease study has highlighted that low back pain (LBP) is the leading global contributor to years lived with disability and the sixth global contributor to disability-adjusted life years [1, 2]. The global prevalence of activity-limiting LBP was recently estimated to be approximately 39 % for lifetime prevalence and 18 % for point-prevalence . Only a small proportion of people experiencing LBP seek health care but these account for high costs that represent an important burden to society [4, 5]. The large majority of patients with LBP are labelled as having non-specific LBP (NSLBP) because no underlying pathology or cause can be found [6–8]. A wide range of health interventions exists for patients with NSLBP and related clinical trials are often summarized in systematic reviews [9, 10]. However, authors of these reviews report that outcomes are inconsistently measured and reported across trials [11–13]. This inconsistency may limit the comparison of findings among trials and hinder statistical pooling . In addition, inconsistent reporting can be due to selective reporting bias (e.g. reporting only favourable outcomes in a publication), which may strongly affect the conclusions of systematic reviews .
The development and use of core outcome sets (COS) for specific health conditions has been suggested to reduce inconsistency in outcomes measured and reported across clinical trials . A COS represents an agreed set of outcomes that should be measured and reported, as a minimum, in all clinical trials for specific health conditions . Such a set does not restrict measurement or the choice of the primary outcome, but mandates collection and reporting of the COS alongside the outcomes of interest . A COS thus creates a minimum standard of outcomes reported, reducing the risk of selective reporting bias and increasing the validity and statistical power of meta-analyses .
The recently launched Core Outcome Measures in Effectiveness Trials (COMET) initiative fosters methodological research and provides methodological guidance on the development of a COS . The expertise accumulated by the Outcome Measures in Rheumatology (OMERACT) initiative is also a fundamental guidance in COS development . A stepwise approach is suggested by both initiatives: first, the core outcome domains should be selected (i.e. ‘what’ to measure), and then the measurement instruments for each domain (i.e. ‘how’ to measure) [16, 19].
In the field of LBP, recommendations for standardized reporting of outcome measurement instruments in clinical studies were formulated at an expert panel discussion held at the 1997 International Forum on LBP in Primary Care (The Hague, The Netherlands) . Specific recommendations were made for five outcome domains (i.e. ‘pain symptoms’, ‘back-related function’, ‘generic well-being’, ‘disability social role’ and ‘satisfaction with care’) [20, 21]. A workshop discussion among LBP researchers during the 2012 International LBP Forum (Odense, Denmark) agreed on the need of updating the existing recommendations . This was motivated by recent advances in understanding of construct development and measurement properties that stress the need to explore whether relevant domains are missing and to critically appraise recommended instruments . Deyo et al.  proposed also a parsimonious set of six questions covering the five domains suggested for measurement in LBP clinical research. These questions were extracted from existing questionnaires and were proposed as the minimum to be used in a wide variety of settings, including routine clinical care . This brief set was labelled as ‘Core Outcome Measures Index’ (COMI) by other investigators who assessed its measurement properties and feasibility of implementation [23, 24]. However, it is out of the scope of this study to update the set of questions included in the COMI for LBP.
The aim of this study is to update the existing standardized set of outcome domains and measurement instruments recommended for LBP [20, 21], through the development of a COS. This COS is intended for the measurement of efficacy or effectiveness of health interventions assessed in all clinical trials for patients with NSLBP. We defined NSLBP as “low back pain not attributable to a recognizable, known specific pathology (e.g. infection, tumour, fracture, axial spondyloarthritis)” . The first step in the development of this COS and focus of this manuscript was to perform a Delphi study to reach international consensus on core outcome domains.
A detailed description of the methods of this Delphi study is presented elsewhere . An International Steering Committee with members from four continents, including researchers, care providers and patients’ representatives, worked on the development of this COS. The day-to-day conduction of the study was performed by a project team of four people (AC, CT, MB, RO) working at the same institution (VU University/VU Medical Center, Amsterdam) who designed and addressed key aspects of the study. The other members of the Committee were regularly consulted by e-mail regarding critical decisions.
The Steering Committee decided to involve four groups of stakeholders in the Delphi study: health care researchers, health care providers, professionals working both as researchers and providers, and patients with NSLBP. Professionals from many fields of clinical research relevant for NSLBP (e.g. orthopaedics, physiotherapy, epidemiology, psychology, rheumatology, rehabilitation medicine) were involved. Patients are judged to be essential in developing COSs as they can bring the perspective of those living with a health condition [16, 18]. Previous COS efforts involving patients or the public identified core outcome domains that were not previously identified by other stakeholders [27–29].
The main advantages of a Delphi method include the involvement of informed individuals, anonymity of responses that reduces influence of prominent personalities, and the possibility for Delphi panellists to reconsider their views based on feedback reports of previous rounds [30, 31]. As this project did not involve experiments with patients or study subjects, according to the Dutch Medical Research in Human Subjects Act (WMO), it was exempt from ethical approval. All patients involved were asked for their consent prior to participation and all procedures were conducted according to the Declaration of Helsinki.
Selection of panellists
A list of health care researchers who had extensively published on LBP over the last 10 years (2003–2013) was made by one reviewer (AC) through a structured search in Web of Science (accessed October 7, 2013) and PubMed . Other researchers and health care providers were added to this list through convenience sampling. Patients were recruited through the Steering Committee, seeking people who sought care for a present or past episode of NSLBP and had a fluent understanding of written English. When patients willing to participate were identified, they were contacted by email, given further information on the study and asked for consent to participate. Patients agreeing to participate were sent an information document giving simplified explanations of the terminology used in the study. Members of the Committee were also selected to participate in the Delphi so that they could express their vote on core domains. The final list of potential panellists was managed by the project team and names in the list remained blinded to all those selected for participation.
Generation of a list of potential core domains
The Steering Committee took responsibility for drawing a list of potential core domains that was used in the Delphi study. This list resulted from a search of outcome domains measured in clinical trials included in five recent systematic reviews [12, 13, 32, 33] (one of which not published yet) with addition of the (sub) domains included in the comprehensive International Classification of Functioning (ICF) core set for LBP , and in a conceptual model developed to characterize the burden of LBP . This conceptual model and the ICF core set were adopted to account for the patients’ perspective in this early phase. The model on the burden of LBP was developed by asking different stakeholders (including patients) which aspects of health were the most relevant to them ; the comprehensive ICF core set was shown to cover all health issues identified by patients with LBP . The OMERACT Filter 2.0 framework was used to structure the list of potential core domains, subdividing it into four core areas that encompass the complete content of what is potentially measurable in a clinical trial (“Appendix I”) . To determine wording and definitions of the potential core domains, terminology used in existing health frameworks or COSs were consulted: ICF , Patient Reported Outcomes Measurement Information System (PROMIS) , Wilson and Cleary Model  and IMMPACT [40, 41].
Three Delphi rounds, including open- and close-ended questions, were used to reach consensus on core outcome domains. Individuals not participating in one round, and who did not explicitly express their desire to opt-out, were invited to each subsequent round. The Delphi study was conducted using SurveyMonkey software and invitations to participate were sent by email.
In the first round, panellists were asked to judge whether each potential core domain was important enough to be included in this COS with possible answers ‘yes’, ‘no’ and ‘unsure/not my expertise’. Panellists were given the opportunity to propose changes of wording and definitions of domains, to indicate if some domains had major conceptual overlap or had to be aggregated, and to suggest the inclusion of missing potential core domains. A question was asked about the ideal number of domains for this COS and another about reporting of adverse events (AEs). Panellists were always encouraged to provide a rationale for their answers. A priori cut-off criteria were established for excluding domains that were rejected by more than 60 % and favoured by less than 20 % of respondents.
In the second round, a proposal was made for exclusion of domains that did not have at least 67 % of the first round respondents answering ‘yes’ or ‘unsure/not my expertise’. Other proposals were made for excluding or retaining domains suggested as having large conceptual overlap. Consensus for the second round was a priori set at 67 % of respondents agreeing with a proposal. Panellists were also asked to judge whether the potential core domains suggested as missing were important enough to be included in the COS, as done for the other domains in the first round.
The remaining potential core domains were presented in the third round to ask the panellists if each was indeed core. A priori consensus was set at 67 % of the panel agreeing that a domain is core. In each round, descriptive statistics were used to summarize all the questions. All rationales provided by panellists were checked against the quantitative results to evaluate whether substantial inconsistencies emerged. Responses of the patients’ group were always analysed separately to assess whether discrepancies were emerging with the rest of the panel. In the third round, frequencies of responses for each domain were calculated for the whole panel and separately for each of the stakeholder groups.
The project team made some proposals to the Steering Committee regarding the interpretation of the final results of the Delphi. Committee members expressed their opinion on each proposal and the opinion supported by more than 50 % of members was followed. Some proposals concerned the inclusion of a ‘death’ and a ‘pathophysiological manifestations’ domain in the COS (as recommended by the OMERACT initiative for all COSs ), and what would be an appropriate approach for the reporting of adverse events (AEs).
We selected a sample of 280 experts to participate: 139 researchers, 108 care providers, 15 patients, and 18 members of the Steering Committee. A flowchart of the response rate in each round is presented in Fig. 1; 79 of the selected panellists (29 %) participated in all three rounds. People from five continents participated, with the United States, The Netherlands, Australia and the United Kingdom being the most represented countries (Table 1). Socio-demographic characteristics, panellists’ disciplines of expertise and experience with NSLBP clinical research were not substantially different between rounds (Table 1). Fourteen patients (seven men and seven women) participated in the first round: three had current and past episodes of NSLBP, six had only a current episode, and three had NSLBP only in the past. Among the nine with current NSLBP: seven sought care for their back problem, three were off-work due to their LBP, two had acute NSLBP (i.e. pain for less than a month), three chronic NSLBP from three months to a year, four chronic NSLBP for more than a year. None of the patients underwent a surgical operation for current and/or past episodes of LBP. In total, forty-six panellists of the first round (32 %) sought care for a present or past episode of LBP but only those specifically invited as patients were considered part of this stakeholder group.
List of potential core domains
The list of potential core domains generated by the Steering Committee included 41 outcome domains, subdivided as follows: 1 in the core area ‘death’, 21 in ‘life impact’, 6 in ‘resource use/economical impact’ and 13 in ‘pathophysiological manifestations’. The list with all definitions used in the Delphi study is presented in Table 2.
Delphi round 1
The first round ran from February 18 to March 24, 2014. The results on inclusion of the 41 domains are presented in Fig. 2. Six domains met a priori criteria for exclusion: ‘legal services’, ‘body structures’, ‘muscle tone’, ‘structural stability’, ‘proprioception’ and ‘urination’. For 12 of the other domains, at least 67 % of respondents indicated that they should be included in the COS or were unsure about it (Fig. 2). The remaining 23 domains did not reach this threshold and their exclusion was proposed in the second round. No clear discrepancies between the patients’ perspective and overall panel responses were identified.
One hundred and thirty-one panellists answered the question on the ideal number of domains and 106 (81 %) indicated a specific number; the suggested median number of domains was 7 (interquartile range 5–10) and the majority of the comments were in favour of a small COS. The majority of respondents to the question on AEs (72 %) agreed that only AEs occurring outside of core outcome domains should be reported as AEs.
Several panellists emphasized the overlap of ‘health-related quality of life’ with other more specific domains (e.g. ‘physical functioning’, ‘psychological functioning’) (see “Appendix II”). To address this, a proposal was formulated for the second round to exclude ‘health-related quality of life’ from the list. Panellists also remarked that ‘work ability’ and ‘work productivity’ should not be included in all trials because they are not applicable to non-working populations, and because they overlap (“Appendix II”). These comments had to be balanced against favourable comments for inclusion and prompted a proposal for the second round to retain these two domains in the list with an adapted definition that includes also non-paid workers (e.g. students, housewives). Several panellists commented about the overlap of ‘pain interference’ with other domains (“Appendix II”) and these comments were addressed in a proposal to retain it in the list despite the overlap. Despite disagreements on inclusion of ‘non-health care services’ (Fig. 2), substantial arguments were put forward in its favour. Two patients emphasized that these services (e.g. alternative health care) can be very important, others highlighted that what constitutes ‘non-health care services’ can differ between countries and that they can be relevant cost-drivers (“Appendix II”). Based on these comments, a proposal for the second round was made to incorporate the content of this domain into ‘health care services’. In total, 16 new potential core domains were suggested by panellists for inclusion in the list. Appropriate definitions were searched for these domains and they were presented in the second round for rating (“Appendix III”).
Delphi round 2
The second round ran from April 27 to May 26, 2014. Consensus was reached for the exclusion of all but one domain (i.e. ‘social functioning’, 64 % consensus) that did not have at least 67 % support from the first round. No substantial arguments favoured the retention of these domains.
Consensus was not obtained for excluding the domain ‘health-related quality of life’ (55 % of the panel recommended its exclusion). Some substantial arguments (e.g. “Construct overlap can only be answered empirically. It is just as likely that the entire question set loads on a single factor, or that there are a few higher order factors. Pain, pain interference, physical functioning, QOL, work, sleep, self-rated health have all been showing to share variance in previous studies. […].”) explained the lack of consensus. Consensus was obtained (i.e. 85 %) for incorporating “non-health care services” into “health care services”, for retaining ‘work ability’ and ‘work productivity’ as independent domains (72 %), and for retaining ‘pain interference’ in the list (68 %).
However, relevant arguments were made against including ‘health care services’ and ‘work productivity’ in the list of potential core domains. These arguments outlined that, given the scope of the COS, it might not be appropriate to include these domains in efficacy trials (e.g. “[…] Often in trials patients are requested not to undertake/receive any other treatments during the intervention period, which means differences in use depend on things other than the patient’s health state […]”). Several panellists also questioned whether there are valid and reliable methods to assess these domains in all clinical trials (e.g. “[…] during follow-up the acquisition of accurate and reliable health care services data is questionable”, or “[…] both are difficult to assess, may be influenced by factors other than the presence of LBP, and I am not sure of the reliability of the assessment methods”). These domains were kept in the list but these arguments were highlighted in the third round.
None of the new potential core domains suggested in the first round reached consensus for inclusion. Votes for inclusion ranged from 60 % for ‘satisfaction with the outcome of treatment’ to 13 % for ‘travel and transportation’. No substantial differences between patients’ responses and the rest of the panel emerged. A total of 13 domains were retained in the list of potential core domains and presented in the last round.
Delphi round 3
The third round ran from June 23 to July 17, 2014. Three domains exceeded the a priori threshold for inclusion in the COS: ‘physical functioning’ (96 % of respondents indicating it as core), ‘pain intensity’ (90 %) and ‘health-related quality of life’ (73 %) (Fig. 3). These ratings were consistent across stakeholder subgroups with the only exception that the patients’ group that did not reach agreement (55 %) on ‘health-related quality of life’ (Fig. 3). ‘Work ability’ was rated as a core domain by 76 % of health care providers but only by 64 % of the whole panel and 36 % of the patients (Fig. 3). ‘Psychological functioning’ was considered a core domain by 76 % of care providers and 91 % of patients but not by the whole panel (Fig. 3). While providers and patients provided ten comments in favour of its inclusion, half of these supported its inclusion as a confounder or moderator, being these not appropriate arguments to support inclusion as an outcome domain. The other eight potential core domains did not reach consensus for inclusion in the COS for any stakeholder group, except 82 % of the patients that rated ‘self-rated health’ as a core domain (Fig. 3).
Based on the Delphi results, the majority of the Steering Committee members agreed on including ‘physical functioning’, ‘pain intensity’ and ‘health-related quality of life’ in this COS. The Steering Committee considered the inclusion of ‘health-related quality of life’ because there were strong arguments in its favour: overall consensus was reached, three groups of stakeholders were in favour, and its definition (Table 2) incorporated the excluded domains ‘psychological functioning’ and ‘self-rated health’ that were rated as core by some groups of stakeholders (Fig. 3). The Steering Committee also agreed on the exclusion of ‘work ability’ as overall agreement for inclusion was not reached, as several arguments for inclusion were weak and as it was not considered core by three groups of stakeholders (Fig. 3). These decisions were also taken with the intention of keeping this COS as short as possible to facilitate its implementation.
The majority of Steering Committee members agreed on including the domain ‘number of deaths’ in the COS as this emphasizes the need to report on the occurrence of deaths in every clinical trial. The Steering Committee acknowledges that death is a rare event for NSLBP clinical trials but a short statement, such as “no deaths occurred in this clinical trial”, would suffice to cover this outcome domain. The Steering Committee did not agree with the inclusion of a generic pathophysiological manifestation domain in this COS, as recommended by OMERACT . The main rationale for this decision was that not all interventions for NSLBP are targeting a pathophysiological manifestation, as this disorder is characterized by the absence of a known pathophysiology [6–8, 25]. Furthermore, its inclusion could create unnecessary increases in research costs and impact upon the brevity of the COS. This recommendation does not imply that measuring pathophysiological manifestations is unimportant in relevant NSLBP clinical trials and researchers are encouraged to include them when appropriate for their individual studies.
In the first round of the Delphi, consensus was reached on the reporting of AEs only for those domains not already included in the COS. This approach ensures that, where appropriate, AEs that occur within a core outcome domain (e.g. an increase in ‘pain intensity’ or a decrease in ‘health related quality of life’) are included in the statistical analysis. However, taking into account some comments by Delphi panellists, the Steering Committee decided to adopt a flexible approach to the reporting of AEs. This approach leaves the option open to trialists to also report, as separate AEs, those negative outcomes occurring within core domains.
Using the methodological guidance of initiatives like COMET and OMERACT [16, 19], we performed a Delphi study to provide an international, multidisciplinary and multistakeholder consensus-based update of an earlier standardized set of outcome domains for LBP research [20, 21]. Sufficient agreement was reached on core outcome domains that are part of a COS intended for clinical trials assessing efficacy or effectiveness of health interventions in patients with NSLBP. The domains included in this COS are ‘physical functioning’, ‘pain intensity’, ‘health-related quality of life’ and ‘number of deaths’ (see definitions in Table 2).
The domain ‘physical functioning’ reached the highest level of consensus in this study and the definition focuses on ability to engage in daily physical activities (Table 2). Our definition of ‘physical functioning’ will be fundamental to determine which measurement instrument would best measure this domain. IMMPACT recommendations for chronic pain clinical trials also suggest measuring ‘physical functioning’ as a core outcome domain [40, 41], and this convergence strengthens its inclusion.
‘Pain intensity’ also reached a very high level of consensus for inclusion in this COS. The inclusion of a pain domain is in line with the original core set [20, 21] and IMMPACT recommendations [40, 41]. ‘Pain intensity’ for this COS refers to the magnitude of the pain experience, whereas other pain (sub)domains were suggested for consideration by the previous core set and/or IMMPACT (e.g. ‘bothersomeness of pain’, ‘pain quality’, ‘temporal aspects of domains’, ‘pain medications’) [20, 21, 40, 41]. Some of those pain domains and others (i.e. ‘Pain behaviour’, ‘pain interference’) were presented as potential core domains in this Delphi but not sufficient agreement was reached to consider them as core (Figs. 2, 3).
‘Health-related quality of life’ included in this COS could be considered as the ‘successor’ of ‘general well-being’ included in the previous set [20, 21]. However, a definition of ‘general well-being’ was not given for the previous set and this makes a clear comparison of the two constructs challenging. Taking into account the widely accepted bio-psycho-social model for LBP [42, 43], it may be appropriate to have a domain like ‘health-related quality of life’ in this COS as its definition includes all components of the model (Table 2). The inclusion of all components of the bio-psycho-social model is also in line with the domains included in a conceptual framework developed to characterize the burden of LBP  and with the results of a review that attempted to summarize qualitative research conducted on the impact of LBP on people’s lives . However, it will be clear only when choosing measurement instruments for this COS if the different components of ‘health-related quality of life’ can be treated as separate domains or as one multidimensional domain. The choice of instruments will also be guided by the intention of minimizing redundancy of measurement, to avoid large overlap of instruments and promote brevity of the COS.
Another key aspect in the development of a COS is the definition of contextual factors (i.e. potential confounders and effect modifiers) that should be measured alongside core outcome domains . However, it was beyond the scope of this study to address contextual factors and for the measurement of these factors a reference is made to the prominent work of the National Institutes of Health (NIH) Task Force . This Task Force recently published a report on minimum baseline standards that should be collected in clinical studies for chronic LBP, to standardize their assessment .
This COS includes refined versions of three domains included in the previous standardized set but does not incorporate the other two: ‘disability social role’ and ‘satisfaction with care’ [20, 21]. ‘Disability social role’ referred to work absenteeism and could be replaced by the domain ‘work productivity’ used in this study, while ‘satisfaction with care’ was formulated as ‘satisfaction with treatment services’ in this study, but neither was supported by the Delphi panel (Figs. 2, 3). ‘Work productivity’ refers to indirect non-medical costs that are the first cost drivers for LBP  and it is an undoubtedly important outcome for clinical trials with economic evaluations alongside. However, this domain poses the challenge of its measurement in clinical trials aimed at assessing efficacy of interventions, in which an economic evaluation might be out of the scope of the trial. To support the exclusion of ‘satisfaction with treatment services’ several panellists underlined that it could be highly influenced by factors unrelated to an intervention (e.g. waiting list, amiability of providers, unfriendly receptionist, parking difficulty) and, consequently, that it could say relatively little about efficacy or effectiveness of that intervention.
This is the first Delphi study conducted to explore international, multistakeholder, and multidisciplinary consensus on core outcome domains to be reported in NSLBP clinical trials. This study highlighted diverging opinions on the importance of some domains and reinforced the wisdom of a comprehensive exercise to determine which outcome domains are felt by the majority to be core. The strengths of this study include methods that followed guidance of initiatives like COMET and OMERACT [16, 19], having a large expert panel of varied stakeholders representing various disciplines and countries, giving the opportunity to Delphi panellists to provide comments for each choice, allowing panellists to reconsider their views after considering other panellists’ reasoning, attempting to address strong arguments emerging from the Delphi panel, and rigorous reporting of methods  and results. One limitation of this study could be the relatively small number of patients involved in the Delphi rounds, which could have led to under or overestimation of the importance of certain domains from their perspective. However, the goal of this study was not to develop a comprehensive range of outcome domains important to all stakeholders, but rather a core set for inclusion in all clinical trials. Patients can also be involved in trial management teams where they can shape the range of outcomes measures collected in individual trials and this should represent good practice. Finally, the definition of COSs places emphasis on the concept of a minimum set [16, 19] and the four domains included in this COS seem to fit perfectly within this definition. The existence of a small COS for NSLBP should facilitate its inclusion in clinical trials, alongside trial-specific outcomes.
The development of a COS is a stepwise approach [16, 19] and this study determined core outcome domains for clinical trials on NSLBP. The next step will be to reach consensus on which measurement instruments should be used to measure these outcome domains. The selection of instruments will be focused on those that have demonstrated adequate measurement properties for these domains with the least participant burden. Recently published methodological guidance on this topic [46, 47] will help to conduct the next step for this COS in NSLBP.
A consensus-based COS for NSLBP was developed and included the domains ‘physical functioning’, ‘pain intensity’, ‘health-related quality of life’ and ‘number of deaths’. This COS represents the update of the standardized set proposed by Deyo et al. in 1998 [20, 21]. The brevity of this COS should facilitate its implementation in clinical trials assessing efficacy or effectiveness of health interventions for NSLBP. Future research should establish which measurement instruments are the most appropriate to measure these core outcome domains.
Murray CJ, Vos T, Lozano R, Naghavi M, Flaxman AD, Michaud C, Ezzati M, Shibuya K, Salomon JA, Abdalla S et al (2013) Disability-adjusted life years (DALYs) for 291 diseases and injuries in 21 regions, 1990–2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet 380(9859):2197–2223
Vos T, Flaxman AD, Naghavi M, Lozano R, Michaud C, Ezzati M, Shibuya K, Salomon JA, Abdalla S, Aboyans V et al (2013) Years lived with disability (YLDs) for 1160 sequelae of 289 diseases and injuries 1990–2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet 380(9859):2163–2196
Hoy D, Bain C, Williams G, March L, Brooks P, Blyth F, Woolf A, Vos T, Buchbinder R (2012) A systematic review of the global prevalence of low back pain. Arthritis Rheum 64(6):2028–2037
Dagenais S, Caro J, Haldeman S (2008) A systematic review of low back pain cost of illness studies in the United States and internationally. Spine J 8(1):8–20
Lambeek LC, van Tulder MW, Swinkels IC, Koppes LL, Anema JR, van Mechelen W (2011) The trend in total cost of back pain in The Netherlands in the period 2002–2007. Spine 36(13):1050–1058
Cohen SP, Argoff CE, Carragee EJ (2008) Management of low back pain. BMJ 337:a2718
Deyo RA, Weinstein JN (2001) Low back pain. N Engl J Med 344(5):363–370
Koes B, Van Tulder M, Thomas S (2006) Diagnosis and treatment of low back pain. BMJ 332(7555):1430
Artus M, van der Windt DA, Jordan KP, Hay EM (2010) Low back pain symptoms show a similar pattern of improvement following a wide range of primary care treatments: a systematic review of randomized clinical trials. Rheumatology 49(12):2346–2356
Furlan AD, Pennick V, Bombardier C, van Tulder M (2009) 2009 updated method guidelines for systematic reviews in the Cochrane Back Review Group. Spine 34(18):1929–1941
Hayden J, Van Tulder MW, Malmivaara A, Koes BW (2005) Exercise therapy for treatment of non-specific low back pain. Cochrane Database Syst Rev 3:CD000335
Kamper SJ, Apeldoorn AT, Chiarotto A, Smeets RJ, Ostelo RW, Guzman J, van Tulder MW (2014) Multidisciplinary biopsychosocial rehabilitation for chronic low back pain. Cochrane Database Syst Rev 9:CD000963
Rubinstein SM, Terwee CB, Assendelft W, de Boer MR, van Tulder MW (2012) Spinal manipulative therapy for acute low-back pain. Cochrane Database Syst Rev 9:CD008880
Clarke M (2007) Standardising outcomes for clinical trials and systematic reviews. Trials 8(1):39
Kirkham JJ, Dwan KM, Altman DG, Gamble C, Dodd S, Smyth R, Williamson PR (2010) The impact of outcome reporting bias in randomised controlled trials on a cohort of systematic reviews. BMJ 340:c365
Williamson PR, Altman DG, Blazeby JM, Clarke M, Devane D, Gargon E, Tugwell P (2012) Developing core outcome sets for clinical trials: issues to consider. Trials 13(1):132
Kirkham JJ, Gargon E, Clarke M, Williamson PR (2013) Can a core outcome set improve the quality of systematic reviews?–a survey of the Co-ordinating Editors of Cochrane review groups. Trials 14(1):21
Tugwell P, Boers M, Brooks P, Simon L, Strand V, Idzerda L (2007) OMERACT: an international initiative to improve outcome measurement in rheumatology. Trials 8(1):38
Boers M, Kirwan JR, Wells G, Beaton D, Gossec L, d’Agostino M-A, Conaghan PG, Bingham CO III, Brooks P, Landewé R, March L, Simon LS, Singh JA, Strand V, Tugwell P (2014) Developing core outcome measurement sets for clinical trials: OMERACT filter 2.0. J Clin Epidemiol 67(7):745–753
Deyo RA, Battie M, Beurskens A, Bombardier C, Croft P, Koes B, Malmivaara A, Roland M, Von Korff M, Waddell G (1998) Outcome measures for low back pain research: a proposal for standardized use. Spine 23(18):2003–2013
Bombardier C (2000) Outcome assessments in the evaluation of treatment of spinal disorders: summary and general recommendations. Spine 25(24):3100–3103
Lin C-WC, Ostelo RW, Maher CG, Grotle M, Koes BW Core outcomes for clinical research in low back pain: revisited. In: Abstract Book, XII International Low Back Pain Forum, Odense (Denmark), 16–19 October 2012
Ferrer M, Pellisé F, Escudero O, Alvarez L, Pont A, Alonso J, Deyo R (2006) Validation of a minimum outcome core set in the evaluation of patients with back pain. Spine 31(12):1372–1379
Mannion AF, Porchet F, Kleinstück F, Lattig F, Jeszenszky D, Bartanusz V, Dvorak J, Grob D (2009) The quality of spine surgery from the patient’s perspective. Part 1: the Core Outcome Measures Index in clinical practice. Eur Spine J 18(3):367–373
Balagué F, Mannion AF, Pellisé F, Cedraschi C (2012) Non-specific low back pain. Lancet 379(9814):482–491
Chiarotto A, Terwee CB, Deyo RA, Boers M, Lin CW, Buchbinder R, Corbin TP, Costa LO, Foster NE, Grotle M, Koes BW, Kovacs FM, Maher CG, Pearson AM, Peul WC, Schoene ML, Turk DC, van Tulder MW, Ostelo RW (2014) A core outcome set for clinical trials on non-specific low back pain: study protocol for the development of a core domain set. Trials 15(1):511
Kirwan J, Heiberg T, Hewlett S, Hughes R, Kvien T, Ahlmèn M, Boers M, Minnock P, Saag K, Shea B, Suarez Almazor M, Taal E (2003) Outcomes from the patient perspective workshop at OMERACT 6. J Rheumatol 30(4):868–872
Kirwan JR, Hewlett SE, Heiberg T, Hughes RA, Carr M, Hehir M, Kvien TK, Minnock P, Newman SP, Quest EM, Taal E, Wale J (2005) Incorporating the patient perspective into outcome assessment in rheumatoid arthritis–progress at OMERACT 7. J Rheumatol 32(11):2250–2256
Sanderson T, Morris M, Calnan M, Richards P, Hewlett S (2010) What outcomes from pharmacologic treatments are important to people with rheumatoid arthritis? Creating the basis of a patient core set. Arthritis Care Res 62(5):640–646
Binkley J, Finch E, Hall J, Black T, Gowland C (1993) Diagnostic classification of patients with low back pain: report on a survey of physical therapy experts. Phys Ther 73(3):138–150
Sinha IP, Smyth RL, Williamson PR (2011) Using the Delphi technique to determine which outcomes to measure in clinical trials: recommendations for the future based on a systematic review of existing studies. PLoS Med 8(1):e1000393
Henschke N, Ostelo R, van Tulder MW, Vlaeyen J, Morley S, Assendelft W, Main CJ (2010) Behavioural treatment for chronic low-back pain. Cochrane Database Syst Rev 7:CD002014
Oosterhuis T, Costa LO, Maher CG, de Vet HC, van Tulder MW, Ostelo R (2014) Rehabilitation after lumbar disc surgery. Cochrane Database Syst Rev 3:CD003007
Cieza A, Stucki G, Weigl M, Disler P, Jackel W, van der Linden S, Kostanjsek N, de Bie R (2004) ICF Core Sets for low back pain. J Rehabil Med 44 Suppl:69–74
Buchbinder R, Batterham R, Elsworth G, Dionne CE, Irvin E, Osborne RH (2011) A validity-driven approach to the understanding of the personal and societal burden of low back pain: development of a conceptual and measurement model. Arthritis Res Ther 13(5):R152
Mullis R, Barber J, Lewis M, Hay E (2007) ICF core sets for low back pain: do they include what matters to patients? J Rehabil Med 39(5):353–357
ICF Classification Browser. http://www.apps.who.int/classifications/icfbrowser/. Accessed 26 September 2014
PROMIS Domain Framework. http://www.nihpromis.org/measures/domainframework/. Accessed 26 September 2014
Wilson IB, Cleary PD (1995) Linking clinical variables with health-related quality of life: a conceptual model of patient outcomes. JAMA 273(1):59–65
Dworkin RH, Turk DC, Farrar JT, Haythornthwaite JA, Jensen MP, Katz NP, Kerns RD, Stucki G, Allen RR, Bellamy N et al (2005) Core outcome measures for chronic pain clinical trials: IMMPACT recommendations. Pain 113(1–2):9–19
Turk DC, Dworkin RH, Allen RR, Bellamy N, Brandenburg N, Carr DB, Cleeland C, Dionne R, Farrar JT, Galer BS et al (2003) Core outcome domains for chronic pain clinical trials: IMMPACT recommendations. Pain 106(3):337–345
Pincus T, Kent P, Bronfort G, Loisel P, Pransky G, Hartvigsen J (2013) Twenty-five years with the biopsychosocial model of low back pain—is it time to celebrate? A report from the twelfth international forum for primary care research on low back pain. Spine 38(24):2118–2123
Waddell G (1987) 1987 Volvo award in clinical sciences: a new clinical model for the treatment of low-back pain. Spine 12(7):632–644
Froud R, Patterson S, Eldridge S, Seale C, Pincus T, Rajendran D, Fossum C, Underwood M (2014) A systematic review and meta-synthesis of the impact of low back pain on people’s lives. BMC Musculoskelet Disord 15:50
Deyo RA, Dworkin SF, Amtmann D, Andersson G, Borenstein D, Carragee E, Carrino J, Chou R, Cook K, DeLitto A (2014) Report of the NIH Task Force on research standards for chronic low back pain. J Pain 15(6):569–585
Macefield RC, Jacobs M, Korfage IJ, Nicklin J, Whistance RN, Brookes ST, Sprangers MA, Blazeby JM (2014) Developing core outcomes sets: methods for identifying and including patient-reported outcomes (PROs). Trials 15(1):49
Prinsen CA, Vohra S, Rose MR, King-Jones S, Ishaque S, Bhaloo Z, Adams D, Terwee CB (2014) Core Outcome Measures in Effectiveness Trials (COMET) initiative: protocol for an international Delphi study to achieve consensus on how to select outcome measurement instruments for outcomes included in a ‘core outcome set’. Trials 15(1):247
We would like to acknowledge all the people who participated in at least one round of the Delphi study. These people (excluding members of the Steering Committee) are listed here in alphabetical order: William A. Abdu, Luc Ailliet, Marcelo Anderson Bracht, Gunnar Andersson, Adri T. Apeldoorn, Majid Artus, Julie Ashworth, Steven J. Atlas, Roxy Azoory, Marco Barbero, Heinz Dieter Basler, David Baxter, Ramsin Benyamin, Mark D. Bishop, Paul Bishop, David Borenstein, Lex Bouter, Hilary Bradbury, Alan Breen, Jens Ivar Brox, Elaine Buchanan, Alex Burdorf, Eugene J. Carragee, John David Cassidy, Roger Chou, Aldo Ciuro, Kris Clark, Steven P. Cohen, Pierre Côté, Peter Croft, Vinicius Cunha Oliveira, Wim Dankaerts, Gavin Davis, Ric Day, Rob de Bie, Henrica C. W. de Vet, Clermont E. Dionne, Wendy T. Enthoven, Hege R. Eriksen, Felipe Fagundes, Carmen Fernandez, Silvano Ferrari, Manuela Ferreira, Paulo H. Ferreira, Timothy W. Flynn, Victoria Franzinetti, Robert Froud, Andrea Furlan, Diego Galace, Robert J. Gatchel, Steven George, Sergio Gimenez Basalotte, Hedley Griffiths, Lars Grovle, Andrew John Haig, Murray Hames, Mark Hancock, Ian Harris, Jan Hartvigsen, Anne Julsrud Haugen, Elaine Hay, Rowland G. Hazard, Standiford Helm, Rob Herbert, Jan Hildebrandt, Deirdre A. Hurley, Eric L. Hurwitz, Julia Hush, Frank Huygen, Wilco C. Jacobs, Matthew Jennings, Johan Juch, Steven J. Kamper, Jaro Karppinen, Peter Kent, Suraj Kumar, Charlotte Leboeuf-Yde, Myeong So Lee, Martyn Lewis, Patrick Loisel, Pim A. J. Luijsterburg, Jon D. Lurie, Luciana Macedo, Luciana Machado, Laxmaiah Manchikanti, Anne F. Mannion, Lynn March, Norman Marcus, Teresa Marin, James McAuley, Alison McGregor, Luciola Menezes Costa, Jan Mens, Stephan Milosavljevic, Shail K. Mirza, Marco Monticone, Lorimer Moseley, Paulo Nascimento, Stefano Negrini, Colin Nelson, Jo Nijs, Oystein Nygaard, John O’Dowd, Teddy Oosterhuis, Richard Osborne, Peter O’Sullivan, Adriano Pezolato, Michael Pfingsten, Serge Poiraudeau, Jan Pool, Pina Porzio, Kristen Radcliff, James Rainville, Francois Rannou, Lisa Roberts, Michael E. Robinson, Myron Rogers, Martin Roland, Ana Royuela, Tamer Sabet, Petry Saeys, Marcus Schiltenwolf, Gay Schoene, Jesus Seco Calvo, Ruth Sephton, William S. Shaw, Karen J. Sherman, Rob Smeets, Anne J. Smith, Matthew Smuck, Bart Staal, Kjersti Storheim, Liv Inger Strand, Michael Sullivan, Simo Taimela, Kazuhisa Takahashi, Judith A. Turner, Martin Underwood, Alexander Vaccaro, Allard van der Beek, Bob van der Meiracker, Danielle van der Windt, Hans van Helvoirt, Willem van Mechelen, Rodrigo Vasconcelos, Arianne Verhagen, Johan W. S. Vlaeyen, Michael von Korff, Debra K. Weiner, Harriet Wittink, Ian Wright, Gustavo Zanoli.
Conflict of interest
None of the authors has any potential conflict of interest.
See Table 3.
See Table 4.
Missing potential core domains suggested by panellists in the first round
Decision quality = shared decision making that focuses on two areas: (1) the extent to which patients are informed about treatment options, and (2) the extent to which treatments match what is most important to patients.
Satisfaction with the outcome of treatment = extent to which the patient is satisfied with the results of treatment.
Abuse/misuse of drugs = misuse or abuse of prescribed or not prescribed drugs.
Pain self-efficacy = degree of confidence a patient has in performing regular activities despite the presence of pain.
Pain catastrophizing = tendency to misinterpret and/or exaggerate actual or anticipated pain experiences.
Bodily extent of pain = number of painful body areas besides the pain experienced in the back.
Size of painful area = dimension of the body area in which the patient experiences pain.
Compliance with treatment = degree to which a patient follows treatment in all its components.
Coping = purposeful use of cognitive and behavioral techniques to manage demands that are perceived as stressful or taxing the resources of the individual.
Patient nominated goals = subjective judgment about the achievement of pre-set individual goals.
Participation = involvement in a life situation.
Recovery = an individual’s determination of his/her recovery which involves cognitive appraisal of the impact of symptoms on his/her life, the capacity to perform relevant daily activities, and achievement of an acceptable quality of life through readjustment and other adaptive strategies.
Acceptance of pain = willingness to have uncomfortable experiences when the actions that bring about those experiences serve important purposes for the individual.
Kinesiophobia = excessive, irrational, and debilitating fear of physical movement and activity resulting from a feeling of vulnerability to painful injury or reinjury.
Patient impression of change = patient’s assessment of the change related to low back pain since beginning of treatment.
Travel and transportation = patient’s difficulty with travel and transportation.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.
About this article
Cite this article
Chiarotto, A., Deyo, R.A., Terwee, C.B. et al. Core outcome domains for clinical trials in non-specific low back pain. Eur Spine J 24, 1127–1142 (2015). https://doi.org/10.1007/s00586-015-3892-3
- Core outcome set
- Clinical trials
- Non-specific low back pain