Background

Despite expansion of services, there are indications that the prevalence of back related disability is much higher than that reported 40 years ago [1]. Large scale surveys of workers in 31 countries, including 27 Member States of the European Union [2] have shown that 25% of workers across a range of occupations suffer from back pain. The Health and Safety Executive report a 12 month prevalence of 47% in computer users [3], and up to 12% of the population were found to consult their GP or practice nurse for back pain at least once in the year ending 31 March 2009 [4]

The direct health care costs of back pain have been estimated to be $90,600 million in the United States and £1,632 million in the UK with the largest proportion of direct medical costs spent on physical therapy [5]. The economic burden is not limited to health care but has implications for the individual and society in general with production losses and informal care reported to be more than ten times the direct cost [5]. A small subset of the population, resistant to rehabilitation and with a poor prognosis for recovery, are disproportionately heavy users of health resources.

The identification of prognostic indicators is a pre-requisite to improving the targeting of services, particularly in light of controversial results of recent studies on treatment efficacy [69]. Demographic factors have been consistently linked to outcome [1013] as have psychological factors [1416], psychosocial factors [11, 12, 17], clinical history [13, 18] and work factors [10, 12, 19]. However, many studies are reported to be methodologically weak, often failing to recruit a relevant or sufficient cohort, often with less than 200 participants [20]. The large range of prognostic indicators presented in previous studies also results from the failure to consider simultaneously the major domains or other prognostic variables with which those identified might be correlated [21]. This study based in primary care addresses these issues using a multiple regression model.

Our objective was therefore to determine whether individual variables or domains were linked to recovery at six months when all identified prognostic variables were taken into account in a large population of unselected back pain patients

Methods

The Clinic

The lack of direct access physiotherapy for low back pain prompted the establishment by the Central London Multifund and the Westminster Primary Care Trust of a multidisciplinary community based back pain clinic. The service was to provide a complex package of care, based on published guidelines [22] largely consistent with the NICE guidelines published in May 2009 [23]. The clinic employed physiotherapists, osteopaths, clinical psychologists and patients had access to physicians, providing a treatment package that could be tailored to the needs of the individual. The commissioning included a parallel service evaluation to examine the contribution of demographic, psychosocial, clinical and work factors to the change in functioning of patients referred to the clinic.

Participants

All 687 consecutive patients with simple low back pain or nerve root pain referred to the clinic by local general practitioners during a twelve-month period to August 2000 were eligible for inclusion in the study. Patients were excluded in line with red flag symptoms for possible serious spinal pathology. These included cauda equina symptoms, sphincter disturbance or saddle anaesthesia, non-mechanical pain, thoracic pain, a history of weight loss, widespread neurological deficit or structural deformity [22]. Patients were excluded at the point of diagnostic triage by their General Practitioner, but a further two patients with organic disease (spinal abscess and kidney disease) were excluded following attendance at the clinic.

A minimal dataset was collected for 48 patients who failed to attend their first appointment. A further 46 failed to complete the baseline questionnaire. 593 consecutive patients completed a self-administered baseline questionnaire, 55 (9%) with the assistance of an interpreter. All participants gave written informed consent and the baseline questionnaire took 30 minutes to complete. Ethical approval was granted by St Mary's Hospital Regional Ethics Committee. The research carried out was in compliance with the Declaration of Helsinki.

Baseline patient questionnaire

Factors identified in one or more published reports as predicting functional outcome can be considered to fall into four domains: demographic factors, psychosocial factors, work characteristics and clinical history.

The selection of instruments to be included in the questionnaire (Additional file 1) was based on several considerations. Firstly, the areas identified in previous studies to have prognostic value. Secondly, the questionnaires with validity and reliability and with established use in these areas. Thirdly, a set of instruments which covered the main aspects of each domain, which complemented each other, without significant redundancy or overlap. The choice of instruments was informed by an international group of back pain researchers who recommended a standard battery of outcome measures to represent the multiple dimensions of outcome in the field of back pain [24]. The domains described included pain symptoms, back related function, generic well-being and disability. These authors anticipated that the instruments would evolve with time and that the core instrument would be sufficiently brief to allow investigators to add other measures to the battery dependent on their research interest. In this study, this core data set was expanded to provide greater breadth and depth which included adding measures of somatisation and depression.

The study followed defined criteria for methodological quality for studies of prognosis, which included participants, selected as consecutive cases, with at least one prognostic outcome available from at least 80% of study population at three month follow up or later, and with appropriate statistical adjustment [20, 21].

Demographic factors

The baseline questionnaire included questions on age, sex, self reported height and weight, smoking history [24] and information on usual levels of physical activity before the onset of the current episode. Participants were asked how frequently (three times a week or more, once or twice a week, one to three times a month, never or hardly ever) they took part in sports or activities which were mildly energetic (eg walking, woodwork, weeding, hoeing, bicycle repair, playing darts, general housework), moderately energetic (eg scrubbing, polishing car, chopping, dancing, golf, cycling, decorating, lawn mowing, leisurely swimming) or vigorous (eg: running, hard swimming, tennis, squash, digging, cycle racing). The responses were then coded to reflect both intensity of activity and frequency of participation, an approach used in the Whitehall II study which looked at the causes of back pain in 10,308 participants [25]. A similar approach and level of coding has been recommended in a recent proposal for core outcome measures in back pain [14]. The categories used in the Office for National Statistics 1991 census classification system were used to define ethnicity.

Employment characteristics

The core elements relating to work as a risk factor for back pain were determined. Participants in paid employment (n = 217) were additionally asked about control over various aspects of their work content and environment using a 21-point scale [25, 26]. Although some studies have reported low job satisfaction as a risk factor for sickness absence due to low back pain (27), previous work in this area [26, 28, 29], including that of members of the steering committee, has suggested that control over the work environment (level of decision making about own workload/flexibility/work colleagues/speed of work/environment) was an important prognostic indicator with greater importance than job satisfaction and other commonly defined measures. However, It was also recognised that physical characteristics of the task were significant, including postural and mechanical demands, so questions on the key physical elements including length of time sitting and standing, typical lifting demands and the frequency of lifting tasks were and included within the questionnaire.

Clinical history and presentation

A series of questions sought to characterise back pain history, including time since first onset, length of current and usual episode and the frequency of episodes within the previous twelve-month period. Data on clinical presentation including neurological signs and altered sensation, impaired reflexes and pain radiation pattern were recorded by the clinician at the first appointment. The Von Korff scale [30] was used to measure the severity of pain, comprising seven questions which combine to provide a measure of pain related disability, persistence and affective distress.

Psychosocial and psychological factors

Housing tenure and age on leaving full time education were recorded as an index of socio-economic status [31] and information on marital and work status was also required. The core elements in the assessment of psychological risk factors for back pain were identified. Distress/depression and somatisation are reported as having an important role in the longevity of back pain (13, 16]. The modified ZUNG Depression Inventory and the Modified Somatic Perception Questionnaire (MSPQ), as a measure of somatic anxiety, make up the Distress and Risk Assessment Method (DRAM) [32]. These are commonly used outcome measures [3335] and were included in the questionnaire. The Modified Zung Depression Inventory is a 23-item patient-completed scale measuring depressive symptoms in back pain patients. Scores range from 0 to 69 with higher scores indicating greater depression. The MSPQ consists of 13 questions; each scored 0 to 3 with a total possible range of 0 to 39 with higher scores indicating greater somatic awareness. This approach to the measurement of psychological state was also used in the UK BEAM trial [13] and additional components were covered by other instruments included in the battery of questionnaires.

Back pain functional outcomes

The 24-item Roland Morris Disability Questionnaire [36] was pre-specified as the primary outcome. It is among the most widely used measure of back-related function [7, 13, 14, 17] and has been proposed as part of an international instrument for standardised use [24]. To enable a more global comparison, the physical functioning scale of the SF-36 was also recorded and comprises of 10 items on activities of daily living.

Six month postal follow-up

Follow-up questionnaires were sent to participants six months after completing the baseline questionnaire. For non-responders a reminder and second questionnaire were sent two weeks later. Persistent non-responders were invited to complete a short version of the questionnaire over the telephone. The six-month follow-up questionnaire took 10 minutes to complete.

Statistical Analysis

Chi square, Kruskal-Wallis and Mann Whitney analyses were used for the comparison of baseline characteristics between responders and non-responders. Differences between baseline and follow-up outcome scores were analysed using repeated measures analysis of variance. Logistic regression was used to determine odds ratios of recovery, initially for every variable independently and then in a multiple regression model.

Recovery was defined as a Roland Morris disability score of zero at six-month follow-up. An odds ratio of greater than 1 indicated that recovery was more likely to occur.

Because baseline score may be a determinant of the amount of change, the individual variables were adjusted for initial score. Inclusion of adjusted and unadjusted data is reported to avoid the biases discussed by Altman [21]. Categorical, independent variables were entered directly into the model. Data over the full range of each scale was collected, however for the purposes of the regression analyses dummy coding was used to dichotomise the independent continuous variables by their median values. This has the advantage of allowing a clear interpretation of the odds ratios and avoids the restrictive assumptions of straight-line linearity between variables. Treatment of missing data for the MSPQ and Zung indices used mean imputation where at least half the items were present. All analyses were conducted in SPSS (SPSS Inc, Chicago, Illinois) and overseen by an independent statistician.

Results

At six months, four hundred and eighty four participants completed a follow-up questionnaire, a response rate of 82%. There were 112 persistent non-responders, despite a strict follow-up protocol of postal reminders and phone calls.

Respondents and non-respondents were similar with respect to all baseline variables with the exception that non-respondents were more likely to be male, living either in a private rental or rent-free accommodation. Participants attended an average of 6 (SD 3.7) treatment sessions.

Primary Outcomes

The mean Roland Morris disability score improved by 3.8 index points (95% confidence interval 3.23 to 4.32, p < 0.001) over the six month follow-up, from 11.6 at baseline assessment to 7.8 index points at six months. The distribution of change scores is illustrated in figure 1. The SF- 36 physical functioning scale improved by 10.7 scale points (95% confidence interval 8.4 to 13.0, p < 0.001) from 49.2 at baseline assessment to 59.8 points at six months.

Figure 1
figure 1

Change in Roland Morris disability questionnaire score from baseline to six-month follow-up.

Demographic factors

Persistence of symptoms at six months was predominantly associated with ethnic grouping. Participants who categorised themselves as non-white had a reduced odds ratio for recovery of 0.39 (0.20 to 0.74, p = 0.004). Those recording ethnic group as North African or Middle Eastern showed a change of less than one index point on the Roland Morris questionnaire and 2.7 points on the SF-36. There was some evidence to suggest that participants who recorded a higher frequency of exercise participation were more likely to have recovered at six month than those who rarely undertook exercise (Table 1).

Table 1 Demographic factors

Ideally, where individual prognostic variables are found to be predictive of outcome, efficient clinical cut-off scores could be used to make decision rules about the need for treatment. This has theoretical and clinical relevance for binary variables like gender or previous surgery. However, in the case of BMI the main analysis used a grouping of data above and below the median. To give greater clinical relevance a further analysis was undertaken. The BMI is usually graded as underweight, optimal, overweight, obese and morbidly obese, rather than dichotomised as required for the main analysis. The mean change scores (with 95% confidence intervals) in the RMDQ for these categories were as follows: BMI less than 25 (underweight/optimal) 4.3 (3.4 to 5.2); BMI 25.01 to 30 (overweight) 4.0 (2.7 to 5.3); and BMI 30 and over (obese/morbidly obese) 4.3 (2.4 to 6.3). A correlation between the continuous variable BMI against the change score in the Roland Morris yielded a non-significant Pearson Product Moment coefficient of 0.05 supporting the results of the main analysis. The extreme categories (BMI less than 18.5 and more than 40.0) had too few participants to provide meaningful independent analysis. Prognostic indicators in these groups may warrant further study.

Employment characteristics

Although we found some evidence to suggest that those in paid employment and the self employed had a greater chance of recovery (odds ratio 2.07, 1.21 to 3.54, p = 0.008), factors including control over work, the work environment and the physical characteristics of the tasks involved were not linked to recovery (Table 2). The questions addressing the physical characteristics of work were condensed to two dimensions. The question on time spent sitting was the converse of time spent walking so it was logical to reduce this to one variable. Similarly, there were few participants who recorded lifting 50 kg and those that did, also lifted 25 kg, so this was also reduced to one variable. The data provides an indication of the nature of work undertaken whether largely sedentary or involving heavy lifting.

Table 2 Employment characteristics

Of the 45 participants who were in paid employment and reported being absent from work as a direct result of their back pain, only 3 (6%) reported that they were still absent at six month follow-up.

Clinical history and Presentation

The odds ratio for recovery increased in participants who had experienced less than twelve short episodes in the past twelve months compared to those who described the nature of their episodes as continuous. Participants categorised as Grade IV on the Von Korff pain scale, indicating high levels of disability and severely limiting pain, had a reduced chance of recovery (odds ratio 0.07, 0.02 to 0.23, p < 0.001) compared to those classified as Grade I; low disability and low intensity (Table 3).

Table 3 Clinical history and Presentation

Since completing treatment, 69% of participants reported experiencing a further spell of back pain, although only 21% felt it severe enough to see either their GP or other health practitioner. These included physiotherapists, osteopaths, chiropractors, acupuncturists, orthopaedic surgeons or rheumatologists.

Psychosocial and psychological factors

Low scores on the Zung depression inventory (odds ratio 3.43, 1.90 to 6.19, p < 0.001) and the index of somatic anxiety (odds ratio 2.36, 1.36 to 4.09, p < 0.001) were found to be linked to improvement in Roland Morris disability and SF-36 physical functioning scores, with sizable effects on both scales. However, once the individual variables were adjusted for baseline Roland Morris scores, their effect was reduced (Table 4).

Table 4 Psychosocial and psychological factors

The scores for the MSPQ and the Zung Depression Inventory (Table 4) are comparable with other studies of similar cohorts; (Mean MSPQ: 5.6 [37], 9.7 [38], 6.7 [39]); (Mean Modified Zung Depression Index: 24.9 [37], 29.7 [38], 23.7 [39]). However, there are many ways of analysing and reporting data for psychological problems. Using the decision rules for the Distress and Risk Assessment method (DRAM) defined by Main [32] and used in the UK Beam trial [13] and other studies [33, 40], patients can be classified into clusters depending on their scores on the MSPQ and the Zung Depression Inventory (Table 5).

Table 5 DRAM classification of participants who responded at 6 months and provided both Zung and MSPQ scores (N = 471)

The psychological profile of participants in this study, categorised using the DRAM, is comparable to similar cohorts of people with back pain (N 37%, R 42%, DD 13% and DS 9% [33]; N 24%, R 42%, DD 24% and DS 10% [40]). This suggests that the greatest proportion of participants were in the normal or at risk categories rather than in the distressed (somatic or depressive) categories.

Multiple Regression Model

Because baseline score is a determinant of the magnitude of change, and there is likely to be co-dependency in the data, those factors found to be predictive of outcome, defined as variables with an unadjusted p-value of less than 0.1, were entered into a multiple regression (binary logistic) analysis, controlling for all other variables in the model (Table 6). Adjustment was also made for age and sex.

Table 6 Multiple regression analysis of predictive variables

Adjusted odds ratios associated with a reduced chance of recovery were linked to self-classification as 'non-white' as opposed to 'white' (0.41, 0.18 to 0.96, p = 0.039). The pattern of back pain over the previous twelve months had an impact on recovery, increasing in those who reported episodic rather than continuous pain (2.64, 1.25 to 5.60, p = 0.005) with greatest improvement in those with fewer, brief episodes of back pain. Change in Roland Morris disability scores for each sub-classification of the two variables with predictive value in the multiple regression model is shown in Table 7.

Table 7 Mean change in Roland Morris for ethnic classification and episodic history (n = 472)

Discussion

In this large study of prognostic indicators for recovery only ethnic grouping and periodicity of the participant's back pain were linked to recovery at six months. The results do not support the value of commonly identified determinants of outcome within demographic, psychosocial, employment and clinical domains. The results illustrate the importance of controlling confounding variables and the adjusted analysis provides an estimate of the independent effect of each variable, providing a measure of whether it contains additional prognostic information [21].

To our knowledge, this is one of the largest studies of prognostic indicators for recovery from back pain, with the highest completion to follow-up, investigating the relative contribution of predictive factors from domains often viewed in isolation.

Although data was generated from a single centre, comparison with data from other primary care studies suggests that the participants in our study are representative of the patients with back pain in primary care [7, 41]. The measurement of the impact of cultural differences on change scores is limited by the relative size of the ethnic groups who demonstrated little benefit, predominantly those who classified themselves as being from North African, Chinese or Middle Eastern countries. Although interpreters accompanied many participants, it is possible that language barriers and cultural differences in the experience and report of pain may have had some influence. The intermittent nature of back pain may also have had a bearing on the results and would depend on the number of participants experiencing an episode at the point of follow-up. The treatment package offered to all patients at the clinic comprised of the same basic components and the clinician completed a record of each treatment session. The pragmatic nature of this observational study meant that individual treatments varied to some degree but are comparable to the treatment options specified in the recently published NICE guidelines on the management of persistent, non-specific low back pain [23].

Whilst work injury and compensation status have been thought to influence the course of back pain, a recent systematic review found insufficient evidence to establish the importance of compensation on aspects of recovery [11]. Only a small proportion of the sample in this study were in paid employment and off work as a result of their back pain and it was therefore not considered one of the core predictors for this sample. However, this may need to be considered in a demographically different sample.

The Fear-Avoidance Beliefs Questionnaire [42] was not selected as a core instrument, although many of the items of this 16-item questionnaire were similar to those covered. 'Despite the prevalent focus on fear' a recent systematic review found little evidence to link fear-avoidance with poor prognosis, however the authors did report a growing consensus that distress/depression plays an important role [16]. We were mindful that the length of the questionnaire, which was already substantial, could become prohibitive. However, future studies may benefit from including these aspects in greater depth in their battery of questionnaires. It is appreciated that there may be gaps in data collected, although these are not anticipated to be substantial [15].

The improvement in Roland Morris (3.8 index points) and SF-36 scales (10.7 points) suggests significant clinical recovery. The Medical Research Council funded UK BEAM trial specified a 2.5 [43] point change on the Roland Morris disability score as a clinically significant change, far smaller than the mean change seen in this study. The results differ from previously reported research which found links between demographic factors including age, sex and height, or pattern of activity [1013] and from those linking outcome to psychological [14] or psychosocial factors [11, 17]. Chronic pain-related disability results in learned behaviours which can become apparent within the first few weeks of onset. Whether psychosocial changes only become apparent as a history of back pain develops has yet to be demonstrated.

Back pain history has been recognised as a strong predictor of future episodes [18]. However, in the adjusted model, the only variable linked to recovery was the continuous or intermittent nature of the participant's pain. In this study, no evidence was found to suggest that recovery was affected by physical exposure or by the degree of control experienced within the working environment, contrasting with previously cited indicators [10, 19, 44], but in agreement with one systematic review [45]. Future research should test this predictive model on a new dataset to determine its prognostic strength,

Conclusions

The results suggest that it is possible to identify patients at presentation who are high risk for persistent disabling symptoms and those who are likely to recover, information essential to the successful targeting of services. It is important to determine whether those patients shown to have a reduced likelihood of recovery should be targeted for more intensive intervention or managed by alternative methods, whilst valuable resources may be better employed on others with a greater chance of recovery. Although an analysis of changes in Roland Morris disability scores suggest that a number of prognostic variables are linked to outcome, once a model is used which adjusts for the confounding effects of all significant variables, including treatment variables, only two contained additional, independent prognostic information. Participants improved more if their episodes of pain during the previous year were short-lived while those with Middle Eastern and Chinese ethnicity demonstrated minimal improvement. The reasons for this require further investigation. In this report, both adjusted and unadjusted data are reported for clarity, but it is also important to remember that the baseline Roland Morris score itself may be a reasonable determinant of outcome at six months. The study did not support previous evidence that a wide range of factors could predict outcome.