Responsiveness and minimal important change for the ProFitMap-neck questionnaire and the Neck Disability Index in women with neck–shoulder pain

Björklund, Martin; Wiitavaara, Birgitta; Heiden, Marina

doi:10.1007/s11136-016-1373-8

Responsiveness and minimal important change for the ProFitMap-neck questionnaire and the Neck Disability Index in women with neck–shoulder pain

Open access
Published: 09 August 2016

Volume 26, pages 161–170, (2017)
Cite this article

Download PDF

You have full access to this open access article

Quality of Life Research Aims and scope Submit manuscript

Responsiveness and minimal important change for the ProFitMap-neck questionnaire and the Neck Disability Index in women with neck–shoulder pain

Download PDF

Martin Björklund^1,2,
Birgitta Wiitavaara¹ &
Marina Heiden¹

2179 Accesses
9 Citations
2 Altmetric
Explore all metrics

Abstract

Purpose

The aim was to determine the responsiveness and minimal important change (MIC) of the questionnaire ProFitMap-neck that measures symptoms and functional limitations in women with neck pain. The same measurement properties were determined for Neck Disability Index (NDI) for comparison purposes.

Methods

Longitudinal data were derived from two randomized controlled trials, including 103 and 120 women with non-specific neck pain, with questionnaire measurements performed before and after interventions. Sensitivity and specificity to discriminate between improved and not or little changed participants, based on categorization of a global rating of change scale (GRCS), were determined for the ProFitMap-neck indices and NDI by using area under receiver operating characteristic curves (AUC). Correlations between the GRCS anchor and change scores of the questionnaires were also used to assess responsiveness. The change score that showed the highest combination of sensitivity and specificity was set for MIC.

Results

The ProFitMap-neck indices showed similar responsiveness as NDI with AUC exceeding 0.70 (Range: ProFitMap-neck, 0.74–0.83; NDI, 0.75–0.86). The MIC in the two samples ranged between 6.6 and 13.6 % for ProFitMap-neck indices and 5.2 and 6.3 % for NDI. Both questionnaires had significant correlations with GRCS (Spearman’s rho 0.47–0.72).

Conclusions

Validity of change scores was endorsed for the ProFitMap-neck indices and NDI with adequate ability to discriminate between improved and not or little changed participants. Values of minimal important change were presented.

Responsiveness and minimal important changes for the Neck Disability Index and the Neck Pain Disability Scale in Italian subjects with chronic neck pain

Article 07 February 2015

Marco Monticone, Emilia Ambrosini, … Simona Ferrante

Responsiveness and minimal important change of the NeckPix© in subjects with chronic neck pain undergoing rehabilitation

Article 20 October 2017

Marco Monticone, Luca Frigau, … Francesco Mola

Reliability, responsiveness and interpretability of the neck disability index-Dutch version in primary care

Article 17 May 2014

Luc Ailliet, S. M. Rubinstein, … C. B. Terwee

Introduction

Neck pain is highly prevalent with a reported 1-year prevalence estimated to be 30 to 50 % in the general population [1]. Neck pain also contributes to activity limitations in 11 to 14 % of workers [2]. In the largest group of neck pain patients, the underlying cause of the pain is uncertain [3, 4]; hence, the designation is non-specific neck pain. The alleviation of symptoms and restoration of functional limitations are particularly important for neck pain sufferers without a clear pathophysiology. To evaluate and establish effective treatment and rehabilitation strategies, access to reliable and valid patient-reported outcome measures, i.e., standardized questionnaires measuring specific constructs of interest, is a necessity. There are a number of questionnaires available to measure pain and disability in people with neck pain. However, weaknesses in measurement properties of several questionnaires were recently recognised, and important methodological aspects to improve were, for example, content validity regarding the relevance and comprehensiveness of items and the use of better statistical methods in responsiveness studies [5, 6]. Also, Wiitavaara and co-workers [7] found a low correspondence between neck–shoulder pain questionnaires and the symptoms experienced by the sufferers, implying a questionable content validity of the questionnaires. One potential explanation for this may be that the neck pain sufferers’ experiences are seldom taken into account in the developmental process of the neck–shoulder pain questionnaires [7], even though it is recommended in the literature [6, 8–11].

The Profile Fitness Mapping neck questionnaire (ProFitMap-neck) is a questionnaire developed in collaboration with neck pain patients, designed to assess symptoms and functional limitations in people with neck pain [12]. It consists of a functional limitation scale and a symptom scale of which the latter is subdivided in separate indices for the intensity and frequency of symptoms. The two scales can also be combined in a compound total score. The content of ProFitMap-neck symptom scale had the best correspondence with experienced symptoms among subjects with chronic neck pain, compared with 9 other neck-specific questionnaires [7]. The function scale of ProFitMap-neck has not been compared in the same way, but items of this scale have shown associations with sensorimotor function tests in different groups of people with neck pain [13–16]. The overall validity and reliability of the questionnaire has been tested on patients with chronic whiplash-associated disorders, as well as chronic non-traumatic non-specific neck pain [12]. However, the validation study of Björklund and co-workers [12] had a cross-sectional design that assessed validity of single scores. To evaluate the ability of an instrument to detect change over time in the construct to be measured, a measurement property referred to as responsiveness [17], longitudinal study designs are necessary.

An issue related to responsiveness concerns the interpretation of a change score, i.e., the change of a score from baseline to a follow-up. It is important to know if a change score of an instrument reflects a change in the patient’s status that he/she would consider important. The cut-off score with the best discriminative ability between patients that have improved and not improved is often referred to as the minimal important change (MIC) of the instrument, defined as the smallest measured change score that patients perceive to be important [17, 18]. The knowledge of a questionnaire’s responsiveness and MIC is crucial for its use in the evaluation of treatment and rehabilitation. In clinical practice, it can be used to judge whether a patient has reached a change of importance, and in research, the measurement properties are useful for the analysis and interpretation of study results. The primary aim of the present study was to determine the responsiveness and MIC of the ProFitMap-neck and the Neck Disability Index (NDI) [19] in women with chronic non-specific neck–shoulder pain. A secondary aim was to compare the responsiveness between ProFitMap-neck and NDI. We chose to compare with NDI since it is the most frequently used and evaluated neck-specific questionnaire [5, 20, 21].

Materials and methods

Data for the current study were derived from two randomized controlled trials (ISRCTN trial registration numbers ISRCTN92199001 [22]) and ISRCTN49348025 [23]. Both trials had an observer-blinded three-arm parallel group design with baseline measures and follow-ups 1 week, 6 months and 12 months after an 11-week intervention. For the purpose of the current study, only the measurements at baseline and 1 week after intervention were used. Both trials were approved by the Ethical Review Board in Uppsala, Sweden, and informed consent was obtained from all individual participants included in the study. The two trials with their adherent samples will from here on be called trial I—sample I [22] and trial II—sample II [23], respectively.

Trial I

The purpose of trial I was to evaluate the effects of neck coordination exercise, compared to either strength training for the neck and shoulder regions or massage treatment, in 108 women with non-specific neck–shoulder pain [22]. The inclusion criteria for the study were women, age 25–65 years, with more than 3 months of non-specific neck pain with the neck region indicated as the dominant pain area on a pain drawing [24] and disability with limitations in performing everyday activities involving the neck, shoulders and arms according to DASH [25]. Excluded were those that had trauma-related neck pain, diagnosis of a psychiatric, rheumatic, neurological, inflammatory, endocrine or connective tissue disease, fibromyalgia, cancer, stroke, cardiac infarction or diabetes type I, surgery or fracture to the back, neck, or shoulder in the last 3 years, shoulder luxation in the last year or reported strenuous exercise >3 times/week during the last 6 months. All interventions comprised of 22 individually supervised treatment sessions. The neck coordination exercise was performed with a training device that participants wore on their head [26]. The exercise task was to control, through visual feedback via mirrors, the movement of a metal ball placed on the device with the aim to improve the fine movement control of the cervical spine. The strength training intervention consisted of isometric and dynamic exercises for the neck- and shoulder muscles, inspired by the training programme of Ylinen and co-workers [27]. The massage treatment consisted of classical massage for the back, neck and shoulders.

Trial II

In trial II, the purpose was to evaluate individualized treatment compared to non-individualized treatment or treatment as usual (participants received no treatment from the study and no restriction to what they were allowed to do) in 120 women with non-specific neck–shoulder pain [23]. The inclusion and exclusion criteria were the same as in trial I with the following exceptions: The age span in trial II was 20–65 years, pain duration was minimum 6 weeks, and participants were required to have between mild and severe disability according to NDI [19] (participants did not answer DASH in trial II) and impaired capacity to work due to neck problems [28]. Also, in trial II, strenuous exercise was not an exclusion criteria, but concurrent low back pain was. Participants of the two intervention groups received treatments two to three times per week for a period of 11 weeks. The individualized treatment was tailored to the individuals’ functional limitations and symptoms, as decided from a decision model comprising the five categories cervical mobility, neck–shoulder strength and motor control, eye–head–neck control, trapezius myalgia and cervicogenic headache. The non-individualized treatment included the same available treatment components but applied quasi-randomly [23].

Measurements

In both trial I and II, the participants answered a comprehensive set of questionnaires at each test occasion. This set included ProFitMap-neck [12] NDI [29] and a global rating of change scale (GRCS, only administered after intervention). In the present study, the GRCS is used as a comparator instrument and external anchor of change in relation to ProFitMap-neck and NDI.

Profile Fitness Mapping neck questionnaire

The two original scales of ProFitMap-neck, the functional limitation scale (function index) and the symptom scale (intensity index and frequency index), consist of 20 and 27 items. After a recent validation study [12], revisions of the scales were suggested by reducing items of the scales to 18 and 26, respectively. In the present study, the revised scales are used. Each item has six response alternatives with the following ranges: Function index (how do you manage to) from “very good, no problem, very satisfying, very likely” to “very bad, very difficult/impossible, very dissatisfying, very unlikely”; Symptom scale, intensity index (how much) from “nothing/none at all” to “almost unbearable/unbearable, all/maximally”; Symptom scale, frequency index (how often) from “never/very seldom” to “very often/always”. The index scores are normalized 0–100 with higher scores reflecting better function/better health (function index) and less symptoms/better health (symptom indices intensity index and frequency index). In addition, a total score is calculated as the average of the three indices. For a detailed description of items and method of index score calculation, see appendix in [12]. The ProFitMap-neck indices have shown good internal consistency in three different neck pain samples, with Cronbach’s α ranging between 0.88 and 0.96, and ICC test–retest reliability ranging between 0.80 and 0.91 [12].

Neck Disability Index

The NDI measures symptoms and disability related to neck pain [19]. It contains 10 items about pain intensity, concentration, headache and activities of daily living. The items have six response alternatives ranging from no disability (0) to total disability (5), thus the sum score ranges from 0 to 50. In the present study, the NDI index was normalized 0–100 with higher scores reflecting higher levels of disability. A recent review of psychometric properties of neck-specific questionnaires [5] concluded that the NDI is the most frequently validated neck questionnaire and that it has limited positive content validity, correlates with questionnaires measuring pain/physical functioning (r = 0.53–0.70), and moderate evidence for responsiveness. However, the reliability of NDI may not be sufficient [30], and the estimation of MIC seems uncertain with widely differing estimates between studies (for references, see [5]). Hence, the use of NDI in the current study might also contribute with more knowledge about the MIC of NDI.

Global rating of change scale

The global rating of change scale (GRCS) used in trial I and II was a single question, asking for the participant’s change after treatment, with responses on a balanced 7-point Likert scale: 1. Very much worse; 2. Much worse; 3. Minimally worse; 4. No change; 5. Minimally improved; 6. Much improved; 7. Very much improved. The Initiative on Methods, Measurement, and Pain Assessment in Clinical Trials (IMMPACT) recommends this 7-point scale (referring to it as the Patient Global Impression of Change Scale) to be a core outcome measure of global improvement in chronic pain clinical trials [31]. There are examples in the literature of GRCS with various numbers of response alternatives, usually ranging from 3 to 15 [32], but GRCS with 7 to 11 points seems to be most appropriate when taking reliability, discriminative ability and patient preferences into account [33].

The wording of the GRCS at evaluation one week after intervention was “Compared to before the treatment of the study started, my overall status is now” (trial I), and “Compared to before the treatment of the study started, my status regarding my neck–shoulder problems is now” (trial II). For the purpose of the present study, the GRCS was used as the external criterion of improved (participants rating 6 and 7) and no or little change (participants rating 3, 4 and 5) for the determination of responsiveness and MIC [34, 35]. Participants with GRCS rating 1 and 2 were excluded from the analysis [35].

Statistical analysis

As described previously, all questionnaire indices were expressed as a percentage of the maximum possible score, where a higher percentage reflects better health/function/less symptoms in ProFitMap-neck indices and more disability in NDI. If an item was omitted by a respondent, the maximum possible score of the index was adjusted by subtracting the maximum score for the item from the maximum possible score of the index before calculating the percentage. If the sum of maximum scores for the omitted items exceeded 50 % of the maximum possible score for the index, or more than half of the items were omitted, the form was considered non-valid.

In the text and tables, data are presented as number and proportion or mean and standard deviation. Responsiveness was determined using anchor-based methods [30, 36, 37]. Sensitivity and specificity to discriminate between improved and not or little changed participants, based on the GRCS categorization, were determined for the ProFitMap-neck indices and NDI. To this end, receiver operating characteristic (ROC) curves were fitted for sample I and II separately to illustrate the discriminating ability of the indices [34]. From each ROC curve, the area under the curve (AUC) and its 95 % confidence interval was calculated and used as the primary measure of responsiveness. The NDI scale was inverted in this calculation to simplify the comparison. An area value of 0.5 indicates discrimination by chance, and a value of 1 indicates perfect discrimination [38]. For the second measure of responsiveness, we calculated the correlation (Spearman’s rho) between the GRCS anchor and change scores (index score after treatment—index score before treatment). Based on the ROC analyses, the minimal important change (MIC) was determined as the change score that showed the highest combination of sensitivity and specificity [39, 40]. All statistical analyses were performed in IBM SPSS Statistics 22.0 for Windows (IBM Corp, Armonk, NY).

Results

The number of participants that completed the intervention was 89 in trial I and 104 in trial II. Four participants were excluded from the analysis because they rated <3 on GRCS (one participant from sample I and three from sample II). Of the remaining 88 participants in sample I, 47 rated an improvement in health after the intervention (i.e., 6 or 7 on the GRCS), and 41 were categorized as no or little change (i.e., rated 3, 4, or 5 on the GRCS). Of the remaining 101 participants in sample II, 54 rated an improvement and 47 did not do so. The characteristics and baseline measurements of the samples are shown in Table 1. The maximum possible score was reached at follow-up for five and six participants for the ProFitMap-neck function index and NDI, respectively. No participant reached the maximum possible score in any of the indices at baseline. Table 2 presents the change scores for each category in the two samples, including the proportion of missing items in the questionnaires.

Table 1 Characteristics and baseline measurements on all participants (n = 223)

Full size table

Table 2 Change scores for sample I and II, including the proportion of missing items in the questionnaires

Full size table

The AUC with 95 % confidence interval for the two samples is shown in Table 3. Overall, the ProFitMap-neck performed similarly to NDI, and the AUCs tended to be larger for sample II compared to sample I but the confidence intervals showed substantial overlap. Among the ProFitMap-neck indices, the function index had slightly lower AUC than the symptom indices.

Table 3 Area under the receiver operating characteristic curve (AUC) with 95% confidence interval for sample I and II

Full size table

In Table 4, the MIC and its corresponding sensitivity and specificity are shown for all indices in both samples. NDI had the lowest MIC in both samples. For sample I, this NDI-MIC value had the lowest sensitivity and specificity, but in sample II its sensitivity was higher. The highest combination of sensitivity and specificity was observed for the ProFitMap-neck symptom-intensity index in sample II. The highest MIC in both samples was obtained for the ProFitMap-neck symptom-frequency index. Overall, the MIC tended to be lower in sample II for all indices.

Table 4 Minimal important change (MIC) and its corresponding sensitivity and specificity for sample I and II

Full size table

For sample I, Spearman’s rho between GRCS and the change scores of ProFitMap-neck and NDI ranged between 0.47 (ProFitMap-neck function index) and 0.59 (ProFitMap-neck symptom-frequency index). For sample II, the correlation ranged between 0.56 (ProFitMap-neck function index) and 0.72 (NDI). All correlations were significant (p < 0.05).

Discussion

In the present study, we aimed to investigate the ProFitMap-neck performance by assessing its responsiveness, and compare that to NDI, in two samples of women with non-specific neck–shoulder pain. The results suggest that both measures possess similar ability to detect change in self-rated perceived health with AUC exceeding 0.7 which is a cut-off value that has been used to delineate adequate responsiveness [40–43]. While this was the first examination of responsiveness for ProFitMap-neck, several previous studies exist on this measurement property for NDI [30, 34, 36, 40–42, 44–47]. Most of these show results in concordance with the present study, except for two studies that found lower AUC for NDI (0.57 [36] and 0.59 [44]). In a review of measurement properties of eight neck-specific pain and disability questionnaires, where NDI but not ProFitMap-neck was included, it was concluded that NDI was one of two questionnaires that had better than limited evidence of responsiveness [5].

Correlation analyses between change scores and GRCSs showed significant associations for both ProFitMap-neck indices and NDI, which indicates that the GRCSs were valid anchors for our study [37, 48]. In contrast to the more general GRCS used in trial I, the GRCS in trial II explicitly expressed neck–shoulder problems and may therefore have better construct validity as an external anchor [32, 49]. This could have affected our results; however, correlations were only slightly higher in trial II, and earlier findings of similar reliability for questions on general perceived recovery compared to perceived change in neck pain [50] indicate that both types of questions could be used. Global rating of change scales of general perceived recovery seem to be the most common external anchors (see e.g. [30, 36, 40, 41, 46, 47]).

Minimal important change of normalized values in the two samples examined ranged between 6.6 and 13.6 % for the ProFitMap-neck indices and was 6.3 and 5.2 % for the NDI. The symptom-frequency index had the highest MIC in both samples. This may reflect the often existing temporal variation of symptoms in neck pain individuals [7, 51]. The symptom-frequency index had also the highest measurement error in the previous validation study of ProFitMap-neck [12]. However, pain frequency may still be important to measure in chronic pain clinical trials since temporal aspects of pain have shown to be a valid dimension discerned from pain intensity, therefore recommended as an outcome [31]. The MICs obtained for NDI are rather low compared with previous studies in chronic neck pain, showing a range of 5–19 % [30, 34, 36, 40–42, 44, 47, 52].One explanation for this may be the low mean NDI baseline scores of 28 and 23 NDI% in sample I and II, respectively. Association between NDI baseline scores and MIC was recently demonstrated, showing larger MIC for those above (i.e., with higher disability) compared to those below (i.e., with lower disability) median baseline score [42, 44, 52]. The same effect of baseline values on MIC in neck pain patients was also shown for pain intensity numerical rating scale [53], but not for Neck Pain Disability Scale [42]. In the comparison of MIC values of NDI and the ProFitMap-neck indices, the latter were slightly higher. However, the combination of sensitivity and specificity for the MICs was higher in all ProFitMap-neck indices in sample I and in the majority of the ProFitMap-neck indices in sample II. The comparison of the MIC of ProFitMap-neck with MIC of other neck-specific questionnaires beside NDI is hampered by the low number of studies and differing methodology to determine MIC. For comparable studies, Neck Pain and Disability Scale [41, 42] and Neck Bournemouth Questionnaire [54] had MIC of similar magnitude as ProFitMap-neck, whereas MIC reported for the Core Outcome Measure Index summary score was higher (20 and 27 %) [55, 56].

Methods to determine MIC can be sorted into anchor-based or distribution-based approaches. Distribution-based methods are conceptually different in being based on statistical characteristics of the sample distribution. These methods rather deal with minimal detectable change than any indication of the importance for the patient of the observed change, which is the ground for anchor-based methods [48, 57, 58]. In the current study, we used anchor-based methods for determining responsiveness and MIC, thereby considering patient perception as a key factor for the MIC [59] in accordance with its conceptual definition [17].

However, the reliance of anchor-based methods poses several challenges. The first concerns the validity of the external anchor. In line with many other studies [30, 34, 36, 40–42, 44–47, 53, 60], we used GRCS as the external anchor to discern improved versus no or little change. This method has been criticized, one reason being recall bias [32]. The COSMIN (Consensus-based Standards for the selection of health status Measurement Instruments) checklist points out that GRCS should not be regarded as a gold standard, and suggests that no gold standard exists for patient-reported outcomes except for longer versions of the same outcome as the one under test [17]. However, the same checklist recommends using a GRCS of the same construct as the instrument under study as a useful comparator with high face validity, and evidence supports the use of GRCS with 7–11 response alternatives [32]. Also, in a review on methodological quality of neck questionnaire studies, GRCS was deemed appropriate and the best criterion available [6]. A second challenge of anchor-based methods, brought up by de Vet and co-workers [57], is that they do not include any aspect of measurement precision, thereby leaving out information whether the MIC lies within measurement error, i.e., is smaller than minimal detectable change, of the tested scale or not. The MIC of the ProFitMap-neck indices established in the present study was smaller than the smallest detectable change earlier determined from test–retest of 45 subjects with non-specific neck pain [12]. The same situation applies to our result on the MIC for NDI, i.e., they were smaller than minimal detectable change observed in most other studies. As a matter of fact, MIC was always smaller than minimal detectable change in NDI (see compilation, Table 1 in [52]), meaning that MIC may be confounded with measurement error [58]. Thus, using minimal detectable change instead of MIC as cut-off in NDI and ProFitMap-neck increases the certainty of that measurement error will be exceeded and should therefore be the choice when a high rate false positive (low specificity) should be avoided. The MIC, expressed as the optimal point on the ROC curve for high sensitivity and specificity equally weighted, may be used as an alternative cut-off in situations where a low rate of false negative (high sensitivity) is equally important. Finally, the use of anchor-based methods to determine responsiveness is not suitable if the proportion of improved versus not improved are severely skewed with only few individuals in one category [61]. This was, however, not the case in either sample (Table 2).

Limitations of the study include the long time period of 12 weeks between measurements which may increase recall bias for the GRCS questions. Another aspect to consider is the generalizability of the results to other women with subacute and chronic non-specific neck pain. The recruitment procedure in both trials was partly done by advertising [22, 23], and samples should therefore be considered as convenience samples which constituted of women with relatively mild pain and disability. This may reduce the generalizability of results. Also, findings cannot be generalized to men with neck pain. A further limitation is that the interventions given could potentially have influenced the MIC differently, but separate analyses of each intervention group were not possible due to small group sample sizes. Finally, the small differences between trial I and II in respect of the inclusion criteria and wordings of the external anchors, and the differences in characteristics and baseline measurements, made us unwilling to pool the data into one sample. This could be seen as a drawback due to reduced sample size, but the number of participants in each sample was most likely adequate for our purpose [62]. With that in mind, the separate samples used could be regarded a strength of the study since confirmation of responsiveness across samples is recommended [37].

Conclusions

This study extends the knowledge of measurement properties of the ProFitMap-neck questionnaire by endorsing its validity for change scores in two groups of women with non-specific neck–shoulder pain. In both groups, adequate ability to discriminate between improved and not or little changed participants was demonstrated and values of important change presented. The responsiveness of the ProFitMap-neck was similar to that of NDI which, in turn, was similar to earlier findings corroborating NDI and ProFitMap-neck as responsive measures. Continuing future validation of the ProFitMap-neck is warranted and should include other neck pain conditions as well as men.

References

Hogg-Johnson, S., van der Velde, G., Carroll, L. J., Holm, L. W., Cassidy, J. D., Guzman, J., et al. (2008). The burden and determinants of neck pain in the general population: results of the Bone and Joint Decade 2000–2010 Task Force on Neck Pain and Its Associated Disorders. Spine (Phila Pa 1976), 33(4Suppl), S39–S51, doi:10.1097/BRS.0b013e31816454c8.
Cote, P., van der Velde, G., Cassidy, J. D., Carroll, L. J., Hogg-Johnson, S., Holm, L. W., et al. (2008). The burden and determinants of neck pain in workers: results of the Bone and Joint Decade 2000-2010 Task Force on Neck Pain and Its Associated Disorders. Spine (Phila Pa 1976), 33(4 Suppl), S60–S74, doi:10.1097/BRS.0b013e3181643ee4.
Borghouts, J. A. J., Koes, B. W., & Bouter, L. M. (1998). The clinical course and prognostic factors of non-specific neck pain: a systematic review. Pain, 77(1), 1–13. doi:10.1016/s0304-3959(98)00058-x.
Article CAS PubMed Google Scholar
Visser, B., & van Dieen, J. H. (2006). Pathophysiology of upper extremity muscle disorders. Journal of Electromyography and Kinesiology, 16(1), 1–16.
Article PubMed Google Scholar
Schellingerhout, J. M., Verhagen, A. P., Heymans, M. W., Koes, B. W., de Vet, H. C., & Terwee, C. B. (2012). Measurement properties of disease-specific questionnaires in patients with neck pain: a systematic review. Quality of Life Research, 21(4), 659–670. doi:10.1007/s11136-011-9965-9.
Article PubMed Google Scholar
Terwee, C. B., Schellingerhout, J. M., Verhagen, A. P., Koes, B. W., & de Vet, H. C. W. (2011). Methodological quality of studies on the measurement properties of neck pain and disability questionnaires: A systematic review. Journal of Manipulative and Physiological Therapeutics, 34(4), 261–272. doi:10.1016/j.jmpt.2011.04.003.
Article PubMed Google Scholar
Wiitavaara, B., Björklund, M., Brulin, C., & Djupsjöbacka, M. (2009). How well do questionnaires on symptoms in neck-shoulder disorders capture the experiences of those who suffer from neck-shoulder disorders? A content analysis of questionnaires and interviews. BMC Musculoskeletal Disorders, 10, 30. doi:10.1186/1471-2474-10-30.
Article PubMed PubMed Central Google Scholar
Bombardier, C., & Tugwell, P. (1987). Methodological considerations in functional assessment. Journal of Rheumatology, 14, 6–10.
PubMed Google Scholar
Guyatt, G. H., Feeny, D. H., & Patrick, D. L. (1993). Measuring health-related quality-of-life. Annals of Internal Medicine, 118(8), 622–629.
Article CAS PubMed Google Scholar
Hoving, J. L., O’Leary, E. F., Niere, K. R., Green, S., & Buchbinder, R. (2003). Validity of the neck disability index, Northwick Park neck pain questionnaire, and problem elicitation technique for measuring disability associated with whiplash-associated disorders. Pain, 102(3), 273–281.
Article PubMed Google Scholar
Streiner, D. L., & Norman, D. R. (1990). Health measurement scales: A practical guide to their development and use. Oxford: Oxford University Press.
Google Scholar
Björklund, M., Hamberg, J., Heiden, M., & Barnekow-Bergkvist, M. (2012). The ProFitMap-neck—reliability and validity of a questionnaire for measuring symptoms and functional limitations in neck pain. Disability and Rehabilitation, 34(13), 1096–1107. doi:10.3109/09638288.2011.635747.
Article PubMed Google Scholar
Rudolfsson, T., Björklund, M., & Djupsjöbacka, M. (2012). Range of motion in the upper and lower cervical spine in people with chronic neck pain. Manual Therapy, 17(1), 53–59. doi:10.1016/j.math.2011.08.007.
Article PubMed Google Scholar
Röijezon, U., Björklund, M., & Djupsjöbacka, M. (2011). The slow and fast components of postural sway in chronic neck pain. Manual Therapy, 16(3), 273–278. doi:10.1016/j.math.2010.11.008.
Article PubMed Google Scholar
Röijezon, U., Djupsjöbacka, M., Björklund, M., Häger-Ross, C., Grip, H., & Liebermann, D. (2010). Kinematics of fast cervical rotations in persons with chronic neck pain: a cross-sectional and reliability study. BMC Musculoskeletal Disorders, 11, 222. doi:10.1186/1471-2474-11-222.
Article PubMed PubMed Central Google Scholar
Sandlund, J., Roijezon, U., Björklund, M., & Djupsjöbacka, M. (2008). Acuity of goal-directed arm movements to visible targets in chronic neck pain. Journal of Rehabilitation Medicine, 40(5), 366–374.
Article PubMed Google Scholar
Mokkink, L. B., Terwee, C. B., Knol, D. L., Stratford, P. W., Alonso, J., Patrick, D. L., et al. (2010). The COSMIN checklist for evaluating the methodological quality of studies on measurement properties: A clarification of its content. BMC Medical Research Methodology, 10, 22. doi:10.1186/1471-2288-10-22.
Article PubMed PubMed Central Google Scholar
van Kampen, D. A., Willems, W. J., van Beers, L. W., Castelein, R. M., Scholtes, V. A., & Terwee, C. B. (2013). Determination and comparison of the smallest detectable change (SDC) and the minimal important change (MIC) of four-shoulder patient-reported outcome measures (PROMs). Journal of Orthopaedic Surgery and Research, 8, 40. doi:10.1186/1749-799x-8-40.
Article PubMed PubMed Central Google Scholar
Vernon, H., & Mior, S. (1991). The Neck Disability Index: A study of reliability and validity. Journal of Manipulative and Physiological Therapeutics, 14(7), 409–415.
CAS PubMed Google Scholar
MacDermid, J. C., Walton, D. M., Avery, S., Blanchard, A., Etruw, E., McAlpine, C., et al. (2009). Measurement properties of the neck disability index: a systematic review. Journal of Orthopaedic and Sports Physical Therapy, 39(5), 400–417. doi:10.2519/jospt.2009.2930.
Article PubMed Google Scholar
Vernon, H. (2008). The Neck Disability Index: State-of-the-art, 1991–2008. Journal of Manipulative and Physiological Therapeutics, 31(7), 491–502. doi:10.1016/j.jmpt.2008.08.006.
Article PubMed Google Scholar
Rudolfsson, T., Djupsjöbacka, M., Häger, C., & Björklund, M. (2014). Effects of neck coordination exercise on sensorimotor function in chronic neck pain: A randomized controlled trial. Journal of Rehabilitation Medicine, 46(9), 908–914. doi:10.2340/16501977-1869.
Article PubMed Google Scholar
Björklund, M., Djupsjöbacka, M., Svedmark, A., & Häger, C. (2012). Effects of tailored neck-shoulder pain treatment based on a decision model guided by clinical assessments and standardized functional tests. A study protocol of a randomized controlled trial. BMC Musculoskeletal Disorders, 13, 75. doi:10.1186/1471-2474-13-75.
Article PubMed PubMed Central Google Scholar
Margolis, R. B., Tait, R. C., & Krause, S. J. (1986). A rating system for use with patient pain drawings. Pain, 24(1), 57–65.
Article CAS PubMed Google Scholar
Atroshi, I., Gummesson, C., Andersson, B., Dahlgren, E., & Johansson, A. (2000). The disabilities of the arm, shoulder and hand (DASH) outcome questionnaire: reliability and validity of the Swedish version evaluated in 176 patients. Acta Orthopaedica Scandinavica, 71(6), 613–618.
Article CAS PubMed Google Scholar
Röijezon, U., Björklund, M., Bergenheim, M., & Djupsjöbacka, M. (2008). A novel method for neck coordination exercise - a pilot study on persons with chronic non-specific neck pain. Journal of Neuroengineering and Rehabilitation, 5, 36. doi:10.1186/1743-0003-5-36.
Article PubMed PubMed Central Google Scholar
Ylinen, J. J., Hakkinen, A. H., Takala, E. P., Nykanen, M. J., Kautiainen, H. J., Malkia, E. A., et al. (2006). Effects of neck muscle training in women with chronic neck pain: One-year follow-up study. Journal of Strength and Conditioning Research, 20(1), 6–13.
PubMed Google Scholar
Martimo, K. P., Shiri, R., Miranda, H., Ketola, R., Varonen, H., & Viikari-Juntura, E. (2009). Self-reported productivity loss among workers with upper extremity disorders. Scandinavian Journal of Work, Environment & Health, 35(4), 301–308.
Article Google Scholar
Ackelman, B. H., & Lindgren, U. (2002). Validity and reliability of a modified version of the Neck Disability Index. Journal of Rehabilitation Medicine, 34(6), 284–287. doi:10.1080/165019702760390383.
Article PubMed Google Scholar
Cleland, J. A., Childs, J. D., & Whitman, J. M. (2008). Psychometric properties of the Neck Disability Index and Numeric Pain Rating Scale in patients with mechanical neck pain. Archives of Physical Medicine and Rehabilitation, 89(1), 69–74. doi:10.1016/j.apmr.2007.08.126.
Article PubMed Google Scholar
Dworkin, R. H., Turk, D. C., Farrar, J. T., Haythornthwaite, J. A., Jensen, M. P., Katz, N. P., et al. (2005). Core outcome measures for chronic pain clinical trials: IMMPACT recommendations. Pain, 113(1–2), 9–19.
Article PubMed Google Scholar
Kamper, S. J., Maher, C. G., & Mackay, G. (2009). Global rating of change scales: a review of strengths and weaknesses and considerations for design. Journal of Manual and Manipulative Therapy, 17(3), 163–170.
Article PubMed PubMed Central Google Scholar
Preston, C. C., & Colman, A. M. (2000). Optimal number of response categories in rating scales: reliability, validity, discriminating power, and respondent preferences. Acta Psychologica, 104(1), 1–15.
Article CAS PubMed Google Scholar
Pool, J. J. M., Ostelo, R., Hoving, J. L., Bouter, L. M., & de Vet, H. C. W. (2007). Minimal clinically important change of the neck disability index and the numerical rating scale for patients with neck pain. Spine, (Phila Pa 1976) 32(26), 3047–3051.
Soer, R., Reneman, M. F., Vroomen, P., Stegeman, P., & Coppes, M. H. (2012). Responsiveness and minimal clinically important change of the pain disability index in patients with chronic back pain. Spine, (Phila Pa 1976) 37(8), 711–715, doi:10.1097/BRS.0b013e31822c8a7a.
Cleland, J. A., Fritz, J. M., Whitman, J. M., & Palmer, J. A. (2006). The reliability and construct validity of the Neck Disability Index and patient specific functional scale in patients with cervical radiculopathy. Spine, (Phila Pa 1976) 31(5), 598–602, doi:10.1097/01.brs.0000201241.90914.22.
Revicki, D., Hays, R. D., Cella, D., & Sloan, J. (2008). Recommended methods for determining responsiveness and minimally important differences for patient-reported outcomes. Journal of Clinical Epidemiology, 61(2), 102–109. doi:10.1016/j.jclinepi.2007.03.012.
Article PubMed Google Scholar
Hanley, J. A., & McNeil, B. J. (1982). The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 143(1), 29–36.
Article CAS PubMed Google Scholar
Newell, D., & Bolton, J. E. (2010). Responsiveness of the Bournemouth questionnaire in determining minimal clinically important change in subgroups of low back pain patients. Spine (Phila Pa 1976), 35(19), 1801–1806, doi:10.1097/BRS.0b013e3181cc006b.
Young, B. A., Walker, M. J., Strunce, J. B., Boyles, R. E., Whitman, J. M., & Childs, J. D. (2009). Responsiveness of the Neck Disability Index in patients with mechanical neck disorders. Spine Journal, 9(10), 802–808. doi:10.1016/j.spinee.2009.06.002.
Article PubMed Google Scholar
Jorritsma, W., Dijkstra, P. U., de Vries, G. E., Geertzen, J. H. B., & Reneman, M. F. (2012). Detecting relevant changes and responsiveness of neck pain and disability scale and neck disability index. European Spine Journal, 21(12), 2550–2557. doi:10.1007/s00586-012-2407-8.
Article PubMed PubMed Central Google Scholar
Monticone, M., Ambrosini, E., Vernon, H., Brunati, R., Rocca, B., Foti, C., et al. (2015). Responsiveness and minimal important changes for the Neck Disability Index and the Neck Pain Disability Scale in Italian subjects with chronic neck pain. European Spine Journal, 24(12), 2821–2827. doi:10.1007/s00586-015-3785-5.
Article PubMed Google Scholar
Terwee, C. B., Bot, S. D. M., de Boer, M. R., van der Windt, D., Knol, D. L., Dekker, J., et al. (2007). Quality criteria were proposed for measurement properties of health status questionnaires. Journal of Clinical Epidemiology, 60(1), 34–42. doi:10.1016/j.jclinepi.2006.03.012.
Article PubMed Google Scholar
Pereira, M., Cruz, E. B., Domingues, L., Duarte, S., Carnide, F., & Fernandes, R. (2015). Responsiveness and Interpretability of the Portuguese Version of the Neck Disability Index in Patients With Chronic Neck Pain Undergoing Physiotherapy. Spine (Phila Pa 1976), 40(22), E1180–1186, doi:10.1097/brs.0000000000001034.
Stewart, M., Maher, C. G., Refshauge, K. M., Bogduk, N., & Nicholas, M. (2007). Responsiveness of pain and disability measures for chronic whiplash. Spine, 32(5), 580–585. doi:10.1097/01.brs.0000256380.71056.6d.
Article PubMed Google Scholar
Vos, C. J., Verhagen, A. P., & Koes, B. W. (2006). Reliability and responsiveness of the Dutch version of the Neck Disability Index in patients with acute neck pain in general practice. European Spine Journal, 15(11), 1729–1736. doi:10.1007/s00586-006-0119-7.
Article PubMed Google Scholar
Young, I. A., Cleland, J. A., Michener, L. A., & Brown, C. (2010). Reliability, Construct Validity, and Responsiveness of the Neck Disability Index, Patient-Specific Functional Scale, and Numeric Pain Rating Scale in Patients with Cervical Radiculopathy. American Journal of Physical Medicine and Rehabilitation, 89(10), 831–839. doi:10.1097/PHM.0b013e3181ec98e6.
Article PubMed Google Scholar
de Vet, H. C. W., Ostelo, R. W. J. G., Terwee, C. B., van der Roer, N., Knol, D. L., Beckerman, H., et al. (2007). Minimally important change determined by a visual method integrating an anchor-based and a distribution-based approach. Quality of Life Research, 16(1), 131–142. doi:10.1007/s11136-006-9109-9.
Article PubMed Google Scholar
Mokkink, L. B., Terwee, C. B., Patrick, D. L., Alonso, J., Stratford, P. W., Knol, D. L., et al. (2010). The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. Journal of Clinical Epidemiology, 63(7), 737–745. doi:10.1016/j.jclinepi.2010.02.006.
Article PubMed Google Scholar
Ngo, T., Stupar, M., Cote, P., Boyle, E., & Shearer, H. (2010). A study of the test-retest reliability of the self-perceived general recovery and self-perceived change in neck pain questions in patients with recent whiplash-associated disorders. European Spine Journal, 19(6), 957–962. doi:10.1007/s00586-010-1289-x.
Article PubMed PubMed Central Google Scholar
Aublet-Cuvelier, A., Aptel, M., & Weber, H. (2006). The dynamic course of musculoskeletal disorders in an assembly line factory. International Archives of Occupational and Environmental Health, 79(7), 578–584. doi:10.1007/s00420-006-0092-9.
Article PubMed Google Scholar
Schuller, W., Ostelo, R. W. J. G., Janssen, R., & de Vet, H. C. W. (2014). The influence of study population and definition of improvement on the smallest detectable change and the minimal important change of the neck disability index. Health and quality of life outcomes, 12, 53. doi:10.1186/1477-7525-12-53.
Article PubMed PubMed Central Google Scholar
Kovacs, F. M., Abraira, V., Royuela, A., Corcoll, J., Alegre, L., Tomas, M., et al. (2008). Minimum detectable and minimal clinically important changes for pain in patients with nonspecific neck pain. BMC Musculoskeletal Disorders, 9, 43. doi:10.1186/1471-2474-9-43.
Article PubMed PubMed Central Google Scholar
Geri, T., Signori, A., Gianola, S., Rossettini, G., Grenat, G., Checchia, G., et al. (2015). Cross-cultural adaptation and validation of the Neck Bournemouth Questionnaire in the Italian population. Quality of Life Research, 24(3), 735–745. doi:10.1007/s11136-014-0806-5.
Article PubMed Google Scholar
Fankhauser, C. D., Mutter, U., Aghayev, E., & Mannion, A. F. (2012). Validity and responsiveness of the Core Outcome Measures Index (COMI) for the neck. European Spine Journal, 21(1), 101–114. doi:10.1007/s00586-011-1921-4.
Article CAS PubMed Google Scholar
Monticone, M., Ferrante, S., Maggioni, S., Grenat, G., Checchia, G. A., Testa, M., et al. (2014). Reliability, validity and responsiveness of the cross-culturally adapted Italian version of the core outcome measures index (COMI) for the neck. European Spine Journal, 23(4), 863–872. doi:10.1007/s00586-013-3092-y.
Article PubMed Google Scholar
de Vet, H. C., Terwee, C. B., Ostelo, R. W., Beckerman, H., Knol, D. L., & Bouter, L. M. (2006). Minimal changes in health status questionnaires: distinction between minimally detectable change and minimally important change. Health and Quality of Life Outcomes, 4, 54. doi:10.1186/1477-7525-4-54.
Article PubMed PubMed Central Google Scholar
de Vet, H. C. W., & Terwee, C. B. (2010). The minimal detectable change should not replace the minimal important difference. Journal of Clinical Epidemiology, 63(7), 804–805. doi:10.1016/j.jclinepi.2009.12.015.
Article PubMed Google Scholar
Cook, C. E. (2008). Clinimetrics Corner: The Minimal Clinically Important Change Score (MCID): A Necessary Pretense. Journal of Manual and Manipulative Therapy, 16(4), E82–E83.
Article PubMed PubMed Central Google Scholar
Farrar, J. T., Young, J. P., LaMoreaux, L., Werth, J. L., & Poole, R. M. (2001). Clinical importance of changes in chronic pain intensity measured on an 11-point numerical pain rating scale. Pain, 94(2), 149–158. doi:10.1016/s0304-3959(01)00349-9.
Article CAS PubMed Google Scholar
Leopold, S. S. (2013). Editor’s spotlight/take 5: Comparative responsiveness and minimal clinically important differences for idiopathic ulnar impaction syndrome (DOI 10.1007/s11999-013-2843-8). Clinical Orthopaedics and Related Research, 471(5), 1403–1405. doi:10.1007/s11999-013-2886-x.
Article PubMed PubMed Central Google Scholar
Terwee, C. B., Mokkink, L. B., Knol, D. L., Ostelo, R. W., Bouter, L. M., & de Vet, H. C. (2012). Rating the methodological quality in systematic reviews of studies on measurement properties: a scoring system for the COSMIN checklist. Quality of Life Research, 21(4), 651–657. doi:10.1007/s11136-011-9960-1.
Article PubMed Google Scholar

Download references

Acknowledgments

The study was funded by the Swedish Research Council for Health, Working Life and Welfare, Grant No: 2009-1403.

Funding

This study was funded by the Swedish Research Council for Health, Working Life and Welfare (Grant No: 2009-1403).

Author information

Authors and Affiliations

Centre for Musculoskeletal Research, Department of Occupational and Public Health Sciences, University of Gävle, SE-801 76, Gävle, Sweden
Martin Björklund, Birgitta Wiitavaara & Marina Heiden
Department of Community Medicine and Rehabilitation, Physiotherapy, Umeå University, SE-901 87, Umeå, Sweden
Martin Björklund

Authors

Martin Björklund
View author publications
You can also search for this author in PubMed Google Scholar
Birgitta Wiitavaara
View author publications
You can also search for this author in PubMed Google Scholar
Marina Heiden
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Martin Björklund.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest

Ethical approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards. This article does not contain any studies with animals performed by any of the authors.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Björklund, M., Wiitavaara, B. & Heiden, M. Responsiveness and minimal important change for the ProFitMap-neck questionnaire and the Neck Disability Index in women with neck–shoulder pain. Qual Life Res 26, 161–170 (2017). https://doi.org/10.1007/s11136-016-1373-8

Download citation

Accepted: 18 July 2016
Published: 09 August 2016
Issue Date: January 2017
DOI: https://doi.org/10.1007/s11136-016-1373-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Responsiveness and minimal important change for the ProFitMap-neck questionnaire and the Neck Disability Index in women with neck–shoulder pain