The foot function index is more sensitive to change than the Leeds Foot Impact Scale for evaluating rheumatoid arthritis patients after forefoot or hindfoot reconstruction

Muradin, Imraan; van der Heide, Huub J. L.

doi:10.1007/s00264-016-3113-7

The foot function index is more sensitive to change than the Leeds Foot Impact Scale for evaluating rheumatoid arthritis patients after forefoot or hindfoot reconstruction

Original Paper
Open access
Published: 03 February 2016

Volume 40, pages 745–749, (2016)
Cite this article

Download PDF

You have full access to this open access article

International Orthopaedics Aims and scope Submit manuscript

The foot function index is more sensitive to change than the Leeds Foot Impact Scale for evaluating rheumatoid arthritis patients after forefoot or hindfoot reconstruction

Download PDF

Imraan Muradin¹ &
Huub J. L. van der Heide¹

2503 Accesses
8 Citations
Explore all metrics

Abstract

Purpose

This study examines the responsiveness of the Foot Functional Index (FFI) and Leeds Foot Impact Scale for Rheumatoid Arthritis (LFIS-RA) in rheumatoid arthritis (RA) patients receiving a forefoot or hindfoot reconstruction.

Methods

This was a prospective cohort study including 30 rheumatoid arthritis patients with severe rheumatoid foot deformities in need for surgical correction. Responsiveness was measured using distribution-based methods (standardized effect size, standardized response mean and Guyatt responsiveness ratio) and anchor-based methods (receiver operating characteristics curves and correlation analyses) by making use of an anchor question. To examine the depth of the questionnaires we measured the floor and ceiling effects.

Results

The study population consisted of three males and 27 females, with a mean age of 62 years. The mean follow-up time was 38 months. Twenty-two feet received a forefoot reconstruction and eight feet a triple arthrodesis. For the FFI the SES was −0.80, SRM was −0.85 and the GRR was −1.25. For the LFIS-RA the SES was 0.58, SRM was 0.58 and the GRR was 0.88. The AUC was 0.741 and 0.645 for FFI and LFIS, respectively. Contrary to the LFIS-RA, the FFI showed a significant correlation between change score and the anchor question. Both questionnaires did not show a significant floor or ceiling effect.

Conclusion

The FFI showed a large responsiveness and the LFIS- RA showed moderate responsiveness in rheumatoid arthritis patients receiving forefoot or hindfoot surgery, without floor or ceiling effects in both questionnaires.

“Psychometric properties of the Norwegian foot function index revised short form”

Article Open access 03 May 2022

Many-faceted rasch calibration of the foot function index-revised short form

Article Open access 17 October 2022

Evaluation of the Dutch version of the Foot and Ankle Outcome Score (FAOS): responsiveness and Minimally Important Change

Article 24 December 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Despite recent advances in pharmacological management of rheumatoid arthritis, forefoot deformity, with its symptoms, remains a common problem, often requiring operative treatment [1]. The foot is most commonly affected early with a prevalence of up to 90 % for the metatarsophalangeal (MTP) joints and in 15 % the forefoot is the first manifestation of the disease [2]. Deformities are mostly found in the forefoot with symptoms such as atrophy, claw toes, dislocation of the plantar fascia and metatarsophalangeal subluxations [3]. Hindfoot involvement in RA is similarly common, with reported prevalence numbers as high as 50 % through 90 % [4]. The treatment of severe forefoot deformities consists of operative correction through repositioning of the metatarsophalangeal joints of the lesser rays in combination with an arthrodesis of the first metatarsophalangeal joint [3].

In regards to the hindfoot triple arthrodesis is the preferred technique [5].

Concerning the feet reconstruction in RA patients there is a need for structured evaluation of the subjective aspects of the patient’s symptoms [6]. Patient-reported outcome measurements (PROMs) are used to evaluate the outcome of operative treatments [2]. To be truly useful PROMs must exhibit good psychometric properties such as reliability, validity and responsiveness [7]. Responsiveness is defined as the ability of a measuring instrument to detect change when it has occurred [8]. The present literature describes 36 different PROMs used to measure foot problems in RA patients; solely five instruments were measured for responsiveness [9]. A worldwide (USA, Europe, Australia and Asia) used PROM to measure the impact of foot pathology on function in terms of pain, difficulty and activity restriction is the Foot Function Index [10, 11] as described by Budiman et al. Nelson et al. reported moderate to large responsiveness of the FFI in RA patients receiving foot or ankle surgery; however, they did not specify the operative procedure [12]. The responsiveness of the FFI in end stage ankle arthritis patients, who received either a total ankle replacement or an ankle arthrodesis, showed a high level of responsiveness (both the standardized effect size and standardized response mean above the threshold of 0.8 indicating a high level of responsiveness) [13].

The Leeds Foot Impact Scale for Rheumatoid Arthritis (LFIS-RA) described by Helliwell et al. is a PROM used to measure the outcome (impairment and activity limitations) in RA patients with foot deformities, in mostly eastern Europe countries, including England, Germany, Hungary and the Netherlands [14]. Initial results showed good psychometric properties [15]. The Dutch-translated version of the LFIS-RA showed excellent psychometric properties, but hasn’t been tested for responsiveness [16]. This study was conducted to determine the responsiveness of the FFI and LFIS-RA in RA patients with severe feet deformities and to compare them.

Methods

Participants

This prospective cohort study included 30 RA patients with typical RA deformities in need for operative treatment. Data was collected between 2009 and 2013. Regarding the FFI and LFIS, the patients were approached during the regular out-patient clinic appointments and agreed to participate in this study. An assistant who was not involved in the treatment approached the patients to obtain the anchor question. All patients were examined and operated by one orthopaedic surgeon.

Outcome assessment

The outcome was measured using the FFI and LFIS-RA pre-operative and post- operative. Post-operative an anchor question was added, using a seven point Likert-scale; ‘How did the situation change regarding your foot since the surgery? (1: very much deteriorated, 2: much deteriorated, 3: somewhat deteriorated, 4: about the same, 5: somewhat improved, 6: much improved, 7: very much improved). The FFI consists of 23 items (score range: 0–115) grouped into three subscales; pain, disability and activity limitation related to foot pathology [10]. Each question provides six alternatives. Important to mention is that a higher score indicates a worse outcome. The LFIS-RA test consist of 51 dichotomous items (score range: 51–102) grouped into four subscales; impairment, activities, participation, and footwear [15]. In contrary to the FFI, a higher LFIS-RA score means a better outcome.

Responsiveness

A distinction could be made between internal and external responsiveness [8]. Internal responsiveness is the ability of a measure to change over a particular time frame [17]. External responsiveness is the ability of a measure to change over a particular time frame related to an external measure [8]. The internal responsiveness could be measured using the standardized effect size (SES), standardized response mean (SRM) and Guyatt responsiveness ratio (GRR) [8]. The SES was calculated by dividing the mean change score by the standard deviation of the mean baseline score. The SRM was calculated by dividing the mean change score by the standard deviation of the mean change score. To determine the GRR the study population was divided into two groups based on the anchor question: patients that improved after the procedure and patient that showed little to no change after the procedure. The “improved” group consisted of people who answered “much improved” and “very much improved” to the anchor question; the “not convincible changed” group consisted of patients that answered “somewhat deteriorated” or “somewhat improved”. To calculate the GRR the mean change score in the “improved” group was divided by the standard deviation of the mean change score in the “not convincible changed” group [7]. For the above mentioned approaches a value of 0.5 or less represents an inadequate responsiveness, values between 0.50 and 0.80 a moderate internal responsiveness, and values of 0.80 or greater a large internal responsiveness [18].

External responsiveness

External responsiveness was measured using the area under the curve (AUC) within the ROC curve [19]. An AUC below 0.5 reflects random distribution, AUC between 0.5 and 0.7 is considered to have limited discrimination accuracy, AUC between 0.7 through 0.8 acceptable accuracy, AUC between 0.8 through 0.9 excellent accuracy and an AUC above 0.9 was considered to have outstanding accuracy [19]. In order to perform this analysis it was necessary to dichotomize the anchor question. The “much improved” and “very much improved” were categorized as the “improved” group, and the “somewhat improved”, “about the same”, “somewhat deteriorated”, “much deteriorated” and “very much deteriorated” as the “not improved” group. A practical cut-off point with optimal balance of sensitivity and specificity was chosen. A correlation analysis (Spearman correlation coefficient) provided an examination of the relationship between the questionnaire and anchor question [20].

Floor and ceiling effects

Floor and ceiling effects in the questionnaires could make it difficult to measure changes after interventions such as a surgery due to distortion of the score distribution. Floor or ceiling effects are considered to be present if more than 15 % of respondents achieved the lowest or highest possible score, respectively [20].

Results

Patient characteristics

The study population consisted of three males (10 %) and 27 females (90 %). The mean age of the group was 62 years (range 44–76; SD 8.7). All patients were reviewed after a mean follow-up of 38 months (range 5–61; SD 16). On 22 feet (73 %) a forefoot reconstruction was performed and on eight feet (27 %) a triple arthrodesis. Approximately 70 % of the patients reported they improved after the surgery (Table 1). During the first measurement, 25 patients had a positive RA-factor (Table 2). Roughly half of the patients used DMARDs during the first measuring point (Table 3).

Table 1 Frequency table for the outcome of the anchor question

Full size table

Table 2 Diagnosis at first measuring moment

Full size table

Table 3 Medication use at first measuring moment

Full size table

Internal responsiveness

Pre-operatively, the mean FFI total score was 55.5 (range 28–82; SD 14), and post-operatively it was 44.2 (range 24–66; SD 12). The mean change was a decrease of 11.3 points (range −42 to 13; SD 13). For the FFI the SES was −0.80, SRM was −0.85 and GRR was −1.25 (Table 4). As mentioned before, negative values indicate improvement. Pre-operatively, the mean LFIS-RA score was 76.8 (range 62–98; SD 8), and post-operative it was 81.2 (range 65–102; SD 10). The mean change was an increase of 4.4 points (range −12 to 23; SD 8). For the LFIS-RA the SES was 0.58, SRM was 0.58 and the GRR was 0.90 (Table 4).

Table 4 Descriptive statistics and internal responsiveness characteristics for the FFI and LFIS-RA

Full size table

External responsiveness

For the FFI questionnaire the AUC was 0.741 (CI 95 %: 0.558–0.924, SE 0.094, P = 0.025) (Fig. 1). For the LFIS-RA questionnaire we obtained an AUC of 0.645 (CI 95 %: 0.440–0.850, SE 0.104, P = 0.177) (Fig. 1). The optimal cut-off point for the FFI was 6 points, with a sensitivity of 81 % and a specificity of 57 %. For the LFIS-RA the cut-off point was 1.5 points, with a sensitivity of 75 % and a specificity of 57 %. There was a significant negative correlation (Spearman correlation coefficient, 0.396; P = 0.030) between the FFI change scores and anchor questions. No significant correlation between the LFIS-RA changes scores and anchor questions was found (Spearman correlation coefficient = 0.210; P = 0.266).

Floor and ceiling effect

Neither the LFIS or the FFI showed a significant floor or ceiling effect pre-operative and post-operative (Table 5), due to the fact that less than 15 % of the respondents achieved the highest or lowest possible score.

Table 5 Range of values and for the FFI and LFIS-RA

Full size table

Discussion

Multiple instruments are developed to measure foot function, foot pain and foot related disability in RA patients. A recent study reviewed the measurement properties of 36 different instruments, and concluded that solely five instruments were measured for responsiveness [9]. Our study is the first to evaluate the responsiveness and floor/ceiling effects of the FFI and LFIS questionnaire in RA patients who received forefoot and hindfoot correction, and boosts robustness by applying both anchor-based as well as distribution-based approaches to measure responsiveness.

We defined responsiveness as a parameter for measurement instruments to measure change over time. We defined internal responsiveness as the ability of a measure to change over time, and external responsiveness as a change over time corresponding with an external measure. We found a large internal responsiveness of the FFI, for the LFIS we found a moderate internal responsiveness. The use of GRR to measure responsiveness is a controversial topic. Although the use of the GRR is seen by some as the superior measurement for responsiveness by some researchers [8, 21], others claim that the GRR does not reflect the validity of the changed score [22, 23]. This difference arises due to the difference in the definition of responsiveness [7].

In regards to the external responsiveness the FFI had stronger discriminative abilities than the LFIS-RA in the present study population. The LFIS had shown below acceptable discriminative abilities. To demonstrate the above-stated, a 60-year-old patient who underwent forefoot surgery answered the anchored question with “much improved”. The FFI showed a change of −20 points (17 %) and the LFIS-RA a change of three points (2 %). Another patient, a 68-year-old woman, who underwent the same forefoot surgery, answered “very much improved” at the anchored question, showed a FFI change of −24 points (20 %) and an LFIS-RA change of three points (2 %). If we would rely solely on the LFIS-RA questionnaire, we wouldn’t be able to perceive the true change after the surgery.

External responsiveness was measured using the AUC within the ROC curve. A disadvantage of the ROC method is that the AUC has little meaning, and is primarily useful for ranking competing scales [21]. Another disadvantage is that the external criterion for change must be dichotomized; by merging the groups, valuable data is lost [19]. In choosing the cut-off points we strived for a high sensitivity and specificity. The FFI reached a higher sensitivity than the LFIS-RA, with an equal specificity in both questionnaires. Correlation analysis showed a significant correlation between the FFI change score and anchor question. In both the FFI and LFIS we did not observe any floor or ceiling effects.

A limitation of this study is the absence of a gold standard in measuring and expressing the responsiveness [22]. Furthermore, the use of an anchor question is prone to bias [24]. It is very difficult for people to remember their past state. They deduce their prior status from their present state and invoke an implicit theory of change to construct their prior state before the surgery [25], thereby creating a high correlation between measure of change and present state, but a low correlation between measure of change and prior state. Following on from the anchor question, it was remarkable that none of the patients reported “no change”. As Guyatt and Deyo remarked, the answer on the anchor question will never solitarily rely on operative outcome, but subsequently measures satisfaction with the program, rehabilitation process, or desire to show gratitude to those who have spent time and effort trying to help the patient [7]. The FFI questionnaire is an attractive measurement for physicians who wish to have a sensitive, reliable and responsive questionnaire for routine clinical practice [26].

This study shows that the FFI reaches better properties regarding the responsiveness, compared to the LFIS-RA questionnaire. We could conclude that the use of the FFI is preferred for RA patients with the above-mentioned deformities. A possible explanation for the difference could be that the LFIS was validated in a study population lacking RA patients with severe deformities [15].

Conclusion

The FFI showed a large responsiveness and the LFIS- RA showed moderate responsiveness in RA patients receiving forefoot or hindfoot surgery, without floor or ceiling effects in both questionnaires.

References

Louwerens JWK, Schrier JCM (2013) Rheumatoid forefoot deformity: pathophysiology, evaluation and operative treatment options. Int Orthop 37(9):1719–1729. doi:10.1007/s00264-013-2014-2
Article PubMed PubMed Central Google Scholar
Trieb K et al (2013) The Weil osteotomy for correction of the severe rheumatoid forefoot. Int Orthop 37(9):1795–1798. doi:10.1007/s00264-013-2011-5
Article PubMed PubMed Central Google Scholar
Van der Heide HJ, Louwerens JW (2010) Reconstructing the rheumatoid forefoot. Foot Ankle Surg 16(3):117–121. doi:10.1016/j.fas.2009.07.001
Article PubMed Google Scholar
Ohly NE, Breusch SJ (2013) Additive hindfoot arthrodesis for rheumatoid hindfoot disease: a clinical study of patient outcomes and satisfaction. Clin Rheumatol 32(12):1777–1782. doi:10.1007/s10067-013-2343-y
Article PubMed Google Scholar
Child BJ, Hix J, Catanzariti AR et al (2009) The effect of hindfoot realignment in triple arthrodesis. J Foot Ankle Surg 48(3):285–293. doi:10.1053/j.jfas.2009.02.006
Article PubMed Google Scholar
Graves SC, Mann RA, Graves KO (1993) Triple arthrodesis in older adults. results after long-term follow-up. JBJS J Bone Joint Surg 75(3):355–362
CAS Google Scholar
Guyatt GH, Deyo RA, Charlson M et al (1989) Responsiveness and validity in health status measurement: a clarification. J Clin Epidemiol 42(5):403–408. doi:10.1016/0895-4356(89)90128-5
Article CAS PubMed Google Scholar
Husted JA, Cook RJ, Farewell VT et al (2000) Methods for assessing responsiveness: a critical review and recommendations. J Clin Epidemiol 53(5):459–468. doi:10.1016/S0895-4356(99)00206-1
Article CAS PubMed Google Scholar
Van der Leeden M, Steultjens MP, Terwee CB et al (2008) A systematic review of instruments measuring foot function, foot pain, and foot-related disability in patients with rheumatoid arthritis. Arthrit Rheuma 59(9):1257–1269. doi:10.1002/art.24016
Article Google Scholar
Budiman Mak E, Conrad KJ, Roach KE (1991) The foot function index: A measure of foot pain and disability. J Clin Epidemiol 44(6):561–570. doi:10.1016/0895-4356(91)90220-4
Article CAS PubMed Google Scholar
Budiman Mak E, Conrad KJ, Mazza J et al (2013) A review of the foot function index and the foot function index—revised. J Foot Ankle Res 6(1):5. doi:10.1186/1757-1146-6-5
Article PubMed PubMed Central Google Scholar
SooHoo NF, Vyas R, Samimi D (2006) Responsiveness of the foot function index, aofas clinical rating systems, and sf-36 after foot and ankle surgery. Foot Ankle Int 27(11):930–934. doi:10.3113/fai.2012.0057
PubMed Google Scholar
Madeley NJ, Wing KJ, Topliss C et al (2012) Responsiveness and validity of the sf-36, ankle osteoarthritis scale, AOFAS ankle hindfoot score, and foot function index in end stage ankle arthritis. Foot Ankle Int 33(1):57–63. doi:10.3113/fai.2012.0057
Article PubMed Google Scholar
Woodburn J et al (2012) Adaptation and crosscultural validation of the foot impact scale for rheumatoid arthritis using Rasch analysis. Arthrit Care Res 64(7):986–992
Google Scholar
Helliwell P, Reay N, Gilworth G et al (2005) Development of a foot impact scale for rheumatoid arthritis. Arthrit Rheuma 53(3):418–422. doi:10.1002/art.21176
Article Google Scholar
Woodburn J, Vliet Vlieland TP, van der Leeden M et al (2011) Rasch analysis of dutch-translated version of the foot impact scale for rheumatoid arthritis. Rheumatology 50(7):1315–1319. doi:10.1093/rheumatology/ker003
Article PubMed Google Scholar
De Vet HC, Terwee CB, Bouter LM (2003) Current challenges in clinimetrics. J Clin Epidemiol 56(12):1137–1144. doi:10.1016/j.jclinepi.2003.08.012, doi: 0.1017/CBO9780511996214.001
Article PubMed Google Scholar
McGough J, Faraone SV (2009) Estimating the size of treatment effects: moving beyond p values. Psychiatry 6(10):21–29
PubMed PubMed Central Google Scholar
Deyo RA, Centor RM (1986) Assessing the responsiveness of functional scales to clinical change: an analogy to diagnostic test performance. J Chron Dis 39(11):897–906. doi:10.1016/0021-9681(86)90038-X
Article CAS PubMed Google Scholar
Terwee CB, Bot SD, de Boer MR et al (2007) Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol 60(1):34–42. doi:10.1016/j.jclinepi.2006.03.012
Article PubMed Google Scholar
Deyo RA, Diehr P, Patrick DL (1991) Reproducibility and responsiveness of health status measures. statistics and strategies for evaluation. Control Clin Trials 12(4 Suppl):142–158S. doi:10.1016/S0197-2456(05)80019-4
Article Google Scholar
De Vet HC, Terwee CB, Mokkink LB, Knol DJ (2011) Measurement in medicine. a practical guide. Cambridge University Press, Cambridge. doi:10.1007/s11136-012-0123-9
Book Google Scholar
Hays RD, Hadorn D (1992) Responsiveness to change: an aspect of validity, not a separate dimension. Qual Life Res 1(1):73–75. doi:10.1007/BF00435438
Article CAS PubMed Google Scholar
Schwartz CE, Andresen EM, Nosek MA et al (2007) Response shift theory: important implications for measuring quality of life in people with disability. Arch Phys Med Rehabil 88(4):529–536. doi:10.1016/j.apmr.2006.12.032
Article PubMed Google Scholar
Ross M, Ross (1989) Relation of implicit theories to the construction of personal histories. Psychol Rev 96(2):341–357. doi:10.1037/0033-295x.96.2.341
Article Google Scholar
Walmsley S, Williams AE, Ravey M, et al (2010) The rheumatoid foot: A systematic literature review of patient-reported outcome measures. J Foot Ankle Res 3:12

Download references

Author information

Authors and Affiliations

Department of Orthopaedics, Leiden University Medical Center, Albinusdreef 2, 2333 ZA, Leiden, The Netherlands
Imraan Muradin & Huub J. L. van der Heide

Authors

Imraan Muradin
View author publications
You can also search for this author in PubMed Google Scholar
Huub J. L. van der Heide
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Huub J. L. van der Heide.

Ethics declarations

Conflict of interest

There are no conflicts of interest.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Muradin, I., van der Heide, H.J.L. The foot function index is more sensitive to change than the Leeds Foot Impact Scale for evaluating rheumatoid arthritis patients after forefoot or hindfoot reconstruction. International Orthopaedics (SICOT) 40, 745–749 (2016). https://doi.org/10.1007/s00264-016-3113-7

Download citation

Received: 16 July 2015
Accepted: 31 December 2015
Published: 03 February 2016
Issue Date: April 2016
DOI: https://doi.org/10.1007/s00264-016-3113-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The foot function index is more sensitive to change than the Leeds Foot Impact Scale for evaluating rheumatoid arthritis patients after forefoot or hindfoot reconstruction