Background

Minimal clinically important change (MCIC) represents the patient-perceived improvement in an outcome after treatment [1,2,3,4], in an individual or within a group over time [5,6,7]. Impaired knee flexion range of motion (flexion) is a characteristic clinical feature in people with knee OA [8] which impacts on function, mobility, quality of life and independence [9,10,11]. Moreover, restricted knee movement is one of the ten key signs recommended for diagnosing knee OA by the European Alliance of Associations for Rheumatology [12]. Knee flexion is a valid and convenient measure used in clinical and research settings. Therefore, estimates of MCIC of knee flexion are necessary to interpret treatment efficacy and calculate the sample size for research studies [13, 14].

The anchor method is the most frequently used method to calculate MCIC, where values are estimated using an external scale (termed ‘anchor’) [15, 16]. The global rating of change scale is commonly used as the ‘anchor’ in studies. For example, patients are asked a transition question after treatment: ‘overall, how are your hip or knee problems now, compared to before treatment?’ with response categories ‘much better’, ‘a little better’, ‘about the same’, ‘a little worse’ and ‘much worse’ [5, 6]. The MCIC is the average pre-post change score of the sub-group of patients who responded ‘a little better’. Some studies use multiple anchors to improve the validity of results [17, 18]; for example, using cut points for the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC), 36-Item Short-Form Health Survey and the Back Depression scale [19]. However, when selecting an anchor, the anchor should be relevant to the disease condition, clinicians should be familiar with it (clinical acceptance) and there should be a relationship between the anchor and the outcome measure being evaluated for MCIC [20, 21].

Knee flexion limitation is associated with joint pain and limited function [8, 9, 22]. Non-surgical interventions (for example physiotherapy, manual therapy and exercise) could improve knee flexion, relieve pain and improve function in people with knee OA [23,24,25]. Pain relief has been associated with knee flexion improvement in people with knee OA awaiting knee replacement [26]. Increased flexion has been related to improved function in people with knee OA after total knee replacement [27]. Therefore, change in (Δ, mean change between baseline and immediately after the intervention) knee flexion after treatment may be associated with Δ pain and Δ function.

The MCIC of knee flexion in people with knee OA has not been previously reported. MCIC estimates for pain and function in knee OA after non-surgical interventions have been established for several patient-reported outcome measures. These include the WOMAC function subscale (function-WOMAC) and the Visual Analogue Scale for pain (pain-VAS) [28, 29]. We hypothesised that the relationship between Δ flexion and Δ pain and/or Δ function measured by tools with known MCIC can be used to estimate the MCIC of knee flexion.

Therefore, this study aimed to use meta-analysis to infer the MCIC of knee flexion, using relationships between Δ knee flexion and Δ pain and/or Δ function measured by tools with known MCIC in people with knee OA after non-surgical interventions.

Methods

We estimated the MCIC of knee flexion using a meta-analytical approach based on anchor method concepts. Data were obtained from published studies through a rapid systematic review, which was designed and reported according to the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) statement [30] and Cochrane rapid review guideline [31]. The protocol was registered on PROSPERO (registration number: CRD42022323927).

Literature search

We searched in databases MEDLINE (EBSCO), Cochrane Central Register of Controlled Trials, Web of Science and Cumulative Index to Nursing and Statistics in practice Allied Health Literature (CINAHL) from inception up to September 2023. A comprehensive search strategy was developed (Additional file 1) to capture all relevant articles, and database-specific MESH terms were used. We limited our search to studies published in English, and grey literature was not searched. The records were exported to EndNote version X9.2 for reference management.

Study screening

Covidence software (Covidence systematic review software, Veritas Health Innovation, Melbourne, Australia (www.covidence.org)) was used to manage the selection process. Records identified in the search were uploaded, and duplicates were removed. Two independent reviewers screened titles, abstracts and then full texts (D. S., T. S., J. C.). Disagreements were resolved by discussion with a third reviewer.

Study selection criteria

We included randomised clinical trials of people with knee OA (diagnosed clinically or radiographically). Studies were excluded if the study included participants with knee OA together with participants with other disorders or OA in other joints that were not reported separately (Table 1). Studies included non-surgical interventions with a treatment duration of ≤ 3 months. We limited studies of treatment duration ≤ 3 months because estimates of MCIC are sensitive to treatment duration and follow-up time [13]. Studies were excluded if they reported on surgical interventions.

Table 1 Eligibility criteria of included studies

We included studies that reported Δ knee flexion as well as Δ pain or Δ function outcomes measured using tools with established MCICs [32]. The tools included intermittent and constant osteoarthritis pain, Knee injury and Osteoarthritis Outcome Score, Lower Extremity Functional Scale, Numeric Pain Rating Scale, Patient-Reported Outcome Measurement Information System, and 36-Item Short-Form Health Survey, VAS and WOMAC. We included studies that measured flexion using a universal goniometer, electrogoniometer or inclinometer in a static position as flexion assessed using the above tools is considered a valid and accessible outcome measure in knee OA [33,34,35]. Studies that assessed knee flexion during functional activities (for example gait, squatting, stair climbing) were excluded as they do not consider the full flexion range. Studies which reported Δ knee flexion as well as Δ pain or Δ function outcomes were included. However, if studies did not report mean change but reported mean values of pre-intervention (baseline) and post-interventions, they also were included. Studies that only reported effect sizes, median measures and only figures (did not report variance) were excluded.

The risk-of-bias assessment

The risk of bias in the included studies was assessed using Version-2 of the Cochrane risk-of-bias tool for randomised trials (RoB 2) [36]. This instrument comprises five domains: bias arising from the randomisation process, bias due to deviations from intended interventions, bias due to missing outcome data, bias in the measurement of the outcome and bias in the selection of the reported result. Judgements for these domains and the overall risk of bias were estimated as ‘low, ‘high’ risk of bias or ‘unclear’. The review team considered the aim of the risk-of-bias assessment to assess the effects of assignment to intervention (the ‘intention-to-treat’ effect). When the assessment method of knee flexion is not reported in the primary article, the reference article or study protocol was checked to obtain that information. The risk of bias in all included studies was assessed by the principal investigator (D. S.), and a random sample of 35% of studies had a second review (T. S., J. C.) to improve the accuracy [37].

Data extraction

Data extraction included the following: study design, sample size, characteristics of participants and details of the interventions and outcomes. We extracted mean change before and immediately after the intervention (Δ) and standard error (SE) of flexion, pain and function outcomes. If studies did not report mean change (inferential statistics), we extracted mean values of pre-intervention (baseline) and post-intervention (descriptive statistics). Knee flexion change was extracted for the index knee (the most affected knee). If the index knee could not be identified (for example in bilateral knee OA), both right and left knee flexion data were extracted. The principal investigator (D. S.) performed data extraction.

Data analysis

First, we standardised Δ knee flexion of the index knee, Δ pain and Δ function. Where studies reported flexion data in both knees, average flexion was calculated. If a study did not report Δ flexion, it was calculated by subtracting pre-mean and post-mean according to Chapter 6.5.2.8 in the Cochrane handbook [38]. While this is likely to overstate the data (assuming those values are independent of each other as pre-mean is not predictive of post-mean), this is the only possible way to calculate the mean change using the given data of most studies. If the SE of mean change was not reported, it was calculated from standard deviation (SD) or confidence intervals (CI) using standard methods [38].

If studies reported pain-VAS in different scales, for example 0 to 10 mm or 0 to 100 mm, all the pain-VAS scores were standardised into a 0 to 100 mm scale where 0 = no pain at all and 100 = worst pain. All function-WOMAC scores were standardised to a 0 to 100 scale (0 = best function and 100 = worst function) where reported as 0 to 17 or 0 to 65.

Second, meta-analyses were performed to estimate the relationships between Δ flexion and Δ pain and/or Δ function. When a study had two or more interventions, each intervention arm was included as a separate cohort because each cohort was independent. However, study arms without interventions, for example waiting for treatment, were excluded. When studies reported both active and passive flexion, we included active flexion data only to avoid overestimating the same participants [38]. For this study, pain was examined in three categories: rest, during activity and general. Where pain was reported as ‘pain’, ‘pain intensity’ or ‘pain level’ without qualifiers, they were categorised to pain-general.

Only non-surgical intervention studies were included, but they comprised pharmacological, non-pharmacological or mixed interventions (type of interventions). Knee flexion can be measured in supine or prone (position) and actively or passively (mode). However, some studies did not provide this level in detail (missing data). Therefore, we examined the sensitivity of the data to these factors using data visualisation techniques and determined whether all data could be pooled or whether it should be analysed separately. Where there was no apparent effect due to these factors, we pooled knee flexion data regardless of the type of intervention, position or mode. In this way, the papers with missing data with regard to position or mode could be included in the pooled analysis.

To estimate relationships between Δ flexion and Δ pain and/or Δ function, meta-analytical models were developed using ‘R’ software [39] using the ‘brms’ package [40]. Separate analyses were conducted for each relationship, e.g. Δ flexion and Δ pain at rest and Δ flexion and Δ pain during activity. The Bayesian hierarchical models were used to determine relationships between variables: Δ flexion (Y, response) with Δ pain or Δ function (X, predictor). Errors-in-variables models were developed because both the predictor and the response variables have measurement errors [41]. We considered flexion as the response variable and pain or function as predictors for the convenience of implementing MCIC of knee flexion.

Model in general form

$$\Delta flexion\sim {N({\beta }_{0},{\tau }_{study})+(\beta }_{pain}\cdot \Delta pain)+N(0,{\tau }_{arm}),{\Delta pain}_{j}\sim N({{\mu }_{pain}}_{j},{{\sigma }_{pain}}_{j})$$

In this model, \(\Delta flexion\) and \(\Delta pain\) have uncertainty components, and \({\beta }_{0}\) is the population intercept. \({\tau }_{study}\) represents the heterogeneity between-study variation, and \({\tau }_{arm}\) represents the heterogeneity between intervention arms.

\({\Delta pain}_{j}\sim N({{\mu }_{pain}}_{j},{{\sigma }_{pain}}_{j})\): Latent variable representing the true value of the j (any given study) is being estimated based on the mean (μ) and standard error (σ). The above model is expressed as the following code.

Example model

$$(\mathrm{bf}\ (\mathrm{flexion}.\mathrm{change}\ |\mathrm{se}\ (\mathrm{SE}\_\mathrm{flexion}.\mathrm{change})\sim 1 +\mathrm{ me}\ (\mathrm{function}.\mathrm{change},\mathrm{ SE}\_\mathrm{function}.\mathrm{change}) + (1|\mathrm{Study}) + (1|\mathrm{Study}:\mathrm{Groups}))$$

In this model syntax (Bayesian regression model), ‘(1|Study) + (1|Study:Groups)’ indicates intervention arms (included hierarchical or ‘random’ effects), which are correlated across responses (univariate-normal distribution of intercepts).

Priors were intended to be weakly informative. The coefficient prior β was \(\mathrm{N}(\mathrm{0,1})\); on the response scale, this suggests a maximal effect of a two-unit change in flexion for a one-unit change in the predictor. The maximum physiologic flexion range of the knee joint is about 135°. We presumed that the maximal effect of 60° in flexion 60/100 change in the pain and 60/100 function could be possible. Posterior mean and credible intervals (CrI) were implemented and visualised for all analyses using the packages ‘ggdist’ [42] and ‘ggplot2’ [43]. Heterogeneity had a normal prior distribution with mean and standard deviation (\(\tau\), standard deviation of the between-study variability and between intervention arms). \(\tau\) had a half-Cauchy hyperprior, HC (0, 1), intended to be weakly informative [44]. Finally, the goodness of fit of the model was assessed by a posterior predictor check. Relationships are presented as the slope (β), intercepts and heterogeneity between studies and intervention arms with 90% CrI.

Third, the MCIC of flexion was estimated if only the uncertainty of the above relationships was sufficiently low. To estimate the MCIC of knee flexion, we post-processed the above models with the established MCIC estimates for pain and function. For example, MCIC of pain-VAS = − 19.9 (− 21.6 to − 17.9) in a 0–100-mm scale [45], MCIC of function-WOMAC (out of 100) = − 9.1 (− 10.5 to − 7.5) [45], − 17.13 (− 20.07 to 14.19) [29] and − 17.02 (− 20.15 to − 13.9) [29].

MCIC estimates for knee flexion using only reported-supine knee flexion data

However, as the supine position is frequently considered the appropriate knee flexion measurement position [46], a separate analysis was performed, including only reported-supine knee flexion data. First, relationships between Δ supine-active flexion with Δ pain and Δ function were established, and MCIC estimates of supine-active flexion were estimated as above. Second, relationships of pooled-supine flexion (mode = supine and position = active, passive or position not reported) with Δ pain and Δ function were also established. Then, MCIC estimates of pooled-supine flexion were estimated as above.

Results

Study selection

The search yielded 7452 records; 4860 records were screened after removing duplicates. Title and abstract screening yielded 428 records. After screening full texts, 72 studies (k = 72, sample n = 5174) (Fig. 1) were identified that reported Δ flexion with Δ pain or Δ function. A list of references for included studies is available in Additional file 2.

Fig. 1
figure 1

PRISMA flow diagram

There was a sufficient number of studies [37] to determine the relationships between Δ flexion with Δ pain using VAS (pain-VAS) and Δ function using WOMAC function subscale (function-WOMAC) only. Therefore, meta-analyses included 61 studies with 140 study arms (k = 61, n = 4516) that reported Δ flexion and Δ pain-VAS or Δ function-WOMAC. Eleven studies (Table 2) reported Δ pain or Δ function using other tools, for example Numeric Pain Rating Scale and Knee Injury and Osteoarthritis Outcome Score.

Table 2 Study characteristics

The risk of bias in included studies

The overall risk of bias was low in 47.2% of the studies, while 45.8% had some concerns, and five studies (7%) were of high risk of bias (Fig. 2, Additional file 3). The risk of bias arising from randomisation was low in 61.1% of the studies, while 36.1% had some concerns, and 2.8% of studies were of high risk. The risk of bias due to deviations from intended interventions was low for 55.6%, while 44.4% had some concerns. The risk of bias due to missing outcome data was low in 59.7% of studies, and others (40.3%) had some concerns. The risk bias in measuring the outcomes (63.9%) and selecting the reported results was low (87.5%) in most studies. Therefore, we included all the studies in the data analysis.

Fig. 2
figure 2

Risk of bias in included studies using RoB 2 tool [35]

Study characteristics of included studies

All the studies were individually randomised parallel group trials. The mean age ranged from 49 to 71 years. The interventions used in the included studies were exercise, kinesio taping, laser therapy, electrotherapy, shock wave therapy and nonsteroidal anti-inflammatory drugs. The treatment duration ranged from one session to 3 months.

Changes in knee flexion, pain and function after interventions

The mean Δ knee flexion ranged between − 6.4 and 59.8°. Δ pain-VAS (/100) at rest, during activity and general ranged from 8.0 to − 39.0, − 7.8 to − 58.8 and 12.8 to − 59.5, respectively. The Δ function-WOMAC ranged from 8.8 to − 41.5/100 (Additional file 4A, B).

Relationship between change in flexion with changes in pain and function

After examining the sensitivity of knee flexion data to type of interventions, position and mode using data visualisation techniques, we identified there was no apparent effect due to these factors (Fig. 3A, B, C). Therefore, we pooled knee flexion data regardless of the type of intervention, position or mode.

Fig. 3
figure 3

A Data distribution of mean change in knee flexion between different nonsurgical interventions. Pharm, pharmacological interventions; Non_pharm, nonpharmacological interventions; mixed, both pharmacological and non-pharmacological interventions; flexion.change, mean change between baseline and immediately after the intervention. B Data distribution of mean change in knee flexion between different knee flexion measurement positions. NR, measurement position not reported; prone, prone position; supine, supine position; flexion.change, mean change between baseline and immediately after the intervention. C Data distribution of mean change in knee flexion between different methods of flexion assessment (mode). Active, measurement mode-active flexion; NR, measurement mode not reported; passive, measurement mode-passive flexion; flexion.change, mean change between baseline and immediately after the intervention

Meta-analyses included 140 intervention arms of 61 studies (n = 4516) that reported Δ flexion with Δ pain-VAS or Δ function-WOMAC. Bayesian meta-analytic models found linear relationships between Δ flexion with Δ pain and Δ function (Table 3).

Table 3 Parameter estimates for relationships between change in flexion with changes in pain and function

Linear relationships between AQ6Δ pain at rest (0-100mm VAS) with Δ flexion was -0.29, (-0.44; -0.15) (β (CrI)). Relationships between Δ pain during activity VAS and Δ flexion were − 0.29 (− 0.41, − 0.18), and Δ general pain-VAS and Δ flexion were − 0.33 (− 0.42, − 0.23). The relationship between Δ function-WOMAC out of 100) and Δ flexion was − 0.15 (− 0.25, − 0.07).

Increased Δ flexion was associated with decreased Δ pain-VAS and increased Δ function-WOMAC. Since the uncertainty of the above relationships was sufficiently low MCIC of knee flexion, using all the above relationships was estimated (Table 4).

Table 4 Parameter estimates for MCIC knee flexion

Point estimates of MCIC of knee flexion ranged from 3.8 to 6.4° with a total range of 90% CrI 1.6 to 9.2° (Figs. 4A, B, C and 5A, B, C).

Fig. 4
figure 4

Relationships between changes in A pain at rest, B during activity, C pain-general with change in flexion. X-axis represents the change in pain (mean change-pain ± standard error (SE)), and Y-axis represents the change in flexion (mean flexion change ± SE). Points represent the observations (estimates from the underlying studies), with horizontal and vertical bars representing the approximate standard error for the predictor and response variable, respectively. The solid line represents the posterior mean of the relationship between the predictor and the response for a hypothetical average study, and the grey field is 90% credible interval (uncertainty), from the meta-regression errors-in-variables model. The vertical dotted lines represent the minimal clinically important change (MCIC) from reference study and its SE (19.9 mm ± 0.94) [45], demonstrating the inference of the corresponding MCIC for flexion

Fig. 5
figure 5

Relationship between change in function with change in flexion. X-axis represents the change in function (mean change function ± standard error (SE)), and Y-axis represents the change in flexion (mean flexion change ± SE). Points represent the observations (estimates from the underlying studies), with horizontal and vertical bars representing the approximate standard error for the predictor and response variable, respectively. The solid line represents the posterior mean of the relationship between the predictor and the response for a hypothetical average study, and the grey field is 90% credible interval (uncertainty), from the meta-regression errors-in-variables model. The vertical dotted lines represent the minimal clinically important change (MCIC) from reference study _and its SE: A − 9.1 + 0.77 [45], B − 17.13 + _1.5 [29], C − 17.02 + _1.59 [29] demonstrating the inference of the corresponding MCIC for flexion

Relationships for knee flexion with changes in pain and function using knee flexion measured in supine

Due to missing data, a few studies only reported that they measured knee flexion in supine. There were data on supine-active flexion and mode = not reported, and there were no mode = passive data. First, relationships between Δ supine-active flexion with Δ pain-VAS and Δ function-WOMAC were estimated. Second, relationships between Δ pooled-supine flexion (position = supine, mode = regardless, active plus not reported) with Δ pain-VAS and Δ function-WOMAC were estimated (Table 5).

Table 5 Parameter estimates for relationships considering supine flexion

Considering Δ supine-active flexion data, linear relationships between Δ supine-active flexion with Δ pain-general VAS were − 0.89 (− 1.09, − 0.55) (β (CrI)). Increased Δ supine-active flexion is associated with decreased Δ pain-general VAS. The uncertainty of the above estimate was only sufficiently low; MCIC considering supine-active flexion was only estimated based on this relationship (Table 5).

Considering Δ pooled-supine flexion data, linear relationships between Δ pooled-supine flexion and Δ pain-during activity were − 0.24 (− 0.37, − 0.12) (β (CrI)), and Δ pooled-supine flexion and Δ pain-general were − 0.61 (− 0.93, − 0.30). Δ pooled-supine flexion were associated with decreased Δ pain-during activity, and Δ pain-general and the uncertainty of these relationships were only sufficiently low. Therefore, MCIC estimates considering pooled-supine flexion were estimated based on these two relationships (Table 6).

Table 6 Parameter estimates for MCIC of knee flexion using only reported supine knee flexion

Estimated MCIC for knee flexion measured in supine

Point estimates of MCIC of knee flexion considering supine-active flexion are 12.8° (− 0.0°, 26.3°) (β (CrI)).

Point estimates of MCIC of knee flexion considering pooled-supine flexion ranged from 1.7 to 11.3° with a total range of 90% CrI − 0.5 to 20.1° (Figs. 6, 7). However, there were still no sufficient studies to draw strong conclusions due to the missing data.

Fig. 6
figure 6

Relationship between change in pain-general with change in supine-active flexion. X-axis represents the change in pain (mean change-pain ± standard error (SE)), and Y-axis represents the change in flexion (mean flexion change ± SE). Points represent the observations (estimates from the underlying studies), with horizontal and vertical bars representing the approximate standard error for the predictor and response variable, respectively. The solid line represents the posterior mean of the relationship between the predictor and the response for a hypothetical average study, and the grey field is 90% credible interval (uncertainty), from the meta-regression errors-in-variables model. The vertical dotted lines represent the minimal clinically important change (MCIC) from reference study and its SE (19.9 mm ± 0.94) [45], demonstrating the inference of the corresponding MCIC for flexion

Fig. 7
figure 7

Relationships between changes in pain. A During activity. B Pain-general with change in pooled-supine flexion. X-axis represents the change in pain (mean change-pain ± standard error (SE)), and Y-axis represents the change in flexion (mean flexion change ± SE). Points represent the observations (estimates from the underlying studies), with horizontal and vertical bars representing the approximate standard error for the predictor and response variable, respectively. The solid line represents the posterior mean of the relationship between the predictor and the response for a hypothetical average study, and the grey field is 90% credible interval (uncertainty), from the meta-regression errors-in-variables model. The vertical dotted lines represent the minimal clinically important change (MCIC) from reference study and its SE (19.9 mm ± 0.94) [45], demonstrating the inference of the corresponding MCIC for flexion

Discussion

This study provided estimates for MCIC of knee flexion using a meta-analytical approach. We used an innovative method to estimate the MCIC of knee flexion using relationships between Δ flexion with Δ pain and Δ function. We found that the point estimates of knee flexion MCIC ranged from 3.8 to 6.4°. Our MCIC estimates are specific to knee flexion in people with knee OA after non-surgical intervention with an intervention duration of ≤ 3 months. To our knowledge, this is the first study that has implemented relationships between Δ flexion with Δ pain and Δ function and estimated MCIC of knee flexion in people with knee OA. We assumed our methodology was robust, as MCIC was estimated only after the strength of the relationships were assured.

Our estimate for knee flexion MCIC for people with knee OA (3.8 to 6.4°) equates to approximately 3 to 5% of full knee flexion (considering 135° full flexion) [117]. A discussion paper on MCIC stated that MCIC of an outcome measure generally ranged between 6 and 10% of the total score regardless of the outcome measure [118]. Therefore, our estimate is lower than might be expected. However, the estimates are for an angle rather than a survey-based measure, and so the conditions governing these measurements may be different.

While MCIC estimates are specific to the disease condition and outcome measure [4, 119], no previous studies have provided MCIC for knee flexion. One previous study [120] attempted to determine the MCIC of maximum knee flexion during walking using a 3D motion analysis system in people with knee OA following knee arthroplasty; however, they were unable to establish estimates as no association was found between flexion and the anchor questions. This may be because walking requires less than 90° of flexion [117], and so range limitation is less likely to affect walking.

The relationships we found between Δ flexion with Δ pain and Δ function agree with previous studies that reported that flexion improvement was associated with pain relief and functional gain [26]. However, while some studies have found that flexion improvement is related to pain relief [8] and functional improvement [9, 10], others concluded that there is no strong relationship between flexion with pain and function [121]. They suggested that the reason might be that if the patients have already achieved a functional range of motion (more than 110°), flexion improvement may be less important [121].

There are several strengths of this study. Our estimates of MCIC are based on several relationships of Δ flexion with Δ pain and Δ function rather than using just one domain. We combined patient-reported outcome measures (pain and function) with physical outcomes (flexion data), thereby increasing the robustness of our estimates. This accords with recommendations for a holistic approach in disability measures with a combination of physical and patient-reported outcomes [35, 122]. This method for estimating an MCIC may be useful where it is difficult to formulate sensitive anchor questions or where recall bias is an issue [16]. For example, kinematic parameters may be difficult to interpret in an anchor question, and recall bias may be an issue for longer-term outcome measurement. However, any bias in the reference MCIC (MCIC of pain and function in this study) will carry forward into the study estimates (propagation of error) [123], and careful selection of reference estimates should be made.

The results of this study should be interpreted considering its limitations. In contrast to the anchor method, we did not directly acknowledge the patients’ perceptions. Since this is a meta-analytical approach study, these results are based on the results of included studies in this review. Therefore, publication bias and missing data may affect the results. We attempted to estimate MCIC considering only supine flexion data but no adequate studies to draw strong conclusions due to the missing data. We included studies published in English, and grey literature was not searched. Though MCIC can be affected by baseline knee flexion [124], we did not address specific baseline knee flexion, which might have affected the results. Even though we included non-surgical interventions with a treatment duration of ≤ 3 months and addressed the comparability of data before calculating the relationships, clinical variability and confounding caused by between-study factors still may affect the results of this study. Since we used the population level data, this may falsely infer individual data. Furthermore, substantial care has been invested in the internal validity of the analyses leading to MCIC estimates, with respect to data extraction and collation, model and prior specification and model goodness-of-fit evaluation. Much of this information is available to the reader, and we believe that particularly the graphical presentation conveys clearly and transparently the information supporting the results. However, we must note that the external validity of these MCIC estimates is simply unknown. The specific MCIC estimates obtained here have not been externally validated by reference to a repeated study of independent data. Further, we are not aware that meta-analytic modelling of correlated responses has been previously applied for MCIC determination, i.e. the whole approach is novel, so we simply cannot know how well MCIC estimated in this manner would generalise. Though we have carefully considered and evaluated the soundness of the statistical modelling, and from a ground-up perspective the methodology makes sense, replication for similar questions would be necessary to make any judgment about its performance. This is necessarily true for studies using exploratory, rather than established, methods.

Instrument factors should be considered when applying these estimates in clinical and research settings. Flexion range of motion can be measured using a range of goniometers. The most commonly used ‘universal goniometer’ has the minimum detectable change (MDC) ranging from 5 to 10° [125, 126]. However, the digital goniometer is more precise where the MDC is about 2° [127]. If MCIC is less than the MDC of the instrument, the minimal effect cannot be precisely measured using that instrument though the impact is meaningful to the patient [7]. Therefore, it is recommended to use a more precise instrument to measure knee flexion in knee OA studies.

In conclusion, estimated knee flexion MCIC values can be used in clinical and research studies to evaluate treatment efficacy. Though our results are specific to people with knee OA after non-surgical interventions, it is recommended that future studies be undertaken to estimate MCIC as a function of the severity of the knee OA. Our novel meta-analytical approach may be useful for estimating MCIC for other outcome measures where anchor questions are problematic.