Minimal important change (MIC): a conceptual clarification and systematic review of MIC estimates of PROMIS measures

Terwee, Caroline B.; Peipert, John Devin; Chapman, Robert; Lai, Jin-Shei; Terluin, Berend; Cella, David; Griffiths, Pip; Mokkink, Lidwine B.

doi:10.1007/s11136-021-02925-y

Minimal important change (MIC): a conceptual clarification and systematic review of MIC estimates of PROMIS measures

Review
Open access
Published: 10 July 2021

Volume 30, pages 2729–2754, (2021)
Cite this article

Download PDF

You have full access to this open access article

Quality of Life Research Aims and scope Submit manuscript

Minimal important change (MIC): a conceptual clarification and systematic review of MIC estimates of PROMIS measures

Download PDF

Caroline B. Terwee ORCID: orcid.org/0000-0003-4570-2826¹,
John Devin Peipert²,
Robert Chapman²,
Jin-Shei Lai²,
Berend Terluin³,
David Cella²,
Pip Griffiths⁴ &
…
Lidwine B. Mokkink¹

15k Accesses
186 Citations
13 Altmetric
Explore all metrics

Abstract

We define the minimal important change (MIC) as a threshold for a minimal within-person change over time above which patients perceive themselves importantly changed. There is a lot of confusion about the concept of MIC, particularly about the concepts of minimal important change and minimal detectable change, which questions the validity of published MIC values. The aims of this study were: (1) to clarify the concept of MIC and how to use it; (2) to provide practical guidance for estimating methodologically sound MIC values; and (3) to improve the applicability of PROMIS by summarizing the available evidence on plausible PROMIS MIC values. We discuss the concept of MIC and how to use it and provide practical guidance for estimating MIC values. In addition, we performed a systematic review in PubMed on MIC values of any PROMIS measure from studies using recommended approaches. A total of 50 studies estimated the MIC of a PROMIS measure, of which 19 studies used less appropriate methods. MIC values of the remaining 31 studies ranged from 0.1 to 12.7 T-score points. We recommend to use the predictive modeling method, possibly supplemented with the vignette-based method, in future MIC studies. We consider a MIC value of 2–6 T-score points for PROMIS measures reasonable to assume at this point. For surgical interventions a higher MIC value might be appropriate. We recommend more high-quality studies estimating MIC values for PROMIS.

Critical examination of current response shift methods and proposal for advancing new methods

Article Open access 17 February 2021

Between-group minimally important change versus individual treatment responders

Article 15 June 2021

Response shift results of quantitative research using patient-reported outcome measures: a descriptive systematic review

Article Open access 13 September 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

There are several ways to interpret change scores arising from patient-reported outcome measures (PROMs). One possible threshold is the minimal important change (MIC) estimate, which refers to the smallest change in score that patients consider important. The MIC is the lower bound of a distribution of thresholds for important change. There is a lot of confusion about the concept of MIC, which questions the validity of published MIC values [1, 2]. First, there is inconsistency in terminology used (e.g., minimal important change, minimal important difference, minimal clinically important difference, meaningful change threshold, to name a few). Similar terms may refer to different concepts and vice versa. Second, there is particular confusion about the concepts of minimal important change and minimal detectable change, which refer to different concepts [3, 4]. Third, there are differences in methods used for estimating the MIC, some more and some less methodologically sound [5]. This confusion hampers and may even bias the interpretation of PROM change scores in research and clinical practice.

An increasingly used, innovative set of PROMs is the Patient-Reported Outcomes Measurement Information System (PROMIS^®). It covers domains of health-related quality of life (HRQOL), such as pain, fatigue, physical function, anxiety, depression, and the ability to participate in social roles and activities, that are commonly important for adults and children with and without (chronic) medical conditions [6, 7]. Most PROMIS measures are rooted in item response theory (IRT)-based item banks (i.e., large sets of calibrated questions measuring the same domain (construct)), which enables efficient measurement through fixed-length short forms and/or computerized adaptive testing (CAT) [8,9,10]. A number of studies have estimated MIC values for PROMIS measures. However, in light of its increasing use across the world [11,12,13,14,15,16,17,18,19], and the aforementioned confusion in the interpretation literature, additional guidance is needed on interpreting PROMIS change scores.

The aims of this study were: (1) to clarify the concept of MIC and how to use it; (2) to provide practical guidance for estimating methodologically sound MIC values; and (3) to improve the applicability of PROMIS by summarizing the available evidence on plausible PROMIS MIC values.

Part 1: the concept of MIC and how to use it

We define the MIC as a threshold for a minimal within-person change over time above which patients perceive themselves importantly changed. Assuming that all patients have their individual threshold of what they consider a minimal important change, the MIC can be conceptualized as the mean of these individual thresholds [20, 21]. This definition of MIC is made up of three important elements: first, it refers to a threshold for a minimal change above which patients perceive themselves as changed (improved or deteriorated). Second, it refers to a change that is considered important to patients. And third, it refers to a within-patient change over time.

These three elements do not only define what the MIC is but also clarifies what the MIC is not. The MIC does not refer to thresholds for changes that are considered more than minimal (e.g., a mean change in patients who reported to be “much better” is not a MIC). There are other relevant concepts that reflect meaningful change thresholds that are larger than minimal, such as Clinically Significant Change [22], Sufficiently Important Difference [23] or Smallest Worthwhile Effect [24]. These concepts are outside the scope of this paper.

Next, the MIC is not a minimal detectable change (MDC, also referred to as smallest detectable change (SDC)). The MDC is the smallest change in score than can be detected statistically with some degree of certainty (e.g., 95 or 90%), based on the standard error of measurement (SEM) or limits of agreement from a test–retest reliability design. The MDC does not relate to the importance of change to the patients under investigation [4, 25,26,27]. The MDC is also an important benchmark for interpreting PROM change scores, but it is also outside the scope of this paper.

Finally, the MIC is not a difference between (groups of) patients. For example, a difference between patients who reported to be “a little better” and those who reported to be “about the same” refers to a minimal important difference (MID), not a minimal important within-person change (MIC). The MID is another relevant benchmark for interpreting PROM scores but is also outside the scope of this paper.

The MIC, as defined above, can be used for different purposes. In research, some use the MIC value as a threshold to determine the number of responders in clinical trials or other studies (i.e., patients who have a change at least as large as the MIC value) [28, 29]. This responder definition adds a meaningful interpretation to study results from the patients’ perspective. In clinical practice, the MIC value can also be used to determine the number of responders in groups of patients who receive certain treatments to inform future patients about the expected effects of treatments. For example, a patient can be told that about 70% of patients experience a minimal important change after a given treatment. This may facilitate shared-decision making. However, it is necessary to acknowledge that the estimated MIC value is derived from a wider sample of patients, and the threshold may not apply to the individual patient in the clinical trial or in the consultation room. If a responder is defined as an individual whose PROM change score exceeds the MIC, then on a group level the percentage of responders will probably be correct. However, this doesn’t mean that all patients have been classified correctly, based on their individual PROM change score being smaller or greater than their individual MIC. This is because all patients have their own individual threshold of what they consider a minimal important change [20]. Furthermore, measurement error in the PROM change score further contributes to misclassification of individuals.

In addition to being used as a threshold for responder definitions, the MIC value can be used as a probabilistic value, rather than a deterministic cut-point, by clinicians to interpret change scores in light of the probability that an individual patient has experienced a meaningful change. For example, if the estimated MIC value of a PROM is 10 points and an individual patient has changed more than 10 points, it is more likely that the patient has importantly improved than that the patient has not importantly improved. This might help the clinician start a conversation with the patient.

Part 2: guidance for estimating MIC values

A variety of methods have been used in the literature to estimate MIC values [1, 30, 31]. Many methods, however, do not refer to the concept of MIC as described above. MIC methods are often categorized into distribution-based and anchor-based methods. Distribution-based methods use statistical parameters, such as a standard deviation (SD) or standard error of measurement (SEM) for estimating the MIC value. These parameters refer to measurement error (minimal detectable change) but do not relate to the importance of the change to the patients under investigation and, while they add useful context to interpreting MIC values, they do not capture the spirit of the MIC [3, 4, 27].

Anchor-based methods are generally more appropriate because they relate change scores on the instrument of interest to an external criterion of important change. Often, a single question at follow-up is being used as the external criterion (the anchor), asking patients how much they have changed, for example on a global 5- or 7-point rating scale ranging from “much worse” to “much better”. The most simple and prevalent method used to estimate the MIC value is the mean change method, where the MIC value (further referred to as MIC_mean) is defined as the change score on the measure of interest in the subgroup of patients that reported to be “a little better” (minimal important improvement) or “a little worse” (minimal important deterioration) on the anchor question [32]. Studies have shown that a MIC for improvement may not be the same as a MIC for deterioration [33,34,35]. The mean change method has some important drawbacks. First, the subgroup of patients who reported to be “a little better” is often small, which results in imprecise MIC_mean estimates. More importantly, the MIC_mean value does not reflect a threshold for minimal improvement because it is defined as the mean of the entire group of patients who reported to be “a little better”. As all patients in this group reported to be minimal importantly changed on the anchor, the mean change in score on the PROMs of interest in this group of patients is higher than the threshold for minimal important change. Finally, it has been shown that if the anchor is not completely accurate, MIC_mean estimates are more severely biased than other anchor-based methods and will always be biased downwards [36].

Two additional, more appropriate, anchor-based MIC methods are the ROC method and the MIC predictive modeling method, which are described in more detail below and in Online supplement 2. In addition, a relatively new qualitative method, based on comparing vignettes (descriptions of health status of hypothetical patients), is also described below.

ROC method

The Receiver Operating Characteristic (ROC) curve method is based on the ability of a measure to distinguish patients who reported to be improved from patients who reported to be not improved (i.e., stayed the same or worsened) on the anchor. The MIC value (further referred to as MIC_ROC) is most often defined as the value for which the sum of the proportions of misclassifications ([1-sensitivity] + [1-specificity]) is smallest [32]. An advantage of this method is that it uses the entire study sample, leading to more reliable estimates than the MIC_mean. Moreover, it estimates the threshold between ‘not changed’ and ‘a little better’ (minimal important improvement) or ‘a little worse’ (minimal important deterioration). A disadvantage is that the MIC_ROC will be biased if the percentage of improved patients is not 50% [20].

Predictive modeling method

The predictive modeling approach is based on the predicted probability that a patient belongs to the improved group (based on the anchor) given the observed change score [21]. This method uses logistic regression analysis with the group variable (improved versus not improved [stayed the same and worsen] on an anchor) as the dependent variable and the change score on the instrument of interest as the independent variable. The MIC value (further referred to as MIC_predict) is defined as the change score associated with a likelihood ratio of 1, which is the change score where the posttest probability of belonging to the improved group (i.e., after knowing the patient’s PROM change score) equals the pretest probability of belonging to the improved group (before knowing the patient’s PROM change score, the pretest probability is the percentage of improved patients in the sample) [20, 21]. The MIC_predict is more precise than the MIC_ROC and a formula has been published to correct the MIC_predict for bias if the percentage of improved patients is not 50% [20]. It is therefore considered as a better option than the MIC_ROC. In Online supplement 2 we provide additional details and SPSS and R codes (See also [37]) for how MIC_ROC and MIC_predict can be calculated.

Vignette-based method

The anchor-based MIC methods described above depend on the reliability and validity of the anchor question, which has been criticized [30, 38, 39]. An alternative method for instruments with IRT-based scores is a vignette-based method, often referred to as bookmarking or standard setting. With this method, patients are asked to compare vignettes (descriptions of health status of hypothetical patients) in focus groups or in a survey [40,41,42]. Each vignette represents a health status with an associated score on the underlying IRT metric. Patients are asked to indicate whether a hypothetical change in health status from one vignette to another would be considered an important change. The MIC (further referred to as MIC_vignette) has been defined as the mean difference in scores between pairs of vignettes that represent a minimal important change. If the mean difference is used to estimate the MIC_vignette, this method may suffer a similar issue to MIC_mean in that it represents a value higher than the minimal threshold. Alternatively, it would also be possible to ask patients to rate the change between two (or more) vignettes on an anchor question and then use the predictive modeling method to estimate the MIC_predict.

In Box 1 we provide a summary of general recommendations for the design and analysis of MIC studies.

Box 1: Recommendations for conducting and reporting MIC studies

1.
The predictive modeling or ROC method should be used over the mean change method because MIC_predict and MIC_ROC provide a threshold between improved and not improved patients [21], while the MIC_mean does not reflect a threshold for minimal improvement, but rather a mean in a subgroup of patients who considered themselves as minimally improved. The MIC_predict is more precise than the MIC_ROC and can be corrected for bias if the percentage of improved patients is not 50%, and is therefore recommended as best option. Vignette-based methods can also be considered or used in addition to the MIC_predict or MIC_ROC because they do not require a longitudinal study. The MIC_vignette is typically determined in a qualitative or survey study [40, 97].
2.
The MIC_predict or MIC_ROC, should be determined in a longitudinal study, where patients complete the instrument of interest at baseline and again after a relevant time period (e.g., after an intervention). The most efficient design is one in which about half of the patients are expected to change (at least to a minimal important degree) on the domain of interest (e.g., physical function) and about half of the patients are expected not to change. If an intervention is applied between baseline and follow-up measurement, this intervention should be clearly described.
3.
An anchor question should be completed by the patients at follow-up. The anchor question should measure the same construct as the instrument of interest. For example, for estimating the MIC of a fatigue instrument, the anchor question should state “how much has your fatigue changed since …”. The anchor question should refer to a change since the previous measurement (e.g., since before treatment). The anchor question can have 3–7 response options, ranging from “much worse” to “much better”. Patients who report to be “a little better” or more will be included in the improved group, while the rest of the patients will be included in the not improved group.
4.
The sample size of the MIC study should be at least 100 patients [2]. Ideally, the percentage of patients in the improved group should be 50%. The percentage of improved patients should be reported. If the percentage of improved patients is not about 50%, the adjusted MIC_predict should be used (see Online supplement 2) [20].
5.
We recommend to plot the distributions of change scores in the improved group and in the not improved group (see Online supplement 2) to visualize how well the instrument of interest can distinguish between the improved and not improved patients [32].
6.
The correlation between the change score on the instrument of interest and the anchor question should be at least 0.30 to assume validity of the anchor [2]. This correlation should be reported. If the correlation is too low, the data are not suitable for estimating MIC value.
7.
A 95% confidence interval around the MIC value should also be calculated and reported (see Online supplement 2) [98].

Part 3: evidence on plausible MIC values of PROMIS measures

To summarize the available evidence on plausible MIC values of PROMIS measures we performed a search in PubMed from inception up to May 31, 2021 to identify all studies that estimated the MIC of one or more PROMIS measures.

Methods

We extracted relevant search terms from the COSMIN PubMed filter for finding studies on measurement properties [43]. The full search strategy is presented in Online supplement 3. One author (CBT) screened the abstracts.

We included studies that determined a MIC value for any PROMIS measure (adults and pediatric, any domain, any language, any version (e.g., v1.0, v2.0), full bank, short form or CAT) in any population. We extracted the following information: PROMIS measure(s) used (including domain, version number, administration type, language, age version) and country in which data were collected, study population, intervention(s), length of follow-up, sample size on which the MIC values(s) was/were based, MIC methods used, correlation between PROMIS change scores and the anchor (Spearman correlation if presented, otherwise Pearson correlation), percentage of patients improved based on the anchor (only for studies estimating MIC_ROC or MIC_predict), and MIC values.

We only extracted MIC values based on anchor-based methods or vignette-based methods. We did not extract distribution-based MIC values. We only extracted MIC values based on longitudinal anchors, referring to within-person change over time. We did not extract values based on cross-sectional anchors, referring to minimal important differences between groups of patients (e.g., difference between patients who reported to be “slightly improved” and patients who reported to be “not changed” [44] or differences between patients with different levels of disease [45]) because these values refer to a minimal important difference (MID) rather than a minimal important change (MIC). When MIC values of other instruments were used as an anchor, we checked whether these MIC values were based on anchor-based methods. Furthermore, we did not extract MIC values that referred to more than a minimal important change (for example, MIC_mean values based on mean changes in patients who reported to be “much better” were not included). We extracted MIC values for minimal important improvement and for minimal important deterioration separately. MIC values determined in groups of less than 10 patients were not extracted. Data extraction was initially performed by one author (either JDP, RC, PG, or CBT) for each paper, and extracted data were checked by another author (CBT or LBM). Missing information (for example, regarding the version numbers of PROMIS measures used) was requested by email (by CBT) to the primary authors of the papers.

All PROMIS measures are scored on a T-score metric, in which 50 is the mean of a relevant reference population (often a general population) with a standard deviation (SD) of 10. Higher scores mean more of the concept being measured (e.g., worse fatigue, better physical function).

Results

The search yielded 911 abstracts, including 50 studies that estimated a MIC value of a PROMIS measure [41, 44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92]. All studies used self-reported PROMIS data, no studies on proxy-reported data were found. Of these 50 studies, 10 studies used only distribution-based methods [49, 50, 52, 55, 58, 66, 68, 74, 75, 77]; five studies estimated a minimal important difference (MID) rather than minimal important change (MIC) [44, 62, 63, 72, 73]; one study averaged estimates based on cross-sectional and longitudinal anchors as well as distribution-based estimates [84]; one study estimated a MIC value that referred to more than a minimal important change [92]; and two studies intended to calculate an anchor-based MIC but reported only a distribution-based MIC because the area under the ROC curve was considered too low [82, 83]. Data from these 19 studies were not extracted.

MIC values from the remaining 31 studies were extracted and presented in Tables 1, 2, 3 (See also Tables S1 through S11 in Online supplement 1) [41, 45,46,47,48, 51, 53, 54, 56, 57, 59,60,61, 64, 65, 67, 69,70,71, 76, 78,79,80,81, 85,86,87,88,89,90,91]. Twenty-eight of these 31 studies used anchor-based methods. Anchor-based MIC values from these studies were extracted. Distribution-based MIC values that were also presented in 17 of these studies were not extracted [45,46,47,48, 53, 54, 59, 64, 69,70,71, 78, 81, 85,86,87, 91], three MIC values based on cross-sectional anchors were not extracted [45, 46, 48], and one MIC value based on patients who experienced a “meaningful change” (more than minimal) was also not extracted [53]. Out of the 28 anchor-based studies, 24 used (a variation of) a mean change method [45,46,47,48, 51, 53, 54, 55, 59, 60, 64, 65, 69,70,71, 76, 78, 79, 85,86,87,88,89, 91], five studies used an ROC method [53, 54, 56, 67, 81], of which two studies used both methods [53, 54], and one study used the predictive modeling method [90]. In addition to the 28 studies that used anchor-based methods, the MIC values of three studies that used a vignette method to estimate MIC values were also extracted [41, 61, 80].

Table 1 Minimal important change values for adult PROMIS pain interference

Full size table

Table 2 Minimal important change values for adult PROMIS physical function

Full size table

Table 3 Minimal important change values for adult PROMIS fatigue

Full size table

Out of the 28 studies that used anchor-based methods 12 studies reported the correlation between the PROMIS change scores and the anchor. These correlations ranged from 0.02 to 0.76.

In several studies MIC values were presented for more than one PROMIS item bank. Regarding the adult PROMIS item banks, most MIC estimates were found for Pain Interference [17 studies, including 19 patient samples, MIC values for improvement ranged from 0.7 to 12.4 (Table 1)] and Physical Function [18 studies, MIC values for improvement ranged from 0.1 to 12.0 (Table 2)]. Multiple studies were found for Fatigue [7 studies, MIC values for improvement ranged from 1.3 to 5 (Table 3)], Anxiety [5 studies, MIC values for improvement ranged from 2.3 to 3.5 (Table S1 is found in Online Supplement 1)], Depression [4 studies, MIC values for improvement ranged from 1.5 to 3.7 (Table S2 is found in Online Supplement 1)], Upper Extremity [4 studies, MIC values for improvement ranged from 3.0 to 10.3 (Table S3 is found in Online Supplement 1)], Sleep Disturbance [3 studies, MIC values for improvement ranged from 0.9 to 2.4 (Table S4 is found in Online Supplement 1)], Ability to Participate in Social Roles and Activities [3 studies, MIC values for improvement ranged from 0.4 to 2.2 (Table S5 is found in Online Supplement 1)], and Pain Intensity [2 studies, MIC values for improvement ranged from 1.2 to 4.0 (Table S7 is found in Online Supplement 1)]. For the domains Satisfaction with Social Roles and Activities, Gastrointestinal Symptoms, Itch, and Global Health, only one study was found (Tables S6, S8, S9, S10, S11 is found in Online Supplement 1).

Only two studies estimated MIC values for five different PROMIS pediatric item banks (Mobility, Upper Extremity, Pain Interference, Fatigue, and Depressive Symptoms, Table S11), with MIC values ranging from 0.1 to 12.7 [41, 61].

Discussion

We defined the minimal important change (MIC) as a threshold for a minimal within-person change over time above which patients perceive themselves importantly changed. Assuming that all patients have their individual threshold of what they consider a minimal important change, the MIC can be conceptualized as the mean of these individual thresholds. The MIC can be used to determine the number of responders in a group of patients to interpret study results or to inform patients about expected treatment results, or to help clinicians to estimate the probability that an individual patient has experienced a meaningful change, facilitating a conversation with the patient.

There is no perfect MIC method. Distribution-based methods are not appropriate because they do not relate to the importance of the change to patients. We consider the predictive modeling method the most appropriate anchor-based method, because, unlike the mean change method, it refers to a threshold for minimal important change. Moreover, the MIC_predict is more precise than the MIC_ROC and a formula has been published to correct the MIC_predict for bias if the percentage of improved patients is not 50% [20]. A disadvantage of all anchor-based MIC methods is the concern about the reliability and validity of the anchor question. The relatively new vignette-based method does not depend upon an anchor question, but the MIC_vignette may represent a value higher than a minimal threshold if based on mean differences between vignettes. We recommend the predictive modeling method, possibly supplemented with the vignette-based method if time and knowledge to design vignettes and recruit patients for that kind of study is available.

Our systematic review showed that published MIC estimates for PROMIS measures vary widely (larger than the range of MIC estimates currently published on the HealthMeasures website [93]) and were often generated by less appropriate methods. The lower end of the observed range of MIC values (0.1 T-score points) is, in our opinion, implausible as a MIC threshold. The highest MIC values (7 T scores points of higher) were almost all found in adult patients undergoing surgery. It has been suggested before that an invasive procedure like surgery might require a higher change to be considered an important improvement, but results in the literature have been inconsistent [94, 95]. For non-surgical interventions, we consider a MIC value of 2–6 points (covering about two thirds of the published MIC values) reasonable to assume at this point. There is not enough evidence yet to make more specific domain-specific or population-specific recommendations. Further studies are needed to examine whether MIC values differ across domains or between adults and children.

We particularly noticed several methodological concerns which might result in such a wide range of MIC estimates. First, most of these studies used the mean change method, which may represent a value higher than a minimal threshold. We did not exclude these results because this method is currently the most widely used method in the field (despite the critiques raised here) and only five studies used the ROC method, one study used the predictive modeling method [90], and three studies used a vignette-based method. In theory, it is likely that MIC_mean values represent an overestimation of the MIC (Fig. 1); however, many reported MIC values were rather low. Second, sample sizes on which the MIC estimates were based were often small. Third, some studies used the MIC of another instrument as an anchor. These MIC values were sometimes untraceable, based on the MIC value of yet another instrument, based on instruments that may not measure a sufficiently-similar construct or that lack evidence for responsiveness, or based on distribution-based methods. Fourth, only 12 out of 28 anchor-based studies presented the correlation between the PROMIS change score and the anchor question and about one third of the correlations were lower than 0.30 (excluding these values would not change our conclusions). Fifth, in some studies it was not clear whether the MIC estimate was based on patients who improved minimally. Sixth, in some studies the lower bound of recommended MIC values was increased to the SEM. However, the SEM represents the amount of measurement error and does not reflect changes that patients consider important. For this reason, setting a MIC lower bound to be in the detectable range may eliminate changes that patients find important. More broadly, researchers should be mindful of instruments with large measurement error and attempt to reduce the measurement error (e.g., using CAT), instead of adjusting the MIC value [3]. Finally, in some studies improved and deteriorated patients were combined together, while the MIC for improvement might be different than the MIC for deterioration, making inferences about the estimated MIC difficult [31, 33,34,35].

Another problem is that important details of the MIC studies were often not reported, such as version numbers (while different versions of PROMIS measures may have a different metric), percentage of patients improved, correlation between the PROMIS change score and the anchor, and samples size on which MIC value was based. Recently, a reporting guideline for all publications using PROMIS and other HealthMeasures instruments was published [96]. We strongly recommend PROMIS users to use these reporting recommendations. A reporting guideline for MIC studies is being developed by an international group led by researchers from McMaster University, Canada (personal communication).

To gain more insight in the meaning of PROMIS change scores, more high-quality MIC studies are needed. To increase the understanding of the concept of MIC and improve the field, we need to agree on a clear definition of the MIC and report MIC values that are based on this definition. We recommend not to publish MIC values based on data where the correlation between the change score and the anchor is too low. We recommend to report the anchor correlations and state that the low correlation prevents MIC estimation, rather than publish MIC values based on distribution-based methods. We offer recommendations for conducting MIC studies (Box 1) that may help preventing the situation where the correlation between the change score and the anchor is too low. Alternatively, we recommend to use vignette-based methods. The recommendations in Box 1 can also be used to re-analyze existing data. More data are also needed to examine whether the MIC value differs across the PROMIS metric and across settings (e.g., duration of disease, kind of intervention, length of follow-up) [26]. In case researchers need to analyze a study (e.g., responders in a clinical trial) and no credible anchor-based MIC value is available, researchers could decide to use a distribution-based value, such as 0.5 × SD, or use a range of different values in a sensitivity analysis, but we argue that these values should not be called MIC values because distribution-based values refer to the concept of measurement error and are not based on the concept of MIC. However, as stated in part 1, researchers should keep in mind that the estimated MIC value is derived from a wider sample of patients, and the MIC threshold or responder classification may not apply to the individual patient in the clinical trial or in the consultation room.

This study has some limitations. First, we only searched PubMed and the abstracts were screened by one author only, so we may have missed some MIC studies. Second, we based our review on one definition of minimal important change and excluded studies and MIC estimates that were not in line with this definition. Others may have different opinions, and the excluded studies and estimates may nevertheless provide relevant information about the interpretation of PROMIS (change) scores. Strong points of the study were that data extraction was checked by a second author and missing information was requested by email from the corresponding authors of the papers.

In conclusion, 50 studies estimated the MIC of a PROMIS measure, of which 19 studies used less appropriate methods. MIC values of the remaining 31 studies ranged from 0.1 to 12.7 T scores points. We consider a MIC value of 2–6 T-score points for PROMIS measures reasonable to assume at this point. For surgical interventions a higher MIC value might be appropriate. We recommend more high-quality studies estimating MIC values for PROMIS. This paper provides recommendations for designing and analyzing future MIC studies.

References

King, M. T. (2011). A point of minimal important difference (MID): A critique of terminology and methods. Expert Review of Pharmacoeconomics & Outcomes Research, 11(2), 171–184.
Article Google Scholar
Devji, T., Carrasco-Labra, A., Qasim, A., Phillips, M., Johnston, B. C., Devasenapathy, N., Zeraatkar, D., Bhatt, M., Jin, X., Brignardello-Petersen, R., Urquhart, O., Foroutan, F., Schandelmaier, S., Pardo-Hernandez, H., Vernooij, R. W., Huang, H., Rizwan, Y., Siemieniuk, R., Lytvyn, L., … Guyatt, G. H. (2020). Evaluating the credibility of anchor based estimates of minimal important differences for patient reported outcomes: instrument development and reliability study. BMJ, 369, m1714.
Article PubMed PubMed Central Google Scholar
de Vet, H. C., & Terwee, C. B. (2010). The minimal detectable change should not replace the minimal important difference. Journal of Clinical Epidemiology, 63(7), 804–805.
Article PubMed Google Scholar
de Vet, H. C., Terwee, C. B., Ostelo, R. W., Beckerman, H., Knol, D. L., & Bouter, L. M. (2006). Minimal changes in health status questionnaires: Distinction between minimally detectable change and minimally important change. Health and Quality of Life Outcomes, 4, 54.
Article PubMed PubMed Central Google Scholar
Terwee, C. B. (2019). Estimating minimal clinically important differences and minimal detectable change. Journal of Hand Surgery, 44(12), e1.
Article PubMed Google Scholar
Cella, D., Riley, W., Stone, A., Rothrock, N., Reeve, B., Yount, S., Amtmann, D., Bode, R., Buysse, D., Choi, S., Cook, K., Devellis, R., DeWalt, D., Fries, J. F., Gershon, R., Hahn, E. A., Lai, J. S., Pilkonis, P., Revicki, D., … Hays, R. (2010). The patient-reported outcomes measurement information system (PROMIS) developed and tested its first wave of adult self-reported health outcome item banks: 2005–2008. Journal of Clinical Epidemiology, 63(11), 1179–1194.
Article PubMed PubMed Central Google Scholar
Cella, D., Yount, S., Rothrock, N., Gershon, R., Cook, K., Reeve, B., Ader, D., Fries, J. F., Bruce, B., & Rose, M. (2007). The patient-reported outcomes measurement information system (PROMIS): Progress of an NIH Roadmap cooperative group during its first two years. Medical Care, 45(5 Suppl 1), S3–S11.
Article PubMed PubMed Central Google Scholar
Bjorner, J. B., Chang, C. H., Thissen, D., & Reeve, B. B. (2007). Developing tailored instruments: Item banking and computerized adaptive assessment. Quality of Life Research, 16(Suppl 1), 95–108.
Article PubMed Google Scholar
Cook, K. F., O’Malley, K. J., & Roddey, T. S. (2005). Dynamic assessment of health outcomes: Time to let the CAT out of the bag? Health Services Research, 40(5 Pt 2), 1694–1711.
Article PubMed PubMed Central Google Scholar
Embretsen, S. E., & Reise, S. P. (2000). Item response theory for psychologists. New York: Psychology Press.
Google Scholar
Alonso, J., Bartlett, S. J., Rose, M., Aaronson, N. K., Chaplin, J. E., Efficace, F., Leplege, A., Lu, A., Tulsky, D. S., Raat, H., Ravens-Sieberer, U., Revicki, D., Terwee, C. B., Valderas, J. M., Cella, D., & Forrest, C. B. (2013). The case for an international patient-reported outcomes measurement information system (PROMIS(R)) initiative. Health and Quality of Life Outcomes, 11, 210.
Article PubMed PubMed Central Google Scholar
Liegl, G., Rose, M., Correia, H., Fischer, H. F., Kanlidere, S., Mierke, A., Obbarius, A., & Nolte, S. (2018). An Initial psychometric evaluation of the German PROMIS v1.2 physical function item bank in patients with a wide range of health conditions. Clinical Rehabilitation, 32(1), 84–93.
Article PubMed Google Scholar
Terwee, C. B., Roorda, L. D., de Vet, H. C., Dekker, J., Westhovens, R., Cella, D., Correia, H., Arnold, B., Perez, B., & Boers, M. (2014). Dutch-Flemish translation of 17 item banks from the patient-reported outcomes measurement information system (PROMIS). Quality of Life Research, 23(6), 1733–1741.
CAS PubMed Google Scholar
Evans, J. P., Smith, A., Gibbons, C., Alonso, J., & Valderas, J. M. (2018). The national institutes of health patient-reported outcomes measurement information system (PROMIS): A view from the UK. Patient Relat Outcome Meas, 9, 345–352.
Article PubMed PubMed Central Google Scholar
Vilagut, G., Forero, C. G., Adroher, N. D., Olariu, E., Cella, D., & Alonso, J. (2015). Testing the PROMIS(R) Depression measures for monitoring depression in a clinical sample outside the US. Journal of Psychiatric Research, 68, 140–150.
Article CAS PubMed Google Scholar
Bartlett, S. J., Witter, J., Cella, D., & Ahmed, S. (2017). Montreal accord on patient-reported outcomes (PROs) use series—paper 6: Creating national initiatives to support development and use-the PROMIS example. Journal of Clinical Epidemiology, 89, 148–153.
Article PubMed Google Scholar
Katarzyna, K., & Glinkowski, W. M. (2019). Patient-reported outcomes of carpal tunnel syndrome surgery in a non-industrial area. Annals of Agricultural and Environmental Medicine, 26(2), 350–354.
Article PubMed Google Scholar
Liu, Y., Hinds, P. S., Wang, J., Correia, H., Du, S., Ding, J., Gao, W. J., & Yuan, C. (2013). Translation and linguistic validation of the pediatric patient-reported outcomes measurement information system measures into simplified Chinese using cognitive interviewing methodology. Cancer Nursing, 36(5), 368–376.
Article PubMed Google Scholar
Schnohr, C. W., Rasmussen, C. L., Langberg, H., & Bjorner, J. B. (2017). Danish translation of a physical function item bank from the patient-reported outcome measurement information system (PROMIS). Pilot Feasibility Stud, 3, 29.
Article PubMed PubMed Central Google Scholar
Terluin, B., Eekhout, I., & Terwee, C. B. (2017). The anchor-based minimal important change, based on receiver operating characteristic analysis or predictive modeling, may need to be adjusted for the proportion of improved patients. Journal of Clinical Epidemiology, 83, 90–100.
Article PubMed Google Scholar
Terluin, B., Eekhout, I., Terwee, C. B., & de Vet, H. C. (2015). Minimal important change (MIC) based on a predictive modeling approach was more precise than MIC based on ROC analysis. Journal of Clinical Epidemiology, 68(12), 1388–1396.
Article PubMed Google Scholar
Jacobson, N. S., & Truax, P. (1991). Clinical significance: A statistical approach to defining meaningful change in psychotherapy research. Journal of Consulting and Clinical Psychology, 59(1), 12–19.
Article CAS PubMed Google Scholar
Barrett, B., Brown, D., Mundt, M., & Brown, R. (2005). Sufficiently important difference: Expanding the framework of clinical significance. Medical Decision Making, 25(3), 250–261.
Article PubMed Google Scholar
Ferreira, M. L., Herbert, R. D., Ferreira, P. H., Latimer, J., Ostelo, R. W., Nascimento, D. P., & Smeets, R. J. (2012). A critical review of methods used to determine the smallest worthwhile effect of interventions for low back pain. Journal of Clinical Epidemiology, 65(3), 253–261.
Article PubMed Google Scholar
Hays, R. D., Farivar, S. S., & Liu, H. (2005). Approaches and recommendations for estimating minimally important differences for health-related quality of life measures. COPD, 2(1), 63–67.
Article PubMed Google Scholar
Revicki, D., Hays, R. D., Cella, D., & Sloan, J. (2008). Recommended methods for determining responsiveness and minimally important differences for patient-reported outcomes. Journal of Clinical Epidemiology, 61(2), 102–109.
Article PubMed Google Scholar
Turner, D., Schünemann, H. J., Griffith, L. E., Beaton, D. E., Griffiths, A. M., Critch, J. N., & Guyatt, G. H. (2010). The minimal detectable change cannot reliably replace the minimal important difference. Journal of Clinical Epidemiology, 63(1), 28–36.
Article PubMed Google Scholar
Schünemann, H. J., Akl, E. A., & Guyatt, G. H. (2006). Interpreting the results of patient reported outcome measures in clinical trials: The clinician’s perspective. Health and Quality of Life Outcomes, 4, 62.
Article PubMed PubMed Central Google Scholar
Brozek, J. L., Guyatt, G. H., & Schünemann, H. J. (2006). How a well-grounded minimal important difference can enhance transparency of labelling claims and improve interpretation of a patient reported outcome measure. Health and Quality of Life Outcomes, 4, 69.
Article PubMed PubMed Central Google Scholar
Coon, C. D., & Cook, K. F. (2018). Moving from significance to real-world meaning: Methods for interpreting change in clinical outcome assessment scores. Quality of Life Research, 27(1), 33–40.
Article PubMed Google Scholar
Crosby, R. D., Kolotkin, R. L., & Williams, G. R. (2003). Defining clinically meaningful change in health-related quality of life. Journal of Clinical Epidemiology, 56(5), 395–407.
Article PubMed Google Scholar
de Vet, H. C. W., Terwee, C. B., Mokkink, L. B., & Knol, D. L. (2011). Measurement in medicine. Cambridge: Cambridge University Press.
Book Google Scholar
Cella, D., Hahn, E. A., & Dineen, K. (2002). Meaningful change in cancer-specific quality of life scores: Differences between improvement and worsening. Quality of Life Research, 11(3), 207–221.
Article PubMed Google Scholar
Conijn, A. P., Jonkers, W., Rouwet, E. V., Vahl, A. C., Reekers, J. A., & Koelemay, M. J. (2015). Introducing the concept of the minimally important difference to determine a clinically relevant change on patient-reported outcome measures in patients with intermittent claudication. Cardiovascular and Interventional Radiology, 38(5), 1112–1118.
Article PubMed PubMed Central Google Scholar
Hendrikx, J., Fransen, J., Kievit, W., & van Riel, P. L. (2015). Individual patient monitoring in daily clinical practice: A critical evaluation of minimal important change. Quality of Life Research, 24(3), 607–616.
Article PubMed Google Scholar
Griffiths, P., Williams, A., Brohan, E., & Cocks, K. (2019). Understanding the role of anchor correlations in the calculation of meaningful change thresholds for health-related quality of life research. Value Health, 22, S826.
Article Google Scholar
Terluin, B., Eekhout, I., Terwee, C. B., & De Vet, H. C. W. (2015). from https://www.jclinepi.com/article/S0895-4356(15)00160-2/fulltext
Guyatt, G. H., Norman, G. R., Juniper, E. F., & Griffith, L. E. (2002). A critical look at transition ratings. Journal of Clinical Epidemiology, 55(9), 900–908.
Article PubMed Google Scholar
Carragee, E. J. (2010). The rise and fall of the minimum clinically important difference. Spine, 10(4), 283–284.
Article Google Scholar
Cook, K. F., Kallen, M. A., Coon, C. D., Victorson, D., & Miller, D. M. (2017). Idio Scale Judgment: Evaluation of a new method for estimating responder thresholds. Quality of Life Research, 26(11), 2961–2971.
Article PubMed Google Scholar
Thissen, D., Liu, Y., Magnus, B., Quinn, H., Gipson, D. S., Dampier, C., Huang, I. C., Hinds, P. S., Selewski, D. T., Reeve, B. B., Gross, H. E., & DeWalt, D. A. (2016). Estimating minimally important difference (MID) in PROMIS pediatric measures using the scale-judgment method. Quality of Life Research, 25(1), 13–23.
Article PubMed Google Scholar
Staunton, H., Willgoss, T., Nelsen, L., Burbridge, C., Sully, K., Rofail, D., & Arbuckle, R. (2019). An overview of using qualitative techniques to explore and define estimates of clinically important change on clinical outcome assessments. Journal of Patient Reported Outcomes, 3(1), 16.
Article PubMed PubMed Central Google Scholar
Terwee, C. B., Jansma, E. P., Riphagen, I. I., & de Vet, H. C. (2009). Development of a methodological PubMed search filter for finding studies on measurement properties of measurement instruments. Quality of Life Research, 18(8), 1115–1123.
Article PubMed PubMed Central Google Scholar
Kazmers, N. H., Qiu, Y., Yoo, M., Stephens, A. R., Tyser, A. R., & Zhang, Y. (2020). The minimal clinically important difference of the PROMIS and QuickDASH instruments in a nonshoulder hand and upper extremity patient population. Journal of Hand Surgery, 45(5), 399-407.e396.
Article PubMed Google Scholar
Yost, K. J., Eton, D. T., Garcia, S. F., & Cella, D. (2011). Minimally important differences were estimated for six patient-reported outcomes measurement information system-cancer scales in advanced-stage cancer patients. Journal of Clinical Epidemiology, 64(5), 507–516.
Article PubMed PubMed Central Google Scholar
Amtmann, D., Kim, J., Chung, H., Askew, R. L., Park, R., & Cook, K. F. (2016). Minimally important differences for patient reported outcomes measurement information system pain interference for individuals with back pain. Journal of Pain Research, 9, 251–255.
Article PubMed PubMed Central Google Scholar
Bernstein, D. N., Houck, J. R., Mahmood, B., & Hammert, W. C. (2019). Minimal clinically important differences for PROMIS physical function, upper extremity, and pain interference in carpal tunnel release using region- and condition-specific PROM tools. Journal of Hand Surgery, 44(8), 635–640.
Article PubMed Google Scholar
Chen, C. X., Kroenke, K., Stump, T. E., Kean, J., Carpenter, J. S., Krebs, E. E., Bair, M. J., Damush, T. M., & Monahan, P. O. (2018). Estimating minimally important differences for the PROMIS pain interference scales: Results from 3 randomized clinical trials. Pain, 159(4), 775–782.
Article PubMed PubMed Central Google Scholar
Gausden, E. B., Levack, A., Nwachukwu, B. U., Sin, D., Wellman, D. S., & Lorich, D. G. (2018). Computerized adaptive testing for patient reported outcomes in ankle fracture surgery. Foot and Ankle International, 39(10), 1192–1198.
Article PubMed Google Scholar
Halperin, D. M., Huynh, L., Beaumont, J. L., Cai, B., Bhak, R. H., Narkhede, S., Totev, T., Duh, M. S., Neary, M. P., & Cella, D. (2019). Assessment of change in quality of life, carcinoid syndrome symptoms and healthcare resource utilization in patients with carcinoid syndrome. BMC Cancer, 19(1), 274.
Article PubMed PubMed Central Google Scholar
Hays, R. D., Spritzer, K. L., Fries, J. F., & Krishnan, E. (2015). Responsiveness and minimally important difference for the patient-reported outcomes measurement information system (PROMIS) 20-item physical functioning short form in a prospective observational study of rheumatoid arthritis. Annals of the Rheumatic Diseases, 74(1), 104–107.
Article PubMed Google Scholar
Hays, R. D., Spritzer, K. L., Sherbourne, C. D., Ryan, G. W., & Coulter, I. D. (2019). Group and individual-level change on health-related quality of life in chiropractic patients with chronic low back or neck pain. Spine, 44(9), 647–651.
Article PubMed PubMed Central Google Scholar
Hung, M., Baumhauer, J. F., Licari, F. W., Voss, M. W., Bounsanga, J., & Saltzman, C. L. (2019). PROMIS and FAAM minimal clinically important differences in foot and ankle orthopedics. Foot and Ankle International, 40(1), 65–73.
Article PubMed Google Scholar
Hung, M., Bounsanga, J., Voss, M. W., & Saltzman, C. L. (2018). Establishing minimum clinically important difference values for the patient-reported outcomes measurement information system physical function, hip disability and osteoarthritis outcome score for joint reconstruction, and knee injury and osteoarthritis outcome score for joint reconstruction in orthopaedics. World Journal of Orthopedics, 9(3), 41–49.
Article PubMed PubMed Central Google Scholar
Kazmers, N. H., Hung, M., Bounsanga, J., Voss, M. W., Howenstein, A., & Tyser, A. R. (2019). Minimal clinically important difference after carpal tunnel release using the PROMIS platform. Journal of Hand Surgery, 44(11), 947–953.
Article PubMed Google Scholar
Kenney, R. J., Houck, J., Giordano, B. D., Baumhauer, J. F., Herbert, M., & Maloney, M. D. (2019). Do patient reported outcome measurement information system (PROMIS) scales demonstrate responsiveness as well as disease-specific scales in patients undergoing knee arthroscopy? American Journal of Sports Medicine, 47(6), 1396–1403.
Article PubMed Google Scholar
Khanna, D., Hays, R. D., Shreiner, A. B., Melmed, G. Y., Chang, L., Khanna, P. P., Bolus, R., Whitman, C., Paz, S. H., Hays, T., Reise, S. P., & Spiegel, B. (2017). Responsiveness to change and minimally important differences of the patient-reported outcomes measurement information system gastrointestinal symptoms scales. Digestive Diseases and Sciences, 62(5), 1186–1192.
Article PubMed PubMed Central Google Scholar
Kroenke, K., Yu, Z., Wu, J., Kean, J., & Monahan, P. O. (2014). Operating characteristics of PROMIS four-item depression and anxiety scales in primary care patients with chronic pain. Pain Medicine, 15(11), 1892–1901.
Article PubMed PubMed Central Google Scholar
Lapin, B., Thompson, N. R., Schuster, A., & Katzan, I. L. (2019). Clinical utility of patient-reported outcome measurement information system domain scales. Circulation Cardiovascular Quality and Outcomes, 12(1), e004753.
Article PubMed Google Scholar
Lee, A. C., Driban, J. B., Price, L. L., Harvey, W. F., Rodday, A. M., & Wang, C. (2017). Responsiveness and minimally important differences for 4 patient-reported outcomes measurement information system short forms: Physical function, pain interference, depression, and anxiety in knee osteoarthritis. The Journal of Pain, 18(9), 1096–1110.
Article PubMed PubMed Central Google Scholar
Morgan, E. M., Mara, C. A., Huang, B., Barnett, K., Carle, A. C., Farrell, J. E., & Cook, K. F. (2017). Establishing clinical meaning and defining important differences for patient-reported outcomes measurement information system (PROMIS((R))) measures in juvenile idiopathic arthritis using standard setting with patients, parents, and providers. Quality of Life Research, 26(3), 565–586.
Article PubMed Google Scholar
Purvis, T. E., Andreou, E., Neuman, B. J., Riley, L. H., 3rd., & Skolasky, R. L. (2017). Concurrent validity and responsiveness of PROMIS health domains among patients presenting for anterior cervical spine surgery. Spine, 42(23), E1357–E1365.
Article PubMed Google Scholar
Purvis, T. E., Neuman, B. J., Riley, L. H., 3rd., & Skolasky, R. L. (2018). Discriminant ability, concurrent validity, and responsiveness of PROMIS health domains among patients with lumbar degenerative disease undergoing decompression with or without arthrodesis. Spine, 43(21), 1512–1520.
Article PubMed Google Scholar
Sandvall, B., Okoroafor, U. C., Gerull, W., Guattery, J., & Calfee, R. P. (2019). Minimal clinically important difference for PROMIS physical function in patients with distal radius fractures. Journal of Hand Surgery, 44(6), 454-459.e451.
Article PubMed Google Scholar
Schwartz, C. E., Zhang, J., Rapkin, B. D., & Finkelstein, J. A. (2019). Reconsidering the minimally important difference: Evidence of instability over time and across groups. Spine, 19(4), 726–734.
Article Google Scholar
Shahgholi, L., Yost, K. J., & Kallmes, D. F. (2012). Correlation of the National Institutes of Health patient reported outcomes measurement information system scales and standard pain and functional outcomes in spine augmentation. AJNR American Journal of Neuroradiology, 33(11), 2186–2190.
Article CAS PubMed PubMed Central Google Scholar
Stephan, A., Mainzer, J., Kümmel, D., & Impellizzeri, F. M. (2019). Measurement properties of PROMIS short forms for pain and function in orthopedic foot and ankle surgery patients. Quality of Life Research, 28(10), 2821–2829.
Article PubMed Google Scholar
Donovan, L. M., Yu, L., Bertisch, S. M., Buysse, D. J., Rueschman, M., & Patel, S. R. (2020). Responsiveness of patient-reported outcomes to treatment among patients with type 2 diabetes mellitus and OSA. Chest, 157(3), 665–672.
Article PubMed Google Scholar
Katz, P., Kannowski, C. L., Sun, L., & Michaud, K. (2020). Estimation of Minimally important differences and patient acceptable symptom state scores for the patient-reported outcomes measurement information system pain interference short form in rheumatoid arthritis. ACR Open Rheumatology, 2(6), 320–329.
Article PubMed PubMed Central Google Scholar
Katz, P., Pedro, S., Alemao, E., Yazdany, J., Dall’Era, M., Trupin, L., Rush, S., & Michaud, K. (2020). Estimates of responsiveness, minimally important differences, and patient acceptable symptom state in five patient-reported outcomes measurement information system short forms in systemic lupus erythematosus. ACR Open Rheumatology, 2(1), 53–60.
Article PubMed Google Scholar
Khalil, L. S., Darrith, B., Franovic, S., Davis, J. J., Weir, R. M., & Banka, T. R. (2020). Patient-reported outcomes measurement information system (PROMIS) global health short forms demonstrate responsiveness in patients undergoing knee arthroplasty. Journal of Arthroplasty, 35(6), 1540–1544.
Article PubMed Google Scholar
Kroenke, K., Baye, F., & Lourens, S. G. (2019). Comparative responsiveness and minimally important difference of common anxiety measures. Medical Care, 57(11), 890–897.
Article PubMed Google Scholar
Kroenke, K., Stump, T. E., Chen, C. X., Kean, J., Bair, M. J., Damush, T. M., Krebs, E. E., & Monahan, P. O. (2020). Minimally important differences and severity thresholds are estimated for the PROMIS depression scales from three randomized clinical trials. Journal of Affective Disorders, 266, 100–108.
Article PubMed PubMed Central Google Scholar
Kroenke, K., Stump, T. E., Kean, J., Talib, T. L., Haggstrom, D. A., & Monahan, P. O. (2019). PROMIS 4-item measures and numeric rating scales efficiently assess SPADE symptoms compared with legacy measures. Journal of Clinical Epidemiology, 115, 116–124.
Article PubMed Google Scholar
Lawrie, C. M., Abu-Amer, W., Barrack, R. L., & Clohisy, J. C. (2020). Is the patient-reported outcome measurement information system feasible in bundled payment for care improvement in total hip arthroplasty patients? Journal of Arthroplasty, 35(5), 1179–1185.
Article PubMed Google Scholar
Lee, D. J., & Calfee, R. P. (2019). The minimal clinically important difference for promis physical function in patients with thumb carpometacarpal arthritis. Hand (N Y), Oct 18 [Online, ahead of print].
Steinhaus, M. E., Iyer, S., Lovecchio, F., Khechen, B., Stein, D., Ross, T., Yang, J., Singh, K., Albert, T. J., Lebl, D., Huang, R., Sandhu, H., Rawlins, B., Schwab, F., Lafage, V., & Kim, H. J. (2019). Minimal clinically important difference and substantial clinical benefit using PROMIS CAT in cervical spine surgery. Clinical Spine Surgery, 32(9), 392–397.
Article PubMed Google Scholar
Bartlett, S. J., Gutierrez, A. K., Andersen, K. M., Bykerk, V. P., Curtis, J. R., Haque, U. J., Orbai, A. M., Jones, M. R., & Bingham, C. O., 3rd. (2020). identifying minimal and meaningful change in PROMIS(®) for rheumatoid arthritis: Use of multiple methods and perspectives. Arthritis Care and Research. https://doi.org/10.1002/acr.24501
Article PubMed Google Scholar
Beaumont, J. L., Davis, E. S., Fries, J. F., Curtis, J. R., Cella, D., & Yun, H. (2021). Meaningful change thresholds for patient-reported outcomes measurement information system (PROMIS) fatigue and pain interference scores in patients with rheumatoid arthritis. Journal of Rheumatology. https://doi.org/10.3899/jrheum.200990
Article PubMed Google Scholar
Bingham, C. O., Butanis, A. L., Orbai, A. M., Jones, M., Ruffing, V., Lyddiatt, A., Schrandt, M. S., Bykerk, V. P., Cook, K. F., & Bartlett, S. J. (2021). Patients and clinicians define symptom levels and meaningful change for PROMIS pain interference and fatigue in RA using bookmarking. Rheumatology. https://doi.org/10.1093/rheumatology/keab014
Article PubMed PubMed Central Google Scholar
Forlenza, E. M., Lu, Y., Cohn, M. R., Baker, J., Lavoie-Gagne, O., Yanke, A. B., Cole, B. J., Verma, N. N., & Forsythe, B. (2021). Establishing clinically significant outcomes for patient-reported outcomes measurement information system after biceps tenodesis. Arthroscopy, 37(6), 1731–1739.
Article PubMed Google Scholar
Haunschild, E. D., Condron, N. B., Gilat, R., Fu, M. C., Wolfson, T., Garrigues, G. E., Nicholson, G., Forsythe, B., Verma, N., & Cole, B. J. (2021). Establishing clinically significant outcomes of the PROMIS upper extremity questionnaire after primary reverse total shoulder arthroplasty. Journal of Shoulder and Elbow Surgery, S1058–2746(21), 00355–00364.
Google Scholar
Haunschild, E. D., Gilat, R., Fu, M. C., Tauro, T., Huddleston, H. P., Yanke, A. B., Forsythe, B., Verma, N. N., & Cole, B. J. (2020). Establishing the minimal clinically important difference, patient acceptable symptomatic state, and substantial clinical benefit of the PROMIS upper extremity questionnaire after rotator cuff repair. American Journal of Sports Medicine, 48(14), 3439–3446.
Article PubMed Google Scholar
Ibaseta, A., Rahman, R., Andrade, N. S., Skolasky, R. L., Kebaish, K. M., Sciubba, D. M., & Neuman, B. J. (2021). Determining validity, discriminant ability, responsiveness, and minimal clinically important differences for PROMIS in adult spinal deformity. Journal of Neurosurgery Spine. https://doi.org/10.3171/2020.8.SPINE191551
Article PubMed Google Scholar
Kazmers, N. H., Qiu, Y., Ou, Z., Presson, A. P., Tyser, A. R., & Zhang, Y. (2021). Minimal clinically important difference of the PROMIS upper-extremity computer adaptive test and QuickDASH for ligament reconstruction tendon interposition patients. Journal of Hand Surgery, 46(6), 516–516.
Article PubMed Google Scholar
Kazmers, N. H., Qiu, Y., Yoo, M., Stephens, A. R., Zeidan, M., & Zhang, Y. (2021). Establishing the minimal clinically important difference for the promis upper extremity computer adaptive test version 2.0 in a nonshoulder Hand and upper extremity population. Journal of Hand Surgery, S0363–5023(21), 00093.
Google Scholar
Khutok, K., Janwantanakul, P., Jensen, M. P., & Kanlayanaphotporn, R. (2021). Responsiveness of the PROMIS-29 scales in individuals with chronic low back pain. Spine, 46(2), 107–113.
Article PubMed Google Scholar
Kuhns, B. D., Reuter, J., Lawton, D., Kenney, R. J., Baumhauer, J. F., & Giordano, B. D. (2020). Threshold values for success after hip arthroscopy using the patient-reported outcomes measurement information system assessment: Determining the minimum clinically important difference and patient acceptable symptomatic state. American Journal of Sports Medicine, 48(13), 3280–3287.
Article PubMed Google Scholar
Silverberg, J. I., Lai, J. S., & Cella, D. (2021). Reliability and meaningful change of the patient-reported outcomes measurement information system(®) itch questionnaire (PIQ) item banks in adults with atopic dermatitis. British Journal of Dermatology. https://doi.org/10.1111/bjd.20066
Article PubMed Google Scholar
Smit, E. B., Bouwstra, H., Roorda, L. D., van der Wouden, J. H. C., Wattel, E. L. M., Hertogh, C., & Terwee, C. B. (2021). A patient-reported outcomes measurement information system short form for measuring physical function during geriatric rehabilitation: Test-retest reliability, construct validity, responsiveness, and interpretability. Journal of the American Medical Directors Association, S1525–8610(21), 00141–00149.
Google Scholar
Speck, R. M., Ye, X., Bernthal, N. M., & Gelhorn, H. L. (2020). Psychometric properties of a custom patient-reported outcomes measurement information system (PROMIS) physical function short form and worst stiffness numeric rating scale in tenosynovial giant cell tumors. Journal of Patient Reported Outcomes, 4(1), 61.
Article PubMed PubMed Central Google Scholar
Bongers, M. E. R., Groot, O. Q., Thio, Q., Bramer, J. A. M., Verlaan, J. J., Newman, E. T., Raskin, K. A., Lozano-Calderon, S. A., & Schwab, J. H. (2021). Prospective study for establishing minimal clinically important differences in patients with surgery for lower extremity metastases. Acta Oncologica. https://doi.org/10.1080/0284186X.2021.1890333
Article PubMed Google Scholar
http://www.healthmeasures.net/score-and-interpret/interpret-scores/promis
Terwee, C. B., Roorda, L. D., Dekker, J., Bierma-Zeinstra, S. M., Peat, G., Jordan, K. P., Croft, P., & de Vet, H. C. (2010). Mind the MIC: Large variation among populations and methods. Journal of Clinical Epidemiology, 63(5), 524–534.
Article PubMed Google Scholar
Hao, Q., Devji, T., Zeraatkar, D., Wang, Y., Qasim, A., Siemieniuk, R. A. C., Vandvik, P. O., Lähdeoja, T., Carrasco-Labra, A., Agoritsas, T., & Guyatt, G. (2019). Minimal important differences for improvement in shoulder condition patient-reported outcomes: A systematic review to inform a BMJ rapid recommendation. British Medical Journal Open, 9(2), 028777.
Google Scholar
Hanmer, J., Jensen, R. E., & Rothrock, N. (2020). A reporting checklist for HealthMeasures’ patient-reported outcomes: ASCQ-Me, Neuro-QoL, NIH Toolbox, and PROMIS. Journal of Patient Reported Outcomes, 4(1), 21.
Article PubMed PubMed Central Google Scholar
Cook, K. F., Cella, D., & Reeve, B. B. (2019). PRO-bookmarking to estimate clinical thresholds for patient-reported symptoms and function. Medical Care, 57(Suppl 5), S13–S17.
Article PubMed Google Scholar
de Vet, H. C., Terluin, B., Knol, D. L., Roorda, L. D., Mokkink, L. B., Ostelo, R. W., Hendriks, E. J., Bouter, L. M., & Terwee, C. B. (2010). Three ways to quantify uncertainty in individually applied minimally important change values. Journal of Clinical Epidemiology, 63(1), 37–45.
Article PubMed Google Scholar
Stephan, A., Mainzer, J., Kummel, D., & Impellizzeri, F. M. (2019). Measurement properties of PROMIS short forms for pain and function in orthopedic foot and ankle surgery patients. Quality of Life Research, 28(10), 2821–2829.
Article PubMed Google Scholar

Download references

Funding

No funding was received for conducting this study.

Author information

Authors and Affiliations

Department of Epidemiology and Data Science, Amsterdam Public Health Research Institute, Amsterdam UMC, Vrije Universiteit Amsterdam, P.O. box 7057, 1007, Amsterdam, MB, The Netherlands
Caroline B. Terwee & Lidwine B. Mokkink
Department of Medical Social Sciences, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
John Devin Peipert, Robert Chapman, Jin-Shei Lai & David Cella
Department of General Practice, Amsterdam Public Health Research Institute, Amsterdam UMC, Amsterdam, The Netherlands
Berend Terluin
Patient Centered Endpoints, IQVIA, Reading, UK
Pip Griffiths

Authors

Caroline B. Terwee
View author publications
You can also search for this author in PubMed Google Scholar
John Devin Peipert
View author publications
You can also search for this author in PubMed Google Scholar
Robert Chapman
View author publications
You can also search for this author in PubMed Google Scholar
Jin-Shei Lai
View author publications
You can also search for this author in PubMed Google Scholar
Berend Terluin
View author publications
You can also search for this author in PubMed Google Scholar
David Cella
View author publications
You can also search for this author in PubMed Google Scholar
Pip Griffiths
View author publications
You can also search for this author in PubMed Google Scholar
Lidwine B. Mokkink
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Caroline B. Terwee.

Ethics declarations

Conflict of interest

D. Cella was co-author on one of the included PROMIS MIC papers [44] and CB. Terwee was co-authors of another included PROMIS MIC paper [89], but both were not involved in the data extraction of these papers. CB. Terwee and D. Cella are board members of the PROMIS Health Organization. The other authors have no conflicts of interest to declare that are relevant to the content of this article.

Research involving human participants and/or animals

This study does not include human participants or animals.

Informed consent

Because the study does not include human participants, informed consent is not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 132 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Terwee, C.B., Peipert, J.D., Chapman, R. et al. Minimal important change (MIC): a conceptual clarification and systematic review of MIC estimates of PROMIS measures. Qual Life Res 30, 2729–2754 (2021). https://doi.org/10.1007/s11136-021-02925-y

Download citation

Accepted: 21 June 2021
Published: 10 July 2021
Issue Date: October 2021
DOI: https://doi.org/10.1007/s11136-021-02925-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Minimal important change (MIC): a conceptual clarification and systematic review of MIC estimates of PROMIS measures

Abstract

Similar content being viewed by others

Critical examination of current response shift methods and proposal for advancing new methods

Between-group minimally important change versus individual treatment responders

Response shift results of quantitative research using patient-reported outcome measures: a descriptive systematic review

Introduction

Part 1: the concept of MIC and how to use it

Part 2: guidance for estimating MIC values

ROC method

Predictive modeling method

Vignette-based method

Box 1: Recommendations for conducting and reporting MIC studies

Part 3: evidence on plausible MIC values of PROMIS measures

Methods

Results

Discussion

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Research involving human participants and/or animals

Informed consent

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 132 kb)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Minimal important change (MIC): a conceptual clarification and systematic review of MIC estimates of PROMIS measures

Abstract

Similar content being viewed by others

Critical examination of current response shift methods and proposal for advancing new methods

Between-group minimally important change versus individual treatment responders

Response shift results of quantitative research using patient-reported outcome measures: a descriptive systematic review

Introduction

Part 1: the concept of MIC and how to use it

Part 2: guidance for estimating MIC values

ROC method

Predictive modeling method

Vignette-based method

Part 3: evidence on plausible MIC values of PROMIS measures

Methods

Results

Discussion

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Research involving human participants and/or animals

Informed consent

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 132 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation