Introduction

Psoriatic arthritis (PsA) is an inflammatory arthropathy that is typically preceded by psoriasis and affects up to 30% of patients with psoriasis [1, 2]. Physical and psychosocial aspects of the disease yield a major disease burden that impacts both quality of life and work productivity [2]. Patients with PsA have high levels of unemployment and impaired work productivity (absenteeism and presenteeism) [3]. The impact of PsA on work productivity has economic consequences, especially considering that an individual’s ability to work and be productive impacts not only themselves, but also their employers and society [4]. For example, Kawalec and Malinowski reported that the cost of work disability due to PsA was $10,754 per patient per year based upon 2013 prices [5]. Importantly, such costs, whether direct or indirect, have been demonstrated to be associated with disease activity and physical function [6].

The Work Productivity and Activity Impairment: Specific Health Problem Questionnaire (WPAI:SHP) is a version of WPAI [7] that can be modified for use with a specific disease such as PsA [8]. Patients are asked to quantify how their PsA specifically impacts their ability to work, and the questionnaire has been used in randomized controlled trials [9, 10] and observational studies [11]. Unfortunately, studies reporting changes or improvements in the domains of WPAI:SHP by patients with PsA have a limited threshold of meaning due to the absence of published minimal clinically important differences (MCIDs). MCIDs quantify the minimal improvement in an outcome that is perceived by patients as being beneficial, thereby translating patient-reported outcomes into clinically actionable results [12]. Thus, our objective was to determine the MCIDs for improvement in WPAI:SHP as reported by patients with active PsA who participated in two clinical trials [9, 10].

Methods

Study Populations and Design

MCIDs for the presenteeism, work productivity loss, and activity impairment domains of WPAI:SHP were derived separately using the intent-to-treat populations of SPIRIT-P1 and SPIRIT-P2. These studies and their patient populations were previously described by Mease and colleagues [13] and Nash and colleagues [14]. SPIRIT-P1 and SPIRIT-P2 were multicenter, randomized, double-blind, placebo-controlled phase 3 trials. SPIRIT-P1 enrolled biologic-naïve patients. SPIRIT-P2 enrolled tumor necrosis factor inhibitor (TNFi)-experienced patients who were either inadequate responders or intolerant to TNFi therapies. Patients participating in the trials were 18 years or older, diagnosed with active PsA at least 6 months prior, met the classification criteria for psoriatic arthritis (CASPAR) criteria (at least 3/68 tender and 3/66 swollen joints), and had either a documented history of plaque psoriasis or active psoriatic skin lesions. Patients participating in SPIRIT-P1 were randomized 1:1:1:1 to receive placebo, 80 mg ixekizumab every 4 weeks (IXEQ4W), 80 mg ixekizumab every 2 weeks (IXEQ2W), or 40 mg adalimumab every 2 weeks. Patients participating in SPIRIT-P2 were randomized 1:1:1 to receive placebo, IXEQ4W, or IXEQ2W. Patients receiving ixekizumab received a 160-mg starting dose in both trials. Adalimumab was included as the active-reference arm in SPIRIT-P1. Both SPIRIT-P1 and SPIRIT-P2 were conducted in accordance with the ethical principles of the Declaration of Helsinki and in compliance with local laws and regulations. All participants provided informed consent. SPIRIT-P1 and SPIRIT-P2 protocols and consent forms were approved by each site’s institutional review board or ethics committee, including the Western Institutional Review Board for SPIRIT-P1 and the Bellberry Human Research Ethics Committee for SPIRIT-P2. The individual SPIRIT-P1 and SPIRIT-P2 sites are listed in the primary manuscript supplements [13, 14]. Both studies were registered on ClinicalTrials.gov (SPIRIT-P1: NCT01695239, SPIRIT-P2: NCT02349295).

WPAI:SHP

WPAI:SHP [8] (V2.0) was administered during the double-blind treatment period (weeks 0–24) at prespecified time points, and continued in the extension periods. In the questionnaire template, “PROBLEM” was replaced with “psoriatic arthritis.” The questionnaire consisted of six questions assessing employment status, hours missed from work due to PsA, hours missed from work due to other reasons, actual hours worked, the impact of PsA on work productivity, and the impact of PsA on activities outside of work. The responses to these questions were used to derive four scores: percentage of absenteeism, percentage of presenteeism, work productivity loss (incorporates absenteeism and presenteeism), and the percentage of activity impairment outside of work. Higher scores indicate a higher degree of impairment.

Statistical Analyses

MCIDs for the presenteeism, work productivity loss, and activity impairment domains of WPAI:SHP through week 24 were derived using the anchor-based method supplemented by the distribution-based method in accordance with FDA guidance [15] and Copay et al. [16]. The patient populations of SPIRIT-P1 and SPIRIT-P2 were individually pooled and kept separate for all analyses. Anchor variables included the American College of Rheumatology (ACR) 20/50/70 responder indices, the minimal disease activity (MDA) [17], and the Health Assessment Questionnaire Disability Index (HAQ-DI) MCID (improvement ≥ 0.35) [13]. Efficacy results for ACR20 [13, 14], ACR50 [13, 14], ACR70 [13, 14], MDA [14, 18], and HAQ-DI MCID [13, 14] were previously published. Per FDA guidance [15], a valid anchor is expected to be understandable, interpretable, and adequately associated with the WPAI:SHP instrument. ACR20/50/70, MDA, and HAQ-DI were considered to be understood and interpretable by rheumatologists for the purposes of these analyses. To validate our anchor choices in preparation for anchor-based analyses, we evaluated associations between WPAI:SHP domain scores and anchors using biserial correlation, logistic modeling, and analysis of covariance (ANCOVA). A threshold of 0.371 signified a large effect for the biserial correlation analysis [19]. The concordance index (ranging from 0.5 to 1.0) from a logistic model is another metric that was used to quantify the association between anchor and WPAI:SHP domains. The larger the concordance index, the stronger the association. The ANCOVA models were employed to demonstrate a significantly greater improvement in the WPAI:SHP domains among patients who met the anchor than among those who did not meet the anchor. For the anchor-based method, we utilized the receiver operating characteristic (ROC) method to identify cutoffs for each WPAI:SHP domain that best differentiated each anchor. When a cutoff is set too low, we would expect to observe a high sensitivity and a negative predictive value. On the other hand, when a cutoff is set too high, both the sensitivity and the negative prediction value decrease while the specificity and positive predictive value increase. An ideal cutoff should balance the tradeoff while being greater than the half the standard deviation or SEM recommended by the distribution-based method. The distribution method used measures of instrument variability, including the standard error of measurement with an assessment of the reliability coefficient with a lower bound of 0.7, as reported by Crawford and colleagues [20], and multiplying by half of the standard deviation [21]. The results of the anchor- and distribution-based method were used to triangulate MCIDs for the domains of WPAI:SHP [22]. Analyses were performed using SAS (version 9.4).

Results

WPAI:SHP estimates were derived using previously published results [9, 10]. The intent-to-treat populations of SPIRIT-P1 and SPIRIT-P2 included 417 biologic-naïve and 363 TNFi-experienced patients with PsA, respectively. Baseline demographics and clinical characteristics were previously published for these patients [13, 14], and are provided in Table 1.

Table 1 Baseline demographics and clinical characteristics of the biologic-naïve and TNFi-experienced (inadequate responders to TNFi or TNFi-intolerant patients) patient populations

It was hypothesized that ACR20, ACR50, ACR70, MDA, and HAQ-DI MCID were valid anchors to use to estimate the MCIDs of WPAI:SHP using the anchor-based method. These were selected as candidate anchors because they are well understood and easy to interpret. To determine anchor validity, we tested the anchors by correlation (Fig. 1a–c) and logistic regression (Fig. 2a–c) modeling against the presenteeism, work productivity loss, and activity impairment domains of WPAI:SHP in the biologic-naïve and TNFi-experienced populations. The hypothesized anchors were found to be valid by both correlation and logistic regression analyses for both patient populations.

Fig. 1
figure 1

Anchor evaluation by biserial correlation analyses of ACR20, ACR50, ACR70, MDA, and HAQ-DI MCID (improvement ≥ 0.35) for the presenteeism (a), work productivity loss (b), and activity impairment (c) domains of WPAI:SHP in patients with PsA who were biologic naïve or TNFi experienced (inadequate responders to TNFi or TNFi-intolerant patients). The dotted line corresponds to a correlation coefficient threshold of 0.371. HAQ-DI Health Assessment Questionnaire and Disability Index, MCID minimal clinically important difference, MDA minimal disease activity, TNFi tumor necrosis factor inhibitor, WPAI:SHP Work Productivity and Activity Impairment:Specific Health Problem Questionnaire

Fig. 2
figure 2

Anchor evaluation by logistic modeling of ACR20, ACR50, ACR70, MDA, and HAQ-DI MCID for the presenteeism (a), work productivity loss (b), and activity impairment (c) domains of WPAI:SHP in patients with PsA who were biologic naïve or TNFi experienced (inadequate responders to TNFi or TNFi-intolerant patients). HAQ-DI Health Assessment Questionnaire and Disability Index, MDA minimal disease activity, MCID minimal clinically important difference, TNFi tumor necrosis factor inhibitor, WPAI:SHP Work Productivity and Activity Impairment: Specific Health Problem Questionnaire

Anchors were then investigated further using an ANCOVA model to determine if there were statistically significant differences between patients who met anchor clinical thresholds and those who did not for the presenteeism, work productivity loss, and activity impairment domains of WPAI:SHP (Fig. 3a–c). Statistically significant differences were observed for all investigated anchors across the WPAI:SHP domains in the biologic-naïve and TNFi-experienced populations.

Fig. 3
figure 3

Anchor evaluation by analysis of covariance (ANCOVA) modeling for the presenteeism (a), work productivity loss (b), and activity impairment (c) domains of WPAI:SHP in biologic-naïve and TNFi-experienced patients (inadequate responders to TNFi or TNFi-intolerant patients). Domain change from baseline at week 24 was stratified by anchor achievement status, adjusting for baseline WPAI:SHP domain score. All comparisons between those who met and those who did not meet the anchors within a population group had P < 0.001. HAQ-DI Health Assessment Questionnaire and Disability Index, MCID minimal clinically important difference, MDA minimal disease activity, TNFi tumor necrosis factor inhibitor, WPAI:SHP Work Productivity and Activity Impairment: Specific Health Problem Questionnaire

Using multiple validated anchors, ROC analyses were performed to identify a meaningful improvement in a WPAI:SHP domain that was best associated with the anchor status (Figs. 4, 5, 6). When ACR20, MDA, and HAQ-DI MCID were incorporated into ROC analyses, the suggested MCID for presenteeism was 20%, that for work productivity loss was 15%, and that for activity impairment was 20% for any of the tested anchors in the biologic-naïve and TNFi-experienced populations. The observed consistency across anchors highlights the robustness of these MCID estimates. In addition, these results are supported by the distribution-based method, since both half standard deviation and standard error of measurement estimates for the work productivity loss and activity impairment domains in the two populations indicate that improvements of greater than approximately 12–15% would be considered statistically significant (Table 2).

Fig. 4
figure 4

Responder definitions of the WPAI:SHP presenteeism domain as determined by the receiver operating characteristic (ROC) method using WPAI:SHP domain changes from baseline at week 24 for ACR20 (a), MDA (b), and HAQ-DI MCID (c) in biologic-naïve patients and ACR20 (d), MDA (e), and HAQ-DI MCID (f) in TNFi-experienced patients (inadequate responders to TNFi or TNFi-intolerant patients). MCIDs were derived from the region reflecting a balance between specificity and sensitivity, with the lower limit defined by half the standard deviation of the distribution-based method. HAQ-DI Health Assessment Questionnaire and Disability Index, MCID minimal clinically important difference, MDA minimal disease activity, TNFi tumor necrosis factor inhibitor, WPAI:SHP Work Productivity and Activity Impairment: Specific Health Problem Questionnaire

Fig. 5
figure 5

Responder definitions of the WPAI:SHP work productivity loss domain as determined by the receiver operating characteristic (ROC) method using WPAI:SHP domain changes from baseline at week 24 for ACR20 (a), MDA (b), and HAQ-DI MCID (c) in biologic-naïve patients and ACR20 (d), MDA (e), and HAQ-DI MCID (f) in TNFi-experienced patients (inadequate responders to TNFi or TNFi-intolerant patients). MCIDs were derived from the region reflecting a balance between specificity and sensitivity, with the lower limit defined by half the standard deviation of the distribution-based method. HAQ-DI Health Assessment Questionnaire and Disability Index, MCID minimal clinically important difference, MDA minimal disease activity, TNFi tumor necrosis factor inhibitor, WPAI:SHP Work Productivity and Activity Impairment: Specific Health Problem Questionnaire

Fig. 6
figure 6

Responder definitions of the WPAI:SHP activity impairment domain as determined by the receiver operating characteristic (ROC) method using WPAI:SHP domain changes from baseline at week 24 for ACR20 (a), MDA (b), and HAQ-DI MCID (c) in biologic-naïve patients and ACR20 (d), MDA (e), and HAQ-DI MCID (f) in TNFi-experienced patients (inadequate responders to TNFi or TNFi-intolerant patients). MCIDs were derived from the region reflecting a balance between specificity and sensitivity, with the lower limit defined by half the standard deviation of the distribution-based method. HAQ-DI Health Assessment Questionnaire and Disability Index, MCID minimal clinically important difference, MDA minimal disease activity, TNFi tumor necrosis factor inhibitor, WPAI:SHP Work Productivity and Activity Impairment: Specific Health Problem Questionnaire

Table 2 Distribution-based method results for the work productivity loss and activity impairment domains of WPAI:SHP in biologic-naïve or TNFi-experienced patients (inadequate responders to TNFi or TNFi-intolerant patients) with PsA

Discussion

Using data from randomized controlled trials, MCIDs for the presenteeism, work productivity loss, and activity impairment domains of WPAI:SHP in patients with PsA were estimated using the anchor-based method with support from the distribution-based method. A 15% improvement was estimated to be the MCID for work productivity loss in either the biologic-naïve or TNFi-experienced population. A 15% improvement in work productivity loss translates into a gain of $7593 per patient per year in indirect costs, assuming a patient earns the 2017 mean US annual salary ($50,620) and that the treatment effect is maintained for 1 year [23, 24]. A 20% improvement was estimated to be the MCID for presenteeism and activity impairment in either the biologic-naïve or TNFi-experienced population. This estimate for activity impairment translates into a 20% improvement in non-work-related activity productivity. Importantly, we highlight that these estimates were consistent in the two patient populations.

The availability of these estimates will provide clinicians and payers investigating the impact of treatments on presenteeism, work productivity loss, and activity impairment with a greater threshold of meaning. This is important given that trials of novel therapies for PsA have used WPAI:SHP to measure the effect of treatment on presenteeism, work productivity, and activity impairment, and have reported changes for presenteeism, work productivity loss, and activity impairment but, to date, have not had a PsA-specific MCID to benchmark [9, 10]. The use of MCIDs is paramount since MCIDs provide clinicians and payers with thresholds that identify minimum changes due to a treatment or intervention that are viewed by patients as being beneficial [16]. In addition, these MCIDs will address recent FDA guidance recommending the use of MCIDs or “responder definitions” for the interpretation of patient-reported outcome results from clinical trials [15].

Recently Wu and colleagues reported MCIDs for the work productivity loss and activity impairment domains of WPAI for psoriasis (WPAI:PsO) in patients with psoriasis [25]. The MCID estimates for work productivity loss and activity impairment in patients with psoriasis were both 20% [25]. The MCID estimate for activity impairment was identical to our estimate for patients with PsA, while the estimate for work productivity loss was 5% less. The differences observed in the estimates for work productivity loss were not unexpected given that patients with PsA have major joint symptoms that may or may not be accompanied by skin symptoms, while patients with psoriasis have major skin symptoms that may or may not be accompanied by joint symptoms. Similarly, the alignment in the estimates for activity impairment reflects the common aspects of the diseases, such as physical and psychosocial burdens or the general impact on health-related quality of life [26].

In this study, we reported estimates for the presenteeism, work productivity, and activity impairment domains of WPAI:SHP in patients with PsA using results from two clinical trials, but we acknowledge that our study has both limitations and strengths. The patient populations of the two trials were selective and consisted predominantly of patients with the polyarthritis phenotype, but this limitation is countered by the numerous strengths associated with these clinical trials, such as well-characterized patient populations, robust data collection, and the inclusion of two distinct populations of biologic-naïve and TNFi-experienced patients. In addition, this study does not report thresholds for deterioration, which may have a different magnitude of change than reported for the improvement in MCIDs. Importantly, the reported MCIDs must be applied to patient-level data, and cannot be compared to population-level data.

Conclusion

In conclusion, using clinical trial data, we estimated MCIDs for the presenteeism, work productivity loss, and activity impairment domains of WPAI:SHP in biologic-naïve and TNFi-experienced patients with PsA. The MCID estimates for biologic-naïve and TNFi-experienced patients were consistent for the two patient populations. The MCID for presenteeism was 20%, the MCID for work productivity loss improvement was 15%, and the MCID for activity impairment improvement was 20%. When applied to patient-level data, these estimates should allow clinicians to more effectively evaluate the effect of treatments on work productivity in biologic-naïve or TNFi-experienced patients with PsA.