FormalPara Key Summary Points

Why carry out this study?

Psoriasis Area and Severity Index (PASI) and Physician’s Global Assessment (PGA) are the most widely used outcome measures in clinical trials of biologics to treat psoriasis; however, these outcome measures vary in both their reliability and validity. As newer biologics approach complete clearance of psoriasis, it becomes important to have standardized, reproducible forms of measure to accurately compare treatment efficacy

How well does psoriasis clearance as measured by PASI 100 correlate with clearance measured by PGA 0 in past clinical trials, and what are the reasons for any variation?

What was learned from the study?

In this systematic review, the average PASI-PGA variation was found to be 20%, with only 2 of 45 trials demonstrating 0 variation between scores. The two scales showed a moderately positive correlation but corresponded less tightly at lower efficacy cutoffs

The PASI and PGA assessment tools show high variability when measuring the same results; this highlights the redundancy of their concurrent use and emphasizes the need for a more consistent and valid outcome measure

Introduction

A wide range of outcome measures is used to assess psoriasis severity in clinical trials. The Psoriasis Area and Severity Index (PASI) score is based on disease severity and coverage of body surface area (BSA), with a higher score corresponding to more serious conditions. PASI 100 is defined as “complete clearance” of disease, indicating 100% clearance of plaques from baseline. Similar measures that take BSA into account include the Self-Administered PASI (SAPASI) and Salford Psoriasis Index (SPI). Another widely used severity measure, the Physician’s Global Assessment (PGA), is a 6-point scale (0–5) rating overall severity of psoriasis without taking BSA into account. This system has led to many derivative scales, including the static PGA (sPGA) and the Lattice System PGA (LS-PGA). A PGA 0 is a more subjective indication of clearance. Generally, PASI is considered the gold standard of assessments; however, almost every clinical trial for psoriasis uses both PASI and PGA as outcome measures.

Previously PASI and PGA correlated tightly [1]; however, it was noted that the definition of clear/almost clear differed from study to study. As more psoriasis patients achieve PASI 100 and PGA 0/1 with newer biologics, more clinical trials are achieving clear/almost clear. Therefore, we conducted a literature survey to examine how well PASI and PGA correlate to determine if “clear” as measured by PASI 100 is “clear” measured by PGA 0.

Methods

A PubMed and clinicaltrials.gov search was conducted using search terms “Psoriasis Clinical Trial,” “Psoriasis Area and Severity Index,” “Physician’s Global Assessment,” “infliximab,” “etanercept,” “adalimumab,” “certolizumab pegol,” “ustekinumab,” “secukinumab,” “ixekizumab,” “brodalumab,” “guselkumab,” “tildrakizumab,” and “risankizumab.” Inclusion criteria were: phase 2/3 trials and use of PASI 90/100 and sPGA or PGA 0/1 as primary end points. For studies that included both PASI 100 and PASI 90, or both sPGA/PGA 0 and sPGA/PGA 0/1, the measures with lower cutoffs (such as PASI 75) were excluded from the statistical analyses. Descriptive statistics, measures of central tendency, and Pearson correlation coefficient were calculated using Microsoft Excel Datapack. Results were mostly limited to studies published within the last 15 years. A total of 33 studies were pooled and investigated, yielding 45 clinical trials (Table 1). This article is based on previously conducted studies and does not contain any studies with human participants or animals performed by any of the authors.

Table 1 PASI/PGA differences in clinical trials of biologics

Results

Eight (17.8%) clinical trials had a PASI-PGA variance of < 5%, 4 (8.9%) trials had a variance of 5–10%, and 33 (73.3%) trials had a variance of > 10% (Fig. 1). The average PASI-PGA variation for all studies was 20% (standard deviation: 15%). Only two trials, IMMvent and AMAGINE, had 0 variation between the PASI and PGA scores. In fact, 80% clinical trials on adalimumab had a variation of < 5% between the PASI and PGA scores. Other biologics that had clinical trials with < 5% PASI-PGA variation were: etanercept (0.6% and 1.3%) and guselkumab (3.1%). The greatest differences were in the IXORA-S trial and the CHAMPION trial, with a variability of 61.9% and 56.4% between PASI and PGA, respectively.

Fig. 1
figure 1

Differences in PASI 100 and PGA 0 reported in examined trials. Only 18% of studies have < 5% difference between the reported PASI 100 and PGA 0, while 74% of studies examined in this report had > 10% difference

Trials for the same biologic also had PASI-PGA variations. For example, although IXORA-S showed a high variability for ustekinumab between the two outcome measures, other trials such as the PEARL, UltMMa-1, UltMMa-2, and CLEAR trial showed much lower variances (Table 1). Due to this variation, many patients who achieved clearance under one assessment did not reach it by the other method.

Further analysis revealed a significant, moderately positive correlation between PASI and PGA scores (Pearson coefficient = 0.64, r2 = 0.41, p < 0.001). The two scales tended to correspond fairly well except at the extremes of therapeutic efficacy (Fig. 2), especially at the lower cutoffs.

Fig. 2
figure 2

Correlation between PASI 90/10 and PGA 0/1 for clinical trials evaluating psoriasis severity. A weakly positive correlation exists between PASI and PGA scores, with the strongest correlation occurring within therapeutic range and falling off towards the extremes

Discussion

The wide range of variability between PASI and PGA scores between trials highlights their lack of standardization. A reliable outcomes measure is needed to better interpret disease severity and improvement, especially as better biologics are developed. The lack of a valid and consistent outcome measure makes it difficult to compare drugs and perform cross-analysis of different clinical trials.

Both PASI and PGA have shortcomings that increase variability, contributing to their apparent differences in clinical trials. Some of the score differences may stem from the PASI 90/100 being a relative parameter (indicating improvement) while the PGA is a measure of absolute severity. However, several other inherent qualities of the PASI and PGA scores further explain the observed discrepancies. The PASI score has difficulty in assessing the percentage of affected skin, inability to separate milder cases, lack of linearity, and lack of sensitivity [2]. There is also high variability in calculations for BSA for what is considered “high” or “full clearance.” Common BSA estimation methods, such as the “rule of nines,” may overestimate BSA affected by psoriasis, leading to score variability among inexperienced scorers [3]. The PGA also has shortcomings. Though easier to perform, PGA lacks clear definitions, leading to higher intra- and inter-grader variation and low score reproducibility [4]. Furthermore, both PASI and PGA lack patient perspective, ignoring common symptoms such as pain and pruritus. Patients who respond to treatment without complete clearance may still experience substantial symptoms of psoriasis that interfere with quality of life [5]. Due to these inherent qualities, it becomes almost inevitable that usage of PASI and PGA scores would contribute variation to a study when used as measures of clearance.

Though both the PASI and PGA are not perfect assessments, an objective measure of psoriasis severity is necessary. These measures act as an end goal for both patients and physicians, where achieving full clearance (a PASI 100 or PGA 0) is a goal to work towards. However, they should be regarded as a standard measure for physicians to rate visible signs of disease rather than a standard measure of overall disease severity. As the PASI is almost used as a universal measure, it can continue to be utilized as long as clinicians can objectively measure disease severity with similar techniques. Techniques such as plaque tracing and point counting grids may aid in making these calculations more accurate and reduce score variation [3].

A systematic review of six severity scores (PASI, BSA, PGA, LS-PGA, SPI, and SAPASI) concluded that none were ideal for evaluating psoriasis, but the PASI score was the most thoroughly studied and validated [6]. The use of both PASI and PGA together has been deemed redundant. Our findings support this conclusion by demonstrating that the PASI and PGA show great variability when measuring the same results and their concurrent use only makes analyses more complicated. We also corroborated previous findings that PASI and PGA scores correlate fairly well across the range of what is considered in therapeutic efficacy [1]. Outside of this range, we hypothesize the variation from subjective score calculations such as percentage clearance and BSA affected, increased as differences became more difficult to elucidate. These fluctuations, combined with fewer data points at the therapeutic extremes, would be reflected differently in each score and contributed to the decrease in correlation.

Dermatologists often reference and cite the clearance rates from clinical trials to patients when deciding on which biologic to use. If estimations of complete clearance continue to have high variability, it would be difficult to compare differences in outcomes between treatments. Concrete steps to address this include clarifying the definition of end points, such as what constitutes “clear” versus what is “almost clear.” It may also be useful to account for patient considerations. Patients often aim at achieving “almost cleared” more than “complete clearance” and prioritize having certain body areas cleared more than others [7]. Improving the accuracy of BSA calculations can also reduce variability within current measures. Finally, it is important to develop and test new methods of severity assessment. While multiple new scales have been created, none are widely used.

These results suggest that further research is needed to determine the most appropriate and sensitive parameters for measuring biologic therapeutic efficacy. In terms of clinical trials, the use of a single standard measure would make it easier to make meaningful comparisons of clearance results across multiple trials. Standardization of BSA estimation, accurately capturing patient-oriented measures and combining objective and subjective assessments may reduce variability in evaluation of disease severity.