Introduction

With an estimated prevalence of 10.6–12.4 cases per million [1], pulmonary arterial hypertension (PAH) is a rare chronic and progressive disease characterized by increased pulmonary vascular resistance that can result in death due to right heart failure [2]. Numerous available treatments for PAH [3] have been evaluated in clinical trials using a variety of endpoints [4,5,6]. In the past two decades, PAH study design and duration shifted from short-term trials assessing noninvasive endpoints to long-term event-driven trials [7, 8].

Six-minute walk distance (6MWD), functional class (FC), and indicators of right ventricular function (i.e., brain natriuretic peptide [BNP]/the active N-terminal cleavage product of its prohormone [NT-proBNP], described in Table 1) are among the commonly used short-term primary and secondary noninvasive endpoints in clinical PAH trials [9, 10]. In a literature review examining 126 pulmonary hypertension (PH) clinical trials (78% in PAH) from 1985 to 2013, surrogate measures were primary endpoints in 95% of trials and secondary endpoints in 33% of trials [9]. 6MWD and FC were among the noninvasive endpoints that were used significantly more frequently (P < 0.0001) [9]. The latest 2015 European Society of Cardiology/European Respiratory Society treatment guidelines [11] and registry studies such as COMPERA [12] and REVEAL [13] also include 6MWD, FC, and BNP/NT-proBNP as important components of risk assessment. Collectively, 6MWD, FC, and BNP/NT-proBNP serve as measurable prognostic indicators of the distal outcomes of morbidity and mortality that may be assessed early in order to determine treatment course and improve outcomes [14, 15]. However, the clinical relevance and ability of these noninvasive endpoints to consistently correlate with key indicators of disease progression, such as hospitalization and death, has received mixed support [16,17,18,19,20,21,22,23,24,25]. This, coupled with improvements in survival and quality of life that have allowed recent clinical trials to follow patients with PAH for 4–6 years [7], has led to the prominence of mortality and morbidity as endpoints.

Table 1 Description of noninvasive endpoints

Morbidity and mortality, a term reflecting clinical worsening and disease progression [16, 23, 26], provide a robust demonstration of efficacy, safety, and long-term benefits of treatments for PAH [23]. The use of clinical worsening or disease progression as a primary endpoint in phase 3 trials was endorsed by the Task Force on End Points and Clinical Trial Design of both the Fourth and Fifth World Symposium on Pulmonary Hypertension and the 2008 Dana Point Task Force on End Points and Clinical Trial Design [16]. However, comparison of treatment efficacy across trials may be hindered by the varying definitions used for clinical worsening, as seen when comparing the definitions used in the AMBITION [27], GRIPHON [28], SERAPHIN [29], and FREEDOM EV [30,31,32] clinical trials. While it has been argued that the composite endpoint of clinical worsening is more clinically meaningful than noninvasive endpoints [19, 25], all four trials include 6MWD and FC in their definitions of clinical worsening, despite other differences. Additional components in the definition of clinical worsening, such as death, hospitalization, and lung transplant, require long-term follow-up to assess, and despite clearly indicating clear and undisputable indicators of ultimate treatment efficacy and safety, they cannot be used to assess clinical risk in the day-to-day care of patients with PAH that guide treatment decisions.

There is a need to revisit the optimal duration of future trials [7] and include clinically meaningful endpoints that reflect how patients feel and function [33]. In addition, the use of universal endpoints in PAH clinical trials and observational studies would better inform health care providers, decision makers, and payers on the value of targeted pharmacotherapies and combination therapies for patients with PAH [16]. 6MWD, FC, and BNP/NT-proBNP are universal endpoints, routinely used in clinical risk assessment, that can be assessed short-term (12–16 weeks [23]). The present review examines the value of 6MWD, FC, and BNP/NT-proBNP by summarizing the literature supporting the relationship between these noninvasive endpoints and long-term clinical and economic outcomes.

Methods

A literature review was conducted on April 13, 2018, in PubMed, Embase, and the Cochrane Database of Systematic Reviews using a search strategy (Table 2) that included Medical Subject Headings (MeSH) and key words for disease (e.g., pulmonary arterial hypertension), endpoints (e.g., 6MWD, FC, BNP), clinical importance (e.g., survival, mortality), and economic importance (e.g., costs, readmission, economics). Inclusion criteria incorporated studies published after January 1, 1997, in English and human subjects; comments, letters, or editorials were excluded. Bibliographies of relevant review articles were reviewed for any pertinent articles unidentified in the original search. To obtain information from relevant unpublished studies, a search of 2016–2017 conference abstracts via Embase was performed, including the American Thoracic Society International Conference, American College of Chest Physicians (CHEST) Annual Meeting, CHEST World Congress Annual Meeting, and the International Society for Pharmacoeconomics and Outcomes Research International Meeting. Titles and abstracts of records were reviewed (level 1 screening) according to the objectives and inclusion and exclusion criteria. Included studies, defined using PICOS (population, intervention, comparison, outcome, study type), had the following:

  • A primarily adult population (≥ 18 years) with PAH from a WHO group 1 etiology

  • Any intervention or comparator

  • At least one of the noninvasive endpoints—6MWD, FC, BNP/NT-proBNP (Table 1)—and reported on the relationship between a noninvasive endpoint and a clinical or economic outcome of interest (note that the literature was searched for individual associations between each of the endpoints and PAH outcomes)

  • An interventional (e.g., randomized controlled trials) or noninterventional (e.g., observational, prospective, retrospective, database, and/or registry studies) design, with ≥ 75 patients, or a relevant literature review.

Table 2 PubMed search strategy (search conducted April 13, 2018)

Full texts of included studies were reviewed (level 2 screening) using the same relevance criteria applied at level 1. That is, full-text articles were reviewed in detail, and the inclusion and exclusion criteria applied at level 1 (title/abstract screening) were applied to evaluate the appropriateness for inclusion.

Results

Literature Review

Figure 1 summarizes the literature review, which identified 836 unique studies for level 1 screening, including 64 conference abstracts and 1 study identified through desktop research. Of these, 65 records were selected for level 2 screening; no conference abstracts were deemed eligible for inclusion. Twenty-five studies, summarized in Table 3, were selected for inclusion according to predefined inclusion/exclusion criteria. Although outcomes, such as risk of death or the combined endpoint of risk of death or lung transplant, were consistently defined across the literature, clinical worsening was defined differently in each of the three studies evaluating it as an outcome [2, 34, 35]; definitions used are noted in Table 3.

Fig. 1
figure 1

Literature review flow diagram

Table 3 Description of literature evaluating relationships between noninvasive endpoints and health outcomes

6MWD

Thirteen studies evaluated the relationship between 6MWD and death (n = 10), death or lung transplant (n = 3), PAH-related death or hospitalization (n = 1), and risk of experiencing a costly event indicative of clinical worsening (n = 2) (Table 4).

Table 4 Summary of relationships between noninvasive endpoints (6MWD, FC, BNP/NT-proBNP, and risk groups) and outcomes in the literature

6MWD and Risk of Death

Risk of death was increased among patients with 6MWD that was shorter [2, 36], below the median (< 380 m) [2], or in lower quartiles at baseline (reference: Q1 ≤ 300 m) [37]. Specifically, risk of death at 1 year was increased among patients with shorter 6MWD [38] or 6MWD < 165 m at baseline [38, 39] and decreased among patients with 6MWD ≥ 440 m at baseline [39]. Risk of death within 2 years was increased among patients with 6MWD that was shorter or in lower quartiles at baseline [40]. Further, 6MWD ≤ 250 m at baseline was independently associated with an increased risk of patients dying within 2 years relative to patients surviving ≥ 5 years [41]. Risk of death was also increased at 3 years [42] and 4 years [43] among patients with shorter 6MWD at baseline.

At follow-up, risk of death was increased among patients with 6MWD that was shorter [2], in lower quartiles (reference: Q1 ≤ 348 m) [37], or below the median (defined as ≤ 400 m [37] or < 418 m [2]). Within 1 year, risk of death was increased among patients with 6MWD that was < 165 m [38]. Risk of death within 2 years was increased among patients with 6MWD that was shorter or in lower quartiles at 12-week assessment [40]. At 3 years, risk of death was increased among patients with smaller improvements in 6MWD between baseline and the 12-week follow-up compared with patients with ≥ 20 m increases in 6MWD [44].

6MWD and Risk of Death or Lung Transplant

Risk of death or lung transplant was increased among patients with 6MWD that was shorter [34, 45] or below cutoff at baseline (≤ 342 m) [34]. Risk of death or lung transplant at 5 years was increased among patients with shorter baseline 6MWD [46].

At follow-up, risk of death or lung transplant was higher among patients with 6MWD that was shorter [45] or declined ≥ 35 m or ≥ 8% [34].

6MWD and Risk of PAH-Related Death or Hospitalization

Risk of PAH-related death or hospitalization was increased among patients with 6MWD in lower quartiles at baseline (reference: Q1 ≤ 300 m) [37]. At follow-up, risk was increased among patients with 6MWD in lower quartiles or below the median (≤ 400 m) [37].

6MWD and Risk of Clinical Worsening

Risk of clinical worsening increased among patients with 6MWD that was shorter [2, 34] or ≤ 342 m at baseline [34]. Risk was higher among patients with shorter [2] or decreasing 6MWD at follow-up [2, 34].

FC

Sixteen studies evaluated the relationship between FC and death (n = 9), death or lung transplant (n = 4), experiencing a costly event indicative of clinical worsening (n = 3), using health care resources (n = 2), and incurring health care costs (n = 1) (Table 4).

FC and Risk of Death

Patients with an increased risk of death at follow-up had more severe baseline FC (III/IV) [2, 36, 47], with highest risk of death within 1 year specifically among FC IV patients [39]. Further, FC IV symptomatology at baseline was independently associated with an increased likelihood of reduced survival (dying within 2 years) in univariate and multivariate analysis excluding initial PAH therapy [41]. Risk of death within 3 years was significantly higher for patients with more severe FC (III/IV) at baseline [42, 44]. Within 5 years, risk of death was higher for FC III or IV patients at baseline and increased 69% per class [48].

Risk of death was also predicted by more severe FC (III/IV) at follow-up [2]. Patients who improved at least one FC from baseline to follow-up had a similar risk of death compared with patients whose FC did not improve, but patients who improved from FC III/IV to I/II at follow-up had a reduced risk of death compared with patients who remained in FC III/IV at both time points [2]. Risk of death at 3 years was higher for patients whose FC worsened or remained unchanged compared with those whose FC improved within 1 year of enrollment [49].

FC and Risk of Death or Lung Transplant

For the combined endpoint of risk of death or lung transplant, risk was higher among patients with more severe FC (III/IV) at baseline in studies examining FC [35, 45]. Risk of death or lung transplant within 5 years was higher for patients with more severe FC (III/IV) at baseline [46].

Risk of death or lung transplant was also predicted by more severe FC (III/IV) [45] at follow-up and changes in FC between baseline and follow-up. Risk of death or lung transplant within 5 years was increased for patients whose FC remained III or IV or increased to III/IV during follow-up compared with patients who remained at FC I/II and patients who improved from FC III/IV to I/II [46]. Within 7 years, risk of death or lung transplant similarly increased for patients with stable or deteriorated FC compared with patients whose FC improved [50].

FC and Risk of Clinical Worsening

Risk of clinical worsening was increased among patients with more severe FC (III/IV) at baseline [2, 34, 35]. More severe FC (III/IV) at follow-up or FC that worsened (from I/II to III/IV) between baseline and follow-up also predicted clinical worsening [2].

BNP/NT-proBNP

Nine studies evaluated the relationship between BNP/NT-proBNP and risk of death (n = 7), experiencing a costly event indicative of clinical worsening (n = 2), and death or lung transplant (n = 2) (Table 4). The frequency of use of BNP (four studies) and NT-proBNP (five studies) was similar in the literature, with no clear temporal differences by year of publication or study start date.

BNP/NT-proBNP and Risk of Death

Risk of death was increased for patients with higher BNP [36] or NT-proBNP at baseline [2, 51] or NT-proBNP above the median (≥ 467 pg/mL) [2] or cutoff (>1256 pg/mL) at baseline [51]. Risk of death at 1 year was increased for patients with BNP higher than the threshold (> 180 pg/mL) at baseline [39]. Risk of death at 2 years was increased among patients with higher BNP at baseline [40]. At 4 years, risk of death was increased among patients with NT-proBNP > 704.5 pg/mL at baseline [43]. Increased risk of death at 5 years was associated with baseline levels of BNP > 340 pg/mL [52].

Risk of death also increased for patients whose NT-proBNP was higher [2], greater than the median (≥ 268 pg/mL) [2], or increased between baseline and follow-up [2, 51]. Risk of death at 2 years, in particular, was increased among patients with elevated BNP at the week 12 assessment [40]. Risk of death at 5 years was highest among patients whose BNP remained high (> 340 pg/mL), followed by patients with increased BNP at 1 year of follow-up; patients whose BNP decreased or remained low (≤ 340 pg/mL) at 1 year of follow-up had the lowest risk [52].

NT-proBNP and Risk of Death or Lung Transplant

Risk of death or lung transplant was increased for patients with elevated NT-proBNP at baseline [35, 46]. An optimal cutoff of NT-proBNP > 1105.5 pg/mL at baseline predicted risk of all-cause death or lung transplant [35]. Risk of death at 1, 3, and 5 years was increased among patients whose NT-proBNP remained high or increased to ≥ 1800 ng/L at follow-up [46].

NT-proBNP and Risk of Clinical Worsening

Risk of clinical worsening was increased among patients with NT-proBNP that was elevated [2, 35] or greater than the median (≥ 467 pg/mL) at baseline [2]. Risk of clinical worsening was also increased among patients with elevated or increased NT-proBNP at follow-up [2].

FC and Economic Outcomes

Mean total health care costs for patients with PAH were higher than costs for a Centers for Medicare and Medicaid Services managed care control group and increased with more severe baseline FC, with patients in FC IV having the highest costs [6]. Health care resource utilization, including inpatient admissions, longer average lengths of stay, and emergency department visits, was also greater for patients with FC IV than other FC subgroups [6]. In another study, patients with severe PAH who received medical therapy alone were more likely to have more severe FC (III/IV), which was associated with greater health care resource utilization compared with patients who received exercise training plus medical therapy [53].

6MWD, FC, and BNP/NT-proBNP Risk Groups

Three studies evaluated the relationship between risk group and death within 5 years (n = 2) and death or lung transplant (n = 1) (Table 4). The 2015 European Society of Cardiology (ESC)/European Respiratory Society (ERS) guidelines were used to stratify patients as low, intermediate, or high risk [12, 54, 55]. All three studies considered FC and 6MWD when determining risk [12, 54, 55] (Table 4). Two considered BNP/NT-proBNP in the primary determination of risk [12, 55], while one study considered the additive value of BNP < 50 ng L−1 or NT-proBNP < 300 ng L−1 low-risk criteria [54]. Additional factors considered in determining risk included right atrial pressure, cardiac index, mixed venous oxygen saturation [12, 54, 55], right atrial area, and pericardial effusion [55]. Risk was determined by a mean score calculated using the sum of the grades assigned to each risk factor from 1 (low risk) to 3 (high risk) and dividing by the total number of risk factors [12, 55] and by the number of low-risk criteria present at baseline and re-evaluation (FC I-II, 6MWD > 440 m, right atrial pressure < 8 mm Hg and cardiac index ≥ 2.5 min−1 m−2) [54].

Risk Groups and Risk of Death

Risk of death within 5 years was increased among patients with a higher proportion of “high-risk” variables at both baseline and follow-up, determined with the inclusion of 6MWD, FC, and NT-proBNP [55]. 6MWD followed by FC and BNP/NT-proBNP most strongly correlated with a patient’s risk of death within 5 years [12].

Risk Groups and Risk of Death or Lung Transplant

At baseline, all four low-risk criteria significantly predicted risk of death or lung transplant in univariable analysis; 6MWD > 440 m was the only low-risk criterion remaining significant in multivariable analysis [54]. At first re-evaluation, all four low-risk criteria significantly predicted risk of death or lung transplant in univariate and multivariable analysis [54]. Outcomes were similar for patients who had an increase in the number of low-risk criteria achieved between baseline and first re-evaluation (< 3 to having 3–4) and those who had 3–4 low-risk variables at both time points. Patients with less than three low-risk criteria at both baseline and first revaluation had the greatest risk of death or lung transplant; for patients with zero low-risk variables at follow-up, transplant-free survival was worse for those with two high-risk variables than for those with one. In the subgroup of patients who had all three noninvasive measurements at follow-up, risk of death or lung transplant was significantly lower for patients who achieved one or more low-risk criteria (FC I-II, 6MWD > 440 m, BNP < 50 ng L−1); in this multivariable model, hemodynamic low-risk criteria (right atrial pressure < 8 mm Hg and cardiac index ≥ 2.5 min−1 m−2) were no longer significant predictors of transplant-free survival [54].

Discussion

This review provides support for 6MWD, FC, and BNP/NT-proBNP as correlates of risk of long-term health outcomes (e.g., mortality and clinical worsening), costly events (e.g., lung transplants or hospitalization), and economic outcomes (e.g., costs and resource utilization) in PAH. Relative to patients with longer or increased 6MWD, patients with shorter or decreased 6MWD have a higher risk of death and experiencing costly events indicative of clinical worsening, such as hospitalization or lung transplant. Compared to patients with more favorable (I or II) or improved FC, patients with poorer (III or IV) or declined FC consume greater health care resources, incur higher health care costs, and are at an increased risk of death and clinical worsening. Patients with elevated or increased BNP/NT-proBNP have a higher risk of death and clinical worsening relative to patients with lower or decreased BNP/NT-proBNP. In addition, patients classified into more severe risk groups, determined by multiple noninvasive endpoints (e.g., 6MWD, FC, and BNP/NT-proBNP), have an increased risk of death and lung transplant.

These findings are important considering the current shift from using these noninvasive measures as primary and secondary endpoints in PAH clinical trials to evaluating mortality and clinical worsening in clinical trials. A related review has also concluded that 6MWD and FC are clinically meaningful trial endpoints associated with outcomes in patients with PAH and CTEPH [16]. While mortality and clinical worsening are robust and valuable long-term endpoints in clinical trials, 6MWD, FC, and BNP/NT-proBNP remain informative prognostic indicators that can be used in the clinic to assess and reduce risk of death or costly events for patients with PAH [15]. Whereas multiple definitions are used for clinical worsening across trials, 6MWD, FC, and BNP/NT-proBNP are universally defined and recognized, enabling comparisons of treatment efficacy across trials using these endpoints. 6MWD and FC are already being included in the composite endpoint of clinical worsening in many trials and in risk calculators such as REVEAL 2.0 [13]. BNP/NT-proBNP is also included in risk calculators [13] and is a prominent biomarker used in clinical management [56, 57] as well as part of the multiparametric risk assessment approach outlined in guidelines [58]. These noninvasive endpoints are simpler and more inexpensive to assess relative to event-based endpoints [14]. For instance, morbidity and mortality event trials require a large number of subjects in order to demonstrate effects even as early as 1 year [59, 60]. Looking to the future, consideration should be given to including 6MWD, FC, and BNP/NT-proBNP in the assessment of time to clinical improvement, a new endpoint proposed at the recent 6th World Symposium on Pulmonary Hypertension [61].

Conclusions from the present review should be considered with caution because no pooled analysis was performed to evaluate the observed relationships between the noninvasive endpoints and long-term outcomes for statistical significance. Previous meta-analyses and pooled analyses for 6MWD have found no correlation between 6MWD and mortality [21] and clinical events [22] or only modest validity for 6MWD as a surrogate endpoint [18]. Given that the majority of studies evaluating 6MWD included in this review were published after these studies were conducted (10 of 13 studies), an updated analysis is needed in order to confirm our observations. In addition, the paucity of literature evaluating health care resource utilization and costs in PAH limits the ability to draw conclusions on the predictive nature of noninvasive endpoints in terms of economic outcomes. However, given what is known about the relationships between noninvasive endpoints and events, such as hospitalization and lung transplant, the economic impact may be extrapolated using existing data on the costs of those events. Further, our review did not consider the predictive value of additional variables that are considered in risk assessments such as cardiopulmonary exercise testing, imaging (e.g., echocardiogram, cardiac magnetic resonance imaging), hemodynamics, and right heart catherization [13, 56]. While these variables are important, our review focused on the noninvasive endpoints most commonly used in clinical trials and risk assessments that are most efficient and inexpensive to assess.

Conclusions

Mortality and clinical worsening will continue to be valuable endpoints in assessing treatment efficacy and safety in PAH; however, these endpoints require lengthy follow-up and cannot be applied in clinical settings where risk reduction is the goal of treatment. Further, differing definitions used for clinical worsening across trials limit the ability of stakeholders to compare treatment efficacy. 6MWD, FC, and BNP/NT-proBNP are universally defined, low-cost, efficient, noninvasive endpoints that correlate with long-term health and economic outcomes in patients with PAH. Collectively, they are important components of risk assessments and will remain beneficial in the clinic to guide treatment decisions. Future research should meta-analytically examine the relationships between these noninvasive endpoints and long-term outcomes for statistical significance. Additional studies are needed examining the relationship between 6MWD, FC, and BNP/NT-proBNP and economic outcomes and the potential utility of 6MWD, FC, and BNP/NT-proBNP as a composite endpoint assessing risk.