Introduction

Heart failure (HF) imposes a substantial disease burden on affected individuals. Clinical symptoms impede daily functioning, and the impact of frequent hospitalizations, which exacerbate the progression of the disease, presents a significant challenge from the patient’s perspective while also diminishing their overall quality of life1,2,3. Despite notable advancements in HF treatment modalities, encompassing pharmacological interventions and implantable devices, hospitalizations remain pervasive. It is estimated that over 50% of HF patients experience at least one hospitalization within a year4. Congestion, the predominant cause of HF hospitalizations, arises from intricate mechanisms regulating the interplay between volume and intravascular pressure5,6. Thus, although the pulmonary artery pressure does not directly represent the total body volume, its changes are interpreted as some clues about the changing congestion status of HF patients6.

Following this simplification and understanding some caveats of that reasoning the continuous invasive monitoring of pulmonary artery pressure (PAP) is used as a surrogate of congestion status, to assess and treat changing congestion in ambulatory HF settings. PAP, thus is an easily measurable and convenient proxy for monitoring the fluid status in HF. Importantly, an increase in PAP precedes the appearance of clinical signs of congestion in physical examinations7.

Previous randomized controlled trials (RCTs) using two devices, CardioMEMS8,9,10,11 and Chronicle12,13, did not provide sufficient evidence of the efficacy. While the safety of using these devices has been confirmed, their effectiveness in preventing adverse clinical outcomes is still uncertain. Given the aforementioned context, we have decided to conduct a systematic review and meta-analysis of existing data regarding the efficacy of PAP monitoring devices in the population of HF patients.

Methods

A systematic review was conducted in adherence to the PRISMA checklist, consisting of 14 essential items14. The study protocol was preregistered with PROSPERO under the registration number CRD 42023449811. On the date of 12.07.2023, the initial screening and extraction of relevant papers were executed. Subsequently, two independent reviewers, namely M.G and K.F, evaluated the titles and abstracts, followed by a detailed screening of the selected full-text articles. Any disparities between the reviewers were resolved through discussion.

Search strategy

The initial review involved the screening of the following databases:

Academic Search Ultimate, ERIC, Health Source Nursing/Academic Edition, MEDLINE, Embase, Clinicaltrials.gov, Cochrane Library (trials).

No temporal constraints were imposed on the publication timeframe.

The details of the search including keywords are available in the supplement file (File S2).

Only studies with the status of completed, terminated, or unknown on clinicaltrials.gov were subjected to screening. Studies focusing on pediatric populations were excluded from the search. Subsequently, references and papers citing the initially included studies underwent screening for relevance. The screening process is illustrated in Fig. 1.

Figure 1
figure 1

Flowchart of results of the review process.

Inclusion and exclusion criteria

Only double-arm, randomized controlled trials (RCT)s, analysing devices estimating the PAP were deemed eligible for inclusion in the analytical phase of this study. Specifically, peer-reviewed papers presenting full-text articles in the English language were considered for inclusion. Notably, investigations conducted on animal models or within paediatric populations were deliberately excluded from the purview of this study. Studies assessing tools measuring other hemodynamic parameters e.g. left atrium pressure, were excluded from the analysis.

Data extraction and analysis

Two investigators, W.T and M.F, independently extracted pertinent data from the included studies, adhering to the pre-specified protocol. Preference was given to utilizing data obtained before the unmasking procedure in cases where both pre- and post-unblinding datasets were available. The extracted data is presented in Table 1.

Table 1 Characteristic of the included studies.

For the analysis of categorical variables, Mantel–Haenszel and Inverse Variance with random effects models were employed. Odds ratios (OR), rate ratios (RR), and hazard ratios (HR) with corresponding 95% confidence intervals were calculated for various outcomes. Odds ratios were computed for events where the count of patients experiencing occurrences was compared to the total patient count, such as all-cause and cardiovascular (CV) death. In instances where the potential number of events exceeded the total patient count, as observed in HF hospitalization, rate ratios were calculated and subsequently pooled according to the guidelines outlined in the Cochrane Handbook15. Hazard ratios were pooled if provided in the research manuscripts. Heterogeneity was assessed using the I2 statistic, with I2 values exceeding 50% considered notable.

Funnel plots, indicative of publication bias, were not generated due to the limited number of available papers16. Quantitative analysis utilized Review Manager version 5.4.1 (The Cochrane Collaboration, 11–13 Cavendish Square, London, W1G 0AN United Kingdom), while figures were created using Biorender. Language editing was conducted using ChatGPT.

The authors of the papers that presented the missing data were contacted to achieve all the necessary results.

Risk of bias and quality of evidence

The assessment of risk of bias was conducted utilizing the Cochrane Risk of Bias tool (version 2)17. Two independent reviewers, W.T and K.F, evaluated all included studies, and any discrepancies were resolved through discussion. A study was designated as high risk if any domain was classified as high risk or if some concerns were identified in multiple domains, substantially diminishing confidence in the study’s outcomes. Certainty of evidence underwent a similar evaluation process, employing the Grading of Recommendations Assessment, Development, and Evaluation (GRADE)18. GradePRO GDT (McMaster University and Evidence Prime Inc.) was employed for generating a Summary of Findings and Certainty of Evidence table.

Subgroup, sensitivity analysis, and meta-regression

We investigated the source of heterogeneity upon its attainment of statistical significance. Subgroup analyses which assessed the outcome in CardioMEMS and Chronicle devices separately and excluded the unblinded studies (MONITOR-HF), were performed. A sensitivity analysis was performed by excluding the largest study (GUIDE-HF) to assess the durability and resilience of the results.

We conducted a meta-regression to assess the influence of the year of publication and the duration of follow-up on the intervention’s effect.

Results

Characteristic of the studies

The outcomes of the review process are depicted in Fig. 1. This compilation of studies involved a combined cohort of 2572 participants, distributed with 1279 individuals in the intervention group and 1293 assigned to the control group. The average age of the participants was 61.3 years, and males constituted 69% of the overall study population. Notably, three of the investigations examined the utilization of the CardioMEMS device (n = 73.79% of the entire population), while the remaining two tested the Chronicle device (n = 26.21% of the total population). The systems employ distinct methodologies for obtaining PAP. The CardioMEMS system directly measures PAP, whereas the Chronicle system assesses a range of hemodynamic parameters, including right ventricular systolic and diastolic pressures and estimated pulmonary arterial diastolic pressure (ePAD). Notably, ePAD has demonstrated a high correlation with actual PAP across various physiological conditions19. Except for one study exclusively involving patients with HFrEF, all other studies included patients irrespective of their ejection fraction (EF). The range of symptom severity included NYHA classes II to IV.

In four of the studies, both study groups underwent device implantation before being randomized into PAP monitoring or control groups, while one study (MONITOR-HF) exclusively performed the procedure on the intervention group. Furthermore, the research design in four of these studies maintained single-blinded conditions, ensuring that patients remained unaware of their group allocation. Conversely, one study adopted an unmasked approach. The mean duration of follow-up across these investigations was 11.25 months. The characteristic of the included studies is displayed in Table 1. In the case of the GUIDE-HF study, where data was divided into pre-covid and overall analysis, the overall data was included.

Outcomes

Five outcomes were consistently documented in a minimum of three studies, including all-cause and cardiovascular (CV) mortality, heart failure (HF) hospitalizations, urgent HF visits, and the composite endpoint of HF hospitalizations and urgent visits termed HF events. A quantitative analysis of these parameters was conducted, and corresponding forest plots illustrating the outcomes are presented in Fig. 2. A summary of the outcomes across the studies is provided in Table 2.

Figure 2
figure 2

Forrest plot of the PAP monitoring versus control for the selected outcomes. CV cardiovascular, HF heart failure, PAP pulmonary artery pressure.

Table 2 Summary of findings of the included studies Number of events are displayed.

All-cause mortality

The all-cause mortality was reported in every study. The death occurred in the 152 (12%) patients in the study group and in the 166 (13%) in the control group. The PAP monitoring systems did not show a reduction in all-cause mortality (OR 0.91 [95% CI 0.72–1.16], p = 0.46). No significant heterogeneity was found.

CV-death

Cardiovascular death was assessed in 3 studies, 95 (10%) and 103 (11%) events occurred in the study and control arm, respectively. No significant difference in the CV death was reported (OR 0.92 [95% CI 0.68–1.25], p = 0.61), nor relevant heterogeneity was noticed.

HF-hospitalization

Heart failure-related hospitalization was evaluated in every study and occurred 611 (0.48 events per patient) versus 873 (0.68 events per patient) times in the study and control group respectively. The PAP monitoring reduced the HF-hospitalization rate (RR 0.72 [95% CI 0.6–0.87], p = 0.0006). Hazard ratio analyses, reflecting time-to-event outcomes, were conducted in four studies. The PAP monitoring reduced the risk of HF-hospitalization (HR 0.67 [95% CI 058–0.76], p < 0.00001).

Significant heterogeneity was noticed in the RR analysis (I2 = 68%). The results retained their significance in the subgroup analysis of CardioMEMS alone (RR 0.66 [95% CI 0.52–0.85], p = 0.001) and in studies with a blinded design (RR 0.77 [95% CI 0.66–0.90], p = 0.001). However, in studies that utilized the Chronicle device, there was no significant difference between the groups (RR 0.84 [95% CI 0.68–1.04], p = 0.1). Furthermore, even after excluding the largest study, GUIDE-HF, for the sensitivity analysis, the findings remained statistically significant (RR 0.69 [95% CI 0.56–0.86], p = 0.0008). Heterogeneity analysis results are displayed in Fig. 3.

Figure 3
figure 3

Heterogeneity analysis of the effect of PAP monitoring on HF-hospitalizations. PAP pulmonary artery pressure.

The metaregression revealed a significant association between the effect of PAP monitoring on HF hospitalization and the length of the follow-up (β = − 0.03, 95% CI = − 0.06 to 0.00, p = 0.039), but not with the year of the publication (β = − 0.01, 95% CI = − 0.05 to 0.02, p = 0.51).

HF events (composite of the HF hospitalizations and urgent HF visits)

The HF-events were reported in 3 studies and occurred 388 (0.47 events per patient) and 441 (0.52 events per patient) times in the study and control group respectively. The PAP monitoring significantly reduced the HF events rate in the intervention group (RR 0.86 [95% CI 0.75–0.99], p = 0.03). No significant heterogeneity was observed.

HF urgent visits

HF-related urgent visits were documented in four studies, with 63 events (0.06 events per patient) in the intervention group and 65 events (0.06 events per patient) in the control group. No statistical significance was observed on that field (RR 0.97 [95% CI 0.69–1.38], p = 0.88). No meaningful heterogeneity was observed.

Risk of bias and certainty of evidence

The results of the ROB assessment are shown in Fig. 4. The ROB was assessed as high, low or some concerns and the certainty of evidence was low or moderate depending on the outcome (Table 3).

Figure 4
figure 4

Risk of Bias in the included studies.

Table 3 Summary of findings and certainity of evidence table.

Discussion

In our meta-analysis, we have evaluated the value of continuous remote PAP monitoring in the HF population. The data comes from, the 5 RCTs analysing 2 different devices measuring the PAP. The principal findings of our analysis indicate that treatment based on these devices exhibited a statistically significant reduction in HF-related hospitalizations and HF events, albeit without a significant impact on overall mortality. Notably, this association remained statistically significant in the aggregate analysis of data from the CardioMEMS device, but not in the case of the Chronicle device. Importantly, this significance persisted even after we excluded the largest study within our dataset, (the GUIDE-HF trial). Furthermore, our findings were consistent regardless of the publication date of the studies and revealed a noteworthy association between the duration of follow-up and the extent of reduction in heart failure (HF) hospitalizations within the device arm. Specifically, studies with a more extended observation period demonstrated a more pronounced reduction in HF-related hospitalizations. This meta-analysis represents the most comprehensive examination to date, comprising a substantial cohort of 2572 participants. It encompasses a comprehensive review of all existing data regarding PAP estimating devices in the context of HF populations, drawing exclusively from RCTs.

The design of the included studies varied considerably. Notably, MONITOR-HF was the sole unblinded study, potentially influencing outcomes, particularly self-reported measures such as quality of life20. The impact of the COVID-19 pandemic on the outcomes of PAP monitoring warrants further exploration. Notably, the GUIDE HF trial has published analyses both pre-pandemic and during the pandemic. Enrollment for this trial concluded in 2019, and by the onset of the pandemic lockdown, 72% of the follow-up was complete, with 42% of patients having finished. Pre-pandemic analyses indicated positive outcomes, which lost significance during the pandemic. This change was primarily due to a reduction in events, especially heart failure-related hospitalizations and visits, in the control group, with no significant changes in the intervention arm. The pandemic also led to a decrease in cardiovascular events across multiple RCTs21. This reduction in heart failure-related hospitalizations could be attributed to patients’ reluctance to seek hospital care due to fear of COVID-19 infection and a greater focus on health maintenance, potentially improving adherence22.

The recent meta-analysis regarding the topic was published simultaneously with the last RCT—MONITOR HF23. The analysis evaluated the effect of the CardioMEMs devices on the selected outcomes and revealed a significant reduction of the: HF hospitalizations, the composite of HF hospitalizations, urgent visits and all-cause mortality, composite of HF hospitalizations and all-cause mortality and composite of HF-hospitalization and urgent HF visits. No significant interactions in the analysed subgroups were found, and no metaregression was performed.

The similar meta-analysis also analysed pooled results of studies regarding CardioMEMS and Chronicle device24. Laconelli et al evaluated the results of CHAMPION, COMPASS-HF, GUIDE-HF, and REDUCEhf trials. The use of PAP monitoring devices significantly reduced HF-hospitalizations, and did not affect all-cause mortality. No data regarding the subgroups or meta-regression were published. Our results complement the previous data with the information coming from the studies analysing all the available devices for PAP monitoring. The differences in the number of included studies stem from the fact that, compared to the study by Clephas et al., our meta-analysis, except the results from the CardioMEMs trials, incorporates results from studies evaluating the effectiveness of the Chronicle system. Additionally, unlike the analysis by Laconelli et al., our work also includes results from the MONITOR HF study, which was not available at the time of their analysis. In summary, the data from PAP monitoring and therapeutic interventions translate into a reduction in the risk of HF hospitalization. Interestingly, this does not appear to correlate with a reduction in mortality risk, which may stem from several factors. Firstly, it is challenging to expect that monitoring alone would reduce any endpoint because it merely provides information. The crucial factor is what actions the physician takes based on this information. So, if the primary intervention aimed at reducing PAP is an increase in diuretic dosage, as was the case in the included studies, it may not translate into a better prognosis, as we know that diuretics do not themselves improve prognosis but rather alleviate clinical symptoms of congestion. As demonstrated in the STRONG-HF study, the optimization of neurohormonal blockade proves to be a safe strategy, irrespective of the patient’s age, comorbidities, or natriuretic peptide level25,26,27,28. Therefore, the development of a response algorithm to elevated PAP based not only on the escalation of the diuretics but rather, on the optimization of the guideline-directed medical therapy (GDMT) i.e. renin–angiotensin–aldosterone system (RAAS) blockade and sodium–glucose cotransporter 2 (SGLT-2) inhibitors, both of which possess decongestion-promoting properties, including indirect reductions in PAP, might yield more favourable outcomes29,30,31. Furthermore, the absence of a positive effect on mortality may be attributed to the characteristics of the population included in the analysed trials, wherein a substantial proportion consisted of patients with HFpEF. Given that individuals with HFpEF frequently experience non-cardiovascular mortality32, demonstrating a mortality reduction with a device in this particular population proves challenging.

Although the results of PAP monitoring remained consistent across subgroups in the analysis by Clephas et al., the device appears to yield more favourable outcomes in the most vulnerable patient populations, such as elderly patients with NYHA class III symptoms, which aligns with intuition, as this group would inherently have a higher baseline hospitalization rate. Ultimately, the most appropriate and potentially beneficial patient population for PAP monitoring remains to be determined, and further research is needed to clarify its role in clinical practice. Subgroup analysis of specific populations in studies involving the Chronicle device was not conducted due to the absence of reported data. As a result, any additional subgroup analyses would have been limited to data from the CardioMEMS, thus duplicating the analysis previously published by Clephas et al. Our distinctive contributions, apart from the exclusive analysis of the Chronicle device, include analyses confined to blinded studies and a sensitivity analysis that excludes the largest study (Fig. 3). Notably, the positive effect persisted as significant in both analyses, which reinforces the reliability of our findings.

Novel devices designed for monitoring PAP are presently under scrutiny33, with one particularly promising solution being the Cordella. Apart from assessing hemodynamic parameters, the device acquires and transmits the vitals. Notably, the Cordella stands out as the inaugural device that permits and encourages patients to self-monitor PAP and autonomously adjust therapy. Prior endeavors involving the incorporation of patients into self-management based on hemodynamic parameters, as conducted in the LAPTOP-HF trial, yielded promising results34. Preliminary findings from the SIRONA-2 trial, focused on assessing the safety and efficacy of the Cordella device, exhibit a favorable outlook35,36,37.

There is a possibility that optimizing the results could be achieved by extending the duration of follow-up. Our findings suggest that the favourable impact of PAP monitoring may become more prominent with longer observation periods. Therefore, it is advisable to conduct further studies that include a more carefully selected patient population, potentially those at a higher risk of hospitalizations, extend the pharmacological intervention beyond the diuretic escalation, and prolong the blinded observation period. Such investigations are warranted to gain a deeper understanding of the benefits of PAP monitoring.

Notably, in the largest study (GUIDE-HF), the control group received bi-weekly calls from healthcare providers, and the outcomes did not show significant differences between the groups. Patients in the control group, often receive less frequent follow-up, which may explain the difference in events. Importantly, there is substantial evidence supporting the effectiveness of non-invasive telemonitoring systems for heart failure (HF) patients, which have demonstrated their efficacy in reducing more durable outcomes, including all-cause mortality38. Conversely, the landscape of telemonitoring methods remains heterogeneous, marked by instances of inefficacious telemonitoring approaches39,40,41,42. Consequently, additional trials are imperative to systematically assess and compare the impact of invasive hemodynamic monitoring against specifically chosen non-invasive telemonitoring systems, which have demonstrated proven efficacy.

Limitations

Our study is not devoid of limitations. Firstly, we were restricted to utilizing only aggregated data from the trials, and conducting a patient-level analysis could potentially provide more in-depth and nuanced conclusions. It is worth noting that all the trials were conducted in the United States and the Netherlands, which are both highly developed countries with relatively robust healthcare systems. The trails were not powered to detect the differences in mortality. Furthermore, it is important to acknowledge that the physicians were not blinded to the allocation of patients, which inherently introduces a vulnerability to performance bias, The risk of bias remains significant, even though it is challenging to conceive a study design in which the physician would not have access to data from the implanted sensor43. Finally, the Chronicle device is presently obsolete and lacks FDA approval. Additionally, it does not directly measure the PAP but provides estimates. This meta-analysis sought to assess the impact of remote PAP measurement-driven therapy on outcomes, focusing on the method rather than the specific device.

In conclusion, our meta-analysis contributes additional evidence regarding the potential efficacy of PAP monitoring devices in reducing HF hospitalizations. However, it is imperative to emphasize that the clinical impact of this procedure warrants further comprehensive evaluation and scrutiny. Further research is needed to better understand the true clinical significance and optimal application of PAP monitoring devices in the management of heart failure.