FormalPara Key Summary Points

Why carry out this study?

Three-drug antiretroviral regimens (3DR) have been the standard of care since 1996, but two-drug regimens (2DR) have been developed to avoid short- and long-term toxicities and reduce costs

This study compared outcomes in a large cohort of People living with HIV (PLHIV) who switched to integrase inhibitor-based 3DR or 2DR of dolutegravir + rilpivirine or dolutegravir + lamivudine

What was learned from the study?

Patients who switched to the dolutegravir-based 2DR were at greater risk of discontinuing treatment because of treatment failure and virologic failure than those who switched to integrase inhibitor-based 3DR

There was no difference between the two groups in the risk of discontinuing treatment because of adverse events

Introduction

HIV infection remains a major cause of morbidity and mortality worldwide; in its latest update, the World Health Organization reported 1.7 million new cases and 690,000 deaths during 2019 [1]. Without antiretroviral therapy (ART) most patients with HIV will show evidence of disease progression within 8–10 years, and nearly half will die within 2 years of the onset of AIDS [2, 3]. ART completely inhibits HIV replication, which is the absolute driver of HIV-disease pathogenesis, thereby transforming HIV to a chronic and manageable condition with which affected individuals may live in good health.

Regimens for ART evolved in the mid-years of the past century from single-drug regimens to three-drug regimens (3DR) since trials conducted in the early 1990s showed little benefit of two-drug regimens (2DR) compared with single-drug regimens [4, 5]. Since 1996, the standard of care for ART has been 3DR consisting of a backbone of two nucleoside analogue reverse-transcriptase inhibitors (NRTI) combined with a third drug [6,7,8,9]. However, long-term toxicities associated with the NRTIs tenofovir disoproxil fumarate (TDF) [10, 11] and abacavir (ABC) [12], as well as cost-related factors, have led to the development and evaluation of treatment options that can help people living with HIV (PLHIV) achieve optimal long-term health. One such option includes a variety of 2DR.

Most earlier clinical trials with older 2DR showed a higher incidence of virologic failure (VF) compared with 3DR [13,14,15,16,17]. However, since 2018 several randomized controlled clinical trials have demonstrated non-inferiority of 2DR consisting of dolutegravir (DTG) plus either rilpivirine (RPV) or lamivudine (3TC) compared with 3DR in patients without prior virological failure. The SWORD 1 and 2 trials showed that DTG + RPV was non-inferior to a 3DR for maintenance of virologic suppression [18]; the GEMINI 1 and 2 trials showed that DTG + 3TC was non-inferior to 3DR of DTG + TDF/emtricitabine (FTC) in ART-naïve adults [19]; and the TANGO trial showed that switching to DTG/3TC was non-inferior to remaining on a three- or four-drug tenofovir alafenamide (TAF)-based regimen [20]. Based on the aforementioned trials, some guidelines recommend a 2DR of DTG + 3TC as an initial treatment option for PLHIV meeting specific criteria and both DTG + 3TC and DTG + RPV as recommended options for PLHIV who switch regimens [7, 8]. Currently, 2DR are not recommended for rapid start, which is increasingly becoming the standard of care globally.

Selection of patients for HIV clinical trials can be stringent. The SWORD, TANGO, and GEMINI trials excluded patients with HBV co-infection and mutations associated with known drug resistance to any of the major drug classes. For example, 12% of potential enrolees in the GEMINI trial were excluded because of evidence of a pre-existing resistance mutation [19]. Previously treated patients in SWORD and TANGO were stably suppressed and had no history of VF [18, 20]. In the real-world, PLHIV may have comorbidities, co-infections, resistance mutations, and a range of viral loads and CD4 counts, which makes applying conclusions from clinical trials to real-world settings problematic [21]. Furthermore, adherence in the real world is often below 80% [22,23,24], whereas adherence in clinical trials is often at least 95% [25, 26]. Adherence has a direct impact on VF [27, 28], although evidence suggests that adherence of at least 80% to contemporary 3DR may be sufficient to maintain viral suppression [29, 30]. It is unknown whether this applies to DTG-containing 2DR.

In a previous study in the Spanish VACH cohort, we found that the risk of VF as reported in the database by the clinician (with or without a reason) was at least two times higher with protease inhibitor- or DTG-based 2DR vs integrase strand transfer inhibitor (INSTI)-based 3DR [31]. Furthermore, there was no difference in discontinuations due to adverse events (AEs) between the 2DR and 3DR groups. The current study updates the previous VACH 2DR versus 3DR analysis to include a more recent period of data after the availability of elvitegravir (EVG)/cobicistat (c)/TAF/FTC in Spain, a larger number of patients, and by focusing on the 2DR of most clinical relevance in the current landscape (DTG + RPV and DTG + 3TC) [7, 8]. It should be noted that at the time of the analysis, the single-tablet coformulations of those 2DR were not available. Therefore, all patients in the analysis on 2DR were on two-tablet regimens and not on the single-tablet 2DR that were subsequently approved.

Methods

A retrospective analysis was performed using data from the VACH cohort, a prospective Spanish cohort of 14,833 adult PLHIV from 23 investigational centers which has been enrolling patients since 1997 [32]. Patient data were prospectively collected in a standardized electronic case record form and electronically stored in the Aplicación de Control Hospitalario (AC&H™). All data were transformed into a standardized format and collected into a central data set. Patient data were deidentified and each patient assigned a unique code. The study was reviewed and approved by Ethics Review Board of Cantabria IDIVAL. The study was performed in accordance with the Helsinki Declaration. Consent to the registration of their data in the AC&H™ clinical management program and the use of anonymous data for epidemiologic studies was given by patients who were included in the analysis.

Patient Selection Criteria

All patients switching to INSTI-based 3DR or to DTG + RPV or DTG + 3TC at any point between May 2, 2016 and May 15, 2019 were included. 2016 was chosen as the starting date for the analysis because that was the year of the last approval for all the regimens being considered in the analysis. Patients were excluded if the reason for treatment switch in either the previous regimen or the study regimen was for a programmed interruption or intermittent treatment, re-initiation after being lost to follow-up, or for the intention of restoring wild-type virus. Patients were also required to have a minimum follow-up of 2 days.

Endpoints

Two primary analysis endpoints were defined: time to discontinuation due to treatment failure (TF; defined as clinician-assessed VF, immunologic failure, or disease progression) and time to discontinuation due to AEs. Additional endpoints were the risk of discontinuation due to TF or AEs. Reasons for switch or discontinuation were captured in the AC&H database via selections from a drop-down menu when the clinician reported a treatment switch or discontinuation. There was no definition for each reason (e.g., VF, immunologic failure, disease progression, AEs, intolerance) in the database. As such, VF, immunologic failure, disease progression, and AEs were per the clinician’s definition.

Statistical Analysis

Patients were censored at loss to follow-up, death, or end of observation period (June 20, 2019). Baseline continuous variables were compared by Student’s t test and categorical variables were compared by the χ2 test if all cells had expected counts more than 5, otherwise Fisher’s exact test was used.

To evaluate consistency of results, three secondary analyses were performed. First, patients included were restricted to those virologically suppressed (HIV RNA < 50 copies/mL) at the time of regimen switch; second, patients included were restricted to a subsample of matched patients who were virologically suppressed at switch in the two groups. Matching was performed as two 3DR patients for each 2DR patient by means of age, sex, number of previous VFs, and line of regimen as matching variables. Third, analysis of the TF endpoint was restricted to VF only.

In both primary and secondary analyses, times to discontinuation due to TF or AEs were estimated by Kaplan–Meier curves. Hazard ratios (HR) for risk of discontinuations due to TF or AEs were estimated using Cox proportional hazard models. Cox proportional hazard models were controlled for age, sex, viral load, CD4 counts, injection drug user (ever), HCV coinfection, HBV coinfection (as identified by HBsAg in patient sera), prior AIDS diagnosis, number of previous regimens, number of previous VFs, and years on ART.

All analyses were performed using STATA SE Version 15 (StataCorp LLC, College Station, TX).

Results

Patient Population

In all, 5047 3DR (8617 person-years) patients and 617 (756 person-years) 2DR patients were included in the analysis. Most patients in the 3DR group were on either EVG/c/TAF/FTC or DTG/ABC/3TC (Fig. 1a). The majority (67.9%) of the patients in the 2DR group were on DTG + 3TC (Fig. 1a). Baseline characteristics differed between groups (Table 1); patients on 2DR were older, had a longer duration of ART, had a higher number of previous regimens and of prior VF, and were more likely to be virologically suppressed at switch.

Fig. 1
figure 1

Regimen distribution of three-drug regimens and two-drug regimens in a the total analysis population, b the virologically suppressed at switch subgroup population, and c the matched virologically suppressed at switch subgroup population. 3TC lamivudine, ABC abacavir, c cobicistat, DTG dolutegravir, EVG elvitegravir, FTC emtricitabine, TAF tenofovir alafenamide, TDF tenofovir disoproxil fumarate, RAL raltegravir, RPV rilpivirine

Table 1 Patient demographics and clinical characteristics at baseline

In the population of virologically suppressed patients at regimen switch, 3996 3DR (7059 person-years) and 554 (681 person-years) 2DR patients were analyzed. The regimen distribution in the virologically suppressed population was similar to the full analysis population (Fig. 1b). Similar differences in baseline characteristics between the 3DR and 2DR patients were observed in the virologically suppressed.

In the matched population, 934 (1638 person-years) 3DR and 467 (575 person-years) 2DR virologically suppressed patients were included in the analysis. The regimen distribution was similar to the full analysis population and the unmatched virologically suppressed population (Fig. 1c). Patients on 2DR had a shorter duration on ART, and a lower proportion were HBV infected (Table 1).

Outcomes in the Full Analysis Population (Primary Analysis)

The majority of patients in both the 2DR (77.5%) and 3DR (72.3%) groups were still on treatment at the end of the study period. The main reason for discontinuation was treatment switch in both the 2DR (16.0%) and 3DR (19.5%) groups (Supplementary Material Fig. S1). The most common reason for switching was TF (18.2%) in the 2DR group and simplification (22.5%) in the 3DR group (Supplementary Material Table S1). Of those who switched, the proportion of patients who switched treatments due to VF was numerically higher in the 2DR (11.1%) versus the 3DR (5.8%) group (Supplementary Material Table S1). Treatment switch due to VF was 1.8% in the entire 2DR cohort and 1.1% in the entire 3DR cohort (Supplementary Material Table S1).

Time to discontinuation due to TF was significantly shorter for 2DR versus 3DR (P = 0.002; Fig. 2a), whereas there was no significant difference for time to discontinuation due to AEs (P = 0.908; Fig. 2b). After we controlled for demographic and clinical characteristics, the risk of discontinuation due to TF was 2.3 times higher on 2DR versus 3DR (HR = 2.33; P = 0.003; Table 2), and no difference was observed for discontinuations due to AEs (HR = 0.80; P = 0.488).

Fig. 2
figure 2

Time to discontinuation due to a treatment failure and b adverse events in the total analysis population

Table 2 Risk of discontinuation due to treatment failure (TF), adverse events (AEs), or virologic failure (VF) in patients switched to two-drug regimens (2DR) or three-drug regimens (3DR)

Outcomes in the Virologically Suppressed at Switch Population

In the secondary analysis of virologically suppressed patients at regimen switch, time to discontinuation due to TF was significantly shorter for 2DR versus 3DR (P = 0.001; Fig. 3a), whereas there was no significant difference for time to discontinuation due to AEs (P = 0.961; Fig. 3b). As in the full analysis population, after we controlled for demographic and clinical characteristics, the risk of discontinuation due to TF was 2.3 times higher on 2DR versus 3DR (HR = 2.28; P = 0.011; Table 2). No difference between 2DR and 3DR was observed for risk of discontinuations due to AEs (HR = 0.82; P = 0.575).

Fig. 3
figure 3

Time to discontinuation due to a treatment failure and b adverse events in the virologically suppressed at switch subgroup population

Outcomes in the Matched Virologically Suppressed at Switch Population

In the secondary analysis of matched virologically suppressed patients, time to discontinuation due to TF was significantly shorter for 2DR versus 3DR (P = 0.003; Fig. 4a). There was no significant difference for time to discontinuation due to AEs (P = 0.699; Fig. 4b). The risk of discontinuation due to TF was 3.0 times higher on 2DR versus 3DR (HR = 3.00; P = 0.017; Table 2). No difference between 2DR and 3DR was observed for risk of discontinuations due to AEs (HR = 0.85; P = 0.736).

Fig. 4
figure 4

Time to discontinuation due to a treatment failure and b adverse events in the matched virologically suppressed at switch subgroup population

Secondary Analysis of VF as Endpoint

In the full analysis population, time to discontinuation due to VF was significantly shorter for 2DR versus 3DR (P = 0.037; Fig. 5a) and risk of discontinuations due to VF was 2.2 times higher on 2DR versus 3DR (HR = 2.24; P = 0.024; Table 2). The times to discontinuation due to VF in the virologically suppressed and matched populations were similar to those in the full analysis population (Figs. 5b, c). The risks of discontinuations due to VF in the virologically suppressed and matched populations were approximately two times higher on 2DR versus 3DR, but the differences were not statistically significant (Table 2).

Fig. 5
figure 5

Time to discontinuation due to virologic failure in the a total analysis population, b virologically suppressed at switch subgroup population, and c matched virologically suppressed at switch subgroup population

Discussion

In this retrospective analysis of a large Spanish cohort of PLHIV, the time to discontinuation due to TF, and more specifically due to VF, was significantly shorter in patients switching to DTG + RPV and DTG + 3TC versus INSTI-based 3DR. The risk of discontinuation due to TF or VF with 2DR was approximately twofold higher compared with 3DR. There was no significant difference between time to or risk of discontinuation due to AEs between the 2DR and 3DR groups. The same general results were maintained when restricting the analysis to an unmatched and matched population of patients who were virologically suppressed at the time of switch. These data confirm the results from the initial analysis that demonstrated a shorter time to discontinuation and higher risk of discontinuation due to VF with 2DR than 3DR without a difference in discontinuation due to AEs [31]. Thus, there is a higher risk of TF with 2DR than 3DR, with no benefit from a safety or tolerability standpoint in the context of the limited study period. Long-term safety and tolerability benefits of a NRTI-sparing 2DR may be possible.

Evidence regarding differences in VF between 2DR and 3DR in a real-world setting is mixed. Consistent with the results in this analysis, one real-world observational study of over 250 French and Italian PLHIV found that among those with VF, the time to VF was almost 2 years shorter for 2DR versus 3DR [33]. A retrospective analysis of PLHIV in the OPERA observational database in the USA found that the difference in the incidence of VF between 2DR (7.9 per 100 person-years) and 3DR (6.0 per 100 person-years) was not statistically significant, although the difference in time to VF between 2DR and 3DR neared statistical significance (p = 0.06) [34]. Analysis of PLHIV in the European EuroSIDA database found that real-world virologic and immunologic outcomes of 2DR were similar to 3DR [35]. In contrast to the current study, these other real-world studies included a variety of 2DR, with a small proportion being DTG-based 2DR, and a wide range of 3DR rather than only INSTI-containing 3DR.

As in the real-world studies, there are mixed clinical trial data regarding the risk of VF between 2DR and 3DR. A meta-analysis published in 2016 found that the risk of VF in clinical trials of either treatment-naïve or virologically suppressed patients significantly increased with 2DR compared with 3DR in patients with baseline viral load greater than 100,000 copies/mL [36]. Furthermore, there was a twofold risk of selecting resistance mutations with 2DR versus 3DR [36]. In contrast, the SWORD clinical trials found that DTG + RPV was non-inferior to 3DR in the proportion of participants classified as experiencing VF, and in the TANGO trial no participants treated with DTG/3TC met the criteria for confirmed virologic withdrawal [18, 20]. The European AIDS Clinical Society in their treatment guidelines concluded that the DTG + RPV and DTG + 3TC regimens were not associated with more virologic rebounds than 3DR, although they noted that there have been a few cases of one- and two-class resistance emergence with DTG + RPV and two-class resistance with DTG + 3TC [8, 37, 38].

The discrepancy between the results for VF in the current study versus the SWORD and TANGO trials likely lies in the characteristics of the study population. The SWORD and TANGO trials selected participants who were virologically suppressed (viral load less than 50 copies/mL) and with no history of previous VF or resistance mutations [18, 20], while patients in the current real-world analysis were unselected for virologic characteristics. The proportion of patients who switched treatment due to TF or VF in the 2DR cohort was more than twice that of the 3DR cohort. Together these data suggest that carefully selected patients may have a comparable benefit with DTG + RPV as with 3DR. No historic genotypic data were available for the VACH cohort, and more research is needed on the risk of VF with 3DR and 2DR in PLHIV who have previous VF or resistance mutations.

The duration of the study periods may also explain the differences in VF between this real-world analysis and clinical trials. In the current analysis, time to discontinuation due to TF did not begin to diverge between the 2DR and 3DR until approximately 1 year, with sharper drop-offs appearing at approximately 2 years. However, the 2DR cohort in the second year of follow-up was small, which could have impacted the results. The SWORD, GEMINI, and TANGO trials were only 48 weeks in duration [18,19,20]. It is possible that more than 1 year of treatment is needed to see differences in VF between 2DR and 3DR. At the 148-week follow-up of the SWORD trials, participants who switched from 3DR to DTG + RPV generally maintained virologic suppression; 11 (1%) met the confirmed virologic withdraw criteria, but there was no corresponding 3DR group for comparison [39]. Real-world settings also present more challenges compared with randomized controlled trials, such as poorer adherence and less viral load monitoring, which can impact TF and VF [40, 41].

A strength of this updated analysis is that the time period of the study allows for the inclusion of more recent 3DR that were developed to reduce the toxicity (e.g., EVG/c/TAF/FTC) associated with some of the older TDF-and TAF-based 3DR, as well as single-tablet 3DR that improve ease of use. Since the completion of this study, additional TAF 3DR have become available that warrant further study. Also, the assessed 2DR are those that will likely increase in use based on the support of large randomized controlled trials [18,19,20] and revised guideline recommendations [7, 8]. The results of this study need to be considered within the context of a few limitations. At the time of analysis, the 2DR available were multi-tablet regimens, which may affect adherence. Baseline characteristics were different in the 2DR and the 3DR groups, with more treatment-experienced patients and more virologically suppressed patients at the time of switch in the 2DR group. This indicates potential confounding by indication. However, the results were maintained in the population of patients virologically suppressed at switch. There is also the possibility of residual confounding because of prescriber bias in the selection of patients for 2DR and because of reduced confidence in the robustness of 2DR. We recognize that factors such as previous regimens and number of previous VFs can impact VF [42,43,44]. Thus, an additional analysis of virologically suppressed patients at switch matched by age, sex, number of previous VFs, and treatment line was conducted, and the results from the full analysis population were maintained. An additional limitation is that the VACH database does not define VF, immunologic failure, or disease progression. They are merely drop-down choices that can be selected by investigators as a reason for treatment switches or discontinuation. Thus, the subjectivity of determining VF, immunologic failure, or disease progression could have impacted the results.

Conclusions

Three-drug therapy plays an important role in effectively managing HIV. In addition to clinical trial data, real-world data need to be considered when weighing the evidence for new ART regimens.