Introduction

Hormone receptor positive (HR +), human epidermal growth factor receptor 2 negative (HER2−) is the most prevalent subtype of invasive breast cancer and accounts for approximately 70% of all cases [1]. Metastatic breast cancer (MBC) remains a universally fatal disease, with overall survival (OS) limited to 2 to 3 years on average [2]. Despite the availability of endocrine therapy for the treatment of HR+, HER2− MBC, benefits progressively diminish with the development of resistance and progressive disease (PD).

Inhibitors of cyclin-dependent kinases (CDKs) 4 & 6, in combination with endocrine therapy, have been recommended by the National Comprehensive Cancer Network (NCCN) [3] and the European Society for Medical Oncology (ESMO) for treatment of HR+, HER2− advanced breast cancer (ABC) [4]. After progression on multiple lines of endocrine-based therapy, either alone or in combination with agents such as CDK4 & 6 inhibitors, sequential single-agent cytotoxic chemotherapy becomes the standard of care for most patients [3]. Cytotoxic chemotherapy is associated with substantially more toxicity compared to endocrine therapies [5,6,7,8]. The addition of CDK4 & 6 inhibitors to endocrine therapy has markedly improved disease control in both the first and second or greater line settings and recently have been shown to improve survival in several settings [9,10,11,12,13].

Abemaciclib is a potent and selective oral, small-molecule inhibitor of CDK4 & 6 which leads to sustained cell cycle arrest when dosed on a continuous schedule [14]. Abemaciclib has received regulatory approval globally in combination with an aromatase inhibitor as initial endocrine-based therapy for the treatment of postmenopausal women with HR+, HER2− ABC or MBC, in combination with fulvestrant for the treatment of women with HR+, HER2− ABC or MBC with disease progression following endocrine therapy, and in the United States (US) as a monotherapy for the treatment of adult patients with HR+, HER2− ABC or MBC with disease progression following endocrine therapy and prior chemotherapy in the metastatic setting [3, 4, 9]. Abemaciclib has demonstrated clinical activity as monotherapy in patients with HR+, HER2− MBC heavily pretreated with both endocrine and chemotherapy in the metastatic setting in MONARCH 1 (NCT02102490) [15].

MONARCH 1 was a single-arm phase II trial including patients with disease progression on or after endocrine therapy, with prior exposure to a taxane, and with at least 2 prior chemotherapy regimens including at least 1 in the metastatic setting. The objective response rate (ORR) was 19.7% (95% CI [confidence interval], 13.3, 27.5), and the median OS was 22.3 months [15]. Although this was a single-arm study, the ORR and OS observed in MONARCH 1 suggest single-agent abemaciclib may offer a more favorable benefit-risk profile than what might be expected in patients receiving cytotoxic chemotherapy [6, 8]. At the time of the MONARCH 1 trial, treatment options for this patient population were typically limited to chemotherapy [15]. However, without a comparator arm, it is difficult to put these findings into clinical context relative to available treatment options.

Traditionally, historical controls from previous clinical trials have been used to provide context for results from single-arm trials [16, 17], but this approach may be hampered by differing patient populations or lines of treatment. In contrast, real-world data (RWD) allow for selection of a more contemporaneous cohort of patients who match relevant trial criteria and patient-level data, and allow for matching between real-world and trial cohorts to balance patient characteristics [16, 18, 19]. This retrospective cohort study used RWD from patients treated in a clinical practice setting for MBC to create a single-agent chemotherapy control arm to help contextualize the results observed in MONARCH 1.

Materials and methods

Data source

This retrospective study utilized the Flatiron Health electronic health records (EHR)-derived database and included patients diagnosed with MBC from 01 January, 2011, to 28 February, 2018. The Flatiron Health database is a US-based longitudinal, demographically and geographically diverse database derived from EHR data from over 280 cancer clinics (~ 800 sites of care) representing more than 2.1 million active cancer patients. As of March 2018, the Flatiron MBC cohort included more than 15,000 patients with MBC from approximately 180 clinics. Patients were included in the cohort if they were stage IV at initial diagnosis or if they developed recurrent MBC after an initial diagnosis of early stage breast cancer. The database includes both structured and some unstructured EHR data elements, such as patient demographics (gender, race, birth year, and state of residence), type of cancer facility visited (community vs. academic), clinical diagnoses, laboratory data, biomarker tests and results, medications ordered and/or administered, line of therapy (derived), month and year of death, and other patient clinical characteristics including cancer stage at initial diagnosis and performance status (PS). Date of death was derived from EHR data, commercial death data, and the Social Security Death Index [20].

Real-world chemotherapy cohort key inclusion and exclusion criteria

The inclusion and exclusion criteria applied to the Flatiron Health MBC Cohort for this study were intended to recapitulate the MONARCH 1 trial key eligibility criteria [15], where it was feasible to do so, in order to provide the most relevant real-world comparator to the MONARCH 1 patient population (Table 1). The criteria consisted of female patients with a diagnosis of HR+, HER2− MBC who received monotherapy with capecitabine, gemcitabine, eribulin, or vinorelbine in second or later lines of therapy. These agents were selected based on ESMO and NCCN guidelines for single-agent chemotherapy for patients in this setting.

Table 1 Inclusion criteria for MONARCH 1 and real-world chemotherapy cohort

Patients received at least 1 but no more than 2 lines of chemotherapy for those in the advanced setting prior to receipt of the single-agent chemotherapies listed above. Those included had an Eastern Cooperative Oncology Group (ECOG) performance status of 0 or 1 (within 60 days before or 30 days after the eligible line of therapy). Patients who received CDK4 & 6 inhibitor drugs (palbociclib, ribociclib, and/or abemaciclib) in prior lines of therapy were excluded. Patients who received prior HER2 targeted therapy were excluded. Patients with diagnosis codes for central nervous system (CNS) metastasis (ICD 9: 198.3 and 198.4; ICD 10: C79.31, C79.32, and C79.49) on or before the eligible line of therapy were excluded.

Due to limited information within the data on the adjuvant setting, prior taxane and endocrine therapy in the adjuvant and/or metastatic setting were not required. Patients who received monotherapy with capecitabine, gemcitabine, eribulin, or vinorelbine in 1 or more lines of therapy qualified as eligible based on the above selection criteria. If a patient had only 1 eligible line of therapy, the index drug was the drug contained in the eligible line and the index date was the start date of the eligible line. If the patient had multiple eligible lines of therapy, a line was randomly selected from eligible lines and the index date was the start date of the randomly selected line of therapy (Table 2). The patients were followed from the index date until the date of death, loss to follow-up, or end of the database. The index date must have occurred at least 3 months prior to the end of the database.

Table 2 Attrition table for real-world chemotherapy cohort

Statistical analyses

Descriptive statistics were used to summarize baseline demographic and clinical characteristics for the MONARCH 1 and real-world chemotherapy cohorts. The primary endpoint of this exploratory analysis was OS, defined as the time from index date to either censoring or death. Patients without a date of death were censored at the last activity date. OS was analyzed by the Kaplan–Meier method and Cox proportional hazards regression in the matched cohorts. As the primary analysis, the Mahalanobis distance matching method [21] was used to match each patient from MONARCH 1 with a patient from the real-world chemotherapy cohort according to key baseline and disease characteristics. The following characteristics were incorporated into the matching process: age group, race group, number of prior chemotherapies in the metastatic setting, number of prior endocrine therapies in the metastatic setting, prior capecitabine use, and progesterone receptor status. The intent of the matching procedure is to correct for any observed imbalance due to differing baseline demographics and disease characteristics.

As a sensitivity analysis, entropy balancing was performed. Entropy balancing [22] provided a reweighting scheme used to adjust inequalities in distribution of baseline characteristics across the MONARCH 1 and real-world chemotherapy cohorts using pre-specified variables for reweighting. Variables for reweighting using entropy balancing included age group, race group, number of prior chemotherapy regimens in the metastatic setting, number of prior endocrine therapies in the metastatic setting, progesterone receptor status, prior capecitabine use, ECOG PS, and stage at initial diagnosis. The weighted Kaplan–Meier method was applied to the weighted real-world chemotherapy cohort to estimate median OS, and the bootstrap approach was used to estimate 95% confidence interval (CI) of the median OS.

The treatment effect was also evaluated among 2 subsets in additional sensitivity analyses. Since Flatiron Health data are US-based, the first subset was among US patients only, where treatment effect was evaluated between the entire real-world cohort and MONARCH 1 US patients. The second subset was US patients with prior endocrine therapy in the metastatic setting, where treatment effect was evaluated between the real-world cohort with prior endocrine therapy in the metastatic setting and the MONARCH 1 US patients with prior endocrine therapy in the metastatic setting to try and account for the prior endocrine therapy inclusion criterion within the trial.

Duration of therapy was defined as time from index date to last order or administration of the eligible drug during the line of therapy. If the drug was an oral medication, then 30 days were added to the last order date to assume a 30-day supply. Quartiles and median duration of treatment along with 95% CIs were estimated using the Kaplan–Meier method. The Cox model was used to estimate HR. All hypothesis tests were conducted at a 2-sided alpha level of 0.05, unless otherwise stated. All CIs were given at a 2-sided 95% level, unless otherwise stated. Data were analyzed using SAS version 9.2 or later (SAS Institute Inc.) and R (The R Foundation).

Results

After applying the inclusion and exclusion criteria, 281 patients were identified who received an eligible line of therapy and were included in the real-world chemotherapy cohort (Table 2).

Baseline demographics and clinical characteristics

Compared to the MONARCH 1 cohort (n = 132), the real-world cohort (n = 281) was more likely to be 65 years or older (43.4% vs. 31.8%, p = 0.03) and less likely to be white (68.3% vs. 93.9%, p < 0.0001) (Table 3). A majority of the patients in MONARCH 1 (70 patients, 53.0%) were enrolled at sites in the US. Unlike the real-world chemotherapy cohort which included all US patients, patients in MONARCH 1 were also enrolled at sites in Belgium (28 patients, 21.2%), Spain (23 patients, 17.4%), and France (11 patients, 8.3%). Only 17 (12.9%) patients in the MONARCH 1 cohort had not received prior endocrine therapy in the metastatic setting compared to 114 (40.6%) patients in the real-world chemotherapy cohort. More than half (76, 57.6%) had prior capecitabine in the MONARCH 1 cohort compared to only 73 (26.0%) in the real-world chemotherapy cohort (Table 3). The cohorts were similar with respect to number of prior lines of chemotherapy in the metastatic setting and progesterone receptor status.

Table 3 Selected baseline characteristics: before and after Mahalanobis distance matching

The Mahalanobis distance matching method was used to select patients who had the shortest Mahalanobis distances from the 132 patients in MONARCH 1 and the 281 patients in the real-world chemotherapy cohort. Patients with missing baseline disease characteristics were removed from the distance calculation. Following Mahalanobis distance matching, 108 patients from the MONARCH 1 cohort were matched to 108 patients from the real-world chemotherapy cohort, and the cohorts had similar patient and disease characteristics. In the matched cohorts, no statistically significant differences existed in age, race, lines of chemotherapy in the metastatic setting, prior endocrine therapy in the metastatic setting, progesterone receptor status, and prior capecitabine use (Table 3). The prior therapy profile between the matched MONARCH 1 and real-world chemotherapy cohort was also similar (Table 4). In the MONARCH 1 cohort there was greater tamoxifen (29, 26.9% vs. 12, 11.1%), letrozole (40, 37.0% vs. 20, 18.5%), and bevacizumab (13, 12.0% vs. 0) use compared to the real-world chemotherapy cohort in the metastatic setting (Table 4).

Table 4 Prior therapy comparison between matched MONARCH 1 and real-world chemotherapy cohort

Overall survival

Using the matching method, the adjusted median OS was 22.3 months (95% CI: 16.0, NR) in MONARCH 1 and 13.6 months (95% CI: 9.6, 16.6) in the real-world chemotherapy cohort. The estimated HR between the 2 matched adjusted groups was 0.536 (95% CI: 0.37, 0.77) (Table 5, Fig. 1).

Table 5 Overall survival between matched MONARCH 1 and real-world chemotherapy cohort
Fig. 1
figure 1

Overall survival between matched MONARCH 1 and real-world chemotherapy cohort. p-value (2-sided)—LOGRANK unstratified for comparing MONARCH 1 with real-world chemotherapy cohort. CI Confidence interval, HR hazard ratio, NR not recorded

The results of sensitivity analyses, using entropy balancing as a secondary analytic approach, were consistent with the Mahalanobis distance matching results. The adjusted median OS was 12.7 months in the real-world cohort (n = 281) with a HR of 0.56 (95% CI from bootstrapping: 0.44, 0.78). The sensitivity analysis in the real-world chemotherapy cohort (n = 281) and MONARCH 1 US patients only (n = 70) was consistent with a HR of 0.60 (95% CI: 0.45, 0.93). The sensitivity analysis of the patients with prior endocrine therapy for MBC in the real-world chemotherapy cohort (n = 167) and in the MONARCH 1 US cohort (n = 62) was also consistent with a HR of 0.52 (95% CI: 0.37, 0.82) (Fig. 2).

Fig. 2
figure 2

Sensitivity analyses of overall survival. aMethod in main analysis. bEntropy balancing method was applied. Sample sizes are as follows: Mahalanobis distance method: real-world chemotherapy cohort (n = 108) vs. MONARCH 1 (n = 108). Entropy balancing: real-world chemotherapy cohort (n = 281) vs. MONARCH 1 (n = 132). US patients only: real-world chemotherapy cohort (n = 281) vs. US patients only (n = 70). US patients who received prior ET: real-world chemotherapy cohort with prior ET (n = 167) vs. US patients who received prior ET in metastatic setting (n = 62). Abbreviations: ET endocrine therapy

Duration of therapy

Duration of therapy was not significantly different between the 2 cohorts. The median duration of therapy was 4.1 months in MONARCH 1 compared to 2.9 months in the real-world chemotherapy cohort (p = 0.050) with a HR of 0.764 (95% CI: 0.58, 1.00) (Table 6).

Table 6 Duration of therapy

Post-discontinuation therapy

Post-discontinuation therapy was consistent between the 2 cohorts. In total, 71 patients in the real-world chemotherapy cohort received at least 1 subsequent therapy compared with 77 patients in MONARCH 1. A total of 61 patients in the real-world cohort received chemotherapy as any post-discontinuation therapy compared with 65 patients in MONARCH 1. However, more patients in the real-world chemotherapy cohort received palbociclib (n = 32) or endocrine therapy (n = 50) versus patients in MONARCH 1 (n = 0 receiving palbociclib, n = 13 receiving endocrine therapy) as any post-discontinuation therapy (Table 7).

Table 7 Summary of post-discontinuation treatment

Discussion

Abemaciclib has demonstrated clinical activity as a monotherapy in patients with HR+, HER2− MBC heavily pretreated in the advanced setting with both endocrine and chemotherapy in MONARCH 1 (NCT02102490) [15]. Here, we compared OS and duration of treatment in MONARCH 1 with a matched real-world cohort of patients who received standard-of-care treatment with single-agent chemotherapy and had not previously received CDK4 & 6 inhibitors using the Flatiron database. Although MONARCH 1 did not include a control arm, this real-world chemotherapy cohort provided a data source representative of a control arm. The OS was significantly longer in the MONARCH 1 cohort at 22.3 months compared to 13.6 months in the matched real-world chemotherapy cohort. Duration of treatment was not significantly different between the two groups; however, it was numerically longer in the MONARCH 1 cohort (4.1 months) compared to the real-world cohort (2.9 months). Altogether, these results suggest a possible survival advantage in favor of abemaciclib as a monotherapy in these patients.

In certain cases, regulatory agencies have accepted the use of real-world control arms to contextualize results from single-arm trials to support regulatory decisions. The increasing accessibility of digital health data, in combination with rising costs and recognized limits of traditional trials, has renewed interest in the use of RWD to enhance the efficiency of research and bridge the evidentiary gap between clinical research and practice [23]. The FDA Real-World Evidence (RWE) Framework notes RWE can be used as the basis for external controls in some situations [16]. Furthermore, RWE has been pivotal in some European regulatory decisions involving conditions with significant unmet need and when a randomized clinical trial is unfeasible or unethical [24]. For example, in the case of blinatumomab, the real-world cohort was helpful in supporting accelerated approval for the treatment of acute lymphoblastic leukemia by the European Medicines Agency [25, 26]. In the case of avelumab as a monotherapy for metastatic Merkel cell carcinoma (mMCC), RWE was used to characterize the natural history of mMCC and was offered to regulators as a benchmark. A subset of trial patients who responded well to treatment was identified and the benefit documented through contrast with the RWE benchmark data, leading to regulatory approval in the US, European Union, and Japan for that subset [19, 27, 28].

There have been improvements in the quality of data over the past several years, with access to recent data containing rich clinical variables and relevant endpoints. For example, the Flatiron Health EHR-derived database represents a diverse group of community cancer clinics, ranging from small practices to large multicenter practices. A full copy of each patient’s medical record is pulled into a central repository for processing. Structured data such as demographics, medications, and routine laboratory tests are harmonized and normalized to a standard ontology and common data model. These structured data are processed and harmonized centrally by Flatiron’s technology-assisted data engine and made accessible for research and analytics. Unstructured data such as case notes, pathology reports, and complex laboratory tests are turned into discrete analyzable data using technology driven abstraction [29, 30].

We attempted to mimic the eligibility criteria for MONARCH 1 in the Flatiron cohort, and we used matching and balancing methods to control for differences in key measured confounders between cohorts. To adjust for potential differences, the Mahalanobis distance matching method [21] was used to match each patient from MONARCH 1 with a patient from the real-world chemotherapy cohort. The aim of this method was to select a subset of patients from the real-world chemotherapy cohort with the most comparable baseline and disease characteristic to the MONARCH 1 population. Following this procedure, known and measured baseline characteristics were balanced between the cohorts. Any observed imbalance due to differing baseline and disease characteristics was corrected, as illustrated by the lack of significance in any of the p-values.

Some of the potential limitations to this approach have been addressed through sensitivity analyses. However, we were unable to match the trial eligibility criteria exactly due to a lack of data availability in Flatiron (such as incomplete data on adjuvant therapy) and unmeasured confounders. Data on adjuvant therapy, including prior taxane use, are not complete in the real-world chemotherapy cohort because patients may have received care prior to adoption of an EHR at the practice, or patients may have received care at a practice outside the Flatiron Health network. Other variables such as sites of metastatic disease (e.g. visceral disease or liver metastases), previous cancers, and comorbidities were not included in the Flatiron Health data. Therefore, residual differences between the real-world chemotherapy and MONARCH 1 cohorts may have contributed to the observed outcomes, even after matching on key measured variables. Another limitation is orders for oral therapies may be incomplete in the structured EHR data, as subsequent refills may not be documented, thus duration of treatment may have been underestimated in the real-world chemotherapy cohort. Another limitation inherent to real-world studies is the potential for selection bias. The fact that clinicians chose to give 1 cohort of patients’ chemotherapy instead of endocrine or targeted therapy could imply patients in the chemotherapy only arm is a higher risk group of patients.

Furthermore, the MONARCH 1 and real-world chemotherapy cohort were not contemporaneous. MONARCH 1 enrolled patients from June 2014 through April 2015, while the real-world chemotherapy cohort included patients with index dates from January 2011 through February 2018. As a result, 29.6% of patients in the real-world chemotherapy cohort received a CDK4 & 6 inhibitor following discontinuation of the index therapy. This suggests the survival in the real-world cohort is potentially longer than it would have been in a truly contemporaneous cohort, thus the difference in overall survival observed in this analysis may be underestimated. Finally, MONARCH 1 included patients from Europe and the US, while the real-world chemotherapy cohort was from the US only. However, sensitivity analyses suggest results are consistent in the subset of MONARCH 1 patients from the US (Fig. 2).

Overall conclusion

Methodological advances in statistical analyses and improvements in data quality enable the use of a real-world single-agent chemotherapy cohort as an external comparator arm. This study demonstrated an approach to create a real-world chemotherapy cohort to serve as a suitable comparator for MONARCH 1. These exploratory results suggest a possible survival advantage in heavily pretreated patients with advanced MBC treated with abemaciclib monotherapy compared to those treated with single-agent chemotherapy.