Introduction

Multiple myeloma (MM) is one of the most common hematologic malignancies (National Cancer Center 2023), and its incidence is increasing with the aging population worldwide (Turesson et al. 2018). While it has traditionally been observed that the incidence of MM is lower in Asians compared to Western countries (Ludwig et al. 2020), recent trends show an increase in diagnoses among Asian populations. This growing number of MM cases in Asians, including in Korea where the incidence has doubled within a decade (Hong and Lee 2016; National Cancer Center 2023), emphasizes the need for further research and attention to this patient group.

In recent years, significant advancements have been made in the development of treatments for MM, including the introduction of proteasome inhibitors (Scheid et al. 2021). One such drug is carfilzomib, which is a second-generation proteasome inhibitor that received accelerated approval from the U.S. Food and Drug Administration (FDA) for the treatment of relapsed or refractory multiple myeloma (RRMM) (Herndon et al. 2013). Additionally, carfilzomib was approved in South Korea in March 2017 and has become the preferred treatment for RRMM.

While carfilzomib has demonstrated promise in enhancing patient outcomes, including progression-free survival, overall survival, and quality of life, concerns about its safety profile persist due to reports of significant adverse events such as heart failure in various clinical trials (Dimopoulos et al. 2016; Hájek et al. 2017; Stewart et al. 2015). Additionally, although no differences in the pharmacokinetics and metabolism of carfilzomib were identified between Asians and non-Asians (Ou et al. 2017), subgroup analyses of clinical trials reported a higher incidence of adverse events in Asians compared to the overall study populations (Dimopoulos et al. 2019). Nevertheless, the clinical trials of carfilzomib included only a limited number of Asian participants (Dimopoulos et al. 2016; Hájek et al. 2017; Stewart et al. 2015). These highlight the need for real-world safety studies of carfilzomib, especially among Asian populations that were underrepresented in randomized controlled trials (RCTs). To address these concerns and generate more substantial evidence, safety studies of carfilzomib were conducted in Japan (Kawaji‐Kanayama et al. 2022; Kawasaki et al. 2022) and Korea (Jang et al. 2022; Lee et al. 2021). Particularly, a previous study with a small cohort of 138 patients, which utilized the target trial emulation (TTE) method using electronic medical records of a tertiary hospital in Korea, identified significant increased risk of dyspnea, muscle spasm, and thrombocytopenia in carfilzomib users. However, these previous studies had limitations such as small sample sizes, lack of comprehensive data, or absence of an active comparator group, which limited their ability to provide conclusive evidence on the risk of carfilzomib. Therefore, this current study aims to address these limitations by analyzing the safety of carfilzomib using a larger sample size and Korean nationwide observational data. Through this study, the researchers intend to contribute valuable evidence regarding the safety of carfilzomib in Asian populations, specifically in Koreans, and enhance understanding of its risk profile.

In this study, the researchers utilized the TTE method to overcome the challenges associated with conducting an RCT to evaluate the safety of carfilzomib in Asian populations. The TTE method involves designing hypothetical RCTs that can be simulated using real-world data, providing a guide for conducting reliable observational studies (Hernán and Robins 2016). This approach has gained attraction in drug safety studies, with an increasing number of studies adopting this methodology (Ahn et al. 2022; Rossides et al. 2021; Takeuchi et al. 2021; Xie et al. 2020). By constructing a hypothetical RCT and emulating it using real-world data, this method enables more systematic analysis and allows for the interpretation of results similar to those obtained from RCTs (Zhao et al. 2020). Therefore, in this study, a retrospective cohort study was conducted within an RCT-like setting, utilizing a sufficiently large number of subjects. The purpose of this approach is to provide more clinically relevant information regarding the safety of carfilzomib and its use in Asian populations.

The objective of this study was to assess the safety of carfilzomib in the treatment of RRMM in the Korean patients and to evaluate if the results from the RCTs could be applied to the broader Asian population.

Materials and Methods

Data source

This study used the customized Health Insurance Review and Assessment Service (HIRA) data in South Korea from January 1, 2007 to May 31, 2020. HIRA data is a nationwide data including over 50 million people because almost all of the Korean population is covered by a single national insurance. This data includes information about patients’ demographic characteristics, diagnosis codes, and treatment (Kim et al. 2020). All patient information was anonymized. This study was approved by the Institutional Review Board of Seoul National University (IRB No. E2101/001–003).

Study design

This was a retrospective cohort study using a TTE method (Fig. 1, Table 1) modified with reference to the ASPIRE trial (Stewart et al. 2015). Individual cohorts were constructed for each outcome variable. The study followed the per-protocol method. The index date was set as the first administration date of the KRd (combination therapy comprised of carfilzomib, lenalidomide, and dexamethasone) or Rd (combination therapy comprised of lenalidomide and dexamethasone) regimens. Censoring dates were determined based on several factors, including the occurrence of the endpoint, the last date of data collection, 600 days from the index date, the study end date, 30 days from the last administration date, and the date of treatment change or stem cell transplantation. A grace period of 60 days was implemented, since the claims data provided for inpatient medical insurance may have a lag time due to monthly claim submissions.

Fig. 1
figure 1

Study design to evaluate adverse outcomes of KRd or Rd regimens. Rx, prescription

Table 1 Study protocol of the hypothetical target trial and emulated trial using claims data

Study population

The study included patients diagnosed with MM between January 1, 2007 and May 31, 2020 and had been previously received at least one treatment regimen other than KRd or Rd. To select new users of KRd or Rd for RRMM, the exclusion criteria were as follows: patients previously prescribed carfilzomib, or those who received KRd or Rd as first-line therapy. Furthermore, different exclusion criteria and screening periods were used according for each outcome variable in various organ systems as outlined in Supplementary Table 1. By excluding patients with pre-existing outcome variables before the index date, the study aimed to establish cohorts that specifically focused on the association between the drug regimen (KRd or Rd) and the occurrence of the outcome variables of interest.

Exposures

The KRd regimen was defined as the case treatment while the Rd regimen was defined as the comparator treatment. The exposure to the KRd regimen was defined as the presence of claims for carfilzomib, lenalidomide, and tablets or injectable dexamethasone on the same date. On the other hand, exposure to the Rd regimen was defined as the presence of claims for lenalidomide and tablets or injectable dexamethasone on the same date with no claims for any other anticancer drugs claims on that specific date.

In consideration of the reimbursement timeline for carfilzomib in South Korea, the exposure window in this study was set from January 1, 2018 to May 31, 2020. This timeframe was chosen to include the period when carfilzomib became eligible for reimbursement, ensuring that patients who initiated treatment with carfilzomib during this timeframe were included in the analysis. A maximum of 600 days from the date of first prescription was defined as the duration of exposure.

Outcomes

The primary focus of this study was to examine adverse events in various organ systems that occurred with a higher frequency in the carfilzomib group compared to the comparator group, specifically with a difference of at least 5 percentage points and other adverse events of interest based on the findings from the ASPIRE trial (Stewart et al. 2015). The study included comprehensive set of outcome variables consisting of 21 different adverse events on different organ systems. These adverse events encompassed a wide range of organ systems including blood and lymphatic, cardiac, gastrointestinal, general, infections, metabolism and nutrition, musculoskeletal and connective tissue, nervous, renal and urinary, respiratory, and vascular disorders.

The diagnosis of MM and other relevant medical conditions was defined according to the Korean Standard Classification of Diseases (KCD) version 8. For drug utilization, the Anatomical Therapeutic Chemical (ATC) Classification System code was used. The procedures were defined using procedure codes, which provided a standardized method for identifying and classifying specific medical procedures or interventions. The operational definitions for each adverse event outcome variable are shown in Supplementary Table 2 with references. The operational definitions were referenced by integrating the definitions used in previous research methods. The outcome occurrence date was defined as the first occurrence date.

Covariates

Basic patient information, comorbidity, MM treatments, and other medications were collected and included as covariates. The operational definitions and screening periods for the covariates apart from the outcome variables are summarized in Supplementary Table 3.

Statistical analysis

In the individually constructed cohorts, the propensity scores were calculated using logistic regression. The outcome variable was excluded from prescreening process and the remaining outcome variables were matched as covariates. Two groups were matched 1:1 using the 5-to-1-digit greedy matching algorithm (Parsons 2001) in order to balance the covariates between the carfilzomib and comparator groups. A standardized difference of less than 0.1 between the groups was considered an indication that the confounding factors had been adequately matched.

Descriptive analysis was performed for both covariates and outcome variables. Continuous variables were summarized using either median (range) or mean (± standard deviation). Categorical variables were presented as the frequency and percentages (%). To assess the relationship between treatment and the outcomes, the hazard ratio (HR) with 95% confidence interval (CI) was calculated using the Cox proportional hazard model. Additionally, Kaplan–Meier survival analysis was performed to estimate and visualize the survival probabilities over time for the carfilzomib and comparator groups.

Two sensitivity analyses were conducted to evaluate potential biases and further investigate the robustness of our findings. The first sensitivity analysis involved changing the comparator group to historical comparators who had received the Rd regimen before January 1, 2018 to evaluate the potential bias resulting from changes in treatment guidelines over time. The second sensitivity analysis involved performing the analysis without limiting the grace period to assess the potential bias resulting from the grace period setting and explored its influence on the observed outcomes. For the statistical analysis SAS Enterprise Guide 6.1 and SAS 9.4 (SAS Institute Inc., Cary, NC, USA.) were used.

Results

Baseline characteristics

The flow chart for subject selection process is shown in Fig. 2. Table 2 shows the baseline characteristics of the matched cohort, excluding patients with underlying heart failure. The median (range) follow-up period in the heart failure cohort was 121 (0–600) days in the KRd group and 153.5 (0–600) days in the Rd group. Before matching, the Rd group had a higher mean age compared to the KRd group. However, after matching, the mean ages of the KRd and Rd groups were more comparable with 66 (28–87) and 67 (25–86), respectively. The proportions of male subjects in the KRd and Rd groups were 53.38% and 55.34%, respectively. After propensity score matching, all covariates between the KRd and Rd groups were well balanced, with standardized differences less than 0.1, suggesting that potential confounding factors were adequately controlled. Supplementary Table 4 shows the distribution of important characteristics for each outcome variable after matching.

Fig. 2
figure 2

Flow chart of subject selection to evaluate 21 adverse outcomes of KRd and Rd regimens. The index date was the first administration date of KRd or Rd regimen. The outcome variables were anemia, neutropenia, thrombocytopenia, diarrhea, fatigue, cough, pyrexia, upper respiratory tract infection, hypokalemia, muscle spasms, peripheral edema, constipation, back pain, dyspnea, peripheral neuropathy, hypertension, acute renal injury, heart failure, deep vein thrombosis, ischemic heart disease, and pulmonary embolism. KRd, combination therapy of carfilzomib, lenalidomide, and dexamethasone; Rd, combination therapy of lenalidomide and dexamethasone; RRMM, relapsed and refractory multiple myeloma; aHistorical comparators

Table 2 Baseline characteristics of the KRd and Rd groups in the heart failure cohort before and after propensity score matching

Risk of safety outcomes

The incidence and HR for each outcome variable between the KRd and the Rd group are shown in Table 3. The risks of all of the blood and lymphatic system adverse events including anemia (HR 1.55; 95% CI 1.21–1.99), neutropenia (HR 1.62; 95% CI 1.32–1.98), thrombocytopenia (HR 2.19; 95% CI 1.59–3.02), and some non-hematological adverse events including cough (HR 1.32; 95% CI 1.09–1.60), hypokalemia (HR 1.56; 95% CI 1.14–2.13), constipation (HR 1.53; 95% CI 1.19–1.95), hypertension (HR 1.58; 95% CI 1.15–2.17), and heart failure (HR 2.04; 95% CI 1.24–3.35) were significantly higher in the KRd group compared to the Rd group. Among the non-hematological adverse events, heart failure and hypertension showed the greatest increase in risks in the KRd group compared to those in the Rd group. Kaplan–Meier survival curves for each outcome variable are shown in Supplementary Fig. 1.

Table 3 Incidences and risks of adverse outcomes in KRd and Rd groups with contemporaneous and historical comparator

Sensitivity analyses

Table 3 shows the results of the first sensitivity analysis, which utilized a historical comparator group consisting of individuals who received Rd before January 1, 2018. The utilization of the historical comparator group resulted in an HR value that was more aligned with the null hypothesis. This finding indicates that the risks of adverse events were comparatively less distinct between the KRd and Rd groups in the sensitivity analysis compared to the original analysis. Unlike the original analysis, the first sensitivity analysis did not find significant differences in the risks of cough, hypokalemia, and constipation between the KRd and Rd groups. Additionally, pyrexia and upper respiratory tract infection occurred more frequently in the historical Rd group. However, for most of the other outcome variables, the direction and the statistical significance of the HR remained consistent in the first sensitivity analysis compared to those in the original analysis.

Supplementary Table 5 shows the results of the second sensitivity analysis, where the grace period was not limited to 60 days. The risk of peripheral neuropathy was not found to be significantly different between the KRd and Rd groups in original analysis. However, peripheral neuropathy occurred more frequently in the KRd group than in the Rd group in the sensitivity analysis. On the other hand, for all other outcome variables included in the second sensitivity analysis, the direction and statistical significance of the HR remained consistent with the original analysis. This indicates that the results for those outcome variables were robust and not substantially affected by the changes in the grace period.

Discussion

In this study, the safety of carfilzomib, the medication used for treating RRMM, was evaluated using nationwide health insurance claims data from South Korea. The analysis employed the TTE method to evaluate the risk of various adverse events associated with carfilzomib treatment in an Asian population. The study findings showed a statistically significant increase in the risk of several adverse events when carfilzomib was used for treating RRMM. Specifically, the analysis showed a higher risk of anemia, neutropenia, thrombocytopenia, cough, hypokalemia, constipation, hypertension, and heart failure associated with carfilzomib in treating RRMM. These results emphasize the importance of considering the potential adverse events when utilizing carfilzomib in the management of RRMM. The use of nationwide health insurance claims data from South Korea and the application of the TTE method contribute to understanding the safety profile of carfilzomib in an Asian population.

In our study, the analysis revealed that the risks of hematologic adverse events including anemia, neutropenia, and thrombocytopenia, were significantly higher in the KRd group compared to the Rd group. These findings are consistent with previous RCTs (Dimopoulos et al. 2016; Hájek et al. 2017; Stewart et al. 2015) and observational studies (Imtiaz et al. 2021; Kawaji‐Kanayama et al. 2022; Rocchi et al. 2021) supporting the notion that carfilzomib treatment is associated with an increased risk of hematologic adverse events. The increased risks of anemia, neutropenia, and thrombocytopenia emphasize the need for the monitoring of hematologic adverse events when using carfilzomib.

The incidence of certain non-hematologic adverse events including cough, hypokalemia, constipation, hypertension, and heart failure was higher in the KRd group compared to the Rd group. These findings are consistent with several previous studies (Bishnoi et al. 2021; Dimopoulos et al. 2016; Fakhri et al. 2020; Hájek et al. 2017; Stewart et al. 2015; Waxman et al. 2018). In our study, hypokalemia events were of grade 2 or higher requiring the use of potassium supplements. Although hypokalemia does not present with symptoms, it holds clinical significance due to its potential to cause heart-related diseases such as electrocardiographic changes and cardiac arrhythmias (Kardalas et al. 2018). The exact mechanism by which carfilzomib induces hypokalemia has not been determined, however it is believed to involve the impact of carfilzomib on K+ channels and cell functioning (So et al. 2019). Additionally, the use of carfilzomib was associated with an increased risk of cardiovascular and pulmonary adverse events, particularly hypertension and heart failure (Bishnoi et al. 2021; Fakhri et al. 2020; Waxman et al. 2018). Cough was identified as a potential symptom of cardiopulmonary adverse events (Fakhri et al. 2020). In our study, the incidence of heart failure in the KRd group was higher (7.8%) compared to the Rd group (4.3%). These rates were slightly higher than those reported in the ASPIRE trial (KRd 6.4%, Rd 4.1%). Furthermore, the incidence of hypertension was also higher in our study (KRd 22.7%, Rd 15.4%) compared to the ASPIRE trial (KRd 14.3%, Rd 6.9%) (Stewart et al. 2015). These results might be due to the real-world setting of our study, which did not exclude patients with cardiovascular comorbidities and risk factors, in contrast to the ASPIRE trial (Stewart et al. 2015) which strictly excluded such patients. A previous observational study in Japan, which included 50 carfilzomib users, reported one case of heart failure and could not confirm an association between carfilzomib and heart failure (Kawaji‐Kanayama et al. 2022). In our current study, we conducted propensity score matching to ensure similarity between the KRd and Rd groups to emulate the design of the RCT using nationwide data in Korea. Our results were comparable to those reported in the ASPIRE trial (Stewart et al. 2015) and the subgroup analysis of RCTs including Asians (Dimopoulos et al. 2019). There were no discernible differences when comparing the outcomes between the Korean population and a non-Asian study population, which also indicated the robustness of our study.

Our findings support needs for monitoring of cardiovascular adverse events during treatment of carfilzomib. The 2022 European Society of Cardiology guidelines recommend regular monitoring of blood pressure, natriuretic peptides, and echocardiography at baseline and throughout therapy, particularly during the first six cycles (Lyon et al. 2022). These guidelines also recognize the limited availability of studies to establish an optimal monitoring schedule for cardiovascular adverse events, and they highlight a low level of evidence. Additionally, previous research utilizing data from the FDA’s adverse event reporting system indicated that cardiovascular adverse events tend to occur early, with a median onset time of 41 days (Zhai et al. 2021). This underlines the importance of closely monitoring cardiovascular adverse events within the first 40 days of treatment. Further research is necessary to determine the optimal frequency of monitoring for effectively preventing cardiovascular adverse events.

This study had several strengths that contribute to its robustness and applicability. Firstly, it was designed as a systematic study utilizing the TTE method, which involved careful selection of appropriate case and comparator regimens. This approach effectively enhanced the ability to draw causal inference, and mitigated the risk of immortal time bias. However, it is important to note that the absence of core elements like randomization limits the study’s ability to fully emulate an ideal clinical trial. To address the unmeasured confounding bias, we performed two sensitivity analyses. Although most of the outcomes agreed with the results of the original analysis, pyrexia, and upper respiratory tract infection showed higher risk in the historical comparator group, which was contrary to the ASPIRE trial (Stewart et al. 2015). We thought the reason for the difference might be attributed to the change in the clinical environments, specifically infection prevention measures had become more intensive in recent times. Overall, these results of our study should be considered as supportive evidence in clinical practice. Secondly, the utilization of claims data from South Korea allowed for generating evidence of carfilzomib safety in an Asian population. Previous RCTs predominantly included individuals of Caucasian ethnicity, making the inclusion of Asian individuals in this study particularly valuable. Thirdly, this study’s exclusion criteria were not overly restrictive, enhancing the generalizability of the findings to a wide range of clinical practice. This flexibility allows for greater relevance and applicability of the results to real-world settings. Finally, this study was strengthened by the inclusion of a large sample size of patients with RRMM by utilizing a nationwide database in Korea. The comprehensive coverage of approximately 97% of Korean population under the National Health Insurance system facilitated the thorough screening of 4,580 RRMM patients and the collection of all relevant medical data. By utilizing this real-world evidence methodology and the Korean insurance claims database, the study provided meaningful insights into the safety profile of carfilzomib in the context of the rare diseases RRMM.

This study also had few limitations that should be acknowledged. Firstly, the precise dosing information of carfilzomib could not be fully captured due to the nature of the claims database. The claims database lacks specific details regarding the exact dosing of injectable drugs like carfilzomib, as they are charged based on bottle units. Therefore, our study could not identify the reduction of carfilzomib dosage for managing intolerance. Additionally, the cycles of treatment regimens could not be identified using the claims database. However, the HR calculations were adjusted for exposure time to account for the lack of cycle information. Secondly, the severity of adverse events was not assessed. Additional information, including symptoms and laboratory test results, is required to determine the grading of adverse events. However, such information is not available in claims databases. Thirdly, our study primarily focused on evaluating the safety outcomes of carfilzomib, utilizing claims data, especially since its clinical efficacy has been demonstrated in the ASPIRE trial (Stewart et al. 2015). Consequently, we did not evaluate the real-world effectiveness of carfilzomib or the risk–benefit ratio for individual patients; instead, our focus was on the drug’s safety profile due to the notable adverse effects reported. As more long-term data become available, analyzing overall survival and quality of life in patients who have adverse reactions will be crucial, providing valuable insights that could enhance clinical decision-making. Fourthly, it was challenging to differentiate whether the observed events after drug administration were exacerbations of the underlying disease or adverse drug events. To mitigate this potential bias, the cohort entry date was set as the first prescription date of either KRd or Rd therapy, and patients with outcome variables before the index date were excluded from the analysis. This approach aimed to minimize the likelihood of attributing pre-existing conditions as adverse drug events. Despite these limitations, our study employed robust methods such as TTE and appropriate exclusion criteria to overcome these challenges and provide valuable insights into the real-world safety profile of carfilzomib.

Conclusions

The results of this study indicate that observational studies utilizing the TTE method can provide valuable estimates of results comparable to RCTs. In patients with RRMM in Korea, the incidence of hematologic adverse events, including anemia, neutropenia, and thrombocytopenia were significantly higher in the KRd group compared to that in the Rd group. These results indicate the importance of monitoring for hematologic adverse events when administering carfilzomib. Furthermore, the study revealed a significantly higher incidence of non-hematologic adverse events such as constipation, cough, hypokalemia, hypertension, and heart failure in the KRd group compared to the Rd group. These safety outcomes were consistent with previous RCTs, suggesting that carfilzomib can be used in Asians populations with appropriate monitoring for tolerability.