Introduction

Four large-scale placebo-controlled outcome trials have studied the effects of sodium–glucose co-transporter-2 (SGLT2) inhibitors in different populations of patients with chronic kidney disease (CKD). Meta-analysis has demonstrated clear net benefits of SGLT2 inhibitors in the CKD populations studied, irrespective of the presence or absence of diabetes. Nevertheless, some at-risk populations remain understudied [1]. Japan has one of the highest prevalences of kidney failure per million population internationally [2], and has historically been under-represented in trials [3,4,5]. The EMPA–KIDNEY trial assessed the effects of empagliflozin 10 mg once daily in patients with CKD, and included the broadest range of patients at risk of progression of the four reported large SGLT2 inhibitor trials in CKD [6]. EMPA–KIDNEY’s streamlined design supported recruitment of 6609 participants from only 8 countries, with 612 participants from 25 clinical sites in Japan. In this report, we aimed to explore post-hoc comparisons of the effects of empagliflozin in participants from Japan vs. non-Japan regions (and include analyses comparing effects by the other regions).

Methods

Details of EMPA–KIDNEY’s rationale [7], design [8], protocol, pre-specified data analysis plan, and main results have been reported previously [6]. The trial design was streamlined: extra work for collaborating doctors and hospitals was kept to a minimum, and only essential information was collected. Participant-reported information recorded by participant interview directly into bespoke computer systems and centrally measured creatinine were the main means of data collection. The trial was conducted at 241 centers. Regulatory authorities and ethics committees for each center approved the trial. Adults with a race-adjusted CKD–EPI [9] estimated glomerular filtration rate (eGFR) of ≥ 20 but < 45 ml/min/1.73m2 (irrespective of level of albuminuria); or an eGFR of ≥ 45 but < 90 ml/min/1.73 m2 with a urinary albumin-to-creatinine ratio (uACR) ≥ 200 mg/g at the screening visit were eligible. They were also required to be prescribed a clinically appropriate dose of single-agent renin–angiotensin system (RAS) inhibitor, where indicated and tolerated. Polycystic kidney disease was the only excluded primary kidney diagnosis. Additional procedures were required in Japan due to regulator requests. These included an extra scheduled visit within the first 4 weeks following randomization and reporting of all participant-reported non-serious adverse events throughout the trial.

The pre-specified primary outcome was time to first occurrence of the composite outcome of kidney disease progression or cardiovascular death. Kidney disease progression included end-stage kidney disease (ESKD), defined as commencing maintenance dialysis or receipt of a kidney transplant; a sustained decline in estimated eGFR to < 10 ml/min/1.73 m2; a sustained decline in eGFR of ≥ 40% from baseline; or death from kidney failure. Central laboratory serum creatinine measurements were used to calculate eGFR, with local laboratory creatinine measurement used when central results were missing. Kidney disease progression was a secondary outcome, and analyses of annual rate of change in eGFR (chronic and total slope) were tertiary outcomes, with exploratory analyses of these outcomes also pre-specified [6]. We emphasize the chronic slope over total slope (to account for the reversible acute dip in eGFR), and relative rather than absolute differences. Relative differences enable a direct test of any differences of the effects of empagliflozin between subgroups which is not usually possible with absolute differences, as the baseline absolute rate of eGFR decline in the placebo group usually varies importantly between subgroups.

Pre-specified subgrouping by region was into four groups: Europe, North America, China and Malaysia combined, and Japan. Results for all regions are provided in the supplement. This report’s focus is on the post-hoc subgroup comparison of Japan vs. the other non-Japan regions combined.

Statistical analyses

All analyses were performed according to the intention-to-treat principle. A pre-specified Cox proportional hazards regression model adjusted for baseline variables specified in the minimization algorithm (age, sex, prior diabetes, eGFR, uACR, and region) was used to estimate the hazard ratios (HRs) and 95% confidence intervals (CI) for empagliflozin vs. placebo for time-to-event analyses (i.e., the primary, kidney disease progression, cardiovascular and safety outcomes) [10]. Effects of empagliflozin on annual rate of change in eGFR were assessed using pre-specified shared parameter models. Tests for heterogeneity of empagliflozin’s observed proportional effects (or differences) in subgroups were performed through the inclusion of relevant interaction terms in models [11]. Effects of empagliflozin on continuous outcomes (e.g., albuminuria) used a pre-specified mixed model repeated measures approach. Full statistical details are provided in the previously published data analysis plan and primary report [6]. In sensitivity analyses, exploratory modelling of the primary outcome accounting for the higher baseline eGFR & albuminuria in participants from Japan were performed by including pre-specified covariates plus treatment interactions with diabetes status, categories of eGFR/uACR and recruitment in Japan. The supplement includes an additional sensitivity analysis which was conducted post-hoc excluding data from two sites (28 randomized participants) in Japan, after regulators became aware of data integrity concerns raised by staff from a Site Management Organization. On-site and central statistical monitoring found no evidence of falsification of trial data in EMPA–KIDNEY, but these sites’ data were removed from the marketing approval application.

Analyses used SAS software, version 9.4 (SAS Institute, Cary NY, USA) and R v3.6.2.

Results

Between May 2019 and April 2021, 6609 participants were randomized. Baseline characteristics by region are provided in Supplementary Table 1. Compared to the 5997 participants from Europe, North America, China and Malaysia combined, the 612 participants from Japan had a similar proportion without prior diabetes (53% vs. 54%) but differed with respect to several other baseline characteristics (Table 1). Participants in Japan were on average slightly older (mean age 65.3 ± 12 vs. 63.7 ± 14 years), less likely to be female (26% vs. 34%), reported less prior cardiovascular disease (15% vs. 28%), had a higher proportion of participants with glomerular disease (32% vs. 25%), a higher mean (SD) eGFR (45.2 ± 18.2 vs. 36.5 ± 13.8 mL/min/1.73m2), and higher median (IQR) uACR (683 [293–1514] vs. 290 [40–1030] mg/g).

Table 1 Baseline characteristics by Japan vs. non-Japan regions

Overall (i.e., all regions combined), study treatment was generally well tolerated. At 12 months, the approximate study midpoint, overall self-reported adherence to study treatment was similar in each of the treatment groups (89.6% empagliflozin vs. 90.3% placebo; reasons for stopping have been reported elsewhere [6].) During 2.0 years of median follow-up overall, allocation to empagliflozin reduced the risk of the primary composite outcome of kidney disease progression or cardiovascular death by 28% (HR = 0.72; 95% CI 0.64–0.82; P < 0.0001). There were broadly similar relative effects in key subgroup analyses by diabetes status (heterogeneity p = 0.06), eGFR (trend p = 0.78), and some evidence to suggest relative benefits may be larger in patients with higher levels of uACR (trend p = 0.02). There were also broadly similar effects across the four pre-specified categories of region which compared participants from Europe, North America, China/Malaysia, and Japan (heterogeneity p = 0.06; Supplementary Fig. 1) [6]. Effects of empagliflozin on annual rate of change in eGFR by pre-specified regions suggested benefits were consistent across regions, with perhaps some evidence of larger relative reductions in chronic slope in North America (Supplementary Fig. 2).

Study treatment was also well tolerated in Japan. At 12 months, self-reported adherence was 91.4% for empagliflozin vs. 95.4% for placebo. Median follow-up was 1.95 years in non-Japan regions vs. 2.19 years in Japan, respectively. Among the participants from non-Japan regions, there were 399 vs. 494 primary outcomes (0.75, 0.66–0.86), whilst in Japan there were 33 vs. 64 (0.49, 0.32–0.75; heterogeneity p = 0.06, Fig. 1). Estimates of relative effect on the primary outcome were also similar in participants in non-Japan vs. Japan regions in sensitivity analyses which took account of differences in baseline diabetes status, eGFR, and albuminuria between participants in non-Japan and Japan regions (sensitivity analysis heterogeneity p = 0.08), and excluding data from the 28 participants at 2 sites excluded from the marketing authorisation application (Supplementary Table 3). Overall, the annual rate of eGFR decline in patients in the placebo group appeared generally constant (overall, and in Japan). In the empagliflozin group, the expected acute decrease in the eGFR was evident on initiation of study treatment (non-Japan −2.1 vs. Japan –2.4 mL/min/1.73 m2; heterogeneity p = 0.50), followed by slowing of the chronic rate of annual eGFR decline. The relative difference in chronic eGFR slope was a −49% (95% CI −58 to −40%) in the non-Japan regions vs. −55% in Japan (95% CI −73 to −37%; heterogeneity p = 0.58, Supplementary Figs. 3 and 4). Averaged over time, the difference in geometric mean uACR was −19% in the empagliflozin group compared to the placebo group (95% CI −23 to −15%), which included a −18% (95% CI −23 to −14%) difference in the non-Japan regions and a −26% (95% CI −36 to −14%) difference in Japan; heterogeneity p = 0.24).

Fig. 1
figure 1

Effect of empagliflozin on primary composite outcome of progression of kidney disease or death from cardiovascular causes, by Japan vs. non-Japan regions (post-hoc exploratory analyses). Footnote: Hazard ratios and confidence intervals were estimated from Cox models, with pre-specified adjustment for age, sex, history of diabetes, estimated glomerular filtration rate (eGFR), urinary albumin-to-creatinine ratio (uACR), and region. The heterogeneity P value (Phet) is calculated from a relevant interaction term in the pre-specified Cox model. Results were robust in a sensitivity analysis following inclusion of pre-specified covariates plus treatment interactions with diabetes status, categories of eGFR/uACR & recruitment in Japan (exploratory model Phet = 0.08). These non-significant heterogeneity p values suggest the best estimate of the effect of empagliflozin on the primary outcome in the two subgroups is the overall result (i.e., there is no statistical evidence for a difference between the subgroup-specific HRs for Japan vs. non-Japan regions)

The effects of empagliflozin on pre-specified secondary outcomes were broadly similar when participants were compared across the different regions (Supplementary Table 2) including in analyses comparing non-Japan vs. Japan regions (Table 2). There was a relatively low absolute risk of serious safety outcomes among the participants recruited in Japan compared to non-Japan regions, with a safety profile consistent with the overall results of the trial (Table 2).

Table 2 Effect of empagliflozin on primary, secondary, and safety outcomes by Japan vs. non-Japan regions (post-hoc exploratory analyses)

Discussion

In the EMPA–KIDNEY population of 6609 patients with a wide range of eGFR, levels of albuminuria, and causes of CKD, empagliflozin led to a 28% lower risk of progression of kidney disease or cardiovascular death compared with placebo, and was well-tolerated. A particular feature of the trial was the recruitment of 612 participants from Japan, a country under-represented in trials and where lifetime risk of progression to kidney failure is high compared to many other high-income countries. Between-region comparisons have more limited power, but such subgroups analyses suggested relative benefits were broadly similar in patients from Europe, North America, China/Malaysia, and Japan. Exploratory analyses comparing the participants from Japan with those from other regions were consistent with this finding: there was no strong statistical evidence of differential effects between Japan and non-Japan regions before or after adjustments which accounted for higher baseline eGFR and albuminuria in Japan. These analyses should not be interpreted as suggesting greater reduction in risk of the primary outcome in Japan however, statistical power is limited by event numbers and, based on statistical tests of heterogeneity, we conclude that the best estimate of effect on the primary outcome in participants in Japan is the overall estimate of a 28% relative risk reduction.

Previous sub-analyses of DAPA–CKD [4] and CREDENCE trial data [5] included 244 and 110 participants from Japan, respectively, necessitating participants from Japan to be grouped with participants from other local countries. Such sub-analyses found that the relative effects of SGLT2 inhibitors on key outcomes were broadly similar, irrespective of the different types of participant represented across the different regions [4, 5]. Our report’s findings are consistent with data from these two other CKD trials. Effects on kidney outcomes from all three trials have been combined in a collaborative meta-analysis [1]. We conclude that the overall relative risk reductions from that meta-analysis are currently the most reliable estimates of effects of SGLT2 inhibition in patients with CKD, irrespective of region [1].

These analyses need to be considered in the context of their post-hoc/exploratory nature. The key limitations of EMPA–KIDNEY were the lower-than-expected number of cardiovascular outcomes and its relatively short follow-up. This highlights the need for future nephrology trial designs to aim for larger samples sizes. Globally, complex research governance and burdensome regulation has caused increased organizational and operational complexity, contributing to an unsustainable increase in the cost of trials. Such burdens may contribute to a reluctance of clinician investigators or patients to participate in trials and limit the ability to recruit at scale [13]. EMPA–KIDNEY’s streamlined design aimed to help reverse these trends. We used a risk-based approach focusing on critical-to-quality data. Future streamlined trial designs incorporating linkage to the Japan-Chronic Kidney Disease-Database could also facilitate longer-term low-cost kidney outcome follow-up [12].

In summary, it is reasonable to conclude that EMPA–KIDNEY’s main conclusions can be extrapolated to patients with CKD in Japan and non-Japan regions, with demonstrable net absolute benefits.

Disclosure

The analyses were performed and original full database developed and held by the Nuffield Department of Population Health at the University of Oxford and validated by the funder. The sponsor provided a grant to the University of Oxford and have members on the Steering Committee who are responsible for reviewing all trial publications. WGH accepts full responsibility for the content of the paper and the decision to submit. CTSU has a staff policy of not accepting honoraria or other payments from the pharmaceutical industry, except for the reimbursement of costs to participate in scientific meetings (www.ctsu.ox.ac.uk). EMPA–KIDNEY was mainly supported by a grant to the University of Oxford from Boehringer Ingelheim, which included funding from Eli Lilly. CTSU (authors WGH, KJM, RL, RH) received core funding to the Medical Research Council Population Health Research Unit at the University of Oxford, which is part of the Clinical Trial Service Unit and Epidemiological Studies Unit (CTSU), from the UK Medical Research Council (grant numbers MC_UU_00017/3 and MC_UU_00017/4). CTSU is also supported by Health Data Research UK and the National Institute for Health and Care Research Oxford Biomedical Research Centre. WGH was supported by a Medical Research Council Kidney Research UK Professor David Kerr Clinician Scientist Award (MR/R007764/1). MN reports research grants from Boehringer Ingelheim, Kyowa-Kirin, Daiichi-Sankyo, Astellas, Tanabe-Mitsubishi, JT, Chga, Torii, Takeda, Bayer; consulting fees from Kyowa-Kirin, Astellas, Daiichi-Sankyo, Tanabe-Mitsubishi, JT, Boehringer Ingelheim; and honoraria from Kyowa-Kirin, Astellas, Astra Zeneca, GASK, Daiichi-Sankyo, Tanabe-Mitsubishi, Chugai, Boehringer Ingelheim. SG reports consulting fees for ANTOS, Jansen Pharma, MSD and Amgen, and fees for membership of the EMPACT–MI Steering Committee from Boehringer Ingelheim. SM reports grants from Nippon Boehringer Ingelheim Co., Ltd. NK has received consulting or speaker honoraria from AstraZeneca, Kyowa Kirin, Novo Nordisk, Novartis, Astellas, Otsuka, and Novo Nordisk. JW has received speaker honoraria from Astra Zeneca, Bayer, Boehringer Ingelheim, Daiichi Sankyo, Kyowa Kirin, Novo Nordisk, and Mitsubishi Tanabe, and receives grant support from Bayer, Chugai, Kyowa Kirin, Otsuka, Shionogi, Sumitomo, and Mitsubishi Tanabe-Pharma. HW has received honoraria for lectures from Bayer Pharma Japan, Teijin Pharma Ltd, MSD, Sanofi-Aventis K.K., Novo Nordisk, Nippon Boehringer Ingelheim, Eli Lilly, Sumitomo Pharma, Mitsubishi Tanabe Pharma, Daiichi Sankyo Company, Ltd, Abbott, Kowa Co., Ltd, Taisho Pharmaceutical, and research activities for Teijin Pharma, Abbott Japan, Nippon Boehringer Ingelheim, Sumitomo Pharma, Mitsubishi Tanabe Pharma, Lifescan Japan, Sanwa Kagaku, Taisho Pharmaceutical, Takeda Pharmaceuticals, Soiken Inc. KU has received grants from Eli Lilly, Daiichi-Sankyo, Kyowa-Kirin and Abbott Japan, grants and honoraria for lectures from Novo Nordisk, Nippon Boehringer Ingelheim, MSD, Mitsubishi Tanabe Pharma, Sumitomo Pharma, Ono and Sanofi, and honoraria for lectures from Astellas and AstraZeneca. EN has received honoraria from Novo Nordisk, Sanfi K.K, Sumitomo Pharma, Abbott Japan, and Mitsubishi Tanabe Pharma. DM is a statistical consultant for Elderbrook Solutions on behalf of Boehringer Ingelheim Boehringer Ingelheim. SH is an employee of Boehringer Ingelheim. RH reports grants or contracts from Roche, GSK/Vir and Combiphar. TK reports research grants from Nippon Boehringer Ingelheim Co., Ltd, Eli Lilly Japan K.K., Kyowa Kirin Co., Ltd, MSD Corporation, Daiichi Sankyo Co., Ltd, Novo Nordisk Pharma Ltd, Sanofi K.K., Takeda Pharmaceutical Co., Ltd, Astellas Pharma Inc, Ono Pharmaceutical Co., Ltd, Mitsubishi Tanabe Pharma Corporation, Taisho Pharmaceutical Co., Ltd; and honoraria from MSD Corporation, Takeda Pharmaceutical Co., Ltd, Mitsubishi Tanabe Pharma Corporation, Astellas Pharma Inc., Teijin Pharma Ltd, Ono Pharmaceutical Co., Ltd, Astra Zeneca K.K., Sumitomo Pharma Co., Ltd, Sanofi K.K., Eli Lilly Japan K.K., Nippon Boehringer Ingelheim Co., Ltd, Novo Nordisk Pharma Ltd, Daiichi Sankyo Co., Ltd, FUJIFILM Toyama Chemical Co., Ltd, Kowa Co., Ltd, Taisho Pharmaceutical Co., Ltd.

Rights retention statement

For the purpose of open access, the author(s) has applied a Creative Commons Attribution (CC BY) license to any Author Accepted Manuscript version arising.