FormalPara Key Summary Points

Why carry out this study?

No specific treatment is recommended, and various antidiabetic agents are used as first-line treatments for patients with type 2 diabetes (T2D) in Japan.

Metformin has long been used at low doses for patients with T2D in Japan, and information regarding its effects at the maximum maintenance dose is lacking.

We conducted a systematic review and network meta-analysis (NMA) to compare the efficacy and safety of metformin at 1500 mg/day with those of the other oral anti-diabetes drugs (OADs) approved in Japan.

What was learned from the study?

Forty-six randomized controlled trials were identified in the systematic review, and 37 studies comprising 38 different types of treatments including placebos were selected for NMA of the reduction in hemoglobin A1c (HbA1c) from baseline.

Compared with metformin 1500 mg/day, 20 treatments showed weak effects in reducing HbA1c from baseline levels.

Metformin 1500 mg/day showed preferable efficacy and safety profiles compared with the other OADs approved in Japan.

Introduction

Currently, metformin is recommended as a first-line glucose-lowering agent for patients with type 2 diabetes (T2D) in Western countries [1]. However, in Japan, no specific treatment is recommended, and the first-line treatment is recommended to be chosen after considering the pharmacological and adverse effect profiles to assess the disease condition for each patient [2]. Consequently, various agents including biguanide, thiazolidinediones, sulfonylurea, glinide, dipeptidyl peptidase-4 inhibitor (DPP-4i), α-glucosidase inhibitor, and selective sodium-glucose transporter-2 inhibitor (SGLT2i) could be used as first-line treatments. Although the availability of these options may make their use preferable from the perspective of patient-centered care, physicians, particularly general practitioners, have difficulty selecting a treatment in real-world clinical settings. Therefore, information on the comparative efficacy of glucose-lowering agents is needed to select first-line treatment for Japanese patients with T2D.

Information regarding the effect of metformin at dosage levels similar to those used in Western countries compared with other treatments is lacking in Japan. Metformin has long been used at lower doses in Japan than in Western countries because of concerns related to the incidence of lactic acidosis caused by phenformin, an agent in the same biguanide class, in the 1970s [3]. Since 2010, the recommended dosage of metformin has been increased to 750–1500 mg/day as the normal maintenance dose and 2250 mg/day as the maximum prescription dose but the mean prescription dose typically remains below 1000 mg/day in clinical settings in Japan [4, 5]. Our internal survey on Japanese physicians showed that some Japanese physicians considered that the glycemic-lowering effect of metformin is lower than that of other treatments. A dose-dependent hypoglycemic effect of metformin compared with a placebo has been reported by clinical trials in Japan [3, 6]. However, the efficacy of metformin at doses similar to those used in Western countries in improving the hemoglobin A1c (HbA1c) level compared with other treatments approved in Japan has not been investigated in interventional or observational studies in Japanese patients with T2D. Such information could guide physicians in choosing first-line treatments.

This study was conducted to compare the efficacy and safety of metformin at the maximum maintenance dose, 1500 mg/day, with those of the other glucose-lowering agents in Japanese patients with T2D through a systematic review and network meta-analysis (NMA) of reported randomized controlled trials (RCTs). Considering that numerous first-line treatments are used in the Japanese clinical setting, NMA is a useful method for comparing treatments. We selected RCTs reporting results on the primary outcome of this study, that is, a change (Δ) in HbA1c from baseline at 12 weeks or later after the initiation of intervention. After summarizing the information of other outcomes reported in the included RCTs, we included the reduction rate of elevated HbA1c and incidence rate of hypoglycemia as additional outcomes because of publicly available data for treatment with metformin at 1500 mg/day.

Methods

Study Design

A systematic review and an NMA were conducted to compare the efficacy and safety of metformin with those of the other oral anti-diabetes drugs (OADs) as first-line treatment in patients with T2D.

The study protocol was registered in PROSPERO (CRD42020160779). We followed the Cochrane Handbook for Systematic Reviews of Interventions [7] for the systematic review and the Preferred Reporting Items for Systematic Reviews and Meta-Analyses for Network Meta-Analysis (PRISMA-NMA) statement [8] to report the results. This study was based on previously conducted studies and does not involve any new studies with human participants or animals performed by any of the authors. Therefore, ethical approval and informed consent were not required.

Eligibility Criteria

The criteria for study inclusion in the systematic review and meta-analysis are summarized in Table 1. Eligible studies were RCTs conducted in Japan that involved Japanese adult (≥ 20 years) patients with T2D and those either with no history of treatment with glucose-lowering agents or with a wash-out period before the RCT. Interventions in the RCTs were treatment for 12 weeks or longer with any OAD (from any class of biguanide, thiazolidinediones, sulfonylurea, glinide, DPP-4i, α-glucosidase inhibitor, or SGLT2i) approved in Japan at the approved dosages. Comparators were another type of the abovementioned interventions (treatments with another type of OAD or a different dose of the same type of OAD) or a placebo for 12 weeks or longer. The primary outcome was the Δ in the HbA1c level from baseline at 12 weeks or later (ΔHbA1c). The target publication period was the entire searchable period in each database described below.

Table 1 Criteria of study selection

Search Strategy and Literature Review

We comprehensively searched four databases: CENTRAL (Cochrane Library) on July 23, 2019, MEDLINE (via PubMed) on August 4, 2019, Ichushi Web (Japanese Medical Abstracts Society) on August 8, 2019, and EMBASE on November 29, 2019. The search terms included the disease name (T2D), names (generic names or compounds) or classes of OADs, study types, and languages. The search strategy for the CENTRAL database is shown in Supplementary Material Table S1. In addition, we searched three clinical trial registries: ClinicalTrials.gov, JAPIC Clinical Trials Information (JAPIC), and UMIN Clinical Trial Registry (UMIN) on May 13, 2020, to identify unpublished RCTs. The search terms applied to the manual search are shown in Supplementary Material Table S2.

We screened the identified RCTs to select eligible studies using the inclusion and exclusion criteria (Table 1). The inclusion criteria were (1) RCTs conducted in Japan, (2) patients and interventions/comparators met the eligibility criteria, (3) data of the primary outcome (ΔHbA1c) were available, and (4) publication in English or Japanese. The exclusion criteria were (1) no information necessary for review (e.g., an abstract) available, (2) patients overlapped with those in other studies, (3) no data on the primary outcome, and (4) RCTs targeting patients with specified comorbidities. We included all publication types, including articles in peer-reviewed journals, conference abstracts, and data recorded in a clinical trials registry, if the RCTs were included in the target databases. If the measurement of ΔHbA1c at 12 weeks or later was described in the report but numerical data were not available, we acquired the study data from other data sources (e.g., clinical trial registry) or requested them from the corresponding authors. Among the studies in which patients overlapped, the study with data at or the closest to 12 weeks if studies included the same patient group or the study with the largest sample size if the studies included patient groups of different sizes was included. Patient overlap was identified based on the trial identification numbers or the following information in the reports: authors, location and settings, type of interventions, numbers and background information of the patients, and date and duration of RCTs. We included RCTs in any of the population types: intent-to-treat (ITT), full analysis set (FAS), or per-protocol set (PPS), and prioritized data from ITT or FAS population if data in multiple population types were provided. RCTs based on PPS population were excluded from the sensitivity analysis. Two reviewers separately selected studies using the following procedure: first, potential studies were extracted based on the abstract, and then the extracted studies were assessed based on the full text. Disagreements regarding study selection between the reviewers were resolved via a discussion.

Data Extraction and Quality Assessment

For data extraction, two reviewers independently screened the included RCTs. Each reviewer extracted the following data from the selected RCTs: basic information in the literature (authors, title, journal information [name of the journal, volume, issue, year, and pages]) or clinical trial (principal investigator, affiliation, and study year); study design; patient numbers (at baseline and evaluation); patient inclusion/exclusion criteria; intervention (drug type and dose); outcome data; length of the evaluation period; population type; and additional information if necessary.

An intervention was defined as a certain combination of drug type and dosage, and RCTs in which the intervention groups comprised patients with different final dosages were excluded. As additional outcomes, data on Δ high-density lipoprotein cholesterol, Δ low-density lipoprotein cholesterol, Δ total cholesterol, Δ non-high-density lipoprotein cholesterol, Δ triglyceride, Δ fasting plasma glucose, Δ glycoalbumin, Δ fasting serum insulin, and the incidence rate of hypoglycemia were extracted if the data were available.

As outcome data for the change at evaluation from baseline, information on the distribution of the mean value was extracted. If outcomes were evaluated at several time points at 12 weeks or later from the commencement of intervention, data at or the closest to 12 weeks were included. It is because 12 weeks from the commencement of intervention is considered to be the appropriate time when the change in HbA1c level stabilizes after the initiation of treatment and is the required time to evaluate treatment effects. We also considered that it is preferable to evaluate outcomes at the same period among treatments as much as possible, and 12 weeks was the most popular evaluation point among RCTs. If outcome data at 12 weeks were only graphically reported, without numerical values or distributions, we requested the data from the corresponding authors. If we did not obtain the numerical data at 12 weeks from the authors, we used the values at time points other than 12 weeks for which numerical values were reported. The incidence rate of hypoglycemia, reported as an independent outcome or adverse event, regardless of severity, was included. The data were recorded in an Excel spread sheet. When essential information could not be obtained, the RCTs were excluded. Disagreements on the extracted data between the reviewers were resolved via a discussion.

The risk of bias of the selected RCTs was assessed by two independent reviewers using the Revised Cochrane risk-of-bias tool for randomized trials [9]. Risk was evaluated for the primary outcome in each RCT. The risk for five domains was assessed: (1) bias arising from the randomization process, (2) bias due to deviations from the intended interventions, (3) bias due to missing outcome data, (4) bias in the measurement of the outcome, and (5) bias in the selection of the reported result. The overall risk of bias based on the risk of the domains was considered as a high risk, some concern, or low risk. The assessment results were subjected to sensitivity analysis to exclude RCTs with a high risk of bias. Disagreements between individual judgments were resolved via a discussion.

Data Synthesis and Analysis

Since April 1, 2012, the HbA1c level has been expressed as National Glycohemoglobin Standardization Program (NGSP) values rather than Japan Diabetes Society (JDS) values in Japan [10]. Therefore, the HbA1c levels reported as JDS values or likely reported as JDS values based on basic study information, such as study time and country, were converted to NGSP values by adding 0.4 to the JDS values [11]. We used the values of Δ in the mean HbA1c level as reported; if the values were not indicated in the studies, they were calculated from the mean HbA1c level at baseline and after intervention. If standard deviations (SDs) were not reported, they were calculated from the data distribution information, such as the 95% confidence interval and standard error (SE), if available.

As an additional outcome, we compared the HbA1c reduction rate using the cutoff value of 6.5% [2, 12] for various OADs to reduce the effects of HbA1c differences at baseline, as the magnitude of HbA1c changes is reportedly associated with its baseline level [13]. That is, a higher baseline HbA1c level is associated with greater reductions. Assuming a model in which an OAD can reduce HbA1c level by a certain percentage, the reduction rate was calculated as \(1{ }{-}{ }\frac{{{\text{HbA}}1{\text{c at evaluation}}}}{{{\text{HbA}}1{\text{c at baseline}}}}\). We considered the SD of the reduction rate of elevated HbA1c to be constant among interventions and calculated this value from the claims data provided by Medical Data Vision Co., Ltd. Among the additional outcomes extracted, those reported as results from treatment with 1500 mg/day metformin were selected for NMA.

We prepared a network graph for each outcome for the main analysis or sensitivity analyses by the type of treatments consisting of drug types and doses. Treatments without a connection with any treatments were excluded from the NMA for each outcome or analysis.

The NMA was conducted using a Bayesian model with the Markov Chain Monte Carlo method. Two linear models were used to examine the effect of each treatment: a fixed-effect model written as a linear combination of the RCT and treatment effects for main analysis and a random-effect model expressing a linear combination of RCT, treatment, and random effects for sensitivity analysis. Prior probability distributions were assumed as flat likelihood functions using general(0) in the SAS MCMC procedure. For each outcome, the mean difference between treatment with metformin 1500 mg/day and each of the other treatments was calculated, which were then assessed based on the difference in the posterior probability distributions with a two-tailed probability of 0.05 (probability of < 0.025 or > 0.975) considered as significant. We also calculated the mean difference between treatment with metformin 750 mg/day and each of the other treatments to assess the difference in the effect depending on the dosage of metformin. As a Bayesian model was used, the consistency of the networks was reflected in the probabilistic distribution function of each variable. Subgroup analyses were not performed in this study.

We used SAS version 9.4 (SAS Institute, Cary, NC, USA) and Microsoft Excel 2016 (Microsoft Corporation, Redmond, WA, USA) for the analyses.

Sensitivity Analysis

We conducted three sensitivity analyses for ΔHbA1c. First, we applied a random-effects model to assess the model risk. Second, we excluded the RCTs considered as high risk based on the results of the risk of bias assessment. Finally, we excluded RCTs in which the outcomes were evaluated in a PPS population.

Results

Study Selection and Characteristics

A literature search identified 2375 studies from four databases, including 1556 unique studies, which were assessed for relevance (Fig. 1). Forty-five RCTs, including one RCT from a clinical trials registry database included in CENTRAL [14] and the others from articles in peer-reviewed journals, were selected (Fig. 1). The number of RCTs by reasons for exclusion is shown in Supplementary Material Table S3. Manual searching retrieved 98, 268, and 152 RCTs from ClinicalTrials.gov, JAPIC, and UMIN, respectively, using the search terms described in Supplementary Material Table S2, of which 9, 14, and 4 RCTs met or were likely to meet the eligibility criteria. After assessing the information in detail, all RCTs from ClinicalTrials.gov were found to be duplicates of RCTs already identified in the previous literature search; from JAPIC, three RCTs did not meet the criteria, ten RCTs were already included, and one RCT [15] was selected as eligible. Among those from UMIN, one RCT was already included and three did not show the results (Fig. 1).

Fig. 1
figure 1

Flow diagram of study selection. *Note: number of randomized controlled trials by reasons for exclusion is shown in Supplementary Material Table S3. NMA: network meta-analysis.

In the 46 RCTs, fasting plasma glucose was the most frequently reported additional outcome (in 40 RCTs), followed by the hypoglycemia incidence rate (36 RCTs). The number of RCTs reporting each additional outcome is shown in Supplementary Material Table S4 with detailed literature information for each RCT. In one RCT, treatment with metformin at 1500 mg/day was included as an intervention, and the outcomes reported were ΔHbA1c with baseline HbA1c and the incidence rate of hypoglycemia [15]. Therefore, we analyzed the reduction rate of elevated HbA1c and incidence rate of hypoglycemia as additional outcomes. Outcome data of each study are summarized in Supplementary Material Table S5.

Basic information on the RCTs obtained through data extraction is shown in Table 2. The treatment duration until ΔHbA1c evaluation extracted for the NMA was 12 weeks for 32 RCTs, 14 weeks for 1 RCT, 16 weeks for 1 RCT, 24 weeks for 11 RCTs, and 28 weeks for 1 RCT (Table 2). Among 14 RCTs without data at 12 weeks, we attempted to contact the corresponding authors or co-authors of 8 RCTs. However, we could not obtain the data because the authors of three RCTs responded that they do not have the data, and the others did not respond. We did not contact authors of six RCTs as the contact information was not available (for three RCTs) or we thought that they likely did not record the data at 12 weeks based on the protocol or the methods in the manuscript (for three RCTs). We excluded RCTs targeting patients with comorbidities (e.g., chronic kidney disease, chronic heart failure, coronary artery disease, and non-alcoholic fatty liver disease), whereas those of patients with metabolic syndrome [16, 17] or overweight [18] were included. Eight RCTs [19,20,21,22,23,24,25,26] evaluated the primary outcome in a PPS or a similar-to-PPS.

Table 2 Basic characteristics of the identified RCTs

Through a risk of bias assessment to determine the overall risk of bias, 10 RCTs [18,19,20,21, 23, 24, 26, 34, 48, 50] were considered high risk and 4 and 32 RCTs were considered low risk and some concern, respectively (Supplementary Material Fig. S1). The high-risk RCTs included six RCTs in PPS or similar-to-PPS [19,20,21, 23, 24, 26], two RCTs published in the 1990s [21, 26], one RCT with a cross-over design [50], two RCTs with an open-label design [18, 19], and one RCT in patients who were overweight [18].

Among the selected RCTs, those without SD data for the mean ΔHbA1c or other information to calculate the SD, including the SE or 95% confidence interval, were excluded from the ΔHbA1c analysis. In the analysis of the reduction rate of elevated HbA1c, RCTs without baseline HbA1c level and those with baseline HbA1c at < 6.5% were excluded. As a result, one RCT [37] was not included in the analysis of any outcome. The number of RCTs finally included in the analysis of each outcome, that is, ΔHbA1c, reduction rate of elevated HbA1c, and incidence rate of hypoglycemia, was 37, 42, and 36, respectively, which included 96, 106, and 92 treatment arms, respectively (Table 2). Among the RCTs included for each outcome, 38, 38, and 33 different types of treatments (including placebo), respectively, were included.

Treatments assessed for the primary outcome were one of the two biguanide drugs approved in Japan (metformin 750 or 1500 mg/day), one thiazolidinedione drug (pioglitazone 15, 30, or 45 mg/day), one of the six sulfonylurea drugs (glimepiride 2 mg/day), all three glinide drugs (nateglinide 270 mg/day, mitiglinide 30 or 60 mg/day, and repaglinide 0.75, 1.5, or 3 mg/day), all nine DPP-4i drugs (sitagliptin 100 or 50 mg/day, vildagliptin 100 mg/day, alogliptin 25 mg/day, linagliptin 5 mg/day, teneligliptin 20 and 40 mg/day, anagliptin 200 or 400 mg/day, saxagliptin 5 mg/day, trelagliptin 100 mg/week, and omarigliptin 25 mg/week), two of the three α-glucosidase inhibitor drugs (voglibose 0.6 and 0.9 mg/day, and miglitol 150 mg/day), and all six SGLT2i drugs (ipragliflozin 50 or 100 mg/day, dapagliflozin 5 or 10 mg/day, luseogliflozin 2.5 or 5 mg/day, tofogliflozin 20 mg/day, canagliflozin 100 mg/day, and empagliflozin 10 mg/day or 25 mg/day) (Fig. 2A). A network graph including the selected studies and treatments for each outcome is shown in Fig. 2. For the reduction rate of elevated HbA1c and the incidence rate of hypoglycemia, an RCT comparing glimepride 2 mg/day and voglibose 0.9 mg/day [55] was excluded because these treatments were not included in the network (Fig. 2). Network graphs for the sensitivity analyses of 31 studies after excluding those with a high risk of bias and 32 studies after excluding those in PPS or similar-to-PPS are shown in Supplementary Material Fig. S2.

Fig. 2
figure 2figure 2

Network graph of the treatments in the included randomized controlled trials for mean (Δ) hemoglobin A1c (a), reduction rate of elevated hemoglobin A1c (b), and incidence rate of hypoglycemia (c). The labels represent the study ID including each treatment arm

Change in the Hemoglobin A1c Level

The mean difference in ΔHbA1c in response to metformin 750 or 1500 mg/day (reference) for each treatment is shown as (mean ΔHbA1c (%) for each intervention) – (mean ΔHbA1c (%) for reference) in Fig. 3. When comparing metformin 1500 mg/day with the 36 other treatments (except placebo) in the RCTs included in the meta-analysis, for all treatments except glimepiride 2 mg/day and pioglitazone 45 mg/day, the difference was a positive value, demonstrating that the reduction in HbA1c from baseline was smaller in patients treated with the treatment than in those treated with metformin 1500 mg/day. The probability of observing inferior results compared with metformin 1500 mg/day was > 0.975 for 20 treatments, including 10 of 12 DPP-4i treatments selected for analysis. The difference from metformin 1500 mg/day ranged from 0.40% for metformin 750 mg/day to 0.96% for dapagliflozin 5 mg/day among the 20 treatments and from 0.43% for sitagliptin 100 mg/day to 0.60% for linagliptin 5 mg/day among the 10 DPP-4i treatments. For treatments with a negative value, the probability of showing superior effect compared with metformin 1500 mg/day was 0.911 (the difference from metformin 1500 mg/day: − 0.38%) for glimepiride 2 mg/day and 0.540 (− 0.03%) for pioglitazone 45 mg/day. Compared to metformin 750 mg/day, four treatments showed superior outcomes with a probability of > 0.975, and three treatments were inferior, showing a probability of > 0.975.

Fig. 3
figure 3

Mean outcome difference of each treatment to metformin 1500 or 750 mg/day (reference). Δ represents (mean ΔHbA1c (%) for each intervention) – (mean ΔHbA1c (%) for reference) in (1), (reduction rate of elevated HbA1c (%) for each intervention) – (reduction rate of elevated HbA1c (%) for reference) in (2), and (relative risk of hypoglycemia to placebo for each intervention) – (relative risk of hypoglycemia to placebo for reference) in (3). P represents the probability (0 to 1) that each intervention is superior to reference. The value of P was rounded off to the third decimal place. Green and red colors indicate superiority and inferiority of the intervention (p > 0.975 and p < 0.025) to the reference, respectively. α-GI α-glucosidase inhibitor, BG biguanide, DPP-4i dipeptidyl peptidase-4 inhibitor, HbA1c hemoglobin A1c, MET metformin, SGLT-2i selective sodium-glucose transporter-2 inhibitor, SU sulfonylurea, TZD thiazolidinedione

In the analysis of the reduction rate of elevated HbA1c using claims data, the SD was calculated to be 1.21; this value was applied in further analysis. All 36 treatments showed a lower reduction rate than metformin 1500 mg/day, and the probability of being inferior was > 0.975 for 11 treatments (Fig. 3). Compared to metformin 750 mg/day, only metformin 1500 mg/day showed a higher reduction rate and no treatment showed a lower reduction rate, with a probability of > 0.975.

Incidence of Hypoglycemia

The incidence rate of hypoglycemia was evaluated as a relative risk (RR) relative to a placebo; the incidence rate for each treatment was divided by the incidence rate for the placebo. The difference in RR compared with metformin 750 or 1500 mg/day is shown in Fig. 3. Of the 31 treatments, only pioglitazone 30 mg/day showed a lower RR (< − 0.01% as difference from metformin 1500 mg/day), with a probability of > 0.975, whereas most of other treatments showed a higher RR, including 23 and 21 treatments with a probability of > 0.975, than metformin 1500 and 750 mg/day, respectively. Glinide-based treatments, particularly, repaglinide, showed a higher difference in RR with increasing dosage. The difference in RR from metformin 1500 mg/day was 49.39, 50.71, and 66.71 at 0.75, 1.5, and 3 mg/day, respectively.

Supplementary Material Figure S3 shows the results of the ΔHbA1c sensitivity analyses. No significant differences were observed in the random-effect model analysis, analysis excluding high-bias-risk RCTs, and analysis excluding RCTs in PPS.

Discussion

We performed a systematic review and an NMA to compare the efficacy of glycemic control and safety against the incidence of hypoglycemia of metformin at the maximum maintenance dose (1500 mg/day) with those of other OADs approved in Japan in Japanese patients with T2D. In total, 46 RCTs were selected in the systematic review, and 37 studies, comprising 38 different types of treatments, including placebos, were selected for the NMA of ΔHbA1c. Compared with metformin 1500 mg/day, 20 treatments including metformin 750 mg/day were significantly less effective in reducing the HbA1c level from baseline levels during a period of ≥ 12 weeks. Two treatments (glimepiride 2 mg/day and pioglitazone 45 mg/day) showed greater mean reductions in HbA1c from baseline than metformin 1500 mg/day, although these differences were not significant. In addition, we evaluated the glycemic control effect, as determined by the reduction rate of elevated HbA1c, to prevent a bias induced by the differences in baseline HbA1c level among studies. No treatment caused a higher mean reduction rate than metformin 1500 mg/day, with a probability of > 0.975 for 11 of the 36 treatments evaluated.

The incidence rate of hypoglycemia was reported by 36 studies. Particularly, a higher RR of hypoglycemia was observed with three repaglinide treatments, which were reported in one RCT [59]. The treatment duration in this RCT, which may be associated with the incidence of hypoglycemia, was 12 weeks, which was not longer than the other RCTs using different OADs. A possible reason for the higher hypoglycemia risk may be related to the structure of repaglinide, which is similar to that of sulfonylurea [60], which is known to induce hypoglycemia [61,62,63]. Hypoglycemia caused by sulfonylurea was not evaluated because of the lack of studies on this treatment. Notably, the RCT involving these three treatments [59] reported a relatively higher hypoglycemia incidence, even in the placebo group (5.6%), than the other RCTs.

To assess the robustness of the main analysis, we conducted three sensitivity analyses on ΔHbA1c, applying a different model (random-effect model) and excluding studies with a high risk of bias and those conducted in PPS. The superiority/inferiority of the treatments compared to metformin 1500 or 750 mg/day did not change, supporting the robustness of the main findings.

Regarding the dosages of metformin, although the 1500 mg/day prescription showed significantly greater efficacy than many treatments, the efficacy of 750 mg/day was not significantly higher than that of most of the other treatments. The risk of hypoglycemia did not widely differ between the dosages evaluated. Considering the difference in efficacy between these dosages and the current Japanese clinical situation in which metformin has been used at lower levels (< 1000 mg/day on average), the use of metformin at 1500 mg/day may be beneficial for patients with insufficient glycemic control.

There are only a few studies comparing the efficacy and safety between metformin and other OADs as a first-line treatment for patients with T2D using NMA, while many studies have compared the efficacy and safety among OADs as an add-on therapy to metformin, or for patients inadequately controlled with metformin. This is probably because metformin monotherapy is recommended as an initial treatment for patients in Western countries [1]. A 2016 NMA reported that, compared with metformin, sulfonylurea, thiazolidinediones, DPP-4i, and α-glucosidase inhibitor were associated with a higher HbA1c level, and SGLT2i and meglitinides showed no significant difference in the HbA1c level at the end of treatment as a first-line monotherapy for ≥ 24 weeks [64]. This study also reported that sulfonylurea is associated with a higher risk of hypoglycemia, and thiazolidinediones and DPP-4i were associated with a lower risk of hypoglycemia than metformin. This study compared treatments as drug class without being divided by drug type and dosage, and metformin dosage varied among RCTs, including those with higher dosages, such as 2500 or 3000 mg/day as the maximum dose, compared with the approved dosage in Japan. Although we cannot compare the results with our results directly, a higher or similar effect of metformin in reducing HbA1c than that of other OADs is comparable between studies. A 2019 NMA compared 11 OADs by drug type (dapagliflozin, gliclazide, glimepiride, glipizide, glyburide, metformin, nateglinide, repaglinide, saxagliptin, sitagliptin, and vildagliptin), as a first-line monotherapy for patients with T2D, although various dosages were included for each treatment [65]. Repaglinide showed the highest efficacy in HbA1c reduction followed by gliclazide and metformin among the 11 OADs, and no significant difference was observed between repaglinide or gliclazide and metformin. The efficacy was significantly higher for metformin than for dapagliflozin, nateglinide, saxagliptin, sitagliptin, and vildagliptin. Although dosages in each drug were different, the results are consistent with our results for most of the drugs. Another NMA, comparing HbA1c reduction efficacy between metformin and sulfonylurea, SGLT2i, DPP-4i, or pioglitazone as a monotherapy reported that pioglitazone showed significantly lower efficacy after 12 weeks of treatment, whereas it showed higher efficacy after 104 weeks [66]. Our study did not show a significant difference in efficacy between pioglitazone with three different dosages and metformin; however, lower efficacy was suggested for pioglitazone as reduction rate of elevated HbA1c. Thus, pioglitazone might be associated with lower HbA1c-reducing effect at 12 weeks if baseline HbA1c level is adjusted. This study also reported that sulfonylurea showed a tendency of greater effect than metformin, although the difference was not significant, after 12 weeks of treatment. In our study, glimepiride 2 mg/day, a sulfonylurea, showed a similar tendency to that in this previous study. Taken together, although there are several differences in the included RCTs, including types and dosages of drugs, treatment periods, and countries, as well as analysis methods, our results are in line with those of the previous NMA studies.

There were some limitations to this study. We targeted all OADs using approved dosages in Japan. However, not all available drugs were included, and the dosage was not the highest for some drugs because of a lack of RCTs meeting the eligibility criteria. The RCTs included in this study used diverse study designs, such as the targeted patients and treatment duration. Different classes of OADs were assessed, and each class of OAD has a different mechanism of action, which may have led to differences in the study designs. Differences in calendar year in which the RCTs were conducted may also impact the study design and results. Moreover, we included reports written in Japanese and English, whereas reports published in other languages were not included.

The studies were selected based on a comprehensive search of four databases and manual search of three clinical trial registries; however, the latter did not include RCTs conducted prior to a certain year (2000 for ClinicalTrials.gov and 2005 for JAPIC and UMIN). Hence, access to unpublished studies was restricted to the availability in (and limitations of) the registries. Differences in statistical analysis methods and data quality between the RCTs may have also affected our results. To reduce the risk inflicted by such differences, we assessed the risk of bias and effect of studies with a high risk of bias.

Finally, we evaluated the outcomes at 12 weeks or the closest period > 12 weeks from the initiation of interventions. It is because 12 weeks is considered to be the time when the change in HbA1c level stabilizes and optimal period to evaluate the outcomes, and majority of RCTs evaluated the outcomes at the point. Selecting this period allowed us to compare a variety of treatments; however, the results may not reflect the efficacy and safety of long-term treatment. In addition, as we included only RCTs, the treatment situation might be different in RCTs from that in real-world settings. Consequently, further research is needed to compare the outcomes among treatments in real-world settings.

Conclusions

A systematic review and an NMA were performed to compare the efficacy and safety of metformin at 1500 mg/day with those of the other glucose-lowering agents available to Japanese patients with T2D. The results suggest that metformin 1500 mg/day has mostly a preferable efficacy and safety profile compared with the other OADs approved in Japan. The information provided in this study could shed insights for physicians to choose a suitable first-line treatment for their patients with T2D among the variety of options available in Japan. Further research is needed to compare the efficacy and safety in real-world settings.