Introduction

Pharmacotherapy for type 2 diabetes mellitus (T2DM) has made remarkable progress since the introduction of insulin, the first injectable treatment for diabetes, in the 1920s, and sulfonylureas, the first oral hypoglycemic agent (OHA), in the 1950s [1]. Though the progress of pharmacotherapy has contributed to improving blood glucose (BG) control in T2DM patients, a substantial number of patients are still poorly controlled. In a survey of T2DM adults in the USA, only 52.2% of the subjects achieved a glycemic target of HbA1c < 7.0% in 2007–2010 [2]. Increasing the number of T2DM patients with a sufficient level of BG control is an important medical objective, and one way of achieving this is to improve patient adherence to treatment. This is supported by a report which showed that a considerable portion of patients exhibited poor adherence [3] and some observational studies that also showed a positive correlation between adherence to medication and BG control level in the real world [4, 5].

Multiple factors are considered to affect non-adherence to diabetic treatment [6,7,8,9]. Patient factors include demographics (e.g., age, sex), psychological factors (e.g., health belief), physical factors (e.g., concomitant disease), and social and economic status (e.g., education, income). On the other hand, there are specific pharmacotherapy factors: efficacy, safety, cost, complexity (e.g., frequency and interval of dose), and convenience (e.g., ease of swallowing formulation, ease of handling injection device). In addition, T2DM patients, especially older adults, commonly have concomitant diseases, such as hypertension and dyslipidemia, and therefore have the greater risk associated with polypharmacy [10]. Considering the factors that might be causes of non-adherence, a less complex and more convenient pharmacotherapy would be preferable, and this together with a reduction of treatment burden is a key for improving treatment.

One possible solution for reducing treatment burden is the use of single-tablet fixed-dose combinations of multiple OHAs [11, 12]. Fixed-dose combination therapies require fewer tablets to be taken and lessen treatment complexity, and thereby reduce treatment burden by replacing multiple agents with one fixed-dose combination. Another solution is the use of a long-acting formulation with a hypoglycemic effect sustained for 1 week [13]. Currently, there are two available once-weekly (QW) formulations: an OHA DPP-4 inhibitor and an injectable GLP-1 receptor agonist [14,15,16,17]. Some observational studies reported that T2DM patients treated with QW GLP-1 receptor agonist exhibited better adherence than patients treated with a once-daily (QD) agent [18, 19]. QW agents are largely equivalent to the corresponding QD agents in terms of efficacy (i.e., hypoglycemic effect), safety (i.e., adverse events including hypoglycemia), and cost (i.e., drug price). Therefore, if switching from QD to QW agents improved adherence, the reason would be that the treatment burden was reduced by mitigating the complexity of QD treatment. However, there is currently no questionnaire available for measuring quantitatively the difference of treatment burden between QW and QD agents.

To measure burden on patients with diabetes, two scales, PAID (Problem Areas in Diabetes Survey) and DDS (Diabetes Distress Scale) were developed so far [20,21,22]. However, these scales focus on the generic burden attributable to diabetes, not on the treatment burden caused by pharmacotherapy. Therefore, they are not suitable for differentiating medicines according to their characteristics, such as dosing schedule which dominantly determines the complexity and convenience of implementation of pharmacotherapy. Thus, to measure the treatment burden caused by pharmacotherapy for T2DM, we developed DTBQ (Diabetes Treatment Burden Questionnaire) focusing on efficacy (i.e., BG control), safety (i.e., hypoglycemia), complexity (i.e., amount, frequency, timing, and interval of dose), and convenience (i.e., ease of taking or injecting medicine), and evaluated its reproducibility and validity. This questionnaire would make it possible to quantify the treatment burden on T2DM patients depending on the characteristics of medicines.

Methods

Development of DTBQ

We interviewed 16 patients with T2DM treated with pharmacotherapy about treatment burden. Based on the interview, pre-existing questionnaires, and opinions from diabetologists, the first version of questionnaire with 26 questions was drafted. This first version was completed by 20 patients for pilot testing. Based on these results, 18 questions were selected for the final version.

Each question is answered by choosing one of seven ordered categories: (1) strongly disagree, (2) mostly disagree, (3) slightly disagree, (4) neither agree nor disagree, (5) slightly agree, (6) mostly agree, (7) strongly agree. It should be noted that the relations between the category number and level of treatment burden are opposite for questions 1–10 and questions 11–18. In other words, a larger category number means heavier treatment burden in questions 1–10, but a smaller burden in questions 11–18. Therefore, item scores are defined by the category number converted so that 0 means “minimum treatment burden” and 6 means “maximum treatment burden”. Correspondence between the category number and the item score for each question of DTBQ is shown in Table S1 in the electronic supplementary material.

Evaluation of Reproducibility and Validity

All subjects were outpatients with T2DM treated for at least 12 weeks with pharmacotherapy corresponding to one of the following six types of diabetic treatment:

  • Injection of GLP-1 receptor agonist, once weekly ± OHA (Injection/QW)

  • Injection of insulin or GLP-1 receptor agonist, once daily ± OHA (Injection/QD)

  • Injection of insulin or GLP-1 receptor agonist, twice a day or more ± OHA (Injection/BID+)

  • Only OHA, once weekly (OHA/QW)

  • Only OHA, once daily (OHA/QD)

  • Only OHA, twice a day or more (OHA/BID+)

In addition, subjects had to be at least 20 years old and provide written informed consent. Moreover, the following patients were excluded: patients who were diagnosed with or suspected to have dementia, patients with psychiatric disorders, patients without sufficient judgment ability, patients who needed consent from their proxies, and other patients judged to be inappropriate by the attending physicians.

The target numbers of enrolled patients were the following: 210 subjects for validity evaluation who answered DTBQ at least once, and 70 subjects for reproducibility evaluation who answered DTBQ twice. The enrollment was adjusted to balance the number of patients among the different types of diabetic treatment. Patients enrolled as diabetic treatment including injectable medicines had the option to use OHAs together, so they were asked to answer the questions in DTBQ on the injectable medicines only.

The attending physician gave the “Report form for all patients” along with the 1st DTBQ and DTSQ to be filled out to the patient after confirming his/her eligibility; the “Report form for reproducibility evaluation”, which included the 2nd DTBQ, was then delivered only if the patient was a subject for reproducibility evaluation. The patient filled out the “Report form for all patients” without the presence of the attending physician in a room at the medical institute. Then the patient put the form in an envelope and posted it to the data center or gave it to the attending physician. The attending physician filled out the “Report form for physicians” and posted it to the data center with the “Report form for all patients” if he had received it from the patient. Patients who were subjects for reproducibility evaluation filled out the “Report form for reproducibility evaluation” and posted it to the data center after leaving the medical institute.

DTSQ is the most popular patient-reported questionnaire to evaluate patient satisfaction in diabetic treatment [23]. License and copyright agreement for using DTSQ questionnaire in this study was obtained from Health Psychology Research Limited (University of London, UK). The study was conducted from November 2016 to February 2017 at seven sites in Japan and was registered at University Hospital Medical Information Network (UMIN000026382).

A common protocol was approved at institutional review boards in Nara Medical University and TDE Healthcare Corporation TOSAKI Clinic for Diabetes and Endocrinology. All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Declaration of Helsinki and its later amendments or comparable ethical standards. Informed consent was obtained from all individual participants included in the study.

Statistical Analyses

Enrolled patients who left any unanswered questions in the 1st DTBQ were excluded from the validity analysis set. Of the validity analysis set, patients who completed the 2nd DTBQ collected and without any unanswered questions were included in the reproducibility analysis set. Appropriate descriptive statistics were calculated for background factors, such as type of diabetic treatment, HbA1c, age, and sex, for the validity and reproducibility analysis sets. Mean and standard deviation were calculated for DTBQ item scores, and for the validity and reproducibility analysis sets. In addition, weighted kappa coefficients were calculated for the reproducibility analysis set where the weights (w) were assigned as follows: exact agreement, w = 1; one-level disagreement, w = 5/6; two-level disagreement, w = 4/6; three-level disagreement, w = 3/6; four-level disagreement, w = 2/6; five-level disagreement, w = 1/6; six-level disagreement, w = 0/6. Factor analysis of the DTBQ item scores was conducted for the validity analysis set. Based on the results, subscales were determined and named. Intraclass correlation coefficients (ICC) were calculated for the DTBQ subscale and total scores for the reproducibility analysis set. Correlation coefficients of the DTBQ subscale and total scores and the DTSQ scores were calculated for the validity analysis set. In addition, analysis of variance (ANOVA) or the t test was used to examine associations between DTBQ subscale and total scores and patient background factors.

Results

Patients

Patient disposition is shown in Fig. 1. Out of 240 enrolled patients, 236 were included in the validity analysis set; four were excluded because of unanswered questions in the 1st DTBQ. The numbers of patients by type of diabetic treatment were as follows: (1) Injection/QW, 40; (2) Injection/QD, 41; (3) Injection/BID+, 37; (4) OHA/QW, 38; (5) OHA/QD, 35; and (6) OHA/BID+, 45. Out of 236 patients in the validity analysis set, 48 patients filled out the 2nd DTBQ. One patient was excluded because of unanswered questions, and 47 were included the reproducibility analysis set.

Fig. 1
figure 1

Patient disposition. Full description of type of diabetic treatment: 1. Injection of GLP-1 agonist, once a week ± oral hypoglycemic agent; 2. Injection of insulin or GLP-1 agonist, once a day ± oral hypoglycemic agent; 3. Injection of insulin or GLP-1 agonist, twice a day or more ± oral hypoglycemic agent; 4. Oral hypoglycemic agent, once a week; 5. Oral hypoglycemic agent, once a day; 6. Oral hypoglycemic agent, twice a day or more. BID+ twice a day or more, DTBQ Diabetic Treatment Burden Questionnaire, OHA oral hypoglycemic agent, QD once daily, QW once weekly

Patient backgrounds reported by physicians and patients are summarized in Table 1 and Table S2 (in the electronic supplementary material), respectively. All background factors were similarly distributed in the validity and reproducibility analysis sets.

Table 1 Patient backgrounds reported by physicians

Summary and Reproducibility of DTBQ Item Scores

Descriptive statistics of the DTBQ item scores for the validity and reproducibility analysis sets, and weighted kappa coefficients for the reproducibility analysis set are shown in Table 2. For all questions, item scores of 0 to 2, which indicate a relatively small burden, were reported from more than half of the patients in the validity analysis set. The averages of 14 item scores out of the 18 questions were less than 2.0. Four questions whose average scores were greater than 2.0 were as follows: “7. Medication away from home”, “13. Allowing me to take a missed dose”, “16. Feeling less burden to follow diet therapy”, and “18. Satisfaction with my current BG control”.

Table 2 Reproducibility of DTBQ item scores (reproducibility analysis set)

Means of the 1st and 2nd item scores in the reproducibility analysis set were close to those in the validity analysis set. Weighted kappa coefficients for the reproducibility analysis set were estimated at 0.6 or higher for 14 of the 18 questions. Four questions with point estimates less than 0.6 were “3. Securing time for medication (κ = 0.550)”, “8. Concern about hypoglycemia (κ = 0.532)”, “17. Feeling less burden to continue diabetes treatment (κ = 0.560)”, and “18. Satisfaction with my current BG control (κ = 0.579)”, which indicated reproducibility good enough to use in practice.

Factor Analysis of DTBQ Item Scores

Table 3 shows the results of factor analysis of the DTBQ item scores for the validity analysis set. In the factor pattern for a four-factor model without rotation, the first factor loadings were more than 0.4 for all questions, which indicated a unidimensional feature of DTBQ. Therefore, the total score of all 18 questions was designated as the “total burden score”. On the other hand, the variance of three factors was greater than 1.0 in the four-factor model.

Table 3 Factor analysis of DTBQ item scores (validity analysis set)

In the factor pattern for three-factor model with varimax rotation, focusing on factor loading more than 0.4, a simple structure was obtained, where all questions belonged to one of the three factors. Considering the factor pattern and contents of the questions, three subscales were determined and designated:

  • Implementation burden score: sum of item scores 1–10

  • Flexibility burden score: sum of item scores 11–13

  • Blood control (BG) control burden score: sum of item scores 14–18

Reproducibility of DTBQ Subscale Scores and Total Burden Score

Scatter plots of the DTBQ subscale scores and total burden score (1st vs. 2nd) and the estimates of ICC for the reproducibility analysis set are shown in Fig. 2. The ICC estimates were 0.8 or higher, indicating high reproducibility of all scores.

Fig. 2
figure 2

Reproducibility of DTBQ subscale scores and total burden scores (reproducibility analysis set). a Implementation burden score (sum of scores for items 1–10), b flexibility burden score (sum of scores for items 11–13), c BG control burden score (sum of scores for items 14–18), d total burden score (sum of scores for items 1–18). CI confidence interval

Correlation Between DTBQ and DTSQ Scores

Table 4 shows the univariate descriptive statistics and correlations of the DTBQ and DTSQ scores for the validity analysis set. Correlations between the DTBQ subscale scores were 0.4 or higher for Spearman’s correlation coefficient (SCC) for every combination. The DTBQ total burden scores correlated positively with all the DTBQ subscale scores with 0.7 or higher SCC estimates.

Table 4 Correlation between DTBQ scores and DTSQ scores (validity analysis set)

The DTSQ hyperglycemic score correlated positively with the DTBQ scores: implementation burden (SCC = 0.31), BG control burden (0.36), and total burden (0.36). Weak correlations were observed between the DTSQ hypoglycemic score and the DTBQ scores within the range of ± 0.2 SCC. The DTSQ satisfaction score correlated negatively with all the DTBQ scores with − 0.3 SCC or less. In particular, the BG control burden score (SCC = − 0.61) and total burden score (− 0.53) were correlated strongly.

Association Between DTBQ Scores and Patient Backgrounds

For the validity analysis set, analysis results of the association between DTBQ scores and patient backgrounds reported by physicians and patients are shown in Table 5 and Table S3 (in the electronic supplementary material), respectively. In addition, the results of pairwise comparisons of the DTBQ/DTSQ scores between types of diabetic treatment are shown in Table S4 in the electronic supplementary material.

Table 5 Associations between DTBQ scores and patient backgrounds reported by physicians (validity analysis set)

DTBQ implementation burden score had significant associations with three items reported by physicians (i.e., type of diabetic treatment, age, frequency of dose: OHA) and five items reported by patients (i.e., BG control, hypoglycemic event, frequency of missed dose: injection, OHA, injection or OHA). DTBQ implementation burden scores were high in the following patients: treated with injection, frequently dosing with injection or OHA, younger age (< 65 years), with BG controlled poorly, with hypoglycemic events, frequently missing dose of injection or OHA.

DTBQ flexibility burden score had significant associations with seven items reported by physicians (i.e., type of diabetic treatment, HbA1c, age, duration of diabetes, frequency of dose: injection and OHA, diabetic retinopathy) and three items reported by patients (i.e., hypoglycemic event, frequency of missed dose: injection or OHA, time to leave home on weekdays). DTBQ flexibility burden score was high in the following patients: frequently dosing with injection or OHA, higher HbA1c (≥ 7.0%), older age (≥ 65 years), longer duration of diabetes (≥ 10 years), with diabetic retinopathy, with hypoglycemic events, frequently missing dose of injection or OHA, leaving home on weekdays late (8:30 or later).

DTBQ BG control burden score had significant associations with four items reported by physicians (i.e., type of diabetic treatment, HbA1c, diabetic nephropathy and retinopathy) and four items reported by patients (i.e., BG control, hypoglycemic event, frequency of missed dose: injection or OHA, communication with physician on diabetic treatment). DTBQ BG control burden score was high in the following patients: frequently missing dose of injection or OHA, higher HbA1c (≥ 7.0%), with diabetic nephropathy or retinopathy, with BG controlled poorly, with hypoglycemic events, frequently missing dose of injection or OHA, less communication with their physicians.

Discussion

We developed DTBQ, a patient-administered questionnaire with 18 questions, to measure the treatment burden caused by pharmacotherapy for T2DM. The reproducibility and validity of DTBQ were evaluated through statistical analysis of data collected using the questionnaire from patients with T2DM under pharmacotherapy. This included 236 patients as the validity analysis set and 47 patients as the reproducibility analysis set.

Weighted kappa coefficients were calculated as an index of reproducibility for the DTBQ item scores, with all 18 questions showing reproducibility high enough for practical use. ICCs were also calculated as an index of reproducibility for the DTBQ subscale scores and total burden score. All coefficients were 0.8 or higher, showing high reproducibility of three subscales and total burden score.

Factor analysis of the DTBQ item scores without rotation showed a unidimensional feature of DTBQ, which supported interpreting the total score of all 18 questions as “total burden”. On the basis of the result of the three-factor model with varimax rotation, a simple structure was obtained, and three subscales were determined and designated as follows: “implementation burden”, “flexibility burden”, and “blood control (BG) control burden”.

There were significant associations between type of diabetic treatment and DTBQ total burden score. In terms of administration route (injection vs. OHA), the score was higher for injection than oral administration. For the frequency of dose (QW, QD, BID+), the score increased with order of dosing frequency: QW, QD, and BID+. Comparing the means of total burden score among the type of diabetic treatment (combination of administration route and dose frequency), OHA/BID+ (33.9) was almost the same as Injection/QD (34.2), and Injection/QW (24.3) was smaller than OHA/BID+. These results suggested that the treatment burden of Injection/QW was lighter than that of OHA/BID+.

The DTBQ subscale scores were also significantly associated with type of diabetic treatment. In particular, the flexibility burden score rose significantly as the frequency of dose increased. Implementation burden score also increased with order of dosing frequency, and was higher for injection than OHA.

HbA1c was significantly associated with total burden score. Total burden score was higher in patients with higher HbA1c (≥ 7.0%). Of the DTBQ subscales, BG control burden score and flexibility burden score also showed significant differences with HbA1c. BG control reported by patients was significantly associated with DTBQ scores as well. These results suggested that patients with worse BG control felt a heavier burden for treatment of diabetes.

Age was not significantly associated with total burden score. However, among the DTBQ subscales, implementation burden score was significantly higher in younger patients (< 65 years). Younger patients might feel heavier implementation burdens because of their greater job demands. On the other hand, flexibility burden score was significantly higher in older patients (≥ 65 years).

Hypoglycemic events were significantly associated with total burden score and DTBQ subscale scores. All scores were significantly higher in patients with hypoglycemic events. Out of the six types of diabetic treatment, the percentage of hypoglycemic events and all DTBQ subscale scores were highest for Injection/BID+, which reflected the association between hypoglycemia and DTBQ scores. This suggests that patients with hypoglycemic events would probably feel a heavier treatment burden for controlling BG.

Daily frequency of dosing non-diabetic agents was not significantly associated with any DTBQ score. This result suggested that DTBQ measured the treatment burden caused by pharmacotherapy for diabetes only, as intended.

Frequency of missed dose of diabetic medication (injection or OHA) was significantly associated with the total burden score and all DTBQ subscale scores. All scores were significantly higher in patients with a missed injection or OHA dose. As the frequency of dosing diabetic medication increased, the frequency of missed dose and all subscale scores also increased simultaneously, which reflected the association between the frequency of missed dose and the DTBQ scores.

The treatment burden in patients with diabetic retinopathy was higher than that in patients without diabetic retinopathy. In the patients with diabetic retinopathy, 41 out of 61 subjects had injection therapy compared with 63 out of 150 subjects in the patients without diabetic retinopathy. Duration of illness of patients with diabetic retinopathy is relatively longer and insulin therapy is necessary for many patients. Since injection therapy has a higher burden overall, there was a statistically significant difference between the groups.

DTSQ satisfaction score was negatively correlated with all DTBQ scores. The expected results suggested that DTBQ validly measured diabetic treatment burden. Moreover, the results of pairwise comparisons of DTBQ/DTSQ scores among types of diabetic treatment (Table S4 in the electronic supplementary material) showed that the mean DTBQ total burden score and DTBQ satisfaction score were inversely related to the types of diabetic treatment. This indicated that the heavier burden of diabetic treatment might worsen treatment satisfaction.

In the studies comparing QW and BID formulations of GLP-1 receptor agonist, better treatment satisfaction was observed in patients treated with QW medication [24, 25]. Looking at another therapeutic area, some observational/database studies on pharmacotherapy of osteoporosis concluded that QW dosing was associated with better adherence and persistence than QD dosing [26,27,28]. These results were consistent with the differences among type of diabetic treatment observed in the DTBQ scores.

Recently, patient preference of diabetic treatment has been studied and reported intensively [29,30,31,32,33,34,35]. For OHAs, a daily dosing schedule was an important factor in addition to efficacy, safety, and cost. For injections, dosing frequency was the factor which most affected patient preference among all characteristics of GLP-1 receptor agonists, including type of delivery system, nausea, hypoglycemia, BG change, and dosing frequency where QW was clearly preferred to QD. The questions in DTBQ directly address factors affecting patients’ preference, so DTBQ can contribute to a better understanding of patient preference.

This study has some limitations. In the validity analysis set of this study, mean HbA1c was 6.98, which indicated that the study subjects controlled BG better than the general T2DM population. Another study should be conducted to evaluate the validity of DTBQ for poorly controlled T2DM patients. Because the type of diabetic treatment was not randomized, results based on comparisons of DTBQ scores with treatment type should not be interpreted as interventional effects. The treatment burden should be measured from various points of view, and there are some aspects that DTBQ does not cover. For example, side effects other than hypoglycemia (e.g., weight gain, gastrointestinal symptoms) and financial aspects (e.g., cost of drug and device) are not included. Other questionnaires measuring the uncovered aspects should be used together depending on the purpose, such as OHA-Q [36]. The original version of DTBQ was written in Japanese; we translated the questionnaire into English for this publication. If there are plans to use the English version with patients, a validation study will be needed.

It has been already shown that improving treatment satisfaction and quality of life of patients has a positive impact on their adherence and BG control [37, 38]. In this study, DTBQ total burden score correlated negatively with DTSQ satisfaction score. Combining these results, it is expected that increase of the treatment burden leads to worsening BG control. In “Standards of Medical Care in Diabetes—2017” released by the American Diabetes Association [39], the following statements were included:

  • Treatment decision should be timely, rely on evidence-based guidelines, and be made collaboratively with patients based on individual preferences, prognosis, and comorbidities.

  • Providers should consider the burden of treatment and self-efficacy of patients when recommending treatment.

These statements reflect the importance of patient preference and treatment burden for determining the most appropriate treatment option for diabetes. DTBQ is very much in the same direction indicated by these statements, and it enables quantitative measurement of the treatment burden caused by pharmacotherapy. Measuring the treatment burden on patients with poor adherence would help to better understand their reasons, and some countermeasures might be taken to improve their adherence.

In conclusion, on the basis of the above results and the considerations discussed, DTBQ has adequate reproducibility and validity as a measurement scale for treatment burden on T2DM patients.