Introduction

Type 2 diabetes mellitus (T2DM) is a common chronic disease and a growing worldwide epidemic [1]. According to a recent measure (in 2013), the prevalence of diabetes in Europe was 8.5% [2]. In Germany in 2010, the estimated number of individuals aged 55–74 years with T2DM was 2.4 million and is expected to rise to 3.9 million by 2030 [3]. In Spain, 30% of participants in the Di@bet.es cross-sectional study of a representative random sample of the Spanish population had some glucose disturbance, and the prevalence of diabetes mellitus was 13.8% [4]. Complications from diabetes lead to increased disability and high healthcare costs [2], highlighting the need for better preventive measures and treatment options.

A range of treatments are available to control patient blood glucose levels. This variety of treatment options naturally provides a diversity of clinical efficacy, modes of administration, and adverse event profiles.

Patient preferences are one of the most important factors to consider when making a drug choice. Because the variety of treatment options available for T2DM has increased, the importance of understanding patient preferences for available agents has similarly increased. With the introduction of medications with new mechanisms of action and side effect profiles, it is vital to determine how patient perceptions of specific treatment attributes vary in order to understand how such attributes might affect patient treatment choices [5,6,7,8,9,10].

Researchers have started using stated-preference surveys, such as discrete-choice experiments (DCEs), to evaluate the relative preference for features of medical interventions [11]. The basis for DCEs is the principle that products comprise a set of attributes and that the value of a product to an individual is a function of these attributes [12]. This approach provides information on people’s willingness to accept tradeoffs between features of multi-attribute products, recognizing that individuals place different levels of importance on different product attributes [13]. An added benefit of this approach is the ability to obtain information about new treatments or treatment attributes before the drugs enter the marketplace. The primary objective of the study reported here was to quantify preferences for attributes of T2DM treatments among patients with T2DM in Germany and Spain using a DCE survey.

Methods

Survey Development

An online survey was developed that included questions on demographics (age, sex, marital status, level of education, employment status), disease experience [time since diagnosis, current and target hemoglobin A1c (HbA1c) levels, and experience with T2DM treatments], and DCE questions about treatment choice.

The DCE questions posed a choice between two hypothetical treatments for T2DM, with each treatment being defined by a set of seven attributes at varying levels: (1) the chance of achieving the respondent’s HbA1c target; (2) reduced risk of serious heart attack or stroke; (3) frequency of hypoglycemic events; (4) risk of gastrointestinal (GI) problems; (5) changes in body weight; (6) mode of administration; (7) frequency of dosing (Table 1). In the final online survey, each of the seven attributes was described separately, and the descriptions were followed by questions about the attribute to ensure respondents’ understanding. The three probabilistic attributes, namely, the chance of achieving an HbA1c target, reduced risk of serious heart attack or stroke, and risk of GI problems, were explained graphically using risk grids in which colored dots identified the number of treated patients for whom the event occurred (Fig. 1). Research suggests that pictograms such as risk grids are an effective method of communicating risk [14]. Respondents were asked to review a sample risk grid, following which they were asked a comprehension question about the risk grid. Respondents who answered incorrectly were presented with the correct answer and an explanation before proceeding with the survey, while respondents who answered correctly were presented with the correct answer to reinforce their understanding. The survey was pretested to evaluate the survey instrument in 14 face-to-face interviews conducted in Berlin, Germany, and 15 face-to-face interviews conducted in Barcelona, Spain.

Table 1 Attribute levels for discrete-choice experiment
Fig. 1
figure 1

Example of discrete-choice experiment question. The images depict the number of patients out of 1000 who reached each goal as colored dots and those who did not as gray dots. The two blue dots in the reduction in risk of serious heart attack or stroke depict the 5% reduction in risk with Medication B versus Medication A. HbA1c Glycated hemoglobin A1, GI gastrointestinal

An experimental design was used to create the pairs of hypothetical treatments included in the DCE questions, following good practice guidelines [15]. The design was generated using SAS 9.3 analytics software (SAS Institute Inc., Cary, NC). The final design included 48 DCE questions divided into six blocks of eight questions each. Each patient was randomly assigned one block of DCE questions, and each DCE question asked the patient to indicate which treatment they would choose if the two treatments in the question were the only treatments available [14, 15].

Survey Populations

Patients were recruited from local communities for inclusion in survey pretesting and from an online national consumer panel for inclusion in the online survey. Patients were required to be residents of Germany or Spain, be aged ≥ 18 years, be able to read and understand German or Spanish, and provide informed consent. Eligible patients were also required to have a self-reported physician diagnosis of T2DM, have first started medication at least 2 years prior to taking the survey, and be receiving ongoing treatment with at least one T2DM prescription medication at the time of the survey. For the online survey, ≥ 200 respondents of both sexes in each country were sought. The study was approved by an institutional review board of RTI International (Research Triangle Park, NC, USA). All procedures followed were in accordance with the ethical standards of the responsible committee on human experimentation (institutional and national) and with the Helsinki Declaration of 1964, as revised in 2013. All patients provided informed consent prior to their inclusion in the study.

Statistical Analysis

The primary endpoint was the average preference weight for each attribute level of T2DM medications included in the survey (Table 1). The data from Germany and Spain were analyzed separately. The DCE choice data were estimated using random-parameters logit, a limited dependent variable regression model that avoids potential estimation bias from unobserved preference heterogeneity among respondents [16, 17]. All treatment attribute levels were included in the model as effects-coded categorical variables, except for the chance of reaching target HbA1c level, which was modeled as a linear continuous variable [12]. The coefficients on the independent variables from the random-parameters logit regression can be interpreted as relative preference weights, indicating the relative strength of preference for each attribute level. Larger positive coefficients indicated that respondents preferred that attribute level to levels that had smaller or negative coefficients. A Wald test was used to test for differences between adjacent attribute levels, and the NLOGIT 5 software package (Econometric Software, Inc., Plainview, NY) was used for the multivariate data analysis.

Separate random-parameters logit models were estimated and tested for differences in preferences between the following subgroups separately in each country: male versus female; age < 65 years versus age ≥ 65 years; self-reported T2DM diagnosis of < 7 years prior to survey versus ≥ 7 years prior to survey; use of injectable versus oral therapy at the time of the survey. The results of these exploratory analyses of preference heterogeneity are shown in the Electronic Supplementary Materials (ESM).

The estimated preference weights from the random-parameters logit model were used to calculate the relative preferences for medication profiles defined by the attributes and levels shown in Table 1. These medication profiles were created to approximate T2DM medications that would be the most likely comparators if a new injectable medication entered the market. For Germany, these were: a sulfonylurea (e.g., glipizide), a prandial insulin (e.g., insulin lispro), a once-daily glucagon-like peptide-1 receptor agonist (GLP-1 RA; e.g., liraglutide), and a once-weekly GLP-1 RA (e.g., albiglutide) (Table 2); for Spain, these were: a once-daily GLP-1 RA (e.g., liraglutide), a basal insulin (e.g., insulin glargine), a prandial insulin (e.g., insulin lispro), and a once-weekly GLP-1 RA (e.g., albiglutide) (Table 2). To calculate the proportion of the sample that may choose each of the four medication profiles, we calculated a net clinical benefit score for each medication as the weighted sum of each medication profile’s attribute levels, where the weights represent the relative strength of preference for the corresponding attribute level in the medication profile.

Table 2 Medication profiles that approximate actual treatments

A minimum acceptable benefit (MAB) for changes in attribute levels was also calculated. The MAB is interpreted as the minimum change in efficacy that respondents would require (on average) to accept changes in other attributes. In this case, it is the minimum increase in the probability that the medication will achieve the target HbA1c level required to compensate respondents for change to a less desirable level in another attribute. MAB was calculated as the difference between the preference weights for two levels of an attribute divided by the preference weight for increasing the probability of achieving the target HbA1c level by 1 percentage point.

Results

Study Populations

In Germany, 1198 individuals responded to the invitation to take part in the survey. Of those who responded, 531 (44.3%) were eligible to participate, of whom 474 (89.3% of those eligible) provided consent and completed the survey. In Spain, 1584 individuals responded to the invitation to take the survey. Of those who responded, 446 (28.2%) were eligible to participate, of whom 401 (89.9% of those eligible) provided consent and completed the survey. The German and Spanish respondents were generally similar, although they differed in that the Spanish group included proportionately more men, had an overall shorter self-reported duration of diabetes, and had more respondents with HbA1c level  > 8.0% (Table 3).

Table 3 Demographics and background characteristics

Results of DCE Model

The results from the DCE analyses are provided in Fig. 2a, b. The estimated preference weights for all attributes were generally consistent with the expected ordering of levels within each attribute; that is, better outcomes were preferred to worse outcomes. Two exceptions that did not follow the expected ordering were dosing frequency among German respondents (Fig. 2a) and risk of GI problems among Spanish respondents (Fig. 2b); however, there was no statistically significant difference between the attribute levels in either case (respondents were indifferent in their choices between the levels). For Germans (Fig. 2a), the most important change was from no risk of GI problems to a 30% risk of GI problems, whereas reducing the risk of serious heart attack or stroke by 5% was the least important change. For Spaniards (Fig. 2b), the most important change was a switch from oral to injectable mode of administration, and the least important change was also a 5% reduction in the risk of serious heart attack or stroke.

Fig. 2
figure 2

Overall preference weight results presented as the mean (symbol) and 95% confidence interval (whiskers). a German preference weights, b Spanish preference weights

Most of the differences in preference weights between adjacent levels within a single attribute were statistically significant (p ≤ 0.05), indicating that respondents, on average, distinguished between the levels when making medication choices. Exceptions among German participants were for comparisons of dosing frequency (once a week versus once a day, once a week versus twice a day, and twice a day versus more than twice a day). Exceptions among Spanish participants were in dosing frequency (once a day versus twice a day), risk of GI problems (10 versus 20% and 10 versus 30%), and in hypoglycemic events (no hypoglycemic events versus 1–2 hypoglycemic events per year).

Subgroup Analysis of DCE Results

Subgroup analysis revealed no statistically significant differences between subgroups based on sex or age in either the German or Spanish analysis (data not shown). For both nationalities, there were statistically significant differences in preferences between respondents using injectable treatments at the time of the survey versus those who were not. The detailed results of the subgroup analysis are presented in ESM Fig. S1.

Minimum Acceptable Benefit

The MAB levels required (in percentage point increases in the chance of reaching HbA1c target levels) to compensate respondents for worsening levels of other treatment attributes are presented in Table 4. Among German respondents, the highest MAB levels were seen for moving from a 0% risk of GI problems to a 30% risk of GI problems, which required a 56.0 percentage point increase in the chance of reaching target HbA1c to offset. Other substantial MAB levels included changes from 2-kg body-weight loss to 2-kg body-weight gain (44.4 percentage point increase), no risk of GI problems to 20% risk of GI problems (43.4 percentage point increase), oral to injectable mode of administration (42.0 percentage point increase), no hypoglycemic events to > 2 hypoglycemic events per month (37.3 percentage point increase), and no body weight change to a 2-kg body weight gain (37.1 percentage point increase).

Table 4 Minimum acceptable benefit for changes in attribute levels

Among Spanish respondents, the highest MAB levels were observed for avoiding injections, which required a 59.4 percentage point increase in the chance of reaching target HbA1c to offset, followed by changes in dosing frequency of once a week to more than twice a day (40.3 percentage point increase), risk of GI problems from 0% to 30% (37.4 percentage point increase), no hypoglycemic events to > 2 hypoglycemic events per month (37.3 percentage point increase), and a 2-kg loss in body weight to a 2-kg gain in body weight (35.4 percentage point increase) (Table 4).

For several comparisons of dosing frequency among German respondents, the MAB 95% confidence intervals included zero, suggesting that the respondents were not willing to trade-off efficacy in controlling HbA1c for changes in dosing frequency. Among Spanish respondents, 95% confidence intervals crossing zero were observed for comparisons between frequencies of hypoglycemic events, GI problems, and dosing frequency (Table 4).

Predicted Proportion of Sample Choosing Specific Medicines

The results from the random-parameters logit model were used to calculate preference shares for hypothetical medicine profiles, which indicate the proportion of the population expected to select a particular hypothetical medicine if given a choice between stated alternatives. Among German respondents (Fig. 3a), the oral sulfonylurea (glipizide-like) profile was predicted to have the greatest preference share among patients (44.4%). The second-ranked hypothetical medication was the injectable once-weekly GLP-1 RA (albiglutide-like) profile (27.1%). Among German respondents using injectable treatments (Fig. 3b), a once-weekly GLP-1 RA (albiglutide-like) profile had the largest preference share (45.1%).

Fig. 3
figure 3

Predicted preference shares for hypothetical treatments based on discrete-choice experiment results (mean with the 95% confidence interval in parenthesis). a German respondents, b German respondents using injectable treatments, c Spanish respondents. GLP-1 Glucagon-like protein-1 receptor agonist, QD once daily, SU sulfonylurea

Among Spanish respondents (Fig. 3c), the once-weekly GLP-1 RA (albiglutide-like) profile was most preferred (39.7%), followed by the once-daily GLP-1 RA (liraglutide-like) profile (35.6%). The model predicted smaller preference shares for basal insulin (19.2%) or prandial insulin (5.5%).

Discussion

As interest in understanding patient preferences grows, researchers have increased the use of stated-preference surveys, particularly DCE surveys, to aid in assessing the value of specific medical interventions. DCE surveys provide a systematic method of assessing the relative preferences of patients and other stakeholders for medication attributes and of quantifying the trade-offs they make between the benefits and risks associated with different treatment options [18, 19].

In this study, the results of the DCE model revealed that estimated preference weights were generally consistent with the expected ordering of attribute levels. However, there were differences in preferences between the two populations. German patients demonstrated the greatest preferences for a lower risk of GI problems, less weight gain, and oral versus injection mode of administration. Spanish patients demonstrated the greatest preferences for oral versus injection mode of administration, lower dosing frequency, and increasing probability of reaching HbA1c target. Subgroup analysis revealed significant differences in preferences among German respondents based on time since diabetes diagnosis. Both German and Spanish respondents showed subgroup preference differences based on current use of injectable treatments versus oral delivery, although how the preferences varied between subgroups was different in the German and Spanish respondents.

In the MAB analyses for both the German and Spanish respondents, respondents would require the largest increases in the probability of reaching the HbA1c target for increases in the rate of hypoglycemic events, the risk of GI problems, body weight, and dosing frequency. Interestingly, these patient relevant attributes were ranked higher than the clinically important attribute of risk of serious heart attack or stroke.

The findings from our study are consistent with those of similar studies. For example, Purnell et al. [18] conducted a review of studies that assessed patient preferences for non-insulin diabetes medications using a variety of quantitative methods, including DCE. When examining studies that included multiple attributes, the review found that weight loss or control, control of blood sugar, and avoidance of GI problems were identified as important. In a DCE survey of patients with T2DM in Canada, Hauber et al. [20] found that insulin-experienced respondents placed the most importance on glucose control, whereas insulin-naive respondents placed just as much significance on reducing the number of injections per day as on controlling glucose. Mohamed et al. [21] conducted a DCE survey of patients with T2DM in Sweden and Germany and reported that respondents placed the most importance on avoiding weight gain followed by glucose control. In a DCE survey of patients with T2DM in the USA comparing oral treatments, glucose control again was highly valued. The study also found that a more convenient dosing schedule (fewer pills and less frequent dosing) was more highly valued by patients who had a lower dosing burden at the time of the survey [22]. A recent DCE-based investigation in the USA and the UK focusing on physician preferences revealed the most important attributes to physicians were glucose control, risk of a fatal myocardial infarction, and weight change [19].

The current analysis also predicted preferences for treatment profiles that were similar to actual T2DM medicines. The predicted preferences support the results of the DCE analysis in that the preference for the glipizide-like and albiglutide-like medications by German respondents agrees with their preference for a non-injectable mode of administration or fewer GI problems. The preference by Spanish respondents for albiglutide-like and liraglutide-like medications agrees with their preference for less frequent dosing or no body weight gain.

The limitations of this study include reliance on the self-reported diagnosis of T2DM and T2DM treatment and the use of a sample of patients with T2DM drawn from the consumer panel, which might not be representative of the actual German or Spanish populations of such patients. Additionally, the data collected in DCEs were based on choices among hypothetical treatment alternatives, with a focus only on clinically relevant treatment attributes. These choices were intended to simulate clinical decisions; however, they did not have the same clinical or emotional consequences of actual decisions, and differences may arise between patients’ stated and actual choices.

Conclusions

Our findings suggest that both German and Spanish patients treated for T2DM were generally willing to trade efficacy for improvements in side effects, mode of administration, and dosing frequency—although differences in relative preferences were observed between the German and Spanish patient populations. Differences in preference profiles were also seen among subgroups of patients based on their current mode of administration and longer versus shorter time since diagnosis. Considering the variety of T2DM medications available, the results from this assessment suggest that engaging with patients about their specific preferences may help improve their satisfaction with T2DM treatments, which may ultimately result in improved outcomes.