FormalPara Key Points for Decision Makers

In Brazil, insulin glargine (GLA) has been available as a treatment option versus neutral protamine Hagedorn (NPH) insulin for a number of years to enhance glycemic control and patients’ quality of life as an easier preparation to administer than NPH insulin.

A systematic review failed to consistently show any significant difference in quality of life scores between GLA versus NPH insulin. Only in patients’ satisfaction measured by the Diabetes Treatment Satisfaction Questionnaire was a better result consistently seen with GLA versus NPH insulin. However, there were concerns with the quality of the studies.

Based on our findings, we believe the current price differential between the two insulin preparations in Brazil cannot be justified within its universal healthcare system.

1 Introduction

Diabetes mellitus (DM) is a chronic and complex disease that requires continuous medical care. DM can be subdivided into several etiological types, with Type 1 DM (T1DM) and Type 2 DM (T2DM) being the most prevalent [1]. T1DM is characterized by the destruction of the islets of Langerhans and insulin-secreting β1 cells in the pancreas mediated by the immune response, and its treatment is based on the replacement of deficient or non-existent insulin [2].

There are a range of insulins currently available for the treatment of patients with T1DM. These include both neutral protamine Hagedorn (NPH), which has an intermediate action profile, and longer-acting insulins such as analog glargine (GLA). NPH is usually the first-choice treatment when the diagnosis of T1DM is confirmed in view of typically appreciably lower costs than the analogs, and similar effectiveness [3,4,5,6,7]. GLA was developed as a better alternative to basal insulin, aiming for a peak-free preparation and with prolonged action, which mimics the insulin secretion of individuals without DM. This has resulted in a decrease in episodes of hypoglycemia as well as better glycemic control, mainly glycated hemoglobin (HbA1c), compared with NPH insulin in some publications [8,9,10,11,12], with HbA1c the main predictor of the effectiveness of T1DM treatment. The monitoring and maintenance of HbA1c at levels ≤ 7% is fundamental to reducing chronic microvascular (retinopathy, nephropathy, and neuropathy) and macrovascular complications (peripheral arterial disease, carotid disease, and coronary artery disease), as well as acute complications, which includes episodes of hypoglycemia and hyperglycemia associated with T1DM [2, 13].

However, in two systematic reviews (SRs), no benefits for GLA compared with NPH insulin were found for the main outcome measure, HbA1c, in terms of effectiveness as well as safety [4, 14]. Similar results were seen in a recent cohort study in Brazil [15]. The difference in the findings may partly be due to the study sponsors, with studies with conflicts of interest favoring GLA versus those studies without conflicts of interest [14]. Another important aspect is the price difference that can exist between GLA and NPH insulin. In the state of Minas Gerais, Brazil, there was an increase of just under 300% in the costs of insulins to the State’s Department of Health after the incorporation of GLA. This resulted in approximately US$6 million being spent on insulins in 2012, exacerbated by the price difference between NPH and GLA in Brazil at over 500% [4, 15].

Improved quality of life (QoL), defined by the World Health Organization (WHO) as a way of measuring the individual’s perception of their life position, cultural aspects, personal goals, and concerns [16], is fundamental to understanding the notion of health. Consequently, this is an important variable in clinical practice, as well as helping with decisions on priority setting and resource allocation. The measurement of the impact of different treatments on QoL, as well as its improvement, is one of the expected humanistic results of healthcare practices and public policies in the fields of health promotion and disease prevention, especially chronic diseases such as T1DM [16, 17]. Several instruments have been used to evaluate the impact of different treatments on the QoL of patients with diabetes [18]. Effective control of T1DM, and minimal problems with insulin therapy, tend to favorably influence patients’ QoL [19,20,21,22].

It is hypothesized that the use of GLA versus NPH insulin would promote better QoL, as GLA insulin could lead to a decrease in the episodes of hypoglycemia and would also cause less discomfort to patients. There have been several SRs to assess differences in effectiveness and safety of GLA versus NPH insulins [14, 23,24,25]. However, we did not identify any SRs in the literature measuring the impact on patients’ QoL as a primary outcome, although our unit and others have assessed the impact of T1DM on patients’ health-related QoL (HRQoL) [17]. In addition, we are aware that Plank et al. [23], in their SR, also assessed the impact of the different insulins on patients’ QoL but found no evidence of an improvement in patients’ scores with GLA versus NPH insulins with the instruments used. Vardi et al. [24] in their SR for the Cochrane Collaboration showed higher scores in patient satisfaction favoring GLA; however, when using the Audit of Diabetes-Dependent Quality-of-Life Questionnaire (ADDQoL) instrument, no differences were seen between the two insulins. Assessing the impact of the two insulins on QoL is important given the current differences in acquisition costs between GLA and NPH insulin in Brazil, and the need to maximize health gain within finite resources in the current economic situation. Consequently, we aimed to evaluate the impact on QoL of T1DM patients using either GLA or NPH insulin through an SR of randomized clinical trials (RCTs) and observational studies. The findings will be used to further guide pricing and other strategies for insulins in Brazil to help maximize the use of available resources for these patients. Within Brazil, CONITEC (The national Committee for Health Technology Incorporation) currently assess the potential listing and funding of new medicines based on their relative efficacy, safety, costs, and budget impact versus current standards in all or defined populations. This is based on published evidence. Subsequently, at the state level, committees including the Commission of Pharmaceuticals and Therapeutics (Comissão de Farmácia e Terapêutica–CFT) in Minas Gerais, further evaluate their relative value versus current standards for inclusion within state-wide agreed treatment guidance, with prescribing against the agreed guidance subsequently monitored [4].

2 Methods

An SR of the literature was undertaken according to the methodological guidelines of the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) [26]. SRs are seen as important for scientific research as they typically reflect the best evidence on a topic, with the findings subsequently used to improve treatments and policy decisions where pertinent.

The SR protocol for this study was registered in the PROSPERO database (International Prospecting Register of Ongoing Systematic Reviews) under nº CRD42016046875 (https://www.crd.york.ac.uk/PROSPERO/display_reord.asp?ID=CRD42016046875).

2.1 Databases and Search Strategy

Available publications were selected until January 2017 in the following databases: MEDLINE (PubMed), Latin American and Caribbean Literature in Health Sciences (LILACS), Cochrane Library, and EMBASE. Several combinations of terms were used following the PICO (population, intervention, comparison and outcome) strategy: T1DM, GLA, NPH and QoL (Table 1). As a complement to the electronic search, a manual search was carried out in all included studies, as well as in the periodicals Diabetes Care and Quality of Life Research, in the years 2000 to December 2016, by two independent reviewers. A search of studies in the gray literature was also carried out in the Digital Library of the Federal University of Minas Gerais (UFMG), and in the thesis and dissertations bank of the Coordination of Improvement of Personnel of Higher Education (CAPES) and the Brazilian Digital Library of Theses and Dissertations. Table 1 describes the overview of the search strategy based on the Medline (PubMed) database, with more details contained in Appendix 1 (see electronic supplementary material). Key terms included T1DM, GLA, NPH, and QoL.

Table 1 Search strategies used to undertake the systematic review

2.2 Selection of Studies and Eligibility Criteria

RCTs and cohort studies (prospective and retrospective) were subsequently selected for the SR from the papers identified that assessed the impact of GLA versus NPH insulin on patients’ QoL.

We excluded cohort studies and RCTs that did not concentrate on QoL as the primary outcome measure, studies with oral hypoglycemic medicines concomitant with insulin in patients with T1DM, as well as studies with a sample of 30 individuals or less or that had a follow-up time of < 4 weeks. 4 weeks was considered the minimum time to effectively measure the impact of different treatments on patients’ QoL based on an earlier study from our group [14].

2.3 Data Collection and Analysis

Initially, the recovered studies in the databases were allocated on a single basis to exclude those duplicated by the EndNote software. Thereafter, two independent reviewers (PA and TS) evaluated the titles (Phase 1), the abstracts (phase 2) and the full text (phase 3). A third reviewer (VA) was asked to solve possible discordances. Sociodemographic characteristics of the patients, treatment period, and QoL were extracted in duplicate in a form, previously formulated and tested in Excel, for this purpose. A qualitative synthesis of the studies was performed, since the heterogeneity of the measurement instruments and the data did not make a quantitative synthesis possible as undertaken with previous SRs concentrating on issues such as effectiveness when measuring HbA1c levels and hypoglycemic episodes [14, 23,24,25].

The methodological quality of the cohort studies was evaluated using the Newcastle–Ottawa scale. This scale was originally developed to evaluate the quality of observational studies. It contains eight items. These include representativeness of the sample in the exposed cohort, selection of the unexposed cohort, exposure by the type of measure used (e.g., secure records or structured interviews), how the outcome of interest was assessed, whether the follow-up of the study was long enough for the hypothesis of the results to occur, and if there was adequate follow-up of the cohorts. Stars are assigned to each completed item, with the highest possible score being nine. A score above six means that the study has high methodological quality [27].

In order to assess the risk of bias in RCTs, the current recommendation of the Cochrane Collaboration is to use the Revman software (Review Manager 5.3). This program consists of two parts, in which seven domains are distributed: random sequence generation, concealment of allocation, blinding of participants and professionals, incomplete outcomes, reporting of outcome, and other sources of bias [28]. The sources of funding for the identified studies were examined for potential sources of bias, as the influence of this on the findings and their implications was seen in a previous SR comparing the different insulin types [14]. Comments regarding any conflict of interest, study financing by a manufacturer of any of the insulins, or if any of the authors were related to the pharmaceutical industry or had received fees from them, were examined and documented.

2.4 Quality-of-Life Scales

A number of QoL scales have been used in the literature to assess the impact of different treatment options on patients’ QoL in studies, including both generic and disease-specific instruments. The most prevalent scales used for studies assessing the QoL of patients with diabetes include:

  • The mixed Well-Being Questionnaire 28 (W-BQ28), which has the following domains: positive perception of well-being, negative perception of well-being, energy, and stress. For the specific part, these included positive perception of well-being related to DM, negative perception of well-being related to DM, and stress related to DM [29, 30].

  • The Well-Being Questionnaire (WBQ22), which provides a general welfare score comprising 22 items and has the following domains: depression, anxiety, energy, and positive perception of well-being. There is also WBQ12, which is a shortened version of WBQ22 with 12 items and four domains: negative well-being, energy, positive well-being, and general well-being [30].

  • The Diabetes Treatment Satisfaction Questionnaire (DTSQ). The DTSQs instrument evaluates treatment satisfaction at the beginning, with DTSQc used at the end of follow-up. DTSQ presents three domains: satisfaction with current treatment, perceived frequency of hypoglycemia, and perceived frequency of hyperglycemia. DTSQc uses the same domains as DTSQs, but it has different response options and asks respondents to assess changes in current treatment satisfaction (endpoint) compared with baseline in order to overcome a possible ceiling effect [31].

  • The Well-being Inquiry for Diabetics (WED) evaluates four areas of psychological well-being: (a) somatic symptoms related to diabetes and physical functioning; (b) concerns related to diabetes and emotional state; (c) mental health; (d) family relationships, friends network, and society [32].

  • The Diabetes Quality of Life Measure (DQOL) presents four domains: impact, satisfaction, concerns about DM, and concerns about social and vocational life [33].

  • The Audit of Diabetes-Dependent Quality-of-Life Questionnaire (ADDQoL), an instrument that has four domains: social and professional life, average weighted impact (AWI), specific QoL for DM, and current QoL that refers to the overall score [34].

3 Results

3.1 Study Selection for the Systematic Review

We found 634 publications in the electronic databases (Fig. 1). After exclusion of 99 duplicates, 535 articles were selected for title and abstract analysis. Of these, 513 publications were subsequently excluded for a variety of reasons (Fig. 1), resulting in 22 publications for complete reading. Manual searches, which included analyzing all volumes and chosen periodicals page by page without the help of any search engine and checking all references in the SRs, did not return any relevant further studies for inclusion. Finally, 8 studies with 11 publications (9 papers and 2 conference abstracts) remained for the SR (Fig. 1). Studies were excluded mainly because they included patients with T2DM; used insulins other than GLA and NPH, or concentrated on oral antidiabetics alone, or patients were taking these concomitantly with injectable insulins; were not cohorts or RCTs; or did not use any instrument to measure QoL.

Fig. 1
figure 1

Overview of the flowchart of the selection of included studies

3.2 Study Characteristics

From the eight included studies, four cohorts [35,36,37,38] (two retrospective [35, 38], two prospective studies [36, 37]) and four RCTs [39,40,41,42] were identified that met our criteria (Fig. 1). The follow-up time of patients in these eight studies ranged from 6 to 24 months; three cohort studies [35,36,37] had a long follow-up period (12 months or more) and one [38] had an intermediate time (6–12 months). All RCTs [39,40,41,42] had a long follow-up time (> 12 months). Three studies [35, 37, 39] reported no conflicts of interest and five [36, 38, 40,41,42] declared a conflict of interest with pharmaceutical companies funding the studies or fee-paying the authors. The studies evaluated approximately 1855 individuals to compare the QoL among patients with T1DM who used either GLA or NPH insulin.

Concerning the characteristics of individuals with T1DM in the identified studies, patients’ ages ranged from 5 to 40 years, on average. Two studies evaluated the QoL of pediatric individuals [35, 38] and six analyzed the impact of the different insulin preparations on the QoL of adult patients [36, 37, 39,40,41,42]. The studies had a majority of men, with the duration of T1DM ranging from 2.9 (SD ± 0.2) to 22.3 (SD ± 15.5) years (Table 2).

Table 2 General characteristics of included studies

3.3 Methodological Quality

None of the cohort studies scored a maximum of nine stars on the Newcastle–Ottawa scale, and in general, the studies obtained only a moderate quality score (Table 2). In addition, none of the studies had a low risk of bias in all seven domains. In the second domain, about allocation concealment, all studies had a high risk of bias, since the insulin bottles were identifiable. For the third and fourth domains, on blinding of participants, professionals, and evaluators, all studies had a high risk of bias (open-label). Only one study blinded the data raters. Regarding the sixth domain, about reporting of selective outcomes, one study did not report in the protocol that the QoL of patients would be collected in the Electronic case records, being considered a high risk of bias. The others were judged as having insufficient information. In the seventh domain, dealing with other sources of bias, all the studies had some kind of link with the pharmaceutical industry, declared or otherwise, being considered a high risk of bias. In general, the four RCTs had poor methodological quality (Fig. 2).

Fig. 2
figure 2

Cochrane Collaboration bias risk graph

3.4 Data Synthesis

The eight studies used eight instruments, in total, to evaluate the QoL of patients with T1DM comparing GLA and NPH insulins. Two generic instruments were used in the studies. These included W-BQ28 (mixed character) [36], and WBQ22 (generic) [40]. Four specific instruments were used to measure the QoL of patients with T1DM. These included the DTSQs and DTSQc [35, 36, 40,41,42], DQOL [38], and the ADDQol [42], with a mixed scale—the WED scale [37] —as well.

3.5 Findings

We will principally document the impact of the different insulin preparations on QoL with key details incorporated into Tables 2 and 3, which include the study design, study population and numbers, the QoL instruments used and their scores, any conflicts of interest, study duration, and the quality of the studies.

Table 3 Efficacy, effectiveness, and quality of life of included studies

In the study by Päivärinta et al. [35], there was no statistical difference between the insulins using the DTSQs. There was also no statistically significant difference in HbA1c levels between the two insulins.

Gallen and Carter [36] observed differences between baseline and the endpoints in some of the parameters studied using W-BQ28 and DTSQs, including well-being and energy, favoring GLA. The other domains showed no differences in the QoL of GLA compared with NPH insulin. When the DTSQs instrument was used, a statistically significant difference in satisfaction between the treatments was observed. In the other domains, no difference in QoL was observed between GLA and NPH. Overall, GLA had a significantly favorable impact on HbA1c compared with NPH insulins.

Manini et al. [37] found a statistically significant difference in favor of GLA insulin in three out of the four domains of the QoL scale (WED) used: symptoms, discomfort, and impact, with the overall total score statistically favoring GLA. A statistically significant difference was also observed for HbA1c favoring GLA (7.6%) compared with NPH insulin (8.4%).

None of the domains in the study of Dixon et al. [38] using DQOL showed any statistically significant differences in the scores between GLA and NPH insulin on QoL, although borderline significance when looking at HbA1c values.

Only one domain showed a difference in the study of Bolli et al. [39] using the WED, with no difference seen in HbA1c values between the two insulins.

In the three instruments used by Witthaus et al. [40], no statistically significant difference was seen in the domains of W-BQ12 and W-BQ22. In the DTSQs, a statistically significant difference was seen in favor of GLA insulin in two out of the four domains: satisfaction with treatment and perception of hypoglycemia.

Ashwell et al. [41] found statistically significant differences in favor of GLA insulin in two out of three domains in DTSQc, with one not recorded. Similarly, with DTSQc, there was one statistically significant difference in favor of GLA insulin with three not recorded. There was also a significant difference favoring GLA in two out of the three domains in the ADDQoL. There was also a statistically significant difference in terms of HbA1c favoring GLA.

Polonsky et al. [42] showed a statistically significant difference in favor of GLA insulin using the DTSQs in treatment satisfaction; however, the other domains were not recorded. No statistical difference was seen with HbA1c levels.

4 Discussion

The most used instrument to assess the impact of GLA versus NPH insulin in our SR was the DTSQs [35, 36, 40,41,42], similar to Plank et al. [23] and Vardi et al. [24].

In general, there appeared to be no difference overall in QoL comparing GLA and NPH insulins when the score of the domains of the different instruments were evaluated (Table 3), although significant differences were seen in favor of GLA insulin in a number of the domains in the studies by Gallen and Carter [36], Manini et al. [37], Bolli et al. [39], Witthaus et al. [40], Ashwell et al. [41] and Polonsky et al. [42]. This is similar to the SRs of Plank et al. [23] and Vardi et al. [24]. Our findings were also consistent with the SR performed by Singh et al. [25], which evaluated QoL as a secondary result of HbA1c outcome and GLA safety. However, the authors found that patients preferred treatment with insulin analogs over human insulins. This appeared to be due to the flexibility in the dosage between meals and satisfaction with the treatment. There were issues though with the quality of the assessed studies, with typically only moderate scores on the Newcastle–Ottawa Scale [27].

Conflicts of interest are a concern if the judgment of healthcare professionals is influenced by possible financial gain. There can also be concerns with possible ethical and bioethical considerations, centering on patients participating in the research and their well-being [43]. However, we recognize that this is not always the case. Five of the eight studies included in this SR reported some conflict of interest with pharmaceutical companies. These included Gallen and Carter [36]; Witthaus et al. [40]; Bolli et al. [39]; Ashwell et al. [41]; and Polonsky et al. [42]. However, the studies were not totally favorable towards GLA over NPH insulin, although they recorded domains that statistically favored GLA insulin. This can be a concern with one SR reporting that studies with pharmaceutical company sponsorship had a greater relative risk, approximately 27%, of publishing outcomes favorable to the investigated intervention when compared with studies that had independent sources of funding [44]. Favorable results towards GLA insulin were also seen in the SR by Marra et al. [14] in studies sponsored by pharmaceutical companies versus those studies where there was no such sponsorship. It should also be emphasized that QoL is a subjective measure and that these conflict of interest relationships between researchers, universities, and pharmaceutical companies can be persuasive, with the findings often favoring the funder [44, 45].

5 Limitations

We accept this SR only included cohort studies (prospective and retrospective), which may have limited study findings, since this study design has selection bias and confounding factors that are not controlled. There are also concerns with the external validity of RCTs. In addition, RCTs and cohorts were compared, but as no quantitative analysis was performed, statistical heterogeneity was not a problem for data comparison, although it made this more challenging.

Some studies did not present important information, which impaired a meta-analysis of the QoL instruments used in the primary studies. There was also considerable diversity among the instruments used to assess the QoL, which also precluded a full meta-analysis. Some authors also used instruments of satisfaction with treatment (DTSQs) and did not opt for QoL instruments. This is a concern as QoL is a broader construct, as defined by the WHO [16], than just satisfaction with the current treatment. The absence of an appreciable number of papers out of those initially sourced that treated QoL as a primary endpoint also made the meta-analysis and associated comparisons more difficult. Despite these limitations, we believe our findings are robust and provide guidance to the authorities in Brazil and other countries where there are still considerable acquisition cost differences between GLA and NPH insulins.

6 Conclusion

Despite the limited number of published studies comparing the QoL of patients with T1DM treated with GLA versus NPH insulin, we believe our study findings showing a relative lack of overall difference between the two are robust. This adds to the current debate about the inclusion of GLA insulin in the list of official reimbursed medicines in Brazil whilst there are still considerable acquisition cost differences. GLA, however, seems to be better accepted by users in the domain that assessed satisfaction in T1DM treatment in the DTSQs instrument. However, these findings, which are related to the patient’s therapeutic preference, should be viewed with caution and discussed in the light of other comparisons of GLA versus NPH insulin, which have principally focused on HbA1c and the safety of the different insulins. This especially given some of the concerns with the quality of the reviewed studies coupled with existing considerable differences in acquisition costs between the two insulins in Brazil.

In the future, in patients with chronic diseases such as diabetes, we would like to see more studies assessing the impact of different treatment approaches on the QoL of patients as a primary outcome measure. This can help to formulate appropriate treatment guidelines, including the place of different therapies in treatment regimens alongside considerations of effectiveness and safety. As a result, health authorities will be better able to maximize the health gain of patients within finite resources. These are considerations for the future.