figure b

Introduction

In the last 20 years, glucagon-like peptide-1 receptor agonists (GLP-1RA) have been used as one of the recommended pharmacotherapy options for individuals with type 2 diabetes mellitus, especially for large subgroups of patients with high cardiovascular risk and other comorbidities such as chronic kidney disease [1]. In several studies, GLP-1RA have been proved to reduce glucose levels, to ameliorate the metabolic syndrome and to provide cardiorenal benefits [2,3,4,5]. Furthermore, GLP-1RA have shown substantial weight-reduction effects [6], and in recent years several compounds have been authorised for weight management in people with overweight or obesity. Since then, GLP-1RA have received considerable media and influencer attention which, combined with intensive commercial practices, have led to a popularity storm and an explosion in the use of these drugs, resulting in global shortages of GLP-1RA drugs [7,8,9].

In contrast, there is a growing concern that GLP-1RA may be linked to increased risk of suicidal ideation and self-injury (SIS), following reports of the Icelandic Medicines Agency in July 2023 identifying a disproportionate number of cases of SIS in individuals taking liraglutide and semaglutide. Previously, other weight-loss drugs have been removed from the market for similar reasons [10], and regulators worldwide are currently reviewing safety data on GLP-1RA [11,12,13]. Although preliminary regulatory-led analyses of post-marketing data have not found conclusive evidence of a causal link [14], and in the absence of evidence from large, randomised clinical trials, observational studies adequately designed to assess the comparative safety of treatments, employing an active comparator, new user design, are warranted to evaluate the risk of SIS in individuals treated with GLP-1RA.

In this study, we aimed to assess the risk of SIS in new users of GLP-1RA to treat type 2 diabetes mellitus in individuals with obesity, when compared with new users of sodium-glucose cotransporter 2 inhibitors (SGLT-2i).

Methods

Design, setting and data sources

This is a real-world cohort study combining several population-wide databases from the Valencia Health System Integrated Database (VID). The Valencia Health System (VHS) is a comprehensive structure of hospitals, primary care facilities and other public resources managed by the government of the region of Valencia in Spain providing free, universal healthcare services (besides drug cost‐sharing) to 98% of the region's five million inhabitants. VID is a set of publicly owned, population-based healthcare, clinical and administrative electronic databases in the region that can be linked by means of a single personal identification number, and provides comprehensive information for the region’s population covered by the VHS since 2008. VID includes sociodemographic and administrative data (sex assigned at birth, age, nationality) as well as healthcare information such as inpatient and outpatient diagnoses, procedures, laboratory data, pharmaceutical prescriptions and dispensing linked at the individual level (including brand and generic name, formulation, strength and dosing schedule/regimen), hospitalisations, primary care electronic medical record data, mortality, healthcare utilisation and public health data [15].

Participants

We included all individuals aged 18 years and over who initiated either GLP-1RA or SGLT-2i treatment for type 2 diabetes from 1 January 2015 to 31 December 2021. Patients filling a prescription for either GLP-1RA or SGLT-2i medication and not having filled a prescription for a glucose-lowering drug (except for metformin) in the previous 365 days were considered new users. Taking into account that SGLT-2i and GLP-1RA are recommended as therapeutic alternatives by many clinical guidelines, usually as second-line treatments after first-line metformin, new users of SGLT-2i appear to be a suitable active comparator group. The index date was the day of first refill of a study drug. Participants were required to have obesity at baseline, defined as BMI ≥30, or a diagnosis of obesity using International Classification of Diseases, Ninth and Tenth Revision (ICD-9 [http://www.icd9data.com/2007/Volume1/default.htm] and ICD-10 [https://icd.who.int/browse10/2019/en]), registered in the VID medical record in the 365 days before the index date, and to be taking metformin on the index date. GLP-1RA are only authorised for and used in the study setting in individuals with a BMI ≥30; their prescription is subject to a prior authorisation procedure, compelling the prescriber to have its prescription validated—by a medical inspector who verifies that the prescription is compliant with the authorised indication—before it is accepted for public funding and dispensing [16]. In this way, we only included new SGLT-2i or GLP-1RA users with obesity when initiating therapy, in order to produce groups as similar as possible at baseline. Use of metformin in the index date was defined by having a prescription for metformin issued in the 60 days before the index date. Finally, people without general VHS pharmacy coverage (mainly certain Spanish government employees whose prescriptions are reimbursed by mutual societies for civil servants and are thus not included in the pharmacy databases of the VHS), and people not registered in the municipal census (non-residents or temporary residents), were excluded because of limitations on follow-up (see Fig. 1 flow chart).

Fig. 1
figure 1

Study flow chart

Variables

We included the following patient sociodemographic, lifestyle and clinical variables (see Table 1): age (18 to 44 years; 45 to 64; 65 to 74; 75 and over); sex assigned at birth; income level (retrieved from the pharmaceutical copayment system that establishes different levels of copayment based on the following income categories: people earning less than €18,000/year, €18,000 to €100,000/year, more than €100,000/year, population exempt from copayment due to socioeconomic vulnerability (labelled in our study as the ‘low resources’ group), and people without income data (labelled as ‘not available’); registered tobacco and alcohol use in VID in the 365 days preceding the index date; active comorbidities in VID at baseline (diagnostic codes in the VID electronic medical record may be activated at the onset of a disease and deactivated after; typically diagnostic codes for chronic conditions remain active for life), including several psychiatric comorbidities (heart failure, dementia, hypertension, liver disease, renal disease, coronary heart disease, congestive obstructive pulmonary disease, malignancies, depression, sleep disorders, anxiety, substance abuse, personality disorders, psychotic disorders, other psychiatric disorders), previous suicidal ideation or self-injury; and BMI at baseline (30–35, 35–40, and ≥40 kg/m2). See electronic supplementary material (ESM) Table 1 for ICD, and ESM Table 2 for Anatomical Therapeutic Chemical (ATC) codes used for covariates and drugs included in the study.

Table 1 Participant characteristics at baseline, overall and for SGLT-2i and GLP-1RA new users

Outcomes

Primary outcomes were SIS. Outcomes were retrieved from different real-world databases in VID (primary care electronic records, in-hospital and outpatient hospital records and emergency room records), via ICD codes registered during healthcare encounters (see ESM Table 1). In the main per-protocol analyses, participants were followed from the index date to first mutually exclusive incidence of SIS, discontinuation of study drug (defined as more than 60 days without drug supply), initiation of other study drug, or switch to other glucose-lowering drug (defined as filling a prescription for any glucose-lowering drug other than the baseline treatment or metformin and not refilling the assigned drug for 60 days), death, or end of follow-up. In secondary, intention-to-treat analyses, participants were followed from the index date to first SIS event, death, or end of follow-up, and were analysed according to the treatment assigned at baseline, irrespective of actual exposure during follow-up.

Analysis

First, we described patient characteristics, overall and for GLP-1RA and SGLT-2i users (see Table 1). The p values were estimated using χ2 tests for categorical variables and t tests for continuous variables. Second, as the main analysis, we used a per-protocol approach to estimate the increased risk of SIS in GLP-1RA vs SGLT-2i users. The per-protocol approach evaluates the comparative effect of actual exposure to the treatment strategies assigned at baseline. We estimated the incidence rates by 1000 person-years of the first SIS event, overall and for each group. Second, we carried out multivariable Cox regression modelling using our set of covariates: age (as a categorical variable, reference [ref.] 18 to 44 years old), sex assigned at birth (ref. men), all comorbidities and lifestyle variables included as categorical variables (yes/no), and BMI (ref. 30–35 kg/m2). To adjust for potential confounding, we used inverse probability of treatment weighting (IPTW) based on propensity scores, a method that allows balancing without losing generalisability [17]. Propensity scores for each outcome were calculated based on the probability of initiating treatment with GLP-1RA taking into account the same covariates, to generate patient-specific stabilised weights. Covariate balance between the weighted exposure cohorts was assessed using standardised mean differences, with standardised differences <0.10 suggesting adequate balance [18]. Third, we performed several sensitivity analyses. We provided an intention-to-treat analysis, where we compared the effect of being assigned to the treatment strategies at baseline, regardless of whether or not the individuals continued to follow the strategies during follow-up. We also conducted several per-protocol stratified analyses: by the more prevalent psychiatric comorbidities in our cohort (depression, anxiety and sleep disorders); by sex; and by obesity levels; and we provided p values for interaction. Finally, to deal with participants with no BMI information at baseline, we also carried out analyses excluding individuals with no BMI information, in addition to an analysis using multivariate imputation using chained equations for BMI missing values (see ESM Table 3 for code employed for analysis). Statistical significance was defined as p<0.05. Ethics approval was obtained from the Clinic University Hospital of Valencia research ethics board (2022/164) with a waiver of informed consent. All analyses were performed using STATA version 14 and R version 3.6.0.). The investigators were granted access to the databases used to create the study population and performed several quality checks (consistency assessments, range and logic checks) before the linkage of the databases using a pseudonymised single identification number. The study was reported using the RECORD checklist for observational studies using routinely collected health data.

Results

We included 3040 individuals initiating treatment with GLP-1RA and 11,627 with SGLT-2i (Fig. 1). Mean age was 59 years, 53.7% were men, 64.7% earned less than €18,000/year, 71.2% had hypertension, 43.1% had anxiety, 36.0% had sleep disorders and 23.7% had registered tobacco use (Table 1). When compared with participants initiating treatment with SGLT-2i, those in the GLP-1RA group were younger (55 vs 60 years, p<0.001), were more likely to be women (54.4% vs 44.2%, p<0.001), had more anxiety (49.4% vs 41.5%, p<0.001), sleep disorders (43.2% vs 34.1%, p<0.001) and depression (24.4% vs 19.0%, p<0.001), and were more obese (35.1% of individuals with BMI ≥40 vs 15.1%, p<0.001). After propensity score weighting, standardised mean differences between groups were <0.1 for all covariates, showing adequate balance between groups at baseline after adjustment (Table 1).

Weighted incidence rates were similar in both groups (Table 2). In the main per-protocol analyses, mean follow-up was 483 days in the GLP-1RA and 397 days in the SGLT-2i group. We found no evidence that GLP-1RA increased the incidence of SIS (HR 1.04; 95% CI 0.35, 3.14). Intention-to-treat analyses (mean follow-up: 970 days for SGLT-2i users and 992 days for GLP-1RA users) resulted in an HR of 1.36 (95% CI 0.51, 3.61). In analyses excluding individuals with no BMI information and using multiple imputation for BMI missing values, respective HRs were 0.89 (95% CI 0.26, 3.14) and 1.29 (95% CI 0.42, 3.92) (Table 2). Stratified analyses showed no differences in the risk of SIS when prescribed GLP-1RA or SGLT-2i between participants with and without depression, anxiety or sleep disorders, between women and men, and between individuals in different obesity categories (ESM Fig. 1).

Table 2 Risk of suicidal ideation or self-injury among new users of GLP-1RA vs SGLT-2i

Discussion

In this population-based, propensity-weighted cohort study, we found no evidence of an association between GLP-1RA and increased risk for SIS when compared with SGLT-2i, when prescribed to treat type 2 diabetes in patients with concomitant obesity. Our findings do not support an increased risk of SIS associated with GLP-1RA. All 95% CIs were compatible with no increase in risk. However, CIs of the observed associations are wide because of the small number of events. As a result of this, protective or adverse effects cannot be fully excluded. Because of the low frequency of SIS outcomes and their potential severity, and based on our estimates, we cannot rule out either a threefold increase or a decrease in SIS rates among people with type 2 diabetes and obesity treated with GLP-1RA. Large trials randomising tens of thousands of patients would be required to accurately detect an effect of GLP-1RA on SIS risk; in the absence of such trials, evidence from high quality pharmacoepidemiological studies employing population-based, observational data, such as this study, is warranted.

GLP-1RA are indicated for diabetes and obesity, which in turn are concomitant risk factors for depression and SIS. Available evidence, however, is difficult to interpret. For instance, a reciprocal association between obesity and overweight and depression has been observed [19], whereas in most studies elevated BMI is consistently associated with lower suicide completion [20]. Whether obesity is associated with suicide attempts or ideation remains unclear, as only a few, methodologically flawed studies have assessed these associations, providing heterogeneous findings, both between studies and in key risk groups [21]. Plausible mechanisms have been proposed that could explain an association of GLP-1RA with suicidality; other authors support a potential anti-depressant effect of GLP-1RA, others a lack of association. For instance, GLP-1RA have been tested in people with serious mental illness, who could be at a high suicidality risk, showing metabolic benefits with no increase in psychopathology or occurrence of SIS [22].

To date, evidence for an association between GLP-1RA and SIS is limited to a few preliminary reports from pharmacovigilance systems and pharmacoepidemiological studies. Randomised controlled trials involving GLP-1RA have not detected suicidality signals, but these studies were underpowered to detect rare events such as SIS, and they selectively exclude individuals with psychiatric comorbidity [23,24,25,26], which are prevalent in daily practice. Studies from the US Food and Drug Administration (FDA) have found a disproportionate reporting of suicidal ideation and ‘depression/suicidal’ thoughts with semaglutide and liraglutide, but not for suicidal behaviour, suicide attempts and completed suicide [27,28,29,30]. Reports from the European Medicines Agency (EMA) were due in November 2023, but the EMA asked for more data from manufacturers in December 2023, and final results are still unreleased [11, 31]. To date, only one study into this subject has been published, with more suicidal events being reported associated with semaglutide and liraglutide when compared to other GLP-1RA [32]. However, pharmacovigilance reporting in public databases is subject to many major limitations, such as heterogeneous reporting, including notoriety and ripple bias, double reporting, the lack of a population denominator or enough clinical detail to enable comparison between agents and over time [20, 22], therefore no causal association link can be inferred from such studies.

Two propensity score matched observational studies have been conducted to date, either evaluating suicidality risk in patients using semaglutide vs non-GLP-1RA glucose-lowering agents, or in those using any GLP-1RA vs users of dipeptidyl peptidase-4 (DPP-4) inhibitors. Both studies found a protective effect of GLP-1RA, which was consistent in several analyses for different indications (obesity or diabetes) and patient profiles (with or without previous depression or suicidal ideation) [33, 34]. In this way, available observational evidence to date refutes concerns of an increase in suicidality risk related to GLP-1RA treatment. However, both studies are based on a large but not representative sample of the US population, and generalisability of results is limited. Also, both provide intention-to-treat estimates only, and were unable to evaluate either medication adherence or the effect of actual exposure to GLP-1RA [33]. Our study adds to the scarce available evidence on the association between GLP-1RA and risk of SIS by providing evidence from a South European population of five million inhabitants. We employed a new user, active comparator design; we chose a suitable comparator group; we employed propensity scores based on IPTW to obtain balanced pseudo-populations at baseline; we provided both per-protocol and intention-to-treat estimates; and we performed several stratified analyses. In all analyses including stratified analyses per sex assigned at birth, we did not detect an association between SIS risk and GLP-1RA when compared with SGLT-2i. Our findings do not support an increased risk of SIS when taking GLP-1RA; however, rarity of SIS events calls for a cautious interpretation of our results.

Our study is subject to some limitations. First, the VID databases gather several sources of real-world clinical practice data and contain information as registered by health professionals during routine clinical practice, but data are not specifically prepared for research. In this sense, studies based on real-world clinical information like the VID are at risk of well-known biases such as differential recording, misclassification bias or missing data. For instance, identification of suicidal outcomes by means of ICD codes registered during routine clinical practice databases has been associated with a slight under-registration of cases, which may result in underestimation of the association between drug exposure and SIS outcomes [35], and studies validating suicidal classification have concluded that ICD codes have good positive predictive value but low specificity [36]. Second, despite accounting for many relevant covariates in the adjustment and employing IPTW techniques, we may have missed some potentially relevant information and thus we cannot rule out residual confounding. For instance, covariates such as income or BMI strata included categories for missing data, which may introduce bias in observational studies. However, results of sensitivity analyses were, overall, comparable to those of our main analysis, suggesting robustness of the observed associations. Also, given the observational nature of the study, we could expect the presence of indication bias, a type of confounding bias that occurs when a symptom or sign of disease is judged as an indication for a given therapy, and is therefore associated both with the use of a drug and with a higher probability of an outcome related to the disease for which the agent is indicated [37]. In this sense, our results must be interpreted with caution. Third, suicidal outcomes are rare events; because of the low incidence of SIS, and based on our findings, it is not possible to rule out either a threefold increase or a decrease in its occurrence among people treated with GLP-1RA. Therefore, association estimates should be interpreted with extreme caution. Fourth, coding decision-making in real-world practice may be subject to variability, a potential bias frequent in real-world studies. Fifth, information on inpatient medication is not available in VID, which may result in marginal misestimation. Sixth, stratified analyses resulted in some groups with a relatively low number of participants and events, where risk estimates should be considered as exploratory findings. Finally, the generalisation of our results to other settings outside Spain, or even to other Spanish regions, should be approached with caution. However, even if findings in one population cannot be generalised to a new setting if the association of interest is modified by patient characteristics, settings, or treatment variations which differ in the new setting, our findings are comparable to those observed in pharmacovigilance and pharmacoepidemiologic studies that do not find increased risk of SIS associated with GLP-1RA.

In view of the steady market growth of GLP-1RA drugs and prospects of potential expansion to hugely profitable and self-selling indications such as weight management or Parkinson’s disease [38], research is urgently needed to clarify the relationship between the use of GLP-1RA and mental health, as well as to identify individuals at a higher risk of suicidal ideation and self-injury. Further studies, including final evaluations from regulatory bodies, are warranted to discard a causal link between GLP-1RA and suicidality (shortly before publication of this article, EMA released a news report in line with our findings [39]).