Introduction

The ability to sleep decreases while insomnia increases in older individuals [1, 2]. These disturbances may result from normal age-related physiological changes, or they may be secondary to a variety of factors, such as underlying disease. Various drugs are commonly used to improve sleep disturbance. According to national registers, 33% of the population in Sweden aged at least 50 years has received at least one prescription for zopiclone, zolpidem, or zaleplon (z-drugs) [3], the drugs used most commonly for insomnia. However, according to recent practical guidelines of the American Academy of Sleep Medicine, the evidence for z-drugs and other drugs for sleeping disorders is weak [4]. Furthermore, the use of z-drugs and other hypnotics has been associated with unfavorable side effects, including drowsiness, gait impairment, reduced cognitive function [5], and an increased risk of fracture [6, 7]. Perhaps the most serious side effect associated with the use of hypnotics is hip fracture [8], as many of the patients lose independence and 25% die within the first year after the event [9].

Based on the risk of side effects, short-acting z-drugs are generally recommended over more long-acting hypnotics, such as benzodiazepines and neuroleptics [4, 10]. Z-drugs are also suggested to result in a more natural sleep pattern [11] and lack of tolerance and dependency compared with most other hypnotics [12]. However, to our knowledge, the risk of side effects associated with short-acting and long-acting hypnotics has not been compared in a randomized controlled study or large observational study. The association between hypnotics and fractures has also not been evaluated before therapy is started and after it is ended. The purpose of the present study was to compare the risks of fracture associated with the use of z-drugs and the long-acting hypnotic propiomazine. Another aim was to critically evaluate the association between hypnotic use and fracture.

Materials and methods

Data sources

This study was conducted with data from three Swedish databases: the Prescribed Drug Registry, the National Patient Registry (NPR), and the Cause of Death Registry. The Prescribed Drug Registry consists of records for all drug prescriptions filled at Swedish pharmacies since July 2005. The NPR includes records of all diagnoses made in inpatient care settings since 1987 and specialist outpatient care settings since 2001. The Cause of Death Registry contains data on causes and dates of death for all Swedish residents. These data were linked by Statistics Sweden using unique 10-digit civil registration numbers, which all Swedish residents have. Statistics Sweden also anonymized the data before receipt by the principal investigator (PN) and provided access to kinship and socioeconomic data, such as education, income, and early disability pension receipt. Diagnoses in the study cohorts were tracked in the NPR using International Classification of Diseases, 10th revision (ICD-10) codes (available since 1997). The present study was approved by the ethical committee of Umeå university, by the National Board of Health and Welfare and by Statistics Sweden.

Study cohorts

All persons aged ≥ 50 years on 31 December 2005 (n = 3,341,706) were considered for inclusion. We excluded individuals who filled prescriptions for propiomazine (n = 140,602, 4.3%) or z-drugs (n = 477,868, 14.3%) before 1 January 2007 to increase the likelihood that therapy began on the first recorded prescription date (data on prescriptions prior to July 2005 were unavailable). Thus, persons who filled prescriptions for propiomazine (n = 239,253, 7.2%) or z-drugs (n = 613,306, 18.4%) at least once between 1 January 2007 and 31 December 2016 were considered to be users. After excluding individuals with suspected inaccurate data, such as a fracture or drug dispensation recorded after a person’s date of death, 237,396 propiomazine users and 604,457 z-drug users remained. Each user was matched with a non-user from the rest of the cohort based on birth year, birth month, sex, and Swedish or non-Swedish citizenship. Non-users who died before the corresponding users were dispensed their last doses of propiomazine or z-drugs were excluded and replaced with other non-users from the rest of the cohort. Unmatched individuals were excluded. This process yielded 233,609 propiomazine user/non-user pairs and 591,136 z-drug user/non-user pairs. The third and fourth cohorts consisted of all full sibling pairs from the total cohort with discordant propiomazine (n = 83,594) and z-drug use (n = 153,314), respectively. The baseline date for each pair was the date of treatment initiation. Sibling pairs in which the drug-naïve sibling died before baseline were excluded. The purpose of the subcohorts of siblings was to adjust for potential uncontrolled confounding due to familial and genetic factors that would not be captured in medical records.

Outcomes

The main study outcome was fracture (ICD-10 codes S22–S82, excluding S62 [fracture of the hand]). The Swedish NPR does not differentiate between new diagnoses and follow-up examination data. To avoid counting the same fracture more than once, only the first and last fractures in each individual were analyzed, and these events had to be separated by at least 365 days for the second diagnosis to be included. In sensitivity analyses, we also examined the secondary outcomes of hip fracture (ICD-10 code S72), stroke (ICD-10 codes I61–I64), and dementia (ICD-10 codes F00, F01, F039). NPR data have been validated in detail, with positive predictive values (PPVs) of 85–95% [13,14,15]. Notably, the PPV for hip fracture in this register exceeds 95% [16].

Statistical analysis

Baseline characteristics are presented as means and standard deviations, unless indicated otherwise. The baseline for each matched case-control pair was set as the date on which the user filled his or her first prescription.

To test whether the association between propiomazine or z-drug use and the risk of fracture diagnosis was time dependent, we evaluated Schoenfeld residuals using the estat phtest command in the Stata software (version 12.1; StataCorp LP, College Station, TX, USA). After determining that it was, we used flexible parametric models, which unlike Cox regression models allow the inclusion of time-dependent covariates, for survival analyses. Retrospective and prospective fractures were analyzed independently using three degrees of freedom and knots at default positions [17]. The analyses were conditional, and thus adjusted for sex, age, and citizenship, but no other adjusting variable was used.

To investigate the magnitude of the association between drug use and fracture risk, conditional logistic regression models for six time frames were used. Associations were analyzed during days 1–90, 91–181, and 182–365 before and after the initiation of therapy. In each analysis, each pair in which a participant had a fracture or died in any time frame closer to the baseline date was excluded. Ninety days after the last prescription was filled, users and corresponding controls were excluded from further analyses. Finally, if not censored before, individuals were censored on 31 December 2016. The clogit command in the Stata software was used to fit maximum likelihood (fixed-effect) models to estimate odds ratios with the dichotomous dependent variable of interest, i.e., propiomazine or z-drug use. The likelihood was then calculated relative to each group, i.e., conditional likelihood was used. The first model was unadjusted, although adjusted for age, sex, and citizenship by design. The second model was adjusted for civil status (four categories), education (five categories), early retirement pension receipt, 17 diagnoses at baseline (all diagnoses listed in Tables 1 and 2, except fracture), and baseline use of four drugs. Associations in the year before baseline were adjusted for differences estimated at 1 year before baseline. Data from the sibling cohorts were analyzed in a similar fashion, but only fully adjusted models are presented. In the total matched cohorts, separate analyses were performed according to subgroups defined by sex, age group, depression diagnosis, use of certain medications, and receipt of one or several prescriptions for propiomazine or z-drugs. Unadjusted estimates are presented for all intervals in the subgroup analyses when the estimate for at least one interval was found to be unstable in fully adjusted models. The Stata software (versions 12.1 and 15.1) and SPSS software (version 23; IBM, Armonk, NY, USA) were used to fit the statistical models and to illustrate the results graphically. P values < 0.05 were considered to be significant.

Table 1 Baseline characteristics of the propiomazine study cohort
Table 2 Baseline characteristics of the zopiclone/zolpidem/zaleplon study cohort

Results

Baseline characteristics

The characteristics of the matched cohorts based on propiomazine and z-drug use are presented in Tables 1 and 2, respectively. The mean ages at baseline in the propiomazine and z-drug cohorts were 69 and 72 years, respectively. In both cohorts, all baseline characteristics differed significantly between users and non-users. The greatest differences at baseline included higher prevalences of benzodiazepine use, glucocorticoid use, and depression or use of antidepressant drugs in propiomazine and z-drug users compared with non-users.

Fracture risk

The risks of fracture in the propiomazine cohort before and after the initiation of therapy are presented in Fig. 1. The risk associated with propiomazine use was greatest in the 90 days preceding the initiation of therapy (odds ratio [OR], 2.94; 95% confidence interval [CI], 2.73–3.16; Table 3). This risk was somewhat attenuated after adjustment for confounders (OR, 2.52; 95% CI, 2.28–2.79). In the first 90 days after initiation of therapy, this risk was reduced by more than half (OR, 1.42; 95% CI, 1.27–1.60) and the risk remained constant thereafter (Figs. 1 and 2, Table 3). The pattern was similar for hip fracture, with the greatest risk seen in the 90 days before the initiation of propiomazine use (Table 3). The pattern of greater fracture risk before than after the initiation of therapy was consistent in all subgroups defined according to age, certain diagnoses, and one or several prescriptions for propiomazine (Table 3). The risk of fracture after termination of therapy could be analyzed in 182,589 pairs, where one individual in each pair from the original cohort had previous use of propiomazine (Fig. 3). The first year after last prescribed dose, previous users still had an increased risk of fracture (OR, 1.28, 95% CI, 1.21–1.36), after adjustment for all confounders.

Fig. 1
figure 1

Associations between propiomazine treatment initiation and fracture. Flexible parametric models for the matched cohort of propiomazine users and controls (n = 467,218). Conditional analyses were performed using four degrees of freedom and knots at default positions. The gray area represents the 95% confidence interval

Table 3 Odds ratios (ORs) for fracture in propiomazine users relative to non-users. Associations are presented for intervals before and after the index date (start of treatment in users), adjusted for confounders according to civil status, education, early retirement pension receipt, diagnoses, and drug use, as presented in Table 1
Fig. 2
figure 2

Associations between z-drug treatment initiation and fracture. Flexible parametric models for the matched cohort of z-drug users and controls (n = 1,182,272). Conditional analyses were performed using five degrees of freedom and knots at default positions. The gray area represents the 95% confidence interval

Fig. 3
figure 3

Associations between previous propiomazine use and fracture after end of therapy. Kaplan-Meier curve for the outcome of fracture in previous users of propriomazine and corresponding controls. Cumulative incidence of fracture and 95% confidence intervals (95% CI) are presented

The risks of fracture in the z-drug cohort before and after the initiation of therapy are presented in Fig. 2. As in the propiomazine cohort, the risk associated with the use of z-drugs was greatest in the 90 days preceding therapy initiation (OR, 4.10; 95% CI, 3.86–4.35; Table 4). This increased risk was more than halved after the initiation of therapy (OR, 1.67; 95% CI, 1.56–1.80; Table 4), and the risk declined further with increasing follow-up time (Table 4). The pattern was similar for the outcome of hip fracture, with the greatest risk seen before the initiation of z-drug use (Table 4). The pattern of greater risk before than after the initiation of therapy was consistent in all subgroups defined according to age, certain diagnoses, and one or several prescriptions for z-drugs (Table 4). The risk of fracture after termination of therapy could be analyzed in 370,213 pairs, where one individual in each pair from the original cohort had previous use of z-drugs (Fig. 4). The first year after last prescribed dose, previous users still had an increased risk of fracture (OR, 1.20, 95% CI, 1.15–1.26), after adjustment for all confounders. Finally, individuals on the higher strength of the respective z-drug (10 mg, n = 73,615) had a lower risk of fracture than individuals on a lower strength (n = 517,521) both the year before (OR, 0.82, 95% CI, 0.77–0.88) and after initiation of therapy (OR, 0.90, 95% CI, 0.84–0.97), after adjustment of all confounders.

Table 4 Odds ratios (ORs) for fracture in users of zolpidem/zopiclone/zaleplon (z-drug) users relative to non-users. Associations are presented for intervals before and after the index date (start of treatment in users), adjusted for confounders according to civil status, education, early retirement pension receipt, diagnoses, and drug use, as presented in Table 1
Fig. 4
figure 4

Associations between previous z-drug use and fracture after end of therapy. Kaplan-Meier curve for the outcome of fracture in previous users of z-drugs and corresponding controls. Cumulative incidence and 95% confidence intervals (95% CI) are presented

Sensitivity analysis results

The risk patterns associated with fracture in propiomazine users and z-drug users were tested in several sensitivity analyses. In the first analysis, the risk was investigated in the two subcohorts of sibling pairs with discordant drug use. The patterns associated with drug use in the sibling cohorts were similar to those in the main cohort, with the greatest risk of fracture seen before treatment was initiated (Tables 3 and 4). Finally, we investigated the risks of stroke and dementia associated with use of the drugs of interest. The patterns were similar to those observed for the outcome of fracture, with the greatest risk seen before the initiation of treatment with propiomazine and z-drugs (Tables 3 and 4).

Discussion

This study showed that the initiation of treatment with propiomazine and z-drugs is associated strongly with an increased risk of fracture. However, this risk was even higher before initiation of therapy. The strength of associations increased from 1 year before the baseline date, peaked close to this date, and then fell after the initiation of treatment with hypnotics. The associations had similar patterns in siblings, women and men, different age groups, and different subgroups defined according to certain diagnoses and use of certain drugs. It is also of interest that the association remained in the first year after therapy was ended for both propriomazine and the z-drugs. Finally, the patterns were similar in sensitivity analyses for the outcomes of stroke and dementia.

The associations after the initiation of therapy between hypnotic use and fracture confirm the results of previous observational studies of the use of other neuroleptics [18,19,20,21] and those of meta-analysis of z-drug use [6, 7]. However, to our knowledge, no previous large observational study has critically evaluated this association by also analyzing the risk of fracture before the initiation of therapy and after end of therapy. In our study cohorts, the risk of fracture was greatest in the last 3 months before the initiation of treatment. This risk was significantly higher than after the initiation of therapy, and the pattern was true for all four cohorts analyzed. This was especially true for the short-acting z-drugs, and the results strongly suggest that the association found between hypnotic use and fracture after initiation of therapy is not causal. Thus, the greater risk of fracture before beginning treatment with hypnotics probably explains the greater risk of fracture after treatment initiation. This explanation is supported by the fact that the associations remained after therapy with hypnotics was ended. This is also supported by the greater prevalence of all considered diseases in users of hypnotics. Yet, disease severity is not indicated in the registry data used, which likely contribute to residual confounding in the present study, as in other observational studies. Finally, register studies cannot measure factors such as general frailty, which increases the risks of many outcomes, including fracture [22]. Our results may represent real-life situations in which individuals have conditions that influence sleep, according to our results a fracture, and are prescribed hypnotics to improve accompanying sleep disorders because of fracture occurrence. These patients would have a greater risk of a new fracture, not necessarily because of hypnotic use, but due to existing conditions that increase the risks of falling and fracture. During treatment with hypnotics, the underlying condition may improve and the patients’ fracture risk is reduced compared to before therapy was started. Another less likely explanation for the main results is that different underlying factors influence the risk of fracture before and after the initiation of treatment with hypnotics. If so, it would still be difficult to explain the remaining increased risk after therapy is terminated, and the fact that a higher strength of z-drugs was associated with lower risk of fracture. It is of interest that we found a quite similar association between antidepressants and fractures in a recent study [23], drugs that previously also have been associated with an increased risk of fractures [24, 25].

We further evaluated the causality of the association between hypnotic use and fracture in sensitivity analyses. In particular, we evaluated the associations between hypnotic use and the outcomes of stroke and dementia. In these analyses, the patterns of association with hypnotic use were similar to that seen for fracture, with the greatest risk observed before the start of treatment. With respect to causality, hypnotic use could theoretically increase the risk of stroke [26, 27], although the results of the present study strongly suggest that the association after the initiation of therapy represents residual confounding and reverse causality. Irrespectively, hypnotic use cannot cause dementia within weeks after the initiation of treatment. Instead, undiagnosed early dementia was likely present in this cohort in the period before the initiation of therapy [28]; this condition is often accompanied by anxiety and depression, resulting in decreased sleep quality.

Despite the obvious pitfalls associated with the evaluation and comparison of side effects of different drugs using observational data, cohort studies are often used for this purpose. In a well-cited study of the risks of side effects of different types of antidepressants [29], selective serotonin inhibitors were associated with greater risks of falls and other side effects compared with tricyclic antidepressants. Another aim of the present study was to critically evaluate any difference in fracture risk after the initiation of propiomazine and z-drug use. If only the risk after the initiation of therapy was assessed, z-drug use would be associated with slightly greater risks not only of fracture, but also of stroke and dementia. However, examination of the associations also before treatment initiation shows that the associations after baseline likely reflect the increased risk before the initiation of therapy, and thus residual confounding and reverse causality.

Several limitations of this study should be considered. The registry-based nature of the study resulted in the lack of some important information, such as whether and how often patients took the medications they had obtained at pharmacies. In addition, no information was available on confounders such as physical activity, eating habit, weight, or smoking habit. Chronic obstructive pulmonary disease data were available as a surrogate for smoking, but this surrogate is inexact. Alcohol intoxication data were available as a surrogate for excessive alcohol use, but this surrogate is also inexact. Information was also lacking for some other drugs of potential interest, such as benzodiazepines, neuroleptics, histamine H1 blockers, melatonin, and anti-epileptics. Although access to data on more confounders and more accurate estimation thereof would be of interest, such differences would not likely have changed the results of this study, as the risk of fracture was compared before and after the initiation of treatment. The strengths of the present study include the examination of a nationwide cohort and the lack of data loss during follow-up, both of which increase the external validity of the results.

In conclusion, this study showed that use of hypnotics is associated with a greater risk of fracture after the age of 50 years in the Swedish population. However, this increased risk is not likely to be causal, as the risk of fracture was greater before than after the initiation of treatment and remained similar after end of therapy. Given these results, similar investigation of reported associations between the use of other drugs and fracture, such neuroleptics other than propiomazine, would be of interest. Finally, the results of the present study emphasize the importance of not making inferences based on observational data.