Introduction

Romosozumab, a monoclonal antibody that binds and inhibits sclerostin (a negative regulator of bone formation secreted by osteocytes), has a dual effect of increasing bone formation and decreasing bone resorption [1, 2]. In two large phase 3 clinical trials, romosozumab has proved superior to placebo and oral alendronate in its ability to reduce fractures [3,4,5,6]. In the FRAME study (NCT01575834), a pivotal fracture study in postmenopausal women with osteoporosis, a year of romosozumab, 210 mg subcutaneously (s.c.) monthly, followed by a year of denosumab (60 mg s.c. every 6 months) was compared to a year of placebo monthly injections followed by a year of denosumab [3]. The romosozumab arm showed a 73% lower relative risk of vertebral fractures and a 36% lower risk of clinical fractures, by the 1-year time point, comparable to the 75% and 33% lower relative risk in the respective outcomes by the 2-year time point. Post hoc subgroup analyses from FRAME have suggested that greater efficacy might occur in high-risk patients [7].

Several analyses in a number of placebo-controlled studies have explored the interaction between treatment efficacy and baseline fracture risk characterised using the FRAX® tool [8,9,10,11,12,13,14,15]. This approach avoids the limitations of subgroup analyses by seeking to determine if there is a significant interaction between efficacy and fracture risk, with the latter handled as a continuous variable. Evidence that the efficacy of some interventions is greater in patients with higher baseline FRAX® fracture probabilities has been reported for clodronate [9], bazedoxifene [8], and denosumab [13], but not for other agents such as raloxifene [10], strontium ranelate [11], teriparatide [14], and abaloparatide [16]. We wished to test the clinical hypothesis that there would be an interaction between the efficacy of romosozumab and fracture probabilities assessed at study entry. Given the design of the romosozumab trials described above, the analysis was confined to the first year of the FRAME study.

Methods

In this independent post hoc analysis of the phase 3 FRAME trial, we used prospectively collected baseline FRAX® risk variables to evaluate the efficacy of romosozumab on fracture outcomes as a continuous function of baseline fracture probability.

Details of the FRAME study are published elsewhere [3]. Briefly, in this international, double-blind, placebo-controlled, parallel group trial, women with postmenopausal osteoporosis were randomly assigned to receive romosozumab in a blinded fashion, at a dose of 210 mg, or placebo. Randomization was stratified according to age (< 75 years vs. ≥ 75 years) and prevalent vertebral fracture (yes vs. no). Romosozumab or placebo was administered s.c once monthly for 12 months, followed by open-label denosumab (Prolia, Amgen), at a dose of 60 mg which was administered s.c every 6 months for an additional 12 months. A total of 7108 women between the ages of 55 and 90 years with a BMD T-score less than − 2.5 but at least − 4.0 at the lumbar spine or total hip were studied; all received daily supplements containing at least 1000 mg of calcium daily with vitamin D (400 to 800 IU daily). Fully anonymised, individual level study data were provided by Amgen to the Centre for Metabolic Bone Diseases, University of Sheffield.

Baseline FRAX® probabilities

Ten-year fracture probability was assessed with the FRAX® tool (version 3.11) in all women at baseline. In brief, clinical risk factors were used to estimate 10-year probability of a major osteoporotic fracture (MOF: hip, clinical spine, forearm, or humerus fracture) or of a hip fracture alone [17]. The fracture probability can be calculated with clinical risk factors alone, or with femoral neck BMD included. The clinical risk factors comprised sex, age, body mass index (BMI), and dichotomised risk variables documenting a prior fragility fracture, parental history of hip fracture, current tobacco smoking, ever long-term use of oral glucocorticoids, rheumatoid arthritis, other causes of secondary osteoporosis, and alcohol consumption of 3 or more units daily.

For the purposes of this analysis, a prior fragility fracture excluded pathological fractures, any fractures arising before the age of 18 years and fractures at sites not considered to be osteoporotic (i.e. ankle, facial bones, fingers, foot, hand, knee cap, knee, skull, and toes) [18]. One or more prevalent vertebral fractures (grades 1–3 on the Genant scale) on baseline spine imaging were included as a prior fracture.

In addition to the dependence on clinical risk factors, fracture probability varies markedly in different regions of the world [19, 20] so that specific ethnic and country models were used as appropriate. For two countries where FRAX® models were not available (Dominican Republic and Latvia), appropriate surrogate models were used (Colombia and Estonia, respectively).

Fracture outcomes

All incident fractures during the study follow-up were adjudicated and confirmed via x-ray or radiology reports. Sites of fracture, but not ICD codes, were provided and clinical fracture outcomes, regardless of the level of trauma, were allocated to four categories: any clinical fracture (all fractures including those of the feet, ankles, hands, face, and skull); any osteoporotic fracture (as any clinical fracture but excluding those of the feet, ankles, hands, face, and skull); MOF; and hip fracture. Incident vertebral fractures were documented either as a clinical vertebral fracture (identified in the presence of back pain and usually requiring radiographic assessment outside of the prespecified schedule) or a morphometric vertebral fracture. The latter were assumed to have occurred mid-way between the date of the scheduled radiograph where the fracture was found and the date of a previous scheduled radiograph, an assumption that does not impact materially on the analysis of efficacy.

Statistical analysis

This was an intention-to-treat (ITT) analysis. A Poisson model was used to study the relationship between age, the time since baseline (months), treatment and calculated 10-year probability of a major osteoporotic fracture, and their effects on the risk of fracture with only the first of any particular fracture outcome of interest being counted per patient [21, 22]. The extended Poisson regression analysis can estimate effects that vary in time, either linearly or nonlinearly, by studying the interactions between the time in study and the variable of interest (i.e. fracture risk). When there is no interaction with time, this extended Poisson regression model gives results very similar to a Cox regression model. Person-years were used (in contrast to a linear logistic model). The hazard function for fracture was assumed to be:

exp(β0 + β1 current time from baseline + β2 current age + β3 10-year probability + β4 treatment + β5 10- year probability treatment).

The variable “10-year probability treatment” tests for an interaction between efficacy and baseline 10-year probability and was handled as a continuous variable by determining if β5 > 0. Treatment interactions considered the following fracture outcomes: all clinical fractures, osteoporotic fractures, MOF, clinical vertebral fractures, and morphometric vertebral fractures. A further analysis considered the composite fracture outcomes (e.g. any clinical fracture, osteoporotic fractures, and MOF) with the exclusion of vertebral fractures, thus providing variable representing nonvertebral fractures.

The 10-year probability of a major osteoporotic fracture without the inclusion of BMD was used as the base case in the treatment interaction analyses. Interactions other than 10-year probability treatment, such as BMD T-score treatment, previous fracture treatment, and age treatment, were explored in sensitivity analysis. A two-sided p value of 0.1 for the interaction between treatment and FRAX® was considered significant. Additional exploratory analyses included those where hip fracture or nonvertebral fracture was the outcome of interest; where FRAX® was calculated with the inclusion of BMD; and finally, an examination of the effects on the any fracture outcome comparing the Latin American region study centres to the whole study population. The latter was prompted by a post hoc analysis of FRAME that showed that the risk of nonvertebral fracture in the region of Latin America was higher in the romosozumab group than in the placebo group, albeit not significantly (hazard ratio [HR] = 1.25; 95% confidence interval [CI], 0.68–2.27) [3, 7]. By contrast, among the patients outside the region of Latin America, romosozumab significantly decreased the incidence of nonvertebral fracture (HR = 0.58; 95% CI, 0.37–0.89) [3, 7].

Hazard ratios for treatment effect and 95% CI were computed as a continuous variable. For tabular presentation, HRs are shown at the 10th, 25th, 50th, 75th, and 90th percentile of fracture probability. In figures, the associations are shown as a continuous function between the 1st and 99th percentiles except where indicated.

Results

A summary description of the data used in this FRAX® analysis is provided in Table 1. Data on the clinical risk factors used in FRAX® were available in 7163 (99.7%) of the women included in FRAME. The mean probability of a major osteoporotic fracture was 9.7% and for a hip fracture was 3.9% for the base case predictor of FRAX® calculated without BMD. When BMD was used in the FRAX® calculation, the probabilities tended to be greater than those based on clinical risk factors alone, reflecting the need for a relatively low BMD as an inclusion criterion for the trial.

Table 1 Summary description of the baseline data provided for calculation of FRAX® probabilities (n = 7163)

During the first year of FRAME, a total of 178 (2.5%) women sustained one or more clinical fractures, with 131(1.8%) and 89 (1.2%) experiencing at least one osteoporotic fracture or MOF, respectively. Seventy-five (1.0%) women sustained one or more new morphometric vertebral fractures, but only 12 (0.2%) sustained clinical vertebral fractures; 22 (0.3%) women sustained a hip fracture.

Overall effects of treatment

The efficacy of romosozumab on fracture outcomes was observed within the first year of the study when the treatment was compared with placebo (Table 2). Romosozumab significantly decreased the risk of osteoporotic fracture, clinical vertebral fracture, and morphometric vertebral fracture, with reductions in the other fracture outcomes fell short of statistical significance.

Table 2 Effects of romosozumab compared to placebo over 12 months according to the fracture outcome. The current analysis used a Poisson model adjusting for age and time since baseline

Treatment interaction

There was no significant interaction between any of the variables and time. There was a significant interaction between romosozumab antifracture efficacy and the outcomes of any clinical fracture, osteoporotic fracture, and MOF (Table 3) as shown by a decrease in HRs (i.e. greater efficacy) with increasing fracture probability at baseline. For example, the reduction in any clinical fracture was 19% with the 95% CIs embracing unity at the 10th percentile (FRAX® MOF probability 3.7%), whereas at the 50th percentile (FRAX® MOF probability 7.4%), the reduction was 29% (95% CI 3–49%) (Table 3). At the 90th percentile (FRAX® MOF probability 18.9%), the reduction was 54%. The effects of romosozumab for the three categories of fracture outcomes are shown as a continuous function in Fig. 1. For each, there was greater efficacy at higher baseline probabilities. In the case of any clinical fracture, treatment with romosozumab was associated with a significant reduction in fracture risk with probabilities of 7% or more. For the outcome of osteoporotic fracture and MOF, significant effects of treatment were observed with probabilities of 9% or more and 12% or more, respectively. The interactions were no longer significant (p = 0.11–0.21) for these outcomes when FRAX® was calculated with bone mineral density (BMD) (data not shown), but the trend was very similar in that higher fracture probabilities were associated with greater efficacy.

Table 3 Hazard ratio between treatments (romosozumab versus placebo) in the first year of FRAME for various fracture outcomes at different values of 10-year probability (%) of a MOF calculated without BMD. Italicised numbers indicate that the 95% CIs for the effect estimate exclude unity
Fig. 1
figure 1

Effect of romosozumab on any clinical fracture (top left panel), osteoporotic fracture (top right panel), MOF (lower left panel), and nonvertebral fracture (osteoporotic fractures excluding vertebral fractures) (lower right panel) compared with placebo expressed as HR with 95% CIs across the range of MOF probabilities at baseline calculated without BMD

In contrast to the interaction between baseline probability and fracture outcomes that were largely composed of fractures at nonvertebral sites, the HRs were stable across the range of baseline FRAX® probabilities for vertebral fracture, both morphometric and clinical fracture (Table 3), consistent with similar treatment efficacy regardless of baseline risk. As expected, when clinical vertebral fractures were excluded from the outcomes of any clinical fracture, osteoporotic fracture, and MOF, the significance of the interactions between romosozumab efficacy and baseline fracture probabilities became even more marked (p = 0.046, p = 0.046, and p = 0.039, respectively). This is illustrated for osteoporotic fractures without vertebral fractures (i.e. nonvertebral fractures) in Table 3 and Fig. 1. Given the relatively small number of incident hip fractures in the first year of FRAME, no significant interaction was seen for this outcome, though a similar trend was observed to that at overall nonvertebral sites.

In the additional analyses looking at potential interaction between romosozumab efficacy and individual risk factors, there were no statistically significant interactions with the continuous (age, BMI) or categorical (smoking, prior fracture, and parental history) baseline variables. An exception was an unexpected interaction between baseline femoral neck BMD and efficacy (p = 0.097) in the sense that the lower the T-score, the less the effect. The differences in efficacy at different T-scores were, however, modest. For example, at a T-score of − 2.99 the HR for clinical fractures was 0.54 (95% CI 0.38–0.76). At a T-score of − 2.62, the HR was 0.41 (95% CI 0.28–0.59).

Regional variation

The baseline median fracture probabilities were lower in the Latin American centres compared with other regions for MOF (5.0% vs. 10.0%, respectively, calculated without BMD) and hip fracture (1.9% vs. 3.2%, respectively). The result of the interaction analysis for the outcome of any clinical fracture, confined to the Latin American centres, is shown superimposed on the overall study analysis in Fig. 2. The data from the Latin American countries showed a similar qualitative relationship between efficacy and FRAX®, though this fell short of statistical significance (p > 0.30) with wide confidence limits. Similar findings were observed for the outcome of osteoporotic fracture (data not shown).

Fig. 2
figure 2

Effect of romosozumab on any clinical fracture compared with placebo expressed as the HR with 95% CIs across the range of MOF probabilities (1st to 99th percentile) at baseline calculated without BMD. Dotted lines are derived from the first year of the FRAME study. Solid lines depict the first-year data from the Latin American countries

Discussion

The aim of this analysis was to determine the interaction, if any, between baseline fracture probability and the efficacy of romosozumab to allow comparison with similar analyses from placebo-controlled trials of other osteoporosis treatments. The design of the phase 3 trials of romosozumab posed challenges to fulfil this aim in that only the first placebo-controlled year of the FRAME study provided the opportunity for such an assessment. The category of “any clinical fracture” (which included fractures of the hand, feet, face, and skull) was added in this analysis to ensure that the power of the study was optimised by accounting for all fractures occurring within that 1-year period. While this study’s approach to the analysis was not identical to that used in the original publications of the FRAME trial, it is important to note that this study’s results on overall fracture risk reductions in the first year were very comparable to those reported previously [3, 23]. Importantly, the present analysis also demonstrates significant interactions between efficacy and baseline FRAX fracture probability, calculated without BMD. Hazard ratios decreased (i.e. greater efficacy) with increasing fracture probability for any clinical fracture, osteoporotic fracture, and MOF, suggesting that the benefits of romosozumab will be most marked in patients at high risk of fracture. Notably, a similar pattern was observed when BMD was included in the calculation of FRAX® probabilities, though this did not reach statistical significance. The reasons for this are not clear, but as reported, we observed an unexpected relationship between femoral neck BMD with lower romosozumab efficacy at lower BMD. It is possible that the relationship between the risk of fracture and BMD was not as expected, given that low BMD was an inclusion criterion for the trial but associated risk factors such as prior hip and moderate/severe vertebral fractures were exclusion criteria, and this may have introduced confounding. In addition, the inclusion of patients with low BMD from countries with low risk of fracture might also have contributed to this observation.

The observation that the interaction is only observed for nonvertebral fractures, and not for vertebral fractures, appears to be a novel finding. It has been appreciated for many years that the responsiveness to osteoporosis treatments varies between vertebral and nonvertebral sites; certainly, the size and speed of onset of effect of treatment have frequently been observed as greater for vertebral than nonvertebral fracture. In the current study, the majority of analyses of the effects of treatment-induced reductions in vertebral fractures across the range of baseline FRAX® probability revealed no significant interactions; the similar and stable efficacy suggests a floor/ceiling effect for each drug on vertebral fracture outcomes. For example, in the setting of anabolic therapy, teriparatide administered daily or weekly has shown reductions in vertebral fractures of 66–79% without any interaction with baseline fracture risk [14, 15]. The interaction with baseline probability for nonvertebral fractures seen with romosozumab raises questions about the mechanism of fracture reductions in the setting of increased fracture risk.

Several studies have examined the interaction between FRAX®-based probabilities with effectiveness in placebo-controlled studies, including treatment with raloxifene [10], bazedoxifene [8], clodronate [9], denosumab [13], alendronate [12], daily and weekly teriparatide [14, 15], and abaloparatide [16]. Several of these studies have shown greater efficacy against fracture in individuals at higher risk when treated with clodronate, bazedoxifene, or denosumab—romosozumab can now be added. This FRAX® dependency has marked consequences for health economic analysis, illustrated when comparing the cost-effectiveness of the two selective oestrogen receptor modulators, raloxifene and bazedoxifene [24]. The appreciation of greater efficacy at higher baseline risks, as seen with bazedoxifene, can alter conclusions drawn about cost-effectiveness, with more expensive but more effective agents potentially preferred in those at highest risk of fracture. Superior and more rapid antifracture efficacy has now been shown for anabolic treatments in head-to-head trials with antiresorptive agents, including teriparatide compared with risedronate [25] and romosozumab compared with alendronate [4]. This knowledge is starting to influence clinical guidelines and approaches to management, where there is increasing recognition that anabolic agents might be seen as first-line agents in patients at very high risk of fracture, especially where the risk might be particularly high in the short term, for example after a recent osteoporotic fracture [26, 27].

These findings are helpful in explaining apparent regional differences in the efficacy of romosozumab on nonvertebral fracture [3]. The FRAME study reported that romosozumab did not significantly decrease the incidence of nonvertebral fractures in the first year (HR = 0.75; 95% CI 0.53–1.05) with a similar effect size at the end of the second year, when both groups had received denosumab for a year. A subsequent post hoc analysis showed that the effect of romosozumab on nonvertebral fractures differed between the Latin American region (HR = 1.25; 95% CI 0.68–2.27) and other regions (HR = 0.58; 95% CI 0.37–0.89) [3, 7]. The present findings indicate that apparent regional differences in efficacy are likely related to differences in fracture probability rather than to ethnicities or other reasons. This view is supported by the observation in the present analysis that high-risk patients in Latin America appear to respond to treatment in much the same way as in the FRAME cohort in general. A caveat is that the interaction term between efficacy and FRAX® was not significant in the analysis confined to women from Latin America, but this is limited by the low number of events in this population.

This analysis has a number of strengths and limitations. Subgroup analyses, especially those undertaken post hoc, reduce the statistical power of studies and increase the risk of type 1 errors [28]. Examining for interactions using a continuous risk variable, such as baseline fracture probability, reduces the erroneous impact of categorical subgroup analyses. For example, in a post hoc analysis of romosozumab efficacy on nonvertebral fracture risk using an intervention threshold in keeping with US treatment thresholds (FRAX® 10-year probability of major osteoporotic fracture ≥ 20% or hip fracture ≥ 3%) suggested that romosozumab was ineffective below this threshold [7]. The current analysis suggests that efficacy of romosozumab can be observed at substantially lower thresholds, for example at MOF probabilities of 9–12% for the outcomes of osteoporotic or major osteoporotic fractures. The present analysis is confined to a total treatment exposure of just 1 year, a substantially shorter duration than in other placebo-controlled studies in osteoporosis. Nonetheless, the rapid efficacy of romosozumab on vertebral and clinical fracture outcomes delivers an efficacy that can then be examined for interactions.

In conclusion, there are significant interactions between baseline fracture probability and the effect of romosozumab on clinical fracture, osteoporotic fracture, and MOF when compared with placebo in the first year of the FRAME study. Furthermore, the evidence supports the view that high-risk patients in Latin America appear to respond to treatment in much the same way as in the FRAME cohort in general. The increased efficacy of romosozumab at higher levels of baseline risk should be taken into account in health economic analyses.