Introduction

Several multi-factorial diseases (e.g., diabetes, hypertension) are defined on the basis of important risk factors for the clinical outcome. Hypertension, for example, is defined from the measurement of blood pressure that provides information on the likelihood of stroke. Hypercholesterolaemia and osteoporosis are also examples. In the case of osteoporosis, bone mineral density (BMD) is measured both to provide a diagnosis, and to yield information on fracture risk [1].

Many well-controlled prospective studies with dual energy X-ray absorptiometry (DXA), particularly in elderly women, indicate that the risk of fracture about doubles for each SD reduction in BMD [2, 3]. The measurement of a risk factor for diagnostic use, however, can only capture one aspect of the likelihood of the outcome when the disease is multifactorial, and in osteoporosis, assessment with BMD captures a minority of the fracture risk. For example, the annual incidence of hip fracture increases approximately 30-fold between the ages of 50 and 90 years; but, from the known relationship between BMD and fracture risk and the loss of bone with age, it is expected that hip fracture risk would rise only fourfold [46]. Thus, the increase in risk with age is approximately sevenfold greater than can be explained on the basis of BMD alone.

The imperfect capture of risk with BMD alone poses several problems for the clinical assessment of fracture risk. In the context of population screening with BMD alone, the performance characteristics of the test are less than optimal in terms of the trade-off between sensitivity and specificity [1, 7, 8]. Thus, osteoporotic fractures affect a substantial minority of the population, but intervention thresholds based on BMD alone lack sensitivity over most reasonable assumption, i.e., the detection rate is low. For example, at the age of 50 years the proportion of women with osteoporosis is approximately 5% [4]. The proportion of these who will fracture in the next 10 years (i.e., positive predictive value) is about 20%. The detection rate for these fractures (sensitivity) is, however, low, and 96% of fragility fractures would arise in women without osteoporosis given a test like BMD where the fracture risk doubled for each SD decrease [9]. Low sensitivity is one of the reasons why wide-spread population screening is not widely recommended in women at the menopause [1]. Moreover, a normal BMD measurement is no guarantee that a fracture will not occur.

The use of risk factors that add information on fracture risk independently of BMD improves the sensitivity of the assessment for any specificity [8, 9]. Over the past several years, we have undertaken a series of meta-analyses to identify clinical risk factors for fracture that provide independent information on fracture risk [3, 1016]. The analyses were based on the primary data from prospective population based studies. This permits the inter-dependence of each of the candidate risk factors to be examined so that they can be accurately combined for clinical use. In the case of the clinical risk factors, BMI is used as a continuous variable, so that its distribution is preserved by the addition of dichotomous variables [8]. For hip fracture prediction, the gradient of risk (increase in fracture risk per standard deviation increase in risk score) for the CRFs is comparable to the use of DXA alone, and the gradient of risk is further enhanced by the addition of BMD to the CRFs [17]. These inter-relationships, assessed from multiple populations and validated in independent cohorts, permit the more accurate identification of individuals who will fracture, so that the average risk in any given proportion of the population identified for treatment will be higher [8].

These considerations indicate that assessment can be improved by the integration of clinical risk factors with or without BMD. In other words, treatment should be directed not only on the basis of T-score for BMD, but also on the independent contribution of other validated risk factors. However, the use of different metrics (the T-score, gradients of risk, risk ratios, etc) is confusing for clinicians and patients alike. For this reason, there has been interest in the development of algorithms that express absolute risk, or the probability of fracture within a given time period [1820], as has been done for cardiovascular and other diseases [2125]. The aim of the present study was to develop a model for the clinical assessment of fracture probability in men and women based on the epidemiology of the UK.

Methods

Cohorts

In order to identify the relevant risk factors, we used baseline and follow-up data from nine prospective population-based cohorts comprising the Rotterdam Study, The European Vertebral Osteoporosis Study (later the European Prospective Osteoporosis Study (EVOS/EPOS), The Canadian Multicentre Osteoporosis Study (CaMos), Rochester, Sheffield, Dubbo, a cohort from Hiroshima and two cohorts from Gothenburg. Details of each of the cohorts have been recently published elsewhere [1016].

Baseline and outcome variables

Height and weight were measured using standard techniques in all cohorts. Body mass index (BMI) was calculated as weight divided by height squared (kg/m2) and used as a continuous variable. BMD was assessed at the femoral neck by DXA with the exception of the two Gothenburg cohorts in which BMD was measured elsewhere. Femoral neck BMD was used as a continuous variable (cohort-specific Z-scores excluding the two cohorts from Gothenburg). The clinical risk factors utilised were those identified from the previous meta-analyses [3, 1016]. These comprised a parental history of hip fracture, exposure to systemic glucocorticoids, a prior history of fragility fracture, current smoking, high intake of alcohol (3 or more units daily on average) and the presence of rheumatoid arthritis as an indicator for secondary osteoporosis.

Fracture ascertainment in the primary cohorts was undertaken by self-report (Sheffield, EVOS/EPOS, Hiroshima) and/or verified from hospital or central data-bases (Gothenburg, CaMos, DOES, Sheffield, EVOS/EPOS, Rochester, Rotterdam).

Models used

Four models were constructed from the risk factor analysis to compute fracture probabilities. These comprised the probability of hip fracture, with and without BMD, and the probability of other major osteoporotic fractures (clinical spine, forearm and proximal humerus), with and without BMD. For each model, fracture and death as continuous hazard functions were computed using a Poisson regression [26, 27] and detailed in the Appendix. In brief, for each risk factor, all significant interactions terms that were identified by meta-analysis were entered (with age, time, sex and the risk factor) with and without BMD [17]. Interactions that were significant for hip fracture risk were also entered into the model for other osteoporotic fractures, and also included in the model for death. Where interactions noted in the “mega-analyses” were no longer significant for both hip fracture and other osteoporotic fractures, these were omitted in a step-wise manner by dropping the interaction with the largest p value. For the death hazard, all significant interactions for fracture risk were included and thereafter omitted if appropriate in a step-wise manner, as described for the fracture hazard.

Epidemiology

The incidence of hip, forearm and proximal humerus fractures in the UK was taken from Singer et al. [28]. Vertebral fractures can be classified as clinically overt fracture, that is, a symptomatic fracture that comes to clinical attention, or a morphometric fracture, which includes both symptomatic and asymptomatic fractures. For the purposes of this study the clinical definition of a vertebral fracture was used. Because data on vertebral fracture risk in the UK are scarce, with a great deal of variation in reported rates [29], the clinical vertebral fracture incidence was calculated by assuming that the ratio of clinical vertebral fracture to hip fracture would be similar in the UK compared to Sweden [30]. The validity of the assumption cannot be directly tested, but in the case of long bone fractures, there is a close concordance of relative incidence at different fracture sites despite marked differences in absolute risk [31]. Swedish fracture risk data were taken from Kanis et al. [32]. The models were then calibrated so that the mean hazard functions of fracture (and death) equalled that of the UK, as detailed in the Appendix.

Input and output variables

Individual patient details comprise age (50 to 90 years), sex, weight (in kg) and height (in cm). BMI is automatically computed from height and weight. Dichotomised risk variables are then entered:

  • A prior fragility fracture (yes/no)

  • Parental history of hip fracture (yes/no)

  • Current tobacco smoking (yes/no)

  • Ever long-term use of oral glucocorticoids (yes/no)

  • Rheumatoid arthritis (yes/no)

  • Other causes of secondary osteoporosis (yes/no)

  • Daily alcohol consumption of three or more units daily (yes/no)

A distinction is made between rheumatoid arthritis and other secondary causes of osteoporosis. Rheumatoid arthritis carries a fracture risk over and above that provided by BMD [11]. Whereas this may hold true for other secondary causes of osteoporosis, the evidence base is weak. Of the many secondary causes of osteoporosis, the following have been consistently documented to be associated with a significant increase in fracture risk:

  • Untreated hypogonadism in men and women, e.g., bilateral oophorectomy or orchidectomy, anorexia nervosa, chemotherapy for breast cancer, hypopituitarism [3340]

  • Inflammatory bowel disease, e.g., Crohn’s disease and ulcerative colitis [4143]. It should be noted that the risk is in part dependent on the use of glucocorticoids, but an independent risk remains after adjustment for glucocorticoid exposure [44].

  • Prolonged immobility, e.g., spinal cord injury, Parkinson’s disease, stroke, muscular dystrophy, ankylosing spondylitis [4550]

  • Organ transplantation [5154]

  • Type I diabetes [5558]

  • Thyroid disorders, e.g., untreated hyperthyroidism, over-treated hypothyroidism [5963]

Whereas there is strong evidence for the association of these disorders and fracture risk, the independence of these risk factors from BMD is uncertain. It was conservatively assumed, therefore, that the fracture risk was mediated via low BMD, but with a risk ratio similar to that noted in rheumatoid arthritis. From an operational view, where the field for rheumatoid arthritis is entered as ‘yes’, a risk is computed with and without BMD. If the field for other secondary osteoporosis is also filled as ‘yes’ this does not contribute to the calculation of fracture probability. Conversely, where the field for rheumatoid arthritis entered as ‘no’, and the field for secondary osteoporosis is ‘yes’, the same β coefficients as used for rheumatoid arthritis contribute to the computation of probability where BMD is not entered. In the presence of BMD, however, no additional risk is assumed in the presence of secondary osteoporosis, since its independence of BMD is uncertain.

If any of the fields for dichotomous variables is not completed, a negative response is assumed. Fractures probability can then be calculated. The output (without BMD) comprises the 10-year probability of hip, clinical spine, shoulder or wrist fracture and the 10-year probability of hip fracture (Fig. 1).

Fig. 1
figure 1

Input and output for the FRAX™ model

Femoral neck BMD can additionally be entered either as a Z-score or a T-score. The transformation of Z- to T-score and vice versa is derived for the NHANES III database for female Caucasians aged 20–29 years [64]. When entered, calculations give the 10-year probabilities as defined above with or without the inclusion of BMD.

Results

Clinical risk factors

The contribution of single clinical risk factors is shown in Table 1 for men and women aged 65 years. In the example, the BMI is set at 25 kg/m2. In the absence of BMD, hip fracture probabilities were higher in women than in men. Each clinical risk factor had a different significance for hip fracture probability, with a family history having the least and a prior fracture the greatest weight in the absence of BMD. The rank order of weighting differed for the major osteoporotic fractures. For example, a parental history of hip fracture was a strong risk factor, close to that provided by a prior fragility fracture. The contribution of age to the assessment of probability is shown in Table 2. In both men and women, there was greater than 100-fold difference in hip fracture probability between the age of 50 years (no risk factors) and 80 years (parental history of hip fracture).

Table 1 Ten-year probability (%) of a major osteoporotic fracture or hip fracture in men and women aged 65 years according to the presence of a single clinical risk factor
Table 2 Ten-year probability of fracture (%) at the sites shown for men and women with a BMI of 25 kg/m2 according to age and the presence or absence of a single risk factor in the absence of BMD

The presence of more than one risk factor increased fracture probability in an incremental manner. For example, in women aged 65 years with a BMI of 25 kg/m2, the 10-year hip fracture probability was 1.3%. With one clinical risk factor, the probability ranged from 1.7% to 3.2%, depending on the risk factor. With two risk factors, the range was 2.5–6.6%, and with 3, 4 and 5 risk factors the ranges were 3.8–11%, 6.7–17% and 13–24%, respectively. In the presence of all six clinical risk factors, the 10-year probability of hip fracture was 30%.

The effect of several clinical risk factors on the 10-year probability of a major osteoporotic fracture is shown in Fig. 2 for a woman aged 65 years and a BMI of 25 kg/m2. In women with rheumatoid arthritis, there was a 33% increase in fracture probability compared with those without rheumatoid (from 9% to 12%). When women additionally took oral glucocorticoids and had a prior fracture there was a fourfold increase in fracture probability.

Fig. 2
figure 2

Effect of combinations of clinical risk factors on the 10-year probability of a major osteoporotic fracture in women aged 65 years and a BMI of 25 kg/m2. [05Ca201]

A higher or lower BMI had a marked effect on fracture probability (Fig. 3). For example, in women at the age of 65 years the 10-year hip fracture probability in the absence of risk factors was 2.3% with a BMI of 20 kg/m2, but was fourfold lower (0.6%) at a BMI of 40 kg/m2. At each level of BMI, fracture probability increased with an increasing number of clinical risk factors.

Fig. 3
figure 3

Ten-year probability of hip fracture (%) in women aged 65 years according to the number of clinical risk factors with a BMI of 20 kg/m2 (left-hand panels) and a BMI of 40 kg/m2 (right-hand panels). The intervals reflect the different weights afforded by different risk factors and provide a range of probabilities. [05Ca070]

Bone mineral density

The relationship between fracture probability and BMD is shown in Table 3. In general, fracture risk increased with increasing age and decreasing T-score. At low T-scores, however, fracture probabilities decreased with age, a phenomenon more marked in men than in women. For example, at the age of 50 years, the 10-year probability of hip fracture was 16.0% in men with a T-score of −4 SD and this fell progressively with age, so that at the age of 70 years, the probability was 12.9% and at the age of 90 years was 8.3%.

Table 3 Ten-year fracture probability (%) in men and women from the UK with a BMI of 25 kg/m2 and no clinical risk factors according to age and T-score at the femoral neck

The differences in probabilities between men and women were much less marked for any given age and T-score than the differences seen with the use of clinical risk factors alone. At a fixed BMD (−2.5 SD in Table 1), the difference in probability between men and women was less evident. For example, the hip fracture probability ratio (women/men) with glucocorticoid use was 1.8 in the absence of BMD but 0.9 at a T-score of −2.5 SD. In the case of all major osteoporotic fractures, probabilities remained higher in women than in men at a T-score of −2.5 SD but the difference in probabilities between men and women was less marked.

Figure 4 compares the probability of hip fracture in men and women with a prior fracture according to age. In the absence of BMD there was approximately a twofold difference in probability between men and women. When the BMD was set at an average for women (Z-score = 0), there was little difference in probabilities up to the age of 75 years. Thereafter, men had a lower probability than women due to the higher death hazard.

Fig. 4
figure 4

Ten-year hip fracture probability (%) in men and women with a prior fracture according to age. The left-hand panel gives probabilities in the absence of BMD. In the right-hand panel probabilities are shown at an average BMD for women at each specific age (i.e., a Z-score = 0). A BMI is set at 24 kg/m2. [05Ca065]

The clinical risk factors added to the information provided by BMD. Isopleths for 10-year fracture probability are shown in Fig. 5 for the combination of several of the clinical risk factors. The clinical risk factors were somewhat less predictive in the presence of BMD in the models. For example, in women aged 65 years and a BMI of 20 kg/m2, the 10-year hip fracture probability in the absence of BMD ranged from 2.3% in the absence of clinical risk factors to 27.9% with four risk factors. When BMD was set constant at a T-score of −2.5 SD, the range was from 2.8% with no clinical risk factor to 19.7% with four risk factors (Fig. 6). The effect of variations in BMI was even more markedly affected by BMD. For example, in women aged 65 years the 10-year hip fracture probability was 2.3% at a BMI of 20 kg/m2 and decreased progressively at higher levels of BMI to 0.6% with a BMI of 40 kg/m2. When BMD was fixed, hip fracture probability remained constant irrespective of BMI (Fig. 7).

Fig. 5
figure 5

10-year probability of a major osteoporotic fracture in men and women aged 65 years according to T-score and clinical risk factors. Body mass index is set at 25 kg/m2. [05Ca136]

Fig. 6
figure 6

Ten-year probability of hip fracture (%) in women aged 65 years, according to the number of clinical risk factors. The left-hand panel shows the probabilities without BMD at a BMI fixed at 20 kg/m2. The right-hand panel illustrates the effects with BMD fixed at the threshold for osteoporosis. [05Ca073]

Fig. 7
figure 7

Effect of variations in BMI on 10-year hip fracture probability (%) in women aged 65 years. Probabilities with BMD are computed at a T-score of −2.8 SD. [05Ca074]

Discussion

The present study provides a model for the assessment of fracture probability in men and women. The model (FRAX™) uses data derived from nine cohorts from around the world, including centres from North America, Europe, Asia and Australia and has been validated in 11 independent cohorts with a similar geographic distribution [17]. The use of primary (but anonymized) data for the model construct permits the interaction of each of the risk factors to be determined to improve the accuracy whereby fracture probability can be computed. The large sample permitted the examination of the general relationship of each risk factor by age, sex, duration of follow up and, for continuous variables (BMD and BMI), the relationship of risk with the variable itself in a manner hitherto not possible. The use of primary data also eliminates the risk of publication bias. The validity of the clinical risk factors identified are supported by the expected relationships between BMD and fracture risk [3].

In the present study, the FRAX™ model has been calibrated to the epidemiology of the UK, but could be calibrated to any country where the epidemiology of fracture and death is known (see Appendix). FRAX™ models for the UK and some other countries are available through the web (http://www.shef.ac.uk/FRAX/index.htm). The approach uses easily obtained clinical risk factors to estimate risk. The estimate can be used alone or with BMD to enhance fracture risk prediction.

Several previous studies have developed models to predict fracture risk from the combination of clinical risk factors and BMD [18, 6578]. The risk factors used include activities of daily living, impaired cognition, liability to falls, poor overall health, history of stroke, seizure disorder and several different medications. A limitation of many of these studies is that, with the exception of the SOF study [18], and one of the GPRD studies [76], they have not been tested in other cohorts. The model described in this paper has been validated in 11 independent prospectively studied cohorts with in excess of one million patient years [17].

The use of risk factors for case finding presupposes that the risk so identified is responsive to a therapeutic intervention. To test this hypothesis, it would be necessary to recruit patients selected on the basis of the risk factor(s) to a randomised controlled trial (RCT). The risk factor that is best evaluated in this way is BMD, and indeed the vast majority of therapeutic studies have recruited patients on the basis of low BMD as recommended by regulatory agencies in the US and Europe [79, 80]. In recent years, other trials have recruited patients on the basis of age, gender, a prior vertebral fracture and current exposure to glucocorticoids irrespective of BMD, and have shown therapeutic effects similar to those noted in RCT’s based on BMD selection [14, 8183].

For other risk factors, comparable data are lacking. In the absence of empirical data, an alternative approach is to demonstrate that the presence (or absence) of a risk factor does not adversely influence therapeutic efficacy against fractures. Several studies have shown no significant interaction between response to treatment and the presence or absence of the risk factors used in the present study including age, height, family history of fracture, low body weight or BMI, smoking, alcohol intake or prior non-vertebral fracture [8488]. In contrast, some risk factors may be associated with less therapeutic efficacy. For example, patients selected on the basis of risk factors for falling may respond less completely to agents that preserve bone mass than patients selected on the basis of low BMD [89]. This concern is greatest in models that omit BMD, because pharmacological agents may not be equally effective across the entire range of BMD [90].

The present model has several unique features. FRAX™ uses Poisson regression to derive hazard functions of death and fracture. Such hazard functions are continuous as a function of time, unlike Cox’s regression for which the corresponding hazard functions are zero except at the time points of a fracture or death. There are also several advantages of the Poisson model over logistic regression analysis. Logistic regression does not take account of when a fracture occurred, nor whether individuals without a fracture died or when death occurred. Secondly, for the assessment of 10-year probabilities by logistic regression, the observation period should be for 10 years. Moreover, information longer than the 10-year period cannot be used for analysis. The cost of ignoring information when fractures occur and whether and when deaths occur is on the precision of the estimate. In simulation experiments, the Poisson model gives the same precision as logistic regression with fewer numbers of individuals. In our own simulations in the present context (data on file), the Poisson model gave the same precision as logistic regression with half the number of individuals. Finally, the Poisson model allows adjustments to be made for time trends. The ability to use several Poisson models permits the use of data from different sources to integrate fracture and death hazards, and to calibrate to different countries.

A further feature of the FRAX™ model is that it takes account of deaths from all causes. In several recent models of disease probability, this has not been accounted for [2325]. For example, the probability of stroke has been determined as a function of age, race, smoking, body mass index, atrial fibrillation, HbA1c, systolic blood pressure, ratio of total to HDL cholesterol and duration of diabetes [25], but the risk of dying from other reasons was not taken into account. In the context of osteoporosis, fracture probabilities are markedly underestimated when no account is taken of the competing death hazard [27, 35, 36]. For example, in a study of men followed after orchidectomy, the cumulative incidence of fractures was 19% after 15 years, but the figure was 40% when deaths were considered as a competing event [36].

FRAX™ also takes account of the impact of risk factors on the death hazard. For example, smoking and low BMD are risk factors for fracture but also significant risk factors for death. Thus, at very low T-scores for BMD, hip fracture probabilities decrease with age (see Table 3), in part related to the higher mortality associated with the lower values for BMD.

There are several limitations that should be mentioned. As with nearly all randomly drawn populations, non-response bias may have occurred. The effect is likely to exclude sicker members of society, and may underestimate the absolute fracture risk for example by age. The analyses also have significant limitations that relate to the outcome variables and the characterisation of risk factors. The definition of what was considered to be an osteoporotic fracture was not the same in all cohorts, but the effect of this inconsistency is likely to weaken rather than strengthen the associations that were found. For the hip fracture outcome, the definition was similar in all cohorts, and may explain in part the higher risk ratios associated for this fracture rather than for osteoporotic fracture. Also, the analyses were confined to clinical fractures, and the results might differ from vertebral fractures diagnosed by morphometry or as an incidental radiographic finding.

There are also limitations with the risk factors themselves. In the case of BMI, this was chosen rather than weight as the measure for body composition. This has the advantage that there is less variability across countries and between sexes. A potential drawback is that BMI can be influenced by height loss associated with vertebral deformities. Therefore, in individuals with important loss of height, the risk conferred through BMI could be underestimated [91]. The use of maximal attained height, rather than current height, might be a solution in the future, if it were shown that fracture risk prediction could be improved.

Further problems relate to the construct of the questions to elicit the presence or absence of risk factors, which varied between cohorts. These included questions on family history, prior fracture, smoking and glucocorticoid use. The effect of this heterogeneity is likely to weaken rather than strengthen the associations found. Recall is also subject to errors and was not validated in any of these cohorts. This is particularly problematic in the elderly. In addition, the validity of self-reported alcohol intake is notoriously unreliable [92]. Indeed, alcohol consumption was significantly less in both men and women than that assessed in the UK [93]. Given that these studies were prospective, however, much of this error (with the exception of alcohol intake) should be random, giving rise to non-directional misclassification. Thus, the associations may actually be stronger than reported here. Any underestimate may have limited consequences for case-finding, since the populations to be tested are similar to the populations interrogated. Biases that arise have more significance where causality is inferred.

A further limitation is that several of the clinical risk factors identified take no account of dose-response, but give risk ratios for an average dose or exposure. By contrast, there is good evidence that the risk associated with excess alcohol consumption and the use of glucocorticoids is dose-responsive [14, 94]. In addition, the risk of fracture increases progressively with the number of prior fractures [95]. These limitations are nearly all conservative.

It should be acknowledged that there are many other risk factors that might be considered for incorporation into assessment algorithms. These include BMD at other skeletal sites, ultrasonography, quantitative computed tomography and the biochemical indices of bone turnover. The available information was too sparse to provide a meta-analytic framework, but they should be incorporated into risk assessment algorithms when they are more adequately characterised. Notwithstanding, the present model provides a mechanism to enhance patient assessment by the integration of clinical risk factors alone and/or in combination with BMD.

The application of this methodology to clinical practice will demand a consideration of the fracture probability at which to intervene, both for treatment (an intervention threshold) and for BMD testing (assessment thresholds). These are currently being developed for the UK, based on cost-effectiveness analyses [96]. Intervention thresholds developed for the United Kingdom may not be applicable to other countries. The 10-year probability of fracture varies markedly in different countries [97]. For countries with low hip fracture rates, as found in developing countries, the relative risk at which intervention is cost-effective will be higher, though the absolute risk at which intervention is cost-effective would not change assuming comparable costs. Intervention thresholds would, however, change with differences in costs, particularly fracture costs, which vary markedly world wide. There is also the issue of affordability or willingness to pay for a strategy. The gross domestic product (GDP) per capita provides an index of affordability. The GDP varies markedly in different regions of the world. In the UK, the GDP per capita is estimated at US$ 25,300 in 2002, as compared with US$ 7,000 in Turkey. Thus, for the same fracture risk and the same costs, treatment will be less affordable (at least to health services) in Turkey than in the UK. Nevertheless, individuals in Turkey, rather than society as a whole, may be willing to pay “United Kingdom prices” for health care. There is also a marked heterogeneity in the proportion of GDP devoted to health care, and in the proportion of the population at risk from osteoporotic fracture (i.e., elderly people) [98, 99]. For all these reasons, it is important to define intervention and assessment thresholds on a country by country basis that takes into account the setting for service provision and willingness to pay, as well as considerations of absolute costs.