Introduction

Osteoporosis is a systemic skeletal disorder characterized by low bone mass and micro-architectural deterioration of bone tissue, and as a consequence an increase in bone fragility and susceptibility to fracture [1]. It has enormous health and socioeconomic implications in terms of morbidity, mortality and disability worldwide [2]. In Morocco (30 million inhabitants), the age-adjusted 1-year cumulative incidence of hip fracture has been estimated to 52.1/100,000 [95% confidence interval (CI) 40.9–63.3] in women in 2002 [3].

Bone mineral density (BMD) is widely recognized as the strongest predictor for future fracture occurrence, similar to an increased blood pressure level for the risk of stroke. Screening for low BMD by using dual energy x-ray absorptiometry (DXA), the gold standard diagnostic tool for osteoporosis, is a widely accepted strategy for identifying subjects with an increased risk of fracture. However, mass screening with use of DXA scanning is not recommended without some selection of the target population. Furthermore, because of limited availability of BMD technology in some communities and cost considerations, it has been proposed that BMD measurements be targeted to subjects with risk factors for osteoporosis. Recently, many epidemiological studies have identified clinical risk factors for osteoporosis to develop risk assessment indices. The purpose of the risk assessment indices is not to diagnose osteoporosis or low BMD, but to identify women who are more likely to have low BMD. These patients can then be referred for BMD measurements. Such indices, while not identifying all cases of osteoporosis, increase the efficiency of BMD measurement by focusing on subjects who are at increased risk [47].

The easiest to use in clinical practice is certainly the Osteoporosis Self-assessment Tool (OST), which is based simply on age and weight [8]. It was developed and validated in several studies in Asian and white populations [913]. Other risk tools are also based on age and weight, in combination with up to four additional risk factors; these include the Osteoporosis Risk Assessment Instrument (ORAI) [14], the Simple Calculated Osteoporosis Risk Estimation (SCORE) [15], and the Osteoporosis Index of Risk (OSIRIS) [16]. These four risk assessment tools have been proposed for increasing awareness of osteoporosis and for encouraging more efficient use of BMD measurements in patients who are likely to have low bone mass, especially in asymptomatic postmenopausal women.

Our aims were to assess the validity of these risk indices in a population of 986 white women from Morocco, and to compare the performance of the four tools (OST, ORAI, SCORE and OSIRIS) in identifying women at risk of low BMD and who could benefit from definitive osteoporosis evaluation using DXA.

Methods

Patients

We analysed a database that included medical data on patients either consulted spontaneously or referred for a BMD measurement between April 2004 and July 2005 to our department located at the Rheumatology and Physical Rehabilitation Centre, Military Hospital Mohammed V (University of Rabat) in Morocco. Postmenopausal women >45 years at the date of consultation were included in our sample. Referral was based on diagnostic judgment of the referring physician. Informed consent was obtained from all eligible study participants. Patients with Paget’s disease and advanced osteoarthritis were excluded. All participants underwent a structured questionnaire which concentrated on those variables for which published evidence suggested an association with osteoporosis or low BMD: current treatment for health complaints (including the presence of low back pain), fractures, presence of a disease, use of birth control pills, use of other hormones, ovariectomy, age of menopause, years between menarche and menopause (fertile years), number of pregnancies, family history of osteoporosis, use of toxic substances (alcohol and tobacco), calcium intake, and occupational exercise and sports in the present and past.

Densitometry measurements

All BMD measurements were performed on a Lunar Prodigy Vision machine (General Electric Inc). The DXA scans were obtained by standard procedures supplied by the manufacturer for scanning and analysis. The BMD measurements were carried out by two experienced technicians. Daily quality control was carried out by measurement of a Lunar phantom. At the time of the study, phantom measurements showed stable results. The phantom precision expressed as the CV (%) was 0.08. Moreover, reproducibility has been assessed recently in clinical practice and showed a smallest detectable difference of 0.04 g/cm2 (spine) and 0.02 (hips) [17]. Patient BMD was measured at the lumbar spine (L1-4) (anteroposterior projection) and the femurs (dual femur). The mean result of the measure of the two femurs (total hip) was used. BMD values, expressed in g/cm2, were converted into T scores, expressed in standard deviations (SDs), using French reference values. We used the WHO classification range to categorize subjects as normal (T >−1), osteopenic (−2.5< T≤−1), or osteoporotic (T≤−2.5). A subcategory was defined as "low BMD", for all subjects with T ≤−2.0, to allow comparison to published results for some risk indices that were based on this cut-off. Further, this cut-off is widely used in many communities to detect pre-osteoporotic patients.

Variables and risk scores

The OST, ORAI, SCORE, and OSIRIS indices were derived according to the algorithms suggested by their developers (Table 1). The following dichotomous cut-offs for DXA referral were used: <2 for OST, >7 for SCORE, >8 for ORAI and <1 for OSIRIS. Also, three risk categories were used for each index, according to their developer’s recommendations and the validation of some indices in American and European populations.

Table 1 Method of calculation of the evaluated indices

Statistical analysis

Prevalence of osteoporosis in each of these three categories was determined using the WHO criteria. Receiver operating characteristic (ROC) analyses were performed to evaluate the discriminatory performances of OST, ORAI, SCORE, and OSIRIS, and the area under the curve (AUC) was computed for each. To assess the internal validity of the indices, sensitivity was defined as the proportion of the population with low BMD correctly classified by the risk index (true positive fraction) and specificity was defined as the proportion with normal BMD correctly identified by the risk index (true negative fraction). ROC curves provided a graphical representation of the overall accuracy of a test by plotting sensitivity against (1-specificity) for all thresholds, while the AUC quantified the accuracy of the test. We also calculated the positive predictive value (PPV) and negative predictive value (NPV) to evaluate the external validity of each tool. The PPV and NPV represent the proportion of women who tested positive or negative (as classified by the four tools) and who truly had, or did not have, BMD below the T-score threshold being tested, respectively.

We evaluated OST, ORAI, SCORE, and OSIRIS at the BMD T-score thresholds of −2.5 and −2.0, to assess the performance of those indices in predicting osteoporosis and low bone mass, respectively. The ability of the tools to detect different thresholds of low BMD was also evaluated for various anatomical sites (total hip, femoral neck, L1–L4) of densitometric measurement. Statistical analysis used SPSS statistical software (SPSS Inc, Chicago, Ill).

Results

The mean age of the women in our sample was 59.8 (±8.8) years, ranging from 46 to 92 years. Table 2 shows their basic demographic data. 15% reported a non-traumatic fracture after age 45 at the wrist, rib, or hip. The prevalence of osteoporosis at all sites increased progressively with age. Of the women in our study, 33% were osteoporotic (T<−2.5) at one or more skeletal site, 45% had a low BMD (T<−2.0), and 77% were classified as osteopenic according to the WHO operational definition. Table 3 shows the performance of the four risk indices in identifying patients at various BMD measurement sites and thresholds (T score values of −2 and −2.5). Increasing prevalence of osteoporosis (T<−2.5) with ascending risk category (low, medium, high) was apparent for all four risk tools. For example, the prevalence of osteoporosis based on total hip BMD was approximately 2%, 17%, and 58% at the low (70% of women), medium (27% of women), and high (3% of women) OST risk levels. Of the women in our study classified as high risk, 91% had low bone mass and 85% had osteoporosis at any site. At the considered thresholds, OST, SCORE, ORAI, and OSIRIS identified respectively, 75%, 88%, 77%, and 76% of the patients with normal BMD who subsequently should not have been recommended for densitometry, since according to their score they were ‘low risk’.

Table 2 Characteristics of the participants (n=986)
Table 3 Prevalence of low BMD and osteoporosis by BMD measurement site and risk category

Using the dichotomous cut-off value of <2, the sensitivity of OST in identifying individuals at increased risk of osteoporosis ranged from 82% for lumbar spine to 91% at the total hip region. The corresponding specificity of OST ranged from 24% at the total hip to 34% at any given site (Table 4). SCORE had a higher sensitivity than OST but lower specificity and NPV. At the OST cut-off point of 2, and using a BMD T-score threshold of −2.5 for any site, 51% of the subjects were misclassified (most of these were false positives); the proportion of misclassified patients for single BMD sites was 45% at the total hip, 32% at the femoral neck, and 49% at the lumbar spine site. The AUC was consistently high (around 0.8) for the two hip sites, and somewhat lower for the spine (Table 5), indicating good test performance. For each combination of BMD measurement and T-score cut-off, the AUC results were similar for all four risk tools.

Table 4 Performance of the risk indices by BMD measurement site and T-score cut-off (%)
Table 5 Areas under the ROC curves for the four evaluated assessment tools by BMD measurement site and T-score cut-off

Discussion

The National Institutes of Health consensus conference defined osteoporosis as a disease of increased skeletal fragility accompanied by low BMD (T score below −2.5) and micro-architectural deterioration [18]. The preferred sites for diagnostic purposes are BMD measurements made at the hip, either at the total hip or the femoral neck [19]. The availability of new effective treatments for osteoporosis emphasised to recommend BMD measurements for patients considered at high risk. Several guidelines have been developed to select which patients should undergo DXA testing [2023].

In our study involving white women aged 45 years and more, the OST successfully identified most women with hip osteoporosis and low BMD with a sensitivity of 85%, specificity of 62% and NPV of 98% at the total hip site. The OST, based only on age and weight, performed as well as the more complex risk assessment indices (SCORE, ORAI, and OSIRIS) in identifying women at low risk of osteoporosis who would not need DXA testing (98% of patients classified as low risk with OST do not have osteoporosis at the total hip). In a population-based sample of postmenopausal Japanese women [10], the OST had a sensitivity of 90% and specificity of 45%. This tool was similarly validated in Korean [9], Filipino [12] and Belgian women [11]. Table 6 shows the prevalence of postmenopausal women with hip osteoporosis by OST risk level in the current study and other published studies.

Table 6 Prevalence of postmenopausal women with osteoporosis by OST risk level in the current study and other published studies

Avoiding unnecessary testing among low risk patients can substantially reduce cost for the community and the patient (social health insurance only covers 15% of Moroccan population). For example, in this sample of Moroccan women, 69% of the women were classified as low risk using OST, and thus would not need to be referred to DXA testing. Of these, only 2.3% actually had osteoporosis based upon total hip BMD. At the same time, using OST to select the 31% of women at risk for BMD measurements, 80% of all women with osteoporosis at the hip level would be identified.

A risk assessment tool such as OST does not need to have both high sensitivity and high specificity. Indeed, there is no risk of harm to the patient from unnecessary treatment or invasive diagnostic testing in case of a false-positive result from OST. Although some women who do not have low BMD were classified as increased risk (false positives) and would be referred for testing, some of these women would have undergone testing anyway if OST were not used. Furthermore, treatment for low BMD would only be initiated upon confirmation by DXA: a safe and non-invasive diagnostic procedure.

Despite differences in ethnicity of the studied populations and the reference databases used to calculate T scores, we found the performance of OST in this sample similar to that reported among Asian, American and European women especially at the hip level. As it was the case in Richy et al. study [11], the results were slightly less concordant at the lumbar spine. We used the same risk tool categories as Geusens et al. [13] to compare the performance of the various tools in a different population, and found very similar results for OST at the T-score ≤−2.0 and ≤−2.5 cut-offs. The SCORE, ORAI, OSIRIS, and OST are validated risk indices that can help physicians and public health authorities to focus DXA testing on individuals at increased risk of osteoporosis. All four risk tools performed similarly, and identified a significant proportion of all women at low risk who would not benefit from BMD measurements. These indices have been studied and validated in several different large populations.

As with most studies, our study has limitations. For example, the subjects in our sample were either referred or came in spontaneously for osteoporosis evaluations, and may differ in some ways from the general population. Another limit is the use of French BMD reference range for calculating T-scores. However, it was felt that awaiting the Moroccan reference database, now under study in our centre [24], the French population has more similarities with the Moroccan population than the US population used in NHANES III. Moreover, hip incidence measured in Morocco showed similar results to the south of France [3]. Another limitation of this kind of studies is that it does not take into account the risk of fracture, which is the main purpose of treating osteoporosis. DXA itself has a low sensitivity and about half of patients who fracture do not have densitometric osteoporosis. However, the main objective of our study and similar studies is to identify patients with low BMD in order to avoid unnecessary exams, which is very important in developing countries, while developing a fracture risk assessment tool needs prospective longitudinal cohorts.

In summary, measuring BMD is the best method of identifying patients with osteoporosis to consider for treatment. However, measuring BMD in all postmenopausal women is not feasible especially in developing countries such as Morocco. The OST tool can help target BMD measurements to women at risk. Its high sensitivity, acceptable specificity and very high negative predictive value allows for the safe exclusion of healthy women, in order to reduce unnecessary bone density examinations in those who are at lowest risk. ORAI, SCORE, and OSIRIS provided similar performance in this setting. The main advantage in using the OST index is that it is the simplest and quickest to calculate, and thus to use in general practice as a first-line prescreening tool in post-menopausal women.