Appendix
Global analysis
In the following paragraph, we describe the methodology used to weight the odds ratio, the area under the curve, and to define the weighted T-score for a given specificity and sensitivity.
The average weighted odds ratio (ORw) is defined as follows for one standard deviation decrease:
$$OR_w = {{\sum\limits_{i = 1}^n {W_i \times e} ^{ - SD_i \times \beta _l } } \over {\sum\limits_{i = 1}^n {W_i } }}$$
(1)
where i represents each study included in the analysis, and SDi the corresponding standard deviation over the population. βi is the estimate extracted from the logistic regression for a given study and Wi is the weighted score which could be either 0.5, 1, 2, or 3 depending on the design of each study. Similarly, the 95% confidence interval of the ORw (95% CI ORw) is:
$$95\% .CI.OR_w = {{\sum\limits_{i = 1}^n {W_i \times e} ^{\left( { - SD_i \times \beta _i } \right) \pm \left( {1.96 \times SD_i \times SEE_i } \right)} } \over {\sum\limits_{i = 1}^n {W_i } }}$$
(2)
with SEE
i
being the standard error of estimate of the logistic regression for a given study.
Based on the same approach, the area under the curve was weighted and averaged as follows:
$$AUC_w = {{\sum\limits_{i = 1}^n {W_i \times AUC_i } } \over {\sum\limits_{i = 1}^n {W_i } }}$$
(3)
where i represents each study included in the analysis, AUC
i
the corresponding area under the curve, and W
i
the weighted score which could be either 0.5, 1, 2, or 3 depending on the design of each study.
Once a threshold was calculated for each study based on the 90% sensitivity and 80% specificity, the weighted average was then calculated for the raw data (using the ROC analysis) and the corresponding T-score. As such, the weighted T-score, for example, is the following:
$$T.score_w = {{\sum\limits_{i = 1}^n {W_i \times T.score_i } } \over {\sum\limits_{i = 1}^n {W_i } }}$$
(4)
where i represents each study included in the analysis, T.scorei the threshold corresponding T-score at either 90% sensitivity or 80% specificity, and W
i
the weighted score which could be either 0.5, 1, 2, or 3 depending on the design of each study.
Individual-level meta-analysis vs the "meta-like" analysis
To check if our developed approach gives results in the expected range, we compared the average weighted odds ratio and area under the curve with the one calculated from an individual-level meta-analysis. As an example for the hip fracture study, the results are given in Table 6. The results are very close to our approach. The small difference observed as compared with our model is that the individual-level meta-analysis does not take into account the different design of the study, thereby giving the same weight for a small cross-sectional study as for a large prospective study.
Table 6. Results of hip and fracture study. OR odds ratio, AUC area under the curve
Other thresholds
Besides the 80% specificity threshold, we also calculated the 90% specificity in both models (hip-fracture and DXA osteoporosis-based, respectively). Comparative results are given in Table 7.
Table 7. Comparative results of study models
It was decided that threshold based on the 90% specificity would be too selective for screening purposes; thus, we used the 90% sensitivity and 80% specificity for our model.