Introduction

Exercise (EX) and adequate physical activity is recommended by clinical practice guidelines worldwide as part of the management of osteopenia and osteoporosis [1]. Thereby, osteopenia is considered an intermediate stage between normal bone mineral density (BMD) and osteoporosis [2]. In 2017, 21 million US citizens aged over 50 [3] and 32 million European people across all ages in 2019 had an osteoporosis diagnosis [4]. Women are more affected than men [3, 4]. Direct medical costs of osteoporosis in the USA were predicted to be more than $16.9 billion in 2005 with estimated annually costs amounting to 25.3 billion dollars by 2025 [5]. In Europe, an economic burden of €56.9 billion was attributed to osteoporosis. Of these, 64%, 33%, and 3% were fracture, long-term disability, and pharmacological intervention costs [4]. According to the World Health Organization (WHO) [2], osteoporosis is defined as a progressive systematic skeletal disease characterized by decreased bone mass and micro-architectural deterioration of bone tissue or the presence of a fragility fracture. These fragility fractures are the fourth most common cause of chronic disease in Europe (EU) and occur mainly in the spine, pelvis, and femur. A loss of just 10% of hip bone mass can increase the risk of hip fractures by a factor of 2.5 [6]. It is estimated that 33% women and 20% men ≥ 50 years old worldwide will experience osteoporotic fractures during their lifetime [7,8,9]. In addition, women aged 65 years old with a previously diagnosed vertebral fracture are expected to have higher risk (one out of four women) of additional fractures within 5 years [10]. The risk for fractures can be reduced to one in eight with preventative treatment [10].

Several strategies exist to reduce fracture risk. Two recognized strategies are exercise and pharmacological therapies (PTs) [11]. To date, several PTs have shown efficacy for the prevention and treatment of osteoporosis and are used worldwide [12]. Corresponding drugs can be divided into antiresorptive and osteoanabolic agents. Bisphosphonates such as alendronate, ibandronate, risedronate, and zoledronic acid are the most frequently used antiresorptive agents [13]. Older, less frequently prescribed bisphosphonates (e.g., clodronate, etidronate) differ in their mode of action (nitrogen-containing bisphosphonates), appropriateness (administration), and clinical evidence, based on the effects on bone health parameters (such as dual-energy X-ray absorptiometry) [13, 14]. Additionally, denosumab (antiresorptive agent), teriparatide (osteoanabolic agent), and romosozumab (osteoanabolic agent) are often used for the treatment of osteoporosis [11, 14]. The choice of drug depends on patient and provider preference, comorbidities, side effects, costs, and the fracture risk level [15].

Previous studies have shown that bones respond positively to mechanical loading by muscle and ground reaction forces [16, 17]. As a result, exercise can improve bone density, muscle mass, and strength regardless of age [15, 16, 18]. Higher muscle forces maintain bone mass and promote bone formation [19]. Bone metabolic markers typically adapt to bone-anabolic exercise with an increase in bone formation markers and minimal or no reduction on bone resorption markers [20]. Exercise also promotes self-confidence and coordination, hence contributes to fall prevention and reduces the risk of fractures [21]. To date, meta-analyses have been published for isolated PT efficacy [22, 23] and effect of EX on osteoporosis [24, 25]. Recently, combined effects of EX + PT have also been considered. The meta-analysis by Zhao et al. [26] addresses the effect of combining different antiresorptives and EX on bone mineral density (BMD) in the population of postmenopausal women. Zhang et al. [27] investigate the effect of bisphosphonates, hormone replacement therapies (HRTs), and various exercise types in people with decreased bone mass on BMD. Although these previous studies have already provided information on the effectiveness of medications and EX interventions in osteoporosis, these did not permit specific commentary on whether EX has an additive effect to PT in osteoporosis due to their selection of patient groups, interventions, and/or outcomes. For practical reasons, the effect of all frequently used medications without any limitation in exercise mode and patient population with increased fracture risk needs to be investigated.

The aim of the present study was to conduct a systematic review and meta-analysis to investigate whether EX + PT has an additive effect over PT alone on BMD, bone turnover marker (BTM), fracture healing, and fractures in patients with increased fracture risk due to osteopenia or osteoporosis.

Methods

The review was prospectively registered with PROSPERO (https://www.crd.york.ac.uk/prospero/display_record.php?ID=CRD42022336132). Data and statistical codes used were uploaded on the Open Science Framework (https://osf.io/yjhvz/?view_only=7f2dae1632c64ce1909ab391b34d2a6e). The systematic review was conducted and reported in accordance with 2020 Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines (PRISMA2020; Supplement 1) [28].

Eligibility criteria

Inclusion criteria followed the Participants, Interventions, Comparators, Outcomes, and Study (PICOS) design framework [29]. Studies that did not meet the following criteria were excluded.

Population

The population group of interest was patients with increased risk of fracture due to osteoporosis or osteopenia (T-score <  − 1 SD) [2]. There were no restrictions on age, sex, gender, or race.

Intervention

Involved the addition of exercise to pharmacological therapy. Exercise had to contain the following aspects: planned, structured, repetitive, and targeted [30]. The aim of exercise is the improvement or attainment of physical fitness components [30] (e.g., aerobic, walking, cycling, jogging, resistance, strengthening, endurance exercises, whole body vibration). The combination of different types of exercises did not lead to exclusion. Co-interventions in form of advisory and educational services were allowed (e.g., calls, web calls, education material in form of booklets). Other co-interventions that have a direct impact to bodily physiology such as nutrition other than vitamin D, cholecalciferol, or calcium to compensate potential deficiencies were excluded.

Pharmacological therapies included the following [12, 14] (Supplement 2): alendronate, risedronate, ibandronate, zoledronate, denosumab, teriparatide, and romosozumab.

Comparators

Received pharmacological therapy only without any exercise training (see Supplement 2 for details of included therapies). Supplementation with vitamin D, cholecalciferol, and/or calcium alone was not considered as an included pharmacological therapy. In the case that participants received different medications, decisions for inclusion/exclusion were made based on the most frequently used medication. One person could not act as both an intervention and a control group participant (e.g., crossover RCTs).

Outcomes

Included BMD (areal and volumetric), BTM, fracture healing (radiologically evaluated), and fractures. No bone strength measures were included (see Supplement 3 for further details and priority for extraction).

Study

Included studies were required to be randomized controlled trials (RCTs; parallel, cluster, or cross-over designs) published in German or English as a full peer-reviewed journal publication (i.e., grey literature including theses and conference abstracts were excluded). There were no restrictions regarding the date of publication.

Information sources and search strategy

An electronic database search of MEDLINE (via PubMed), EMBASE (via Ovid), CINAHL (via EBSCO Host), and CENTRAL (via the Cochrane Library) was conducted using keywords, Medical Subject Headings (MeSH) terms, as well as related text words. Searches were performed from the database inception to May 6, 2022. The full search strategy is contained in Supplement 4. Unpublished and ongoing trials were searched via International Standard Randomized Controlled Trial Number (https://www.isrctn.com/), US National Institutes of Health (https://clinicaltrials.gov/), EU Clinical Trials Register (https://www.clinicaltrialsregister.eu/), German Clinical Trials Register (https://www.drks.de/), and Australian New Zealand Clinical Trials Register (https://www.anzctr.org.au/). In addition, the reference lists of prior relevant systematic reviews were screened for additional studies. Forward and backward citation tracking of included studies was performed via Web of Science Core Collection. Hand searches were performed for older bisphosphonates (see Supplement 5).

Selection process

Two independent assessors (from a pool of three screeners: AKS, NKA, EAC) screened each record against predefined inclusion criteria based on title and abstract and subsequently full text. Covidence (www.covidence.org) was used for the screening process. Duplicate records were detected and removed via the Covidence software. Disagreements were resolved through discussions between reviewers and, if necessary, an adjudicator (NLM) and a senior member of the research team (DLB, PJO, BB) were consulted.

Data collection process and data extraction

Data was extracted independently by two extractors (from a pool of three extractors: AKS, NKA, EAC) using GoogleSheets. Differences and extraction errors for the measures of spread and/or impact of studies reporting were identified by a reviewer (AKS) and were resolved through discussions between reviewers and, if necessary, an adjudicator (NLM) and a senior member of the research team (DLB, PJO, BB) were consulted. Prior to commencing data extraction, this method was piloted on ten studies chosen at random and the results were collated by one team member (AKS). In the custom GoogleSheets for extraction, comment bubbles contained detailed information for the extractors as to what data to extract for each parameter and information on how to address unclear information in publications. The following study information were extracted: publication information (author, year, title, journal, funding), study design, study demographics (e.g., age, sex, number of participants), medication interventions (e.g., category, dose, and frequency), exercise interventions (e.g., types, session and overall intervention duration, intensity, and frequency), measurement time points, and outcomes (e.g., BMD, BTM). If multiple follow-ups existed within each timeframe, we extracted the follow-up data closest to 12 months for BMD outcomes and end of intervention if exercise period deviated. For BTM outcome, data closest to 3 months and end of intervention were extracted (see Supplement 3 for further information).

Data for the main outcomes were extracted as number of participants, mean, and standard deviation (SD) where possible. In case these were reported using other measures of center and spread (e.g., median and interquartile range reported instead), we used standard equations to convert the data [31]. All applied data handling methods are available for each study and outcome in Supplement 6. Where data was presented in a figure only, ImageJ (https://imagej.nih.gov/ij/) was used to extract the values by measuring the length of the axes in pixels followed by the length of the relevant data of interest [32]. If it was not possible to extract the required data, information was requested from the corresponding author with a minimum of three times over a 4-week period.

Study risk of bias assessment, reporting bias assessment, and certainty assessment

The Cochrane Collaboration Risk of Bias Tool 2 (RoB2) was used to examine potential bias from the randomisation process, deviations from intended interventions, missing outcome data, measurement of the outcome, and selection of the reported results for randomized trials [33]. An overall risk of bias judgement was made for BMD and BTM outcomes.

The Grading of Recommendations Assessment, Development, and Evaluation (GRADE) approach was used to assess the certainty of the evidence [34]. Risk of bias, imprecision, inconsistency, indirectness, and publication bias are considered in the GRADE approach to rate the certainty [34]. Further details on the criteria used for GRADE are available in Supplement 7. Due to the exclusive presence of RCTs, a high level of confidence in the evidence was assumed with a possibility of downgrade after the assessment. Two independent assessors (AKS, EAC) assessed risk of bias and GRADE. Conflicts were resolved by discussion and, if necessary, consultation of an adjudicator (NLM).

Synthesis methods and effect measures

All statistical analyses and forest plots were completed in R version 4.2.1. (https://www.r-project.org/) and the R packages meta [35], metafor [36], and metadat [37].

As all outcomes of interest were continuous, but could be measured on different scales, standardized mean difference (SMD) with Hedges correction (Hedges’ g) was used as the effect estimate [38]. Random-effects meta-analysis was used. To estimate the 95% confidence intervals (95%CI), the Hartung-Knapp-Sidik-Jonkman adjustment with ad hoc correction was used [39, 41]. Statistical significance was set as an alpha of ≤ 0.05.

Heterogeneity was assessed by I2 statistic via the reported 95% prediction intervals (95%PI) where possible. The heterogeneity parameter τ (tau) was estimated via restricted maximum likelihood (REML) estimation. Funnel plots and the Egger’s test modified by Pustejovsky [42] to assess the risk of publication bias and small study effects are only recommended if at least 10 studies were available [43]. Furthermore, this threshold also applies for meta-regression and subgroups analysis [43]. Accordingly, these statistical approaches would only be used if ≥ 10 studies were included. In order to test the robustness of the results, sensitivity analyses exploring the role of outliers and influential trials were included [44].

Results

A total of 2593 records were identified. After removing duplicates, screening titles, and abstracts of all remaining reports, 52 full text articles were assessed for eligibility (Fig. 1). Five records from five RCTs [45, 49] were included. Literature sources and reasons for exclusion of ineligible studies of studies at full-text stage are reported in Supplement 8. Additionally, three potentially relevant ongoing trials were found through registry search but were not included due to pending trial completion (Supplement 9). Three author groups [45, 47, 48] were contacted to request missing data and two [45, 47] responded (see Supplement 6).

Fig. 1
figure 1

PRISMA flow chart

Study characteristics

The details of the five included RCTs [45,46,47,48] are shown in Table 1 and Supplement 10. The sample size of each record varied from 35 to 249 participants with a total sample of 530 participants with a mean age from 46.6 to 81 years old. The length of interventions in included studies ranged from 6 to 24 months.

Table 1 Characteristics of included studies

Population

Two studies included both males and females [45, 49], whereas three studies [46,47,48] investigated females only. The participant groups consisted of chronic spinal cord injury with low bone mass [45], postmenopausal osteoporotic women [46], postmenopausal breast cancer survivors [48], women with osteoporosis or osteopenia [47], and patients with type 2 diabetic osteoporosis [49].

Intervention/comparator

Three of the included trials had two groups (PT alone, EX + PT) [46, 48, 49], one study had three groups (PT alone, EX alone, EX + PT) [45], and one study had four groups (PT alone, EX + PT, EX + PT, EX combined + PT) [47]. For trials with more than two groups, only data from PT alone and EX + PT/EX combined + PT group were extracted.

As pharmacological therapy, teriparatide was used in two trials [45, 46], two studies used alendronate [47, 49], and one study used risedronate [48]. As exercise intervention, vibration was used in two trials [45, 46], strength/weight training [48], Sinaki combined with Nordic walking [47], and individual exercise rehabilitation therapy [49] was each used in one study. As co-intervention, cholecalciferol was used in one study [45], vitamin D in three studies [46, 48], and calcium with health education in combination with diabetes treatment in one study [49].

Outcome

All studies [45,46,47,48,49] assessed BMD as an outcome, whereas only four studies [45, 46, 48, 49] assessed BTM outcomes. None of the included studies assessed radiological evaluated fracture healing or fractures.

Study

All five included studies were parallel arm RCTs with patients being the unit of randomization. Three studies received no/non-profit funding [45, 46, 48] and two studies gave no information about funding [47, 49] (see Supplement 10). The primary hypothesis of three studies [45, 46, 48] was to investigate EX + PT vs. PT alone, whereas one study evaluated the influence of different exercises on osteoporosis [47] and another one the influence of aerobic exercise on type 2 diabetic osteoporosis [49].

Risk of bias and GRADE assessment

A summary of the risk of bias assessment for each study is shown in Supplement 11. Summary risk of bias plots for BMD and BTM outcomes and a detailed risk of bias assessment for each included study and outcome were created (Supplement 11).

For the five trials assessing BMD outcome, 80% had a high risk of bias and 20% some concerns overall. There was low risk of bias for randomization process (40%), deviations from intended interventions (60%), measurement of outcome (40%), and selection of reported results (20%). Missing outcome data was rated as having high risk of bias in 80% of included studies.

For the four trials assessing BTM outcome, 75% had a high and 25% a low risk of bias overall. There was a low risk of bias for randomization process (50%), deviations from intended interventions (75%), missing outcome data (25%), measurement of outcome (50%), and selection of reported results (25%). The certainty of evidence was rated as very low for all study outcomes (Supplement 12). The main reasons for downgrading were risk of bias, inconsistency, and imprecision. Publication bias was assessed by funnel plots (Supplement 13) in addition to criteria used (Supplement 7).

Data handling

Three trials had the mean and SD extracted directly [45, 46, 49]. For one study [47], the mean and standard error were extracted from a figure. Standard error was converted to SD. One study [48] had mean and SD as percentage change from baseline data. Due to missing data and non-responsiveness of the authors, no transformation of the data was possible. Two trials [45, 47] required pooling of groups due to multiple groups and various recruitment sites in both intervention and comparator groups.

Meta-analysis of bone mineral density (BMD)

The meta-analytic findings of all BMD related meta-analyses are presented in Table 2.

Table 2 Effect sizes of BMD outcomes

Meta-analysis of hip bone mineral density (BMD)

At 12 months, three trials [45, 46, 48] involving 294 participants and at end intervention two trials [45, 48] with 237 participants compared the effect of EX + PT with PT alone. Meta-analysis showed that hip aBMD increased at 12 months (SMD [95%CI]: 0.18 [− 1.71; 2.06]; P = 0.72; I2 = 90.3%, GRADE: very low) (Fig. 2) and at end intervention (0.55 [− 7.08; 8.18]; P = 0.53; I2 = 79.0%, GRADE: very low) (Supplement 14).

Fig. 2
figure 2

Forest plot of intervention effect on BMD at 12 months. The data are shown as pooled SMD with 95%CI for changes in experimental (EX + PT) and control (PT) groups. The prediction interval for tibia BMD could not be calculated because there were only two studies included in the meta-analysis

Meta-analysis of femoral neck bone mineral density (BMD)

At 12 months, three trials [45, 48, 49] involving 377 participants and at end intervention two trials [45, 48] involving 239 participants compared the effect of EX + PT with PT alone. Meta-analysis showed that femoral neck aBMD decreased at 12 months (− 0.03 [− 1.80; 1.75]; P = 0.95; I2 = 93.9%, GRADE: very low) (Fig. 2) and increased at end intervention (0.13; [− 7.34; 7.60]; P = 0.86; I2 = 80.2%, GRADE: very low) (Supplement 14).

Meta-analysis of tibia bone mineral density (BMD)

At 12 months, two trials [45, 46] involving 61 participants compared the effect of EX + PT with PT alone. Meta-analysis showed that tibia BMD increased (0.25 [− 4.85; 5.34]; P = 0.65; I2 = 57.8%, GRADE: very low) (Fig. 2).

Meta-analysis of lumbar spine bone mineral density (BMD)

At 12 months, four trials [45, 46, 48, 49] involving 412 participants and at end intervention two trials [45, 48] involving 237 participants compared the effect of EX + PT with PT alone. Meta-analysis showed that lumbar spine aBMD increased at 12 months (0.20 [− 1.15; 1.55]; P = 0.67; I2 = 90.2%, GRADE: very low) (Fig. 2) and at end intervention (0.85 [− 5.02; 6.73]; P = 0.32; I2 = 57.6%, GRADE: very low) (Supplement 14).

Meta-analysis of forearm bone mineral density (BMD)

At 12 months, three trials [46,47,48] involving 313 participants compared the effect of EX + PT with PT alone. Meta-analysis showed that forearm BMD increased (0.05 [− 0.35; 0.46]; P = 0.63; I2 = 0%, GRADE: very low) (Fig. 2).

Meta-analysis of bone turnover markers (BTM)

The meta-analytic findings of all BMT related meta-analyses are presented in Table 3.

Table 3 Effect sizes of BTM outcomes

Meta-analysis of bone ALP (bone-specific alkaline phosphatase)

At end intervention, three trials [45, 48, 49] involving 379 participants compared the effect of EX + PT with PT alone. Meta-analysis showed that bone ALP decreased (− 0.68 [− 5.88; 4.53]; P = 0.63; I2 = 96.7%, GRADE: very low) (Supplement 14).

Meta-analysis of PINP (procollagen type I N propeptide)

At 3 months and end intervention, two trials [45, 46] involving 73 participants compared the effect of EX + PT with PT alone. Meta-analysis showed that PINP decreased at both three months (− 0.74 [− 10.42; 8.93]; P = 0.51; I2 = 89.1%, GRADE: very low) and end intervention (− 0.39 [− 10.94; 10.16]; P = 0.72; I2 = 91.1%, GRADE: very low) (Supplement 14).

Meta-analysis of CTX-I (C-terminal crosslinking telopeptide of type I collagen)

At 3 months and end intervention, two trials [45, 46] involving 73 participants compared the effect of EX + PT with PT alone. Meta-analysis showed that CTX-I decreased at both 3 months (− 0.69 [− 9.61; 8.23]; P = 0.51; I2 = 87.4%, GRADE: very low) and end intervention (− 0.04 [− 7.05; 6.97]; P = 0.95; I2 = 81.1%; GRADE: very low) (Supplement 14).

Results of sensitivity analysis

A sensitivity analysis exploring the role of outliers and influential trials was conducted. When the trial from Waltman et al. [48] was excluded, as the results were only given as percentage change data and could not be transformed, results of the meta-analyses changed to a weaker impact of exercise with some results reversed results into negative findings and therefore were no longer effective. Further details of the sensitivity analysis can be seen in Supplement 15.

Results of subgroup analysis

A subgroup analysis was not possible for any outcome as the number of studies included was too low (n ≤ 10 studies) [50].

Results of meta-regression

A meta-regression was not possible for any outcome as the number of studies included was too low (n ≤ 10 studies) [50].

Protocol additions since PROSPERO registration

We decided to add a search for unpublished and ongoing trials in five trial registries and to search for potentially relevant studies with older bisphosphonates by performing an additional hand search in all databases used. No other protocol deviations occurred.

Discussion

This systematic review and meta-analysis found with very low certainty of evidence that while changes in BMD typically favored exercise added to pharmacological therapy, the effects were not statistically significant. Additionally, the different modes of action from anabolic agents (teriparatide) and antiresorptive agents (bisphosphonates) can be seen in Fig. 2. Anabolic agents stimulate bone formation and increase BMD [51], whereas antiresorptive agents inhibit bone resorption and preserve BMD [52]. BTMs were lowered at all timepoints, but confidence intervals were very wide and evidence certainty very low. No results were available for fracture healing or fracture. There was marked heterogeneity, both clinically and in the individual study outcomes and one study [48] appeared to be influential in the analysis.

Results in the context of other evidence

Previous systematic reviews and meta-analyses have largely focused on specific populations [19, 53, 54], pharmacological therapies [53, 55, 56], and/or types of exercises [57, 58, 58, 59]. The meta-analysis by Yan et al. (2021) [60] has proceeded in a comparable manner, as they also compare EX + PT versus PT alone. They also included different types of exercises with focus on BMD, BTM, and pain. No further details were given regarding the pharmacological therapy used. Their meta-analysis showed significant improvements for BMD, but no improvements for BTMs. The reason of different results may occur because they included more studies (n = 20) and therefore involved more participants (n = 1824). Additionally, any activity to enhance and improve physical health with intervention periods less than 6 months were included. Furthermore, they searched in Chinese databases and included reports only published in Chinese except of one German article. In comparison to the existing literature, our current study focusses on the comparison of exercises instead of routine activities, with currently prescribed pharmacological therapies. In addition, we included only studies that could reach clinically based results due to their duration of intervention (see Supplement 3).

Implications of the results for practice, policy, and future research

Recent clinical practice guidelines [61,62,63] give very similar recommendations in recommending exercise training in combination to pharmacological therapy for treatment of osteoporosis. On the basis of the findings of the current study, we cannot yet recommend additions to clinical practice guidelines that exercise can have an additive effect to pharmacological therapies. While it seems logical to prescribe both exercise and pharmacological therapy to patients with osteoporosis, the certainty of evidence is very low and heterogeneity very high, precluding a possible recommendation.

High-quality RCTs are required to examine the additive effect of exercise to antiresorptive and/or osteoanabolic medication. Specifically, future RCTs should be adequately powered and provide sufficient and useful data quality for a meta-analysis. In terms of participants, future studies should focus primarily on primary osteopenia/osteoporosis. Also, it is possible a greater benefit of exercise may be seen in less active individuals; this should be considered in study design and recruitment. Sex, age, race, treatment history, and comorbidities may moderate treatment effects and should be considered in study design and/or statistical analysis. In terms of the exercise intervention, high-impact exercise and high-intensity progressive resistance training have been shown to have with osteoanabolic effects [64]. Future primary RCTs could and should focus on these interventions, but also consider that these forms of exercise may not suit all populations and other forms of exercise and physical activity (e.g., aerobic dance) should also be evaluated. The frequency of exercise sessions and duration of exercise should be considered, and we recommend adhering to current guideline recommendations [64]. In terms of the pharmacological intervention, it should be investigated if the efficacy of currently in several guidelines suggested sequential therapies using an osteoanabolic agent first followed by an antiresorptive for individuals with very high fracture risk can be enhanced by exercise. For outcome measures, BMD is often taken as a surrogate marker of bone fracture risk, and is a more feasible (e.g., total hip or femoral neck BMD) outcome to assess than long-term fracture rates, even if the latter is the ultimate goal of osteoporosis management. Furthermore, the method of measuring BMD (DXA, QCT, pQCT, and/or QUS) should be considered versus their advantages and limitations. Further, in terms of study design, 9 months of intervention is a conservatively set time period to measure detectable changes in BMD via, for example, DXA [65]. Finally, while it is difficult to blind participants to exercise, the assessors of the bone outcomes must be blinded to the group allocation of the participants.

Strengths and limitations

This study has some limitations. Nearly all study outcomes had a high risk of bias. The lack of blinding participants to exercise treatment may overestimate the intervention effects. Due to the small number of included studies (n = 5), it was not reasonable to perform subgroup analysis or to assess the publication bias and meta-regression by using a statistical approach. A further limitation is the small population in the respective studies indicating further demand on studies [66]. Regarding the BMD outcome, we only included aBMD and vBMD to examine the effect of intervention on bone mineral density. This leads to limited findings [67]. Besides exercise and pharmacological therapies, BTM also depends on various other controllable factors such as nutrition, as well as non-controllable factors such as genetic predisposition [68]. The influenceable factors might also depend on social-economical aspects [69, 70]. These factors should be considered while interpreting the final results and deriving measurements according to the results.

The strength of the current study was a systematic review approach, which is the highest level of evidence. Additionally, the Hartung-Knapp-Sidik-Jonkman adjustment was used. This is important to implement in meta-analyses that include less than five trials [39,40,41]. The role of outliers in sensitivity analysis was explored to examine the robustness of the results. The addition of searches of trial registries and backward/forward citation tracking strengthened the search process.

Conclusion

While the mean effect sizes typically favored additional exercise to pharmacological treatment in improving BMD and lowering BTMs, confidence and prediction intervals were very wide and evidence certainty was very low. No results were found for fracture healing and fractures. Even though the study compiles all available information in the field of the research question, no explicit recommendation for additional exercise is possible. Targeted RCTs that are adequately powered, targeting primary osteopenia/osteoporosis with an osteoanabolic exercise program and are subsequently well reported are needed to derive more definitive recommendations.