Supervised Versus Unsupervised Exercise for the Improvement of Physical Function and Well-Being Outcomes in Older Adults: A Systematic Review and Meta-analysis of Randomized Controlled Trials

Background Unsupervised exercise intervention (UNSUP) appears to be a practical and beneficial strategy for older adults, although its feasibility and effectiveness compared to supervised exercise intervention (SUP) remains unknown. We aimed to compare the safety, attendance/adherence rates, and effectiveness of SUP versus UNSUP on physical function and well-being outcomes in older adults. Methods A systematic search was conducted in PubMed, Web of Science, CINAHL, SPORTDiscus, and APA PsycINFO up to September 2022 for randomized controlled trials comparing SUP versus UNSUP in older adults (≥ 60 years). Safety and attendance/adherence rates were registered as indicators of feasibility, and meta-analyses were performed for physical function and well-being outcomes. Sub-analyses were performed for those studies that applied a similar intervention in both groups and for those studies where participants performed ≥ 66% of the sessions in the assigned condition. Results Thirty-four studies were included (n = 2830). No serious adverse events were reported, with similar attendance rates (81%) for both SUP and UNSUP. Compared with UNSUP, SUP induced significant higher benefits on knee extension strength (standardized mean difference (SMD) = 0.18, p = 0.002), sit-to-stand test (STS, SMD = 0.25, p = 0.050), timed-up-and-go test (TUG, SMD = 0.21, p = 0.035), usual gait speed (SMD = 0.29, p = 0.026), lean mass (mean difference = 1.05 kg, p < 0.001) and health-related quality of life (HRQoL, SMD = 0.21, p = 0.035), albeit only knee extension strength remained significant in sensitivity analyses. Sub-analyses revealed superior benefits of SUP on knee extension strength when only considering those studies that applied a similar intervention in both SUP and UNSUP groups. However, no significant benefits were found for the remaining outcomes. Beneficial effects of SUP over UNSUP were also observed for knee extension strength, STS, functional reach test, TUG, usual gait speed, lean mass, and HRQoL when separately analyzing those studies in which participants performed ≥ 66% of the sessions in the assigned condition. Conclusions Current evidence suggests that both SUP and UNSUP programs are safe and could exert benefits on physical function and HRQoL. However, despite being associated with similar attendance rates, SUP might offer some additional benefits, although further high-quality research (i.e., accounting for confounding factors such as presence of supervised sessions in UNSUP or vice versa, as well as equating the exercise dose) is necessary to confirm these findings. PROSPERO Registration Number CRD42022326420. Supplementary Information The online version contains supplementary material available at 10.1007/s40279-024-02024-1.


Introduction
The population aged 60 years or over is rapidly growing, with the number of older adults worldwide expected to reach 1.4 billion in 2030 and 3.1 billion by 2100 [1].This epidemiological shift is accompanied by a concomitant increase in the so-called aging-related diseases, notably frailty [2,3].Consequently, efforts are needed to attenuate aging-related deterioration and its associated burden.
Strong evidence supports the benefits of regular physical exercise for attenuating aging-related multisystem deterioration [4,5].Despite being overall beneficial, supervised exercise intervention (SUP)-the most widely analyzed type of intervention in the scientific literature-might have some drawbacks, as older adults can face difficulties in joining these interventions due to variables such as physical or financial constraints, low availability of facilities, weather conditions, distance from home, time commitments, the intimidating gym environment or, more recently, the lockdowns imposed by the COVID-19 pandemic [6].In this context, unsupervised exercise intervention (UNSUP) appears to be a practical and potentially effective alternative [7,8].Indeed, a recent meta-analysis by our research group concluded that, despite being associated with modest adherence rates (67%), UNSUP might be effective for improving some important physical fitness outcomes in older adults compared with performing no exercise [9].Similar results were reported by a recent meta-analysis that found a beneficial effect of UNSUP on physical fitness measures in healthy older adults [10].

Study Selection
Eligibility criteria are reported according to the Population, Intervention, Comparison, Outcome and Study design (PICOS) approach [21].The review was limited to studies that met the criteria shown in OSM Table S2.
Studies were first retrieved and preliminarily screened by title and abstract, and the full texts of those studies that met the inclusion criteria were assessed (AM, PGR).Disagreements between authors were resolved through consensus or after consultation with a third reviewer (PLV).
An exercise session was considered supervised when participants received synchronous supervision from a professional (e.g., an initial instructional session showing the exercises to ensure the correct technique, or individual/group supervised sessions conducted over the intervention period) whether face-to-face or videocall format.On the other hand, an exercise session was considered unsupervised if it did not include synchronous supervision by a sports scientist (e.g., phone calls asking about the exercises performed, assessing exercise frequency).
In the sub-analysis performed in this study, a supervised exercise group (SUP) was considered applicable if most training sessions performed had synchronous supervision by a sports scientist (i.e., at least 66% of the training sessions were supervised).An unsupervised exercise group (UNSUP) was considered applicable if the main part of the training sessions was conducted without synchronous supervision by an exercise professional (i.e., at least 66% of the training sessions were conducted without real-time supervision).Two independent reviewers (JSM, PGR) checked information from the included studies to calculate ratios in Table 1.In cases of disagreement, a third author (PLV) was consulted for clarification.This 66% cut-off has been applied in previous systematic reviews and meta-analyses comparing supervised and unsupervised exercise training [13].

Outcomes Assessment
Safety included the number of adverse events (e.g., injury, pain, discomfort, worsening of an existing condition) as well as the number of falls during the intervention period.
Attendance rates refer to whether or not the participant carries out the exercise sessions.On the other hand, adherence refers to whether the participant, in addition to attending the exercise sessions, has achieved the intended objectives (i.e., volume, intensity, duration, exercises) [22].
To evaluate effectiveness, studies included should assess at least one of the following health-related endpoints: (1) muscle strength (e.g., knee extension strength, handgrip strength), (2) balance (e.g., one leg stance, tandem stance) (3) physical performance (e.g., timed-up-and-go test, maximum gait speed), (4) body composition (e.g., body fat, lean mass), or (5) health-related quality of life (e.g., European Quality of Life 5 Dimensions (EQ-5D-5L), 36-Item Short-Form Health Survey).If studies reported multiple variables within one of the endpoints categories, all variables were included.Only those variables that were included in at least three studies were used for meta-analysis.

Data Extraction
Two authors (JSM, PGR) independently extracted the following data from each study: participants' characteristics, characteristics of the exercise interventions, attendance and adherence rates, outcomes assessed, and main results.This information was reviewed by a third author (AM) to ensure accuracy and completeness.Data comparing baseline and post-intervention assessments were used.We contacted the authors when studies reported the calculated change.Data were extracted, when available, as mean, standard deviation (SD), and number of participants per group.When data were provided as intervention effects and/or using other measures of dispersion (e.g., standard error, 95% confidence interval (CI)), the required information was estimated following the guidelines reported elsewhere [23].When available, we used the results based on "intention-to-treat" analyses.We had to contact the authors of 30 studies [15,16,18, because the required data were not reported.Of these, the authors of 11 studies [15,18,25,26,31,36,37,39,44,49,50] provided the required information.

Quality Assessment
Two authors (AM, PGR) independently assessed the methodological quality of the included studies with the Tool for the assEssment of Study qualiTy and reporting in Exercise (TESTEX) scale [51].This is a 15-point scale specifically designed for use in exercise training studies, including 5 points for study quality and 10 points for reporting.Thus, the quality of the studies was classified according to their total TESTEX score as "high" (≥ 12 points), "good" (7)(8)(9)(10)(11), or "low" (≤ 6).All the studies were used for data synthesis independently of their methodological quality.A third author (JSM) resolved any potential disagreement.

Statistical Analysis
A random-effects meta-analysis (DerSimonian and Laird method) was performed when at least three studies assessed a given outcome.The pooled standardized mean difference (SMD, post-minus pre-intervention data) between interventions was computed along with the 95%CI, and if the studies reported a given outcome using the same measurement units (e.g., kg, meters), the absolute mean difference (MD) was computed.A conservative correlation coefficient (Pearson's r-value) of 0.7 between pre-and post-intervention data was used for the computation of the within-group SD, and sensitivity analyses with an r-value of 0.2 and 0.5 were performed when a significant result was found (not reported unless results became non-significant) [52].When a study provided effect sizes separately for a given outcome divided in different subscales (i.e., health-related quality of life (HRQoL) divided into its different subdomains), results for that study were combined following a conservative approach by using a random-effects model assuming total dependency between measures (r = 1) as explained elsewhere [53].Sensitivity analyses were also conducted by testing significance when removing one study at a time to check if findings were mostly driven by an individual study.Finally, sub-analyses were performed focusing solely on those studies with high and good quality according to the TESTEX scale, for those studies that applied a similar intervention in both SUP and UNSUP groups (e.g., both groups including exercise interventions targeting the same muscle groups and with similar characteristics) and for those studies in which participants performed more than two-thirds of the sessions in the assigned condition (i.e., the SUP group performed at least 66% of the sessions under supervision; Table 1, OSM Tables S3 and S4).Begg's test was used to determine the presence of publication bias, and the I 2 statistic was used to assess heterogeneity across studies.I 2 values > 25%, 50%, and 75% were considered indicative of low, moderate, and high heterogeneity, respectively.The level of significance was set at 0.05.All statistical analyses were performed using the statistical software package Comprehensive Meta-analysis 2.0 (Biostat, Englewood, NJ, USA).

Study Characteristics
From the retrieved studies, 34 studies derived from 30 RCTs (n = 2830 participants) met all eligibility criteria and were included in the systematic review (Fig. 1).Seven studies analyzed the same sample from three RCTs [15,18,25,26,28,29,54], and they were only counted once for the final sample size.The characteristics of the included studies are summarized in Table 2.

Quality Assessment and Publication Bias
The quality of the included studies was overall good (mean TESTEX score of 10, range 4-14; Table 3).Three (9%) of the studies showed low methodological quality, 17 (50%) were of good quality, and 14 (41%) were deemed to be of high quality.Most studies did not specify allocation concealment (76% of the studies) or did not report adverse events (56% of the studies).Also, only 41% of the studies had a completion rate of at least 85% and only 41% reported details on assessors' blinding.

Balance
Seventeen studies evaluated balancerelated endpoints [14, 16, 24, 28, 30, 35, 37-41, 43, 45, 46, 49, 50, 55], of which 14 could be included in the analyses (OSM Figs.S5-S9).Significantly superior benefits were found for SUP when pooling the four studies that assessed the Berg balance scale, but significance was not confirmed in sensitivity analyses.A non-significant trend towards beneficial effects of SUP was observed for the functional reach test (FRT), although when removing the study by Watson et al. [14], the result was far from significant (Table 4).No significant differences between interventions were found for one leg stance or tandem stance with eyes closed or open.

Body Composition
Ten studies assessed different markers of body composition [14,15,25,27,41,44,45,47,54,55] and seven could be included in the analyses (OSM Figs.S15-S18).No significant differences were found between SUP and UNSUP for body mass index, body mass, or body fat (Table 4).Nevertheless, significantly superior benefits of SUP were found for lean mass, although these

BMI body mass index, HRQoL
Health-related quality of life, STS Sit-to-stand test, TESTEX Tool for the assEssment of Study qualiTy and reporting in Exercise, TUG Timed-up-and-go test, VO 2peak maximal oxygen uptake, 6MWT 6-min walk test Results are shown as standardized mean difference (SMD) or absolute mean difference along with 95% confidence intervals (CI).A higher TES-TEX score indicates better quality.Significant p-values for the effect estimates are in bold font differences became non-significant in sensitivity analyses when removing the study by Watson et al. [14].Some studies analyzed other body composition variables such as bone mineral density [14] or body circumferences [15,41,44,54], but they could not be meta-analyzed (Table 2).
The results obtained in the main analyses remained essentially the same after removing the low-quality studies except for usual gait speed, which became non-significant, and the functional reach test, which became significant (see OSM Table S4).

Muscle Strength
Beneficial effects of SUP on knee extension strength were observed when separately analyzing those studies that applied a similar intervention [30,40,44,45,49] and in the nine studies [16,17,26,30,40,41,44,49,54] in which participants performed ≥ 66% of the sessions in the assigned condition in both groups (OSM Table S3).Subanalysis also confirmed significantly superior benefits of SUP in those studies [29, 43-45, 55, 56] where participants performed ≥ 66% of the sessions in the assigned condition (Table 1) for STS.No significant differences were found for handgrip strength in sub-analyses.

Body Composition
Three studies performed a comparable training intervention in SUP and UNSUP for lean mass [27,44,47], and no significant benefits were found when pooling these studies.In all studies, participants performed ≥ 66% of the sessions in the assigned condition and significant benefits of SUP over UNSUP were found for lean mass (OSM Table S3).

Health-Related Quality of Life
Four studies [43,44,46,48] applied a similar intervention in both SUP and UNSUP groups, and no differences were found when separately analyzing them.In seven [18,31,39,43,44,46,50] out of the nine studies participants performed ≥ 66% of the sessions in the assigned condition, and their separate analyses revealed significant benefits of SUP, whereas non-significant benefits were found for the two studies that did not meet this criterion (OSM Table S3).

Discussion
The present systematic review and meta-analysis compared the safety, attendance/adherence rates, and effectiveness of SUP versus UNSUP on measures of physical function and well-being outcomes in older adults.The incidence of adverse events and falls as well as the attendance to the program (81%) were similar in the SUP and UNSUP groups.Compared to UNSUP, SUP provided significantly superior benefits in knee extension strength, STS, TUG, usual gait speed, lean mass, and HRQoL, but only knee extension strength was still significant after sensitivity analyses.No benefits were found for the remaining outcomes.These results highlight the potential additional benefits that SUP can provide over UNSUP in older adults.However, for those unable to perform SUP, UNSUP may represent a safe and cost-effective alternative to ensure physical exercise.

Safety
We found that most of the included studies reported overall similar rates of adverse events and falls in SUP and UNSUP.For example, Almeida et al. [24], Costa et al. [17], and Lacroix et al. [56] performed a multicomponent exercise intervention (combining strength and aerobic training) during 12 weeks and reported no adverse events (e.g., falls, muscle soreness, or injuries) during the study for either group.Furthermore, one of the included studies with the longest duration (i.e., 35 weeks) did not register any adverse events as a result of the exercise programs [14].Our results are in accord with a systematic review and meta-analysis analyzing the safety and effectiveness of long-term (≥ 1 year) exercise interventions in older adults, which concluded that regardless of supervision or intervention structure (i.e., supervised group-based, unsupervised home-based, or a combination thereof), exercise reduces the number of falls and fall-associated injuries in this population [5].However, it must be noted that the number of adverse events and falls reported in this study may be underestimated given that exercise dose variables are generally not equated between SUP and UNSUP groups (i.e., more difficult exercise selection, higher intensity, and volume for SUP).

Attendance and Adherence to the Exercise Program
In the present study we observed attendance rates of 81% for both SUP and UNSUP groups.In this regard, Lacroix et al. [13] compared the effects of SUP versus UNSUP programs including resistance and balance exercises on different physical fitness measures in older adults, finding a lack of association between attendance rates and the total number of supervised sessions.However, it is worth noting that there are at least two factors that might bias attendance rates.Firstly, most of the studies reported attendance using diaries in the UNSUP group, so the data obtained may not be accurate.Secondly, the fact that 21 of the 34 studies involved some level of supervision in the UNSUP group may affect the attendance rates obtained.Another relevant finding is that only two studies considered whether participants complied with the prescribed parameters (i.e., intensity, duration, exercises) as well as their attendance to the training sessions (adherence).There are factors that can promote greater long-term adherence to the exercise program [57].Although in the present study attendance rates were similar between groups, it is unclear whether this attendance rate could be maintained in the long term.One of the studies that showed the greatest benefits of SUP versus UNSUP lasted 35 weeks and had attendance rates over 85% in both groups [14].Therefore, the limited duration of most interventions or the low attendance to the program in other studies may explain the lack of significant benefits observed in the remaining outcomes.In this sense, previous research has shown that people may be more likely to adhere to an UNSUP program compared to a SUP in the long term because UNSUP programs are easier to integrate into their lives [57].Nevertheless, other factors associated with SUP might also be of relevance, such as obtaining direct feedback from a professional, the social component of being with other participants, or having greater material resources.Further research is therefore needed to confirm the role of attendance of SUP versus UNSUP in the long term in addition to studies that analyze adherence to training rather than only attendance rates.

Effectiveness of Supervised Exercise
Intervention (SUP) Versus Unsupervised Exercise Intervention (UNSUP) Our findings based on preliminary meta-analytical evidence suggest that SUP could provide greater benefits compared to UNSUP in different physical functions (i.e., knee extension strength, STS, TUG, usual gait speed, and lean mass) and well-being (i.e., HRQoL) measures.In line with our results, the meta-analysis of Lacroix et al. [13] found that SUP could provide additional benefits on some strength/power and balance measures.However, most of our findings became nonsignificant after sensitivity analyses, with the exception of knee extension strength.The observed improvement in knee extension strength is potentially relevant, as this outcome has proven to be critical for preventing osteopenia or osteoporosis [58].Knee extension strength is also an important predictor of functional performance in older adults, as it is essential for activities of daily living and general well-being [59].Remarkably, a systematic review and meta-analysis including data from two million adults concluded that higher levels of knee extension strength were associated with a lower risk of mortality, regardless of age and follow-up period [60].
On the other hand, no significant benefits were found for the remaining physical function outcomes (i.e., handgrip, FRT, one leg stance, balance scales, tandem stance, maximum gait speed, 6-min walk test, maximal oxygen uptake, body mass index, body mass, and body fat).There are different hypotheses that may partially explain the lack of benefits obtained.Firstly, participants will improve to a greater extent what they specifically train in their workouts (e.g., if most training sessions include lower-body exercises such as the Otago Exercise Program [37], participants will improve more in outcomes such as knee extension or STS since exercises with similar movement patterns are included).Secondly, it is possible that significant improvements will only be observed in those outcomes in which participants show more possibility of improvement because they have a lower starting level.This is consistent with the results obtained since, as we age, we tend to lose power, strength, and muscle mass due to the natural phenomenon of sarcopenia [61], so participants may be more likely to improve outcomes such as knee extension strength or STS if their baseline level is low.Lastly, there was large heterogeneity in the characteristics of the included studies and the applied interventions, as well as some potentially confounding factors that may influence the lack of additional benefits observed.

Confounding Factors in SUP and UNSUP Exercise Interventions
Of note, in many studies the exercise intervention applied in SUP and UNSUP differed substantially.We observed that training variables (i.e., volume, frequency, intensity, and type of exercise) were overall better reported in the SUP group than in the UNSUP group, which hinders drawing strong conclusions on the influence of these factors.In line with previous systematic reviews and meta-analyses [12,62], a higher exercise intensity was usually applied in SUP than in UNSUP.For example, Iliffe et al. [37] reported that the SUP group trained at a higher intensity than the UNSUP group, and in the study by Watson et al. [14], the SUP group trained using weights equivalent to > 80-85% repetition maximum (RM) while the UNSUP group trained using a lower intensity (< 60% RM).This may be due to the fact that the target population are older people, which might lead professionals to be more conservative when prescribing intensities for the UNSUP group to avoid the potential risk of adverse events (e.g., injuries, falls) or to the participants themselves self-selecting a lower intensity during UNSUP.
In a few studies the exercise selection for the SUP group was similar to the UNSUP group, taking into account the limitations and advantages in terms of facilities and equipment involved when training at a center versus training at home [40,43].Additionally, in most studies the type of exercise intervention was not comparable between groups.For example, Cecchi et al. [32] compared a SUP multicomponent physical exercise program (i.e., strength, balance, aerobic, and stretching) versus an UNSUP program consisting solely of regular walking (only aerobic).To account for this issue, a sub-analysis was performed comparing those studies that equated the exercise intervention as closely as possible (i.e., similar volume, frequency, intensity, and type of exercise) between the SUP and UNSUP groups.Significant differences were found for knee extension strength in those studies that performed a similar exercise program in both groups.However, the number of studies included in each of the 12 analyzed outcomes ranged from three to nine, and the remaining six outcomes could not be analyzed due to the low number of studies available.Moreover, in some cases participants in the SUP or the UNSUP groups performed only part of the exercise sessions with or without supervision, respectively (only 13/34 studies did not include any supervised sessions during the intervention).Rarely, the two exercise groups only differed in the amount of guidance they received [30,49].Therefore, we conducted a second sub-analysis comparing those studies in which participants performed more than two-thirds of the sessions in the assigned condition (i.e., the UNSUP group performed at least 66% of the sessions without supervision) and significant differences in knee extension strength, STS, FRT, TUG, usual gait speed, lean mass, and HRQoL were observed.Eighteen outcomes could be analyzed in this subanalysis, but the number of studies included in each outcome was reduced (from three to 13 studies).These limitations derived from the reduced number of studies included in both sub-analyses make it difficult to reach definitive conclusions.
Future research should examine the safety, attendance/ adherence rates, and effectiveness of SUP versus UNSUP focusing on comparable training parameters, including volume, frequency, intensity, and exercise modality.These studies should specifically compare programs that differ solely in the presence or absence of supervision (i.e., fully supervised vs. fully unsupervised).This approach will provide valuable insights into the potential benefits and limitations associated with supervision, shedding light on the optimal design of exercise interventions for various populations.

Practical Implications
Our results have shown that SUP could provide additional benefits to UNSUP on some specific outcomes.The reason why UNSUP may not be as effective as SUP for improving some outcomes might be partly due to the fact that workouts conducted under the supervision of a professional may be performed with a higher quality.For example, SUP usually trains with better technical execution of the exercises, higher intensity and rating of perceived effort, better implementation of individualization and progression principles, and higher motivation due to direct feedback resulting in greater improvements [63,64].Therefore, given its potential superiority, SUP might be recommended over UNSUP when possible.However, there are some barriers usually associated with SUP in this population.For example, a systematic review conducted in the oldest old (i.e., people aged 80 years and over) showed that some of the main limitations to exercise identified were costs, transport, lack of access to exercise facilities, no exercise companion or being alone, care of siblings or others, fatigue, and embarrassment [65].In addition, Costello et al. [66] reported lack of time and discipline, potential for injury, inadequate motivation, boredom and intimidation as main barriers to regular physical activity.
Previous meta-analyses have shown that UNSUP is effective for improving health-related outcomes in older adults [7,10,67].Our research group showed that, compared with no exercise, UNSUP could be safe and effective for improving measures of muscle strength/power and balance in community-dwelling older adults, although the adherence to these programs was low [9].Similarly, a meta-analysis including 17 studies also reported that UNSUP was effective for enhancing physical fitness in healthy older adults [10].More recently, a meta-analysis including 12 studies (performed in both adolescents and adults) concluded that supervised resistance training could provide small additional benefits over unsupervised training on muscle strength, but no consistent differences were found for body composition [12].Thus, when SUP is not feasible, UNSUP could be a safe and cost-effective alternative for improving the fitness and health of older adults.

Strengths and Limitations
One of the main strengths of the present study is that it provides novel information, as we focused solely on older adults, including a large number of studies (34 RCTs and 2830 participants), and analyzed both physical function and well-being outcomes.Another major strength is that previous systematic reviews and meta-analyses have often focused on a single specific type of exercise (e.g., strength or balance alone) whereas the present review included studies that examined all exercise types (i.e., strength, balance, flexibility, aerobic, or a combination thereof).Conversely, some limitations of the present study should be acknowledged.Notably, the low number of available studies for some of the meta-analyzed outcomes made the conclusions preliminary.One potential confounding factor is that most studies did not equate all training variables (i.e., volume, frequency, intensity, and type of exercise) for the SUP and UNSUP groups.To account for this issue, we performed sub-analyses comparing those studies that performed a similar exercise intervention in both groups.The lack of a consistent terminology regarding the degree of supervision in exercise interventions can be considered another confounding factor, since most of the UNSUP programs included some supervised sessions.Therefore, we performed additional sub-analyses defining an objective concept of training supervision (≥ 66% supervised sessions in the SUP group and ≥ 66% unsupervised sessions in the UNSUP group).There was also a lack of homogeneity in the tests used for assessment, making it difficult to reach definitive conclusions due to the small number of studies included in each meta-analyzed outcome.Future studies should take all these limitations into consideration.Finally, it is worth noting that the included studies analyzed both healthy and diseased populations, but given the heterogeneity of the populations assessed within-and betweenstudies, and the low number of studies included, we were unable to perform sub-analyses to determine how participants' characteristics (i.e., healthy vs. clinical populations) moderate exercise benefits.Indeed, given the advanced age of the participants included, setting an objective definition of "healthy" is highly complex, since most participants presented some comorbidities (e.g., diabetes, hypertension, obesity, osteoarthritis, sarcopenia, frailty).

Conclusion
The present study suggests that SUP may offer certain advantages over UNSUP in enhancing physical function and well-being outcomes among older adults.Nevertheless, given that both interventions show high attendance rates and similar levels of safety, UNSUP appears to be an accessible approach for older adults, which might overcome some of the limitations associated with SUP.Future research should aim to examine the safety, attendance/adherence rates, and effectiveness of SUP versus UNSUP, focusing on equating training parameters between the two groups and differing only in the presence or absence of supervision (i.e., the UNSUP group cannot include supervised sessions or vice versa).These studies will provide valuable information about the benefits and limitations of supervision, informing the optimal design of exercise interventions.Figure 2 summarizes the findings obtained in this research.

Fig. 1
Fig. 1 PRISMA 2020 flow diagram for new systematic reviews which included searches of databases, registers, and other sources.RCT randomized controlled trials

Fig. 2
Fig. 2 Graphical summary of the study findings.CI confidence interval, HRQoL healthrelated quality of life, SMD standardized mean difference, SUP supervised exercise interventions, UNSUP unsupervised exercise interventions

Table 1
Supervised training sessions ratio over total number of training sessions in SUP and UNSUP groups SUP supervised exercise interventions, UNSUP unsupervised exercise interventions * Ratio was calculated based on supervised sessions/total number of sessions

Table 2
Characteristics of the included studies

SUP > UNSUP: ↑ SF-36 subscales: role- physical, bodily pain and vitality
* Data are shown as median ↑ Significant improvement in the outcome ↓ Significant worsening in the outcome

Table 4
Summary of pooled results