Introduction

In 2016, sarcopenia was recognised as a disease with an International Classification of Disease, Tenth Revision, Clinical Modification (ICD-10-CM) code (1). As global life expectancy continues to rise, sarcopenia presents itself as a significant public health challenge (2). The various consequences of sarcopenia, ranging from the development of physical disability to nursing home admission, depression, hospitalisation, and mortality, are anticipated to significantly impact the quality of life for affected individuals (3, 4). Within the context of health systems shifting towards a more patient-centred model of care, it is crucial to consider the impact of diseases, such as sarcopenia, on patients’ quality of life and directly perceived outcomes, (5). In this context, patient-reported outcome measures (PROMs) and patient-reported experience measures (PREMs) emerge as invaluable tools for capturing patients’ perspectives on their health and experiences. PROMs and PREMs aim to report on diseases and symptoms, treatment side effects (such as pain, fatigue, or anxiety), functional outcomes (physical, sexual, social, role, emotional, or cognitive functioning), or multidimensional constructs like HRQoL or health utility (6).

Using PROMs and PREMs as endpoints in clinical studies may improve the understanding of patient’s experience by providing information that may not be captured through biomedical methods due to the difficulty of observing certain aspects and their subjective nature (6). This approach may support healthcare professionals and future patients in choosing the most suitable treatment by giving a clearer view of patient’s experiences and identify any unmet needs or areas in healthcare that require improvement (6). Therefore, using PROMs and PREMs in interventional clinical studies on sarcopenia contribute to a more comprehensive understanding of clinically perceived benefits, fostering an assessment of treatment efficacy. Government regulatory agencies, including the Food and Drug Administration (FDA) and the European Medicines Agency (EMA) (7, 8), have advocated for the incorporation of PROMs as primary or secondary outcomes in interventional studies. The European Society for Clinical and Economic Aspects of Osteoporosis, Osteoarthritis and Musculoskeletal Diseases (ESCEO) working group further also recommends using co-primary endpoints, combining a measure of physical performance with PROMs in all phases III clinical trials for sarcopenia (9).

The current approach to managing sarcopenia involves a multifaceted strategy to mitigate its impact on individuals’ health and well-being. Interventions are diverse, incorporating a blend of nutritional, exercise, and pharmacological strategies. Many of these approaches raise numerous questions regarding their effectiveness, particularly in patient-centred management.

Although several studies demonstrate improvements in various parameters such as muscle mass, strength, and physical performance, there are still uncertainties about their impact on enhancing patients’ overall HRQoL (10).

This systematic review addresses the need to understand the impact of sarcopenia interventions from patient’s perspective. We therefore aimed 1) to identify all sarcopenia-designed interventional studies that used a PROMs/PREMs as the primary or secondary outcome; 2) to identify the different PROMs/PREMs used within those studies and 3) to summarize the effects of sarcopenia-designed interventions on PROMs/PREMs of sarcopenic participants.

Methods

The 2020 Preferred Reporting Items for Systematic Reviews and Meta-analysis (PRISMA) statement (11) has been followed throughout the whole procedure of this systematic review (completed PRISMA checklist available in Appendix 1). The review protocol has been registered on the Open Science Framework (https://osf.io/zxgwm/).

Literature Search

The electronic databases MEDLINE, EMBASE, Review-Cochrane Central of Register of Controlled Trials, and APA PsycINFO (via OVID platform for all the mentioned bibliographic databases) were searched in September 2023 for any study who have used a PROMs/PREMs in a clinical trial aiming at the management of sarcopenia. The search strategy employed for searching in Medline (Ovid) is available in Appendix 2. The search strategy was further adapted to fit the requirements of each database. Additionally, a manual search within the bibliography of relevant papers was performed in order to complete the bibliographic search. Moreover, we also conducted forward references searching of included studies using Web of Science to identify other research that has referenced any article of interest. We also searched on clinical trial registries (www.clinicaltrial.gov) for potential unpublished studies.

Study selection

The search results from the electronic sources and hand searching were imported into Covidence software for data management. Covidence is a web-based collaboration software platform that streamlines the production of systematic and other literature reviews (https://support.covidence.org/help/how-can-i-cite-covidence). During the initial screening phase, three reviewers (G.L.D., F.O.F, V.S.) independently assessed the title and abstract of each obtained reference to eliminate articles irrelevant to the systematic review. Rigorous inclusion criteria were applied, as outlined in Table 1. In the subsequent step, the three reviewers individually examined the full text of each article that had not been excluded in the initial stage, selecting studies that fulfilled the inclusion criteria. Any discrepancies in article selection were resolved through discussion and consensus.

Table 1 Inclusion criteria

Studies were excluded if they included individuals with acute sarcopenia or diagnosed sarcopenia based solely on a single biomarker (e.g., muscle mass only). Additionally, exclusion criteria applied to studies that relied only on a screening tool (e.g., the SARC-F) without further diagnosing the condition, studies that recruited pre-/post-operative hospitalised or disease-specific participants, studies exclusively focused on the diagnosis of sarcopenic obesity, and studies that examined PROMs/PREMs using qualitative research methods.

Data extraction

Data extraction was carried out by three independent reviewers (G.L.D., F.O.F, V.S.) using a standardised form, which had been pretested on a sample of 4 studies.

The following data were extracted: article information (authors, journal name, years of publication, title, country), study description (objective, design, and duration), population characteristics (general description and sarcopenia diagnosis), outcomes (type of PROMs/PREMs), funding details, information on conflicts of interest, and the study’s conclusion. To include as many studies as possible in our systematic review, we systematically contacted authors or co-authors when information was missing in the full-text paper.

Risk of Bias Assessment

The same three independent reviewers assessed trials’ risk of bias using the Cochrane Risk of Bias Tool 2.0 (12). This tool assesses five domains for each study: randomisation process, deviation from intended interventions, missing outcome data, measurement of the outcome and selection of the reported results. When a study included various PROMs, the focus was primarily on assessing the quality-of-life outcome. For studies featuring a single PROM, the assessment specifically targeted that particular outcome. In case of conflicts, resolution was achieved through consensus, with the option of involving a third party (C.B.).

Data Synthesis

Due to the diverse range of treatments included in this systematic review, conducting a direct comparison using meta-analytic statistics was impossible. Consequently, the findings were summarised and explained narratively.

Results

The search strategy initially identified 2,006 records, with 1,646 records remaining after removing duplicates. Upon screening the titles and abstracts of these records, 77 were considered potentially eligible. Following a thorough evaluation of the full texts of these 77 articles, 60 were excluded and a total of 17 studies met the eligibility criteria and were included in this systematic review. A list of excluded studies and their reasons for exclusion is available on the Open Science Framework deposit (https://osf.io/zxgwm/). Authors from three included paper were contacted by email for additional information or details about their analyses. They all responded positively to our request, providing us enough information to ensure the inclusion of their paper in the current manuscript. Manual search yielded no new reference. Flowchart of study selection is available in Figure 1.

Figure 1
figure 1

Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA 2020) flowchart of study selection

Studies characteristics

The 17 randomized-controlled trials (RCTs) included, published between 2015 and 2023, are detailed in Table 3 and Table 4 (1324). Trial durations ranged widely, spanning 8 (18) to 156 (20) weeks, and study design comprised 2 to 5 groups. The intervention was singular or combined, including medications (n=2, 11.76% (25, 26)), nutritional supplementation (n=6; 35.3% (1318)), exercise program (n=4, 23.53% (23, 24, 27, 28)) and a combination of exercise and nutritional supplements (n=5, 29.41% (1922, 29)). At the same time, the control group received isocaloric products/placebos, isocaloric products/placebo combined with an exercise program, or exercise programs alone. The number of participants ranged from 28 (23) to 1519 (20). Only one study exclusively enrolled females (19). Although the targeted population across included studies was sarcopenic, variations were observed in age categories (≥ 60 years, ≥ 65 years, ≥ 70 years) and living conditions encompassing community-dwelling individuals (76.47%), residents of care institutions (11.76%), and candidates for in-patient rehabilitation (5.88%). Regarding sarcopenia diagnosis, 13 studies adhered to a recommended definition by scientific societies, with 4 using AWGS (13, 21, 25, 29), 8 using EWGSOP 1 or 2 (16, 18, 19, 22, 23, 2628), and 1 using FNIH criteria (20). Other studies used at least two biomarkers to diagnose sarcopenia but did not mention their adherence to a scientific-societies consensual definition.

Table 2 General description of the 9 that showed an improvement of the primary outcome following the intervention
Table 3 General description of the 8 RCTs that showed no improvement of the primary outcome following the intervention
Table 4 Overview of PROMs used within included studies

None of the included studies presented any high risk of bias for any of the investigated domain (Figure 2).

Figure 2
figure 2

Results of risk of bias assessment of included studies

PROMs/PREMs characteristics

The 17 included studies reported PROMs as outcomes, but none of them reported any PREMs. PROMs utilized in the studies covered assessments of various concepts including HRQoL (SF-12, SF-36, EQ-5D, SarQoL) (1322, 2429), depressive symptoms (GDS, BDI) (17, 30), loneliness/social isolation (UCLA) (30), daytime sleepiness (ESS) (23), insomnia impact (ISI) (23), and sleep quality/disturbance (PSQI) (23, 24). The majority of PROMs employed in these studies were generic, with the exception of one study that utilized a sarcopenia-specific PROM called SarQoL (27). Table 2 provides a brief overview of these various PROMs.

PROMs were mainly used as secondary outcomes. Nevertheless, 5 studies listed multiple primary outcomes among which one or more PROMs were listed. Only Pinheiro et al. (19) used exclusively PROMs as primary endpoint (i.e. depression, loneliness and HRQoL).

Results of interventions on primary endpoint

The nature of the primary endpoint varied widely across trials, but in the majority of trials, primary outcome was defined by a physical measurement (i.e. change in gait speed, change in SPBB test, etc.). Further details are provided in Table 3 and Table 4.

Nine RCT reported a significant improvement of the primary endpoint following exercise-based intervention (20, 23, 24, 27, 28), nutrition-based intervention (15, 18, 31) or a combination of exercise and nutrition-based intervention (21) (Table 3). In four of these studies, no further improvement was observed on PROMs (neither HRQoL (18, 20, 21, 31) nor depression (31)). The other five RCTs highlighted both an improvement in the primary endpoint and in PROMs, reflecting that the proposed intervention may be effective in counteracting sarcopenia and its impact on PROMs. De Sa Souza et al. (23) reported a significant effect of a 12-week intervention with resistance exercise training on the Insomnia Severity Index (ISI) and the Sleep Quality Disturbance (PSQI) compared to the control group. Flor-Rufino et al. (28) reported that 26 weeks of high-intensity resistance training may improve muscle strength, gait speed, maximum expiratory pressure alongside the EQ-5D VAS, despite reporting no effect on the EQ-5D utility index. Tsekoura et al. (27) reported that a 12-week group-based exercise and individualized home-based exercise may be effective in improving calf circumference, Time up and Go test, 4-meter gait speed, muscle mass index, muscle strength, and HRQoL PROM, as reported by the SarQoL questionnaire. Tung et al. (24) reported that a 26-week vital acupunch exercise program may be effective in improving muscle mass, muscle strength, and sleep quality as reported by the PSQI PROM. Finally, one nutrition-based intervention, published by Bo et al. (15), using a 24-week intervention with supplements containing whey protein, vitamin D and vitamin E, reported a significant improvement in muscle strength and muscle mass index alongside a significant improvement in HRQoL physical and mental component scales of the SF36 questionnaire.

The other 8 RCTs did not report any significant improvement on the primary endpoint (i.e. SMI, handgrip strength, SPPB test 30-second chair stand test, gait speed) nor on the PROMs either (Table 4). None of the pharmacological intervention-based RCTs (i.e. using Bimagrumab 700mg monthly (25) or perindopril/leucine (26)) reported improvement in the primary endpoint or in PROMs.

Discussion

The objective of this study was to identify sarcopenia-focused interventional studies utilizing PROMs and PREMs as primary or secondary outcomes. This systematic literature review aimed to provide a comprehensive overview of the impact of sarcopenia-designed interventions on outcomes directly considered as relevant by patients suffering from sarcopenia. Seventeen sarcopenia-designed RCTs encompassing interventions such as medication (11.76%), nutritional supplementation (35.3%), exercise programs (23.53%), and a combination of exercise and nutritional supplements (29.41%) were identified. Surprisingly, none of the 17 studies incorporated PREMs but all of them utilized PROMs, mainly as secondary outcomes. The selected PROMs covered diverse concepts, including HRQoL (88.23%), depressive symptoms (17.65%), loneliness/social isolation (5.88%), and sleep quality/disturbance (11.76%).

Out of the 17 sarcopenia-targeting RCTs utilizing PROMs as primary or secondary endpoints, half of them (i.e., 9/17) reported a positive impact of the intervention on sarcopenia parameters (i.e., improvement of muscle mass, muscle strength, or physical performance), and approximately 30% of them (i.e., 5/17) further reported an improvement in PROMs. One hypothesis that could explain these results might be that, with the exception of one study employing a sarcopenia-specific PROM called SarQoL, most reported PROMs were generic. Generic instruments provide a broad assessment of HRQoL in populations and enable comparisons with other conditions, but their generic nature makes them less likely to reflect the impact of the intervention compared to a specific instrument. Specific questionnaires are more sensitive to change. Currently, only three specific PROMs for sarcopenia are documented in the scientific literature, which explains their limited utilization in clinical trials. First, the Age-Related Muscle Loss Questionnaire (ARMLQ - SARCOPRO) has been developed by Evans et al. (32) in 2011 to assess functional impacts of reduced muscle strength. Only the content validity of this PROM has been reported, leaving the other psychometric properties, such as validity, reliability and responsiveness to change unexplored. Second, the Patient-Reported Outcomes Measurement Information System (PROMIS) Physical function item bank, funded by the NIH (33), has been validated for measuring physical function. However, the appropriate context of use and the fit-for-purpose measurement in sarcopenia have not been reported thus far. Thirdly, the SarQoL, a specific quality of life questionnaire for sarcopenia, stands as the only validated PROM currently available. Translated into more than 35 languages and validated across 19 different populations, this PROM is recommended for use in both clinical and research practices (3436).

The international COnsensus-based Standards for the selection of health Measurement Instruments (COSMIN) initiative (37) provides guidelines for selecting the most appropriate PROM in research and clinical settings. COSMIN taxonomy covers key psychometric properties, including reliability (such as reliability, measurement error, and internal consistency), validity (such as content validity, criterion validity, and construct validity), and responsiveness. However, the mere existence of a PROM doesn’t ensure its suitability for measuring a specific concept related to PROMs. In clinical trials targeting sarcopenia management using a PROM as a primary or secondary endpoint, it is crucial to verify that the PROM has been properly developed and validated according to established guidance. In the context of interventional studies, the PROM’s responsiveness to change of a PROM is paramount. The instrument should be sensitive enough to detect any change in the measured concept. For this reason, an ESCEO working group, composed by key experts in the field of clinical trials for sarcopenia, recommends the use of specific PROMs in clinical trials aiming at the management of sarcopenia (9).

The existing diversity in outcome measures underlined in this systematic review also underscores the critical need for standardization in sarcopenia research. The development and adoption of a Core Outcome Set (COS) play a pivotal role in achieving this standardization. The absence of a COS for sarcopenia significantly contributes to the observed variability in PROMs across studies. The COMET (Core Outcome Measures in Effectiveness Trials, https://www.comet-initiative.org/) initiative actively promotes the creation and use of COS, recognizing its numerous advantages (38). COS serves as a valuable tool in averting ineffective interventions and addressing outcome-reporting bias by providing a predefined list of essential outcomes for measurement in Randomized Controlled Trials (RCTs) (39). Its implementation enhances the ability to conduct more consistent systematic reviews or meta-analyses, facilitating robust comparisons across studies and improving the reliability and generalizability of research findings (39). Developing a COS specific to sarcopenia is indispensable not only for establishing uniformity in outcome reporting but also for enhancing the validity and reliability of study results.

This review also brings to light a significant aspect of sarcopenia research: Patient-Reported Experience Measures (PREMs) were not reported in any of the included studies. The underreporting of PREMs in sarcopenia can be attributed to several factors, including researchers’ skepticism, resource limitations, insufficient funding, reluctance to overburden patients, uncertainties about how to utilize PREMs results, and an evident lack of standardization (40). However, addressing the underreporting of PREMs in sarcopenia research requires a broader recognition of the importance of patient perspectives in evaluating intervention effectiveness. Adopting a patient-centered approach is crucial, as neglecting the patient’s experience during an interventional trial can impede shared decision-making in sarcopenia research. Shared decision-making (SDM) explicitly involves patients and clinicians in decisions regarding diagnostic and treatment options (41) by integrating patients’ values and preferences (42). Developing a standardized tool to address this issue could overcome barriers to PREMs implementation and contribute to a more comprehensive and patient-centered approach to sarcopenia studies.

Limitations

This study is constrained by its reliance solely on published literature, potentially overlooking additional interventional studies that are either ongoing or completed but unpublished. Furthermore, our inclusion criteria focused on interventional studies explicitly reporting PROMs or PREMs as primary or secondary outcomes, preventing us from offering a comprehensive prevalence of studies employing such outcomes. Consequently, it remains uncertain whether the use of PROMs is a common or less widespread practice in sarcopenia research. Additionally, the substantial variability in the PROMs used across these studies posed a challenge to conducting a meta-analysis. The differences in outcome measurement instruments between studies created obstacles in synthesizing quantitative data. This emphasizes the critical need for heightened standardization of measurement tools in sarcopenia research. Enhancing standardization would not only facilitate future meta-analyses but also promote comparability across studies, addressing a significant challenge in the current landscape.

Conclusion

In addressing the impact of sarcopenia intervention on PROMs/PREMs, this study identified 17 sarcopenia-designed interventional trials employing diverse strategies such as medication, nutritional supplementation, and exercise. None of these studies used a PREM as primary or secondary outcome. PROMs were exclusively use in those studies covering aspects like HRQoL, depressive symptoms, loneliness and sleep quality. The varied effect on PROMs highlights the need for standardization in sarcopenia research. Developing a Sarcopenia Core Outcome Set is important to ensure consistent, comparable findings, enhancing the intervention’s reliability and effectiveness.