Introduction

Cancer is a major global health issue and one of the leading causes of death in the world, placing a substantial economic burden on both the societal and individual levels [1, 2]. In 2020, an estimated 19.3 million new cancer cases and 10.0 million cancer-related deaths occurred worldwide [1]. With advancements in cancer diagnosis, prognosis, and treatment options, the cancer mortality rates in America have declined significantly between 1991 and 2019 by a total of 32% [3], and the number of cancer survivors has increased. The 2006 Institute of Medicine landmark report “From cancer patient to cancer survivor: Lost in transition [4]” highlighted the unique issues in survivorship care faced by all cancer survivors. The report presented the potential physical and psychosocial challenges caused by cancer and/or cancer treatment (including surgery, chemotherapy, radiation therapy, hormonal therapy, immunotherapy, or a combination of the above treatment options). Cancer-related impairments include but are not limited to cardiotoxicity [5], sarcopenia [6], cancer-related fatigue [7], lymphedema [8], pain, osteoporosis, and chemotherapy-induced peripheral neuropathy [9, 10]. They can occur during active cancer treatment (acute response) and persist after treatment (long-term effects) or appear months or years after treatment ends (latent effects) [11].

The positive role of PA has been profoundly researched in cancer-related impairments and survivorship. Physical activity refers to any bodily movement produced by skeletal muscles or requires muscular contraction and results in energy expenditure [12]. As early as 1938, animal experiments have demonstrated PA as an inhibitory factor to tumor growth. Nowadays, there has been large research effort devoted to investigating the role of PA in cancer survivors; many positive impacts have been shown including lowered cancer mortality risks and improved cancer-related health outcomes [13]. According to a systematic review [14], a combined resistance training with aerobic exercise program can reduce cancer-related fatigue and improve patients’ quality of life. A clinical trial suggested that active exercises prevented lymphedema in female breast cancer patients [15]. A high-quality controlled trial demonstrated that home-based aerobic exercise combined with supervised resistance training significantly reduced arthralgia associated with aromatase inhibitor therapy in breast cancer patients [16]. Emerging evidence suggests that exercise can harness the immune system to improve colorectal cancer survival rate [17, 18]. Given the promising effects of PA intervention in this patient population, an increasing number of initiatives for the integration of PA into cancer care continuum have been put forward [19].

Clinical practice guidelines (CPGs) are protocols developed based on systematic reviews of the current available evidence and the analyses of the benefits and harms. CPGs can assist clinicians with clinical decision-making and help identify gaps that may require further research [20]. Implementing good clinical practice guidelines could optimize clinical care quality while decreasing medical expenses and minimizing potential harm due to ineffective or unsafe interventions. Many guidelines and recommendations have supported PA intervention for cancer patients, but the wide variation in the quality of these guidelines may influence clinicians providing the optimal treatment for patients. At present, many quality assessment tools are frequently used to evaluate the guidelines [21]. As one of the most widely used instruments, the “Appraisal of Guidelines for Research and Evaluation (AGREE) II” was also recommended by WHO [22]. Numerous CPGs with PA recommendations for cancer patients have been appraised by this instrument and have met rigorous methodological quality criteria [23, 24]. However, previous studies [25] suggested that there may be a conflict between guideline methodologic quality and recommendation validity. Having high AGREE II scores does not guarantee that CPG recommendations are optimal, trustworthy, credible, or practical to implement, since some CPGs may have omitted common clinical situations or recommendations of uncertain clinical validity, which may affect the clinical application of recommendations [25, 26]. As a complementary tool to AGREE II, the Appraisal of Guidelines Research and Evaluation–Recommendations Excellence (AGREE-REX) was specifically developed to evaluate the clinical credibility and implementability of recommendations [26]. Therefore, the purpose of this review was to understand and appraise the quality of current CPGs with PA recommendations for cancer survivors and the clinical application of their recommendations using the AGREE II and AGREE-REX tools. In addition, we also wanted to identify research gaps through the synthesis of PA recommendations.

Materials and methods

Study design

We conducted a methodological appraisal of CPGs using the AGREE II and AGREE-REX instruments and reported our results according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement. The protocol was registered in the International Prospective Register of Systematic Reviews (PROSPERO) with the registration number CRD42021265329.

Search strategy

We conducted a systematic search for guidelines and employed three approaches to identify guidelines with PA recommendations for cancer survivors.

We searched electronic databases including PubMed, CINAHL, PEDro, and EMBASE. In addition, we supplemented database searches by a hand-search of guideline repositories. We also conducted a supplementary search by scanning the reference lists of review articles and relevant conference abstracts to identify eligible CPGs. A publication date limitation from 1 May 2016 to 1 June 2022 was set for all searches.

We adopted a combination of Medical Subject Headings (MeSH), free text terms, including “neoplasms, cancer, tumor, exercise, physical activity, rehabilitation, guideline, practice guideline, recommendation” as database search strategy. The detailed search strategies were provided in the Supplementary Table S1.

Selection criteria

The eligibility of CPGs was identified according to the following criteria.

Inclusion criteria were as follows: ① CPGs published in English; ② recently published or updated from 1 May 2016 to 1 June 2022; ③ published in peer-reviewed scientific journals; ④ exclusive to adult (aged ≥ 18 years) cancer population; ⑤ provided specific PA intervention recommendations on at least one PA parameter (frequency, intensity, duration and/or type); and ⑥ provided recommendations and explicit methodology.

Exclusion criteria were as follows: ① did not include PA as part of intervention strategy; ② published or updated prior to 1 May 2016; ③ CPGs for pharmacological or rehabilitation interventions without PA recommendations; ④ received commercial funding; ⑤ provided recommendations for PA referrals; and ⑥ focused on cancer prevention other than cancer or cancer-related symptoms and impairments management.

Study selection

We first imported all results into Endnote (version X9, Clarivate Analytics) reference manager program, and eliminated duplicates using the software and manually. Two independent reviewers (X. Zhou, C.H. Li) scanned the remaining records against the titles and abstracts. For records that were considered potentially relevant, we retrieved the full-text guidelines following the inclusion and exclusion criteria. If any disagreement arose, a decision was made by discussion with a third reviewer (X.G. Lai).

Data extraction and summary

Data screening and extraction were performed by two authors (X. Zhou, C.H. Li). One author performed data extraction which was confirmed for consistency by a second author. A third independent author (X.G. Lai) adjudicated unresolved discrepancies. We extracted and summarized the relevant information on the CPGs and recommendations on specific PA interventions. Main characteristics of the CPGs were collected, including title, organization/author, country of origin, publication year, methodological approach, and its scope.

Quality appraisal of the CPGs

The quality of each included guideline was appraised using the evaluation tools AGREE II and AGREE-REX by trained reviewers. All the reviewers received the online AGREE II training and evaluated CPGs on melanoma from published review [27] using the AGREE-REX tool to familiarize themselves with the instrument and compare their scores.

Both tools were rated on a 7-point scale ranging from 1 (strongly disagree) to 7 (strongly agree). Disagreement among reviewers of > 2 points for each item was discussed and resolved by consensus. Domain scores were calculated by adding scores of the individual items in a domain and standardizing the sum as a percentage of the maximum possible score for that domain, i.e., (obtained score – minimum possible score) / (maximum possible score – minimum possible score).

Two trained reviewers (C.H. Li, W.Q. Hou) independently appraised the methodological quality of each guideline by the AGREE II instrument. The instrument comprised 23 items within 6 domains: scope and purpose (items 1–3), stakeholder involvement (items 4–6), rigor of development (items 7–14), clarity of presentation (items 15–17), applicability (items 18–21), and editorial independence (items 22–23).

The Consortium of AGREE II does not set minimum domain scores or patterns of scores across domains to differentiate between high quality and low quality [28]. In this study, based on cut-off scores reported in previous guideline appraisals [23, 27, 29], we chose a cut-off “score of at least 60% for rigor of development (domain 3) as well as 60% in at least two other domains as a quality threshold. Guidelines meeting the cut-off scores were then evaluated by AGREE-REX.

Complementary to AGREE II, AGREE-REX is an instrument used for evaluating the clinical credibility and implementability of the recommendations. AGREE-REX consists of 3 domains, including “clinical applicability (Domain 1), values and preferences (Domain 2) and implementability (Domain 3)” composed of 9 items that must be considered to ensure that guideline recommendations were of high quality. This instrument was used by five independent reviewers (X. Zhou, C.H. Li, W.Q. Hou, X.G. Lai, and L.W. Zhai).

Similar to AGREE II, AGREE-REX does not provide a standard threshold to differentiate between high quality and poor quality. In this study, we set the threshold to be 60% and identified high-quality guidelines when all domain scores were above the threshold.

Statistical analysis

All domain scores were calculated by adding the item scores in each domain and converting the number into a standardized percentage of the maximum score for that domain. We also listed mean (± standard deviation, SD) for analysis of the descriptive statistics. We used the intraclass correlation coefficient (ICC) with a two-way random effects model [30] to test the inter-rater agreement (measure agreement among reviewers). The degree of agreement (ICC) was classified according to Cicchetti (1994): poor (< 0.40), fair (0.40–0.59), good (0.60–0.74), or excellent (0.75–1.00) [31]. P < 0.05 was considered statistically significant. Statistical analyses were conducted using SPSS (version 24, IBM Corporation, Armonk, NY, USA).

Results

CPGs characteristics

The literature search of the databases and supplementary sources identified 5108 records. We reviewed 182 full-text articles for eligibility after removing the duplicates and screening the titles and abstracts and 16 CPGs that met selection criteria were included in this systematic review (Fig. 1). Reasons for exclusion (n = 166) are provided in Fig. 1. The fourteen CPGs included four for breast cancer [32,33,34,35], one for head and neck cancer [36], one for multiple myeloma [37], one for cancer survivorship [38], three for nutrition management and/or physical activity in cancer patients [39,40,41], and six for symptom or condition management [42,43,44,45,46,47]. Twelve CPGs were published in the USA, of which four CPGs were developed by the American Society of Clinical Oncology (ASCO) and four from National Comprehensive Cancer Network (NCCN). The remaining were published in the UK (n = 1), in Europe (n = 1), in Canada (n = 1), and in Germany (n = 1). The basic characteristics of the sixteen CPGs are shown in Table 1.

Fig. 1
figure 1

Preferred Reporting Items for Systematic Reviews and Meta-Analyses flow chart

Table 1 Characteristics of included CPGs with PA recommendations for cancer patients

AGREE II

Scope and purpose

This domain assesses whether the guideline clearly described the main objectives, clinical questions, and target population. The guideline by Mohile et al. [44] scored the highest and fulfilled 100% of the criteria whereas the guideline [35] from Germany achieved only 41.67% (Fig. 2).

Fig. 2
figure 2

Heat-map showing an overview of the final AGREE II scores on guidelines with physical activity recommendations for cancer patients

Stakeholder involvement

This domain evaluates whether the guideline was developed by appropriate stakeholders and represents the views of its intended users. Furthermore, it covers whether the guideline clearly defined its target users. Two ASCO guidelines [43, 44] achieved the highest score fulfilling 100% of the criteria, while the guideline [35] from Germany showed no consideration of the views of the other stakeholders such as patients, the public, payers, and policy-makers and therefore received a low score of 13.89%.

Rigor of development

This domain assesses the systematic methods used for gathering and synthesizing evidence, formulating recommendations, the expert external peer review process, and the procedure for updating the guideline. The mean score was 6.21 (± 1.49), with individual scores ranging from 34.38% to 98.96% (Supplementary Table S2). Most guidelines provided search strategies and clearly described the selection criteria except for the guideline from Germany [35]. Three guidelines [34, 35, 37] did not describe clear procedures for updates.

Clarity of presentation

This domain is related to whether the guidelines provided clear recommendations and whether important recommendations were easily identifiable. This domain achieved the highest mean score (6.52 ± 0.78), and the individual scores ranged from 75% [35] to 100% [44]. This suggested all the included guidelines performed well in the presentation and clarity of the recommendations.

Applicability

This domain considers facilitators and barriers to guideline implementation, including potential cost implications, and presents key monitoring criteria for adherence to the guideline. Across the guidelines, AGREE II scores were lowest in this domain, with a mean score of 5.26 (± 1.66). The guideline by Shapiro et al. [43] achieved the best score of 98.58%, while the guideline from Germany [35] and the ASCO guideline [42] paid limited attention to the applicability, scoring only 33.33% and 50%, respectively.

Editorial independence

This domain focuses on whether guidelines are developed independently or have any existing conflicts of interest. The mean score was 6.37 (± 1.03). All of the guidelines fulfilled the criteria and scored above 75% except for the guideline from the UK.

AGREE-REX

Clinical applicability

This domain evaluates whether the guideline is evidence-based and the extent to which the recommendations apply to the guideline’s target users in terms of the practice context and patient population. Among all the included guidelines, the AGREE-REX domain clinical applicability scored the highest with a mean score of 6.10 (± 0.82) (Supplementary Table S3), individual scores ranging from 68.89% [40] to 93.33% [33] (Fig. 3).

Fig. 3
figure 3

Heat-map showing an overview of the final AGREE-REX scores on guidelines with physical activity recommendations for cancer patients

Values and preferences

This domain assesses whether the values and preferences of target users, patients, policy decision-makers, and guideline developers had been explored and considered in the development of the recommendations. This domain scored the lowest with individual scores ranging from 45.83% [34] to 74.14% [33] and seven guidelines scoring below 60%. Of the four items, the values and preferences of policy decision-makers and guideline developers had been omitted completely in several guidelines.

Implementability

This domain assesses the “purpose” and “local application and adoption” of the items. The mean score was 5.28 (± 1.11), with individual scores ranging from 55.00% to 88.33%. The NCCN guideline by Sanft et al. [38] scored the highest, at 88.33%. The NICE guideline [33] and the NCCN guideline by Swarm et al. [45] achieved 83.34%. Most guidelines paid limited attention to local application and adoption.

Among all, eight guidelines met the cut-off scores, achieving above 60% of all domains, and the NICE guideline earned the best scores in the three AGREE-REX domains, ranging from74.17% to 93.33%.

Summary of recommendations and levels of evidence

Fifteen of the sixteen guidelines met the cut-off scores of AGREE II, six of which provided PA intervention recommendations within a disease-specific context, and the remaining guidelines provided management recommendations for symptoms or functional impairments. Notably, two of the fourteen guidelines were published by specialized rehabilitation organizations, and one guideline endorsed by the Canadian Physiotherapy Association provided information specifically for physical therapists on the management of patients with multiple myeloma. PA recommendations, the level of evidence, and the strength of recommendation are outlined in Table 2. Method to collect the evidence, formulate recommendations, and articles type/number are outlined in Table 3.

Table 2 Summary of recommendations and levels of evidence
Table 3 Method to collect the evidence and formulate recommendations and articles type/number

Discussion

This study presents the first systematic review using AGREE II and AGREE-REX to appraise the quality of CPGs with physical activity recommendations for cancer patients. Sixteen guidelines were evaluated and eight guidelines published by ASCO, NCCN, NICE, and ESPN obtained high quality. This is likely due to the strict framework provided by these organizations, and the guideline makers had strictly adhered to the CPGs development process. For example, there is a representative from the ASCO Practice Guidelines Implementation Network (PGIN) and at least one patient representative in each expert panel of ASCO guidelines. Additionally, ASCO produces clinical tools and resources to assist with disseminating the recommendations contained in the guidelines. Based on the available clinical treatment resources, NCCN provides NCCN Framework, NCCN Harmonized Guidelines, and International Adaptations/Translations to extend the utility of the NCCN Guidelines. NICE incorporates economic evaluation in the development of guideline recommendations.

How might our findings impact CPGs development and future research?

Recommendations for guideline development

The potential benefits of clinical practice guidelines depend on the quality of the CPGs [28]. Ideally, guideline makers should strictly follow rigorous processes when developing CPGs. This review identified 16 relevant CPGs that showed acceptable quality in most AGREE II and AGREE-REX domains. Remarkably, some weaknesses have been revealed in AGREE II domain “applicability” and AGREE-REX domains of “values and preferences” and “implementability,” especially the items “values and preferences of policy/decision-makers” as well as “local application and adoption.” The lowest mean score (5.26 ± 1.66) was observed for the AGREE II domain “applicability,” and the evaluators gave low ratings (< 60%) to 4 out of 16 guidelines. Applicability refers to whether facilitators, barriers, and additional resources of the guideline application were presented and if monitoring or auditing criteria of the recommendations was provided. These parameters especially monitoring or auditing criteria were not clearly provided in most guidelines, which may explain the low values of this domain. However, ESPEN guideline did well in this domain, sufficiently providing all parameters [39]. The AGREE-REX domain “values and preferences” assesses whether the values and preferences of target users, patients, guideline developers, and policy decision-makers had been considered and how it influenced the recommendation formulation. Sackett [48] indicated that evidence‐based medicine should be equally based on service users’ values and expectations, individual clinical expertise, and the best available clinical evidence. Previous studies [49, 50] suggested that CPGs were more implementable when they considered patients’ and stakeholders’ values and preferences and provided information to support patient involvement in decision-making. However, most CPGs [32, 34, 37, 44] included in this study achieved low scores (< 50%) because there were missing information on the values and preferences of users, patients, guideline developers, and policy/decision-makers’ influenced on the recommendation formulation, which may ultimately influence the acceptability and adoption into clinical practice of PA recommendations. The findings are consistent with the previous studies [51,52,53]; many factors such as physical activity preferences (type, place, time), convenience (e.g., there is no travel to and from a hospital), cost, knowledge of physical activity guidelines, and accessibility to the program had led to a low level of PA adherence among cancer patients. For the AGREE-REX domain “implementability,” low values were consistently rated on item 9 “local application and adoption” in most included guidelines. This item assesses the suitability of the guideline recommendations for the setting, patient population, and/or the healthcare system in which they are being implemented. Professor Robbie Foy suggested that successful implementation of a guideline depends on both the national and local resources/action [54]. Most of the included guidelines did not consider resources at the local setting or adjust recommendations to tailor local adaptations when developing CPGs. This may further influence applicability to implement across the different local practice settings. Importantly, this consequence can be found in a systematic review involving ninety-eight studies and reported that barriers to cancer patients’ PA engagement included competencies of healthcare professional guidance, social support, and inaccessibility to fitness facilities [53]. However, NCCN guidelines are the notable exception. They provided NCCN Framework for Resource Stratification, NCCN Harmonized Guidelines targeting the regional resources, and International Adaptations/Translations to extend the utility of the NCCN Guideline internationally.

In conclusion, this analysis shows there is substantial room for improvement in the following domains “applicability,” “values and preferences,” and “implementability” to facilitate the guideline implementation. It is essential to comprehensively consider the values and preferences of users, patients, policymakers, and guideline makers and provide advice or tools and resources to facilitate the implementation of the recommendations. In addition, providing a “Guideline Implementation Survey” and a clear “monitoring and auditing criteria” may facilitate application of guidelines. Furthermore, the transparency of the guideline development process should also be addressed.

Suggestions on study design

This study identified eight high-quality CPGs; notwithstanding, many studies that formed the exercise evidence base contained flaws in the study design. We reviewed the quality of studies included in 16 guidelines and summarized the main problems in their design as follows: ① small sample sizes, ② low adherence and high attrition rates, ③ short intervention durations, ④ observational studies or feasibility studies, and ⑤ serious inconsistency owing to various interventions and outcomes. For example, the trials in the ASCO guideline [42] “Management of chronic pain in survivors of adult cancers” presented with some limitations, such as pain as the secondary outcome, a small sample size, and different measurement tools. Some studies included in the CPA guideline [37] were retrospective design, and we also found that most of the latest research for patients with multiple myeloma are feasibility studies [55]. Future research must be conducted using large sample sizes, randomized design, and rigorous method to decrease the risk of bias. Community or home-based exercise mode combined with a web-based guidance is also needed so as to enhance exercise compliance in patients.

Opportunities for future research

Findings of a bibliometric analysis suggested that the overall trend of cancer rehabilitation publications is optimistic [56]. However, this literature study revealed the presenting research deficits in some areas of physical activity interventions: ① Researches were mostly performed on the breast cancer, prostate cancer, colorectal cancer, and lung cancer populations. Of the 16 included guidelines, up to 25% CPGs were tailored to breast cancer survivors; ② previous studies on cancer rehabilitation emphasized on the cognitive, behavioral, and psychological therapies, whereas research on the health effects of PA in cancer rehabilitation is lacking; ③ there is still minimal research exploring the safety, feasibility, and potential benefits of exercise in patients with advanced cancer and cachexia [57].

Therefore, future study efforts should be devoted to ①the effectiveness of PA interventions on a wider range of cancer population other than breast, lung, prostate, and colorectal cancer; ② the immediate and long-term benefits of different modalities of PA on the physical and mental fitness of cancer survivors; and ③ the optimal PA prescription (frequency, intensity, duration, or type) for different symptoms or functioning impairments management. The previous studies [58] suggest that the optimal effects of PA are dependent on the dose appropriate to the individual level of physical fitness and health status. The PA prescription should follow FITT principles: frequency, intensity, time, and type [59]. Some guidelines included in this review are lacking specific PA prescriptions; therefore, for further in-depth exploration of the benefits of PA for cancer patients, future research should continue to explore specific FITT principles tailored towards cancer patients; ④ future research must focus on the effect of exercise on clinical symptoms in patients with advanced cancer and cachexia, especially for sarcopenia and pain.

Recent studies [60, 61] have demonstrated that telehealth-based exercise prescription is both feasible and effective for cancer survivors during the COVID-19 pandemic. Multiple novel web-based exercise systems are emerging as an excellent option to provide individualized PA interventions for cancer survivors, such as breast cancer and prostate cancer [62, 63]. In addition, other factors such as cost-effectiveness, less expenditure on travel time, patients’ preference to home-based exercise, and less absence from work may be considered in favor of the patient [64]. Therefore, efforts should be performed to continue evaluating the value of web-based rehabilitation mode across the continuum of cancer care and survivorship.

Additionally, we also noted that some available evidence included in the CPGs suggested a potential benefit from exercise for some physical impairments or symptoms, yet latest evidence remain insufficient for a clear direction on recommendations. For example, exercise for chemotherapy-induced peripheral neuropathy in adult cancers with insufficient evidence to formulate a recommendation [65, 66]; Tai-Chi for depression with shallow evidence to support a recommendation [67, 68]; and yoga for chemotherapy-induced nausea and vomiting with insufficient data to support specific recommendations [69]. Therefore, future well-designed clinical trials are needed to confirm the effectiveness and safety of the PA interventions for treatment-related issues, especially chemotherapy-induced peripheral neuropathy, nausea, and cachexia.

What challenges are faced when integrating physical activity into oncology care?

There are numerous challenges in applying PA into routine oncology care practice. One solution to overcome the challenges is implementing practice guidelines. The potential benefits of guidelines depend on the quality [28]. In this review, we found some common issues in the content of selected guidelines not relating to methodological flaws. First, much of the guidelines covered information on medical management, while less information was relevant to non-pharmaceutical management. Professional guidance should be provided by experts in different disciplines; however, most expert panels which developed the selected guidelines [32, 33, 35, 39] did not consist of a rehabilitation specialist or exercise physiologist required during the development of PA recommendations. These existing problems may affect the process of integrating physical activity into routine oncology care practice. Hence, there is a compelling need to promote more collaboration between oncologists and rehabilitation specialists, establish multidisciplinary expert panels, improve the referral system, and eventually gain the recognition and promotion of PA intervention in oncology clinical setting. Besides, we also observed that most included CPGs did not provide relevant factors and resources to promote its successful dissemination, which may lower knowledge about PA guidelines in daily life. Previous studies demonstrated that most cancer patients did not know the PA guidelines, and providing more detailed knowledge regarding PA guidelines in the oncological setting was an important facilitators to motivate their PA participation [51, 70, 71]. This phenomenon was also reported among healthcare professionals[72]. It’s important to improve knowledge of PA guidelines in for narrowing the knowledge-to-practice gap. According to the AGREE-REX tool and relevant guidelines implementation projects [73, 74], we suggest that the PA guidelines promoting-strategies include the following: (1) Present information from the PA guidelines in an easy-to-understand format, such as developing PA guidelines for patients; (2) The PA guidelines for cancer patients should be sent to hospital decision-makers, managers, practitioners, and explained to them by the researchers; (3) provide routine training sessions and manual for healthcare professionals, education, and handbook for patients and their caregivers of PA guidelines; and (4) disseminate PA guidelines by holding public outreach events (public forums, reports, conferences, etc.).

Strengths and limitations

As with all systematic reviews, our review has several strengths. It is the first review of guidelines with PA recommendations for cancer patients using AGREE II and AGREE-REX instruments. We evaluated the quality and applicability of included CPGs, synthesized PA recommendations, which could guide future research. In this review, the study selection and data extraction were performed by two independent reviewers, and all included guidelines were assessed by five independent evaluators. Before this article, reviewers were trained to ensure the validity and reliability of the guideline evaluation. Our review team also consists of an expert who has experience in guidelines development.

There are some limitations in our review. First, albeit a comprehensive literature search strategy and covering grey literature, some CPGs may still be missed. Second, given that CPGs only published in English were included, there may be selection bias in our study. Finally, the methodological quality appraisal of guidelines may be somewhat subjective; AGREE II and AGREE-REX did not set defined thresholds to differentiate between high quality and low quality guidelines, we adopted cut-off scores based on previous studies in our review [27]. Other evaluators may interpret AGREE domains scores differently, so the results of AGREE II and AGREE-REX appraisal should be interpreted with discretion. The AGREE II and AGREE-REX only focus on the methodological quality and applicability of guidelines.

Conclusions

This review revealed several deficiencies in domains including “applicability,” “values and preferences” and “implementability,” so this study can serve as a baseline from which to benchmark future development and improvement of cancer rehabilitation guidelines. Furthermore, this analysis indicated that more rigorous, high-quality studies are needed to generate trial-based evidence for supporting PA recommendations and provide guidance on research gaps in the field of cancer rehabilitation.