Introduction

Favorable health outcomes among cancer survivors are increasingly being attributed to lifestyle factors, especially to physical activity (PA), which is promoted in clinical guidelines [1]. PA has been shown to improve physical fitness [2] and quality of life [3] and has been suggested to affect mortality and recurrence rates favorably [4]. However, the latter findings are based on observational evidence and need to be interpreted with caution. Moreover, evidence suggests that PA has the potential to reduce fatigue among survivors of cancer [5], though most studies examining the effect of PA on fatigue have been conducted among heterogeneous survivor groups that mainly comprise survivors of breast cancer. Whether PA is equally effective in reducing fatigue among survivors of other cancers, such as colorectal cancer (CRC), has not been thoroughly examined to date. This distinction is important because these patients are generally older and have many disease-specific side effects related to their treatments and co-morbid conditions.

CRC is the third most common cancer worldwide, having age-standardized incidence rates in developed countries of 36.3 and 23.6 per 100,000 for men and women, respectively [6]. Mortality rates, however, are decreasing in these countries thanks to improved screening programs, treatments, and follow-up regimes [6,7,8]. Consequently, the number of survivors of CRC is expected to rise [9], which presents challenges to the provision of follow-up care [10, 11]. Survivors of CRC show a variety of late and long-term side effects, such as bowel dysfunction [12], anorectal dysfunction, sexual dysfunction, and psychosocial problems [13], which can significantly affect quality of life (QoL) and functioning.

A frequently occurring problem that affects QoL is cancer-related fatigue [14], which is reported to be twice as prevalent among survivors of CRC compared with an age- and gender-matched reference population [15]. Moreover, it can persist for a prolonged period, being present up to 10 years after diagnosis [16, 17]. The relationship between PA and fatigue has been studied in cross-sectional studies of CRC survivors, indicating that higher levels of moderate-to-vigorous PA are associated with lower levels of fatigue [18,19,20,21]. In other cross-sectional research, it was also suggested that fatigue could mediate the relation between PA and QoL [22]. Due to the design of these studies, no conclusions can be made about causality.

In this study, we aimed to conduct a systematic review of the effect of posttreatment PA on fatigue among survivors of CRC, using data from randomized controlled trials (RCTs) or prospective cohort studies.

Methods

Protocol and registration

The protocol for this review is available at PROSPERO, the international database of prospectively registered systematic reviews for health and social care (registration number RD42015029702).

Eligibility criteria

Studies were included if they met the following criteria: they must be a randomized clinical trial, controlled clinical trial, or prospective cohort study; use longitudinal data collection and analysis; report data about patients with CRC; include patients treated with curative intent; use a PA intervention (for the clinical trials) or an assessment of PA (for the cohort studies); assess fatigue using a validated fatigue questionnaire or a fatigue subscale from a validated QoL questionnaire; and have the data published in peer-reviewed journals.

Information sources and search strategy

The databases of Medline, CINAHL, and PsycINFO were systematically searched in December 2015, using combinations of MeSH and free-text words for “colorectal cancer,” “physical activity,” and “fatigue.” Table 1 shows the full search details for all databases; the search criteria were combined with the “AND” statement. No date restrictions or language restrictions were defined a priori. In addition, we checked the references of included studies for additional publications, and if necessary to obtain the relevant data, we contacted the authors of studies that did not report the results for CRC separately.

Table 1 Search strategy for different databases

Data collection

Studies were screened for eligibility based on titles and abstracts, after which full-text articles were reviewed, by two researchers independently (DB and JK). Data collection was performed in Microsoft® Access®, using a piloted and digitized data extraction form. We identified authors, titles, study types, populations, inclusion and exclusion criteria, intervention details, questionnaires used, and outcomes measured.

Statistical analysis for meta-analysis of randomized controlled trials

Because of expected heterogeneity of study participants, interventions used, and timing of outcome measurements, we used a random-effects model to pool the effect of PA on fatigue. We calculated standardized mean differences (SMDs) using mean scores and standard deviations at 6 months for the intervention and control groups in all studies. When these data were not available in the original articles, they were calculated by the researchers. When studies varied in time points of measurement of outcomes, we used the SMDs for the time points closest to 6 months. Statistical heterogeneity was evaluated by the χ 2 and I 2 tests and was determined statistically significant for P < 0.1 and I 2 > 50%, respectively.

Risk of bias assessment

Risk of bias assessment was performed using the Cochrane collaboration’s risk of bias assessment tool for the controlled trials [23] or the Newcastle–Ottawa quality assessment scale for the cohort studies [24]. Two researchers (DB and JK) independently reviewed the articles. Disagreement was calculated by Cohen’s kappa, and discrepancies were discussed until consensus was reached, with arbitration by a third researcher (AJB) when necessary. Publication bias was assessed by evaluating a funnel plot. All analyses were conducted using the computer program Review Manager (version 5.3. Copenhagen, The Cochrane Collaboration, 2014).

Results

Selection of articles

Figure 1 summarizes the search and article selection process. In total, we found 1196 records and screened the titles and abstracts of 1118 records after removing duplicates; agreement between researchers was moderate (Cohen’s kappa of 0.558). After removing unsuitable articles, we screened 61 full texts (Cohen’s kappa was 0.693, which was considered good), which led to the exclusion of another 54 records. Therefore, 7 records were included for data extraction: 5 were RCTs (one of which was a pilot), and 2 were cohort studies in which data were analyzed prospectively [25,26,27,28,29,30,31].

Fig. 1
figure 1

Flow diagram showing the path of article selection

Randomized controlled trials

Patients

Table 2 shows the characteristics of the five included RCTs. In total, 630 CRC survivors were included, with sample sizes ranging from 18 to 410 patients. The average ages of survivors ranged from 57.3 to 69 years, most patients were male, and all patients were included within 5 years after treatment. In all studies but that by Pinto et al., patients with metastatic cancer were excluded [28].

Table 2 Characteristics of the included randomized controlled trials

The baseline characteristics of the intervention and control groups are summarized in Table 3. Mean ages ranged from 59.5 (11.2) to 68.7 (9.13) years in the intervention groups, and from 55.6 (8.24) to 70.3 (8.7) years in the control groups. In three of five studies [25,26,27], patients in the intervention groups were younger than those in the control groups. Four of five studies [25,26,27,28] included more women in the intervention group (44–60%) compared to the control group (22.2–54%).

Table 3 Baseline characteristics and fatigue outcomes among the intervention and control groups in the included randomized controlled trials

Physical activity interventions

Interventions in the included RCTs (Table 2) ranged from 10 weeks to 36 months, with a median duration of 13 weeks. Three interventions were home-based, requiring participants to exercise themselves, and focused on either moderate cardiovascular activity for up to 30 min 3–5 times per week [25, 28] or encouraging participants to increase their annual number of steps walked to 10,000 [27]. To enhance adherence, the home-based programs used monitoring or counseling telephone calls. The other two interventions consisted of supervised exercise sessions combined with advice for participants to exercise in their own time [26, 30]. One intervention involved a combination of cardiovascular exercise and resistance training [26], whereas the other consisted of traditional hatha yoga [30]. In most of the studies, controls received care as usual, but, in one case, this was supplemented by weekly calls to monitor symptoms and a survivorship tip sheet [28]. The study by Cramer et al. used patients on a waiting list as their control population [30]. Dropout rates in included interventions ranged from 6 to 21%, with the larger studies having larger dropout rates.

Outcome measures

The primary outcome measures are detailed in Table 2. All RCTs used the Functional Assessment of Cancer Therapy-Fatigue (FACT-F) questionnaire to measure fatigue as a secondary outcome measure. We contacted the corresponding author of one study with unusual low fatigue scores, to find out that the scores had been recoded [25]. We transformed these scores back to the original scale and used them for the meta-analysis.

The fatigue outcomes for the RCTs are summarized in Table 3. All studies showed improvements in fatigue scores from baseline to after the intervention. However, the effect of the intervention on fatigue, when compared with the control group, was only statistically significant in one of the studies [26].

Figure 2 shows the results of the random-effects meta-analysis of the effect of the PA interventions on fatigue, which confirmed that there was no significant effect on fatigue in patients with treated CRC (SMD = 0.21 (− 0.07 to 0.49), I 2 = 42%, Z = 1.46, p = 0.14). No statistically significant heterogeneity was found between the studies (χ 2 = 6.95 (p = 0.14), I 2 = 42%).

Fig. 2
figure 2

Forest plot for the meta-analysis of the randomized controlled trial results

Risk of bias

Figure 3 depicts the risk of bias assessment for each included RCT. Because participant and personnel blinding was not possible in these studies, blinding was evaluated based on that of the outcome assessors. The studies by Hawkes et al. and Cramer et al. were considered to have the lowest risks of bias [27, 30], whereas the studies by Courneya et al. and Pinto et al. had the highest risks of bias [25, 28]. These latter two studies also scored high on other bias assessments. For example, contamination (i.e., the control group also increased their levels of PA) was an issue in the study by Courneya et al. [25], and failure to obtain the required numbers based on the sample size calculation was an issue in the study by Pinto et al. [28]. As can be inferred from the funnel plot in Fig. 4, publication bias was unlikely.

Fig. 3
figure 3

Risk of bias assessment of the randomized controlled trials

Fig. 4
figure 4

Funnel plot for publication bias

Cohort studies

Patients

The two longitudinal cohort studies comprised 3904 patients with CRC, and longitudinal data was available in 2828 of these [29, 31]. The mean age of patients was 69.1 ± 9.42 years, 44.4% were female, and the average time since diagnosis was 5.2 ± 2.8 years. Most had non-metastatic CRC.

Outcomes

Both cohort studies measured fatigue using the Fatigue Assessment Scale. The cohort study by van Putten et al. (n = 2451) examined the correlation between PA and symptoms or function-related barriers (including fatigue), clinical or sociodemographic variables [31]. While levels of moderate-to-vigorous PA were lower for fatigued CRC survivors compared with those who were not, changes in fatigue scores among survivors were not significantly associated with changes in PA levels (β: − 0.25, 95% CI: − 0.50 to 0.01, p = 0.05). The other cohort study by Husson et al. (n = 1453) analyzed the course of fatigue and its correlates [29]. However, this study did show that increased levels of moderate-to-vigorous PA among CRC survivors were significantly associated with decreased levels of cancer-related fatigue over time (β: − 0.03, 95% CI: − 0.05 to − 0.01, p < 0.01).

Quality assessment

The studies were performed using comparable methodology; therefore, the quality assessments were comparable. In both studies, participants were somewhat representative of the average CRC survivors in their communities (1 star), and the exposed and non-exposed cohorts were also enrolled from the same communities (1 star). Ascertainment of exposure (fatigue or moderate-to-vigorous PA) was based on written self-reporting (no star), as were the outcome assessments (no star), and the outcome of interest could have been present at the start of the study (no star). Both studies controlled for most important factors (1 star) and had a sufficiently long follow-up for the outcome to occur (1 star). Both studies suffered from substantial loss to follow-up (approximately 60%), thereby receiving no star for this criterion. Therefore, the studies scored four out of eight possible stars.

Discussion

Summary

In this systematic review, we performed a meta-analysis of five RCTs comprising 630 survivors of CRC. Interventions based on increasing PA were shown not to have any significant effect on fatigue levels. However, despite the lack of statistical significance, a trend for reduced fatigue was observed in all intervention groups. We considered the combined effect not clinically relevant. Concerning the longitudinal cohort studies, one showed that increased levels of moderate-to-vigorous PA were significantly associated with decreased fatigue, but another showed that decreasing levels of fatigue were not associated with increasing levels of PA.

Comparison with the literature

Our meta-analysis is the first to focus solely on the effects of PA interventions on fatigue after treatment for CRC. Findings from meta-analyses that included survivors of all types of cancer showed that interventions to increase PA had significant effects on fatigue [3, 32,33,34], as did a meta-analysis of supervised exercise interventions among breast cancer survivors [35]. However, most of the meta-analyses were over-represented by breast cancer survivors, and they showed considerable heterogeneity. A subgroup analysis in one meta-analysis indicated that the results did not differ by type of cancer [3], though this comparison was only between survivors of breast cancer and survivors of other cancers, without further specification of cancer type.

PA has been shown to be effective in reducing fatigue among people who are healthy or have long-term conditions [36, 37]. It has been hypothesized that PA can positively affect key biological variables associated with fatigue in cancer survivors, including loss of skeletal muscle mass and strength, anemia, increased proinflammatory cytokines levels, and psycho-behavioral problems [38]. There is no reason to assume that these should be radically different in survivors of CRC.

A possible explanation for not finding an effect in this meta-analysis is that none of the included trials was performed among fatigued survivors. Average fatigue scores at baseline ranged from 38.5 to 43, yet it is recommended that a score less than 34 be used for the diagnosis of fatigue when using the FACT-F [39]. Given that the scores observed in the included studies were close to average fatigue scores in a normative population (43.6 ± 9.4) [40], this may have led to ceiling effects in the analyses. This was observed in earlier meta-analyses [32]. Despite randomization protocols, the studies in our meta-analysis (except the studies by Courneya et al. [25] and Hawkes et al. [27]) showed higher FACT-F scores in the intervention groups that could have led to even larger ceiling effects.

In previous meta-analyses, the populations tended to be younger compared with those in the current review. For example, in the meta-analysis by Mishra et al. [3], the average age varied ranged from 39 to 70 years, whereas it ranged from 60 to 69 in the current meta-analysis. It has been suggested that older survivors of cancer benefit less from PA interventions [41, 42], which is possibly related to lower adherence among the elderly [43]. Further research is needed into the optimal types of interventions designed to increase PA among older survivors of cancer [41, 44].

Three of the five studies in our meta-analysis used home-based exercise protocols with telephone monitoring, focusing on aerobic exercise [25, 27, 28], while a fourth studied the effect of hatha yoga [30]. Earlier research suggested that the effects of PA on fatigue were larger when the intensity of PA was higher [34], and these four studies arguably did not include exercise of sufficient intensity. One trial that combined aerobic and strength exercises, possibly leading to a higher intensity, did show a significant effect. However, this was only a pilot study and was not powered to assess the effects on fatigue, indicating that the results need to be interpreted with caution. It has been suggested that lower intensity intervention might only offer beneficial effects if the exercise routines become a part of daily life [34]. Considering the short duration of most trials in our current meta-analysis, it is questionable whether this could be assessed.

The results from the included cohort studies showed that fatigue levels decreased when survivors of CRC increased their moderate-to-vigorous PA. However, the other showed that PA did not significantly increase when fatigue levels decreased. Thus, PA could positively affect fatigue, but it is likely that other clinical or demographic variables have confounding effects. Although causality cannot be inferred from observational research, selection bias was less of a problem compared with the RCTs. This is particularly relevant because the RCTs of the type we included were more likely to appeal to survivors who were already interested or participating in exercise, and because earlier research has shown that current exercise status is the best predictor for adherence to exercise interventions [45]. The opposite is true of fatigued survivors, those who are not interested in PA, and those who feel more ill, because such people may be less likely to accept an invitation to participate in trials that require intense PA.

Strengths and weaknesses

A major strength of our review and meta-analysis is that all the included RCTs used the FACT-F questionnaire as the outcome measure when assessing fatigue. The FACT-F has been shown to be sensitive to changes over time [46, 47]. Indeed, although earlier meta-analyses showed large heterogeneity in fatigue outcomes, this was probably related to the differences in the questionnaires used to evaluate fatigue [32, 34]. Also, we did not limit the review to RCTs, which is important because these are especially prone to selection bias in studies of PA, as discussed. Other risks in trials of PA include selective attrition and the potential for contamination, with the possibility that participants randomized to the control group could improve their PA, leading to further underestimation of the effect of the intervention.

Heterogeneity was classified as moderate in our meta-analysis (I 2 = 41%), possibly because of the differences in the types and intensities of PA, or because of the differences in the timings of the outcome measures. We analyzed outcomes that were measured between 6 weeks and 6 months after the intervention, but earlier research has shown that the effects of PA on fatigue varied between these time points. For example, improvements have been seen at 12 weeks and between 3 and 6 months after intervention, but not at 6 months [3]. Another noticeable weakness of our study was the small number of included studies, making it impossible to perform subgroup analyses to compare the effects of either aerobic and strength exercises or self-guided and supervised exercise.

Conclusion

The results of this systematic review and meta-analysis do not allow for definitive conclusions about the effect of PA on fatigue among survivors of CRC. More research is needed and should be conducted in trials powered to find differences in the reduction of fatigue and that are performed among fatigued survivors of CRC. However, considering the many other beneficial effects of PA on fitness, QoL, and survival, physicians should not hesitate to encourage survivors of CRC to engage in more PA.