Background

Colorectal cancer (CRC) is the third most common cancer globally, with incidence rates that positively correlate with Human Development Index levels (HDI), a measure of societal development based on the average health, education, and income of a population [1, 2]. Several CRC risk factors are associated with socioeconomic development including smoking, physical inactivity, unhealthy diets, and excess body weight [3]. Incidence rates are declining among older adults in high-income countries due to screening and early removal of precursor lesions but increasing among young adults (age < 50 years) [4]. The reason for this increase is unknown, but lifestyle exposures in childhood and adolescence are considered drivers [5]. Economically transitioning countries are seeing a rapid increase in CRC incidence, albeit from low levels, reflecting a global shift toward more Westernized lifestyles [6].

Less is known about the impact of lifestyle factors on CRC prognosis. Smoking [7, 8], physical inactivity [9, 10], and unhealthy diets [11, 12] have all been associated with increased mortality in CRC patients. The impact of overweight and obesity on CRC survival is, however, highly debated [13]. Some studies have reported improved survival among CRC patients with excess body weight compared to normal weight patients [14], the so-called obesity paradox [15]. Others report worse CRC-specific outcomes in the obese group [16, 17], and yet other studies find no associations between body mass index (BMI) and survival [18]. Reverse causality due to illness-induced weight loss may account for these differences, highlighting the importance of timing when assessing body weight [19].

Few studies have examined the combined effects of lifestyle factors on CRC recurrence and CRC-specific survival [20,21,22,23]. Previous studies have reported conflicting results, which may reflect methodological differences, including the timing of exposure assessment (pre/post-diagnosis), the use of different lifestyle scores, and differences in study populations. This study aimed to investigate the associations of the combined impact of pre-diagnostic modifiable healthy lifestyle factors, including avoidance of smoking, moderate to high levels of physical activity, high adherence to a healthy diet, and BMI within the healthy range, with CRC recurrence and overall survival in a cohort of Swedish CRC stage I-III patients.

Methods

Study design

The Colorectal cancer low-risk study cohort consists of more than 3300 participants diagnosed with all-stage CRC in 14 hospitals in Middle Sweden from 2003 to 2009, as described elsewhere [24]. Participants were consecutively recruited or identified using data provided by Regional Oncologic Centers. The latter received letters of invitation to participate in the study, and those interested were contacted over the telephone for informed consent and inclusion. A subset of participants included in 2004–2006 received a self-administered questionnaire on lifestyle habits (n = 1767), with a response rate of 93% (n = 1639).

We conducted a cohort study including participants from the Colorectal cancer low-risk cohort with stage I–III CRC who had completed the lifestyle questionnaire. A healthy lifestyle was the exposure of interest and recurrence-free survival (RFS) was the primary outcome of this study, using overall survival (OS) as a secondary outcome.

Participants

Participants with a radically resected adenocarcinoma of the colon or rectum that had surgery in 2003–2006 were eligible for inclusion. Stage IV CRC patients were excluded due to dismal prognosis, as were participants with unavailable patient files or missing data on the American Joint Committee of Cancer (AJCC) TNM stage. Patients were staged according to version 5 of the AJCC TNM [25].

Two investigators (S.B and P.R) collected treatment and follow-up data for the participants during the years 2017–2020, including date of surgery, American Society of Anesthesiologists (ASA) classification, oncological treatment (neoadjuvant radiotherapy/adjuvant chemotherapy), time to CRC recurrence, time to last recurrence-free follow-up visit, and time to all-cause death.

Exposure assessment

A semiquantitative questionnaire was used for the collection of information on smoking, physical activity, and anthropometric markers. Participants were asked to report their cigarette smoking status and history including the number of cigarettes per day and duration of smoking for current and ever-smokers. Data on physical activity including leisure time exercise was collected using a validated set of questions with 5 pre-defined duration categories ranging from less than 1 h to more than 5 h/week, a validated method for assessing physical activity [26]. Self-reported weight 5 years before diagnosis was used to calculate BMI by dividing the weight in kilograms by the square of height in meters. Weight 5 years prior to diagnosis was chosen to minimize the risk of reverse causation, as CRC can induce weight loss.

The lifestyle questionnaire included a food frequency section designed to assess a typically Swedish diet. Participants were asked to report serving size and average intake frequency of 96 commonly eaten foods and beverages 5 years before diagnosis. A similar validated questionnaire, where participants report eating habits over the last year, has been used in previous studies [27, 28].

Mediterranean diet score

The Mediterranean diet (MD) is one of the most scientifically evaluated dietary patterns in the field of nutritional epidemiology [29, 30]. Several studies have reported inverse associations between MD adherence and CRC risk and mortality [31,32,33,34,35]. A diet adhering to the MD pattern was considered healthy in this study.

We used the modified Mediterranean diet scale (mMED) defined by Tektonidis et al. and developed further by Larsson et. al to compute a diet variable [36, 37]. This is a modification of the Mediterranean diet scale originally constructed by Trichopoulou, to better suit the intake habits of the Swedish population. The mMED score was created by categorizing the intakes of the following 6 food groups into quintiles: vegetables and fruits, legumes and nuts, whole grains, fish, dairy products, and red and processed meats. Participants received a score from 1 to 5 for being in the first five groups’ lowest to highest quintiles of intake. The score was reversed for the last group, red and processed meats, assigning 5 points to the lowest quintile. The use of olive- or rapeseed oil was assigned 5 points; conversely, 1 point was assigned for non-use. The mMED has previously included intakes of alcoholic beverages. However, the health effects of moderate alcohol consumption are widely debated [38], prompting us to exclude alcohol consumption from the score. The total mMED score thus ranged from 7 (low adherence) to 35 (high adherence).

Healthy lifestyle and BMI score

A healthy lifestyle and BMI (HL) score was created by dichotomizing each of the four lifestyle variables into a pre-defined healthy and less healthy/unhealthy alternative [39]. Never smokers and former smokers with > 1 year of cessation time were considered non-smokers, as opposed to current and former smokers with ≤ 1 year of cessation. Current but not former smoking has been associated with poorer CRC-specific survival [7]. According to the WHO recommendations for adults, participants with ≥ 150 min/week of leisure time exercise were considered physically active, versus < 150 min/week [40]. A low-risk diet was defined as an mMED score > the cohort median, versus an mMED score ≤ cohort median. Participants with a BMI of 18.5–24.9 were considered to have healthy body weight, as opposed to those with underweight (BMI < 18.5) and pre-obesity or obesity (BMI ≥ 25.0 m2), according to the WHO classification [41]. One point was allocated for each healthy lifestyle factor, and 0 points for the less healthy or unhealthy alternative. The total score thus ranged from 4 (most adherent to a healthy lifestyle) to 0 (least adherent).

Outcome assessment

CRC recurrence was defined as locoregional recurrence, distant metastasis, or the occurrence of a new colorectal tumor. Observation time started on the date of curative surgery and ended on recurrence or the date of the last follow-up visit to the surgical or oncological clinic. In the OS analysis, participants were observed from the date of curative surgery to the date of all-cause death or the last known date of contact.

Statistical methods

We categorized participants into four groups based on HL points. Those with an HL score of 0 and 1 were combined into one group due to low numbers in the former category (n = 20). Those missing data on smoking (0.8%) were coded as non-smokers. Participants who had left the entire diet section of the FFQ blank were considered non-responders and excluded (0.9%). Median imputation was used to replace missing values for single food groups (6.5%), physical activity (2.8%), and BMI (2.6%).

The distribution of demographic variables across categories of exposure was tested using the chi-square test for categorical and the Kruskal–Wallis test for continuous variables. We used the Kaplan–Meier (K-M) method to assess median RFS and OS in each group and Cox proportional hazards model analysis to estimate univariate and multivariable-adjusted hazard ratios (HRs) and 95% confidence intervals (CIs) for CRC recurrence and all-cause death. The survival analysis was right censored. The least healthy group served as the reference category.

The pre-defined confounders age, sex, and educational level were included in the multivariable model (Figure S1). Tumor stage, oncological treatment, and tumor site were considered potential mediators of the effect of a healthy lifestyle on RFS (Figure S1). Diabetes and cardiovascular disease (CVD) were considered potential mediators of the effect of a healthy lifestyle on OS (Figure S2).

Using the Wald test, we tested for interactions between the HL score and tumor site, oncological treatment, and tumor stage. Since most participants with rectal cancer had received neoadjuvant radiotherapy (RT) before surgery, that is before the start of observation time, we analyzed rectal cancer patients separately using an RT variable as a covariate in the multivariate regression model. A complete cases-only analysis was conducted excluding those missing data on any of the HL score variables.

All analyses were done using SPSS version 28.

Results

Participants missing an exact date of recurrence (n = 29), with recurrences occurring ≤ 6 months of diagnosis (n = 18) or follow-up time ≤ 6 months (n = 11) were excluded from the RFS analysis (n =58 ). A total of 1040 participants were included in the RFS analysis and all 1098 participants were included in the OS analysis (Fig. 1).

Fig. 1
figure 1

Flow diagram of participants illustrating the study design according to Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) recommendations. N  number, CRC  colorectal cancer, tumor stage = stage according to AJCC TNM 5th edition, R1 microscopically positive resection margins, RFS recurrence-free survival, OS overall survival, m months

Demographic characteristics of participants are shown for the total population and by HL score category in Table 1.

Table 1 Characteristics of 1098 patients with stage I–III colorectal cancer in the Colorectal cancer low-risk cohort, for the population as a whole and by Healthy lifestyle and Body Mass Index score group

The group with the healthiest lifestyle (HL 4) consisted of 157 participants (14%). These were more likely to be women, of higher age, with a higher educational level, and less often diabetics, as compared to the 233 participants (21%) in the least healthy group (HL 0–1) who were predominantly male and tended to be younger at CRC onset. There were no differences in cancer stage, tumor site, oncological treatment, or other clinical factors. The composition of the HL score is further outlined in Table 2.

Table 2 Healthy Lifestyle and Body Mass Index Score composition, showing the distribution of score components for the study population as a whole and by score groups

We observed 221 events of cancer recurrence among 1040 participants during a median follow-up time of 4.3 years (Fig. 2).

Fig. 2
figure 2

a and b Kaplan–Meier estimates for a recurrence-free survival and b overall survival by Healthy lifestyle and Body Mass Index score groups among patients with stage I-III colorectal cancer. Time since curative surgery is expressed in years

A healthy lifestyle was associated with improved RFS. The crude and adjusted HRs of recurrence and death were significantly lower than the reference for all score categories above the reference. Compared to participants with an HL 0–1 (least healthy), the HL 4 (most healthy) category had a crude HR for recurrence of 0.51 (95% CI 0.32–0.81) and an adjusted HR for recurrence of 0.51 (95% CI 0.31–0.83), with sex, age, and educational level included in the multivariate model. The adjusted HRs for recurrence of participants with an HL 2 and HL 3 were 0.57 (95% CI 0.40–0.81) and 0.66 (0.47–0.92), respectively (Table 3). There were 542 deaths among 1098 participants during a median follow-up time of 6.3 years (Fig. 2). The crude HR for all-cause death for participants with HL 4 vs HL 1 was 0.65 (95% CI 0.48–0.87) and the adjusted HR for all-cause death was 0.52 (95% CI 0.38–0.70) (Table 3). The adjusted HRs for death of HL 2 and 3 vs HL 1 were 0.66 (0.50–0.79) and 0.72 (0.57–0.90). We found no significant interactions between the covariates included in the multivariate model when using the Wald test.

Table 3 Univariate and Multivariate survival analysis using Cox proportional hazards regression models for colorectal cancer recurrence and overall death by Healthy lifestyle and BMI score

In the K-M curves for recurrence (Fig. 2), the curve of the least healthy group was found to have a disproportional course in relation to the others with events occurring sooner in this group, indicating changes in HR over time. The proportional hazards assumption was not valid (Log-Rank test p-value = 0.001). The HRs for recurrence are thus to be interpreted as average estimates for the whole time of observation.

Sensitivity analysis

The violation of the proportional hazards assumption prompted us to conduct a Cox proportional hazards analysis with time-dependent covariates, in order to estimate the HRs for time periods < 24 months, ≥ 24 months – < 36 months, ≥ 36 months – < 48 months, and ≥ 48 months – < 60 months. The strongest effect of the HL score on RFS was seen in the interval of ≥ 24 months – < 36 months (Table S1).

We found no significant interaction effects between the HL score and cancer stage (p-value: 0.39), tumor location (p-value 0.68), and oncological treatment (p-value:0.66) when using the Wald test in the RFS analysis, A complete cases-only analysis was performed, excluding all cases with missing values in score components. This only slightly affected estimated HRs and 95% CIs (Table S2).

We analyzed the rectal cancer group separately, including a covariate for radiotherapy in the Cox model. Participants had received a total dose of either 25 Gy (80% of those treated with neoadjuvant RT) or 50,4 Gy. We coded a categorical variable with three levels (0 = no RT, 1 = 25 Gy, 2 = 50.4 Gy), and tested it in a Cox model for rectal cases only. The p-value of the RT-covariate was non-significant.

When including the individual score components as covariates in a multivariate Cox regression analysis, only smoking, and exercise were significantly associated with a reduced HR of CRC recurrence and death (Table S3). Sex, age, and level of education were all significantly associated with OS, but not RFS.

Discussion

In this cohort study, patients with CRC stage I-III and a healthy lifestyle (HL 4) had a 49% lower HR of cancer recurrence and a 48% lower HR of all-cause death compared with the least healthy (HL 1). As we found the proportional hazards assumption to be violated in our survival analyses, indicating changes in hazard rate over time, we investigated the time-varying effects of the score on RFS. The results indicate a stronger effect in the interval of 24–36 months. However, only a small number of recurrences occurred after this period and the results for survival > 36 months should thus be interpreted with caution.

This is one of the first studies to report a statistically significant decrease in the risk of CRC recurrence in patients adhering to a healthy lifestyle pre-diagnosis. A small number of previous studies have reported inverse associations between a healthy lifestyle and overall mortality, but results for RFS or CRC-specific mortality have often been weaker.

Among 5727 all-stage CRC cases, Pelser et al. reported a statistically significant reduction in the risk of all-cause death for those with a pre-diagnostic healthy lifestyle (including a BMI within the normal range, smoking avoidance, physical activity, a healthy diet, and a low intake of alcohol) [20]. Reduced risk of CRC-specific death was seen only among rectal cancer cases, whereas our results indicate a protective effect of a healthy lifestyle irrespective of the anatomical subsite.

In a cohort of CRC patients from the Nurses’ Health Study (NHS) and the Health Professionals Follow-up Study (HPFS), pre-and post-diagnostic adherence to the World Cancer Research Fund/American Institute for Cancer Research (WCRF/AICR) score was significantly associated with a lower risk of all-cause, but not CRC-specific, death [21]. This score includes physical activity, diet, and body weight, but not smoking. Non-smoking was our study’s strongest individual risk-reducing factor, which may account for the differences. Among 3292 cases of all-stage CRC within the European Prospective Investigation into Cancer and Nutrition (EPIC) study, pre-diagnostic concordance with the WCRF/AICR recommendations was however associated with reduced CRC-related and overall mortality [22].

Among 992 colon cancer stage III cases, finally, a healthy lifestyle post-diagnosis (including normal body weight maintenance, physical activity, and a healthy diet) according to the guidelines issued by the American Cancer Society (ACS) was associated with a significant improvement in OS and a significant trend toward improved RFS over a 7-year median follow-up time [23]. Stratifying for tumor stage did not indicate stage-specific associations between lifestyle and survival in our study.

A recently published large study on the associations between healthy lifestyles and cancer morbidity and mortality in diabetics, including 1904 participants with CRC, found a 45% lower risk of cancer mortality among those with the healthiest lifestyle, compared to the least healthy [42]. Lifestyle and dietary factors have also been included in recurrence and survival prediction models for colon cancer stage III, resulting in significantly improved predictions [43].

The suggested biological mechanisms conferring a protective effect of a healthy lifestyle on CRC risk include decreases in inflammation and oxidative stress, modulation of gut microbiota, decreased bowel transit time, and increases in insulin sensitivity [44,45,46]. The same mechanisms may be involved in reducing the risk of recurrence. Traditional models of tumorigenesis have considered systemic tumor spread to be a late event in the process of primary tumor progression. This is being challenged by studies showing that dissemination can occur also in the early stages of this process [47, 48] even in preneoplastic lesions [49]. Environmental exposures during the process of tumorigenesis could thus influence the risk of dissemination.

We’ve used a diet score modified to suit the intakes of a Swedish population, which may impair the generalizability of our results. However, the other lifestyle variables were assessed using internationally established criteria and the results may thus apply to other high-income or even transitioning populations. Our results indicate that pre-diagnosis lifestyle has an impact not only on CRC risk but also on disease-specific survival, which underlines the importance of primary preventive measures. Further studies are warranted to confirm our results.

Strengths and limitations

Our study has several strengths including a long follow-up time with many observed events, detailed clinical data, and a design that may have reduced the risk of reverse causation in lifestyle assessment. The study is based on high-quality questionnaires, and the method has been evaluated previously. Further, the proportion of questionnaire responders was high (93%) decreasing the risk of selection bias and missing data was scarce, increasing the internal validity. There are also weaknesses to consider, including the risk of misclassification bias in self-reported data. Unmeasured lifestyle changes post-diagnosis may be reflected in our results. Studies on lifestyle changes in CRC survivors report conflicting results, with some finding shifts towards more healthy dietary habits [50], and smoking cessation [51], while others report little or no change in lifestyle [52].

Our exposure variable, the HL score, is based on four dichotomized lifestyle factors, each given equal weight within the score. Our results however indicate that non-smoking and physical activity have a stronger association with an improved recurrence-free and overall survival than a healthy diet and BMI within the normal range. Future studies could thus consider using a weighted score. We chose to dichotomize the BMI variable, placing the underweight participants in the same “unhealthy” category as the overweight and obese. The impact of underweight, overweight, and obesity on CRC recurrence and survival may however differ, which should be considered in future studies. Using additional anthropometric markers may further improve body weight assessment [6]. Confounding due to additional unmeasured factors cannot be ruled out.

Conclusions

Our study indicates that adherence to a healthy lifestyle may increase the RFS and OS of patients with stage I-III CRC. Avoidance of smoking and being physically active were independent risk-reducing factors for these outcomes.