Background

Colorectal cancer is the fourth most commonly diagnosed cancer in the United States [1]. While there is high potential for dietary patterns as a modifiable risk factor for colorectal cancer development [2, 3], very limited evidence exists among colorectal cancer survivors [4]. For example, in a recent review, we identified 50 articles published up to 2016 that reported on the association between dietary patterns and colorectal cancer development [3] but only about five articles on the association between dietary patterns and outcomes among colorectal cancer survivors [4,5,6,7,8,9]. The evidence showed that the Western dietary pattern, often characterized by high intakes of refined grains, red and processed meats, desserts, and potatoes, is associated with higher risk of all-cause mortality, but generally not with colorectal cancer-specific mortality in patients with colorectal cancer. The prudent dietary pattern, often characterized by high intakes of fruits, vegetables, whole grains, and poultry, showed similar results, with inverse associations for all-cause mortality but no consistent association with colorectal cancer-specific mortality [5,6,7,8,9]. Higher adherence to other dietary patterns including the Mediterranean diet score, dietary approaches to stop hypertension meal plan, American Cancer Society cancer prevention guidelines score, healthy eating index score, were generally associated with lower risk of all-cause mortality, but the associations were inconsistent across studies [5, 6, 8, 9].

Further research is therefore needed to clarify if dietary patterns are important for colorectal cancer prognosis and if dietary changes can maximally impact overall and cancer-specific survival. Biomarker-based dietary patterns may be helpful in this regard. For example, hyperinsulinemia and insulin resistance are considered important underlying mechanisms linking poor dietary patterns and lifestyle behaviors, to the development of multiple chronic diseases, including colorectal cancer [10,11,12]. Studies have shown positive associations between circulating c-peptide concentrations (a marker of beta cell secretory activity) and colorectal cancer risk and prognosis [13,14,15,16]. Therefore, a dietary pattern associated with hyperinsulinemia may be more predictive of outcomes following colorectal cancer diagnosis, than a dietary pattern not associated with this pathway. We previously derived the empirical dietary index for hyperinsulinemia (EDIH) score, to assess the potential of dietary patterns to influence insulinemia [17], which has been extensively applied in large cohort studies and found to be predictive of non-fasting c-peptide concentrations [17, 18], long-term weight gain [19], risk of developing colorectal cancer [20], other digestive system cancers [21], and other cancers [22]. However, the influence of dietary insulinemic potential on cancer survival outcomes has not yet been evaluated. The objective of the current study is to apply the EDIH score in a large cohort of colorectal cancer patients to assess the insulinemic potential of their dietary patterns after diagnosis and determine its influence on survival outcomes.

Methods

Study population

We used data from the Nurses’ Health Study (NHS) and the Health Professionals Follow-up Study (HPFS) - two ongoing cohorts in the United States. HPFS was initiated in 1986 and enrolled 51,529 male health professionals between the ages of 40 and 75 years [23]. NHS, initiated in 1976, enrolled 21,701 registered female nurses aged 30 to 55 years [24] (Fig. 1). Data on medical, lifestyle, and other health-related factors was collected at baseline and have been updated every 2 years thereafter. Ethical approval for our study was provided by the Harvard T.H. Chan School of Public Health, and those of participating registries as required and the institutional review boards of the Brigham and Women’s Hospital. Study participants provided consent by completing and submitting study questionnaires. Participants were free to terminate participation in the study at any time.

Fig. 1
figure 1

Flow chart describing the flow of participants from the full cohorts to the final analytic sample in the Nurses’ Health Study (NHS) and Health Professionals Follow-up Study (HPFS)

Assessment of diet and the empirical dietary index for hyperinsulinemia (EDIH) score

In both cohorts, diet was assessed using a validated self-administered food frequency questionnaire (FFQ) that assessed how often, on average, participants consumed a standard portion size of various foods in the past year. In the NHS, diet was assessed in 1980, 1984, 1986 and every 4 years thereafter, whereas in the HPFS diet was assessed in 1986 and every 4 years thereafter [25]. The EDIH score, developed to empirically measure the insulinemic potential of whole diets using food groups, has been described in detail [17]. Briefly, thirty-nine food groups [26] were entered into stepwise linear regression models to identify a dietary pattern most predictive of plasma c-peptide levels. The EDIH score represents a weighted sum of 18 food groups, with higher scores indicating hyperinsulinemic diets (hyperinsulinemia) and lower scores indicating low insulinemic diets. The food groups contributing to lower EDIH scores are: wine, coffee, full-fat dairy products, whole fruit and green leafy vegetables; whereas the food groups contributing to high EDIH scores are: low-fat dairy products, French fries, low-energy beverages, cream soups, processed meat, red meat, margarine, poultry, non-dark fish, high- tomatoes, energy beverage and eggs [17].

In the current study, we calculated EDIH scores for each participant based on the self-administered FFQs. Post-diagnosis EDIH score was calculated based on the first FFQ returned at least 6 months but not more than 4 years after colorectal cancer diagnosis, thus avoiding diet assessment during active cancer therapy. The median time from diagnosis to post-diagnosis diet assessment was 2.1 years. Pre-diagnosis EDIH score was calculated based on the cumulative average of EDIH scores up to the last diet assessment before colorectal cancer diagnosis. The median time from pre-diagnosis diet assessment to diagnosis was 1.9 years.

Patients with colorectal cancer and mortality assessment

When a colorectal cancer diagnosis was reported during the previous 2 years on the follow-up biennial questionnaires, we requested permission to obtain hospital records and pathology reports. Blinded study physicians then reviewed these records and recorded data on tumor characteristics. For non-respondents, the National Death Index was used to identify deaths and ascertain any diagnosis of colorectal cancer that contributed to death. After 30 years of follow-up for disease diagnoses (1980–2010 in NHS and 1986 to 2010 in HPFS), we identified 4219 patients with pathologically confirmed colorectal cancer. We excluded participants who died before 1980 in NHS or 1986 in HPFS, had reported any cancer (except nonmelanoma skin cancer) before colorectal cancer diagnosis, who died at diagnosis, who did not have pre-diagnosis diet or post-diagnosis diet, patients who did not complete a diet assessment between 6 months and 4 years after diagnosis or had diet assessed outside of this period, who had diabetes at colorectal cancer diagnosis, and patients with stage IV or unknown stage at diagnosis. Therefore, the current analysis included 1718 patients with stage I, II or III colorectal cancer, including 600 participants from HPFS and 1118 from NHS (Fig. 1). Deaths were ascertained through reporting by family. For persistent non-responders, we queried the National Death Index with their names, up to June 2014 for NHS and January 2014 for HPFS [27]. Cause of death was assigned by blinded physicians.

Covariate assessment

Both cohorts assessed covariate data (e.g., medical history, lifestyle and health factors) through self-administered questionnaires every 2 years. These factors included physical activity, smoking habits, alcohol intake, multivitamin use, endoscopy status, regular use of aspirin and other non-steroidal anti-inflammatory drugs (NSAIDs), family history of colorectal cancer, weight, height, menopausal status and postmenopausal hormone use (only for women), in both cohorts as previously described. Diet assessment was conducted every 4 years [26, 28, 29].

Statistical analysis

We categorized the EDIH score into quintiles, with cohort-specific cutoffs, then pooled the data for analysis. Person-time of follow-up was calculated from the date of post-diagnosis diet assessment to death or to last follow-up date (January 2014 in HPFS or June 2014 in NHS), whichever was first. We used the Kaplan-Meier method to generate survival curves by quintiles of EDIH score, and tested group differences (highest vs. lowest quintile) using the log-rank test. For this test the EDIH score was adjusted for total energy intake and BMI using the residual method.

Cox proportional hazards regression was used to calculate hazard ratios (HRs) of colorectal cancer-specific death or all-cause death in EDIH quintiles. Quintile cutpoints were created separately by sex and applied in the pooled sample. Given that participants must survive from diagnosis until post-diagnosis diet assessment, we used time since diagnosis as the underlying time scale to account for left truncation due to staggered entry. The Cox models were tested for the assumption of proportionality using time*covariate interaction terms and stratified by age, sex and stage. We fitted two models to the data as follows: model 1 (minimally adjusted model) included BMI, demographic factors (sex, age at diagnosis), and tumor characteristics (stage, subsite within the colon, grade of tumor differentiation). Model 2 (fully adjusted model) included all the covariates in model 1 and post-diagnosis lifestyle factors: pack-years of smoking, physical activity, regular aspirin use, pre- to post-diagnosis weight change, total alcohol intake, and pre-diagnosis dietary pattern (EDIH score). Test for linear trend of risk across EDIH quintiles was performed using the median post-diagnosis EDIH score in each EDIH quintile as a continuous variable in the Cox regression models and interpreting the p-value of this variable as the p-value for linear trend.

To determine how changes in the insulinemic potential of diet before and after diagnosis influence survival, we dichotomized pre and post-diagnosis EDIH scores at the median and used to create a change variable with low indicating a score below the median and high, a score above the median: Low-Low: consistently low dietary insulinemic potential before and after diagnosis (i.e., both scores below the median); Low-High: patients consuming low insulinemic diets before diagnosis and more hyperinsulinemic diets after diagnosis; High-Low: patients consuming hyperinsulinemic diets before diagnosis and then changed towards low insulinemic diets after diagnosis and High-High: patients who consistently consumed hyperinsulinemic dietary patterns before and after diagnosis. We then applied these dietary pattern changes in multivariable-adjusted Cox models to examine risk of death from colorectal cancer and from other causes.

We conducted exploratory subgroup analyses in categories of the following potential effect modifiers: sex, weight change pre- and post-diagnosis, and pre-diagnosis EDIH score. We categorized pre-diagnosis EDIH at the median (< median and ≥ median). Weight change was calculated by subtracting pre-diagnosis weight from post-diagnosis weight, the continuous weight change variable was categorized as follows: those who gained more than 2.5 kg, had a stable weight (− 2.5 kg to 2.5 kg) or lost more than 2.5 kg. We also conducted subgroup analyses by time since diagnosis (< 5 years, ≥5 years) and age at diagnosis (< 65 years, ≥65 years). Tests of interaction between post-diagnosis EDIH score and the potential effect modifiers were assessed by entering in the model the cross product of post-diagnosis EDIH score and the stratification variable and evaluated by the Wald test. All analyses were performed using SAS 9.4 for Unix. All p-values were two sided.

Results

Characteristics of patients (65.1% women) with colorectal cancer after diagnosis are shown in Table 1. Mean age at diagnosis was 68.3 years and mean post-diagnosis BMI was 26.1 kg/m2, with 58.5% of patients classified as overweight or obese. Regarding disease stage, 71.5% had stage I or II and 28.5% had stage III disease. During a median follow-up of 9.9 years, there were 1008 all-cause deaths, which included 272 colorectal cancer-specific deaths. Median overall survival by cancer stage was 10.7 years for those with stage I disease, 9.9 years for those with stage II and 8.0 years for those with stage III disease. Forty-one percent of patients maintained a stable weight between − 2.5 kg and + 2.5 kg between the pre-diagnosis and post-diagnosis period, while 34% lost more than 2.5 kg body weight and 25% gained 2.5 kg body weight in the same period. Colorectal cancer patients with the most hyperinsulinemic dietary patterns after diagnosis (quintile 5) tended to have higher body weight and lower physical activity. For example, the average BMI among those classified in quintile 5 was 27.3 kg/m2 and the average physical activity was 13.8 MET-hour/week compared with 25.1 kg/m2 and 21.8 MET-hour/week among those in quintile 1. Also, patients consuming the most hyperinsulinemic dietary patterns were less likely to have stage I disease and they experienced shorter survival times (Table 1).

Table 1 Postdiagnosis characteristics of colorectal cancer patients by quintiles of postdiagnosis empirical dietary index for hyperinsulinemia (EDIH) score; n = 1718

Patients consuming low insulinemic dietary patterns had higher intakes of wholegrains, nuts, vegetables, whole fruits and coffee; and lower intakes of refined grains, cream soup, eggs, French fries, butter, margarine, sugar-sweetened beverages, red meat and processed meat. In terms of the nutrient profile resulting from this post-diagnosis dietary pattern, patients consuming a low insulinemic dietary pattern had higher intakes of total carbohydrates and total fiber and lower intakes of total fat, total protein and branched-chain amino acids (Table 2).

Table 2 Median (5th, 95th percentile) food and nutrient intake profiles of colorectal cancer patients by quintiles of post-diagnosis empirical dietary index for hyperinsulinemia (EDIH) score

Kaplan-Meier curves by quintiles of EDIH score are shown in Fig. 2, with patients consuming a low insulinemic diet (quintile 1) experiencing better survival for colorectal cancer-specific and overall mortality, compared to those consuming hyperinsulinemic diets (quintile 5). In the multivariable-adjusted analyses, we found that a hyperinsulinemic post-diagnosis dietary pattern was associated with higher risk of colorectal cancer-specific mortality and all-cause mortality (Table 3). Comparing colorectal cancer patients classified in the highest EDIH quintile to those in the lowest quintile, there was a 66% higher risk of colorectal cancer-specific death: HR, 1.66; 95%CI, 1.03, 2.69; P-trend = 0.07; and a 24% higher risk of all-cause death: HR, 1.24; 95%CI, 0.97, 1.58; P-trend = 0.08, after accounting for pre-diagnosis dietary insulinemic potential among other confounding variables (Table 3).

Fig. 2
figure 2

Kaplan–Meier curves of (a) colorectal cancer-specific and (b) overall survival among patients with colorectal cancer by quintile of post-diagnosis empirical dietary index for hyperinsulinemia (EDIH) score. Log-rank p-values were calculated to test group differences (quintile 1 vs. 5) and adjusted for post-diagnosis total energy intake and post-diagnosis body mass index

Table 3 Hazard ratios (95% CI) for colorectal cancer-specific and all-cause mortality among patients with colorectal cancer by quintile of post-diagnosis EDIH score

In relation to changes in the insulinemic potential of the diet before and after diagnosis, patients who consumed a more hyperinsulinemic dietary pattern consistently before and after diagnosis were at higher risk of dying from colorectal cancer (HR, 1.51; 95% CI, 0.98, 2.32) and from other causes (HR, 1.31; 95% CI, 1.04, 1.64), compared to patients who consistently consumed a low insulinemic dietary pattern before and after diagnosis (Fig. 3).

Fig. 3
figure 3

Hazard ratios for the association of change in dietary insulinemic potential between pre-diagnosis diet and post-diagnosis diet and risk of dying form colorectal cancer (CRC-survival) and from all causes combined (overall survival). EDIH scores were dichotomized at the median: Low-Low, the reference category, represents participants who persistently consumed low insulinemic diets (below the median EDIH) from pre to post-diagnosis period; Low-High are those who changed from low insulinemic diets towards more hyperinsulinemic diets; High-Low represents those who changed from consuming hyperinsulinemic diets prior to diagnosis towards consuming low insulinemic diets after diagnosis, whereas High-High represents those who persistently consumed hyperinsulinemic diets prior to diagnosis and after diagnosis. The number of deaths / patients alive in the four categories were as follows: CRC-survival: Low-Low 97/541, Low-High 33/196, High-Low 30/205, High-High 112/504; Overall survival: Low-Low 347/291, Low-High 140/89, High-Low 139/96, High-High 382/234. Models were adjusted for age at diagnosis, post-diagnosis body mass index, total energy intake, sex, race, year of diagnosis, cancer stage, grade of tumor differentiation, location of primary tumor within the colon, post-diagnosis physical activity, post-diagnosis pack years of smoking, post-diagnosis regular aspirin use, weight change pre to post-diagnosis, post-diagnosis total alcohol intake, and pre-diagnosis EDIH score

In subgroups of potential effect modifiers, risk of colorectal cancer-specific mortality was significantly elevated among women, and among those who lost body weight, those who were consuming a hyperinsulinemic dietary pattern before diagnosis and those younger than 65 years. For these subgroup analyses, interactions were statistically significant only for sex in all-cause mortality (Table 4).

Table 4 Subgroup analyses of the association between dietary insulinemic potential and colorectal cancer-specific and all-cause mortality

Discussion

In the current study, we showed that habitual consumption of hyperinsulinemic dietary patterns after colorectal cancer diagnosis, or consumption of a hyperinsulinemic dietary pattern consistently before and after diagnosis, was associated with higher risk of dying from colorectal cancer and from all causes combined.

The insulinemic potential of diet was first estimated by the insulin index [30], which is based on a concept similar to the more widely used glycemic index, that characterizes carbohydrate-containing foods according to their ability to raise blood glucose concentrations postprandially compared with a reference food (glucose or white bread) [31]. Though carbohydrate content is one important factor influencing insulin response, foods can also stimulate insulin secretion in a carbohydrate-independent manner. The insulin index directly quantifies the postprandial insulinemic potential of a food, and takes into account foods with a low or no carbohydrate content [30]. It is important to understand that the insulin index, which was used in most previous studies of dietary insulinemic potential and colorectal cancer survival, is conceptually and technically different from the EDIH, and essentially uncorrelated (Spearman r = − 0.03). The principle of the insulin index is how a particular food item stimulated insulin secretion independent of underlying insulin resistance, whereas the EDIH is primarily driven by insulin resistance. For colorectal cancer, the only other paper using the EDIH was on cancer incidence [20].

Both the insulin index and glycemic index assess the postprandial (short-term) effects of the diet, unlike the EDIH score which predicts integrated insulin exposure (i.e., both fasting and non-fasting), based on habitual (long-term) dietary intake [18]. Post-diagnosis insulin index and insulin load have been linked to higher risk of dying from colorectal cancer [32, 33]. Higher dietary insulin load and insulin index after diagnosis of colorectal cancer were associated with increased risk of colorectal cancer-specific and overall mortality [33]. The association of post-diagnosis glycemic indices with colorectal cancer prognosis has been inconsistent. Whereas one study found higher risk of colorectal cancer recurrence and death associated with higher glycemic load but not higher glycemic index [34]; another found no association between glycemic load or glycemic index and colorectal cancer survival [35]. Glycemic scores are primarily reflective of the postprandial glucose responses of carbohydrate-containing foods, whereas the EDIH score directly reflects insulin increases induced by components of the dietary pattern that may or may not be contributing to calories (e.g., coffee). Current study findings therefore suggest that the direct effect of the diet on insulin may be more important than the effect of diet on glucose for colorectal cancer prognosis. Though the glycemic index is a measure of the short-term (postprandial) effect of the diet on glucose concentrations, it is possible that such a habitual dietary pattern could, over time, lead to sustained hyperinsulinemia and insulin resistance, which could then mediate colorectal cancer prognosis. However, a previous study in these cohorts did not observe an association between an overall low-carbohydrate diet score and colorectal cancer or overall mortality; although those who consumed a plant-rich, low-carbohydrate diet, which emphasized plant sources of fat and protein with moderate consumption of animal products, had lower risk of colorectal cancer-specific mortality [36].

Insulin is a growth factor and major regulator of cell metabolism, and its effects in target cells are mediated by the insulin receptor, a transmembrane protein with enzymatic activity [37]. Evidence suggest that insulin stimulates growth mainly via its own receptor and not the IGF-1 receptor, and that in many cancer cells, the insulin receptor is overexpressed and the A isoform, which has a predominant mitogenic effect, is more represented than the B isoform [38]. The metabolic pathway stimulated by the activated insulin receptor to regulate glucose, protein, and lipid metabolism involves the PI3K/Akt pathway [39]. These characteristics provide a selective growth advantage to cancer cells when exposed to insulin. Therefore, all conditions of hyperinsulinemia, both endogenous (e.g., type 2 diabetes, metabolic syndrome, obesity) and exogenous (e.g., hyperinsulinemic diets; which also influence some of the endogenous conditions) [19, 40], will increase cancer risk and mortality [37].

Although interactions for most of the subgroup analyses were not statistically significant, some of the findings merit some discussion. The associations were stronger among women than among men, which may be related to several factors: the larger sample size and statistical power in our evaluation of women, potential confounding with age as women were younger on average than men, and a true biological interaction based upon endocrine and associated metabolic factors. We also observed that there were worse outcomes among patients who lost weight than among those who maintained a stable weight from pre- to post-diagnosis period, which may be consistent with complications of progressing disease leading to poor diet intake.

Major strengths of our study include the use of a food-based EDIH score that is correlated with circulating c-peptide concentrations [17, 18]. We had access to comprehensive pre- and post-diagnosis data on diet and important covariates, which reduces the potential for residual confounding and recall bias. Our findings also accounted for potential bias from staggered entry due to differences between participants in the time between diagnosis and post-diagnosis diet assessment. Limitations to be considered in interpreting our findings include: potential measurement error in the self-reported dietary and lifestyle data, though prior studies in the HPFS and NHS that evaluated the relative validity of FFQ data have shown reasonably good correlations between FFQ and diet records [23, 25, 41]. Though we adjusted for several potential confounding variables, a hyperinsulinemic dietary pattern may be associated with other factors not included in the current study. Therefore, we cannot completely rule out confounding by unmeasured variables. Given that we did not have information on cancer treatment, which could influence dietary choices of cancer patients or modify the diet and survival association, we adjusted all analyses by cancer stage at diagnosis, which is the principal determinant of colorectal cancer treatment.

Conclusion

In this large prospective study, a higher EDIH score, reflecting higher insulinemic potential of the diet, was associated with higher risk of death from colorectal cancer and from all causes. Taken together, our results suggest that this association may be mediated partly through mechanisms involving hyperinsulinemia. Interventions with dietary patterns to reduce insulinemia may enhance survivorship among colorectal cancer patients.