Introduction

The association between cigarette smoking and colorectal cancer (CRC) is not established. In Norway, CRC is the second most common cancer found after prostate cancer among men and breast cancer among women with a total of 1,767 incident cases among women in 2006 [1]. For both genders, the age adjusted incidence rate of CRC has been doubled during the last 50 years. In 2002, Norwegian women had the second highest incidence of CRC in the world, only surpassed by women in New Zealand [2].

There has been, so far, no obvious explanation for the increase in risk and top ranking among Norwegian women [3]. Worldwide CRC is one of the most common cancers with a high mortality [2].

In 1996, Giovannucci et al. [4] hypothesized that smoking is an initiator of colorectal carcinogenesis, but that the increased risk only emerges 30–40 years after the smoking initiation. The notion that smoking is a risk factor for CRC has been supported by others [5]. However, two reports from 2004, one from the International Agency for Research on Cancer [6] and the other from the US Surgeon General [7], found that there was not enough evidence to conclude that the relationship between smoking and CRC is causal. Because the use of tobacco is rising globally among women [8, 9] smoking may result in large numbers of CRCs if a causal association exists.

For more than 30 years, from the late 1960s until the year 2000, the prevalence of daily, current smokers was more than 30% among Norwegian women. In 2005, daily smokers represented 40% among those with only primary school education [10]. A large proportion of Norwegian women are ever smokers, i.e., either current or former smokers. Data on smoking and CRC from prospective studies are still sparse. The purpose of this study was to examine the association between cigarette smoking and CRC incidence, overall and by location, in a countrywide, and population-based, prospective cohort study. Subsequently, we estimated the number of avoidable CRCs if no Norwegian women had smoked.

Materials and methods

Study population—The Norwegian Women and Cancer (NOWAC) study

As previously documented [11, 12] the NOWAC study is a prospective, country wide, and population-based cohort study comprising a representative sample of the Norwegian female population. All women were randomly selected from the Central Population Register according to year of birth. A letter of invitation to participate in the study contained a questionnaire and a pre-stamped return envelope. The National Data Inspectorate and the Regional Committee for Medical Research Ethics approved the study. All women gave an informed consent.

The NOWAC study was initiated in 1991 recruiting 57,600 women aged 34–49 years (response rate 57.6%) who answered a postal questionnaire. In 1996, women aged 30–69 were invited to respond to a postal questionnaire. The cohort expanded with 44,843 women (56.8% of the subjects). In 1998, a similar postal questionnaire was mailed to the initial sub-sample, of whom 46,971 women (81.5%) responded. More details about the study population may be found elsewhere [11, 12] (http://uit.no/kk/NOWAC/).

Exposure information

The questionnaire included detailed assessment of smoking habits, hormonal contraceptive use, postmenopausal hormone therapy (HT) use, physical activity (PA), and other lifestyle habits as well as height and current weight (allowing us to calculate body mass index (BMI) as weight in kilograms divided by the square of height in meters). It also comprised 82 different food frequency questions including alcohol consumption and dietary supplements.

The questionnaires asked if the women have ever been smoking, and those answering “yes” were asked for number of cigarettes smoked daily at different age intervals. Subsequently, they were asked if they smoke on a daily basis at the moment. We categorized ever smokers according to current and former smoking status, age at smoking initiation, smoking duration, average number of cigarettes smoked daily, pack-years of smoking (i.e., number of cigarettes smoked per day, divided by 20, multiplied by the number of years smoked), and latency (i.e., age at enrolment in the study minus age at smoking initiation) all at enrolment. Former smokers were classified according to years since quitting smoking. All women not being current or former smokers were classified as never smokers.

Women who reported a natural menopause or a bilateral oophorectomy at cohort enrolment were considered postmenopausal during follow-up. All other women were considered pre-menopausal, regardless of age, hysterectomy, or use of postmenopausal HT. We calculated average daily consumption of alcohol in grams based on the content of pure alcohol in different sorts of beverages among drinkers. Women who reported to be teetotallers and those answering “seldom” or “never” in the frequency table had their alcohol consumption set to zero.

Follow-up and endpoints

The study population comprised women, aged 30–69 years, who completed a questionnaire in 1996 together with the responders of the second questionnaire in 1998, for a total of 91,814 women. We followed the women with linkages to the Cancer Registry of Norway and the Central Population Register, utilizing the unique national birth number to identify all cancer cases and deaths/emigrations, respectively. The national registries are both accurate and virtually complete [13]. Woman-years were calculated from the start of follow-up to the date of diagnosis of CRC, the date of any incident cancer (except skin basal cell carcinoma) diagnosis, emigration, death, or the end of follow-up, i.e., 31 December 2005, whichever occurred first.

The colorectal tumors were classified according to the Seventh Revision of the International Statistical Classification of Diseases. Women with colon cancer were further categorized according to location of the tumors proximal (codes 153.0/153.1) and distal (codes 153.2/153.3) colon cancers. Tumors that were overlapping (code 153.4), specified as appendix (code 153.6), or unspecified (code 153.9) were classified as “others” and included in the analyses for the whole colon only.

We excluded 3,734 subjects who were diagnosed with any invasive cancer prior to the start of the study, 17 women who had emigrated or died, one woman with missing vital status, 2,264 women with insufficient information on smoking history, leaving 85,798 women. Altogether 17,638 women, including 116 cases, with missing information for any of the covariates considered a priori as relevant were excluded from the multivariate models. The study population comprised the remaining 68,160 women.

Statistical analysis

We calculated crude CRC incidence rates by dividing the number of cases by the total number of woman-years in that exposure category. The rates were then age-adjusted to the world standard population [14]. Rate ratios (RRs) and 95% confidence intervals (CIs) associated with different measures of smoking exposure for CRC overall, and according to location, i.e., in proximal- or distal colon or in rectum, were estimated by fitting Cox proportional hazards models [15] with never smokers as the reference group.

The RR of each of the following factors was estimated in both univariate and multivariate analyses and evaluated as a potential confounder of the relation between cigarette smoking and CRC: age (years, continuous), education (years, continuous), menopausal status (pre, post), both current and ever user of hormonal contraceptive (yes, no), ever user of postmenopausal HT (yes, no), mean recreational and work related PA score on a 10-level scale (1–10), BMI (continuous), and alcohol consumption (grams/day), all at enrolment. Several additional dietary factors, i.e., meat, fruit, vegetables, fiber, calcium, and folate, were also evaluated, but none were related to CRC in this cohort and therefore not kept in the final analyses.

Factors changing the RR estimate with 5% were included in the final multivariate models, i.e., age, menopausal status, ever user of hormonal contraceptive, ever user of postmenopausal hormone therapy, BMI, and alcohol consumption. The Cox analyses were performed with the PHREG procedure in the SAS statistical package [16]. Tests for linear trend were obtained by creating an ordinal exposure variable with equally spaced scores and including it in the models. The population attributable fraction (PAF) was calculated using the formula \( \hbox{PAF}\,=\,\frac{{P_{\text{e}} (RR_{\text{e}} - 1)}}{{P_{\text{e}}\,\times\,RR_{\text{e}}\,+\,(1 - P_{\text{e}} )}} \) quoted in Breslow and Day [17], where the notation P e = The proportion of persons in the population exposed to the risk factor and RR e = The relative risk in the exposed group compared to the unexposed. All statistical tests were two-sided and were considered statistically significant at p ≤ 0.05. Finally, we rerun the main analyses in the final models excluding the cases (n = 38) which occurred during the first year and also including a category for missing values for each of the covariates. The results did not materially change (data not shown).

Results

During the 533,786 woman-years of observation, 425 incident cases of histologically confirmed primary invasive colorectal [284 (67%) colon and 141 (33%) rectal] cancers were identified. Among the colon tumors 137 (48%) were classified as proximal, 108 (38%) as distal, and the remaining 39 (14%) cases as “others.” Altogether, 43,578 (64%) of the women reported to have been ever smokers of which approximately one half (51%) reported being current smokers. The proportion of women with a latency period of 30 or more years was higher among the former (62%) compared with the current (59%) smokers (p < 0.05).

Table 1 shows the distribution of selected characteristics at enrolment among cases and non-cases, and according to anatomical distribution of tumors. The cases were older and less educated, and were more likely to be post-menopausal. They were less likely to have a history of ever use of hormonal contraceptive and more likely to have used postmenopausal hormonal therapy compared with non-cases (all p values <0.05). There was no difference between colon and rectal cancer cases or between proximal and distal colon cancer cases for the variables displayed in the table (Table 1).

Table 1 Distribution of selected characteristicsa of the study population (n = 68,160) at cohort enrolment among cases, non-cases and according to tumor localization

Table 2 shows that the distribution of the selected characteristics varies according to the women’s smoking status. Current smokers differed from never smokers on all the listed variables in Table 2 (all p values <0.001), except for height. The age-adjusted incidence rates of CRC were 83 and 65 per 100,000 woman-years among ever and never smokers, respectively.

Table 2 Distribution of selected characteristicsa of the study population (n = 68,160) at cohort enrolment according to smoking status

Table 3 shows the multivariate RR estimates of CRC overall and separately for colon and rectal cancers among ever smokers, compared with never smokers. Former smokers had a 30% increased risk for CRC (RR = 1.3; 95% CI = 1.0–1.6) and a 40% increased risk of colon tumors (RR = 1.4; 95% CI = 1.1–1.9), whereas only non-significantly increased risks were found for current smokers. When the two groups were collapsed, ever smokers had a 20% increased risk of CRC (RR = 1.2; 95% CI = 1.0–1.5), a 30% increased risk of colon cancer (RR = 1.3; 95% CI = 1.0–1.7), and a non-significantly increased risk of rectal (RR = 1.1; 95% CI = 0.7–1.5) cancers, compared with never smokers. Ever smokers with a latency period of more than 40 years had a similar 30% increased risk for colorectal (RR = 1.3; 95% CI = 1.0–1.7), colon (RR = 1.3; 95% CI = 0.9–1.8), and rectal (RR = 1.3; 95% CI = 0.8–2.1) cancers. The displayed RR estimates for smoking initiation (≥20 and <20 years), number of pack-years smoked (0–9, 10–19, and ≥20), and duration of smoking (number of years smoked (1–19, 20–29, and ≥30)) showed a significant dose response association with colorectal tumors when a category for never smokers was included in the trend test (all p values <0.05), whereas the number of cigarettes smoked per day did not (= 0.14). Similar significant results were found for colon cancer, while none of the trend tests achieved statistical significance for rectal cancer. The RR estimates for rectal cancer in the upper exposure categories (latency ≥40, smoking initiation <20; number of pack-years ≥20, and number of years smoked ≥30) were all increased, although not significantly. No meaningful association was found between time since quitting smoking and colorectal tumors overall or by anatomical subsite (Table 3).

Table 3 Multivariatea rate ratios estimates and 95% confidence intervals of colorectal cancer overall, and by location among ever smokers according to various measures of smoking exposure at enrolment compared with never smokers among 68,160 women

Table 4 shows that a model including 137 proximal and 108 distal colon cancers found that former smokers had a significantly increased risk for both the proximal (RR = 1.6; 95% CI = 1.1–2.4), and distal (RR = 1.7; 95% CI = 1.1–2.7) colon tumors (Table 4). The PAF was estimated to be 12%.

Table 4 Multivariatea rate ratios estimates and 95% confidence intervals for proximal and distal colon cancer according to smoking status among 68,160 women

Discussion

The results from our study suggest that in Norway one out of eight CRCs would have been prevented annually if women had not been smoking. The increased risk of CRC is found foremost for colon cancer, but is also indicated for rectal cancer. In support of a causal relationship between smoking and CRC, our results show a dose–response relationship between age of smoking initiation, number of years, and of pack-years smoked. The increase in risk of CRC caused by smoking seems to require a long latency period. Thus our results are in accordance with the prevailing hypothesis described previously [4].

Our study has several major strengths. The study is a countrywide, truly population-based, and a prospective cohort representing the general female population in Norway. Also, the smoking histories were obtained at enrolment and, hence, are not subject to recall bias. Our cohort has virtually complete follow-up and we are able to examine the association with smoking according to site-specific cancer subgroups. We have a high proportion of both current and former smokers [18, 19], and the smoking habits found in our study reflect known smoking patterns among Norwegian women [20]. In addition, we have detailed information on, and are able to control for, established risk factors for CRC, many of which vary according to smoking status. CRC screening practices are not yet common in Norway and are therefore unlikely to affect our results. Furthermore, the cumulative incidence rates during follow-up for all cancer sites have been shown to be almost identical to those reported to the national cancer registries in Norway during the same period [12]. Since our cohort is representative both according to exposure and outcome, we can justify our estimation of PAF due to smoking.

Our study also has several limitations. We have a short follow-up period resulting in a limited number of cases. This gives us more unstable estimates, results that are more prone to chance and we are unable to stratify the analyses in detail according to different measures of smoking exposure. Since time of smoking initiation, latency, duration, and pack-years of smoking are highly correlated; we cannot separate their effects. We lack information on depth of inhalation, type of cigarettes smoked, and passive and occasional smoking status. We assume that women exposed to passive smoking have been included in the reference group. Around 10% of the Norwegian female population reported to be occasional smokers at the period of our enrolment of whom approximately half were former daily smokers [20]. We believe that in our study, some of these women have been excluded due to insufficient smoking information; some have been included in the reference group, while those answering “yes” to ever and “no” to current daily smoking have been classified as former smokers. Including women exposed to passive smoking and those who smoke occasionally in the reference group would have attenuated the associations between smoking and CRC. Categorizing occasional smokers as former smokers would most likely have the same effect. A limitation is that we do not have information on changes in smoking habits during follow-up. However, few Norwegian women start to smoke after the age of 30, which is the lowest age of enrolment in our study. We know that current smokers quit smoking and former smokers reengaged in smoking [20]. If a large proportion of the ever smoking women changed exposure category during follow-up, this would have resulted in an underestimation of the RR among current and an overestimation of the RR among former smokers. Since current smokers have an increased risk of dying from any major cause during follow-up and CRC is assumed to take many years, competing causes of death may decrease the impact of smoking among current smokers. Both circumstances could explain why the association with CRC is quite similar for current and former smokers. In the multivariate analysis, alcohol consumption was the covariant with most impact on the RR estimates. We lack information on changes regarding the covariates during the follow-up. There may be some residual confounding due to these and other unknown risk factors. Nevertheless, the dose response observed is suggestive of a causal association.

In the latest review by the International Agency for Research on Cancer in 2004 [6], most of the studies examined for a possible causal relationship between smoking and risk of CRC did not have the required hypothesized latency or induction period. The results from the few previous epidemiological studies finding weak positive associations between smoking and CRC have been explained as confounded by alcohol and other dietary risk factors [6, 21]. Indeed both, alcohol consumption [22, 23] and intake of read meat [23] were recently established as risk factors for CRC.

In Norway, women began smoking in substantial numbers in the late 1960s and early 1970s [10]. As a consequence our cohort study is the first in which a high proportion of Norwegian women could have accrued long enough induction time for the smoking and CRC relationship to take place. Nevertheless, a previous study from Norway, including older cohorts with fewer smoking women, did find a non-significant increased risk of both colon and rectal cancer among former, but not among current smoking women [24].

It has been suggested that colon and rectal cancer may have different risk factors [25]. Our results indicate that smoking is a risk factor for both proximal and distal colon cancer as well as for rectal cancer. The association is more consistent for colon cancer, which may be due to the fact that we had twice as many colon cancers as rectal cancer cases. We find the overall associations to be similar among former and current smokers. We find no association between time since smoking cessation and risk of CRC. These results are consistent with the hypothesis that smoking is an initiator rather than a promoter of CRC [26, 27].

Results from other cohort studies are sparse and somewhat inconsistent, and they do not represent the general population as ours do. In 1994, Giovannucci et al. [26] reported that ever smokers in the Nurses’ Health Study had an increase in risk of both colon and rectal cancer and that most importantly this increase in risk was not revealed until 35 years after smoking had begun. Using data from the Canadian National Breast Cancer Study, Terry et al. [28] found a significantly increased risk of rectal cancer, but not of colon cancer among women with a smoking latency of more than 40 years. The Iowa Women’s Health Study found that ever smokers among the postmenopausal women had an overall increased risk of CRC [29]. In a recent report examining the participants in the Women’s Health Initiative study, Paskett et al. [30] found a statistically significant increased risk among the less than seven percent current smokers for rectal cancer, but not for colon cancer. Among the approximately 42% former smokers the results showed a borderline significantly increased risk of 12% for colon cancer, and a non-significant increased risk of 15% for rectal cancer.

Two European cohort studies [31, 32], also found an increased risk for CRC overall. The Swedish study reported that after examining the relationship by subsite, ever smokers had a 60% non-significant increased risk for colon (n = 318) and a fivefold increased risk for rectal cancer which almost all of the 180 cases occurred among the men [31]. The other study from the UK only included 95 cases and did not report by subsite [32]. These results were, as ours, from multivariate analyses adjusting for, among other factors and intake of alcohol [26, 2832]. In a recent report from the US [33], examining two cohorts from 1963 to 1975 with both genders, a non-significant increased risk for rectal cancer was found among women in the most recent cohort. This study based on data from two private censuses could only adjust for a few socioeconomic factors.

Some of the case–control studies that show results for women separately have found a positive relationship between ever smoking and colon cancer, but not rectal cancer [34] for rectal, but not colon cancer [35], and for both [36] cancers. In a systematic review including six cohort and 15 case–control studies in Japan, the authors conclude that ever smokers may have an increased risk for CRC, and particularly for rectal cancer. The few results displayed for women were in agreement with this [37]. Several recent studies have examined if the relatively weak association between smoking and CRC is depending on genetic susceptibility [38] or on molecular aspects of the tumor [39, 40]. So far no clear pattern has emerged.

Cigarette smoking has been shown to be a precursor for colorectal adenomas which are established risk factors for CRC [41, 42]. Cigarette smoke contains a mixture of at least 60 established mutagenic carcinogens including arsenic, cadmium, ammonia, formaldehyde, and benzopyrene [6]. These and other genotoxic compounds, including polynuclear aromatic hydrocarbons, heterocyclic amines, nitrosamines, and aromatic amines may reach the colorectal mucosa through direct ingestion or through the blood stream [27]. Presence of anti-benzo(a)pyrene diolepoxide-DNA adduct formation in human colon mucosa have been suggested as an evidence of a direct carcinogenic effect on the colon derived from diet/and or tobacco smoke [43]. Benzo(a)pyrene [B(a)p] is an incomplete combustion product from cigarette smoke that has been shown to have powerful carcinogenic activity and to act locally. Hecht has recently published a model for understanding the mechanisms of tobacco carcinogenesis. This model conceptualizes the complex pathways that lead to genomic instability and ultimately to cancer due to persistence of un-repaired DNA adducts in tissues of people who smoke cigarettes [44, 45]. The above listed carcinogens in tobacco smoke are plausible risk factors for colorectal carcinogenesis and support the biological plausibility of a positive association between smoking and CRC.

The smoking epidemic among women in the last four decades can explain some of the marked increase in incidence of CRC in Norway. In conclusion, our results support the hypothesis that cigarette smoking is a preventable cause of CRC among women.