Background

Lung cancer is the most common cancer and the leading cause of cancer-related deaths worldwide, with an estimated 1.82 million lung cancer cases and 1.59 million deaths in 2012 [1]. Incidence and mortality rates for lung cancer are higher among men than women, with 1.2 million cases and 1 million deaths estimated in men and 580,000 cases and 490,000 deaths estimated in women in 2012 [2]. The incidence of lung cancer varies by age, sex, geographical location and histological type [3, 4]. These variations are mostly determined by differences in smoking patterns and exposures to other lung carcinogens [5,6,7,8]. Smoking, second-hand smoke, air pollution, asbestos, radon, and occupational exposure to chemical carcinogens are well-known risk factors for lung cancer [9,10,11,12,13]. Furthermore, a comprehensive review of epidemiological evidence revealed that low consumption of fruits and vegetables contribute to an increased risk of lung cancer [14, 15].

Obesity is linked to an increased risk of many cancers, including cancers of the breast (in post-menopausal women), endometrium, esophagus, gallbladder, kidney, colorectal, and pancreas [16]. By contrast, body mass index (BMI, a proxy measure of obesity) of ≥30 kg/m2, has been inversely associated with the risk of lung cancer in several case-control and cohort studies [17,18,19,20,21,22,23,24,25,26,27]. Besides, some of these studies have also shown that low BMI is associated with an increased risk of lung cancer [19, 20, 25, 28, 29]. Two recent meta-analyses have provided more evidence supporting that excess weight could significantly decrease the risk of lung cancer [30, 31]. There are some methodological issues in examining the association between BMI and lung cancer risk. Firstly, smoking is an established risk factor for lung cancer and is also associated with body weight, which may confound the relation between BMI and lung cancer [32, 33]. Smokers tend to be leaner than non-smokers; heavy smokers tend to have greater body weight than light smokers, which likely reflects an unhealthy lifestyle (for instance, poor diet and low level of physical activity) [32]. In fact, studies that restricted the analysis to never smokers, the association between BMI and lung cancer disappeared [34, 35]. Secondly, preclinical effects of lung cancer and associated weight loss may distort the association between BMI and lung cancer, which is often referred to as reverse causation [20, 36]. Studies that had a short follow-up or studies in which weight was reported shortly before cancer diagnosis are more prone to reverse causality. To our knowledge, few studies have attempted to tackle these methodological issues using Mendelian randomization approach [37,38,39]. However, this method has not been extended to evaluate non-linear associations. Apart from these, some epidemiological studies have failed to find the inverse association between BMI and lung cancer risk [40, 41]. In addition, histological types of lung cancer may exemplify largely divergent diseases with different etiologies, but studies examining the association between BMI and lung cancer by histological type are limited [24, 42, 43].

Hence, the aim of the present study was to examine the association between BMI and lung cancer risk in a pooled analysis based on nested case-control studies from four cohort studies in USA, Europe, China and Singapore. The large sample size of this nested study allowed us to assess the association by gender, smoking status and histological types of lung cancer.

Methods

Study population

This project was conducted under the framework of the International Lung Cancer Consortium (ILCCO). ILCCO was established in 2004 with the objective to pool equivalent data and maximize resource sharing and statistical power of epidemiological studies of lung cancer [44]. Four ILCCO studies are included in this pooled analysis. The collaborating cohorts have been described in detail previously [45,46,47,48,49,50,51]. These are the Carotene and Retinol Efficacy Trial (CARET), European Prospective Investigation into Cancer and Nutrition Study (EPIC), Shanghai Cohort Study (SCS), and Singapore Chinese Health Study (SCHS). A summary of selected characteristics of these cohorts is presented in Table 1.

Table 1 Characteristics of participating cohorts

Cases ascertainment and data collection method

Cases included were all incident primary lung cancer (International Classification of Diseases-Oncology (ICD-O) 3rd edition and included all invasive cancers coded to C33–34). All histological types were included. Case ascertainment varied among studies but included linking participants to cancer registries, health insurance records, medical records, self-report, and next of kin reports. Most of the cases among studies were histologically confirmed.

In each study, two lung cancer-free controls were matched per case (controls were cancer-free at the time of diagnosis of the matched case). Mostly, controls were matched to cases on age (plus/minus 5 years) and sex. Some cohorts used more stringent matching on other variables (Table 1). In each study, two lung cancer-free controls were matched per case.

Data on demographics and possible confounders were collected among studies through a self-administered written questionnaire (EPIC and CARET) or in- person interviews (EPIC, SCS and SCHS). At recruitment, measurements of height and weight were taken for all the participants of the CARET study and for most of the EPIC cohort (Table 1). In the SCS and SCHS cohort and for some of the EPIC participants (mainly for Oxford cohort, Norwegian cohort and approximately two-thirds of the French cohort) height and weight at baseline were self-reported. A detailed description of data collection methods has been published previously by the individual studies [45,46,47,48,49,50,51]. From each study, baseline information on anthropometric measurements (height and weight), history of cigarette smoking, sex, age at enrollment and diagnosis, year of last observation/follow-up, and level of education was requested.

Statistical analysis

Unconditional logistic regression models were used to estimate odds ratios (ORs) and 95% confidence intervals (CIs) for the association between BMI and lung cancer risk. BMI at baseline was calculated as weight in kilograms divided by the square of the height in meters (kg/m2) and classified into 4 categories according to the WHO international classification: underweight (BMI < 18.5), normal weight (18.5 ≤ BMI < 25), overweight (25 ≤ BMI < 30) and obese (≥30). Normal weight was used as the reference category. Pack-years of smoking were computed by using the formula: (number of years smoked x mean number of cigarettes smoked per day)/20. In cases, time elapsed was computed as the difference between the age at enrolment and diagnosis, whereas in controls, it was calculated as the difference between age at enrolment and last follow-up/observation.

All models were adjusted for sex, study center, age (< 45, 45–49, 50–54, 55–59, 60–64, 65–69, ≥70), time elapsed (< 2, 2–8, 9–14, 15–20, ≥20), pack-years of smoking (0, < 20, 20–29, 30–39, 40–49, and ≥50), and education level (none, primary school, middle/vocational, secondary school, postsecondary/technical and university). Subgroup analyses were performed for gender, smoking status and histologic types of lung cancer. Deviation of multiplicative interactions of BMI with sex and smoking status was explored by including an interaction term along with the main effect term in the adjusted model. The statistical significance of the interaction term was evaluated using likelihood ratio tests. To investigate possible reverse causation, sensitivity analysis was performed by excluding lung cancer cases diagnosed in the first 3 years of follow up. Additional, sensitivity analysis was also conducted by eliminating two studies (SCS and SCHS), where height and weight were self-reported. We tested for heterogeneity across studies using the Q and I2 statistic [52]. To graphically display odds ratios representing the dose-response association for BMI and lung cancer risk, we used the restrictive cubic spline (RCS) function with 4 knots (5, 10, 20, and 40 percentile) in a multivariate unconditional logistic regression model as described above. The selection of model (4 knots) was based on the lower Akaike Information Criteria (AIC). This analysis was performed using the RCS_Reg SAS Macro created by Desquilbet and Mariotti [53]. All analyses were performed using the SAS 9.3 software (SAS Institute, Cary, NC) and a p-value < 0.05 was considered as statistically significant.

Results

The study included 4172 lung cancer cases and 8471 controls aged 35 to 74 years (Table 1). Baseline characteristics of participants are presented in Table 2. Of the 4172 lung cancer cases, 3043 were men and 1129 were women. Compared with controls, cases were slightly older, had a lower education level and higher prevalence of current smoking. The average age at lung cancer onset in cases was 68.0 years, and the average time elapsed from enrollment to diagnosis of lung cancer in cases was 8.3 years.

Table 2 Selected characteristics of participants

In the total participants, cases had slightly lower mean weight compared with controls (68.2 and 69.7 kg). Mean height was similar (1.67 m). Fifty-two percent of cases and 51% of controls had BMI in the normal range, 27% of cases and 32% of controls were overweight, and 9% of cases and 11% of controls were obese.

Table 3 displays adjusted ORs and 95% CIs for lung cancer according to baseline BMI categories. Considering all participants, and using normal weight as the reference group, a decreased risk of lung cancer was observed for those who were overweight (OR 0.77, 95% CI: 0.68–0.86) and obese (OR 0.69, 95% CI: 0.59–0.82) whereas no statistically significant association was observed for underweight subjects (OR 1.03, 95% CI: 0.84–1.25). When stratified by gender, the inverse association observed between BMI and lung cancer risk was similar for overweight and obese men (OR 0.71, 95% CI: 0.62–0.81 for overweight group; and OR 0.63, 95% CI: 0.52–0.78 for obese group); the association for women was slightly attenuated (OR 0.80, 95% CI: 0.63–1.02 for overweight group; and OR 0.70, 95% CI: 0.51–0.97 for obese group) (Table 3).

Table 3 Adjusted odds ratio of lung cancer according to BMI categories

To further investigate the association between BMI and lung cancer risk among subgroups, we stratified the analyses by smoking status (Table 4). In both genders combined, the decreased risk for lung cancer was observed among current, former, and never smokers (P for interaction 0.002). The adjusted ORs for overweight and obese groups were 0.79 (95% CI: 0.68–0.92) and 0.75 (95% CI: 0.60–0.93) for current smokers, 0.70 (95% CI: 0.53–0.93) and 0.55 (95% CI: 0.37–0.80) for former smokers, 0.77 (95% CI: 0.59–0.99), and 0.71 (95% CI: 0.44–1.14) for never smokers, respectively. When separate analysis was performed by gender, the decreased risk for lung cancer was observed among both former and current male and female smokers but it did not reach statistical significance among female smokers, which could be explained by few number of female cases (Table 4).

Table 4 Adjusted odds ratio of lung cancer by smoking status according to BMI categories

We performed RCS regression to describe the nonlinear dose-response association between BMI and risk of lung cancer (Fig. 1). In all the participants, we found a significant nonlinear dose-response association between BMI and risk of lung cancer (P nonlinearity 0.001; Fig. 1a). After stratifying by gender, the evidence of a nonlinear association was observed in men (P nonlinearity 0.009; Fig. 1b) but not in women (P nonlinearity 0.11; Fig. 1c). After stratifying by smoking status, the nonlinear association was observed in former and current smokers (P nonlinearity 0.006; Fig. 1d,e respectively) but not in never smokers (P nonlinearity 0.14; Fig. 1f).

Fig. 1
figure 1

Adjusted dose-response association between BMI and risk of lung cancer: (a) All participants (b) Men (c) Women (d) Former smokers (e) Current smokers (f) Never smokers. BMI was coded using an RCS function with four knots arbitrarily located at the 5th, 10th, 20th and 40th percentiles. The y-axis represents the adjusted odds ratio for lung cancer risk for any value of BMI compared to individuals with a BMI of 22.0 kg/m2 (median value of BMI). Dashed lines are 95% CI. Knots are represented by dots

We also examined the association between BMI and risk of histological types of lung cancer (Table 5). When we stratified the analysis by histological types, the reduction in risk was observed for all histological types but it was statistically significant for adenocarcinoma and large cell carcinoma. The adjusted ORs for overweight and obese groups were 0.74 (95% CI: 0.62–0.87) and 0.65 (95% CI: 0.50–0.85) for adenocarcinoma, 0.69 (95% CI: 0.48–0.99) and 0.49 (95% CI: 0.26–0.92) for large cell carcinoma, respectively.

Table 5 Adjusted odds ratio of lung cancer by histological type according to BMI categories

The risk estimates did not change substantially in the sensitivity analysis after exclusion of lung cancer cases diagnosed in the first 3 years of follow-up (Additional file 1: Table S1). In addition, analyses using measured BMI, which included two studies (CARET and EPIC) yielded similar results (data not shown).There was mild heterogeneity between cohorts for the BMI-lung cancer results (P value = 0.12, I2 = 50%) (Additional file 2: Figure S1).

Discussion

In this pooled analysis of cohorts involving 12,643 subjects (4172 lung cancer cases and 8471 controls), we found a statistically significant inverse, dose-dependent association between BMI and lung cancer risk. This inverse association was present in current, former and never smokers and the effect was more evident for the subjects with a BMI of > 30 (kg/m2).

Our study found that being overweight or obese is associated with a decreased risk of lung cancer. This is consistent with a recent meta-analysis including 31 studies (20 cohorts, 11 case-control). The pooled relative risks in this study were 0.74 (95% CI: 0.68–0.80) for overweight (BMI 25–29.9 kg/m2) and 0.71 (95% CI: 0.68–0.80) for obese (BMI ≥30 kg/m2), compared with normal weight (BMI 18.5–24.9 kg/m2) [30]. More recently Duan et al. conducted a dose-response meta-analysis, which included 29 cohort studies and found evidence of a non-linear association between BMI and lung cancer risk (P nonlinearity < 0.001) [31]. Compared with individuals with a BMI of 18.5–24.9 kg/m2, the summary relative risks for those with a BMI of 30 kg/m2 and BMI 35 kg/m2 were 0.91 (95% CI: 0.85–0.98) and 0.81 (95% CI: 0.72–0.91), respectively [31]. A cohort study conducted in the UK, which was not included in this meta-analysis, also showed an inverse association between higher BMI and lung cancer risk [54].

In gender-stratified analysis, we observed similar results in both men and women. The findings of previous meta-analyses also indicated no gender differences in the association between BMI and lung cancer risk [30, 31].

Given that smoking is the most important risk factor for lung cancer and associated with body weight, we stratified our analyses by smoking status and found a significant inverse association between BMI and lung cancer risk among current, former and never smokers. This is in line with results of previous meta-analysis [31]. In this meta-analysis, the pooled RRs for overweight and obese groups were 0.91 (95% CI: 0.85–0.98) and 0.77 (95% CI: 0.69–0.85) for ex-smokers, 0.79 (95% CI: 0.71–0.87), 0.72 (95% CI: 0.66–0.78) for current smokers and 0.86 (95% CI: 0.78–0.94) and 0.86 (95% CI: 0.75–0.98) for non-smokers respectively [31].

In our study, stratification by histological subtype showed that overweight and obese was significantly inversely associated with risk of adenocarcinoma and large cell carcinoma. The results of stratified analysis in the previous meta-analysis reported a lower risk for adenocarcinoma and squamous cell carcinoma [30, 31].

Our study found no association between being underweight and risk of lung cancer. However, stratification by smoking status showed a non-significant increased risk for lung cancer in current smokers who were underweight (OR 1.24, 95% CI: 0.98–1.58). Results of a recent meta-analysis reported a significant positive association between low BMI and lung cancer risk (pooled RR 1.24; 95% Cl: 1.20–1.27; for underweight vs. normal weight) [31]. But, this association was confined to current smokers (RR 1.31, 95% CI: 1.10–1.57) and no statistically significant association was found in ex-smokers and never smokers (RR 1.40, 95% CI: 0.82–2.36 and RR 1.18, 95% CI: 0.90–1.54, respectively) [31].

A few biological mechanisms support the plausibility for the inverse association between BMI and lung cancer risk. Environmental Geno toxicants like polycyclic aromatic hydrocarbons (PAHs) that derived from smoking and occupational exposure, are known to cause DNA damage that results in a dose-dependent risk of lung cancer [55]. Among PAHs, benzo-α-pyrene is the most widely studied element, and its ability to induce lung tumors upon inhalation is well recognized [56]. Interestingly, studies have found inverse associations between BMI and benzo-α-pyrene DNA adduct levels among smokers, suggesting that increased body fat impacts adduct levels, possibly by affecting the distribution of the carcinogen [57, 58]. In addition, inverse associations have been reported between BMI and levels of urinary 8-hydroxydeoxyguanosine (8-OHdG), a marker of oxidative DNA damage in smokers [59, 60]. Brennan et al. reported that FTO genotype, a genetic marker of obesity which is related to increased BMI, was associated with a decreased risk of lung cancer [37]. On the other hand, two recent Mendelian randomization (MR) analyses for BMI and lung cancer, showed increased BMI was positively associated with lung cancer risk [38, 39]. However, the MR assumptions can be confounded by the potential pleiotropic effects of genetic variants associated with both BMI and smoking behavior [38, 39]. Recently Dik et al. conducted a large-scale genome-wide analysis of the association between BMI and DNA methylation and found increased BMI is associated with increased methylation at the HIF3A locus in blood and in adipose tissue [61]. HIF3A is an element of the hypoxia-inducible transcription factor (HIF) that controls a wide variety of cellular and physiological responses to reduced oxygen concentrations by controlling the expression of several target genes [62]. Studies have observed that HIF3A can regulate many genes associated with angiogenesis, in addition to cell survival and apoptosis [63, 64]. These observations suggest that HIF3A may play a role in lung carcinogenesis [60]. However, further molecular-epidemiological studies are needed in exploring the underlying carcinogenic mechanisms associating BMI with lung cancer risk.

The major strengths of this study include its prospective cohort-based nested case-control design, large sample size, and available information on potential confounders. As the study population was largely a nested sample from different prospective cohort studies and BMI was measured before lung cancer diagnosis, hence the possibility of selection and recall bias is minimal. In addition, we were able to perform the analyses by gender, smoking status and histological types. Our study also has some limitations. First, the use of some self-reported exposure information. However, separate analyses using measured BMI yielded similar results. In addition, previous studies have noted that even though self-reported height tends to be overestimated and weight tends to be underestimated, the self-reported values are highly correlated with the measured values [65,66,67]. Second, our analyses were based on self-reported cigarette smoking at baseline, and information on change in smoking habits during follow-up was not available from studies; if smoking habits varied over time, this could have had some effect on the results. However, we observed similar patterns for former and current smokers; hence it is unlikely to alter the results if current smokers became former smokers during follow-up. Third, adjustment for pack-years of smoking may not adequately control for confounding effect of cigarette smoking, thus residual confounding by smoking may still exist. Last, there was a wide range of time elapsed between BMI measurement (collected at baseline) and the date of diagnosis. However, sensitivity analysis examining this time elapsed by excluding the first 3 years of follow-up did not change risk estimates substantially. Hence, it argues against an effect of preclinical disease-related changes in anthropometric measures (reverse causation).

Conclusions

In conclusion, the results of this study provide additional evidence that obesity is associated with a decreased risk of lung cancer. Further biological studies are needed to address this association.