Background

Lung cancer is the leading cause of cancer deaths worldwide, and the third most common cancer in women, after breast- and colorectal cancer.1 In most countries, including Korea, the incidence of lung cancer in women is increasing, whereas in men, it has decreased or stabilised.2,3

Lung cancer in women has several characteristics that differ from lung cancer in men. Although smoking is the most important risk factor, a significant portion of lung cancer in women occurs in non-smokers.4 Global statistics estimate that 20% of lung cancer in Western populations develops in non-smokers.1 In Korea, 38% of female non-small-cell lung cancer patients reported themselves to be non-smokers.5 Female lung cancer patients have a higher proportion of adenocarcinoma histology,6 and a higher frequency of mutations in the epidermal growth factor receptor (EGFR)7 than male lung cancer patients. Such sex-related differences suggest that hormonal factors might influence the development of lung cancer.

Hormonal involvement in lung carcinogenesis was first suggested by experimental studies that identified oestrogen and progesterone receptors in human lung cancers.8 Steroid hormone-related receptors, including oestrogen receptors, progesterone receptors and human epidermal growth factor receptor 2, are frequently expressed in tumour lung tissue,8,9,10 and have been linked to survival in lung cancer patients.11,12 Oestrogen receptor α nuclear expression correlates significantly with adenocarcinoma histology, female sex and a history of never smoking.9 In addition, a recent study revealed a sex-related difference in oestrogen receptor β nuclear expression.8

As a result, it has been hypothesised that reproductive factors and hormone use could affect women’s risk of developing lung cancer. However, the epidemiological evidence is inconsistent and does not fully support that hypothesis. For example, with respect to late age at menopause, studies have variously shown an increase,13,14 a decrease15,16 or no significant change17,18,19 in lung cancer risk; early age at menarche has been shown to increase15 or have no significant effect16,18,19,20 on risk; a small number of pregnancies has been shown to increase,19,21,22 decrease20 and have no significant effect15,18 on risk; late age at first birth might decrease20 or have no significant effect15,16,18,22 on risk. Studies on the use of hormones have also produced heterogeneous results: the use of oral contraceptives has been shown to increase16 or have no significant effect18,19,20,22,23,24,25,26 on risk, and the use of hormone replacement therapy (HRT) might increase,27,28 decrease17,29,30 or have no appreciable influence15,18,19,25,31,32 on risk.

Previous cohort studies have been limited by small sample size (cohort studies published between 2001 and 2015 included 87–3512 lung cancer cases15,16,17,18,19,20,21,22,27,29,30,31,32,33,34), and case–control designs prone to recall bias. Moreover, the potential association between hormone exposure and lung cancer has not been comprehensively elucidated because several studies focused exclusively on exogenous hormone use.25,27,28,29,30,31,32,35

Furthermore, even though lung cancer in East Asia shows unique patterns in epidemiology7 and treatment response,36 few studies have been conducted there. East Asian females with lung cancer generally show lower smoking prevalence (9.9% in Korea,5 5.2% in China7 and 17–25.6% in Japan37) than is common in Western countries. Several case–control studies have been carried out in China and Singapore, but only a few cohort studies have been conducted in Asian countries—1 in Japan,18 2 in China22,33 and 2 in Singapore.19,21

In this study, we investigate the association between reproductive factors and the risk of lung cancer among postmenopausal women in a nationwide population cohort in Korea.

Methods

Data source and study setting

The Korean National Health Insurance Service (NHIS) is the single insurer that provides universal insurance for ~97% of the South Korean population. The remaining 3% of the population (in the lowest-income bracket) is covered by the Medical Aid program financed by the government, but it is also administered by the NHIS.

The NHIS provides national health check-up programmes, including a general health examination focused on cardiovascular risk factors, for all citizens aged 40 and above, and all employees regardless of age.38 It records the results of those general health examinations and questionnaires on lifestyle behaviour. In addition, the NHIS provides the National Cancer Screening Program (NCSP),39 which screens all individuals for stomach, liver, colorectal, breast and cervical cancers, as indicated by age.39 During our study period, Korean women were eligible to participate in biennial screening for breast cancer beginning at age 40 and cervical cancer beginning at age 30.39

The NHIS has qualification data on all its enrolees (age, sex, income level and mortality), diagnosis codes from the International Classification of Disease 10th revision (ICD-10), medical treatment information (medical services provided for medical expense claims) and prescription information.40,41 To manage enrolee qualification, the NHIS updates its mortality data through frequent links with mortality data from the Korean National Statistical Office. The NHIS database is widely used in epidemiological studies investigating disease incidence,42,43,44 and the details of the database profiles are described elsewhere.38,40,45,46 This study was approved by the Institutional Review Board of Samsung Medical Centre (IRB File No. SMC 2018-05-020).

Study population and data collection

From the NHIS database, we collected data for women older than 40 who had undergone both a cardiovascular health screening and an NCSP (breast/cervical cancer) screening between 1 January 2009 and 31 December 2014. Among 4,775,398 postmenopausal women who qualified, those who had previously been diagnosed with cancer (n = 167,910), participants diagnosed with any cancer within 1 year after the date of examination (n = 55,937), individuals who had previously undergone a hysterectomy (n = 17,667), participants who died within 1 year after examination (n = 4565) and those missing data in a key variable (n = 4116) were excluded. We excluded those who had previously undergone a hysterectomy because the questionnaire provided no information about the timing of the hysterectomy, and whether a bilateral oophorectomy had been performed concurrently. After those exclusions, a total of 4,525,203 individuals were included in our analyses (Fig. 1).

Fig. 1
figure 1

Flow chart of the cohort study design.

Information about health-related behaviours, and the menstrual and reproductive histories of participants, was obtained using a self-administered questionnaire. Participants came to the medical centre on the day of the screening examination, and filled out the questionnaire prior to the exam. According to the NCSP guidelines, women were asked about their age at menarche and age at menopause as continuous variables. For analysis purposes, we categorised age at menarche as ≤14 years, 15–16 years, 17 years and ≥18 years, and age at menopause as <40 years, 40–44 years, 45–49 years, 50–54 years and ≥55 years. The reproductive period was calculated as the interval between the age at menarche and age at menopause. Information on parity number, total lifetime breastfeeding history, HRT history and the use of oral contraceptives was collected categorically: parity number (0, 1 or ≥2 children), total lifetime breastfeeding history (never, <6 months, 6–12 months and ≥12 total months), duration of HRT (never, <2 years, 2–5 years, ≥5 years and unknown) and duration of oral contraceptive use (never, <1 year, ≥1 year and unknown).

Participants’ history of smoking was classified as never smoked, former and <10 pack-years smoker, former and ≥10 pack-years smoker, current and <10 pack-years smoker and current and ≥10 pack-years smoker. Drinking was divided into three levels: none, mild (<30 g of alcohol/day) and heavy (≥30 g/day). Regular exercise was defined as moderate physical activity for more than 30 min, and more than 5 days per week during the past week. Body mass index (BMI) was calculated using weight (kg) divided by height in metres squared (m2), and classified as low (<18.8 kg/m2), normal to overweight (18.5–24.9 kg/m2) and obese (≥25 kg/m2). Hypertension, diabetes and dyslipidaemia were defined using physicians’ diagnoses or the use of medication, based on self-reporting. Household income was categorised into quartiles based on insurance premium levels (in Korea, insurance premiums are determined by income level), with those covered by Medical Aid (poorest 3%) being merged into the lowest-income quartile.

Study outcomes and follow-up

The primary endpoint of this study was newly diagnosed lung cancer, defined as new claims for inpatient or outpatient care with the ICD-10 diagnosis code of C34 (malignant neoplasm of lung and bronchus), and registration in the special co-payment reduction programme for critical illnesses. The NHIS special co-payment reduction programme for cancer and some other intractable diseases relieves the large financial burden of patients with serious and rare diseases. For example, cancer patients pay only 5% of the total medical bill incurred for cancer diagnostic tests or treatment. As enrolment in this programme requires a medical certificate from a physician, the cancer diagnosis variable in our study is considered sufficiently reliable, and has been used in other studies.42,44 The cohort was followed from 1 year after the health check-up date to the date of incident lung cancer, death or the end of the study period (31 December 2016), whichever came first.

Statistical analysis

Continuous variables are presented as the mean ± standard deviation, and categorical variables are presented as the number and percentage. The incidence rates of lung cancer were calculated by dividing the number of incident cases by the total follow-up period. Hazard ratios (HRs) and 95% confidence interval (CI) values for lung cancer were analysed using Cox proportional hazard models for various reproductive factors. A multivariable-adjusted proportional hazard model was applied: (1) model 1 was age-adjusted; (2) model 2 was adjusted for age at menarche and menopause, parity, duration of breastfeeding, duration of HRT, duration of oral contraceptive use, alcohol consumption, smoking, regular exercise, income, BMI, hypertension, diabetes mellitus, dyslipidaemia and cancer; (3) model 3 was adjusted for reproductive period instead of age at menarche and menopause separately, as in model 2. Statistical analyses were performed using SAS version 9.4 (SAS Institute Inc., Cary, NC, USA), and a P value <0.05 was considered statistically significant.

Results

Baseline characteristics of study participants

During a mean follow-up of 4.4 years (19,910,975 person-years), 16,556 women (15,223 non-smokers) were diagnosed with lung cancer, for an incidence rate of 83.15 cases per 100,000 person-years. Table 1 provides the baseline information and characteristics of the study population. The mean age of the study participants was 61.2 years. Among all participants, only 3.1% were current smokers, 91.9% had breastfed and 14.6% had received HRT.

Table 1 Selected baseline characteristics of the study population.

Compared with the women not diagnosed with lung cancer, lung cancer cases had a higher mean age at study entry (65.9 vs. 61.3 years), a higher proportion of current and past smokers (6.5 and 1.6 vs. 3.1 and 1.1%), a tendency to exercise less (35.5 vs. 40.2%), more hypertension (51.4 vs. 42.3%) and more diabetes (16.7 vs. 13.8%). With respect to reproductive factors, only the duration of breastfeeding differed; the proportion of women breastfeeding for ≥1 year in lung cancer patients was 73.8%, which was higher than that among women not diagnosed with lung cancer (66.1%). The mean ages of menarche and menopause for lung cancer patients and the non-case group were 16.6 vs. 16.3 and 50.0 vs. 50.2 years, respectively, and the mean reproductive period was 33.4 vs. 33.9 years. Supplementary Table 1 provides a baseline characteristic analysis according to smoking status.

Associations between reproductive factors and the risk of lung cancer

None of the menstrual or reproductive variables analysed showed a statistically significant association with lung cancer (Table 2). In multivariate-adjusted model 2, the HRs for lung cancer were 1.03 (95% CI, 0.98–1.09) for those whose age at menarche was ≥18 years compared with those aged ≤14 years, 1.02 (95% CI, 0.91–1.14) for those whose age at menopause was ≥55 years compared with those <40 years old, 0.95 (95% CI, 0.84–1.06) for the group having ≥2 parity compared with those with nulliparity, 0.95 (95% CI, 0.88–1.02) for the group breastfeeding for ≥1 year compared with those with no breastfeeding history, 1.04 (95% CI, 0.95–1.15) for the group on HRT for ≥5 years compared with the never-HRT group and 1.06 (95% CI, 0.99–1.13) for the patients who had used oral contraception for ≥1 year compared with never-users. Furthermore, when model 3 was adjusted for reproductive period instead of age at menarche and menopause as in model 2, similar trends were observed. The reproductive period was calculated as the interval between the age at menarche and the age at menopause, indicating a strong correlation among those variables. Therefore, we analysed them separately. Moreover, a sensitivity analysis restricted to non-smokers showed no statistically significant association between reproductive factors and the risk of lung cancer (Supplementary Table 2).

Table 2 Hazard ratios (HR) and 95% confidence intervals (CI) for the association between reproductive factors and the risk of lung cancer among postmenopausal women.

Other factors associated with lung cancer

Compared with never-smokers, current smokers were at a significantly increased risk for lung cancer (aHR 1.98; 95% CI, 1.80–2.17 for <10 pack years and aHR 3.40; 95% CI, 3.12–3.69 for ≥10 pack years). The risk of lung cancer in women who were former smokers also remained elevated compared with never-smokers (aHR 1.27; 95% CI, 1.08–1.49 for <10 pack years and aHR 2.59; 95% CI, 2.14–3.14 for ≥10 pack years). Hypertension was associated with a slightly elevated risk of lung cancer (aHR 1.05; 95% CI, 1.01–1.08). Alcohol consumption, physical activity, BMI, diabetes mellitus, dyslipidaemia and income level were not statistically significant in the multivariable analyses.

Discussion

In the largest cohort study to date, we found that a wide range of reproductive history measures made no consistent contributions to lung cancer risk. The strengths of our study include an unprecedentedly large representative sample size and outcome incidence (more than 4.5 million women with 16,556 incident lung cancer cases), a comprehensive evaluation of the reproductive variables of interest and minimal follow-up losses (1.3%).

None of the menstrual variables analysed in our study showed a statistically significant association with lung cancer. Decades ago, a case–control study reported that a later age at menopause was associated with an increased lung cancer risk,13,14 whereas more recent studies found that a later age at menopause was associated with a reduced risk of lung cancer.15,47 With respect to menarche, Brinton et al.15 reported that early menarche was associated with an increased risk of adenocarcinoma, and a few other studies reported an increased risk for lung cancer with a later age at menarche, but the risk estimates were not statistically significant.19,22 Most other studies reported null associations.16,20,22,23,26,48,49,50,51,52,53,54 Weiss et al. reported that a longer reproductive period was associated with a decrease in the risk of lung cancer (HR 0.60 for 36 vs. <31 years; 95% CI, 0.39–0.93),22 but other studies reported null associations.18,21,33 Overall, previous studies showed that menstrual variables were not consistently associated with lung cancer risk, and our results support that finding.

Concerning parity, Kabat et al. reported an increased risk with increasing parity in Canada,20 but most other cohort studies reported an inverse association19,21,22,23,55 or null effect.15,17,18,26,48,50,51,53,56,57 A recent meta-analysis of 11 case–control and 5 cohort studies also found no overall relationship with parity.58 Our result of no association with parity concurs with that meta-analysis.

Only a few epidemiologic studies have examined the association between duration of breastfeeding and lung cancer, and their results showed null effects.18,21,33 Those studies were all conducted in East Asia, and our null result is consistent with theirs.

Previous findings about the association between HRT and lung cancer are conflicting, with studies reporting a reduced risk,17,29,30 an increased risk27,28 and a null effect.15,16,19,23,25,31,32,49 A systemic review that analysed 6 case–control studies showed a protective association between HRT and lung cancer,59 but a recent meta-analysis of 14 cohort studies found no statistically significant association between HRT and lung cancer.60 Our largest-ever cohort study confirms the null association.

The epidemiologic literature is also inconsistent with regard to the role of oral contraceptives in lung cancer risk. Some case–control studies14,53,59 noted that women who had used an oral contraceptive had a decreased risk relative to women who had not used one. But most other studies reported a null association.18,19,20,22,23,24,25,26,51 A systemic review61 that analysed nine case–control and five cohort studies showed no statistically significant association between oral contraceptive use and lung cancer, which is consistent with our findings.

While not the focus of our study, our data show that smoking is a definite risk factor for the development of lung cancer, producing a clear dose–response relationship. In East Asia, smoking prevalence in females has been relatively low due to a social atmosphere that deters women from smoking.62 Indeed, in our study, only 3.1 and 1.1% women identified themselves as current or past smokers. The other 95.8% were never-smokers at cohort entry, and 92.0% of lung cancer patients were never-smokers. However, the female smoking rate is increasing, especially in younger populations,63 and there are many hidden smokers.62 Some studies suggest that women are more susceptible to tobacco carcinogenesis than men.64,65,66 In the absence of other potent risk factors for women, our study’s results emphasise the importance of abstinence from smoking to prevent lung cancer in women.

In our study, having hypertension was slightly associated with an increase in lung cancer risk (aHR 1.05; 95% CI, 1.01–1.08) in women. Lindgren et al. reported that elevated blood pressure was associated with increased risk of lung cancer in smoking, hypertensive men,67 but most other cohort studies reported a null effect.68,69,70,71,72 The biological mechanism of such an association has rarely been investigated, but one recent study suggested that angiotensin-converting enzyme inhibitors are linked to a small increase in the risk of lung cancer and could be a potential confounder.73 Further studies are required to clarify this issue.

This study has several limitations. First, all the primary variables of interest were based on self-reported reproductive histories, and we therefore cannot exclude the possibility of bias caused by inaccurate recall. However, our data showed a null association, and it seems unlikely that recall bias could noticeably change that conclusion. Second, information on the components of HRT and oral contraceptives (e.g. oestrogen alone or oestrogen–progesterone) was unavailable. However, many studies, including the Women’s Health Initiative trials, reported that neither the use of oestrogen plus progestin nor oestrogen alone was associated with lung cancer incidence,28,74,75 and the HRT formula is unlikely to affect our conclusion of no association. Third, information about other possible risk factors for lung cancer (second-hand smoke exposure,76,77,78 indoor pollutants79,80 and a family history of lung cancers14,81) was also unavailable. Fourth, our study participants were limited to health-screening participants and might be healthier and more engaged in having a healthy lifestyle than the general population. Fifth, the follow-up period was relatively short at 4.4 years. Sixth, we could not include the pathological type of lung cancer because our data could not be linked to cancer registry data.

Conclusions

Overall, this study found no statistically significant association between reproductive factors or hormone use, and the risk of lung cancer in postmenopausal Korean women, among whom smoking prevalence is very low. Further studies are warranted to discover additional risk factors for lung cancer in the female population.