Introduction

Breast cancer poses a serious threat to female health worldwide. Based on GLOBOCAN data, in 2020 alone, the number of new cases of breast cancer reached 2.26 million, and the incidence of breast cancer ranks first among female malignant tumours [1]. Although breast cancer is highly dangerous, its aetiology remains unclear, and its known risk factors include genetic factors [2], lack of physical activity, unhealthy weight, alcohol intake, and hormone replacement therapy [3].

Notably, the incidence of malignant tumours varies with the education level (EL). A negative correlation has been observed between EL and the risk of developing gastric, liver, lung, and oesophageal cancer in humans [4, 5], the association of EL with breast cancer remains controversial. EL is associated with an increased risk of breast cancer[5, 6], while other studies have yielded contradictory results [7]. The association between EL and breast cancer was determined by analysing the effect of EL on breast cancer in the population from the Kailuan Study.

Materials and methods

Study cohort

The Kailuan Study, which began in 2006, is an ongoing observational cohort study of a functional community population. The EL information of the Kailuan Study subjects was obtained at baseline (from May 2006 to December 2007). Then, continuous follow-up data, including breast cancer data, were obtained from the establishment of the cohort to the present, allowing us to study the association between EL and breast cancer. More details about the Kailuan Study can be found in the literature [8, 9]. Subjects who met the following criteria were recruited: (1) participation in the baseline survey of the Kailuan Study; (2) complete data on EL; and (3) signed the informed consent form. Those with history of malignancy were excluded. The study was approved by the Ethics Committee of Kailuan General Hospital and was conducted in accordance with the Declaration of Helsinki.

Collection of exposure information

Face-to-face questionnaire interviews and health examinations were performed by well-trained physicians or nurses by using a standardized protocol to gather information at baseline. These baseline data included sociodemographic characteristics (e.g., age and EL), lifestyle characteristics (smoking, alcohol consumption, salt intake and physical activity), history of previous diseases (hypertension, diabetes mellitus and malignant tumour) and physical examination data (height and weight). Participants were asked to take off their hats and shoes and wear thin clothes during the measurement. Height and weight were measured, and body mass index (BMI) was calculated as weight (kg)/height2 (m2).

Definition of variables

EL was classified into three categories, namely, low EL (illiteracy and primary), medium EL (middle school) and high EL (college and above). Smoking was defined as smoking one or more cigarettes per week in a recent year, and participants were classified into never smokers, former smokers and current smokers. Alcohol drinking was defined as drinking one or more alcoholic drinks per month for no less than six consecutive months, and participants were classified into never drinkers, former drinkers and current drinkers. Physical activity was classified according to the frequency of physical activity (20 min = 1 instance of activity) performed during leisure time. The categories of physical activity include exercise occasionally (< four instances of activity per week) and frequent exercise (≥ four instances of activity per week).

Collection of endpoint event information

The follow-up period was considered from the establishment of Kailuan Cohort (May 2006 – December 2007) to December 31, 2019. The follow-up endpoint was newly diagnosed breast cancer or death in the observed subjects (whichever came first). First, information on the participant medical records was obtained through the Tangshan City health insurance system. Then, the medical histories of subjects were collected by professionally trained investigators. Pathology, imaging (including magnetic resonance imaging, computed tomography, and colour Doppler ultrasonography) and blood biochemical examination results were verified by clinicians to confirm the diagnosis of breast cancer. Tumour cases were coded according to the International Classification of Diseases-10 (ICD-10), and breast cancer was coded as C50. Information on fatal events was obtained from the Kailuan Group social insurance system.

Statistical methods

SAS 9.4 was used to perform statistical analysis of the data. Continuous variables were expressed as mean ± standard deviation, while categorical variables were expressed as the number of cases (percentage). Differences in baseline demographic characteristics among the low, medium and high EL groups were analysed using variance analysis and Chi square test. A multivariable Cox proportional hazards regression model was applied to analyse the associations between EL and the risk of developing breast cancer. Subgroup analysis was performed according to age, BMI, physical activity and hypertension history, and the associations between EL and the risk breast cancer were explored in different subgroups. Sensitivity analysis was conducted to evaluate the consistency of our findings after excluding participants new breast cancer within 2 years of the start of the follow-up period. All statistical tests were considered statistically significant at P < 0.05 (two-sided).

Results

Baseline characteristics

The number of female participants in the 2006 baseline survey was 20,400. A total of 116 individuals with a history of malignancy and 155 participants lacking baseline EL data were excluded, and 20,129 female participants were eventually included in the statistical analysis. The mean age of the observed subjects was 48.75 ± 11.50 years. The low, medium and high EL groups included 1,078, 16,833 and 2,218 subjects, respectively. In comparison to the medium- and high-EL groups, the mean age, BMI and the proportions of subjects with current smoking, heavy salt intake, frequent physical activity, hypertension and diabetes were higher in the low EL group, whereas the proportions of current drinkers were lower (all P values < 0.0001; Table 1).

Table 1 Baseline characteristics of the participants

Association between EL and risk of breast cancer

The cumulative follow-up period in this study was 254,386.72 person-years, with a median follow-up time of 12.96 years. A cumulative total of 279 new breast cancer (n = 8 in the low EL group; n = 244 in the medium EL group; n = 27 in the high EL group) were detected during the follow-up period. Cox proportional risk regression analysis was carried out with the occurrence of breast cancer as the dependent variable and EL difference as the independent variable. In comparison with the low EL group, model 1 was adjusted for age, while model 2 was adjusted for smoking, alcohol consumption, BMI, physical activity, salt intake, hypertension and diabetes mellitus based on model 1. The hazard ratios (HRs) and 95% confidence intervals (95% CI) for breast cancer in the medium and high EL group were 2.23(1.12–4.64) and 2.52(1.12–5.70), respectively. In the sensitivity analysis, after excluding participants who developed breast cancer within 2 years after the start of the follow-up period, the risk of breast cancer was significantly higher in the medium and high EL group (Table 2).

Table 2 Associations between EL and breast cancer risk

Subgroup analysis of EL and risk of breast cancer

Subgroup analysis was performed according to age, BMI, physical activity and hypertension history, and the associations between EL and the risk breast cancer was explored in different subgroups. In comparison with the low EL group, in participants aged 50 years and above and the exercise occasionally subgroups, an increased risk of breast cancer was observed in the patients in the medium and high EL group, while an increased risk of breast cancer was observed in high EL’s subjects with BMI ≥ 28 and hypertension subgroup (Table 3).

Table 3 Subgroup analysis of the association between EL and breast cancer risk

Discussion

This prospective study based on the Kailuan cohort explored the association between EL and breast cancer risk. The results show that the risk of breast cancer is significantly increased in the medium and high EL groups compared with the low EL group. This study provides reference for the prevention and screening of breast cancer.

The risk of breast cancer significantly increased among subjects with medium and high EL, and this result is consistent with previous research results [5]. In a meta-analysis [6] that includes 18 cohorts, compared with low EL participants, the breast cancer risk of subjects with high EL significantly increased [pooled RR(95%CI) = 1.22 (1.14–1.30)]. In another study, higher EL was associated with a lower risk of breast cancer after adjusting for several confounding factors [7]. These differences in findings may be related to differences in subject race and the length of the follow-up period.

Subgroup analysis results showed that the risk of developing breast cancer is increased in medium and high EL subjects aged ≥ 50 years and exercise occasionally and in high EL subjects with BMI ≥ 28 and hypertension. In a meta-analysis, although the effect was different for pre- and post-menopausal women and for women of different geographic and ethnic groups, a statistically significant association was still observed between hypertension and increased breast cancer risk [10]. Therefore, prevention and screening should be prioritised in the older women. Enhancing physical exercise, controlling BMI level and keeping blood pressure normal are of essential for the prevention and control of breast cancer.

Although the association between EL and breast cancer risk has not been fully understood, several following aspects may tentatively explain the relationship between EL and breast cancer development. Firstly, women with more education are likely to have advanced age at first pregnancy, birth and fewer children, thus increasing their risk of developing breast cancer [11, 12]. Secondly, women with a higher EL are associated with a later menopause onset [13], which is also a breast cancer risk factor [14]. Moreover, more education was correlated with higher alcohol intake and higher prevalence of hormone therapy, and both factors may contribute to the development of breast cancer [6]. In addition, people with higher EL may have better screening compliance, and increased screening can reduce breast cancer mortality but increase incidence [15, 16].

This study has some limitations, such as the absence of some confounders (e.g., use of hormone therapy and age at menopause). The larger study cohort and longer follow-up time aims to minimise the effect of the above missing factors, and the abovementioned variables will be increased in subsequent studies. Secondly, only baseline data of EL were used for the analysis of the association between EL and breast cancer risk, and time-dependent exposures during follow-up were not considered. Some subjects may have received on-the-job education, so changes in EL may affect the risk of developing breast cancer, which should be explored in depth in subsequent studies.

Conclusion

An increased risk of breast cancer was associated with a higher EL. This condition was more prominent in the sensitivity analysis, and certain factors such as alcohol use and hormone therapy may play a mediating role. Breast cancer can be reduced by reducing alcohol intake, maintaining a normal BMI, controlling blood pressure, increasing physical activity and improving screening compliance.