Introduction

Colorectal cancer (CRC) is one of the most common malignancies and a leading cause of cancer-related mortality worldwide1. Recent studies on the development of CRC has focused on the conventional adenoma-carcinoma pathway2. Nevertheless, accumulating evidence suggests that the serrated pathway is an alternative pathway, accounting for 15–30% of sporadic CRC cases3.

Conventional adenomas and sessile serrated lesions (SSLs), the two main precursor lesions of these two distinct pathways of CRC, have different genetic and epidemiologic characteristics4. Conventional adenomas progress to invasive carcinoma by the accumulation of mutations in genes such as APC, CNNTB1, KRAS, SMAD4, and TP42. Moreover, these adenomas tend to be more prevalent in the distal colon and rectum5. Because of age-related changes in the DNA repair mechanism and other environmental factors, the risk of conventional adenomas increases with age and male sex. Conversely, the serrated pathway involves genetic mutation of BRAF, high CpG island methylator phenotype (CIMP), and escape of cellular senescence via methylation of tumor suppressor genes or mismatch repair genes6. Additionally, SSLs, unlike conventional adenomas, are less strongly associated with age and sex. Although some studies have reported a lower risk of SSLs in patients < 50 years, the risk did not increase substantially with age7. Similarly, the association between SSLs and sex is unclear with the risk of SSLs being approximately equal in men and women4. These differences in genetic and epidemiologic characteristics suggest that conventional adenomas and SSLs may have distinct risk factors of development and progression.

Additionally, lifestyle and modifiable risk factors play crucial roles in the development and progression of colorectal polyps including adenomas and SSLs. Previous studies have linked smoking, red meat consumption, low-fiber diet, and obesity as risk factors for adenomas8,9,10,11. Few studies have evaluated the risk factors associated with SSLs; the meta-analysis by Bailie et al., reported that smoking increased the risk of SSLs by more than three times (risk ratio, 3.40; 95% confidence interval [CI] 1.90 − 6.07)12. Other studies found that compared to never drinkers, alcohol consumption is associated with the risk of SSLs, with risk ratios from 1.1 to 1.813. However, most previous studies did not clearly differentiate between SSLs and hyperplastic polyps because of the rarity and ambiguity of the definition of SSLs.

Therefore, in this study, we evaluated the modifiable risk factors associated with the development of three different precursor lesions of CRCs: non-advanced adenomas (NAs), advanced adenomatous lesions (ADLs), and SSLs, using a large screening colonoscopy database.

Results

Baseline characteristics of the study population

After exclusions, a total of 9025 patients were enrolled in the final analysis (Fig. 1). Table 1 presents the baseline characteristics of the study population. The median age of the enrolled participants was 48 years (interquartile ranges [IQR], 40–57 years), and 59.6% were male (n = 5375). Overall, 3641 (40.3%) participants had NAs, 836 (9.3%) had ADLs, and 533 (5.9%) had SSLs. Among the participants, 37.0% (n = 3338) had obesity (body mass index [BMI] over 25 kg/m2), 34.0% (n = 3064) had central obesity, 25.2% (n = 2277) had metabolic syndrome, and 4.8% (n = 436) had sarcopenia. Nearly half of the participants (47.2%, n = 4257) were current or former smokers: 29.0% were current smokers, and 18.2% were former smokers. Furthermore, 76.2% (n = 6876) of the participants were alcohol drinkers: 70.2% were light drinkers, and 6.0% were heavy drinkers.

Figure 1
figure 1

Flow diagram for the enrolled study subjects. In total, 9025 patients who underwent a comprehensive medical checkup were selected for statistical analysis after excluding 26,713 participants according to our exclusion criteria. NA was defined as 1 or 2 tubular adenomas < 10 mm in size. ADL was defined as any adenoma > 10 mm, ≥ 3 non-advanced adenomas, any adenoma with villous component, high-grade dysplasia, or invasive colorectal cancer. SSL was defined as one or more SSLs, with or without synchronous NAs.

Table 1 Baseline characteristics of the study population.

Risk factors associated with the development of non-advanced adenoma, advanced adenomatous lesions and sessile serrated lesions

Multivariable analysis was performed to assess the risk factors associated with the development of precursor lesions in CRC (Fig. 2). For the development of NAs, significant risk factors included older age (aOR 2.360, 95% CI 2.086–2.668, P = 0.044), male sex (aOR 1.317, 95% CI 1.098–1.580, P < 0.001), hypertension (aOR 1.437, 95% CI 1.216–1.638, P < 0.001), obesity (aOR 1.227, 95% CI 1.077–1.398, P = 0.002), current smoking (aOR 1.358, 95% CI 1.215–1.517), current alcohol consumption (aOR 1.164, 95% CI 1.038–1.306), and high physical activity (aOR 1.112, 95% CI 1.037–1.250, P = 0.048). Additionally, appendicular skeletal muscle mass (ASM) was a significant protective factor against the development of NAs (aOR 0.981; 95% CI 0.963–0.999; P = 0.044). In the development of ADLs, older age (aOR 2.987, 95% CI 2.496–3.576, P < 0.001), male sex (aOR 1.630, 95% CI 1.193–2.225, P = 0.003), hypertension (aOR 1.254, 95% CI 1.075–1.463, P = 0.004), obesity (aOR 1.600, 95% CI 1.300–1.969, P < 0.001), and current smoking (aOR 1.695, 95% CI 1.396–2.059, P < 0.001) were significant risk factors. Furthermore, ASM was a significant protective factor against the development of ADLs (aOR 0.968; 95% CI 0.939–0.998; P = 0.034). Current smoking was the only significant risk factor for SSL development (aOR 1.394; 95% CI 1.012–1.764, P = 0.006).

Figure 2
figure 2

Association of risk factors with the development of non-advanced adenomas, advanced adenomatous lesions, and sessile serrated lesions. (a) Non-advanced adenomas (b) Advanced adenomatous lesions (c) Sessile serrated lesions. BMI, body mass index; DM, diabetes mellitus; Hx, history; HTN, hypertension. Odds ratio was adjusted for age, sex, underlying diseases, laboratory findings, alcohol consumption, smoking status, physical activity, obesity, central obesity, and metabolic syndrome.

Modifiable risk factors associated with the development of advanced adenomatous lesions and sessile serrated lesions

We evaluated the odds ratio (OR) for modifiable risk factors by adjusting for potential covariates to reveal the impact of precursor lesions on CRC. In the model adjusted for age, sex, underlying diseases, laboratory findings, alcohol, smoking status, physical activity, central obesity, and metabolic syndrome, an increasing degree of obesity was positively associated with increased risk of NA (P for trend < 0.001) and ADLs (P for trend < 0.001), and was more strongly associated with the development of ADLs: the adjusted OR (95% CI) for obese II (BMI ≥ 30) was 1.459 (1.134–1.871) for the development of NA and 1.729 (1.168–2.558) for the development of ADLs. However, obesity was not associated with a risk of developing SSLs (Table 2). In the model adjusted for age, sex, underlying diseases, laboratory findings, obesity, central obesity, physical activity, and metabolic syndrome, smoking status was positively associated with an increased risk of NA, ADLs, and SSLs (P for trend < 0.001, P for trend < 0.001, and P for trend = 0.003, respectively). The risk was highest in current smokers and was attenuated in former smokers for all three precursor lesions of CRC. The association between smoking status and risk was highest for the development of ADLs (aOR for current smoking = 2.070; 95% CI 1.656–2.587), followed by SSLs (aOR for current smoking = 1.577; 95% CI 1.202–2.070) (Table 2). There was no increasing trend in the association between alcohol drinking status, ASM, and physical activity, and the development of all precursor lesions of CRC.

Table 2 Association of Modifiable Risk Factors with the development of Non-advanced adenoma, Advanced adenomatous lesions, and Sessile serrated lesions.

Subgroup analysis was performed to identify specific characteristics associated with modifiable risk factors for the development of CRC precursor lesions; the aORs for the development of ADLs were the highest in former and current smokers (3.803 and 5.175, respectively) among those with sarcopenia. Additionally, the aOR for the development of SSLs in current smokers was further increased in participants with metabolic syndrome (aOR 1.706, 95% CI 1.050–2.772) (Table 3).

Table 3 Adjusted risk analysis of development of adenoma in subjects with sarcopenia, metabolic syndrome, and central obesity.

Discussion

Our study evaluated distinct modifiable risk factors associated with the development of each of the three precursor lesions of CRC: NAs, ADLs, and SSLs. Obesity and current smoking were identified as the two most significant modifiable risk factors, whereas higher ASM emerged as a significant protective factor against the development of NAs and ADLs. Notably, these risk factors exhibited a more robust association with the development of ADLs, indicating a heightened effect on this particular lesion. The association between current smoking and development of ADLs was more significant in patients with sarcopenia. Our study revealed that current smoking was the only modifiable risk factor for the development of SSLs, and this risk was further increased in patients with metabolic syndrome.

In South Korea, individuals can voluntarily undergo medical screening, including colonoscopy, either by paying for it themselves or having it sponsored by their company, independent of the national cancer screening program. Additionally, because Korean colonoscopists are highly specialized professionals who have undergone rigorous qualification examinations, a fact well recognized by the public, and because of the relatively low cost of colonoscopy, it is easily accessible14. Therefore, data from these large-scale medical health checkups, including colonoscopies, have been published in previous Korean studies. A study by Kim et al. analyzing screening colonoscopy data from 70,428 individuals showed that men with both metabolic abnormalities and obesity are at an increased risk of ADLs15 Other studies reported the association between low skeletal muscle mass or visceral obesity and the risk of developing any adenoma or advanced adenomas/cancers16,17,18. However, none of these studies have evaluated the risk according to the subtypes of CRC precursor lesions.

In this study, obesity was one of the most significant modifiable risk factors for the development of precursor lesions of CRC in the conventional adenoma-carcinoma pathway. The degree of obesity was strongly associated with an increased risk: the highest risk was observed in severely obese patients with a BMI ≥ 30 kg/m2. Moreover, the influence of obesity on risk was particularly pronounced for ADLs. Several previous studies have reported an association between obesity and the risk of colorectal adenoma, most of which were conducted in Western populations2,13,19. In a previous meta-analysis including 36 studies, a 5-unit increase in BMI was reported to be associated with a 19% increased risk of colorectal adenomas, and this risk was similar across races20. Few studies have also reported this association in Asian patients. In a recent Chinese population-based study, obesity (BMI ≥ 28 kg/m2) increased the risk of colorectal adenomas by approximately 1.5 fold21. Although the mechanism by which obesity increases the risk of adenoma remains unclear, endocrine and inflammatory effect of adipose tissue has been suggested as a possible explanation. Among cytokines, leptin, which is highly expressed in adipose tissue, triggers the division of colonic epithelial cells through the activation of p42/44 mitogen-activated protein kinase in colorectal cancer cell lines and adiponectin stimulates colorectal carcinogenesis22. Proinflammatory cytokines such as tumor necrosis factor, interleukin-6, and macrophage migration inhibitory factor, which are secreted by the immune response in adipose tissue, are thought to have direct tumorigenic effects on colorectal mucosa23,24. Significantly obesity, metabolic syndrome, and central obesity exhibited a synergistic impact on the risk of adenoma. Although the reason remains unclear, our results emphasize that the impact is not solely attributable to a single metabolic disease but rather underscores the tumorigenic effect of an unhealthy lifestyle associated with the occurrence of multiple metabolic diseases. Considering that a previous study reported a significant reduction in the risk of metachronous adenoma with a 5% weight loss, high-risk patients with multiple metabolic diseases should be considered for intensive lifestyle intervention and monitoring25.

In our study, smoking was the sole significant factor for SSL and the lone common risk factor between SSL and adenoma. Few studies have suggested a more pronounced association between smoking and the serrated pathway than between smoking and conventional pathway. Previous studies demonstrated that smoking is a stronger risk factor for SSLs than for adenomas19,26. Furthermore, Anderson et al. reported that the risk of synchronous clinically significant serrated polyp and conventional high-risk adenoma was higher in current smokers, when compared to the risk of conventional high-risk adenomas alone27. The molecular pathological characteristics of microsatellite instability (MSI), CIMP, and DNA mutations such as BRAF and KRAS are widely recognized in the context of smoking and CRCs28. Given that MSI-high and CIMP-positive status are prominent features in CRC associated with the serrated pathway, the strong link between SSLs is not surprising29. In this study, current smoking was found to be a significantly stronger risk factor for SSLs than for conventional adenoma (aOR 1.660, 95% CI 1.264–2.180), although the risk was not as much for ADL. Notably, this risk is particularly prominent in patients with metabolic syndrome. Considering that the detection of SSL is difficult and associated with interval cancer, lifestyle modifications, including smoking cessation, should be considered in high-risk patients with recurrent SSLs.

Third, higher ASM played a significant protective role in the development of both NA and ADLs. Consistent with our findings, a previous large cross-sectional study found that low muscle mass is associated with an increased prevalence of colorectal neoplasia in both men and women, and an increased prevalence of advanced neoplasia in men16. Although the mechanisms linking lower ASM and adenoma occurrence remain uncertain, several potential explanations can be suggested. First, low muscle mass is associated with higher levels of inflammation-related serum proteins and cytokines. Fleming et al. reported that patients with low ASM had significantly higher systemic levels of C-reactive protein, interleukin-6, vascular endothelial growth factor, and expression of cell surface receptor CD1430. These inflammatory mediators and cell receptors have been shown to promote dysplastic transformation and cancer cell survival. Second, skeletal muscle is the major site of insulin-mediated glucose disposal and myokines synthesis31. The low skeletal muscle mass in patients reduced insulin-mediated glucose uptake and, subsequently, induced insulin resistance18, which has been linked to hyperinsulinemia, higher levels of growth factors such as insulin-like growth factor I, and alterations in nuclear factor kappa B and peroxisome proliferator-activated receptor signaling, which may promote colon neoplasia through their effects on colonocyte kinetics32. Considering the protective effect of higher ASM against colonic adenoma, vigorous physical activity, particularly strength training to increase muscle mass, should be considered in high-risk patients with low muscle mass.

This study has several strengths. First, this large-scale study evaluated the association between different modifiable risk factors and the development of colorectal polyps. Second, we standardized the diagnosis of colorectal polyps by uniformly applying current pathology criteria. Third, because this was a cohort study based on screening colonoscopy, the findings can be broadly applied to the general population. Fourth, within the same population group, a risk assessment for each type of colon polyp, such as NAs, ADLs, and SSL, was performed. However, several limitations should be considered when interpreting the results. First, because specific dietary patterns were not studied in detail, the effects of inappropriate diets such as high-fat or low-fiber diets on the risk of polyp development were not investigated. Second, the study questionnaire could not fully capture all aspects of smoking history, including whether the patient was a light smoker or a heavy smoker, owing to the complexity of smoking patterns. Third, there was no information available on medication history, including the use of aspirin, nonsteroidal anti-inflammatory drugs, or statins, which have been shown to reduce the incidence of colorectal cancer.

This large cross-sectional study demonstrated an association between various modifiable risk factors and colorectal polyp development, on screening colonoscopy, and showed that smoking and obesity are strongly linked to an increased risk of developing NA and ADL. Particularly, smoking was the only significant modifiable lifestyle risk factor for SSL. Further prospective, long-term follow-up studies are required to confirm these observations.

Methods

Study population

In this cross-sectional cohort study, participants who underwent a comprehensive medical checkup between February 2019 and December 2021 at a tertiary university-affiliated hospital were considered eligible. All individuals voluntarily underwent the medical checkup, either paying for it themselves or having it sponsored by their company. Patients who underwent qualifying colonoscopy, in which the appendiceal orifice was visualized, were enrolled in the study. The exclusion criteria were as follows: (1) participants who did not undergo colonoscopy; (2) insufficient data for the assessment of risk factors including obesity, alcohol consumption, smoking, sarcopenia, and physical activity; and (3) incomplete colonoscopy due to inadequate bowel preparation and insertion failure. Finally, 9025 subjects were selected for statistical analysis (Fig. 1). The study protocol was approved by the Institutional Review Board of Ewha Seoul Hospital (IRB No. 2023-02-040), and was performed in accordance with the ethical guidelines of the World Medical Association’s Declaration of Helsinki.

Definition of non-advanced adenoma, advanced adenomatous lesion, and sessile serrated lesion

NA was defined as 1 or 2 tubular adenomas < 10 mm in size. ADL was defined as any adenoma > 10 mm, ≥ 3 non-advanced adenomas, any adenoma with villous component, high grade dysplasia, or invasive colorectal cancer. The diagnosis of SSL was confirmed through histological examination, conducted by the pathologists, when there was at least one unequivocal aberrant crypt. Individuals who had one or more SSLs, with or without synchronous NAs, were classified under the SSLs group.

Definitions of study parameters

A standardized, self-administered questionnaire was used to collect information on demographic characteristics, smoking status, alcohol consumption, preexisting medical conditions, and medication use. Smoking status was categorized as current, former, or never smoked. Alcohol consumption status was categorized as heavy, light, or never drinkers. Heavy alcohol consumption was defined as a daily alcohol intake > 30 g/day for men and > 20 g/day for women. Patients who did not meet these criteria were classified as light drinkers.

The participants were categorized based on BMI according to the Korean Society for the Study of Obesity practice guidelines: underweight (BMI < 18.5 kg/m2), normal weight (18.5 kg/m2 ≤ BMI < 23 kg/m2), overweight (23 kg/m2 ≤ BMI < 25 kg/m2), obese I (25 kg/m2 ≤ BMI < 30 kg/m2), and obese II (BMI ≥ 30 kg/m2) kg/m2.33 Central obesity was defined as waist circumference ≥ 90 cm for men and ≥ 85 cm for women34. Metabolic syndrome was defined as the presence of three or more of the following: (1) waist circumference ≥ 90 cm for men and ≥ 80 cm for women; (2) triglyceride (TG) ≥ 200 mg/dL or the use of TG lowering medication; (3) high-density lipoprotein-cholesterol (HDL-C) < 40 mg/dL for men and < 50 mg/dL for women or the use of HDL-C raising medication; (4) blood pressure ≥ 130/85 mmHg or the use of antihypertensive medication; (5) fasting glucose ≥ 100 mg/dL or the use of antidiabetic medication35. Hypertension was defined as systolic blood pressure ≥ 140 mmHg, diastolic blood pressure ≥ 90 mmHg, or the use of antihypertensive medication36. Diabetes was defined as fasting glucose level ≥ 126 mg/dL or glycated hemoglobin A1C (Hb A1C) ≥ 6.5%, or the use of insulin of oral hypoglycemic agents37. Dyslipidemia was defined as one or more of the following: (1) total cholesterol ≥ 240 mg/dL; (2) TG ≥ 200 mg/dL; (3) HDL-C < 40 mg/dL (4) low-density lipoprotein-cholesterol (LDL-C) ≥ 160 mg/ dL; or (5) specific drug treatment38.

For sarcopenia, ASM was measured using multi-frequency bioelectric impedance analysis (InBody770; InBody Co., Ltd., Seoul, Korea), which showed excellent agreement with dual-energy X-ray absorptiometry and magnetic resonance imaging39. The sarcopenia index (SI) was calculated using the following formula: SI = ¼ total ASM (kg)/BMI (kg/m2). Sarcopenia was defined as SI < 0.789 in men and < 0.521 in women, based on a recent recommendation40. Sarcopenic obesity is defined as the presence of sarcopenia and BMI ≥ 25 kg/m241. Detailed physical activity assessments were initially obtained through self-reported questionnaires, which were structured identically to those used in the National Health Information Data base of the National Health Insurance Service42. On the basis of this survey, weekly physical activity-related energy expenditure (metabolic equivalent task [MET]-min/week) was calculated by summing the product of frequency, intensity, and duration43.

Statistical analysis

Baseline characteristics were expressed as frequencies and percentages for categorical variables and as medians and IQR for continuous variables. Student's t-test, χ2 test, and analysis of variance were used to compare variables. We performed multivariable logistic regression analysis to evaluate risk factors for the development of CRC precursor lesions. In addition, multivariable logistic regression analysis was performed to determine the independent association between modifiable lifestyle factors including obesity, appendicular muscle mass, smoking status, and alcohol drinking status and the risk of NAs, ADLs, and SSLs. This association was examined using models adjusted for potential confounding and mediating variables, including age, sex, underlying diseases, laboratory findings, alcohol consumption, smoking status, physical activity, obesity, central obesity, and metabolic syndrome. The severity of modifiable lifestyle factors was categorized according to the previously described definitions. In subgroup analysis, this analysis was repeatedly performed in individuals with sarcopenia, metabolic syndrome, and central obesity. All the adjusted OR and 95% CIs were calculated using this model. All statistical analyses were performed using SPSS (version 23.0; IBM, Armonk, NY, USA). A P-value of < 0.05 was considered statistically significant.

Ethics declarations and informed consent

The study protocol was approved by the Institutional Review Board of Ewha Seoul Hospital (IRB No. 2023 02 040), and was performed in accordance with the ethical guidelines of the World Medical Association’s Declaration of Helsinki. Informed consent was waived from the IRB due to the retrospective design.

Conference presentation

Presented as a poster presentation at the Korea Digestive Disease Week 2023.