Introduction

Coffee is one of the most commonly consumed beverages in the United States1. It consists of hundreds of compounds including caffeine, melanoidins, diterpenes, polyphenols, trigonelline, lipids, and complex polysaccharides, each with a wide range of effects on normal physiology and pathophysiology of human diseases2,3. Several natural compounds found in coffee exert inhibitory effects on cancer initiation and development4,5. For example, caffeine is a well-characterized psychostimulator and diuretic that decreases cancer cell proliferation6, while polyphenols exert antioxidant activity (chlorogenic and caffeic acids)7,8 and inhibit DNA strand breaks (ferulic acid)9.

Cancer accounts for a major burden of disease in the United States, with over 1.7 million new cases and 600,000 deaths estimated in 201910. There are clear disparities in cancer incidence and mortality rates across racial/ethnic groups11. Incidence rates of lung and breast cancer are highest among non-Hispanic whites, with ~ 40% and ~ 110% higher rates compared to Hispanics, respectively. Non-Hispanic Blacks have ~ 50% higher incidence rates of colorectal cancer and > 200% higher incidence rates of prostate cancer as compared to the lowest rates in Asians/Pacific Islanders. The mortality rates for non-Hispanic Blacks are also the highest among all racial/ethnic groups, with percent increases ranging from ~ 98% for lung cancer to ~ 350% for prostate cancer compared to Asians/Pacific Islanders. These racial/ethnic disparities in incidence and mortality rates are not fully understood, but may be due, in part, to differences in environmental exposures and nutritional habits12.

Epidemiologic studies have reported an inverse association between coffee consumption and many cancer types, which is consistent with the anti-carcinogenic activity of coffee components13. A protective association for coffee has been demonstrated in several cancers including colorectal, prostate, liver, endometrial, and oral/pharyngeal14,15,16,17,18. However, coffee consumption was found to be associated with elevated risk of lung cancer, although the association was substantially attenuated after adjustment for smoking19,20. The association between coffee consumption and cancer may differ across racial/ethnic groups with different consumption patterns and distributions of other risk factors. To date, most epidemiological studies of coffee consumption and cancer risk have been conducted in non-Hispanic White or Asian populations. To our knowledge, few studies have investigated this relationship in African Americans, the racial group that suffers the greatest burden of many cancers11. A prospective study in the Multiethnic Cohort reported a reduced risk of overall cancer associated with high coffee consumption in African Americans after adjustment for smoking and other potential confounders21, although the inverse association was not statistically significant for individual cancer sites21,22. Similarly, the Black Women’s Health Study found no association between coffee consumption and the risk of breast cancer in African American women23.

We designed a nested case–control study within the Southern Community Cohort Study (SCCS) to investigate the association between coffee consumption and the risk of the most prevalent cancers in the US (lung, prostate, breast and colorectal) in African Americans. The SCCS is a large prospective cohort comprised largely of medically underserved participants in the southeastern United States, with nearly 70% of study participants self-reporting as African American24.

Results

Characteristics of the study participants

Demographic and exposure characteristics of participants analyzed in this study are summarized and compared between cases and controls in Table 1. Cases included African American participants with one of the major cancer types: lung cancer (n = 524), prostate cancer (n = 503), breast cancer (n = 458), and colorectal cancer (n = 316) (Table 1). Controls were 3,337 African American participants without cancer. Characteristics of study participants summarized by categories of total coffee consumption are presented in Supplementary Table S1.

Table 1 Characterization of Southern Community Cohort Study African American cancer cases and controls included in the coffee consumption analysis.

Coffee consumption and risk of cancer

Lung cancer cases consumed more regular (P = 1.4 × 10–4) and decaffeinated (P = 0.033) coffee than controls; 33.7% of lung cancer cases consumed regular coffee ≥ 1 times/day as compared to controls (25.4%) and other cancer cases (from 21.8% to 27.1%) (Table 1). Similarly, 12.8% of lung cancer patients consumed decaffeinated coffee one or more times per day as compared to 9.2% of controls and 8.0%-9.4% for cases of other cancer sites. No statistically significant difference in coffee consumption was observed for other cancer cases compared to controls.

The association between coffee consumption and risk of cancer in African Americans was estimated for lung, prostate, breast, and colorectal cancers separately and combined. There are two multivariable models presented in Table 2: (1) adjusted for the case–control frequency matching factors only (age and sex) and (2) fully adjusted for the covariates relevant to each specific cancer type. In the minimally adjusted model, consumption of regular or decaffeinated coffee was statistically significantly associated with increased risk of lung cancer (P = 3.3 × 10–4 for 1–2 times/day and 4.1 × 10–4 for ≥ 2 times/day). However, this association was no longer statistically significant in the fully-adjusted model including BMI and smoking (P = 0.063 for 1–2 times/day and 0.726 for ≥ 2 times/day). No statistically significant associations between coffee consumption and cancer risk were observed for prostate, breast, colorectal, or all four cancers combined. The fully-adjusted ORs for the associations with ≥ 2 versus < 1 times/day were all below 1.0, indicating a tendency towards a reduced cancer risk with higher coffee consumption. Acknowledging our limited statistical power, similar results were observed when each type of coffee was considered separately (Supplementary Tables S2 and S3) or when the reference category included only non-drinkers (data not shown).

Table 2 Association between total coffee consumption and cancer risk in Southern Community Cohort Study African Americans.

Discussion

To our knowledge, the present report is among the first to investigate the impact of coffee consumption on the risk of cancer at multiple sites specifically in African Americans. A major strength of this analysis is the size of the SCCS cohort, which enables careful examination of cancer risk factors among African Americans in the southeastern US. We found no association between coffee consumption and risk of cancer overall or by cancer site after adjustment for cancer-specific risk factors.

In prostate, breast, and colorectal cancers, we observed a trend towards a protective association for coffee intake 2 or more times a day, although none of the results were statistically significant. This trend is in line with the recent large case–control and cohort studies that found a protective association between coffee consumption and the risk of cancers of the breast25, colon26, and prostate27 in Whites. In addition, our findings are consistent with a recent case–control study of White British individuals from the UK Biobank cohort28.

It has been suggested that coffee exerts its protective effect through a variety of potentially bioactive compounds. As an example, diterpenes such as cafestol and kahweol have been confirmed to exert anticarcinogenic effects by modulating the expression of proteins involved in cellular antioxidant defense or DNA repair29,30. Polyphenols such as chlorogenic and caffeic acid are also important constituents of coffee that were reported to inactivate various pathways involved in tumorigenesis (i.e. cell cycle regulation, apoptosis, inflammatory and stress response) through regulation of DNA methyltransferase activity31. In addition, chlorogenic-derived constituents have been shown to alter glucose metabolism and insulin sensitivity in rats32, which are established markers for some cancer sites in humans33.

The Women's Health Initiative Observational Study reported a borderline significant increased risk of colorectal cancer associated with higher coffee consumption, which is in contrast with the potential anticarcinogenic effect of coffee constituents34. A lack of association between regular or decaffeinated coffee and colorectal cancer has also reported in different racial/ethnic groups35, including African Americans21. The species of coffee bean, method of preparation, roasting degree, and serving size all influence the level and potency of bioactive compounds in coffee, and therefore, coffee’s potential health implications36. In addition, coffee is a major source of caffeine which has been reported to both stimulate and suppress tumors37. These could be potential reasons for differences in association between coffee consumption and cancer risk across studies. Further, the broad geographical distribution of the populations studied (including African Americans) could serve as a source of variability between association analyses due to the presence of region-specific lifestyle or environmental factors that may modify associations between coffee and cancer risk.

Similar to our results in African Americans, a positive association between coffee consumption and lung cancer risk has been reported by several prospective studies38,39 including the NIH-AARP Diet and Health Study20. However, the association was no longer statistically significant after adjustment for smoking. It is important to note that the NIH-AARP Diet and Health Study was comprised of 90% Whites overall and 96% whites among coffee drinkers of > 2 times a day. Our results are also in line with a previous study in the Multiethnic Cohort based on 964 African American cases with lung cancer21. Of note, any positive association between coffee consumption and lung cancer risk may be attributable to either residual confounding by smoking or potential interaction with unknown risk factors, and prospective studies in African Americans with complete exposure assessment are needed to confirm this hypothesis.

Coffee intake can be difficult to measure accurately, as epidemiologic studies typically rely on self-reported data from questionnaires which are hampered by recall bias and measurement error40, and most studies (including ours) do not have information on duration of exposure. Further, low consumption of coffee in a population may preclude accurate characterization of a dose–response relationship, as only 7.0% of the African American SCCS controls in our study reported drinking regular coffee ≥ 2 times per day. The same trend was observed for decaffeinated coffee consumption, with only 2.1% of African American SCCS controls drinking coffee ≥ 2 times per day. When considering overall coffee consumption (regular coffee + decaffeinated coffee), 31.6% of healthy African Americans (versus 59.8% of Whites in the SCCS) reported that they were daily coffee drinkers (≥ 1 times/day). Consistent with our observations, it has been reported previously that African Americans drink 2–3 times less coffee than Whites or Mexican Americans regardless of age and sex41. This is also in agreement with the low coffee consumption patterns in African countries as compared to European countries42. Thus, the low number of coffee drinkers among African Americans suggests the importance of large cohorts for a powerful investigation of coffee’s impact on cancer risk in this population.

In this study, non-drinkers were combined with those who consumed coffee < 1 times/day based on previous studies that reported no significant differences in cancer risk between non-drinkers and very low coffee drinkers in African Americans21,22,23. Having a larger reference group including both non-drinkers and extremely low drinkers increases the statistical power to detect an association between daily coffee consumption and cancer risk. Although previous prospective studies reported significant associations between very high levels of coffee consumption (i.e. ≥ 5 times/day) and cancer risk that were not observed in lower categories of intake20,25,38,43,44, the upper end of the administered FFQ limited our ability to investigate the highest levels of coffee consumption in relation to cancer risk. Other study limitations include potential measurement error in dietary coffee consumption due to reporting bias, incomplete food tables and a lack of information on the serving size (e.g. fluid ounces), type, and brewing method of coffee consumed. These factors influence the level of exposure to different bioactive compounds with potential anticarcinogenic effects. Further, the lack of evaluation of other known or unknown confounders (e.g. family history of cancer) that could potentially influence the association between coffee and cancer risk is a limitation.

In conclusion, we confirmed the relatively low consumption of coffee among African Americans in the SCCS and showed that coffee consumption was not strongly associated with risk of any of the four major cancer types in African Americans. Demographic, cultural, environmental, and genetic factors that differ across racial/ethnic groups may modify the association of coffee consumption with cancer risk, and thus, larger cohorts with a higher proportion of coffee drinkers and complete exposure assessment are needed in future studies to further evaluate the relation between coffee consumption and cancer in African Americans.

Methods

Study participants and data retrieval from the Southern Community Cohort Study (SCCS)

A detailed description of SCCS methods is available on the study website (https://www.southerncommunitystudy.org/) and in previous publications24. Briefly, the SCCS is an ongoing prospective study across 12 southeastern states in the US aiming to investigate chronic diseases in approximately 85,000 adults, two-thirds of whom are African Americans. Recruitment of participants (aged 40–79) began in March 2002 and ended in September 2009. A total of 84,797 participants were enrolled in the SCCS (65% African Americans, 30% White, 5% other racial/ethnic groups). Participants were followed from the date of enrollment until first cancer diagnosis, death, emigration or end of the follow-up period, whichever occurred first. Incident cancer cases were ascertained via linkage to state cancer registries, while deaths were ascertained via linkage to the National Death Index and Social Security mortality files.

At enrollment, participant characteristics including demographic variables, socioeconomic status, medical history, and lifestyle exposures were assessed using a standardized computer-assisted personal interview (available at https://www.southerncommunitystudy.org/). Epidemiological data from the SCCS examined in this study included anthropometric measurements (height, weight, BMI), sociodemographic factors, physical activity, vegetable consumption, smoking, daily alcohol (i.e. beer, wine, liquor), sweet beverage (i.e. soft drinks, Kool-Aid, or other sweetened drinks) and tea consumption, total energy intake, reproductive factors, medication use (e.g. aspirin), and hormone replacement therapy (HRT). Dietary intake, including regular and decaffeinated coffee consumption, was assessed using a validated, semi-quantitative food frequency questionnaire (FFQ) that was developed specifically for this study population45. For each food item, the FFQ elicited information on frequency of consumption using the following scale: never, rarely, once a month, 2–3 times a month, once a week, 2–3 times a week, 4–6 times a week, once a day and two or more times a day. Daily regular coffee intake, decaffeinated coffee intake, and total coffee intake (regular coffee + decaffeinated coffee) were each converted into a derived variable including three ordered categories of coffee drinking: < 1 times/day, ≥ 1 and < 2 times/day, and ≥ 2 times/day.

A nested case–control study was designed among recruited participants. Of the 55,362 African American participants recruited to the SCCS, a total of 1,801 cases with incident primary cancers of the lung (N = 524), prostate (N = 503), breast (N = 458), or colorectum (N = 316) were identified. For participants with multiple cancers, we included only the first diagnosed cancer, or if age at diagnosis was the same, we included the cancer occurrence most likely to reflect the primary site based on established metastasis patterns. Controls were randomly selected in a ratio of 5 cancer-free controls per 1 incident lung cancer (the most frequent site in our dataset) case of any racial/ethnic background, employing frequency matching on age at enrollment in 5-year categories, sex, and race (White, Black, other). African Americans among the selected controls (N = 3,337) served as the comparison group in this study. The same control group was used to assess the association with each individual cancer and all cancers combined. Only males (females) among the control group were included in prostate (breast) cancer analyses. The focus on the four major cancers was primarily due to lack of statistical power for analysis of less common cancers.

All methods were performed in accordance with relevant guidelines/regulations. Informed consent was obtained from all participants in the SCCS, and the study was approved by the Institutional Review Boards of Vanderbilt University Medical Center and the Meharry Medical College.

Statistical analysis

Data are presented as means (± SD) for continuous variables or percentages for categorical variables. We conducted Student’s t-tests and chi-squared tests of independence to compare demographic and risk factor distributions between cases and controls with regard to continuous and categorical variables, respectively. Quantile–quantile (QQ) plots were used to test for deviations from a normal distribution. All analyses were conducted separately by cancer type. Only female controls were included in breast cancer-specific analyses, and only male controls were included in prostate cancer-specific analyses. We estimated the odds ratios (OR) and 95% confidence intervals (CI) for the associations between categories of total coffee consumption and risk of each of the four major cancers (lung, prostate, breast, and colorectal) as well as all four cancers combined using unconditional logistic regression models adjusted for age and sex. Models were further adjusted for known cancer-specific risk factors: lung (BMI, total energy intake, smoking status and pack-years), prostate (BMI, physical activity, smoking status and pack-years), breast (BMI, physical activity, smoking status and pack-years, hormone replacement therapy (HRT)), colorectal (BMI, vegetable consumption, low-dose aspirin use, nonsteroidal anti-inflammatory drugs (NSAID) use, physical activity, smoking status and pack-years), and all cancers (BMI, smoking status and pack-years). BMI (1.3% unknown) and physical activity (total activity MET-hours; 2.4% unknown) were treated as continuous variables, whereas the remaining variables were categorized: smoking status (never, former, current; 0.7% unknown), pack-years (defined as number of packs of cigarettes smoked per day times the number of years the person has smoked; 20 + and < 20 packs/years; 2.8% unknown) vegetable consumption (5 + and < 5 times/day; 0.8% unknown), low-dose aspirin use (2 + and < 2 years; 2.5% unknown), NSAID use (Aspirin, OTC, or Rx; 1 + and < 1 year; 2.5% unknown), and, for women, HRT (ever and never use; 0.6% unknown). Covariate selection was informed by a combination of univariate association analyses in our data (Supplementary Table 1) and/or evidence of established risk factors in the literature. Cases and controls with missing values for the above-mentioned variables were excluded. Sensitivity analyses were conducted to assess the individual associations of regular coffee intake and decaffeinated coffee intake with cancer risk and to test the association with total coffee consumption using only non-drinkers as the reference category (N = 870 among controls).

Statistical significance in two-sided tests was set at P < 0.05, which indicates a 5% probability of a false positive association. All analyses were conducted using the R statistical computing software.