Introduction

COVID-19 is a global pandemic with an accumulated death toll of 6.328 million persons worldwide (Worldmeter 2022; available at https://www.worldometers.info/coronavirus/). There are many risk factors associated with the spread of the COVID-19 pandemic, including (1) environmental factors: low wind speed and fossil fuel energy production (Coccia 2020; Coccia 2021a); (2) urban and political structure: high population size and/or worse public governance, promoting lower expenditures in health system (Arbel et al. 2022; Coccia 2022); (3) professions: police officers (De Santi et al. 2021); and (4) medical factors: older age cohorts (García-Esquinas et al. 2021), cardio-vascular diseases (De Santi et al. 2021), obesity (Yanover et al. 2020), first vs. second wave (Coccia 2021b), and smoking (De Santi et al. 2021).Footnote 1

The tobacco epidemic is one of the biggest public health threats the world has ever faced, killing more than 8 million people a year around the world (World Health Organization (WHO) 2020c). With the outburst of the COVID-19 pandemic (World Health Organization (WHO) 2020a; Wu and McGoogan 2020), several researchers identified smoking as one of the serious risk factors causing coronavirus-related complications.Footnote 2 Smoking is also strongly associated with respiratory diseases such as bronchitis (Balte et al. 2020). Yet, two recent studies found an underrepresentation of current smokers among coronavirus patients in China, only 1.4% (Zhang et al. 2020) and 16.9% among severe patients (Guan et al. 2020), compared with the 50.5% prevalence of smokers among the overall adult Chinese population. Moreover, the current literature does not support smoking as a predisposing factor in men or any subgroup for infection with SARS-CoV-2. Also, differences in COVID-19 disease prevalence and severity are associated with differences of expression of ACE2 receptor in smokers, responsible for severe acute respiratory syndrome coronavirus (Cai 2020).

Indeed, there is a debate in the literature over whether current smoking increase individuals’ risk of developing symptomatic COVID-19. On the one hand, Hopkinson et al. (2021) support the conclusion that smoking increases individual risk from COVID-19 in England. Based on information from the UCLA hospital, Monteiro et al. (2020) suggest that obesity, smoking history, and elevated inflammatory markers were associated with increased need for IMV (intermittent mandatory ventilation) in patients with COVID-19. Rosoff et al. (2021) provide strong genetic evidence in support of smoking as a risk factor of COVID-19 and suggest that prevention programs aimed at reducing smoking may be important for the COVID-19 pandemic and have substantial public health benefits.

On the other hand, based on meta-analysis of China and US studies, Farsalinos et al. (2021) mention the unusually low prevalence of smokers (approximately 1/4th the expected prevalence) among hospitalized COVID-19 patients. The authors conclude that pharmaceutical nicotine or other nicotinic cholinergic agonists should be explored as potential therapeutic options, based on a recently presented hypothesis.Footnote 3 According to Yanover et al. (2020), and based on reports from Israeli hospitals, there is no statistical evidence in support of the null hypothesis that smoking and respiratory diseases are associated with COVID-19 complications. Finally, De Santi et al. (2021) demonstrated that smoking habit is associated with only 33.56% chance of infection from SARS-CoV2 (see footnote 1).

Another strand of the literature investigated the impact of COVID-19 lockdowns on prevalence of smoking and smoking cessation. Jackson et al. (2022) demonstrated that in England, the first COVID-19 lockdown was associated with increased smoking prevalence among younger adults and increased high-risk drinking prevalence among all adults. Smoking cessation activity also increased: more younger smokers made quit attempts during lockdown and more smokers quit successfully.

The objective of the current study is to address the following research questions: (1) can smoking prevalence explain COVID-19 indicators (cases, mortality, and recovery)? Are these relationships monotonically increasing or decreasing?Footnote 4 The collected indicators are updated to April 28, 2020, and include number of corona infections, number of severe cases of illness from corona, number of recovered cases from corona, the number of deaths from corona, coronavirus as the cause of death (percent of the total number of deaths), and the population of the country as of July 2019. The Organization for Economic Cooperation and Development (OECD Report 2020) issued a report, which specifies for each member country the prevalence of smoking among the population between 1960 and 2020 (available at https://data.oecd.org/healthrisk/daily-smokers.htm).

In an attempt to test the counter-intuitive possibility of a non-linear relationship between the prevalence of smoking and coronavirus indicators, the empirical model in the study relaxes the assumption of monotonic change by applying the quadratic design and testing which one of the two competing models (quadratic or linear) better fits the data. To the best of our knowledge, this possibility has not been examined previously in the context of smoking patterns and COVID-19.

In line with Lombardi et al. (2021), results indicate more complex relationships between corona indices and the prevalence of smoking than previously considered. Indeed, as anticipated, there is a linear increase of projected mortality rate from corona and corona as the cause of death with an elevated prevalence of smoking. Nevertheless, other projected indicators, such as the number of infected per 100,000 persons, rates of severe corona cases, and rates of recovered corona patients, exhibit quadratic, rather than linear, patterns. The reasons for these non-linear patterns might be explained by several conditions such as attenuation of hypercytokinemia for mild levels of smoking prevalence compared with non-smokers, which offsets the lack of sufficient lung reserves in heavy smokers, elevated social distancing of smokers in countries with lower smoking prevalence, and unidentified factors that should be examined in future research.

The remainder of the article is organized as follows. The “Material and methods” section portrays the methodology and the “Results and discussion” section presents the results. Finally, the “Conclusions” section concludes and summarizes.

Material and methods

Sample and data

The Organization for Economic Cooperation and Development (OECD Report 2020) issued a report, which specifies for each member country the prevalence of smoking among the population between 1960 and 2020 (available at https://data.oecd.org/healthrisk/daily-smokers.htm).

The collected COVID-19 indicators are updated to April 28, 2020, and include number of corona infections, number of severe cases of illness from corona, number of recovered cases from corona, the number of deaths from corona, and coronavirus as the cause of death (percent of the total number of deaths). To normalize these indicators based on the population of the country we used information from July 2019 (on the eve of COVID-19 pandemic outburst).

Measures of variables

Table 1 provides the descriptive statistics of all the normalized variables latter incorporated in the regression models. INFECTED are Corona cases per 100,000 persons. SEVERE are severe divided by total corona cases. RECOVERED are recovered divided by total corona cases. CAUSE OF DEATH indicates the ratio between the number of deaths from corona divided by total number of deaths in the country. MORTALITY RATE is the ratio between number of dead and infected cases from corona. SMOKERS is the prevalence of smokers in OECD countries.

Table 1 Descriptive statistics

The mean prevalence of smokers among the OECD countries is 26.26–26.37% and the median is 24.1–24.5%. The implication is that the distribution is right-tailed and the modal prevalence is lower than the median. While Brazil exhibits the minimum prevalence of 7.2%, Denmark exhibits the maximum prevalence of 57.5% (SMOKERS). Further regression outcomes (available upon request) demonstrate that in 1960, the projected prevalence of smokers in OECD countries is 52.27%. This percent steadily drops by 0.6479% per annum. The 99% confidence interval is 0.6043–0.6915% annual drop. Consequently, this percent is anticipated to drop to 13.40 percent in 2020.

Models and data analysis procedure

Consider the following two competing empirical models:

$${Y}_j={\alpha}_{1j}{\mathrm{SMOKERS}}^2+{\alpha}_{2j}\mathrm{SMOKERS}+{\alpha}_{3j}+{u}_{1j}$$
(1)
$${Y}_j={\beta}_{1j}\mathrm{SMOKERS}+{\beta}_{2j}+{u}_{2j}$$
(2)

where j = 1,2,3,4,5; \({Y}_1=\mathrm{INFECTED}=\mathrm{100,000}\bullet \frac{\mathrm{CORONA}\ \mathrm{CASES}}{\mathrm{POPULATION}}\); \({Y}_2=\mathrm{SEVERE}=100\bullet \frac{\mathrm{SEVERE}\ \mathrm{CASES}}{\mathrm{CORONA}\ \mathrm{CASES}}\); \({Y}_3=\mathrm{RECOVERED}=\frac{\mathrm{RECOVERY}\ \mathrm{CASES}}{\mathrm{CORONA}\ \mathrm{CASES}}\); Y4 = CORONA (the cause of death from the total number of dead persons); \({Y}_5=\mathrm{MORTALITY}=100\bullet \frac{\mathrm{DEATH}\ \mathrm{CASES}}{\mathrm{CORONA}\ \mathrm{CASES}}\); SMOKERS is the prevalence of smokers among the population; α1j, α2j, α3j and β1j, β2j are the parameters; and u1j, u2j are the stochastic random disturbance terms. All of these models are estimated via regression analyses based on the Ordinary-Least-Square (OLS) procedure, the conventional methodology to identify correlations between variables. Unlike Eq. (2), Eq. (1) permits non-monotonic change of the corona indicator with the SMOKERS variable.

To test which of these two models better fits the data for each indicator, the Ramsey RESET test is employed (e.g., Ramanathan 2002, pp. 270–271). The test is based on estimating the models: \({Y}_j={a}_j+{b}_jX+{c}_j{\hat{Y}}_j^2+{\mu}_j\), where \({\hat{Y}}_j={\hat{d}}_j+{\hat{e}}_jX\); j = 1,2,3,4,5; X = SMOKERS; H0: cj = 0; and H1: cj ≠ 0. Rejection of the null hypothesis suggests that compared to the linear model, the quadratic model fits the data better.

Results and discussion

Table 2 provides the regression outcomes. Based on these regressions, Figures 1 and 2 are based on projections obtained from regression outcomes reported in columns (1) and (2) of Table 2 and 95% confidence intervals. Appendices Figure 3 and 4 are based on projections obtained from regression outcomes reported in columns (3) and (5) of Table 2 and 95% confidence intervals.

Table 2 Regression outcomes
Fig. 1
figure 1

Projected corona cases per 100,000 persons vs. prevalence of smokers

Fig. 2
figure 2

Projected rate of severe corona patients vs. prevalence of smokers. Note that Figs. 1 and 2 are based on projections obtained from regression outcomes reported in columns (1) and (2) of Table 2 and 95% confidence intervals

Figure 1 displays the projected number of corona cases per 100,000 persons vs. the prevalence of smokers in OECD countries. Based on column (1) in Table 1, the RESET test supports the quadratic model in favor of the linear model. The figure demonstrates that at the 5–30.56% prevalence of smokers, projected corona cases per 100,000 persons drop from 322 to 100 cases. At the 30.56–60% prevalence of smokers, projected corona cases per 100,000 persons rise from 100 to 394 cases.

Figure 2 reports the projected rate of severe corona patients vs. the prevalence of smokers in OECD countries. Based on column (2) in Table 1, the RESET test supports the quadratic model in favor of the linear model. The figure demonstrates that at the 5–26.6% prevalence of smokers, the projected rate of severe corona patients drops from 1.92 to 1.12%. At the 26.6–60% prevalence of smokers, the projected rate of severe corona patients rises from 1.12 to 3.015%.

Appendix Figure 3 reports the projected rate of recovered corona patients vs. the prevalence of smokers in OECD countries. Based on column (3) in Table 1, the RESET test supports the quadratic model in favor of the linear model. The figure demonstrates that at the 5–34.20% prevalence of smokers, the projected rate of recovered corona patients rises from 18.47 to 47.417%. At the 34.20–60% prevalence of smokers, the projected rate of recovered corona patients drops from 47.47 to 24.82%.

Finally, Appendix Figure 4 reports the projected mortality rate from coronavirus vs. the prevalence of smokers in OECD countries. Based on column (5) in Table 1, the RESET test supports the linear model in favor of the quadratic model. The figure demonstrates a steady rise from a mortality rate of 2.07%, where 5% of the population are smokers to 19.49%, where 60% of the population are smokers.

Smoking is considered by the WHO as one of the biggest public health threats the world has ever faced, killing more than 8 million people a year, and causing respiratory illness, such as bronchitis (World Health Organization (WHO) 2020c; Balte et al. 2020). Coronavirus disease 2019 (COVID-19) is a declared global pandemic and is considered as a respiratory illness (World Health Organization (WHO) 2020a; Wu and McGoogan 2020). Consequently, given that (1) the act of smoking means that fingers (and possibly contaminated cigarettes) are in contact with lips which increases the possibility of transmission of virus from hand to mouth and (2) the lack of sufficient lung reserves among heavy smokers; smoking is considered a serious risk factor for both coronavirus disease infection and complications (World Health Organization (WHO) 2020b: Are smokers and tobacco users at higher risk of COVID-19 infection?). Yet, recent studies found a significantly lower prevalence of smokers in China among corona patients compared with the high prevalence of smokers in the total population (e.g., Guan et al. 2020; Zhang et al. 2020). Moreover, the current literature does not support smoking as a predisposing factor in men or any subgroup for infection with SARS-CoV-2. Also, differences in COVID-19 disease prevalence and severity are associated with differences of expression of ACE2 receptor, responsible for severe acute respiratory syndrome coronavirus (Cai 2020).

The objective of the current study is to analyze the potential relationships between different corona indicators and smoking prevalence on a statewide level, based on an OECD report. To test the counter-intuitive possibility of non-linear relationship, the proposed empirical model relaxes the assumption of monotonic change by applying the quadratic design and testing which one of the two competing models (quadratic or linear) better fits the data.

Conclusions

This section concludes and summarizes the manuscript. Findings of this article suggest more complex relationships between corona indices and prevalence of smoking than previously thought. Indeed, as anticipated, there is a linear increase of the projected mortality rate from corona and corona as the cause of death with an elevated prevalence of smoking. Nevertheless, other projected indicators, such as the number of infected per 100,000 persons, rates of severe corona cases, and rates of recovered corona patients, exhibit quadratic, rather than linear, patterns. The reasons for these non-linear patterns might be explained by several conditions such as attenuation of hypercytokinemia for mild levels of smoking prevalence compared with non-smokers, which offsets the lack of sufficient lung reserves in heavy smokers, elevated social distancing of smokers in countries with lower smoking prevalence, and unidentified factors that should be examined in future research.

The short-run public policy repercussions would be warning of the public by publicity campaigns to avoid crowding and reduce the exit from home in the face of the pandemic. If possible, the public should be encouraged to relocate on countryside regions with reduced levels of pollution.

The long-run public policy repercussions of our study are the following. In countries with high smoking prevalence, it would be of great importance to convince the public of smokers to quit smoking or at least reduce the consumption of cigarettes. This recommendation is strengthened following our findings, indicating a monotonic increase of COVID-19 mortality with smoking prevalence.

The limitation of our study includes the potential supplement of control variables, such as the level of pollution in each OECD country. If the pollution variance in OECD countries is high, this might lead to biased coefficients. This should be the subject of future studies.