Introduction

Cotinine has been proved to be an exposure biomarker for monitoring tobacco use, which is most often determined in urine, serum and saliva, has longer half-life (16-h) than its precursor nicotine (2-h), and reflects the amount of smoking and metabolic status of the body1,2. The serum cotinine level increased in a dose-dependent manner with the increase of tobacco use or exposure3. A study showed that smokers exposed to the same amount of tobacco were more likely to become addicted if the body absorbed more nicotine, resulting in the decline of smokers' ability to quit smoking4. Exposure to nicotine, a precursor of cotinine in tobacco, could lead to green tobacco sickness and DNA damage5. Cotinine could also affect the reproductive function of men and women and increase the risk of cancer in offspring6.

Tobacco could produce a large number of toxic substances under incomplete combustion, and the most typical was VOCs7,8. Studies have shown that benzene was one of the carcinogenic and popular toxins in cigarette smoke, which belonged to VOCs and was related to smokers' leukemia9,10. One study measured the individual cotinine exposure of 72 smokers, and then calculated the benzene exposure. Compared with non-smokers, smokers have higher benzene exposure11. In addition, researchers also found other VOCs types in tobacco smoke, such as monoaromatic hydrocarbons (including ethylbenzene, toluene, styrene and xylene), and these might pose a certain risk to health12,13. Smoking, a social behavior, would lead to the absorption of nicotine and VOCs. At the same time, VOCs absorbed into the blood could inhibit the metabolism of cotinine by cytochrome P45014,15,16,17. It showed that when the body absorbed the above two at the same time, there could be a synergistic effect. Therefore, The quantitative measurement and joint analysis of VOCs and cotinine or nicotine are essential, because all of these toxins can circulate to important organs and tissues with the blood18.

Obviously, different VOCs have the same status of harm to human health, but the relationship between VOCs and cotinine is also indivisible, because their joint harm to health may be greater than that caused by them alone. For example, one study found that 2,5-dimethylfuran in VOCs could be used as a smoking monitoring biomarker comparable to cotinine19. In addition, Pauwels C's study pointed out the correlation between VOCs and cotinine, but did not show the one-way effect of VOCs on the metabolic level of cotinine20. Therefore, now it is urgent to use more representative samples to identify the synergistic effect of VOCs content in bleeding fluid on cotinine absorption and metabolism levels, because a few small sample studies only confirmed the correlation between them, rather than the causal relationship18,21. The key subjects we selected were smokers living with human immunodeficiency virus (HIV), hepatitis B virus (HBV) or hepatitis C virus (HCV), whose tobacco exposure was much higher than that of ordinary smokers, which was an effective and special place for us to solve this research question.

Generally speaking, the smoking rate of some patients with blood infectious diseases was often higher than that of the general population. According to a study report based on the United States integrated health system, people living with HIV were more likely to smoke and more vulnerable to the harmful effects of smoking than people without HIV22. Another study showed that hepatitis patients in the United States were mainly drug users, non-Hispanic blacks, the poor, people with low education and people with mental health disorders23. The typical feature of these groups was the high smoking rate24. Therefore, in this study, smokers living with HIV, HBV and HCV were studied as a group of high tobacco exposure people, in order to find out the more real association between serum VOCs and cotinine metabolism than the general smokers, compared with non-infected smokers.

Methods

Study sample selection

The materials used in this study were 14-year long-term follow-up survey data from Centers for Disease Control and Prevention (CDC) in the United States, which were a biennial survey conducted by the National Center for Health Statistics (NCHS). The 2005–2018 National Health and Nutrition Examination Survey (NHANES) data sets required for the analysis were downloaded from the NHANES website and combined using SAS software version 9.4. A total of 13,652 eligible subjects aged 21–59 was included in this study (Fig. 1). They had complete general sociodemographic information, as well as serological measurements that mainly included cotinine and VOCs.

Figure 1
figure 1

Flow chart of research subjects selection.

HIV, HBV, and HCV diagnosis

The HIV-1 antibody status was determined by the blood test results of NHANES. According to the test results, the subjects were divided into two categories: HIV positive or HIV negative. The VITROS Hepatitis B surface antigen (HBsAg) test using by NHANES was used to determine the infection status of HBV, which was divided into HBV positive and HBV negative according to the test results. In addition, in vitro nucleic acid amplification test (COBAS AMPLICOR HCV MONITOR Test, version 2.0) was used to quantitatively determine HCV RNA in human serum or plasma to determine HCV infection status (HCV positive and HCV negative).

Measurement of exposure and outcome

Participants' blood VOCs concentrations were automatically analyzed by capillary gas chromatography (GC) and mass spectrometry (MS) combined with selective ion monitoring (SIM) detection and isotope dilution. Serum cotinine was determined by isotope dilution high performance liquid chromatography / atmospheric pressure chemical ionization tandem mass spectrometry (ID HPLC-APCI-MS / MS). Smoking status was assessed from one self-reported item in the questionnaire: “Have you ever smoked cigarettes?”25 Thus, it might be described as “Smoker: yes/no” (as indicated in Supplementary Table S1), including smokers who reported they smoked at least 100 cigarettes in their lifetime and non-smokers who reported they had not smoked 100 cigarettes in their lifetime26. It was worth emphasizing that cotinine was defined as an effective biomarker of smoking in this study, while VOCs blood indicators were exposure biomarkers of cigarettes. We used the median (0.14 ng/mL) of serum cotinine in 3114 smokers as the boundary to divide it into a binary dependent variable (‘ ≥ 0.14 ng/mL’ was defined as high cotinine group of smokers; ‘ < 0.03 ng/mL’ was defined as low cotinine group of smokers), and the measured values of VOCs were included in the follow-up analysis as continuous independent variables. Similarly, the median (0.02 ng/mL) of serum cotinine in 10,538 non-smokers was defined as the boundary between high cotinine group (≥ 0.02 ng/mL) and low cotinine group (< 0.02 ng/mL) of non-smokers.

Classification of confounders

Potential confounders in the relationship between VOCs and cotinine were selected for inclusion into multivariable logistic regression model. This study identified age, gender, race/ethnicity, education, family poverty index ratio (PIR), depression status, and substance use behavior. The sociodemographic variables included in this study were described in Supplementary Table S1. Only age (in years) was continuous variable, and the rest were classified variables.

Statistical analysis

Independent Samples t-test was used for continuous variables that obeyed normal distribution, while Nonparametric test (Mann–Whitney U test) was used for continuous variables that did not obey normal distribution. All categorical variables were tested by Chi-square test. The single factor correlation analysis results were presented through the heat map based on Spearman algorithm. Correlation coefficient greater than 0.6 was defined as a strong correlation. Multivariable logistic regression analysis, adjusted for all potential confounders, was performed to determine the final relationship between VOCs and cotinine and to identify the independent risk factors leading to the increase of cotinine among smokers living with HIV, HBV, or HCV (HIV|HBV|HCV). The variance inflation factor was used to diagnose the multicollinearity between independent variables in the regression model (the multivariable regression model established in this study did not have multicollinearity, and all VIF values were less than 10). Cohen’s criteria for gauging small, medium and large effect sizes of R2 were 0.2, 0.5 and 0.8, which were calculated from Cohen’s d values27. A two-tailed P value less than 0.05 was considered statistically significant.

Data processing, statistical analysis, and graphic drawing were carried out with SAS version 9.4 (SAS Institute Inc., Cary, NC), IBM SPSS version 22.0, and R version 4.0.2 (http://www.R-project.org, The R Foundation).

Institutional review board statement

Ethical review and approval were waived for this study, since all the data from NHANES is publicly accessible.

Informed consent

Informed consent from all subjects was obtained by NHANES.

Results

This study preliminarily compared the differences of sociodemographic variables, blood VOCs and cotinine between people living with HIV|HBV|HCV and those without HIV, HBV, and HCV (HIV&HBV&HCV). The sociological characteristics of the subjects and the test results of the differences of these blood indexes were shown in Supplementary Table S1. Compared with the HIV&HBV&HCV non-infected group, people living with HIV|HBV|HCV were more likely to be older (44.6 years vs 40.2 years, P < 0.001), males (67.4% vs 45.5%, P < 0.001), non-Hispanic Black (36.0% vs 19.3%, P < 0.001), lower educational level (45.0% vs 37.2%, P = 0.013), lower economic level (21.9% vs 15.7%, P = 0.010), depression (42.9% vs 32.9%, P = 0.009), drug users (60.3% vs 44.4%, P < 0.001), current smokers (44.6% vs 22.4%, P < 0.001). In addition, we also conducted a statistical test on the blood indexes between above two groups, and the results showed that the serum Cotinine (ng/mL), 1,2-Dichlorobenzene (ng/mL), Tetrachloroethene (ng/mL), Carbon Tetrachloride (ng/mL), Methylene Chloride (ng/mL), 1,1,1-Trichloroethane (ng/mL), and Nitrobenzene (ng/mL) were significantly (P < 0.05) higher in current smokers than in non-smokers.

In order to further find possible differential serum VOCs, which might be associated with cotinine, we subdivided HIV|HBV|HCV infected group and HIV&HBV&HCV non-infected group into the following four subgroups (Supplementary Table S2): (1) Smokers living with HBV| HCV|HIV; (2) Non-smokers living with HIV|HBV|HCV; (3) Smokers living without HIV&HBV&HCV; (4) Non-smokers living without HIV&HBV&HCV. The results showed that the content of cotinine in the blood of the four groups was significantly different, that is, there was a statistical difference between each two groups (“Smokers living with HIV| HBV|HCV” vs. “Non-smokers living with HIV|HBV|HCV”, P < 0.01; “Smokers living with HIV| HBV|HCV” vs. “Smokers living without HIV&HBV&HCV”, P < 0.01; “Smokers living with HIV| HBV|HCV” vs. “Non-smokers living without HIV&HBV&HCV”, P < 0.001; “Non-smokers living with HIV|HBV|HCV” vs. “Smokers living without HIV&HBV&HCV”, P < 0.05; “Non-smokers living with HIV|HBV|HCV” vs. “Non-smokers living without HIV&HBV&HCV”, P < 0.05; “Smokers living without HIV&HBV&HCV” vs. “Non-smokers living without HIV&HBV&HCV”, P < 0.001). Figure 2 visualized above differences of serum cotinine content between each two groups. The results demonstrated that smokers living with HIV|HBV|HCV had significantly higher serum cotinine level than non-smokers living with HIV|HBV|HCV, and that smokers living without HIV&HBV&HCV had also significantly higher serum cotinine level than non-smokers living without HIV&HBV&HCV. Most obviously, smokers living with HIV|HBV|HCV had the highest serum cotinine level among the four groups. In addition, Supplementary Table S2 showed that four differential blood VOCs indicators, including 1,2-Dichlorobenzene (ng/mL), Tetrachloroethene (ng/mL), Benzene (ng/mL), and 2,5-Dimethylfuran (ng/mL), had positive dose effects on serum cotinine level.

Figure 2
figure 2

The distribution of serum cotinine levels in the different infection groups (***P < 0.001; **P < 0.01; *P < 0.05).

Subsequently, this study conducted Spearman correlation analysis on exploring the correlation between serum cotinine and VOCs among the above four groups. The results showed that the correlation heatmap between them was significantly different between different groups. From a macro perspective, the intensity and density of the correlation between blood indexes in the HIV|HBV|HCV infected group were higher than those in the HIV&HBV&HCV noninfected group. Compared to non-smokers living with HIV|HBV|HCV, the correlation between different blood indexes was more and stronger in smokers living with HIV|HBV|HCV. Compared to non-smokers living without HIV&HBV&HCV, the correlation between different blood indexes was more and stronger in smokers living without HIV&HBV&HCV. Specifically, there were 12 pairs of exclusively strong interaction items in smokers living with HBV|HCV|HIV (correlation coefficient greater than 0.6), including 1 pair of negative correlation indicators (correlation coefficient r = −0.83) and 12 pairs of positive correlation indicators (correlation coefficient fluctuation range is 0.81–0.92). All the above correlation coefficients mentioned were lower than the level of statistical significance (all P < 0.05), please see Fig. 3 for the details.

Figure 3
figure 3

Spearman correlation analysis between serum VOCs and cotinine in different groups (a. Smokers living with HIV, HBV, or HCV; b. Non-smokers living with HIV, HBV, or HCV; c. Smokers living without HIV, HBV, and HCV; d. Non-smokers living without HIV, HBV, and HCV).

As for the association between serum VOCs and cotinine levels in 3114 smokers, multivariable binary logistic regression model, adjusting for age, gender, race/ethnicity, education, PIR, depression status, and drug use history, was conducted to identify potential associated VOCs of cotinine (Table 1). The results showed that 1,2-Dichlorobenzene, Benzene, Carbon Tetrachloride, 2,5-Dimethylfuran are all independent associated factors of cotinine. That was to say, high concentrations of 1,2-Dichlorobenzene (OR:1.036, CI:1.009–1.124), Benzene (OR:1.478, CI:1.036–2.292), Carbon Tetrachloride (OR:1.576, CI:1.275–2.085) and 2,5-Dimethylfuran (OR:1.091, CI:1.030–1.157) in blood might be independent risk factors leading to the increase of serum metabolite cotinine in smokers. Similarly, in 10,538 non-smokers, the results of the multivariable binary logistic regression model in Table 2 showed that high cotinine (≥ 0.02 ng/mL) group had higher levels of 1,2-Dichlorobenzene (OR:1.135, CI:1.079–1.193) and Carbon Tetrachloride (OR:1.493, CI:1.209–1.846), which might also be independent risk factors leading to the increase of serum metabolite cotinine in non-smokers.

Table 1 Association between serum VOCs and cotinine levels in 3114 smokers.
Table 2 Association between serum VOCs and cotinine levels in 10,538 non-smokers.

Discussion

The harm of smoking to human health was well known, but few studies had mentioned the harmful effects of the synergistic factors (specifically VOCs in this study). Even though the studies of Pauwels C and others only studied the association between VOCs and smoking, they did not emphasize interactions or causal relationship between them20,28,29. Cotinine might be used as a biomarker of tobacco exposure for smokers because it was a main metabolite of nicotine (a compound whose concentration in the human body was produced in a high percentage by exposure to tobacco) and could be quantified in the body up to 72 h after the last cigarette30. However, Castellanos M believed that VOCs often accompanied by smoking were also the focus of studies, and could even be used as a biomarker of tobacco exposure19. It might be the reason that a large number of VOCs were produced in the process of smoking and competing with nicotine for cytochrome P450, which affected the metabolic efficiency of nicotine in the body, leading to smokers' addiction to tobacco and difficulty in quitting smoking31,32.

In this study, all subjects were divided into different groups (“People living with HBV, HCV, or HIV” and “People living without HBV, HCV, and HIV”), and the results showed that serum cotinine of smokers living with HBV, HCV, or HIV was the highest (Supplementary Table S1 and Fig. 2). In addition, we also found that all infected people had more and more dangerous sociological characteristics than non-infected people, which could explain that the distribution level of cotinine was actually affected by different socio-demographic characteristics. For example, compared with the HIV&HBV&HCV non-infected group, people living with HIV|HBV|HCV were more likely to be older, males, non-Hispanic Black, lower educational level, lower economic level, depression, drug users, current smokers. Consistently, one study also suggested that age, gender and race might affect the metabolism of nicotine and cotinine33. Therefore, this study adjusted above potential confounding factors, so as to obtain more accurate and reliable analysis results.

We also found a strong correlation between serum cotinine and eight VOCs (1,2-Dichlorobenzene, 1,2-Dichloroethane, Chlorobenzene, Carbon Tetrachloride, Methylene Chloride, Trichloroethene, 1,1,1-Trichloroethane, Nitrobenzene) in smokers infected with HIV|HBV|HCV. Studies showed that the smoking rate and tobacco exposure dose of this population were much higher than that of the general population22,23,34,35, so there were higher concentrations of cotinine and VOCs in the serum. Since the tobacco exposure of smokers infected with HIV|HBV|HCV was much higher than that of the general population, it could be inferred that the harm of serum cotinine and VOCs in the population would increase with the increase of tobacco exposure level. Our study also indicated a more obvious relationship between serum cotinine and VOCs in the population exposed to higher exposure of tobacco than lower the figure for tobacco. Therefore, we speculated that there might be a similar regularity in the non-infected population with the same unhealthy behavior (especially smoking in this study). In addition, there could be a possibility that the smoking cessation measures applied to the smokers of infected persons might also be extended to the non-infected smokers, which might indirectly provide new ideas for solving the public health problem of smoking. However, it requires more prospective multicenter cohort studies to prove whether this is possible in the future.

Our research also had some limitations. First of all, although the smokers among HIV|HBV|HCV infected people whose smoking rate was higher than that of the general population were integrated into the analysis, the sample size was still small. However, our study was consistent with other research results. For example, there was a significant positive correlation between serum cotinine and VOCs among smokers18. Secondly, the clear causal relationship between serum VOCs and cotinine needs to be further explored, because our study only found that the association between them in the body, and the relevant mechanism of serum cotinine level co-affecting by the external environment or other biochemical factors in the body might be more complicated than we expected. Finally, although NHANES had accurately measured the internal exposure dose of smoking, that exposure dose was far lower than the external exposure dose of subjects, so the existing data results might be underestimated.

Conclusions

Among smokers living with HIV|HBV|HCV, the serum cotinine was the highest, and the association between serum VOCs and cotinine was also very significant. Therefore, there might be a dose–response relationship between the impact intensity of four VOCs (such as 1,2-Dichlorobenzene, Benzene, Carbon Tetrachloride, 2,5-Dimethylfuran) on serum cotinine metabolism, that is, the higher the tobacco exposure, the stronger the impact intensity. In addition, this study used molecular epidemiology to indirectly provide new ideas and perspectives on smoking hazards for smokers.