Skip to main content

Advertisement

Log in

Building a predictive model for hypertension related to environmental chemicals using machine learning

  • Research Article
  • Published:
Environmental Science and Pollution Research Aims and scope Submit manuscript

Abstract

Hypertension is a chronic cardiovascular disease characterized by elevated blood pressure that can lead to a number of complications. There is evidence that the numerous environmental substances to which humans are exposed facilitate the emergence of diseases. In this work, we sought to investigate the relationship between exposure to environmental contaminants and hypertension as well as the predictive value of such exposures. The National Health and Nutrition Survey (NHANES) provided us with the information we needed (2005–2012). A total of 4492 participants were included in our study, and we incorporated more common environmental chemicals and covariates by feature selection followed by regularized network analysis. Then, we applied various machine learning (ML) methods, such as extreme gradient boosting (XGBoost), random forest classifier (RF), logistic regression (LR), multilayer perceptron (MLP), and support vector machine (SVM), to predict hypertension by chemical exposure. Finally, SHapley Additive exPlanations (SHAP) were further applied to interpret the features. After the initial feature screening, we included a total of 29 variables (including 21 chemicals) for ML. The areas under the curve (AUCs) of the five ML models XGBoost, RF, LR, MLP, and SVM were 0.729, 0.723, 0.721, 0.730, and 0.731, respectively. Butylparaben (BUP), propylparaben (PPB), and 9-hydroxyfluorene (P17) were the three factors in the prediction model with the highest SHAP values. Comparing five ML models, we found that environmental exposure may play an important role in hypertension. The assessment of important chemical exposure parameters lays the groundwork for more targeted therapies, and the optimized ML models are likely to predict hypertension.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data Availability

The datasets analyzed during the current study are available in the NHANES repository (https://wwwn.cdc.gov/nchs/nhanes/Default.aspx).

Abbreviations

AUC :

Area under the curve

BMI :

Body mass index

BP3 :

Benzophenone-3

BPA :

Bisphenol A

BUP :

Butylparaben

Cd :

Cadmium

COT :

Cotinine

DBP :

Diastolic blood pressure

EPB :

Ethylparaben

Hg :

Mercury

LOD :

The limit of detection

LR :

Logistic regression

MBzP :

Monobenzyl phthalate

MCNP :

Mono-carboxynonyl phthalate

MCOP :

Mono-carboxyoctyl phthalate

MCPP :

Mono-(3-carboxypropyl) phthalate

MECPP :

Mono-2-ethyl-5-carboxypentyl phthalate

MEHHP :

Mono-(2-ethl-5-hydrxhxyl) phthalate

MEHP :

Mono-(2-ethyl)-hexyl phthalate

MEOHP :

Mono-(2-ethyl-5-oxohexyl) phthalate

MEP :

Monoethyl phthalate

MiBP :

Monoisobutyl phthalate

MiNP :

Monoisononyl phthalate

ML :

Machine learning

MLP :

Multilayer perceptron

MMP :

Mono-n-methyl phthalate

MnBP :

Phthalates mono-n-butyl phthalate

MPB :

Methylparaben

MVPA :

Moderate-to-vigorous physical activity

NCHS :

The National Center for Health Statistics

NHANES :

The National Health and Nutrition Survey

P01 :

1-Hydroxynaphthalene

P02 :

2-Hydroxynaphthalene

P03 :

3-Hydroxyfluorene

P04 :

2-Hydroxyfluorene

P05 :

3-Hydroxyphenanthrene

P06 :

1-Hydroxyphenanthrene

P07 :

2-Hydroxyphenanthrene

P10 :

1-Hydroxypyrene

P17 :

9-Hydroxyfluorene

PAH :

Polyaromatic hydrocarbons

Pb :

Lead

PIR :

Poverty income ratio

PPB :

Propylparaben

RF :

Random forest classifier

SBP :

Systolic blood pressure

SHAP :

SHapley Additive exPlanations

SVM :

Support vector machine

TCS :

Triclosan

XGBoost :

Extreme gradient boosting

2,4-DCP :

2,4-Dichlorophenol

2,5-DCP :

2,5-Dichlorophenol

References

  • Aramjoo H, Arab-Zozani M, Feyzi A, Naghizadeh A, Aschner M, Naimabadi A, Farkhondeh T, Samarghandian S (2022) The association between environmental cadmium exposure, blood pressure, and hypertension: a systematic review and meta-analysis. Environ Sci Pollut Res Int 29:35682–35706

    Article  CAS  Google Scholar 

  • Bae S, Samuels JA, Flynn JT, Mitsnefes MM, Furth SL, Warady BA, Ng DK (2022) Machine learning-based prediction of masked hypertension among children with chronic kidney disease. Hypertension 79:2105–2113

    Article  CAS  Google Scholar 

  • Carnethon MR, Evans NS, Church TS, Lewis CE, Schreiner PJ, Jacobs DR Jr, Sternfeld B, Sidney S (2010) Joint associations of physical activity and aerobic fitness on the development of incident hypertension: coronary artery risk development in young adults. Hypertension 56:49–55

    Article  CAS  Google Scholar 

  • Chen T, Guestrin C (2016) XGBoost: A scalable tree boosting system. Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining 2016:785–794

  • Chowdhury MZI, Leung AA, Sikdar KC, O’Beirne M, Quan H, Turin TC (2022) Development and validation of a hypertension risk prediction model and construction of a risk score in a Canadian population. Sci Rep 12:12780

    Article  CAS  Google Scholar 

  • Desvergne B, Feige JN, Casals-Casas C (2009) PPAR-mediated activity of phthalates: a link to the obesity epidemic? Mol Cell Endocrinol 304:43–48

    Article  CAS  Google Scholar 

  • Forman JP, Stampfer MJ, Curhan GC (2009) Diet and lifestyle risk factors associated with incident hypertension in women. JAMA 302:401–411

    Article  CAS  Google Scholar 

  • Garner RE, Levallois P (2017) Associations between cadmium levels in blood and urine, blood pressure and hypertension among Canadian adults. Environ Res 155:64–72

    Article  CAS  Google Scholar 

  • Houston MC (2007) The role of mercury and cadmium heavy metals in vascular disease, hypertension, coronary heart disease, and myocardial infarction. Altern Ther Health Med 13:S128–S133

    Google Scholar 

  • Hung MH, Shih LC, Wang YC, Leu HB, Huang PH, Wu TC, Lin SJ, Pan WH, Chen JW, Huang CC (2021) Prediction of masked hypertension and masked uncontrolled hypertension using machine learning. Front Cardiovasc Med 8:778306

    Article  CAS  Google Scholar 

  • Jacobs L, Buczynska A, Walgraeve C, Delcloo A, Potgieter-Vermaak S, Van Grieken R, Demeestere K, Dewulf J, Van Langenhove H, De Backer H, Nemery B, Nawrot TS (2012) Acute changes in pulse pressure in relation to constituents of particulate air pollution in elderly persons. Environ Res 117:60–67

    Article  CAS  Google Scholar 

  • Jeong YW, Jung Y, Jeong H, Huh JH, Sung KC, Shin JH, Kim HC, Kim JY, Kang DR (2022) Prediction model for hypertension and diabetes mellitus using Korean public health examination data (2002–2017). Diagnostics 12:1967

    Article  Google Scholar 

  • Kaur S, Garg N, Rubal R, Dhiman M (2022) Correlative study on heavy metal-induced oxidative stress and hypertension among the rural population of Malwa Region of Punjab, India. Environ Sci Pollut Res Int 29:90948–90963

    Article  CAS  Google Scholar 

  • Lawlor DA, Nordestgaard BG, Benn M, Zuccolo L, Tybjaerg-Hansen A, Davey Smith G (2013) Exploring causal associations between alcohol and coronary heart disease risk factors: findings from a Mendelian randomization study in the Copenhagen General Population Study. Eur Heart J 34:2519–2528

    Article  CAS  Google Scholar 

  • Leung AA, Bushnik T, Hennessy D, McAlister FA, Manuel DG (2019) Risk factors for hypertension in Canada. Health Rep 30:3–13

    Google Scholar 

  • Lim SS, Vos T, Flaxman AD, Danaei G, Shibuya K, Adair-Rohani H, Amann M, Anderson HR, Andrews KG, Aryee M, Atkinson C, Bacchus LJ, Bahalim AN, Balakrishnan K, Balmes J, Barker-Collo S, Baxter A, Bell ML, Blore JD, Blyth F, Bonner C, Borges G, Bourne R, Boussinesq M, Brauer M, Brooks P, Bruce NG, Brunekreef B, Bryan-Hancock C, Bucello C, Buchbinder R, Bull F, Burnett RT, Byers TE, Calabria B, Carapetis J, Carnahan E, Chafe Z, Charlson F, Chen H, Chen JS, Cheng AT, Child JC, Cohen A, Colson KE, Cowie BC, Darby S, Darling S, Davis A, Degenhardt L, Dentener F, Des Jarlais DC, Devries K, Dherani M, Ding EL, Dorsey ER, Driscoll T, Edmond K, Ali SE, Engell RE, Erwin PJ, Fahimi S, Falder G, Farzadfar F, Ferrari A, Finucane MM, Flaxman S, Fowkes FG, Freedman G, Freeman MK, Gakidou E, Ghosh S, Giovannucci E, Gmel G, Graham K, Grainger R, Grant B, Gunnell D, Gutierrez HR, Hall W, Hoek HW, Hogan A, Hosgood HD 3rd, Hoy D, Hu H, Hubbell BJ, Hutchings SJ, Ibeanusi SE, Jacklyn GL, Jasrasaria R, Jonas JB, Kan H, Kanis JA, Kassebaum N, Kawakami N, Khang YH, Khatibzadeh S, Khoo JP, Kok C, Laden F et al (2012) A comparative risk assessment of burden of disease and injury attributable to 67 risk factors and risk factor clusters in 21 regions, 1990–2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet 380:2224–2260

    Article  Google Scholar 

  • Lu L, Ni R (2023) Association between polycyclic aromatic hydrocarbon exposure and hypertension among the U.S. adults in the NHANES 2003–2016: A cross-sectional study. Environ Res 217:114907

    Article  CAS  Google Scholar 

  • Lundberg S, Lee SI (2017) A unified approach to interpreting model predictions. Proceedings of the 31st International Conference on Neural Information Processing Systems 2017:4768–4777

  • Mateos A, Dopazo J, Jansen R, Tu Y, Gerstein M, Stolovitzky G (2002) Systematic learning of gene functional classes from DNA array expression data by using multilayer perceptrons. Genome Res 12:1703

    Article  CAS  Google Scholar 

  • Miao H, Liu Y, Tsai TC, Schwartz J, Ji JS (2020) Association between blood lead level and uncontrolled hypertension in the US population (NHANES 1999–2016). J Am Heart Assoc 9:e015533

    Article  Google Scholar 

  • Mills KT, Bundy JD, Kelly TN, Reed JE, Kearney PM, Reynolds K, Chen J, He J (2016) Global disparities of hypertension prevalence and control: a systematic analysis of population-based studies from 90 countries. Circulation 134:441–450

    Article  Google Scholar 

  • Mills KT, Stefanescu A, He J (2020) The global epidemiology of hypertension. Nat Rev Nephrol 16:223–237

    Article  CAS  Google Scholar 

  • Nash D, Magder L, Lustberg M, Sherwin RW, Rubin RJ, Kaufmann RB, Silbergeld EK (2003) Blood lead, blood pressure, and hypertension in perimenopausal and postmenopausal women. JAMA 289:1523–1532

    Article  CAS  Google Scholar 

  • Rahman HH, Niemann D, Munson-McGee SH (2022) Environmental exposure to metals and the risk of high blood pressure: a cross-sectional study from NHANES 2015–2016. Environ Sci Pollut Res Int 29:531–542

    Article  CAS  Google Scholar 

  • Sonne-Holm S, Sørensen TI, Jensen G, Schnohr P (1989) Independent effects of weight change and attained body weight on prevalence of arterial hypertension in obese and non-obese men. BMJ 299:767–770

    Article  CAS  Google Scholar 

  • Steyerberg EW, Eijkemans MJ, Harrell FE Jr, Habbema JD (2000) Prognostic modelling with logistic regression analysis: a comparison of selection and estimation methods in small data sets. Stat Med 19:1059–1079

    Article  CAS  Google Scholar 

  • Suykens JAK, Vandewalle J (1999) Least squares support vector machine classifiers. Neural Process Lett 9:293–300

    Article  Google Scholar 

  • Tran NK, Albahra S, May L, Waldman S, Crabtree S, Bainbridge S, Rashidi H (2021) Evolving applications of artificial intelligence and machine learning in infectious diseases testing. Clin Chem 68:125–133

    Article  Google Scholar 

  • Trasande L, Sathyanarayana S, Spanier AJ, Trachtman H, Attina TM, Urbina EM (2013) Urinary phthalates are associated with higher blood pressure in childhood. J Pediatr 163:747–53.e1

    Article  CAS  Google Scholar 

  • Trasande L, Zoeller RT, Hass U, Kortenkamp A, Grandjean P, Myers JP, DiGangi J, Bellanger M, Hauser R, Legler J, Skakkebaek NE, Heindel JJ (2015) Estimating burden and disease costs of exposure to endocrine-disrupting chemicals in the European union. J Clin Endocrinol Metab 100:1245–1255

    Article  CAS  Google Scholar 

  • Vaziri ND (2008) Mechanisms of lead-induced hypertension and cardiovascular disease. Am J Physiol Heart Circ Physiol 295:H454–H465

    Article  CAS  Google Scholar 

  • Wang NY, Young JH, Meoni LA, Ford DE, Erlinger TP, Klag MJ (2008) Blood pressure change and risk of hypertension associated with parental hypertension: the Johns Hopkins Precursors Study. Arch Intern Med 168:643–648

    Article  Google Scholar 

  • Wang F, Wang Y, Wang Y, Jia T, Chang L, Ding J, Zhou L (2022) Urinary polycyclic aromatic hydrocarbon metabolites were associated with hypertension in US adults: data from NHANES 2009–2016. Environ Sci Pollut Res Int 29:80491–80501

    Article  CAS  Google Scholar 

  • Whelton PK, Carey RM, Aronow WS, Casey DE Jr, Collins KJ, Dennison Himmelfarb C, DePalma SM, Gidding S, Jamerson KA, Jones DW, MacLaughlin EJ, Muntner P, Ovbiagele B, Smith SC Jr, Spencer CC, Stafford RS, Taler SJ, Thomas RJ, Williams KA Sr, Williamson JD, Wright JT Jr (2018) 2017 ACC/AHA/AAPA/ABC/ACPM/AGS/APhA/ASH/ASPC/NMA/PCNA Guideline for the prevention, detection, evaluation, and management of high blood pressure in adults: a report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. J Am Coll Cardiol 71:e127–e248

    Article  Google Scholar 

  • Wu W, Jiang S, Zhao Q, Zhang K, Wei X, Zhou T, Liu D, Zhou H, Zeng Q, Cheng L, Miao X, Lu Q (2018) Environmental exposure to metals and the risk of hypertension: a cross-sectional study in China. Environ Pollut 233:670–678

    Article  CAS  Google Scholar 

  • Yang S, Taylor D, Yang D, He M, Liu X, Xu J (2021) A synthesis framework using machine learning and spatial bivariate analysis to identify drivers and hotspots of heavy metal pollution of agricultural soils. Environ Pollut 287:117611

  • Zhou S, Lu H, Zhang X, Shi X, Jiang S, Wang L, Lu Q (2022) Paraben exposures and their interactions with ESR1/2 genetic polymorphisms on hypertension. Environ Res 213:113651

    Article  CAS  Google Scholar 

Download references

Acknowledgements

This work was supported by the Heilongjiang Provincial Science and Technology Department (2022ZXJ03C05), the Military Healthcare Program (19BJZ24), and the National Natural Science Foundation of China (82271770).

Author information

Authors and Affiliations

Authors

Contributions

Shanshan Liu: formal analysis and writing—original draft; Lin Lu: methodology and formal analysis; Fei Wang: formal analysis; Bingqing Han: data curation; Lei Ou: writing—review and editing. Xiangyang Gao: data curation; Yi Luo: formal analysis; Wenjing Huo: data curation and methodology; Qiang Zeng: project administration and writing—review and editing.

Corresponding author

Correspondence to Qiang Zeng.

Ethics declarations

Ethical approval

Approval from the ethical board for this study was not required because of the public nature of all the data.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Responsible Editor: Lotfi Aleya

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, S., Lu, L., Wang, F. et al. Building a predictive model for hypertension related to environmental chemicals using machine learning. Environ Sci Pollut Res 31, 4595–4605 (2024). https://doi.org/10.1007/s11356-023-31384-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11356-023-31384-w

Keywords

Navigation