Skip to main content

Leveraging Supervised Machine Learning for Determining the Link between Suboptimal Health Status and the Prognosis of Chronic Diseases

  • Chapter
  • First Online:
All Around Suboptimal Health

Abstract

Identification of people at risk of cardiometabolic diseases is a major clinical need. Such individuals can benefit from tailored treatments that can potentially reduce their risk, while bringing precision to predictive, preventive and personalised medicine (PPPM). Central to preventive medicine is the new concept of suboptimal health status (SHS), which captures individuals with subclinical conditions across five subscales using a 25-item psychometric-like instrument (SHSQ-25). Each of the subscales represents aspects of a person’s health status that could be explored in a disease continuum. Machine learning (ML) lends itself as a feasible approach to explore SHSQ-25 screening data. It can be used to transform such data into clinically useful information while enabling better health planning, disease forecasting and characterisation of disease risk. ML methods can analyse and interrogate the data in a manner not previously possible with conventional statistical methods. ML algorithms can offer robust and more streamlined means of predicting diseases, identifying individuals with the greatest clinical need and earmarking them for treatment. However, its potential to reveal suboptimal health and cardiometabolic diseases is yet to be explored. This chapter provides an overview of supervised learning, a subset of ML and how it can be applied in subclinical disease prediction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Abbreviations

AI:

Artificial intelligence

ANN:

Artificial neural network

BMI:

Body mass index

DT:

Decision tree

HbA1c:

Glycated haemoglobin

KNN:

k-neural neighbour

ML:

Machine learning

NB:

NaĂŻve Bayes

PCA:

Principal component analysis

PPPM:

Predictive, preventive and personalized medicine

RF:

Random Forest

ROC:

Receiver operating characteristic

SHS:

Suboptimal Health Status

SHSQ-25:

Suboptimal Health Status Questionnaire-25

SVM:

Support vector machine

T2DM:

Type II diabetes mellitus

TC:

Total cholesterol

References

  1. Russell A, Adua E, Ugrina I, Laws S, Wang W (2018) Unravelling immunoglobulin G Fc N-glycosylation: a dynamic marker potentiating predictive, preventive and personalised medicine. Int J Mol Sci 19(2):390

    Google Scholar 

  2. Wang W, Yan Y (2012) Suboptimal health: a new health dimension for translational medicine. Clin Transl Med 1(1):e28

    Google Scholar 

  3. Adua E, Roberts P, Wang W (2017) Incorporation of suboptimal health status as a potential risk assessment for type II diabetes mellitus: a case-control study in a Ghanaian population. EPMA J 8(4):345–355

    Article  PubMed  PubMed Central  Google Scholar 

  4. Adua E, Afrifa-Yamoah E, Frimpong K, Adama E, Karthigesu SP, Anto EO, Aboagye E, Yan Y, Wang Y, Tan X, Wang W (2021) Construct validity of the suboptimal health status questionnaire-25 in a Ghanaian population. Health Qual Life Outcomes 19, 180

    Google Scholar 

  5. Yan YX, Dong J, Liu YQ, Yang XH, Li M, Shia G, Wang W (2012) Association of suboptimal health status and cardiovascular risk factors in urban Chinese workers. J Urban Health 89(2):329–338

    Google Scholar 

  6. Adua E, Memarian E, Russell A, Trbojević-Akmačić I, Gudelj I, Jurić J, Roberts P, Lauc G, Wang W (2019) Utilization of Nglycosylation profiles as risk stratification biomarkers for suboptimal health status and metabolic syndrome in a Ghanaian population. Biomark Med 13(15):1273–1287

    Google Scholar 

  7. Adua E, Roberts P, Sakyi SA, Yeboah FA, Dompreh A, Frimpong K, Anto EO, Wang W (2017) Profiling of cardio-metabolic risk factors and medication utilisation among type II diabetes patients in Ghana: a prospective cohort study. Clin Transl Med 6(1):32

    Google Scholar 

  8. Anto EO, Roberts P, Coall D, Turpin CA, Adua E, Wang Y, Wang W (2019) Integration of suboptimal health status evaluation as a criterion for prediction of preeclampsia is strongly recommended for healthcare management in pregnancy: a prospective cohort study in a Ghanaian population. EPMA J 10(3):211–226

    Google Scholar 

  9. Kupaev V, Borisov O, Marutina E, Yan YX, Wang W (2016) Integration of suboptimal health status and endothelial dysfunction as a new aspect for risk evaluation of cardiovascular disease. EPMA J 7(1):1–7

    Google Scholar 

  10. Mogaka JJ, James SE, Chimbari MJ (2020) Leveraging implementation science to improve implementation outcomes in precision medicine. Am J Transl Res 12(9):4853–4872

    PubMed  PubMed Central  Google Scholar 

  11. Wang Y, Liu X, Qiu J, Wang H, Liu D, Zhao Z, Song M, Song Q, Wang X, Zhou Y, Wang W (2017) Association between ideal cardiovascular health metrics and suboptimal health status in Chinese population. Sci Rep 7(1):14975

    Google Scholar 

  12. Zhu J, Ying W, Zhang L, Peng G, Chen W, Anto EO, Wang X, Lu N, Gao S, Wu G, Yan J, Ye J, Wu S, Yu C, Yue M, Huang X, Xu N, Ying P, Chen Y, Tan X, Wang W (2020) Psychological symptoms in Chinese nurses may be associated with predisposition to chronic disease: a cross-sectional study of suboptimal health status. EPMA J 11(4):551–563

    Google Scholar 

  13. Hou H, Feng X, Li Y, Meng Z, Guo D, Wang F, Guo Z, Zheng Y, Peng Z, Zhang W, Li D, Ding G, Wang W (2018) Suboptimal 1,2 3 4 health status and psychological symptoms among Chinese college students: a perspective of predictive, preventive and personalised health. EPMA J 9(4):367–377

    Google Scholar 

  14. Anto EO, Roberts P, Coall DA, Adua E, Turpin CA, Tawiah A, Wang Y, Wang W (2020) Suboptimal health pregnant women are associated with increased oxidative stress and unbalanced pro-and antiangiogenic growth mediators: a cross-sectional study in a Ghanaian population. Free Radic Res 54(1):27–42

    Google Scholar 

  15. Sun Q, Xu X, Zhang J, Sun M, Tian Q, Li Q, Cao W, Zhang X, Wang H, Liu J, Zhang J, Meng X, Wu L, Song M, Liu H, Wang W, Wang Y (2020) Association of suboptimal health status with intestinal microbiota in Chinese youths. J Cell Mol Med 24(2):1837–1847

    Google Scholar 

  16. Libbrecht MW, Noble WS (2015) Machine learning applications in genetics and genomics. Nat Rev Genet 16(6):321–332

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Ge S, Xu X, Zhang J, Hou H, Wang H, Liu D, Zhang X, Song M, Li D, Zhou Y, Wang Y, Wang W (2019) Suboptimal health status as an independent risk factor for type 2 diabetes mellitus in a community-based cohort: the China suboptimal health cohort study. EPMA J 10(1):65–72

    Google Scholar 

  18. Yan Y-X, Dong J, Liu YQ, Zhang J, Song MS, He Y, Wang W (2015) Association of suboptimal health status with psychosocial stress, plasma cortisol and mRNA expression of glucocorticoid receptor α/β in lymphocyte. Stress 18(1):29–34

    Google Scholar 

  19. Leclercq M, Vittrant B, Martin-Magniette ML, Scott Boyer MP, Perin O, Bergeron A, Fradet Y and Droit A (2019) Large-scale automatic feature selection for biomarker discovery in high-dimensional OMICs data. Front Genet 10:452

    Google Scholar 

  20. Wang Y, Adua E, Russell AC, Roberts P, Ge S, Zeng Q, Wang W (2016) Glycomics and its application potential in precision medicine. American Association for the Advancement of Science

    Google Scholar 

  21. Adua E, Memarian E, Russell A, Trbojević-Akmačić I, Gudelj I, Jurić J, Roberts P, Lauc G, Wang W (2019) High throughput profiling of whole plasma N-glycans in type II diabetes mellitus patients and healthy individuals: a perspective from a Ghanaian population. Arch Biochem Biophys 661:10–21

    Google Scholar 

  22. Najafabadi MM, Villanustre F, Khoshgoftaar TM, Seliya N, Wald R, Muharemagic E (2015) Deep learning applications and challenges in big data analytics. J Big Data 2(1):1–21

    Google Scholar 

  23. Adua E, Kolog EA, Afrifa-Yamoah E, Amankwah B, Obirikorang C, Anto EO, Acheampong E, Wang W, Tetteh AY (2021) Predictive model and feature importance for early detection of type II diabetes mellitus. Transl Med Commun 6(1):17

    Google Scholar 

  24. Buskirk TD, Kirchner A, Eck A, Signorino CS (2018) An introduction to machine learning methods for survey researchers. Surv Pract 11(1):1–10

    Google Scholar 

  25. Kuhn M, Johnson K (2013) Applied predictive modeling in RR. Springer, New York

    Book  Google Scholar 

  26. Machado G, Mendoza MR, Corbellini LG (2015) What variables are important in predicting bovine viral diarrhea virus? A random forest approach. Vet Res 46(1):1–15

    Article  Google Scholar 

  27. Yu C-S, Lin YJ, Lin CH, Wang ST, Lin SY, Lin SH, Wu JL, Chang SS (2020) Predicting metabolic syndrome with machine learning models using a decision tree algorithm: retrospective cohort study. JMIR Med Inform 8(3):e17110

    Google Scholar 

  28. Krittanawong C, Virk HUH, Bangalore S, Wang Z, Johnson KW, Pinotti R, Zhang H, Kaplin S, Narasimhan B, Kitai T, Baber U, Halperin JL, Tang WHW (2020) Machine learning prediction in cardiovascular diseases: a meta-analysis. Sci Rep 10(1):1–11

    Google Scholar 

  29. Zhang L, Wang Y, Niu M, Wang C, Wang Z (2020) Machine learning for characterizing risk of type 2 diabetes mellitus in a rural Chinese population: the Henan rural cohort study. Sci Rep 10(1):1–10

    Google Scholar 

  30. Kolog EA, Montero CS, Toivonen T (2018) Using machine learning for sentiment and social influence analysis in text. In International Conference on Information Technology & Systems. Springer

    Google Scholar 

  31. Chiu M-H, Yu Y-R, Liaw HL, Chun-Hao L (2016) The use of facial micro-expression state and Tree-Forest Model for predicting conceptual-conflict based conceptual change. In Jari Lavonen, Kalle Juuti, Jarkko Lampiselkä, Anna Uitto & Kaisa Hahl (Eds.). Science Education Research: Engaging learners for a sustainable future (ESERA eproceeding, ISBN 978-951-51-1541-6)

    Google Scholar 

  32. Tang Z, Bo L, Liu X, Wei D (2021) An autoencoder with adaptive transfer learning for intelligent fault diagnosis of rotating machinery. Meas Sci Technol 32(5):055110

    Google Scholar 

  33. Choe EK, Rhee H, Lee S, Shin E, Oh SW, Lee JE, Choi SH (2018) Metabolic syndrome prediction using machine learning models with genetic and clinical information from a nonobese healthy population. Genomics Inform 16(4):e31

    Google Scholar 

  34. Webb GI, Keogh E, Miikkulainen R (2010) Naïve Bayes. Encyclopedia Mach Learn 15:713–714

    Google Scholar 

  35. Bishop CM (2006) Pattern recognition. Mach Learn 128:9

    Google Scholar 

  36. Altman NS (1992) An introduction to kernel and nearest-neighbor nonparametric regression. Am Stat 46(3):175–185

    Google Scholar 

  37. Kavakiotis I, Tsave O, Salifoglou A, Maglaveras N, Vlahavas I, Chouvarda I (2017) Machine learning and data mining methods in diabetes research. Comput Struct Biotechnol J 15:104–116

    Google Scholar 

  38. Zhang Z, Beck MW, Winkler DA, Huang B, Sibanda W, Goyal H; written on behalf of AME Big-Data (2018) Opening the black box of neural networks: methods for interpreting neural network models in clinical applications. Ann Transl Med 6(11):216

    Google Scholar 

  39. Mehdy MM, Ng PY, Shair EF, Saleh NIM, Gomes C (2017) Artificial neural networks in image processing for early detection of breast cancer. Comput Math Methods Med 2017:1–15

    Google Scholar 

  40. Du Vaure CB, Dechartres A, Battin C, Ravaud P, Boutron I (2016) Exclusion of patients with concomitant chronic conditions in ongoing randomized controlled trials targeting 10 common chronic conditions and registered at ClinicalTrials.gov: a systematic review of registration details. BMJ Open 6(9):e012265

    Google Scholar 

  41. Afrifa-Yamoah E, Adua E, Anto EO, Peprah-Yamoah E, Opoku-Yamoah V, Aboagye E, Hashmi R. (2023) Conceptualised psychomedical footprint for health status outcomes and the potential impacts for early detection and prevention of chronic diseases in the context of 3P medicine. EPMA J. 10.1007/s13167-023-00344-2

    Google Scholar 

  42. Roumen C, Corpeleijn E, Feskens EJ, Mensink M, Saris WH, Blaak EE et al (2008) Impact of 3-year lifestyle intervention on postprandial glucose metabolism: the SLIM study. Diabet Med 25(5):597–605

    Google Scholar 

  43. Vlaar EM, Nierkens V, Nicolaou M, Middelkoop BJC, Busschers WB, Stronks K, van Valkengoed IGM (2017) Effectiveness of a targeted lifestyle intervention in primary care on diet and physical activity among south Asians at risk for diabetes: 2-year results of a randomised controlled trial in The Netherlands. BMJ Open 7(6):e012221

    Google Scholar 

  44. Sarker IH (2021) Machine learning: algorithms, real-world applications and research directions. SN Comput Sci 2(3):1–21

    Article  Google Scholar 

  45. Weng SF, Reps J, Kai J, Garibaldi JM, Qureshi N (2017) Can machine-learning improve cardiovascular risk prediction using routine clinical data? PLoS One 12(4):e0174944

    Google Scholar 

  46. Zhao J, Feng Q, Wu P, Lupu RA, Wilke RA, Wells QS, Denny JC, Wei WQ (2019) Learning from longitudinal data in electronic health record and genetic data to improve cardiovascular event prediction. Sci Rep 9(1):717

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Eric Adua .

Editor information

Editors and Affiliations

Ethics declarations

EA1,2, EA-Y3 and EA4 conceived the study and wrote the manuscript. All authors edited the ideas and concepts presented. All authors read and approved the final manuscript.

Competing Interests

Authors have no competing interest to declare.

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Adua, E., Afrifa-Yamoah, E., Kolog, E.A. (2024). Leveraging Supervised Machine Learning for Determining the Link between Suboptimal Health Status and the Prognosis of Chronic Diseases. In: Wang, W. (eds) All Around Suboptimal Health . Advances in Predictive, Preventive and Personalised Medicine, vol 18. Springer, Cham. https://doi.org/10.1007/978-3-031-46891-9_9

Download citation

Publish with us

Policies and ethics