Skip to main content
Log in

Handling of derived imbalanced dataset using XGBoost for identification of pulmonary embolism—a non-cardiac cause of cardiac arrest

  • Original Article
  • Published:
Medical & Biological Engineering & Computing Aims and scope Submit manuscript

Abstract

Relationship between pulmonary embolism and heart failure is presented in this paper. The proposed research is divided into two phases. The first phase includes the establishment of a novel database with the help of a Cleveland’s database for cardiology in order to establish a link between pulmonary embolism and heart failure. The connectivity is based on the relationship between the stroke volume and the pulse pressure (Pp < 25% (ap_hi)). The second phase includes the applicability of machine learning on the novel database. Novel database formed in this work is imbalanced, resulting in the overfitting problem. XGBoost has been used to get rid of overfitting problem. Efficiency has been increased by formulating an ensemble technique by combining extreme learning machines, IB3 tree, logistic regression, and averaged neural network (avNNet) models.

Graphical abstract

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Abbreviations

ap_hi :

Systolic blood pressure

ap_lo :

Diastolic blood pressure

Pp :

Pulse pressure

sv :

Stroke volume

pe :

Pulmonary embolism

HF:

Heart Failure

References

  1. Chayakrit K, Zhang H (2017) Artificial intelligence in precision cardiovascular medicine in J. of American College of Cardiology

  2. Shrestha S, Sengupta PP (2018) Machine learning for nuclear cardiology: the way forward

  3. Shashikant R, Chetankumar P (2019) Predictive model of cardiac arrest in smokers using machine learning technique based on Heart Rate Variability parameter. In: J. of applied computing and informatics

  4. Alizadehsani R, Habib J, Javad M, Hosseini Mashayekhi H, Boghrati R (2013) A data mining approach for diagnosis of coronary artery disease

  5. Hinton G (2018) Deep learning: a technology with the potential to transform health care. JAMA

  6. Cowger J, McLaughlin V (2018) Acute right ventricular failure in the setting of acute pulmonary embolism or chronic pulmonary hypertension. In: Bentham Science Publication

  7. Ebrahim LA (2018) Cardiac arrest due to pulmonary embolism, Science Direct - Indian Heart Journal

  8. Bizopoulos P, Koutsouris D (2019) Deep learning in cardiology, IEEE Review

  9. Kim J, Kang U, Lee Y (2017) Statistics and deep belief network based cardiovascular risk prediction. In: Healthcare informatics research, vol. 23, no. 3, pp. 169–175

  10. Cano-Espinosa C, Cazorla M, Gonzalez G (2020) Computed aided detection of pulmonary embolism using multi-slice multi-axial segmentation, MDPI

  11. Singh S, Pandey S, Pawar U, Janghel RR (2018) Classification of ECG arrhythmia using recurrent neural networks, Science Direct

  12. Rucco M, Sousa-Rodrigues D, Merelli E, Johnson JH (2015) A neural hyper network approach for pulmonary embolism diagnosis. BMC Res Notes 8(1):617

    Article  Google Scholar 

  13. Agharezaei L, Agharezaei Z, Nemati A, Bahaadinbeigy K, Keynia F, Baneshi MR (2016) The prediction of the risk level of pulmonary embolism & deep venus thrombosis through artificial neural network. Acta Information Med 24(5):354–359

    Article  Google Scholar 

  14. Chen MC, Ball RL, Yang L, Moradzadeh N, Chapman BE, Larson DB, Langlotz CP, Amrhein TJ, Lungren MP (2017) Deep learning to classify radiology free-text reports. Radiology 286(3):845–852

    Article  Google Scholar 

  15. Liu W, Liu M (2020) Evaluation of acute pulmonary embolism & clot burden on CTPA with deep learning. In: Imaging informatics & artificial intelligence- Springer

  16. Jardin R, Martin Faivre (2020) Machine learning & deep neural network application in thorax. In Journal of Thoracic Imaging

  17. Kannan R, Vasanthi V (2018) Machine learning algorithms with ROC curve for predicting & diagnosing the heart disease. In: Springer Briefs in Applied Science and Technology

  18. Atallah R, Mousa A (2019) Heart disease detection using machine learning majority voting ensemble method. In: IEEE

  19. Krishnani D, Kumari Dewangan A (2019) Prediction of coronary heart disease using supervised machine learning algorithm, IEEE

  20. Ali L, Khan SU (2019) Early detection of heart failure by reducing the time complexity of machine learning based predictive model. In 1st International Conference on Electronics & Computer Engineering

  21. Ashier Zhou S, Yongjian L (2019) An intelligent learning system based on random search algorithm & optimized random forest model for improving heart disease detection. In IEEE Explore

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Naira Firdous.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Firdous, N., Bhardwaj, S. Handling of derived imbalanced dataset using XGBoost for identification of pulmonary embolism—a non-cardiac cause of cardiac arrest. Med Biol Eng Comput 60, 551–558 (2022). https://doi.org/10.1007/s11517-021-02455-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11517-021-02455-2

Keywords

Navigation