External Validation of a “Black-Box” Clinical Predictive Model in Nephrology: Can Interpretability Methods Help Illuminate Performance Differences?

da Cruz, Harry F.; Pfahringer, Boris; Schneider, Frederic; Meyer, Alexander; Schapranow, Matthieu-P.

doi:10.1007/978-3-030-21642-9_25

Harry F. da Cruz¹¹,
Boris Pfahringer¹²,
Frederic Schneider¹¹,
Alexander Meyer¹² &
…
Matthieu-P. Schapranow¹¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11526))

Included in the following conference series:

Conference on Artificial Intelligence in Medicine in Europe

3484 Accesses
2 Citations

Abstract

The number of machine learning clinical prediction models being published is rising, especially as new fields of application are being explored in medicine. Notwithstanding these advances, only few of such models are actually deployed in clinical contexts for a lack of validation studies. In this paper, we present and discuss the validation results of a machine learning model for the prediction of acute kidney injury in cardiac surgery patients when applied to an external cohort of a German research hospital. To help account for the performance differences observed, we utilized interpretability methods which allowed experts to scrutinize model behavior both at the global and local level, making it possible to gain further insights into why it did not behave as expected on the validation cohort. We argue that such methods should be considered by practitioners as a further tool to help explain performance differences and inform model update in validation studies.

A. Meyer and M-P. Schapranow—Share the senior authorship.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 74.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Andrew, A.: Git-crypt (2013). https://github.com/AGWA/git-crypt
Che, Z., Purushotham, S., Khemani, R., Liu, Y.: Interpretable deep models for ICU outcome prediction. In: AMIA Symposium 2016, pp. 371–380 (2016)
Google Scholar
Freitas da Cruz, H., Schneider, F., Schapranow, M.P.: Prediction of acute kidney injury in cardiac surgery patients. In: Proceedings of the 12th International Conference on Biomedical Engineering Systems and Technologies, vol. 5, pp. 380–387 (2019)
Google Scholar
Doshi-Velez, F., Kim, B.: Towards A Rigorous Science of Interpretable Machine Learning. arXiv e-prints arXiv:1702.08608, February 2017
Eyck, J.V., et al.: Data mining techniques for predicting acute kidney injury after elective cardiac surgery. Crit. Care 16(Suppl 1), P344 (2012)
Article Google Scholar
Flechet, M., et al.: AKIpredictor, an on-line prognostic calculator for acute kidney injury in adult critically ill patients. Intensive Care Med. 43(6), 764–773 (2017)
Article Google Scholar
Glas, A.S., et al.: The diagnostic odds ratio: a single indicator of test performance. J. Clin. Epidemiol. 56(11), 1129–1135 (2003)
Article Google Scholar
Guidotti, R. et al.: A Survey of Methods for Explaining Black Box Models. arXiv e-prints arXiv:1802.01933, February 2018
Hall, P., Gill, N.: An Introduction to Machine Learning Interpretability. O’Reilly, Boca Raton (2018)
Google Scholar
Johnson, A., et al.: MIMIC-III, a freely accessible critical care database. Sci. Data 3, 160035 (2016)
Article Google Scholar
Kate, R.J., et al.: Prediction and detection models for acute kidney injury in hospitalized older adults. BMC Med. Inform. Decis. Mak. 16(1), 39 (2016)
Article Google Scholar
Knöpfel, A., Gröne, B., Tabeling, P.: Fundamental Modeling Concepts: Effective Communication of IT Systems. Wiley, Hoboken (2005)
Google Scholar
Kraskov, A., Stögbauer, H., Grassberger, P.: Estimating mutual information. Phys. Rev. 69(6), 16 (2004)
MathSciNet Google Scholar
Lee, H.C., et al.: Derivation and validation of machine learning approaches to predict acute kidney injury after cardiac surgery. J. Clin. Med. 7(10), 322 (2018)
Article Google Scholar
Legrand, M., et al.: Incidence, risk factors and prediction of post-operative acute kidney injury following cardiac surgery for active inefective endocarditis: an observational study. Crit. Care 17(5), R220 (2013)
Article Google Scholar
Louppe, G., Wehenkel, L., Sutera, A., Geurts, P.: Understanding variable importances in forests of randomized trees. In: Neural Information Processing Systems, pp. 1–9 (2013)
Google Scholar
Moons, K.G.M., Altman, D.G., Vergouwe, Y., Royston, P.: Prognosis and prognostic research: application and impact of prognostic models in clinical practice. Brit. Med. J. 338, b606 (2009)
Article Google Scholar
Murdoch, W.J., et al.: Interpretable Machine Learning: Definitions, Methods, and Applications. arXiv e-prints arXiv:1901.04592, January 2019
O’Neal, J.B., et al.: Acute kidney injury following cardiac surgery: current understanding and future directions. Crit. Care 20(1), 187 (2016)
Article Google Scholar
Ribeiro, M., Singh, S., Guestrin, C.: "Why should i trust you?": explaining the predictions of any classifier. In: Proceedings of 22nd ACM SIGKDD, pp. 1135–1144, NY, USA (2016)
Google Scholar
Rossum, G.V., Drake, F.L.: Python tutorial. History 42(4), 1–122 (2010)
Google Scholar
Thakar, C.V., et al.: A clinical score to predict acute renal failure after cardiac surgery. J. Am. Soc. Nephrol. 14(8), 2176–7 (2004)
Google Scholar
Thottakkara, P., et al.: Application of machine learning techniques to high-dimensional clinical data to forecast postoperative complications. PLoS ONE 11(5), 1–19 (2016)
Article Google Scholar
Toll, D., Janssen, K., Vergouwe, Y., Moons, K.: Validation, updating and impact of clinical prediction rules: a review. J. Clin. Epidemiol. 61, 1085–1094 (2008)
Article Google Scholar
Wyatt, J.C., Altman, D.G.: Commentary: prognostic models: clinically useful or quickly forgotten? Brit. Med. J. 311(7019), 1539–1541 (1995)
Article Google Scholar

Download references

Acknowledgments

Parts of the given work were generously supported by the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 780495.

Author information

Authors and Affiliations

Digital Health Center, Hasso Plattner Institute, Rudolf-Breitscheid-Straße 187, 14482, Potsdam, Germany
Harry F. da Cruz, Frederic Schneider & Matthieu-P. Schapranow
Department of Cardiothoracic and Vascular Surgery, German Heart Center Berlin, Augustenburger Platz 1, 13353, Berlin, Germany
Boris Pfahringer & Alexander Meyer

Authors

Harry F. da Cruz
View author publications
You can also search for this author in PubMed Google Scholar
Boris Pfahringer
View author publications
You can also search for this author in PubMed Google Scholar
Frederic Schneider
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Meyer
View author publications
You can also search for this author in PubMed Google Scholar
Matthieu-P. Schapranow
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Harry F. da Cruz .

Editor information

Editors and Affiliations

Universitat Rovira i Virgili, Tarragona, Spain
David Riaño
Poznan University of Technology, Poznan, Poland
Szymon Wilk
VU Amsterdam, Amsterdam, The Netherlands
Annette ten Teije

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

da Cruz, H.F., Pfahringer, B., Schneider, F., Meyer, A., Schapranow, MP. (2019). External Validation of a “Black-Box” Clinical Predictive Model in Nephrology: Can Interpretability Methods Help Illuminate Performance Differences?. In: Riaño, D., Wilk, S., ten Teije, A. (eds) Artificial Intelligence in Medicine. AIME 2019. Lecture Notes in Computer Science(), vol 11526. Springer, Cham. https://doi.org/10.1007/978-3-030-21642-9_25

Download citation

DOI: https://doi.org/10.1007/978-3-030-21642-9_25
Published: 30 May 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-21641-2
Online ISBN: 978-3-030-21642-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics