To Err is (only) Human. Reflections on How to Move from Accuracy to Trust for Medical AI

Cabitza, Federico; Campagner, Andrea; Datteri, Edoardo

doi:10.1007/978-3-030-87842-9_4

Federico Cabitza¹¹,
Andrea Campagner¹¹ &
Edoardo Datteri¹²

Part of the book series: Lecture Notes in Information Systems and Organisation ((LNISO,volume 51))

799 Accesses
6 Citations
5 Altmetric

Abstract

In this paper, we contribute to the deconstruction of the concept of accuracy with respect to machine learning systems that are used in human decision making, and specifically in medicine. We argue that, by taking a socio-technical stance, it is necessary to move from the idea that these systems are “agents that can err”, to the idea that these are just tools by which humans can interpret new cases in light of the technologically-mediated interpretation of past cases, like if they were wearing a pair of tinted glasses. In this new narrative, accuracy is a meaningless construct, while it is important that beholders can “believe in their eyes” (or spectacles), and therefore trust the tool enough to make sensible decisions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Available at https://covid19-blood-ml.herokuapp.com/.
2.
This is the acronym for Reverse transcriptase-polymerase chain reaction, a laboratory technique for the quantification of viral RNA in research and clinical settings.

References

Assale, M., Bordogna, S., Cabitza, F.: Vague visualizations to reduce quantification bias in shared medical decision making. In: Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, IVAPP, Valletta, Malta, vol. 3, pp. 209–216 (2020)
Google Scholar
Balcan, M.-F., Blum, A., Srebro, N.: A theory of learning with similarity functions. Mach. Learn. 72(1–2), 89–112 (2008)
Article Google Scholar
Baxter, J.: A model of inductive bias learning. J. Artif. Intell. Res. 12, 149–198 (2000)
Article Google Scholar
Bednar, P.M., Sadok, M.: Socio-technical toolbox for business systems analysis and design. In: STPIS@ CAiSE, pp. 20–31 (2015)
Google Scholar
Bella, A., Ferri, C., Hernández-Orallo, J., Ramírez-Quintana, M.J.: Calibration of machine learning models. In: Handbook of Research on Machine Learning Applications and Trends: Algorithms, Methods, and Techniques, pp. 128–146. IGI Global (2010)
Google Scholar
Brinati, D., Campagner, A., Ferrari, D., Banfi, G., Locatelli, M., Cabitza, F.: Detection of Covid-19 infection from routine blood exams with machine learning: a feasibility study. J. Med. Syst. 44(8), 135 (2020)
Article Google Scholar
Bush, V.: As we may think. Interactions 3(2), 35–46 (1996)
Article Google Scholar
Cabitza, F.: Biases affecting human decision making in AI-supported second opinion settings. In: Torra, V., Narukawa, Y., Pasi, G., Viviani, M. (eds.) MDAI 2019. LNCS (LNAI), vol. 11676, pp. 283–294. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-26773-5_25
Chapter Google Scholar
Cabitza, F., Campagner, A., Balsano, C.: Bridging the last mile gap between AI implementation and operation: data awareness that matters. Ann. Transl. Med. 8(7) (2020)
Google Scholar
Cabitza, F., Campagner, A., Ciucci, D., Seveso, A.: Programmed inefficiencies in DSS-supported human decision making. In: Torra, V., Narukawa, Y., Pasi, G., Viviani, M. (eds.) MDAI 2019. LNCS (LNAI), vol. 11676, pp. 201–212. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-26773-5_18
Chapter Google Scholar
Cabitza, F., Locoro, A., Alderighi, C., Rasoini, R., Compagnone, D., Berjano, P.: The elephant in the record: on the multiplicity of data recording work. Health Inform. J. 25(3), 475–490 (2019)
Article Google Scholar
Cabitza, F., Rasoini, R., Gensini, G.F.: Unintended consequences of machine learning in medicine. JAMA 318(6), 517–518 (2017)
Article Google Scholar
Carroll, J.M., Rosson, M.B.: Getting around the task-artifact cycle: how to make claims and design by scenario. ACM Trans. Inf. Syst. (TOIS) 10(2), 181–212 (1992)
Article Google Scholar
Caruana, R., Niculescu-Mizil, A.: An empirical comparison of supervised learning algorithms. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 161–168 (2006)
Google Scholar
Chen, M., Herrera, F., Hwang, K.: Cognitive computing: architecture, technologies and intelligent applications. IEEE Access 6, 19774–19783 (2018)
Article Google Scholar
Chen, S.C., Dhillon, G.S.: Interpreting dimensions of consumer trust in e-commerce. Inf. Technol. Manag. 4(2–3), 303–318 (2003)
Article Google Scholar
Chen, Y., Garcia, E.K., Gupta, M.R., Rahimi, A., Cazzanti, L.: Similarity-based classification: concepts and algorithms. J. Mach. Learn. Res. 10(Mar), 747–776 (2009)
Google Scholar
De Souza, C.S.: The Semiotic Engineering of Human-Computer Interaction. MIT Press, Cambridge (2005)
Book Google Scholar
De Souza, C.S., Barbosa, S.D.J., Prates, R.O.: A semiotic engineering approach to user interface design. Knowl.-Based Syst. 14(8), 461–465 (2001)
Article Google Scholar
Devetyarov, D., Nouretdinov, I.: Prediction with confidence based on a random forest classifier. In: Papadopoulos, H., Andreou, A.S., Bramer, M. (eds.) AIAI 2010. IAICT, vol. 339, pp. 37–44. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-16239-8_8
Chapter Google Scholar
Eco, U.: Metaphor, dictionary, and encyclopedia. New Literary Hist. 15(2), 255–271 (1984)
Article Google Scholar
Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. Stat. 1189–1232 (2001)
Google Scholar
Goddard, K., Roudsari, A., Wyatt, J.C.: Automation bias: a systematic review of frequency, effect mediators, and mitigators. J. Am. Med. Inf. Assoc. 19(1), 121–127 (2012)
Article Google Scholar
Guo, C., Pleiss, G., Sun, Y., Weinberger, K.Q.: On calibration of modern neural networks. In: Proceedings of the 34th International Conference on Machine Learning, vol. 70, pp. 1321–1330. JMLR.org (2017)
Google Scholar
Holzinger, A., Langs, G., Denk, H., Zatloukal, K., Müller, H.: Causability and explainability of artificial intelligence in medicine. Wiley Interdisc. Rev.: Data Min. Knowl. Discov. 9(4), e1312 (2019)
Google Scholar
Huggard, H., Koh, Y.S., Dobbie, G., Zhang, E.: Detecting concept drift in medical triage. In: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1733–1736 (2020)
Google Scholar
Hüllermeier, E., Waegeman, W.: Aleatoric and epistemic uncertainty in machine learning: a tutorial introduction. arXiv preprint arXiv:1910.09457 (2019)
Klein, G.: Naturalistic decision making. Hum. Factors 50(3), 456–460 (2008)
Article Google Scholar
Lipshitz, R.: Decision making as argument-driven action. In: Decision Making in Action: Models and Methods, pages 172–181 (1993)
Google Scholar
Luijken, K., Groenwold, R.H.H., Van Calster, B., Steyerberg, E.W., van Smeden, M.: Impact of predictor measurement heterogeneity across settings on the performance of prediction models: a measurement error perspective. Stat. Med. 38(18), 3444–3459 (2019)
Article Google Scholar
Maul, A., Mari, L., Wilson, M.: Intersubjectivity of measurement across the sciences. Measurement 131, 764–770 (2019)
Article Google Scholar
Niculescu-Mizil, A., Caruana, R.: Predicting good probabilities with supervised learning. In: Proceedings of the 22nd International Conference on Machine Learning, pp. 625–632 (2005)
Google Scholar
Papadopoulos, H., Vovk, V., Gammermam, A.: Conformal prediction with neural networks. In: 19th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2007), vol. 2, pp. 388–395. IEEE (2007)
Google Scholar
Parasuraman, R., Manzey, D.H.: Complacency and bias in human use of automation: an attentional integration. Hum. Factors 52(3), 381–410 (2010)
Article Google Scholar
Pasquinelli, M.: How a machine learns and fails. Spheres: J. Digit. Cult. (5), 1–17 (2019)
Google Scholar
Pfeffer, J.: Building sustainable organizations: the human factor. Acad. Manag. Perspect. 24(1), 34–45 (2010)
Google Scholar
Pleiss, G., Raghavan, M., Wu, F., Kleinberg, J., Weinberger, K.Q.: On fairness and calibration. In: Advances in Neural Information Processing Systems, pp. 5680–5689 (2017)
Google Scholar
Sadin, E.: L’intelligence artificielle ou l’enjeu du siecle: anatomie d’un antihumanisme radical. L’ećhappeé (2018)
Google Scholar
Tenner, E.: The Efficiency Paradox: What Big Data Can’t Do. Vintage (2018)
Google Scholar
Tsymbal, A.: The problem of concept drift: definitions and related work. Comput. Sci. Dept. Trinity Coll. Dublin 106(2), 58 (2004)
Google Scholar
Vovk, V., Gammerman, A., Shafer, G.: Algorithmic Learning in a Random World. Springer, Heidelberg (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Informatica, Sistemistica e Comunicazione, Università degli Studi di Milano-Bicocca, Viale Sarca 336, 20126, Milan, Italy
Federico Cabitza & Andrea Campagner
Dipartimento di Scienze Umane per la Formazione, Università degli Studi di Milano-Bicocca, Piazza dell’Ateneo Nuovo, 1, 20126, Milan, Italy
Edoardo Datteri

Authors

Federico Cabitza
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Campagner
View author publications
You can also search for this author in PubMed Google Scholar
Edoardo Datteri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Federico Cabitza .

Editor information

Editors and Affiliations

Department of Management and Business Administration, University of Chieti-Pescara, Pescara, Italy
Federica Ceci
Department of Business and Management, Luiss Guido Carli University, Roma, Italy
Andrea Prencipe
Department of Business and Management, Luiss Guido Carli University, Roma, Italy
Paolo Spagnoletti

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cabitza, F., Campagner, A., Datteri, E. (2021). To Err is (only) Human. Reflections on How to Move from Accuracy to Trust for Medical AI. In: Ceci, F., Prencipe, A., Spagnoletti, P. (eds) Exploring Innovation in a Digital World. Lecture Notes in Information Systems and Organisation, vol 51. Springer, Cham. https://doi.org/10.1007/978-3-030-87842-9_4

Download citation

DOI: https://doi.org/10.1007/978-3-030-87842-9_4
Published: 26 October 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87841-2
Online ISBN: 978-3-030-87842-9
eBook Packages: Business and ManagementBusiness and Management (R0)

Publish with us

Policies and ethics

To Err is (only) Human. Reflections on How to Move from Accuracy to Trust for Medical AI