Avoid common mistakes on your manuscript.
Artificial intelligence (AI) is expected to broadly reshape medicine; however, the vast majority of developed AI models in the intensive care unit (ICU) still remains within the testing and prototyping phase [1,2,3]. AI, defined as a machine’s ability to mimic human-like capabilities such as reasoning, learning, planning, and creativity [4], faces several adoption challenges in clinical settings. One of the primary issues is that the integration of AI into clinical practice encounters challenges, including concerns over data privacy, sharing, transparency, and explainability [5]. These are crucial to overcome because recent advancements have led to the development of sophisticated AI models capable of diverse tasks, ranging from text interpretation to image generation. For optimal performance, such advanced models, also known as foundation models, require training on extensive and diverse datasets. A well-known example of a foundation model is ChatGPT, released by OpenAI in 2022, able to generate human-like natural language responses that are more empathetic to patient questions compared to clinicians and can assist ICU clinicians with tasks such as summarizing unstructured medical notes and preparing accurate discharge summaries [6, 7]. This however can only be achieved when data are shared among healthcare providers and institutions to achieve proper volume, is standardized, and is de-identified. The process of anonymizing data, while critical, is not foolproof against AI-driven attacks that can potentially reconstruct sensitive information. Thus, traditional anonymization techniques may fall short in fully protecting patient privacy [8]. Also, acquiring such data is a challenge because healthcare data are siloed within hospitals and data sharing is subject to ethical, organizational, and legal complexities. Currently, the absence of a robust framework for cross-border data sharing in ICUs (and hospitals in general) hinders the standardization of data sharing and access, thereby affecting the effective training and implementation of foundation models.
Federated data access and federated learning
To address these challenges, federated data access and federated learning (FL) offer innovative solutions. Federated data access enables the analysis and extraction of insights from health data without the need for its physical transfer. This methodology enables the aggregation of diverse data sets from multiple sources while ensuring that each dataset remains securely within its original location. Consequently, it not only safeguards privacy but also breaks down the barriers created by data silos. Building upon this, FL extends the capabilities of federated data access by enabling AI models to be trained and refined directly within this secure data infrastructure. This approach allows for data analysis and model training to occur at the data’s source, circumventing common data privacy concerns in healthcare. This makes FL a crucial tool in enhancing patient care and medical research. Collaborative data access and model training are orchestrated via a central server (e.g., a service provider). Hospitals have the flexibility to utilize their own infrastructure or a virtual private cloud for data storage and facilitating model training (Fig. 1) [9]. Imagine a network of ICUs across various hospitals, each accumulating vital patient data and providing a large, diverse medical dataset. For a model to be trained with such data, all data across the federated data network need to be harmonized to facilitate that data elements have a similar structure and meaning. Common data models like the Observational Medical Outcomes Partnership (OMOP) facilitate data harmonization, offering a standardized framework that can be used to map raw data from various sources into a common structure. This facilitates data harmonization and the possibility to train models on large amounts and diverse patient data from multiple institutions, eventually improving the model [10]. In a FL framework, each data controller not only defines its own governance processes and associated privacy policies, but also controls data access and has the ability to revoke it. This ensures that local ethical standards, organizational policies, and legal requirements can be met without the need of fully harmonizing these across many countries and institutions [11].
Federated learning and bedside support
Imagine a night shift at the ICU, where an alert is raised stating, “Caution: patient X has an increased risk of deteriorating towards septic shock within the next 24 h at 78% likelihood.” Further interaction with the bedside monitoring model is possible, with queries like, “[healthcare professional] why was this alert issued?” and the model responding, “[model] the patient’s respiration rate and heart rate have increased over the last hour, and blood cultures have just returned positive results.” This example demonstrates the advanced functionality of a foundation model. For such advanced predictive functionality to be realized, the model must be trained on large, diverse medical datasets. Also, a foundation model must be fine-tuned with validated prompts and must be ‘grounded’ to local data prior to verification and testing and ultimately deployment to the ICU. Grounding involves taking the pre-trained foundation model and tailoring it to address specific real-world challenges and tasks. As such AI models will offer bedside decision support by harnessing clinical expertise and delivering comprehensive textual explanations and data summaries.
Advantages of federated learning
One of the benefits of FL is its capability to enable the swift and real-time analysis of diverse, sensitive clinical data. This feature supports the local implementation of foundation models, allowing them to be continually updated with the most recent data from a variety of sources. Such dynamic updating enables the models to rapidly adapt to evolving clinical situations, providing more accurate and timely support for critical decision-making. This is particularly crucial in ICU settings. In broader healthcare scenarios like pandemics, different institutions possess unique knowledge, resources, or datasets that are vital for an effective response. FL allows these institutions to contribute their specialized expertise and data while retaining control over it. The decentralized nature of FL is instrumental in developing responsive AI models and strategies, thus improving our collective capacity to manage emergency health crises.
Challenges of federated learning
Despite its promising approach to data privacy, FL does not completely resolve data governance issues. Instead, robust data governance becomes a fundamental requirement for enabling effective FL. To begin with, data sharing and the adoption of common data models necessitate that all participating data providers have well-defined data management policies. These policies should ensure data lineage, uphold high data quality, and establish clear responsibility and accountability for the data. Furthermore, robust agreements for the type of standardization across hospitals are essential. Institutions must establish consensus on a unified data model and semantic standards for coding key concepts like diagnoses, medications, and clinical lab results. Additionally, technical limitations related to the availability of hardware and cloud resources for data storage, as well as computational power for model training, may pose challenges for hospitals in adopting federated learning. These technical constraints can be barriers for some hospitals in adopting FL, particularly those with limited IT infrastructure. Moreover, and perhaps more importantly, because their output reflects their training data, foundation models can perpetuate biases due to disparities in gender, race, and socio-economic status. Ensuring an adequate representation of hospitals from various regions worldwide could lead to more diverse and inclusive health datasets.
Data-driven ICU medicine
Currently, most ICU models are trained on small, narrowly scoped clinical datasets and are evaluated on tasks that do not provide meaningful insights on their usefulness to health systems [12]. A recent successful example of FL in the ICU is a collaborative effort across 20 global institutes to predict clinical outcomes in patients with coronavirus disease 2019 [13]. Despite this, clinical examples are limited and foundation models in the ICU remain a vision until (cross-border) data access between various institutions to enable proper model development is achieved. To facilitate such a network, the European Commission issued a tender to initiate a pan-European level ICU data infrastructure [14]. We are currently bringing together different stakeholders to create a federated infrastructure for ICU data across Europe, with the European Society of Intensive Care Medicine as key stakeholder, and welcome expressions of interest from professionals with diverse backgrounds and an interest in ICU.
Take-home message
FL is a machine learning setting where a model is collaboratively trained under the orchestration of a central server, while keeping the training data decentralized. It is key to the development of foundation models for healthcare. In the ICU field this innovative approach ushers in a future marked by safer, more effective, and globally interconnected healthcare. This paradigm shift ensures that data are standardized, privacy is preserved, regulatory compliance is maintained, and healthcare institutions retain control over their invaluable patient data, while disease detection and knowledge sharing is enhanced.
References
van de Sande D, van Genderen ME, Huiskens J, Gommers D, van Bommel J (2021) Moving from bytes to bedside: a systematic review on the use of artificial intelligence in the intensive care unit. Intensive Care Med 47(7):750–760
Moor M, Banerjee O, Abad ZSH, Krumholz HM, Leskovec J, Topol EJ, Rajpurkar P (2023) Foundation models for generalist medical artificial intelligence. Nature 616(7956):259–265
Rajpurkar P, Chen E, Banerjee O, Topol EJ (2022) AI in health and medicine. Nat Med 28(1):31–38
European Commission (2021) Regulation of the European parliament and of the council laying down harmonised rules on artificial intelligence (Artificial Intelligence Act) and amending certain union legislative acts
Rodemund N, Wernly B, Jung C, Cozowicz C, Koköfer A (2023) Striking the balance: privacy protection and data accessibility in critical care research. Intensive Care Med 49(8):1029–1030
Ayers JW, Poliak A, Dredze M, Leas EC, Zhu Z, Kelley JB, Faix DJ, Goodman AM, Longhurst CA, Hogarth M et al (2023) Comparing physician and artificial intelligence chatbot responses to patient questions posted to a public social media forum. JAMA Intern Med. https://doi.org/10.1001/jamainternmed.2023.1838
Komorowski M, del Pilar Arias López M, Chang AC (2023) How could ChatGPT impact my practice as an intensivist? An overview of potential applications, risks and limitations. Intensive Care Med 49(7):844–847
Rocher L, Hendrickx JM, de Montjoye Y-A (2019) Estimating the success of re-identifications in incomplete datasets using generative models. Nat Commun 10(1):3069
Wen J, Zhang Z, Lan Y, Cui Z, Cai J, Zhang W (2023) A survey on federated learning: challenges and applications. Int J Mach Learn Cybern 14(2):513–535
Blacketer C, Defalco FJ, Ryan PB, Rijnbeek PR (2021) Increasing trust in real-world evidence through evaluation of observational data quality. J Am Med Inform Assoc 28(10):2251–2257
Rieke N, Hancox J, Li W, Milletari F, Roth HR, Albarqouni S, Bakas S, Galtier MN, Landman BA, Maier-Hein K et al (2020) The future of digital health with federated learning. npj Digit Med 3(1):1–7
Wornow M, Xu Y, Thapa R, Patel B, Steinberg E, Fleming S, Pfeffer MA, Fries J, Shah NH (2023) The shaky foundations of large language models and foundation models for electronic health records. npj Digit Med 6(1):135
Dayan I, Roth HR, Zhong A, Harouni A, Gentili A, Abidin AZ, Liu A, Costa AB, Wood BJ, Tsai C-S et al (2021) Federated learning for predicting clinical outcomes in patients with COVID-19. Nat Med 27(10):1735–1743
European Commission–Directorate-General for Communications Networks CaT (2023). Cloud-data and AI digital-2023-cloud-AI-04. In: (Digital) DEP(ed)
Acknowledgements
We thank Jan AJG van den Brand and Lisanne van Prooyen–Schuurman for their input and specific knowledge of federated data access and federated learning.
Author information
Authors and Affiliations
Contributions
MvG and CJ conceived the idea and drafted the manuscript. MC critically reviewed and edited the manuscript. All authors approved the final manuscript and had final responsibility for the decision to submit for publication.
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare that they have no conflicts of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License, which permits any non-commercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc/4.0/.
About this article
Cite this article
van Genderen, M.E., Cecconi, M. & Jung, C. Federated data access and federated learning: improved data sharing, AI model development, and learning in intensive care. Intensive Care Med 50, 974–977 (2024). https://doi.org/10.1007/s00134-024-07408-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00134-024-07408-5