1 Introduction

The Medical Internet of Things (MIoT) is crucial component for enabling the Industry 4.0 revolution across healthcare industry. MIoT connects large num- ber of medical devices to the internet. The rapid growth of medical IoT has transformed the healthcare system through the use of sensors and smart medical systems [1]. The inadequacies in our present healthcare systems have been highlighted by a number of natural disasters, including the COVID-19 pandemic. IoT and data analysis using machine learning (ML) methods can considerably increase the cost-effectiveness, accessibility, and resilience of our healthcare sys- tem by converting some of the conventional physical healthcare methods to their online remote equivalents, such as remote patient health monitoring [2]. ML in healthcare would augment physicians in making critical decisions quickly. How- ever, ML approaches are data-hungry. An approach to machine learning (ML) training that protects data privacy is essential given the significance of protecting patient health data. The use of FL in smart healthcare can solve the problem of limited healthcare data availability, especially for rare diseases, and patient data privacy during training by collaborating the ML model training among various hospitals.

A healthcare organization consists of several patient health records that vary from hospital to hospital. Electronic medical records generated during clinical practices are captured at various points throughout the hospital, beginning at the registration desk and continuing until the patient is discharged. However, these records of different kinds, like medical imaging, are stored in hospital servers not accessible to stakeholders outside a particular hospital/ healthcare provider in- stitution to adhere to the privacy and security regulations of a particular country. However, the disease spectrum varies worldwide; for example, medical students or lecturers in the western countries do not observe tropical diseases as well as different symptoms for the same diseases across various parts of the world. In this situation, FL technology holds great significance in cross training physicians on various diseases while ensuring the security and privacy of patient records, as the records do not move outside a hospital server. Clinical training and therapeu- tic aspects can be taught to trainees, and management skills can be imparted. Personalized FL [3] can jointly train distributed ML models from the knowl- edge of the data across all the different geographical locations, still producing personalized models specific to each geographical location simultaneously.

In 1976, philosophy professor Patrik Hill proposed the term FL to form a Federated Learning Community (FLC) for bringing people from different re- search universities to learn together [4]. McMahan et al. [5]. used the term FL for the first time in 2017 to preserve patient data privacy by training the ML model in a distributed fashion.

Figure 1 describes a general privacy-preserving FL framework. Multiple clients in FL receive a global model with weight parameters from the server. The clients then train their models from local data using various stochastic optimization methods (for example, stochastic gradient descent [6]) and send the trained model to the server. The server then aggregates these weights in order to learn a global model. Consequently, the global model acquires all the local clients’ knowledge without even accessing their local datasets [5]

Fig. 1
figure 1

Federated learning framework

FL can reduce the computational budget required to train a high-quality global model due to the resource-constrained MIoT devices by distributing the workload and training it in parallel [7]. Because FL training occurs on edge de- vices, it can accelerate the inference step where network bandwidth is limited. FL can also reduce bandwidth requirements by transferring only weights rather than data. Data breaches are prevented because each edge server performs local computing, ensuring privacy.

This paper discusses the importance of FL in healthcare systems, how it can solve several disease detection problems, and how it can ensure patient health- related data privacy. The remainder of the paper is structured as follows: The overview of FL in healthcare is described in Sect. 2. Sect.3 discusses the current state of FL in IoT-based healthcare applications. Sect. 4 describes the difficulties encountered by FL-based ML models. Following that, future work is discussed in Sect. 5. Finally, Sect.6 concludes the paper.

2 System model for FL in healthcare

Figure 2 illustrates a generic framework for FL in IoT-enabled healthcare sys- tems. The IoT-enabled medical devices collect the data on the respective edge servers of each hospital. Afterward, local computation of the model is done on the edge servers that provide the optimal weights. Each server’s optimal weights are then sent to the global server, which aggregates all the weights and trains and evaluates the model. This way, weights are shared in parallel from all the hospitals to the centralized server. The training of the FL model can be used for classification and detection approaches. For example, hospitals that contain the patient’s ECG measurements can have several classes depending on the heart- beat measurements. Few patients have an average heart rate, while others can have catastrophic heart diseases like arrhythmia and ventricular tachycardia. To distinguish between normal and diseased patients, local computing at each client can be trained for the classification task. Later, the local weights can be aggregated on the global server for final evaluation.

Fig. 2
figure 2

A Federated learning system model in healthcare

According to the Health Insurance Portability and Accountability Act (HIPAA) [8], patient safety is essential when applying any ML algorithms. The FL ap- proach can ensure patient privacy and secure patient health records (PHI). Dur- ing the transmission of health data, the data gets transmitted from medical devices to the cloud and is susceptible to cyber-attacks like spoofing, informa- tion disclosure, denial of service (DoS), and privilege escalation attacks. The first crucial aspect of using FL in healthcare is preventing data sharing with the gateway or a cloud server. This way, the safety of the patient data is also ensured.

3 Related work

The FL model for IoT-enabled healthcare delivery provided several solutions for interconnected IoT devices’ diverse, low-power, and distributed nature. The ear- lier research discusses the architecture of federated learning, system problems, and a method for protecting privacy [9]. With the advancement of wearables and sensor technologies, enormous data is now evolving, which is essential to be trained using different ML algorithms. Syreen et al. [9] proposed several clas- sification and clustering techniques for FL-enabled IoT systems. The resource- constraint nature of IoT makes these devices share data with edge nodes using different communication protocols, which are prone to cyber-attacks. To address this, Nguyen et al. [10] address the extensive range of FL applications in IoT services, including data sharing, attack detection, and IoT privacy and security. Similarly, Alam et al. [11] emphasize the importance of robust FL classifiers that aid in the non-disclosure of information, resulting in highly secure privacy rules. The industry 4.0 revolution has upgraded the healthcare system by incorpo- rating AI into edge devices to enable training. FL can facilitate the connectiv- ity of various industrial internet of things (IIoT) devices at the gateway while maintaining patient confidentiality. However, the previously stated study demon- strates future uses of FL in numerous technological sectors. These papers focus primarily on the role of FL and the challenges it can confront in all application domains.

Recently, the FL concept is becoming increasingly prevalent in clinical care. Table 1 describes different FL healthcare applications implemented until to-day. With the growth of IoT devices that offer affordable healthcare monitoring, the FL method can expedite the detection of diseases based on the patient’s physiological characteristics. Elayan et al. [12, 13] offer a deep federated learn- ing system that utilizes transfer learning to detect skin disorders. The authors evaluated their results using the evaluation metrics of precision (the number of optimistic predictions), recall (the number of correctly classified accurate pre- dictions), and the F1 score (the weighted average of precision and recall). The FL model increases the area under the curve, a classifier’s ability to distinguish between classes, to 0.97. The higher the value, the better the performance. The author, however, did not resolve the issue of limited data availability. Singh et al. [14] offer a blockchain-based distributed FL architecture for IoT-enabled med- ical devices. This research aims to train machine learning models at various IoT device locations without transmitting patient data to the cloud.

Table 1 Federated learning applications in healthcare

The early diagnosis of certain fatal diseases is crucial. Li et al. [14] proposed an ADDetector, a detector for Alzheimer’s disease. The author introduces three layers to privacy: users, clients, and gateway. A classification model with a dif- ferential privacy mechanism was designed to protect model aggregation between clients and, clouds. The accuracy of their ADDetector is 81.9%. Brisimi et al. [15]. suggest a binary supervised learning system to predict hospitalizations of car- diac events through distributed algorithms. Without explicitly exchanging raw data, the study aims to enable diverse data holders to interact and converge on a shared predictive model. Blanquer et al. [16]. propose a hardware-enhanced ar- chitecture for encrypting sensitive data in memory and on discs that can only be accessed within trusted execution contexts. The use of federated clouds enables the collection of resources needed for high performance and security.

Securing local models from violating the user’s privacy is also crucial. The current locally trained models are exposed to adversaries such as inversion and membership inference attacks. Stephanie et al. [17]. present a blockchain-based heterogeneous model to learn from healthcare institutions collaboratively. Each hospital owned an edge server where data collected from the medical devices were trained. The trained model is verified and stored in a private blockchain. The author took two datasets: Colon Pathology and Breast Cancer. The model used for training purposes is centralized Fl, FedAvg, TEE-based FL, and Ensemble- Fed Avg. All algorithms gave an accuracy of more than 80%.

Zhang et al. [18]. used cryptographic primitives, including masks and homo- morphic encryption, to protect the local models. Private medical data can be attacked using a model reconstruction attack. Along with data protection, the author analyzed the communication and computation costs. The model was trained on HAM1000 medical dataset for skin lesion classification. The maxi- mum accuracy obtained was around 76.9%.

Khoa et al. [19]. uses Encode Depth Convolutional Network as both server and client for recognizing human activity. They compared their approach with LSTM and CNN in the FL setting. A DEEP-FEL model was proposed by Lian et al. [20]. This method teaches medical centers their local model and aggregates the sub-model by maximizing the topological ring structure. Sub-global model information was transmitted using differential privacy. The model was evaluated using FL and ML techniques on COVID-19 CT scan images. An estimation and classification model for a diabetes-monitoring healthcare system was proposed by Castillo et al. [21]. The estimation model forecasts the glucose level in the patient, while the classification model detects the anomalous points in patient data. The authors implemented the FL and independent learning (IL) approach with a recall rate of 98.69% and 97.87%, respectively.

The current state of research describes the significance of FL in IoT-enabled healthcare systems. The authors discussed numerous applications of FL in health- care domains in the literature survey. With the progress of this healthcare field, several limitations like data heterogeneity, loss of essential data, and modifica- tion of model weights need to be addressed, as discussed in Sect. 4, for making these FL-deployed healthcare systems robust.

4 Challenges in FL for IoT-enabled healthcare systems

FL for the healthcare system can be proved to be the most influential future contributions to the medical industry. However, with current FL approaches, there are several limitations:

  • Data heterogeneity: Healthcare organizations contain diverse information. When training a local model on edge devices, the differential weights shared by all the local servers will have a significant disparity, leading to inefficient model prediction. Because of heterogeneous patient metadata, obtaining essential data comparable to other edge devices from hospitals can take time and effort.

  • Heterogeneous healthcare systems: Most healthcare organizations train mod- els with varying processing capabilities during local computing on edge servers. Consequently, the GPU and hospital processing unit are essential for local training. With hospitals with less computing power, the local model may be less efficient, and the shared weights from these hospitals may con- tribute less to the central server’s global aggregation.

  • Loss of important healthcare data: During the sharing of weight from local clients to the global server, packets with the most significant contribution to the model prediction may be dropped during the communication. As a result, if the FL was trained on critical illness detection applications, data packet loss during communication can cause severe concerns about patient health and medicine.

  • Modifications with the model weights: The exchanged weights during FL are vulnerable to cyber-attacks. Any adversary can use reverse engineering to discover the shared weight’s attributes and modify them maliciously. Even if the data is not shared, it is essential to safeguard the model weights.

  • Single server problem: Until now, most FL approaches in healthcare appli- cations utilize a single server for global weights aggregation and training. However, relying on a single server cannot be trustworthy. A single point of failure may result in the complete failure of critical infrastructure like health- care. Privacy issues are still present on the server. A cyber-attack can ever occur on the server, directly affecting the final model weights and resulting in a considerable loss for hospitals. Therefore, distributed systems or backup can help overcome these challenges and prepare them for fault tolerance.

  • Trade off between noise and accuracy: As the client’s model weights can be reverse-engineered to extract meaningful information about the client’s knowledge, differential privacy can make the weight update more robust. However, adding noise to the model parameters will temper the knowledge contained in the parameters, which would eventually decrease the accuracy of the aggregated global model. Thus, there exists a trade-off between adding noise and the final accuracy of the model.

5 Future work

FL sees a tremendous expansion. It is possible to extend it further by following the paths of study.

  • Detection of malicious weights at the client level: Malicious clients have previously been identified at the server level during aggregation [22, 23]. However, determining the specific corrupted weights at the client level has yet to be investigated.

  • Energy efficient communication: Several research papers have investigated more efficient methods of communicating client weights to the server. Trans- ferring only the top few percentages of weights as opposed to all weights is known as sparsification of weight. This significantly reduces communication expenses. However, the proof of convergence of such communication-efficient federated learning frameworks needs to get investigated. [24].

  • Statistical data heterogeneity: The signs and symptoms of a disease can vary significantly across time and space. Thus, it is necessary to design federated aggregation algorithms that can capture healthcare data’s temporal and spa- tial variations and efficiently learn from data spanning a large geographical area and time.

  • Resource constrained training: Medical devices are made to be used in a hospital or clinic. However, they need more computing power to train the

  • ML algorithms, which need considerable data to learn. This problem can be fixed by looking into FL training for IoT devices with limited resources [25]. Neural architecture search can be used to find the best model size based on resources available, and FL can be used to train those models. [26]

  • Novel communication scheme: Federated learning approaches can be divided into synchronous and asynchronous (bounded delay) approaches. This Fed- erated learning training approach is practical when the clients are only ded- icated to the task of federated learning. However, real-world clients are only sometimes available for training, e.g., mobile phones on low batteries, etc. Thus, a device-centric training approach is highly needed for the Federated learning training [27,28,29].

  • Necessity of training medical professionals: In the current scenarios, FL train- ing is only performed to train ML models to improve patient health. However, medical professionals are not comfortable using these technologies. Thus, training medical professionals on how to use these technologies are neces- sary. They also need to be trained in the abilities of these ML algorithms so that the problems can be communicated better with the technical persons to solve the problems in the healthcare domain efficiently. [30, 31]

6 Conclusion

FL is a novel AI methodology that has generated attention for developing scal- able and secure smart healthcare applications. The use of FL in various health- care settings is analyzed in this article. We discussed diverse reasoning and how and why it could be used in smart healthcare. After that, there was a discus- sion about new FL ideas that, if implemented, would make federated intelligent healthcare work better. Then, critical FL applications in innovative healthcare were carefully looked at and defined. Some of these were federated EHR manage- ment, federated remote patient health monitoring, and medical image diagnosis and, illness detection. Last, we discussed some intriguing difficulties and the future of FL-smart healthcare.