An intelligent healthcare system for predicting and preventing dengue virus infection

Sood, Sandeep Kumar; Sood, Vaishali; Mahajan, Isha; Sahil

doi:10.1007/s00607-020-00877-8

An intelligent healthcare system for predicting and preventing dengue virus infection

Special Issue Article
Published: 08 January 2021

Volume 105, pages 617–655, (2023)
Cite this article

Download PDF

Computing Aims and scope Submit manuscript

An intelligent healthcare system for predicting and preventing dengue virus infection

Download PDF

Sandeep Kumar Sood¹,
Vaishali Sood²,
Isha Mahajan² &
…
Sahil³

4960 Accesses
14 Citations
Explore all metrics

Abstract

Dengue is a mosquito-borne pandemic viral infection, which transmits to humans from Female Aedes albopictis or Aedes agypti mosquitoes. It progressively deteriorates the health of infected individuals and poses a high threat of human morbidity and mortality. This paper proposes an intelligent healthcare system which identifies, monitors, and alerts dengue virus (DeV) infected individuals and other stakeholders in real-time and control the DeV infection outbreak using cloud computing, internet of things and fog computing paradigms. The proposed system uses Naive Bayesian Network (NBN) for diagnosing the possibly DeV infected individuals and generating real-time alerts for suggesting and alerting the concerned stakeholders for taking on-time necessary actions at the fog subsystem. The proposed system also uses Social Network Analysis at the cloud subsystem, to provide Global Positioning Systems (GPS)-based global risk assessment of the DeV infection on Google Maps (Google-based web map service) and control DeV infection outbreak. The analysis of the experimental results acknowledges the efficiency of the NBN-based DeV infection diagnosis, alert generation, and GPS-based risk assessment functionality, of the proposed system, via various statistical measures and experimental approaches.

An intelligent and secure healthcare framework for the prediction and prevention of Dengue virus outbreak using fog computing

Article 28 March 2019

An intelligent system for predicting and preventing MERS-CoV infection outbreak

Article 08 July 2015

IoT and cloud-based COVID-19 risk of infection prediction using hesitant intuitionistic fuzzy set

Article 07 January 2024

1 Introduction

Dengue fever is considered as one of the world’s most common vector-borne diseases. It is caused by the transmission of the Dengue Virus (DeV) into human beings, from the infected mosquitoes through mosquito bites. Female Aedes agypti or Aedes albopictis mosquitoes are the primary sources of DeV transmission into humans. When the mosquito bites a person having DeV in his/her blood, the mosquito becomes infected with the DeV. An infected mosquito can later transmit that infection to healthy people by biting them. An infected person can transmit the DeV from 4 to 10 days via female Aedes agypti mosquito after the symptoms appear. The DeV incidence can cause human morbidity and mortality in case the due attention is not paid. The DeV infection affects the tropical and subtropical areas, especially in developing and underdeveloped countries, where poor water management, substandard housing, inadequate waste management, poor vector control, and deteriorating disease prevention programs become its main drivers [6]. The symptoms of DeV infection are severe fever, headache, eye pain, rashes on the body, bleeding in nose or gums, and loss of appetite [47]. Sometimes, the DeV infection can lead to more severe conditions like hemorrhagic fever, dengue shock syndrome, and various permanent health losses.

According to the World Health Organization [52], approximate 2.5 thousand million people or 40% population of the world is living in the areas, where the DeV transmission risk is high. The report states that the incidence of DeV infection has increased by 30 times over the last half of the century. An estimation states that more than 50 million DeV infection occurs worldwide in a year, of which 1% severe dengue cases are hospitalized, and 2.5% of the hospitalized people die. Another report [53] states that only 96 million out of 390 million dengue infection cases are being clinically manifested per year. Further, the report states that the 3.9 thousand million people belonging to 128 countries are at a high risk of DeV infection. Henceforth, its real-time diagnosis and control are emerging as one of the most critical issues for healthcare agencies, especially in developing and underdeveloped countries.

1.1 Contribution

The diagnosis of DeV infection, provision of curative aids to infected individuals, and preventive aids to the uninfected individuals at the nascent stage of the infection can reduce the health severity consequences and prevent the outbreak of DeV infection. However, the existing healthcare systems are not so efficient in controlling such types of infectious diseases. These systems are having various limitations like the readily unavailable diagnostic procedures, people’s ignorance in treating every fever as a simple fever and even their reluctance in providing their vital health information, their traditional methods to treat diseases, and various environmental conditions. Even the reluctance and unawareness of patients to visit hospitals for routine checkups and doctors’ inability to monitor each patient regularly by visiting them, especially in those places where doctor to patients ratio is very low, are some significant challenges for the existing healthcare systems. Hence, to control the DeV infection outbreak, its real-time and remote detection, and monitoring can play an important role in healthcare-related services.

The immense potential and advancement of Information and Communication Technologies (ICT) and their diffusion in various domains have enabled the communities to build sustainable solutions for improving their lives [34] . The ICT paradigms viz. cloud computing, Internet of Things (IoT), and fog computing have revolutionized the world in recent times, with the development of smart solutions in various fields [8, 17, 27] and healthcare [3] is one such field. These paradigms have equipped the healthcare sector with the features of On-time Remote Diagnostics, Tele-Medicine, Remote Healthcare Monitoring, Timely Health Awareness and alike, and evolved the legacy healthcare systems into intelligent and responsive system.

The Cloud-assisted IoT-based healthcare systems have made it feasible to remotely diagnose and monitor the health attributes of the individuals and provide appropriate treatment. The effectiveness of healthcare services require real-time health diagnostics and alert generation to the concerned stakeholders: individuals, hospitals and doctors. However, in the Cloud-IoT environment, the scale of data generated by IoT-sensors present the challenges viz. the latency due to the communication overhead and transmission delay, real-time data processing, and privacy of the sensitive health data of the patients. Here, the concept of fog computing resolves the issues like location awareness, high latency, and reliability by providing computing, communication and storage resources at the edge devices of the network. It enables the individuals to use computing services for processing the data in their mobile devices, and the transmission of only required data that too in a protected manner to the cloud servers. Fog computing provides various merits to the healthcare systems, such as quality of service, low latency, location awareness, and real-time notification of services [35].

These significant benefits can assist in real-time identification of the DeV infected individuals, on-time alert generation to the concerned stakeholders and assessing the DeV infection geographical risk. Hence, the challenges of early and remote detection of DeV infection, controlling of its outbreak, and the motivation of ICT diffusion in the healthcare systems have derived us to propose a Fog-Cloud-IoT based intelligent healthcare system for controlling DeV infection outbreak.

In this paper, a Fog-Cloud-IoT based healthcare system is proposed, which analyzes individuals’ health attributes and the surrounding environment for diagnosing the possible DeV infected individuals, monitoring their health, and controlling the outbreak of DeV infection. The system acquires the individuals’ personal and health-related attributes via a mobile application on their phones, and environment-related attributes through environmental sensors, at the data acquisition subsystem. The system keeps the personal information of the individuals private using a two-stage mechanism: information granulation and secret sharing scheme. On the Fog subsystem, the system uses Naive Bayesian Network (NBN) for diagnosing the individuals for possible DeV infection, which classifies them into the category of possibly infected (In) or uninfected (Un), using their health attributes. Once the individuals are diagnosed with possible DeV infection, the fog subsystem immediately generates alerts and aware individuals about the disease and suggest them for medical examination (if require) to medically confirm the incidence of the infection by consulting with doctors and through proper recommended laboratory tests.

On the cloud subsystem, the system uses the location of infected individuals using their GPS-enabled mobile phones to track their movement for Social Network Analysis (SNA) and for assessing the geographical risk assessment of the DeV infection using Google Maps. This subsystem depicts the geographical risk by using the information of actually infected and uninfected individuals from hospital databases and cloud storage. This geographical risk assessment alerts the individuals through advice and precautions, guides the individuals for choosing safer paths, and helps the various stakeholders (individuals, government agencies, and health organizations) in controlling the DeV outbreak. The primary objectives of the proposed system are as follows.

To early screen the possible DeV infected individuals and suggest them for the on-time medical examination for confirming the incidence of the infection.
To generate real-time diagnostic and suggestive alerts to individuals, and emergency alerts to doctors or caregivers for taking on-time appropriate actions.
To identify the infected and risk-prone areas geographically for effective and on-time controlling of the DeV infection outbreak.
To predict the impeding DeV epidemic for generating early warning alerts to individuals for enhancing the effectiveness of the preventive measures.
To protect the individuals’ sensitive health information.

The proposed system does not substitute the doctors and medical procedures, but instead, provide the individuals and healthcare agencies the ability of early and possible diagnose of the DeV infection, so proper medical procedures for diagnosing the actual DeV infection could be adopted, especially in developing and under-developed countries, where most of people are below the poverty line and treat dengue fever as a simple fever, which may sometime cause life-threatening complications to them. The system also helps in avoiding any chaos or panic among individuals due to the rumors during the peak season of dengue by distinguishing dengue fever from simple fever, and subsequently suggesting for proper medical procedures to confirm its incidence.

The proposed system presents an intelligent healthcare system for controlling the DeV infection outbreak using IoT, cloud, and fog paradigms. The significant contributions of the proposed system are as follows.

The proposed system provides early remote-diagnostic functionality for the possible DeV infection, based on the physiological and environmental parameters.
The proposed system bridges the gap between the required healthcare infrastructure and available infrastructure, especially in the countries where doctor to patients ratio is very low.
The proposed system provides real-time diagnostics, suggestive, and curative alerts to the concerned stakeholders (individuals, government agencies and health organizations).
The proposed system geographically assesses the global DeV infection risk and helps in controlling its outbreak.

The paper has been organized into five sections. In the second section, dengue-based healthcare, Cloud-IoT based healthcare, Fog computing, and Infectious diseases-based intelligent healthcare related significant works have been discussed. The third section discusses the architecture level details of the proposed system. In the fourth section, the experimental evaluation has been performed along with various results discussions. The fifth section discusses the conclusion of the paper.

2 Related work

This section discusses significant related works based on four different directions. The first subsection discusses the works related to dengue-based healthcare. The second subsection discusses the works related to Cloud-IoT based healthcare. The third subsection discusses the works related to fog computing, and the fourth subsection discusses the works related to infectious diseases-based intelligent healthcare.

2.1 Dengue-based healthcare

This subsection discusses various significant works related to DeV infection, which have explored the laboratory features and clinical symptoms of dengue fever and the impact of climatic factors (temperature, humidity, etc.), socioeconomic factors as well as socio-geographical factors on the incidence of DeV infection. In this regard, Huang et al. [18] had identified the clinical symptoms as well as laboratory features for distinguishing the patients diagnosed with influenza from the patients diagnosed with DeV or other pyretic diseases. Sirisena and Noordeen [41] had presented the role of climatic factors in the DeV outbreak. Machado-Machado [24] had evaluated the contribution of both the climatic and socioeconomic factors in determining DeV infection. The results of the work has depicted the more relevant significance of parameters related to climate as determinants for DeV infection than the variables related to socioeconomics. Other than symptoms and environmental factors, many research works have indicated the impact of socio-geographical factors [48, 50] as well as human movement [23, 32, 44] on the transmission of DeV and its outbreak. One such work by Tao et al. [46] had presented a model that depicts the role of the infected humans’ movement in contributing DeV infection outbreak and suggested that the restricted movement in infected areas can lower the diffusion risk of DeV infection.

Based on aforementioned factors and various advancements in data analysis-based techniques, various research works have proposed computer-aided DeV detection and prevention systems in the domain of dengue-related healthcare. In this direction, Rao and Kumar [31] had developed a computer-aided diagnostic system for identifying dengue fever based on the clinical and laboratory symptoms. The system has employed data analytics approaches: wrapper-based feature selection method with genetic search and decision tree method for extracting and classifying the symptoms for identifying the incidence of dengue fever. Vinarti and Hederman [51] had proposed a DeV infection risk prediction system that analyzes individual’s personal attributes like age, gender and visiting locations, and environmental attributes like specific locations features (aedes aegypti mosquito dwell), climatic conditions (favorable mosquito breeding conditions) to predict the risk of contracting dengue infection by using Bayesian network. Zhang et al. [56] had proposed a DeV infection outbreak surveillance system that analyzes the dengue-related information from online news sources to determine the dengue outbreak. The system has employed a text mining approach, which extracts the dengue-related articles and apply statistical analyses to determine the number of cases for dengue. The entire intention of this work was to work on the alternative sources of dengue-related information to determine the dengue outbreak, other than the clinical investigation of patients. In this regard, another work by Missier et al. [26] had depicted the extraction of dengue-related information from the social networking platform Twitter, where people use to discuss information online with others, for analyzing the outbreak of dengue infection in particular areas. Sousa et al. [43] had designed a user-oriented platform for reporting mosquito-borne diseases and breeding sites, which helps in supporting the authorities for preventing and controlling mosquito-borne diseases. However, for reducing the dependence on information from the users, which is based on the willingness of the users, the platform has also explored alternate source of information through the mining of geolocated content from social networks.

These motivations of exploring the sources of information other than the clinical investigations and that too for the remote detection of dengue infection have led to the integration of distributed computing paradigms: IoT prototype of wireless sensor network and cloud computing, for establishing various arenas for smart healthcare systems.

2.2 Cloud-IoT based healthcare

The recent years have witnessed the development of a number of smart healthcare systems, which have provided the remote healthcare services through the diffusion of information and communication technologies, in which IoT prototype of wireless sensor networks and cloud computing have played a significant role. The current subsection discusses such various distinctive advancements in the healthcare domain. In this regard, Sandhu et al. [37] had developed a cloud-based DeV infection detection system, which identifies the infection among individuals using keyword aware domain thesaurus, which extracts the relevant keywords from the symptoms submitted by individuals through their mobile devices in any format and converts the message into system-specified format. The system then identifies the dengue cases from these system-formatted data and shares the information with the concerned stakeholders: doctors and caregivers. Liu et al. [22] had discussed an approach for healthcare using digital twin of the physical healthcare approach, which could remotely monitor patients using their data through sensor devices, at the remote servers and provide useful insights, diagnoses and predictions to the concerned stakeholders: doctors and individuals. The approach has identified the key technologies viz. IoT and cloud computing for such a remote healthcare approach, which could be effective in many scenarios, where, doctor to patients ratio is low, patients are not able to visit doctors on regular basis, inconvenience of elderly patients to visit doctors, and continuous monitoring of patient health is required. Kaur et al. [19] had developed machine learning based remote health monitoring approach that doesn’t require the presence of doctor with the patient, instead cloud-based decision support system evaluates the IoT-based acquired health data of the individuals and diagnose the disease. The authors have determined the performance of different machine learning algorithms for thyroid, breast cancer diabetes and liver disorder. Abdel-Basset et al. [2] had proposed wireless body area network-based emergency remote healthcare system that continuously monitors the heart-related data of the patient though multi-criteria decision making data analysis. This system works in cloud-assisted IoT environment and if the system diagnoses that a person is suffering from heart failure, then the system immediately sends the ambulance to the patient. Chung et al. [11] had proposed a IoT-based health risk assessment approach that uses individuals’ personal, health and surrounding data for creating a context of an individual and using deep neural network for unwinding context patterns. The approach further employs Minkowski distance formula for evaluating the health risk of an individual with analyzed context patterns by measuring the similarity with the patients having chronic disease. This approach helps in alerting/preventing the individuals with higher risk factors. Latif et al. [21] had developed an integrated healthcare monitoring approach using IoT and cloud computing, that continuously monitors vital health parameters, ensure timely medicinal dosage and emergency prediction. It also provides physicians the data analytics to decide/diagnose the medication errors, medication side effects and prescribe medication accordingly. The approach has employed ranking instances by maximizing the area under ROC curve (RIMARC) for long term medical assessment and classification approaches for short term medical assessment.

The most of the approaches in intelligent healthcare had primarily adopted cloud-centric model for processing the data, where the edge devices were only collecting and pushing the raw data to cloud. Such data off-loading had tended to neglect the computational capabilities of the edge devices in supporting the data analysis at their own and resulted in delaying of data analysis tasks, instead of accomplishment of these tasks immediately near to the data source. As a result, such approaches had suffered overall increased reaction time due to network latency and burdened the cloud with the increased workload, and increased overall renting and running cost of the cloud services.

2.3 Fog computing

As the result of various quality of service issues in cloud-IoT environment, the recent advancements of distributed computing in the form of fog computing and its application in diverse domains viz. disaster management [35], agriculture [17], transportation [33], security [1, 4], and alike, have also motivated the utilization of fog computing in healthcare domain too. In this regard, Dautov et al. [14] had discussed the requirement of processing data at the edge devices for better service provisioning in the Cloud-IoT environment. The authors have proposed a hierarchical data fusion approach in the domain of intelligent healthcare, where the acquired data from the physical world is processed at the different level of hierarchy from IoT sensors to the remote cloud servers at the different edge devices based on their processing capabilities. The approach has employed complex event processing technology for detecting complex event patterns in the stream of data. This approach helps in providing response in real-time and location-dependent. Abdel-Basset et al. [3] had developed a Fog- IoT based remote diagnostic framework for detecting and monitoring type-2 diabetes using VIšekriterijumsko KOmpromisno Rangiranje (VIKOR, a muli-criteria decision making technique). This approach employs fog computing for accessing real-time information from cloud and cloud computing for suggesting a compromised decision which will be closed to the ideal solution, in the situations where all criterias are not settled simultaneously. Pace et al. [28] had proposed a mobile phone-based fog computing architecture called as BodyEdge, which provides easiest way to bridge the body sensor network and cloud computing, along with providing data privacy at the fog level, for providing cardiac monitoring services for athletes and factory labourers. Pravin et al. [29] had proposed a fog-cloud computing-assisted IoT framework for predicting and preventing DeV infection. The proposed framework has depicted how DeV-related information moves in a fog-cloud-IoT based distributed system, but, without any further details about the appropriateness of the employed mechanisms for providing various analytics. The system also lacks in providing geographical-locations based DeV outbreak analytics and their visualization to the concerned stakeholders, which are vital for preventing and controlling DeV infection outbreak. Singh et al. [40] had developed a fog-Cloud-IoT based DeV infection monitoring system, which has employed decision tree-based approach at the fog layer for classifying the infection status of the system users and cloud storage-based health information sharing mechanism for providing health analytics to the concerned stakeholders. The major drawbacks of this system are the cloud-based alert generation, which is not so responsive in providing real-time alerts to the concerned stakeholders as compared to the the fog layer. The system also doesn’t consider any health severity during generating alerts, and even doesn’t explore the infection outbreak controlling perspective of the monitoring approach.

The aforementioned related works depict how Internet of things, cloud computing and fog computing can be used for intelligent healthcare and how their collaborated frameworks can improve the quality of services in various healthcare scenarios. Henceforth, their application in the infectious disease-based healthcare systems can also present effective solutions for helping the world in fighting with global epidemics.

2.4 Infectious diseases-based intelligent healthcare

The recent years have witnessed various incidences of global epidemics such as MERS-CoV and Ebola. For controlling such global epidemics, several key factors have played crucial roles. Among them, speed of diagnosing the infected individuals, tracking their movement and tracing of their contacted persons, and depicting the risk level of infection at different locations, have remained the focus of epidemic controlling strategy. In this regard, Majumdar et al. [25] had developed a fog-based approach to diagnose Kyasanur Forest Disease (KFD) using extremal optimization tuned neural network (EO-NN) at the fog devices and GPS-based visualization of each infected user and risk prone region on Google Maps. However, the authors haven’t represented the epidemic perspective of the infection through maps. Sandhu et al. [36] had presented a BBN-based prediction model that predicts the incidence of the MERS-CoV using Cloud-IoT and provides timely medical support. The system has used the concept of population density of infected and uninfected individuals in a particular area to ascertain the risk level of infection in those areas. Raghav and Dhavachelvan [30] had proposed a fog-based cyber physical system for remotely diagnosing the severe acute respiratory syndrome cronavirus (SARS-CoV) and analyzing its outbreak. The system has employed J48 decision tree for classifying the type of infection and big data-based approach MongoDB for storing, modeling, analyzing and visualizing the SARS outbreak.

Based on the aforementioned discussion and observation from it, a comparison-based performance analysis of the proposed system with some significant existing systems has been shown in Table 1. The performance has been analyzed based on nine parameters viz. IoT, Cloud Computing, Fog Computing, Health monitoring, Environment Monitoring, Real-Time Perspective, Epidemic Susceptance Analysis, Pandemic Susceptance Analysis, Healthcare Data Security, and Geographical Risk Analysis. The values “Y” or “N” depicts the inclusion of mentioned technology or functionality in a particular system or not. The analysis depicts that only the proposed system includes all the mentioned parameters, and henceforth acknowledges the performance edge as compared to other existing systems.

Table 1 Performance comparison of the existing systems with the proposed system

Full size table

3 Proposed system

The proposed system comprises three subsystems, as shown in Fig. 1, namely data acquisition subsystem, fog subsystem, and cloud subsystem. The responsibility of the data acquisition subsystem is to collect the data related to individuals’ personal and health attributes from their mobile phone applications, and data regarding the surroundings from environmental sensors. This subsystem transmits the collected data to the fog subsystem for the real-time diagnosis of the possibly infected individuals. On diagnosing the possibly infected individuals, the fog subsystem immediately generates diagnostic and suggestive alerts to the possibly infected individuals’ mobile phones for suggesting to follow proper medical procedures, and emergency alerts to the doctors and caregivers, if individuals’ health becomes sensitive. The alert generation on the fog subsystem enables the on-time efforts for taking precautionary measures. The compiled medical information and DeV classified data of each individual, along with the actual diagnose of DeV infection from hospital databases, are stored on the cloud subsystem. The cloud subsystem deploys Social Network Analysis (SNA) to analyze the geographical information of each registered individual to identify the infected and risk-prone areas, and its pinpointing on Google Maps, which aids the efficient controlling and prevention of DeV outbreak. The cloud subsystem also protects the privacy of the personal health-related information of the individuals using information granulation and secret sharing. The notations used throughout the paper are enlisted in Table 2. Each subsystem of the proposed system has been explained ahead.

Table 2 List of used notations

Full size table

Table 3 Personal, environmental and health attributes

Full size table

3.1 Data acquisition subsystem

This subsystem collects the data regarding individuals’ personal and health attributes from their mobile phone applications and the data regarding the surroundings from the environmental and mosquito sensors. Initially, the individuals register with the system by providing personal information via a mobile application on their phones. Then, the system generates a unique identification (uId) for each registered individual and acquires the DeV infection-related health data. The values of the personal attributes remain the same for most of the period and only changed when any attribute changes, whereas the attribute-values related to the individual’s health and environment change over time. The health data is acquired periodically through the registered individuals’ mobile phones in the form of “Yes” or “No”. For infected individuals, the data is acquired on an hourly basis; for recovered individuals, the data is acquired on alternative days for a fortnight (after which the recovered individual is declared uninfected), and for uninfected individuals, the data is acquired weekly. Table 3 depicts the personal, environmental, and health attributes, which are acquired by the system. The personal attributes are stored on secure locations to avoid any disclosure of the personal identity of the individual to any unauthorized stakeholder and to avoid any unnecessary discrimination and panic among the people. The information regarding the sites of mosquito density and mosquito breeding are captured using different site-located wireless mosquito sensors. These mosquito sensors identify and count the number of mosquitoes in a specific area. The attributes sensed by environmental sensors like air temperature, humidity and carbon dioxide around the standing water sites signify the favorable conditions for mosquitoes breeding. The higher humidity, temperature, and carbon dioxide values are the favorable conditions for mosquitoes’ breeding. So, these environmental attributes are continuously acquired and stored in the Cloud database for identifying the breeding sites.

3.2 Fog subsystem

This subsystem plays the role of a bridge between the Cloud and the data acquisition subsystems. It pre-processes the data accumulated from the data acquisition subsystem for classifying the individuals as possibly infected (In) or uninfected (Un) and subsequently generates the real-time diagnostic, suggestive, and emergency alerts to individuals and other concerned stakeholders. This classification diagnoses the possibly DeV infected individuals and suggests them for following proper medical procedures to confirm the incidence of the infection by consulting with the doctors and through recommended laboratory tests (if requires). At the same time, this layer alerts the caregivers or doctors, if the health sensitivity with respect to the various environmental events, surpasses the threshold. This subsystem comprises of two components: (I) Fog-based DeV Infection Diagnosis, and (II) Time-based alert generation. The Fog components are explained as follows.

3.2.1 Fog-based DeV infection diagnosis

This component utilizes a probabilistic classification model called Naive Bayesian Network (NBN) [45] for classifying the individuals based upon their health attributes as In or Un. In this model, X1, X2...Xn are considered as the conditionally independent predictor variables of a given class variable D. Since the entire set of DeV infection-related health attributes are independent of each other, the NBN classifies the individuals under the category of In or Un. The training dataset estimates the probability parameters using the maximum likelihood estimator. Figure 2 shows the graphical structure of the NBN for categorizing registered individuals.

We assume, D as a class variable for representing two categories: In and Un, based on the symptom vector SyV${}_{k}$ = (F, SH, PE, NA, JP, V, MP, SR, MB, AP). To classify an individual u${}_{k}$ $\in $ X; k = 1, 2, ..., n as In or Un based on his/her SyV${}_{k}$, the probability of each classified category is calculated using the Bayes’ rule as given in Eqs. 1 and 2.

$$\begin{aligned} P(D=(In/SyV_{k}))= & {} \frac{P(In)P(SyV_{k}/In)}{P(SyV_{k})} \end{aligned}$$

(1)

$$\begin{aligned} P(D=(Un/SyV_{k}))= & {} \frac{P(Un)P(SyV_{k}/Un)}{P(SyV_{k})} \end{aligned}$$

(2)

where P(D = In/SyV${}_{k}$) represents the probability of an individual classified as a possibly infected with DeV infection based on his/her symptom vector SyV${}_{k}$; P(D = Un/SyV${}_{k}$) represents the probability of an individual classified as an uninfected of DeV infection based on his/her symptom vector SyV${}_{k}$; P(In) and P(Un) represent the probabilities of having DeV infection, and P(SyV${}_{k}$) represents the probability of having SyV${}_{k}$ symptoms. The comparison of the probabilities of two categories determines the category of an individual. The category of the individual is decided from the fact that the class, which is having higher probability, is the category of the individual.

3.2.2 Time-based alert generation

This component is used to generate the diagnostic and suggestive alerts to the individuals based on their diagnosed category and emergency alerts to the doctors or caregivers based on the health sensitivity of infected individuals with respect to the environment so that timely action for individuals’ care can be taken. Fog-based DeV infection diagnosis component determines the health category of an individual as In (Unsafe state) or Un (Safe state). Mathematically, an individual’s state is denoted by S = $\{$D, E, $\Delta $T$\}$ where ‘D’ is the classified category of an individual, ‘E’ represents the environmental events which happens in ‘$\Delta $T’ time window. In another way, ‘E’ is represented as $\{$Humidity, Carbon Dioxide, Air Temperature, Breeding Sites, and Mosquito Density$\}$. Determination of an individual’s health category is important for the medical emergency alert generation to doctors or caregivers. If the individual is classified in the category In, then the occurrences of any undesired environmental event leads the individual’s health state to a more severe unsafe state. In this case, emergency alerts are generated based on the occurrence of the environmental events in $\Delta $T time-space. It is determined on the basis of the Environmental Event Index (EEI). The probability of EEI is calculated, as shown in Eq. 3.

$$\begin{aligned} P(EEI)= \delta \left( \frac{P(E)}{P(In)}\right) \end{aligned}$$

(3)

where P(E) is the probability for the occurrence of undesired environmental events, P(In) is the probability of an individual’s classified category as possibly infected, $\delta $ is the differentiation function, and P(EEI) is the probability of environmental event index.

P(EEI) determines the occurrence rate of sensitive environmental events with respect to the infection of an individual. P(EEI) helps in depicting the health sensitivity of a possibly infected individual with respect to the occurrence of environmental events. The step by step procedure for generating emergency alerts is depicted in Algorithm 1. In this algorithm, the health attributes are examined using NBN for an individual’s DeV infection category determination. If an individual’s DeV infection category is found as In, a checkpoint is created and saved on the fog subsystem. Then, IoT devices continuously transmit the data regarding surrounding environmental events during the infected period of the individual to temporary log events, at fog storage. These log values are examined for asserting the severity of the environmental events during the individual’s infection period by calculating P(EEI). P(EEI) helps in determining the individual’s health sensitiveness due to environmental factors E, based on predefined thresholds ($\beta $). If P(EEI) $>\beta $, the emergency alert is generated to the doctors or caregivers. After predefined time window $\Delta $T, the subsystem re-examines the individual’s health state. At any instant, if an individual is found in unsafe state, the P(EEI) is determined for $\Delta $T, and emergency alerts are generated to the doctors or caregivers along with the log information if P(EEI) surpasses its threshold. After $\Delta $T, the checkpoint is terminated, and the process of monitoring is repeated.

Based on the suggestive and emergency alerts, and in consultation with doctors, the actual confirmation of the DeV infection is determined at the hospitals’ laboratories. The possible diagnoses by the fog subsystem help the individuals in getting on-time medical aid. Subsequently, the information of confirmed DeV infected and uninfected individuals at hospitals and uninfected individuals analyzed at the Fog subsystem, along with generated alerts, is transmitted to the Cloud subsystem.

3.3 Cloud subsystem

The Cloud subsystem stores and processes the data, which was not possible to be processed at the fog subsystem due to computation and space constraints. This subsystem comprises of four components: (I) Cloud storage, (II) Information Protection, (III) GPS-based risk assessment, and (IV) Health communication. The detailed explanation of each component is as follows.

3.3.1 Cloud storage

This component stores each individual’s compiled medical information, fog analyzed diagnosed DeV infection category, generated alerts (accumulated from fog subsystem), and the actual diagnosis of the DeV infection of the individuals (accumulated from hospital databases), for further analysis at the cloud subsystem. Based on the data received from the fog subsystem and hospital databases, this component updates the actual DeV infection category of the individuals. Cloud storage enables the secure sharing of personal information of individuals among authorized stakeholders like individuals, hospitals, government agencies, and healthcare professionals. The visiting locations related information of the individuals is also stored there and is used to create the SNA graph and synthesize various useful analytics related to the DeV outbreak analysis.

3.3.2 Information protection

The data of an individual viz. personal and health attributes and environmental data are stored at the Cloud storage. This sensitive information needs to be shared with authorized stakeholders only. Any unintentional disclosure of information to any unauthorized person can result in a distress situation among individuals. Hence, a two-stage information protection mechanism is proposed in this system: (I) information granulation [55], and (II) secret sharing scheme [38]. In the first stage, the individual’s data is divided into three different data tables, i.e., L-I, L-II, and L-III. In L-I, the table contains the utmost confidential data, which can identify an individual precisely, i.e., name, age, sex, residential address, and mobile number. In L-II, the table contains medium-level confidential information, which can identify a DeV outbreak affected location. It includes environmental attributes: mosquitoes density and its site’s location, temperature, humidity, carbon dioxide. In L-III, the table contains the least sensitive information, which cannot create any panic or distress on its unauthorized access. It includes health attributes like fever, skin rashes, headache, bleeding, muscle pain, joint pain, etc. If someone access L-II or L-III information in an unauthorized way, the system still preserves the sensitive information because the precise identity of the individual still cannot be ascertained. To identify the individual and his/her location, the only key is to access all three fragments and correlate them. That is why the acquired information is fragmented into appropriate data fragments in the proposed system. Figure 3 represents the proposed data granulation. A DeV infection data table is defined as DeV${}_{DT}$ = (w, at), where w depicts the registered individuals’ count and at depicts the set of personal, environmental, and health attributes of each individual. The data table is fragmented into L-I, L-II, and L-III, i.e., DeV${}_{DT}$ = (w, L-I U L-II U L-III). These fragments are combined by using a unique identification number. After granulation, mechanism of secret sharing is applied to split the value of highly sensitive attributes of L-I (personal attributes) into secret shares. These secret shares are kept on secure servers so that no one can access information of L-I. If anyone is able to collect the information of all three fragments, the exact identity of the individual still can not be retrieved due to the generation of secret shares for L-I information.

3.3.3 GPS-based risk assessment

Female Aedes albopictis or Aedes agypti mosquitoes’ biting are the main sources of DeV transmission in the human blood. This virus replicates in human blood and transmitted to other uninfected mosquitoes on bitting DeV infected human beings. These infected mosquitoes can lay infectious eggs and further transmit this virus. Because of the limited flight of the mosquitoes, they cannot fly to farther locations, but by transmitting the infection into humans, wide-area spread of DeV happens through infected humans traveling to various and far away locations by further transmitting the DeV into uninfected mosquitoes. Hence, the Social Network Analysis (SNA) can be a significant approach to analyze and represent the geographical risk of DeV infection by analyzing the location-related information of the individuals and synthesizing various useful insights.

In this component, SNA has been used to analyze the registered individuals’ movement-related geographical information through their mobile phone-based GPS for identifying the infected and risk-prone regions, and its pinpointing on Google Maps. SNA also helps in synthesizing various useful insights like Epidemic Susceptance Index (ESI) of each individual and region, and Pandemic Susceptance Index (PSI) of each region to ascertain the severity of the DeV outbreak. Subsequently, SNA helps in providing preventive measures for controlling its outbreak. For ascertaining the behavior of DeV infection outbreak within the region or from one region to another region, following scenarios have been considered.

Scenario 1 An individual is infected in a region x, if the mosquitoes of the region x bite an uninfected individual, who is: (I) resident of the region x and present in region x, or (II) resident of the region y but visited region x.

Scenario 2 A Mosquito is infected in a region x, if the susceptible mosquito bites an infected individual, who is: (I) resident of the region x and present in region x, or (II) resident of the region y but visited region x.

By considering the above-discussed scenarios as the basis, various useful analytics can be synthesized if the global SNA graph is created efficiently. Therefore, Algorithm 2 is used for the systematic creation of the global SNA graph using GPS-based geographical location of individuals through their phones and DeV infection status from cloud storage. Here, in the SNA graph, three-node (residence, workplace, and user) for an individual is created and colored according to the proposed color scheme, as shown in Fig. 4a. The permanent locations (i.e., residence and workplace) and visiting locations are traced and labeled as P and V respectively. The edges are drawn from the user node to all his/her visiting locations and colored according to the color of the user node. The population percentage of infected P and infected V nodes is calculated in each region to identify them as infected or risk-prone regions using Fig. 4b, c and Table 4.

Table 4 Population-based risk level color scheme

Full size table

Once the risk level of various regions is identified on the SNA graph, several metrics can be synthesized and the outbreak of infection can be checked, and the individuals in those regions (like residents or visitors) can be alerted in real-time by sending warning alert messages. Also, various precautionary measures and suggestions can be provided to them to control the infection rate and transmission process. The various significant metrics that can be drawn from the SNA graph are explained as follow.

I.
Metric 1 Epidemic Susceptance Index (ESI).

ESI determines the individual’s or region’s probability to spread or contract DeV infection based on the SNA graph.
1. (a)
  ESI${}_{InR}$ (for Infected Region)
  
  It states the ratio of the number of infected visitors and infected residents in an infected region to the total number of visitors and residents in that area, as shown in Eq. 4.
  $$\begin{aligned} ESI{}_{InR} = \frac{\text {Number of infected visitors and infected residents in an infected region}}{\text {Total number of visitors and residents in that region}} \end{aligned}$$
  (4)
2. (b)
  ESI${}_{InI}$ (for Infected Individual)
  
  It states the ratio of the number of infected regions visited by the infected individual to the total number of regions visited by the same individual, as shown in Eq. 5.
  $$\begin{aligned} ESI{}_{InI}=\frac{\text {Number of infected regions visited by an infected individual}}{\text {Total number of visits by the same individual}} \end{aligned}$$
  (5)
3. (c)
  ESI${}_{RpR}$ (for Risk Prone Region)
  
  It states the ratio of the number of infected individuals’ visits in a risk-prone region to the total number of individuals’ visits in that region, as shown in Eq. 6.
  $$\begin{aligned} ESI{}_{RpR}=\frac{\text {Number of infected individuals' visits in a risk-prone region}}{\text {Total number of visits in that region}} \end{aligned}$$
  (6)
4. (d)
  ESI${}_{UnI}$ (for Uninfected Individual)
  
  It states the ratio of the number of infected and risk-prone regions visited by the uninfected individual to the total number of regions visited by the same individual, as shown in Eq. 7.
  $$\begin{aligned} ESI{}_{UnI}=\frac{\text {Number of infected and risk-prone regions visited by an uninfected individual}}{\text {Total number of regions visited by the same individual}} \end{aligned}$$
  (7)
If ESI${}_{InR}$ of an infected region is high, it means many infected individuals have visited or/and infection among residents is high in that region. So, visitors are required to be warned to refrain their visits to that region, and residents are required to be warned to have immediate medical checkups for determining their health status and required precautions. At the same time, nearby hospitals are also needed to be alerted about the high ESI${}_{InR}$ of that region. If ESI${}_{InI}$ of an infected person is very high, it means that the person is highly infected because of his/her visits to many infected regions. That person is required to be quarantined for proper treatment and restricted to travel, for controlling DeV spread. Hence, the alerts are required to be generated to that person and hospitals near his/her residence for his/her proper treatment. If ESI${}_{RpR}$ of a risk-prone region is high, it means many infected visitors are visiting that region. So, that region is very susceptible to spread DeV infection to its residents and future visitors. So, people with low immunity are required to be warned to refrain their visits to such regions, and at the same time, residents of those regions are required to be warned to take necessary precautions for avoiding the contraction of DeV infection. If ESI${}_{UnI}$ of an uninfected individual is high, it means that a person is visiting many infected locations and risk-prone regions. His/her susceptance is very high to contract DeV infection. So, warning alerts are required to be generated to that individual for precautions and immediate checkup at the hospital.
II.
Metric 2 Pandemic Susceptance Index (PSI).

PSI determines how fast a particular region is spreading the DeV infection over time to the other regions. The spread of DeV infection from a particular region can be analyzed by creating a tree, based on the catching of infection by the neighbor regions or neighbors of neighbor regions or, like so, on the temporal scale. The number of nodes in a temporal infection tree depicts the pandemic spread of infection to that extent (see Algorithm 3).

The prediction of the DeV infection outbreak can control its pandemic nature and enable early warning alerts to the people who are living in those regions and to the healthcare agencies for taking effective preventive measures in its early stage. The prediction uses the following methods to predict the dengue cases, which are explained as follows.

(a)
Autoregressive Method Autoregressive prediction method forecasts the dengue cases by using past dengue cases of the given region. Dengue cases in month ‘t’ are computed as shown in Eq. 8.
$$\begin{aligned} Y{}_{t}=\emptyset {}_{0} + \emptyset {}_{t-1}Y{}_{t-1} + \emptyset {}_{t-2}Y{}_{t-2} + \emptyset {}_{t-3}Y{}_{t-3} \ldots \ldots \ldots \ldots \ldots \emptyset {}_{t-p}Y{}_{t-p} \end{aligned}$$
(8)
where ${\emptyset }_0$ is the constant number of dengue cases, $Y_t$ is the number of dengue cases at time ‘t’ , $Y_{t-1}$ is the number of dengue cases at time ‘t-1’ , $Y_{t-p}$ depicts the number of dengue cases at time ‘t-p’. ${\emptyset }_{t-1}$, ${\emptyset }_{t-2}$, ${\emptyset }_{t-3}$........${\emptyset }_{t-p}$ are the autoregressive parameters in last ‘p’ months. This forecasting helps in determining the dengue cases based on the trend of past months’ dengue cases.
(b)
Weather Prediction Method The dengue infection is profoundly impacted due to the weather conditions as the population of dengue mosquitoes grow in particular weather conditions. The climatic variables, such as temperature, rainfall and humidity, impacts the incidence of dengue cases. Here, we consider only two climate variables, i.e., temperature and rainfall, as these are the strong predictors for dengue prediction. Based on the climate lag [10], the incidence of dengue appears after some days, weeks or months, of exposure to climatic conditions. Dengue can occur several days after the favorable weather conditions. The impact of monthly mean temperature on a number of dengue cases is used to predict dengue incidence as shown in Eq. 9.
$$\begin{aligned} C{}_{Temp}= \emptyset {}_{0} + \sum ^{t=l}_{m=t-(l+p)}{\emptyset {}_{mn}Temp{}_{mn}} \end{aligned}$$
(9)
where ${\emptyset }_0$ is the constant number of dengue cases. ${\emptyset }_{mn}$ is the parameter of mean temperature at lag term m in n mean temperature range. $m = t - (l + p)$ depicts the start of the lag period, t depicts month, l depicts lag term, p depicts the monthly mean temperature’s data cycle period. The determination of $C_{Temp}$ helps in asserting the number of dengue cases well before its incidence based on the climate lag effect using monthly mean temperature. Similarly, using monthly cumulative rainfall, the number of dengue cases can be asserted as shown in Eq. 10.
$$\begin{aligned} C{}_{Rain}= \gamma {}_{0} + \sum ^{t=l}_{a=t-(L+q)}{\gamma {}_{ab}Rain{}_{ab}} \end{aligned}$$
(10)
where ${\gamma }_0$ depicts the constant number of dengue cases. ${\gamma }_{ab}$ depicts the parameter of cumulative rainfall at lag term a in b cumulative temperature range. $a = t - (L + q)$ depicts the start of the lag period, t depicts month, L depicts lag term, q depicts the monthly cumulative rainfall’s data cycle period.

From these two metrics, the prediction of dengue cases using the influence of temperature and rainfall can be computed a much time ahead. A weather-based prediction in dengue early warning system can help in local vector surveillance and control in many ways. Firstly, it can enhance the effectiveness in efforts to check the scale of the DeV infection outbreak. This will result in decreasing the disease transmission, avert possible mortality, and subsequently lower the burden and operating cost of healthcare agencies. Secondly, it can use the publicly available weather variables without investing in weather-based predictive methods. Thirdly, it can allow the vector control units to focus more deliberately and sensibly during high-risk periods.

The identified infected and risk-prone regions based on the population of infected residents (Red P) and infected visitors (Red V) in the SNA graph, reflects the current global DeV outbreak. Apart from this, the areas where there is a high probability of occurrence of dengue cases in the future are also considered as high risk-prone areas. Based on the assessed risk level of DeV infection, the regions are mapped and updated on Google Maps, as shown in Algorithm 4. It helps the proposed system in generating real-time warning alerts to the individuals residing and visiting in various regions based on the respective risk levels and providing suggestions for precautions and appropriate treatments. At the same time, the nearby hospitals, health professionals, and other government agencies are also get alerted to deal with the global situation and control the DeV outbreak.

Once the up to date and timely information about risk-prone and infected regions are available online on Google Maps web service, it helps the healthcare agencies, government agencies, and residents in preventing the spread of the DeV outbreak, based on the generated warning alerts, as shown in Algorithm 5.

Table 5 Advices and precautions

Full size table

3.3.4 Health communication

The health communication component sends the system generated alerts and health awareness tips to the registered individuals. Instant messaging (IM) or email service is used for this communication. Alert messages and health awareness suggestions are sent repetitively to individuals. It aids in enforcing preventive measures awareness and reduction in the DeV infection risk. Alerts are also generated to the government healthcare agencies as well as nearby hospitals based on the patient’s GPS location. It helps in providing first aid and organizes the required free medical camps to control the DeV outbreak. The advices and cautionary measures for individuals have been represented in Table 5.

4 Experimental setup

The Experimental setup of the proposed system has been divided into six subsections viz. data generation, Fog-based DeV infection diagnosis evaluation, alert generation evaluation, dengue cases prediction evaluation, information protection security analysis, and GPS-based risk assessment evaluation.

4.1 Data generation

The experiment for the proposed system had required DeV infection-related symptoms-based health data. However, the thorough internet search and even the consultation with different hospitals did not find any symptoms-based dengue dataset for evaluating the proposed system. So, the dataset had been systematically generated in collaboration with Dr. Arvind Manchanda. He is an M.D. in medicine and working in Punjab government health services. Based on the health records of 177 patients, checked for DeV infection [9], a dataset had been generated in collaboration with the doctor. The generated dataset had been further consulted with the community health professionals to validate its closeness with the real data. The generated dataset considers all possible combinations of symptoms, having ten attributes with value as either Yes or No to diagnose any individual for possible DeV infection. A few possible combinations of 10 dengue-related health attributes have been shown in Table 6. The overall permutation of these cases had been resulted in 2${}^{ 10}{}^{ }$ = 1024 combinations. The probability of the occurrence of a particular combination of all symptoms is unique in the generated dataset. So, the symptoms have been categorized into three groups based on their occurrence in DeV infected cases: (I) Most Probable Symptoms i.e., fever, (II) Probable Symptoms viz. severe headache, vomiting, pain behind eyes, severe abdominal pain, and skin rash, and (III) Least Probable Symptoms viz. joint pain, muscle pain, soft bleeding, and nausea. The combined occurrence of these symptoms in DeV infected cases has been shown in Table 7.

Table 6 Sample of different combinations of dengue-related health attributes

Full size table

Table 7 Occurrence-based symptoms combinations

Full size table

For the systematic generation of the environmental dataset, an IoT-based scenario had been created in Java-based simulator CupCarbon U-one 3.8.2 [12]. The created IoT scenario is based on Pathankot, the ninth most populated city of the Punjab province in India. A total of 1240 sensors had been deployed on various locations to capture the environmental attributes such as the location of breeding and mosquito dense sites, humidity, carbon dioxide, and the temperature. These sensors had been programmed with senscript (scripting language for CupCarbon) for generating random values of environmental attributes. The environmental dataset had been acquired from these sensors, which comprises the Sensor Location, Sensor ID, and respective attribute-value. The proposed system uses Algorithm 6 to map all possible combinations of DeV infection-related health records with the environmental dataset.

4.2 Fog-based DeV infection diagnosis evaluation

The timely diagnosis of the DeV infection is critical in saving the lives of individuals and real-time depiction of infection outbreak. Hence, the proposed system has deployed Fog-assisted NBN-based probabilistic model for classifying the individuals as In or Un. The Bayesian network has been created using R-Studio’s bnlearn package [39] with 1024 individuals’ health data, using different learning algorithms viz. incremental association (iamb), grow-shringk (gs), interleaved incremental association (inter.iamb), fast incremental association (fast.iamb) and hill climbing (hc), and evaluated based on the parameters viz. number of nodes, number of directed arcs, number of undirected arcs, average branching factor, average neighborhood size, penalization coefficient and number of tests used in the learning procedure. The evaluation of the created networks through these algorithms has been compared and shown in Table 8, and found the performance of ’hc’ algorithm better as compared to the others because of its higher number of directed arcs. Therefore, the fog-based DeV infection diagnosis has used the ’hc’ algorithm for training the Bayesian network.

Table 8 Performance of different learning algorithms for NBN

Full size table

Table 9 10-Fold cross validation of employed classification approaches

Full size table

The current evaluation has further employed Weka 3.6 for testing the trained NBN [54] with 1024 health records. The evaluation has also compared the classification performance of NBN with other state of the art classification approaches viz. Neural Network (NN), Fuzzy System (FS), and Bayesian Belief Network (BBN), as shown in Fig. 5. For NN, the multilayer perceptron function of the weka has been employed with two hidden layers, learning rate of 0.3 and momentum of 0.2, whereas the FS has been implemented with R-Studio’s sets package, and the BBN with R-Studio’s bnclassify package. The evaluation has included a set of statistical performance measures [16, 20] viz. accuracy, sensitivity, precision, specificity and F-measure, for evaluating the classification performance of the employed classifiers. Figure 5a depicts the classification accuracy of all four employed classifiers, where NBN has the best classification accuracy throughout the process, whereas the Fig. 5b represents the different statistical measures viz. sensitivity, precision, specificity, and F-measure of the employed classification approaches. The obtained results in Fig. 5b are the average performance outcomes from the ten folds of the 10-fold cross validation method, as shown in Table 9. The performance analysis from Fig. 5b and Table 9, has found that the NBN has performed better in all measures as compared to the other classifiers. It has been noticed that the performance of NBN yielded better results than other classifiers in every parameter and hence, found satisfactory for employing in the proposed system.

The execution (or classification) time of all four employed classifiers for their training and testing phases, have been shown in Fig. 6a, b. From the analysis of their performances in terms of execution time, it has been well-depicted that NBN has taken the overall least time in classifying the DeV infection status of the individuals in both phases, as compared to the other employed classifiers.

The evaluation has also presented the detailed class-wise performance analysis of the NBN-based classification in Table 10. The results depict the higher accuracy and lower false classification rate of NBN in classifying the DeV infected individuals. The results depict that the overall classification accuracy of NBN is 98.24%. and weighted average Sensitivity (True Positive rate) of NBN is 0.940. These higher value of accuracy and sensitivity generates higher precision, which is 0.898 and enable the system to minimize error rate [15]. Along with this, the overall higher value of specificity i.e. 0.951 leads the NBN in generating less classification errors. Similarly, the higher values of F-Measure (0.977) and ROC (0.853) indicates the accuracy of the NBN-based classification [5]. Hence, it is safe to state that the level of performance conceived through above-mentioned parameters justifies the utility of NBN in fog-based DeV infection classification in the proposed system.

Table 10 Detailed class-wise accuracy of Naive Bayesian network

Full size table

4.3 Alert generation evaluation

The proposed fog-based alert generation provides timely information about the DeV infection diagnosis to the concerned stakeholders. The response time and the reliability of the generated alerts viz. sensitivity, precision and specificity, are considered as the critical parameters for estimating the efficiency of the alert generation [15]. That’s why, the proposed fog-based alert generation has been systematic examined for evaluating its effectiveness in providing accurate and on-time delivery of the generated alerts to the concerned stakeholders.

Figure 7 depicts the architectural details of the evaluating model, where, the proposed fog-based alert generation has been evaluated on a smartphone with 4 GB memory and snapdragon 636 1.8 Ghz Octa-core processor, and compared with a system having same alert generation functionality (with same NBN-based DeV diagnosis), but at EC2-based Cloud instance and having no provision of fog computing. Both the systems have used the same set of health data samples ranges 100 to 1024 in size. The both systems have considered the time taken from the occurrence of an event (diagnose of DeV infection status or determination of individual’s severe condition) to the instance on which the alert about the event occurrence delivered to the individual, as the response time (or delay). Whereas, the reliability of the alerts has been indicated in terms of the ability of the DeV infection diagnosing process in determining the true alerts.

Table 11 depicts the results of various parameters viz. minimum delay, maximum delay, average delay, standard deviation in delay, sensitivity, specificity, coverage, precision, root average square error, mean absolute error, root relative squared error and relative absolute error for comparing the alert generation performance of the proposed and Cloud-based system. The results has indicated that the proposed system has far better alert generation functionality with taking almost half time on average for delivering alerts, as compared to the cloud-based system. On the reliability end, the result evaluation has found that the higher values of sensitivity and precision has helped in lowering the false classification rate and minimizing the error rate

The availability of computation resources in the proximity of individuals, avoidance of any unnecessary network communication delay to cloud subsystem, and availability of the low latency and higher bandwidth in the fog-subsystem, as provided by the proposed system, has enabled the immediate alert generation from the edge of the network of the individuals and tended the reduced data volume transmission over the network for classification and subsequently led to lesser congestion and chances of error. On the other side, the higher value of sensitivity, precision and specificity has indicated the reliability of the generated alerts . The result comparison acknowledges the utility of the fog-based alert generation with improved average response time (or delay), better statistical measures, lesser error rates, and lower false-positive alerts.

Table 11 Comparative analysis of alert generation component

Full size table

4.4 Dengue cases prediction evaluation

A prediction of impeding epidemics helps in enhancing the effectiveness of preventive measures against a disease. Hence, the presented system has employed autoregressive method-based and a weather-based approach for prediction of dengue cases. For the evaluation of dengue cases prediction, monthly data of mean temperature and cumulative rainfall and data about monthly dengue cases from 2011 to 2017 have been obtained from [13, 42] and the local civil hospital of the Gurdaspur district (of which, Pathankot city was a sub division), respectively. The past cases of infectious diseases influence the current and future cases. Hence, autoregression had been employed in R-studio to investigate the serial relationship between already occurred cases and currently occurring cases. The possible lag time using Autocorrelation function (ACF), Partial Autocorrelation Function (PACF) and prior information about cases, in R-Studio, had depicted that the spikes were gradually decreasing in ACF analysis and cut off after 2nd spike in PACF analysis, suggesting a lag of 2 months. This was a strong indication of correlation between already occurred cases and currently occurring cases.

The data pattern depicted that the maximum rainfall occurred in the month of July and August, whereas the maximum mean temperature in May and minimum in January, of every year, and the Dengue incidence was mainly higher during September–December, of every year. The analysis of the cross correlation had depicted the symmetrical sine wave around the zero line (between dengue cases and temperature) with 6 months per cycle. This symmetrical pattern is the evidence of stable and consistent relationship between the incidence of dengue and mean temperature. On the other side, the cross correlation between the dengue cases and cumulative rainfall had depicted asymmetric pattern with less consistent time cycles.

The analysis of the data had depicted that the dengue cases with 2 months serial relationship fits appropriate in the model. Further, the analysis of the model performance depicted that the weather time cycle of 5–6 months at the lag of 4 months performed with consistency, as compared to other time cycles and lag terms. The selected model had exhibited high prediction precision, consistent performance and lowest standardized root mean square error (SRMSE). The SRMSE of the selected model were 0.293 and 0.31 for period of 2011–2016 and 2017 respectively. The included weather time cycles for temperature and rainfall were 6 months and 5 months respectively in the selected model. Based on these parameter setting, the model has been fitted to predict the number of dengue cases from 2011 to 2017 and compared against the actual number of cases in that period, as shown in Fig. 8. The R${}^{2}$ of 0.853 depicts that the selected model explains the 85.3% variance in the monthly dengue cases. The time series of predicted cases against the actual cases exhibits a good performance of the selected model.

4.5 Information protection security analysis

In the information protection component, the concept of granulation by dividing information into three-level: L-I, L-II, and L-III (as shown in Fig. 3) has been used to protect the sensitive or personal information of the registered individuals. Furthermore, the personal information present at L-I has been protected using secret sharing concept. The secret shares derived from L-I have also been stored at the secure servers. Hence, the proposed system protects the sensitive information of the individuals.

4.6 GPS-based risk assessment evaluation

The GPS-based risk assessment in the proposed system has been visualized over the ninth population-wise largest city of Punjab province in India i.e., Pathankot. The generated data of 1024 individuals’ health records and 310 mosquito dense and mosquito breeding sites of the mapped area (a specific area of Pathankot) have been maintained into three different .csv formatted files, and utilized by the Netlogo [49] for creating the global SNA graph of DeV infection. There after, a Google API has been used to represent the SNA determined DeV infection risk on Google Maps, as shown in Fig. 9. The different colored hexagonal-shaped regions have depicted different DeV infection risk levels, as per mentioned in Table 4, where non-colored regions means there’s no infected resident and infected visitor in that region, hence having no DeV infection risk in that non-colored region.

Figure 9a, b depicts the tracing of an individual’s taken path without any knowledge of DeV infection risk on the underlying path, from a specific location to another specific location. Figure 9a depicts the tracing of the path followed, while Fig. 9b depicts the same tracing if it is represented on the risk-visualized map. The risk of catching the DeV infection has increased in this taken path as the path lies in the infected regions. However, once the risk-assessment is visualized on the map, it guides the individuals to re-route their path through safer regions. The same has happened in Fig. 9c, d, where, now the same individual has re-routed his path (which starts from the same source location to the destination location) because of the DeV infection risk visualization on map. In the proposed approach, an individual is free to opt any route, but because of the DeV infection risk visualization on map, an individual in Fig. 9c, d has taken an alternative route (suggested by Google map based on the distance in order of best to worst), so the route could be passed through safer regions in most of its distance. Here, the proposed approach has only visualized the DeV infection risk on the map, whereas the Google Maps has only suggested him the second best route, and the individual is person, who has taken that route, so he could travel to alternative route which has lesser risk of DeV infection and at the same time having shorter distance.

Figure 9c depicts the tracing of the safer taken path without risk assessment representation, while Fig. 9d depicts the same tracing on the risk map. The blue line in Fig. 9b traces the normal path, while the red line in Fig. 9d traces the re-routed safer path using Google Maps services. The GPS-based risk assessment visualizes the DeV infection risk in various regions in such a manner that one can plan his re-routed path in such a manner that the visits through infected areas can be avoided to much extent and exposure to infection can be minimized.

5 Conclusion

A fog-based intelligent healthcare system has been proposed in this paper, which diagnoses the possible DeV infection of the individuals using Naive Bayesian Network and generates real-time diagnostic, suggestive, and emergency alerts to the concerned stakeholders (individuals, government agencies and health organizations). The proposed system aware and suggests the individuals diagnosed with possible DeV infection to medically confirm the incidence of the infection by consulting with the doctors and through proper recommended laboratory tests. The proposed system has utilized the environment event index (EEI) to ascertain the health sensitivity of the possibly infected individual with respect to the occurrence of undesired environmental events, and generate emergency alerts to the doctors or caregivers for taking timely remedial actions. The proposed system has also pinpointed the DeV infected and risk-prone areas on Google Maps using SNA and provided efficient warning alert system for the visitors or residents in those areas. The system helps in preventing the further spread of DeV by alerting uninfected individuals and government healthcare agencies and aids in effective and precautionary control of the infection. The experimental evaluation of the proposed system has acknowledged the classification efficiency of NBN viz. highest sensitivity, precision, recall, and F-measure, as compared to the other employed classifiers, and utility of NBN in achieving better execution time. The evaluation has also acknowledged the utility of fog computing through the real-time functionality of the alert generation. The evaluation of the GPS-based risk assessment has depicted the efficiency of the proposed SNA-based warning alert system. The proposed system has a significant future scope in the domain of technology-based healthcare research, where common symptoms based disease monitoring platform can be developed using the current proposed system.

References

Abbas N, Asim M, Tariq N, Baker T, Abbas S (2019) A mechanism for securing IoT-enabled applications at the fog layer. J Sens Actuator Netw 8(1):16–33
Article Google Scholar
Abdel-Basset M, Gamal A, Manogaran G, Long HV et al (2019) A novel group decision making model based on neutrosophic sets for heart disease diagnosis. Multimedia Tools Appl 79:1–26
Google Scholar
Abdel-Basset M, Manogaran G, Gamal A, Chang V (2019) A novel intelligent medical decision support model based on soft computing and IoT. IEEE Internet of Things J 7(5):4160–4170
Article Google Scholar
Baker T, Asim M, MacDermott Á, Iqbal F, Kamoun F, Shah B, Alfandi O, Hammoudeh M (2019) A secure fog-based platform for Scada-based IoT critical infrastructure. Softw Pract Exp. https://doi.org/10.1002/spe.2688
Article Google Scholar
Baldi P, Brunak S, Chauvin Y, Andersen CA, Nielsen H (2000) Assessing the accuracy of prediction algorithms for classification: an overview. Bioinformatics 16(5):412–424
Article Google Scholar
CDCP: Dengue. http://www.cdc.gov/dengue/. Accessed 24 Dec 2019
Chang AY, Parrales ME, Jimenez J, Sobieszczyk ME, Hammer SM, Copenhaver DJ, Kulkarni RP (2009) Combining google earth and gis mapping technologies in a dengue surveillance system for developing countries. Int J Health Geogr 8(1):1–11
Article Google Scholar
Chauhan V, Patel M, Tanwar S, Tyagi S, Kumar N (2020) Iot enabled real-time urban transport management system. Comput Electr Eng 86:106746
Article Google Scholar
Chien TW, Chow JC, Chou W (2019) An app detecting dengue fever in children: using sequencing symptom patterns for a web-based assessment. JMIR mHealth uHealth 7(5):e11461
Article Google Scholar
Chowell G, Sanchez F (2002) Climate-based descriptive models of dengue fever: the, (2002) epidemic in Colima, Mexico. J Environ Health 68(10):40–44
Google Scholar
Chung K, Yoo H, Choe DE (2018) Ambient context-based modeling for health risk assessment using deep neural network. J Ambient Intell Humaniz Comput 11:1–9
Google Scholar
CupCarbon: Cupcarbon u-one 3.8. http://www.cupcarbon.com/. Accessed 24 Dec 2019
Data, statistics, analysis, & visualization. https://knoema.com/
Dautov R, Distefano S, Buyya R (2019) Hierarchical data fusion for smart healthcare. J Big Data 6(1):19
Article Google Scholar
Devarajan M, Subramaniyaswamy V, Vijayakumar V, Ravi L (2019) Fog-assisted personalized healthcare-support system for remote patients with diabetes. J Ambient Intell Humaniz Comput 10(10):3747–3760
Article Google Scholar
Hassan MK, El Desouky AI, Badawy MM, Sarhan AM, Elhoseny M, Gunasekaran M (2019) Eot-driven hybrid ambient assisted living framework with Naïve Bayes-firefly algorithm. Neural Comput Appl 31(5):1275–1300
Article Google Scholar
Hsu TC, Yang H, Chung YC, Hsu CH (2018) A creative IoT agriculture platform for cloud fog computing. Sustain Comput Inf Syst. https://doi.org/10.1016/j.suscom.2018.10.006
Article Google Scholar
Huang SY, Lee K, Wang L, Liu JW, Hung SC, Chen CC, Chang TY, Huang WC (2014) Use of simple clinical and laboratory predictors to differentiate influenza from dengue and other febrile illnesses in the emergency room. BMC Infect Dis. https://doi.org/10.1186/s12879-014-0623-z
Article Google Scholar
Kaur P, Kumar R, Kumar M (2019) A healthcare monitoring system using random forest and internet of things (IoT). Multimedia Tools Appl 78(14):19905–19916
Article Google Scholar
Lakshmanaprabu S, Mohanty SN, Krishnamoorthy S, Uthayakumar J, Shankar K et al (2019) Online clinical decision support system using optimal deep neural networks. Appl Soft Comput 81:105487
Article Google Scholar
Latif G, Shankar A, Alghazo JM, Kalyanasundaram V, Boopathi C, Jaffar MA (2019) I-CARES: advancing health diagnosis and medication through IoT. Wirel Netw 26:1–15
Google Scholar
Liu Y, Zhang L, Yang Y, Zhou L, Ren L, Wang F, Liu R, Pang Z, Deen MJ (2019) A novel cloud-based framework for the elderly healthcare services using digital twin. IEEE Access 7:49088–49101
Article Google Scholar
Lopez LF, Amaku M, Coutinho FAB, Quam M, Burattini MN, Struchiner CJ, Wilder-Smith A, Massad E (2016) Modeling importations and exportations of infectious diseases via travelers. Bull Math Biol 78(2):185–209
Article MathSciNet MATH Google Scholar
Machado-Machado EA (2012) Empirical mapping of suitability to dengue fever in Mexico using species distribution modeling. Appl Geogr 33:82–93
Article Google Scholar
Majumdar A, Debnath T, Sood SK, Baishnab KL (2018) Kyasanur forest disease classification framework using novel extremal optimization tuned neural network in fog computing environment. J Med Syst 42(10):187
Article Google Scholar
Missier P, Romanovsky A, Miu T, Pal A, Daniilakis M, Garcia A, Cedrim D, da Silva Sousa L (2016) Tracking dengue epidemics using twitter content classification and topic modelling. In: International conference on web engineering. Springer, pp 80–92
Neelam S, Sood SK (2020) A scientometric review of global research on smart disaster management. IEEE Trans Eng Manag. https://doi.org/10.1109/TEM.2020.2972288
Article Google Scholar
Pace P, Aloi G, Gravina R, Caliciuri G, Fortino G, Liotta A (2018) An edge-based architecture to support efficient applications for healthcare industry 4.0. IEEE Trans Ind Inf 15(1):481–489
Article Google Scholar
Pravin A, Jacob TP, Nagarajan G (2019) An intelligent and secure healthcare framework for the prediction and prevention of dengue virus outbreak using fog computing. Health Technol 10:1–9
Google Scholar
Raghav R, Dhavachelvan P (2019) Bigdata fog based cyber physical system for classifying, identifying and prevention of sars disease. J Intell Fuzzy Syst 36(5):4361–4373
Article Google Scholar
Rao VSH, Kumar MN (2012) A new intelligence-based approach for computer-aided diagnosis of dengue fever. IEEE Trans Inf Technol Biomed 16(1):112–118
Article MathSciNet Google Scholar
Reiner RC Jr, Stoddard ST, Scott TW (2014) Socially structured human movement shapes dengue transmission despite the diffusive effect of mosquito dispersal. Epidemics 6:30–36
Article Google Scholar
Sahil, Sood SK (2019) Smart vehicular traffic managemen: an edge cloud centric IoT based framework. Internet of Things https://doi.org/10.1016/j.iot.2019.100140
Sahil, Sood SK (2020) Bibliometric monitoring of research performance in ICT-based disaster management literature. Qual Quant. https://doi.org/10.1007/s11135-020-00991-x
Sahil, Sood SK (2020) Fog-cloud centric IoT-based cyber physical framework for panic oriented disaster evacuation in smart cities. Earth Sci Inf. https://doi.org/10.1007/s12145-020-00481-6
Sandhu R, Sood SK, Kaur G (2016) An intelligent system for predicting and preventing MERS-CoV infection outbreak. J Supercomput 72(8):3033–3056
Article Google Scholar
Sandhu R, Kaur J, Thapar V (2018) An effective framework for finding similar cases of dengue from audio and text data using domain thesaurus and case base reasoning. Enterpr Inf Syst 12(2):155–172
Article Google Scholar
Sareen S, Sood SK, Gupta SK (2016) Towards the design of a secure data outsourcing using fragmentation and secret sharing scheme. Inf Secur J Glob Perspect 25(1–3):39–53
Article Google Scholar
Scutari M, Ness R (2019) bnlearn: Bayesian network structure learning, parameter learning and inference. https://www.bnlearn.com/documentation/bnlearn-manual.pdf. Accessed 24 Dec
Singh S, Bansal A, Sandhu R, Sidhu J (2018) Fog computing and IoT based healthcare support service for dengue fever. Int J Pervasive Comput Commun. https://doi.org/10.1108/IJPCC-D-18-00012
Article Google Scholar
Sirisena P, Noordeen F (2014) Evolution of dengue in Sri Lanka-changes in the virus, vector, and climate. Int J Infect Dis 19:6–12
Article Google Scholar
Socio-economic statistical information about India. https://www.indiastat.com/
Sousa L, de Mello R, Cedrim D, Garcia A, Missier P, Uchôa A, Oliveira A, Romanovsky A (2018) Vazadengue: an information system for preventing and combating Mosquito-Borne diseases with social networks. Inf Syst 75:26–42
Article Google Scholar
Stoddard ST, Forshey BM, Morrison AC, Paz-Soldan VA, Vazquez-Prokopec GM, Astete H, Reiner RC, Vilcarromero S, Elder JP, Halsey ES et al (2013) House-to-house human movement drives dengue virus transmission. Proc Nat Acad Sci 110(3):994–999
Article Google Scholar
Sucar LE, Bielza C, Morales EF, Hernandez-Leal P, Zaragoza JH, Larrañaga P (2014) Multi-label classification with bayesian network-based chain classifiers. Pattern Recogn Lett 41:14–22
Article Google Scholar
Tao H, Wang K, Zhuo L, Li X, Li Q, Liu Y, Xu Y (2020) A comprehensive framework for studying diffusion patterns of imported dengue with individual-based movement data. Int J Geogr Inf Sci 34(3):604–624
Article Google Scholar
Thein TL, Ng EL, Yeang MS, Leo YS, Lye DC (2017) Risk factors for concurrent bacteremia in adult patients with dengue. J Microbiol Immunol Infect 50(3):314–320
Article Google Scholar
Tipayamongkholgul M, Lisakulruk S (2011) Socio-geographical factors in vulnerability to dengue in thai villages: a spatial regression analysis. Geospat Health 5(2):191–198
Article Google Scholar
Tisue S, Wilensky U (2004) Netlogo: design and implementation of a multi-agent modeling environment. In: Proceedings of agent, pp 7–9
Toledo ME, Rodriguez A, Valdés L, Carrión R, Cabrera G, Banderas D, Ceballos E, Domeqc M, Peña C, Baly A, Vanlerberghe V, Van der Stuyft P (2011) Evidence on impact of community-based environmental management on dengue transmission in Santiago de Cuba. Trop Med Int Health 16(6):744–747
Article Google Scholar
Vinarti RA, Hederman LM (2019) A personalized infectious disease risk prediction system. Expert Syst Appl 131:266–274
Article Google Scholar
WHO: Dengue (2018) https://www.who.int/denguecontrol/en/. Accessed 24 Dec 2019
WHO: Dengue and Severe Dengue (2018). https://www.who.int/en/news-room/fact-sheets/detail/dengue-and-severe-dengue. Accessed 24 Dec 2019
Witten I, Frank E, Hall M, Pal C (2016) Data mining: practical machine learning tools and techniques
Yao Y (2001) Information granulation and rough set approximation. Int J Intell Syst 16(1):87–104
Article MathSciNet MATH Google Scholar
Zhang Y, Ibaraki M, Schwartz FW (2020) Disease surveillance using online news: dengue and zika in tropical countries. J Biomed Inform 102:103374
Article Google Scholar

Download references

Acknowledgements

We sincerely acknowledge the worthy efforts of Dr. Arvind Manchanda for providing his experiences and guidance in understanding the root causes of Dengue Virus (DeV) Infection outbreak, which has helped us in formulating the problem for our research and preparing DeV infection-related symptoms-based health data. At the same time, we also want to acknowledge the efforts of Dr. Pankaj Sood, who is actively working in various community-based healthcare projects, in arranging the consultation of various health professionals regarding our proposed approach.

Author information

Authors and Affiliations

Department of Computer Science and Informatics, Central University of Himachal Pradesh, Dharamsahala, HP, India
Sandeep Kumar Sood
Department of Computer Science and Engineering, Guru Nanak Dev University Regional Campus, Gurdaspur, PB, India
Vaishali Sood & Isha Mahajan
School of Computing, Indian Institute of Information Technology, Una, HP, India
Sahil

Authors

Sandeep Kumar Sood
View author publications
You can also search for this author in PubMed Google Scholar
Vaishali Sood
View author publications
You can also search for this author in PubMed Google Scholar
Isha Mahajan
View author publications
You can also search for this author in PubMed Google Scholar
Sahil
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sahil.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sood, S.K., Sood, V., Mahajan, I. et al. An intelligent healthcare system for predicting and preventing dengue virus infection. Computing 105, 617–655 (2023). https://doi.org/10.1007/s00607-020-00877-8

Download citation

Received: 05 June 2020
Accepted: 19 November 2020
Published: 08 January 2021
Issue Date: March 2023
DOI: https://doi.org/10.1007/s00607-020-00877-8

Keywords

Mathematics Subject Classification

68T99

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

An intelligent healthcare system for predicting and preventing dengue virus infection

Abstract

Similar content being viewed by others

An intelligent and secure healthcare framework for the prediction and prevention of Dengue virus outbreak using fog computing

An intelligent system for predicting and preventing MERS-CoV infection outbreak

IoT and cloud-based COVID-19 risk of infection prediction using hesitant intuitionistic fuzzy set

1 Introduction

1.1 Contribution

2 Related work

2.1 Dengue-based healthcare

2.2 Cloud-IoT based healthcare

2.3 Fog computing

2.4 Infectious diseases-based intelligent healthcare

3 Proposed system

3.1 Data acquisition subsystem

3.2 Fog subsystem

3.2.1 Fog-based DeV infection diagnosis

3.2.2 Time-based alert generation

3.3 Cloud subsystem

3.3.1 Cloud storage

3.3.2 Information protection

3.3.3 GPS-based risk assessment

3.3.4 Health communication

4 Experimental setup

4.1 Data generation

4.2 Fog-based DeV infection diagnosis evaluation

4.3 Alert generation evaluation

4.4 Dengue cases prediction evaluation

4.5 Information protection security analysis

4.6 GPS-based risk assessment evaluation

5 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation