Introduction

A pandemic is a disease that spreads across a large area including multiple countries, continents, or worldwide. The novel Coronavirus outbreak late in 2019 and early 2020, known today as COVID-19 or SARS–CoV-2 (which refers to the virus causing the disease), is with us. The World Health Organization (WHO), on 11 March 2020, declared COVID-19 disease a pandemic (Stang et al. 2020). It has horribly spread across the stretches of the world, and too many people are suffering. COVID-19 is a disease that people are not immune to, spreading throughout the world beyond expectation. Almost 211 countries are suffering (Xu et al. 2020). Countries all over the world are declaring Health emergencies and taking whatever possible steps to keep the citizens safe (Xu et al. 2020). By June 2021, worldwide deaths are approaching 3.85 million, and more than 178 million cases of COVID-19 are confirmed. That is why the main objective of the present investigation is to identify the significant risk factors in transmitted COVID-19.

Common symptoms include fever, cough, headache, fatigue, shortness of breath, persistent pain or pressure in the chest, bluish lips or face, and loss of smell and taste (Lu et al. 2020). The majority of patient instances show mild symptoms most of the times; nevertheless, some progress to Acute Respiratory Distress syndrome (ARDS) possibly precipitated by multi-organ failure and blood clots (Murthy et al. 2020; Huang et al. 2020). It takes four-five days after the exposure to the virus for some symptoms to be revealed, but this time may also range from two to fourteen days (Lu et al. 2020; Murthy et al. 2020).

The virus is primarily spread between people during close contact, most often via small droplets produced by coughing, sneezing, and talking. The droplets usually fall to the ground or onto surfaces rather than traveling through air over long distances (Galbadage et al. 2020). It is most contagious during the first three days after the onset of symptoms, although the spread is possible before symptoms appear, and from people who do not show symptoms. Estimates of the number of people infected by one person with COVID-19 are referred to as the basic productive number, R0 have varied widely (Battiston and Gamba 2021). Elderly human beings with underlying diseases and affected by COVID-19 had a higher mortality rate. Younger ages and especially children are not severely affected by the disease. For some reason, males are more susceptible to the disease (Bwire 2020; Huang et al. 2020).

At a time when humanity is facing the alarming spread of the new pandemic, one feels that despite the predictions and measures, the tools that the evolution of human thought has acquired are not yet capable of arming us with both knowledge and know-how to deal with pandemics in a timely and effective manner. Statistical prediction models, although studied in great depth, are capable of modeling a single snapshot and, moreover, have the well-known weaknesses about their flexibility, their ability to handle situations involving uncertainty, and manage a multitude of interdependent factors that affect the rate of disease spread (Groumpos 2021; Ala’raj et al. 2021). Moreover, the statistical models and the correlation between variables of the pandemic are commonly based on linear assumptions. However, most, if not all, medical problems, especially new and not known pandemics, are nonlinear in their nature (Groumpos 2021).

Since the pandemic has caught the whole world by surprise, no one was ready to deal with it. Very few studies have been conducted and all research articles are either on SARS and MERS (Huang et al. 2020; De Wit et al. 2016) or on COVID-19 the last 3–4 months (Toğaçar et al. 2020; Hijikata et al. 2020; Gralinski and Menachery 2020; Wrapp et al. 2020; Li and De Clercq 2019; Ling 2020). Again, most of them use statistical methods and probability theories. New theories are needed to address the many challenging questions for COVID-19.

In our study, we analyzed and conducted research on the factors that affect the extent and capacity of the spread of the pandemic, isolating the most important ones. Our goal is to develop a decision-making system that can predict the potential development of pandemics in a given region and/or country, based not only on correlation factors, but also on causality. Besides, this system, in addition to forecasting, will be able to be for the supervision or the effectiveness of measures.

Methods

For this study, the modeling methodology provided by Fuzzy Cognitive Maps, is proposed as a solution. In this section, we provide a detailed description of the research methodology, as well as the parameters of the proposed model.

Classical theory of Fuzzy Cognitive Maps

Cognitive Maps (CM) were firstly introduced by R. Axelrod in 1976 (Axelrod 1976) as a formal way of representing social scientific knowledge and modeling decision in social and political systems. However, CM, as had been developed by Axelrod, were not sufficient to address the dynamic behaviors of complex systems, especially in the presence of uncertainty and ambiguity. They were not able to capture the true knowledge of the complex dynamic systems, especially as the time was evolving. Thus, an extension of CM was needed and was introduced ten years later 1986 by Kosko (Kosko 1986), bringing in the computational fuzzy logic. It was a new soft computing method had been named the Fuzzy Cognitive Map (FCM) approach. FCMs have become popular and used extensively with excellent results since: (1) complex dynamic systems have many parameters and they are difficult to be modeled with precise mathematical methods, (2) they are nonlinear, and time-varying, (3) depend on many real data and on the evolution of time. On the other hand, FCMs are very simple and are close to human reasoning. In addition, they do not need to have a precise mathematical model of the process. They can address any complex dynamic system without using complex mathematics. FCM methods depend heavily on accumulated human knowledge. This is due to the use of experts’ opinion and their human Decision-Making Support Systems (DMSS).

Fuzzy Cognitive Maps came as a combination of methods of fuzzy logic and neural networks. They constitute a computational method that is able to examine situations during which the human thinking process involves fuzzy or uncertain descriptions. An FCM presents a graphical representation utilized to describe the cause and effect relations between nodes, thus giving us the opportunity to describe the behavior of a system in a simple and symbolic way.

The FCM theories embody the accumulated knowledge and experience from experts who know how the medical system behaves in different circumstances. This is one of the strongest points of the FCM approach that rely mainly on the experience of the experts. This knowledge is extracted using linguistic variables, which then are transformed to numeric values using a defuzzification method. Figure 1 shows a representative diagram of an FCM. During the simulation, the value of each concept is calculated using the following rule:

Fig. 1
figure 1

A simple Fuzzy Cognitive Map (FCM)

$${A}_i\left(k+1\right)=f\ \left({k}_2\times {A}_i(k)+{k}_1\times {\sum}_{j=1,\kern0.5em j\ne i}^N{A}_j(k)\times {W}_{ji}\right)$$
(1)

Where N is the number of concepts, \({A}_{i}\) (k + 1) is the value of the concept Ci at the iteration step k + 1, Aj(k) is the value of the concept Cj at the iteration step k, Wji is the weight of interconnection from concept Cj to concept Ci. The constant “k1” expresses the influence of all the other interconnected concepts on the configuration of the new value of the concept Ai, and “k2” represents the proportion of the contribution of the previous value of the concept in computing the new value. Their values usually are set equal to 1, unless the experts of a particular dynamic system can determine these influences and contributions to the behavior of the system. Meanwhile, function f is the sigmoid function:

$$f=\frac{1}{1+{e}^{(-\lambda \times x)}}$$
(2)

where λ > 0 determines the steepness of function f. The FCM’s concepts are given some initial values, which are then changed depending on the weights Wij and the way the concepts affect each other. The calculations stop when a steady state is achieved, i.e., when the concepts’ values become stable.

However, despite the many theoretical developments of FCMs, based on the material presented in the previous section and their success to many medical applications (and not only), FCMs still have a number of limitations and drawbacks (Groumpos 2018; Mpelogianni et al. 2018). All theories are essentially based on the material of the “Methods” section, which will be referred from now on as the “classical FCM theories”. They do not go into the depth of the dynamic behavior of complex systems, especially the medical ones. In addition, the initial system structure described by experts and the learning principles-algorithms cannot follow the evolution of the medical complex dynamic system. The natural-human world evolves and the human-made systems progress by applying knowledge derived from observations of and familiarity with repeatable human events and phenomena of nature. Our perceptions, understanding, and ability to model medical problems enable us to develop policies, processes, and products that are invented and required in solving them. This requires new advanced interdisciplinary theories and technologies (Groumpos 2018; Mpelogianni et al. 2018).

The main problem stems from the existing approach that all concepts (say N) are grouped as one vector A. The values A of all N concepts at all instant of times, k, are calculated using Eqs. (1)–(2). Similarly, the learning-training algorithms for upgrading the causality coefficients Wij, are performed for all N concepts and for all instant of times, k (Anninou et al. 2018; Groumpos 2010). The equations utilized for Hebbian learning are using all N concepts. This is not mathematically correct since, from the total number N of concepts, there is a number of concepts whose values remain constant for a long period of time when other concepts are changing slowly or fast. There are also exogenous variables that are not easily identified in the first place and thus cannot be part of the classical learning algorithms. Another issue is that, in the classical FCM theories, the algorithm was always converging to the same final values, regardless of the initial conditions. This, in return, forces us to increase the number of simulations consulting the behavior of the changes of the weights Wij through their changes based learning algorithms (Anninou et al. 2018).

Another drawback has to do with the Nonlinear Hebbian Learning (NHL) learning method (Groumpos 2010; Papageorgiou et al. 2002, 2003a, b). While running several simulations, we have observed that, due to the way weights are being calculated, if the number of iterations of the algorithm is increased, in order to reach a steady-state, the causality sometimes reverses all or some of the Wij values. This is a very serious drawback as it changes the causality between concepts (Mpelogianni et al. 2018) and in several occasions, instead of having a lower, we are going to have a larger result causing serious problems not only in the interpretation of the obtained results but also on stability issues to a number of real-life systems. This is very crucial to medical problems and especially for COVID-19.

All the above drawbacks and some other ones were addressed by the research team of the Laboratory for Automation and Robotics (LAR) of the University of Patras under the supervision of the author. The new approach addressing these drawbacks and deficiencies of FCMs is called the state space Advanced Fuzzy Cognitive Maps (AFCM) approach. The basic mathematical approach is to separate the total number of concepts N to (1) N1-State Concepts, (2) N2-Input Concepts, and (3) N3-Output Concepts.

The Advanced Fuzzy Cognitive Maps (AFCM) approach is an evolution of the classic methodology, which promises more accurate results for a large variety of complex medical systems. This methodology, which is thoroughly analyzed in (Groumpos 2010), overcomes major issues of the classic FCMs. Those issues are: (a) the presence of concepts of different nature is ignored in the traditional approach and, (b) the utilization of the classic sigmoid normalization function was fuzzing the system, especially in cases of the existence of several concepts. The AFCM approach was used in two medical applications with excellent results (Antigoni 2013; Anninou et al. 2018).

The proposed State–Space AFCM. Basic aspects and input concepts

The new proposed AFCM approach uses the basic theory of FCM with the difference that not all variables-concepts of the system are treated in the same way as in the “classical FCM theories”.

The procedure of constructing the AFCM is remaining the same (Mpelogianni et al. 2018; Apostolopoulos et al. 2021). First, experts should define all the basic system’s variables-concepts. Then, the main change and innovation are that after this procedure, variables-concepts should be divided into three categories:

  • Fuzzy concept states

  • Fuzzy concept inputs

  • Fuzzy concept outputs

The fuzzy concept states of a dynamic system refer to a minimum set of variables, known as state variables, which fully describe the system and its response to any given set of inputs. The fuzzy concept inputs concern signals that stimulate the system. The fuzzy output variables constitute those that we should examine their behavior. In that way, we take into consideration what exactly each concept does. We no more treat outputs in the same way as inputs, and inputs are separated from states. The mathematical description of the system and the combination of initial states and inputs are sufficient to provide information about both the future states and outputs. Mathematically, the standard form of the new model is described by the system general equations (Mpelogianni et al. 2018),

$$\mathrm{X }\left[\mathrm{k }+ 1\right]=\mathrm{ Ax}\left[\mathrm{k}\right]+\mathrm{ Bu}\left[\mathrm{k}\right]$$
(3)
$$\mathrm{Y }[\mathrm{k}+1] =\mathrm{ Cx}[\mathrm{k}] +\mathrm{ Du}[\mathrm{k}]$$
(4)

where k = 0,1,2,3……..until convergence

$$Y\left(Z\right)=Ex(Z)$$
(5)

where Z = 1,2,3…. until the final desired time. In addition, x(k) ∈ Rnxn is a concept state vector, u(k) ∈ Rrxr is an exogenous known concept input vector and is in the discrete-time unit “k” and y(k) ∈ Rmxm the output concept vector. In addition, y(Z) ∈ Rpxp is the concept output vector, and “Z” is the discrete instant of time(s) Z that we are interested to know the concept outputs. For example, the discrete times can be every 12, 18 or 24 h. The discrete times k and Z can change according to each individual case. The matrices A, B, C, and D are constant matrices having been determined by the engineers in close consultation with the physician staff. A similar approached for the FCM application for the knee case.

Please note that the discrete-time Z of Eq. (5) is different from the discrete times of k = k1,k2,k3……not only of Eqs. (3) and (4) but also of Eq. (1). The discrete-time k of Eqs. (3) and (4) can also be different from the k of Eq. (1). A lot of discussions are required for this topic and need to be addressed in future research.

The discrete-time Z is set by the medical doctors. This is in stark contrast with the “k” variable in Eq. (1). The latter expresses the iteration steps until convergence. In the new model, the number of iteration steps depends on the samples that have been available at all times, and the final time Z when the outcome result is desired. In the new AFCM method, Eqs. (3) and (4), the problem of convergence is complicated and needs to be addressed in future research.

The algorithm that describes the new model in detail, adjusted in the prediction of the pandemic spread 19, is presented as follows: (with the term experts are the medical personnel).

  1. Step 1:

    Experts define the basic concepts (known until today).

  2. Step 2:

    Experts separate them into the Fuzzy States, Fuzzy Inputs, and Fuzzy Outputs.

  3. Step 3:

    Experts construct an AFCM schema-diagram drawing the appropriate interconnections.

  4. Step 4:

    Experts construct weight matrices between States-States, States-Inputs, States-Outputs, Inputs-Outputs using fuzzy variables, The matrices A, B, C, and D of Eqs. (3) and (4) are determined by the engineers in close consultation with the medical doctors.

  5. Step 5:

    Convert weight matrices into numeric values using a defuzzification method.

  6. Step 6:

    Determine Initial states in time unit “0” (Patients Initial Symptoms)-initial conditions.

  7. Step 7:

    Determine also all Inputs for discrete times “k”.

  8. Step 8:

    Run simulations for the AFCM till convergence.

  9. Step 9:

    Obtain the output Y(Z). The concept that is associated with the spread prediction.

For creating the proposed model, the advanced methodology, as proposed in Mpelogianni and Groumpos (2019); Apostolopoulos et al. (2021), was followed. Based on the recent literature for the pandemic of COVID-19, we propose some fundamental concepts to describe the spread of the disease. The proposed concepts are illustrated in Table 1.

Table 1 The concepts of the system

Concept #1: This factor may be utilized to estimate the current situation (Kucharski et al. 2020). For example, if the days since the first death was recorded are below 30, we can assume that the spread has not generally reached its peak yet and we can expect an ascending behavior in the future. This concept will only affect a particular state concept, as described later.

Concept #2: This factor can be used as an indicator of the pandemic trend, although its one-sided assessment does not provide much information. Similar to the first concept, this input element affects a particular state concept described later.

Concept #3: This concept behaves as an estimator of the significance of the pandemic at a particular period of time (week) and could affect (Giuliani et al. 2020).

Concept #4: The number of intubated in an intensive care unit may both affect the quality of the healthcare system, considering a possible insufficiency in available units.

Concept #5: The restriction policy applied to the habitants plays a vital role in handling the outbreak, as several real-life facts demonstrated during the recent months. More specifically, the stricter the measures, the more controllable the spread of the disease will be, given that the measures have been taken in time (Sattenspiel and Herring 2003).

Concept #6: The restriction policy referring to the industrial zones of a state is perhaps the most decisive factor to control the spread of the disease, due to the high congestion observed in large industries.

Concept #7: The state concept referred to as the “impact of restrictions” aims to handle both the individual and the industrial restrictions to produce a combined factor affecting the spread of the disease.

Concept #8: The number of intensive care units per thousand populations is a factor influencing the sufficiency and the quality of the healthcare system and may constitute an essential parameter, as a possible insufficiency in available units may result in a growth of the outbreak.

Concept #9: Highly related to the previous concept, the number of medical experts per thousand populations is correlated to the mortality rate (Gulliford 2002).

Concept #10: The ranking of the specific country, according to World Health Organization, is utilized as a factor affecting state concept #11 (i.e., the effectiveness of the healthcare system).

Concept #11: Concepts #8–10 are contributing to the state concept “Effectiveness of the Healthcare System,” which is related to the spread of the virus to the society.

Concept #12: The estimation of the current situation is an optional parameter supplied to the model, aiming to increase its perception of the current situation.

Concept #13: The output concept, which attempts to predict the spread of the disease the next week, based on the inputs and the states of the last week.

Revised sigmoid function

In order to apply the FCM methodology, the values of all the input concepts must be between the interval [0,1], where 0 (zero) denotes that the value of the concept is very small and 1 (one) that the value is very big. This is the reason why the sigmoid function (Eq. (2)) is needed; to suppress the result to the interval [0,1]. But due to the shape of the curve any concept which, because of the additive calculation method (Eq. (1)) is assigned a value beyond 3 leads the sigmoid function to correspond it to the value one which is greatly problematic as the final output is interpreted as “high” even if this is not always the expected or correct result. Continuing on the subject of the sigmoid function, there is another drawback that leads to high output values. This is the fact that the center of the curve, instead of being on the (0,0) point on the xy axis, it is on the (0.5,0) point. This means that each concept’s lowest value can be 0.5. This problem, combined with the first one, makes it difficult to interpret the result even with the use of the experts’ interpretation criterion (Eq. (3)).

$$R \left(x\right)= \left\{\begin{array}{c}0, x<0.5\\ \frac{x-0.5}{0.5}, x\ge 0.5\end{array}\right.$$
(6)

However, this is not efficient for medical problems. For this reason, we use the new approach proposed in Mpelogianni and Groumpos (2018). The sigmoid function (Eq. (2)) is changed to Eq. (7); so to fit the values to the desired interval, the slope of the curve changes depending on the variable we want to fit.

$$f\left(x\right)=m+ \frac{M-m}{1+{e}^{-r*(x-to)}}$$
(7)

where x ∈ R and f(R) = (m,M).

In the above-mentioned equation, we have.

  • m is the lower limit of the curve

  • M is the upper limit of the curve

  • r is the slope of the curve and

  • to is the symmetry to the y-axis

Based on the simulations, the slope of the sigmoid function was set to 1.5.

Weight suggestions

Ideally, the weight tables are defined by the experts according to their knowledge and experience. For the needs of this proposal and to make a first assessment of the model, the weight tables are arbitrarily determined by the authors, based on their own assessment, but also in direct proportion to the response of the algorithm.

Τhe weight definition field was defined for this task as follows: (a) very weak impact (VW), (b) weak impact (W), (c) medium impact (M), (d) strong impact (S), and (e) very strong impact (VS). The weights may take negative values according to the mentioned categories.

The overall weight table referring to the proposed AFCM system is presented in Table 2. As the values of the weights reflect real-time relations, a dynamic approach is considered to alter the weight values over time, according to a specific set of rules, which is explained in the “The proposed State – Space AFCM. Basic aspects and input concepts” section. In Table 2, information regarding which weight is also affected by a certain rule is given. For example, the impact of the first concept on the output is by default set to “Very Strong”, however, this impact may alter later when the pandemic is heading towards the end of its cycle.

Table 2 Table of weights

Rules

There are several approaches to achieve the appropriate dynamism to transform the model from a static one to a robust one, able to adapt to changing environments. In machine learning, this is achieved through extensive training on large sets of data. In this way, the connections between the concepts are adjusted according to the situation, aiming to reflect the behavior of the training data. The AFCMs can be benefited from a similar approach, Hebian learning (Papageorgiou et al. 2003a, b, c); however, since the pandemic is not over, no complete data are available to explore this strategy.

To overcome this issue, it is proposed to apply a set of rules which are designed with respect to the epidemic curve, which is a graph in which the number of new cases of an outbreak disease is plotted against time. Epidemic curves are ordinarily constructed after the disease outbreak is over. However, in the case of Coronavirus disease, good estimates have been extensively utilized by the stated and the epidemiologists to assess the situation and plan possible measures. We propose to fit the model under consideration to the epidemic curve, i.e., to let the model know that the disease will approximately follow this distribution. The reader should note that only the general trend of the pandemic curve is adopted here as a guidance for the model. In essence, the model is developed to seek and identify a pandemic peak, before it alternated its behavior. Depending on the current situation, which is defined by the values of the concepts, this general trend, curve, slope, standard deviation may be altered.

The set of rules proposed for a more precise approach to reality is presented in Table 3.

Table 3 Set of rules

Predicting the peak

If the total number of patients is increasing, but at a declining rate, then there is a serious indication that the outbreak is reaching its peak value. To record this trend in our model, a global variable denoted as reach_peak is introduced, with values zero or one. To define the actual peak, the following conditions must be true: (a) the rate of increase in confirmed disease patients is below 1%, and (b) the value of reach_peak is one.

Experimental cases

To evaluate the conception, specific simulations were performed, utilizing the available statistics related to the spread of COVID-19 for Greece, South Korea, and Germany. The simulations were performed over one month, separated into four weeks. The inputs of the model are the corresponding weakly averages, while the prediction (i.e., the output) refers to the severity of the spread for the following week. We selected to perform the simulations at different times for each country to investigate the performance over a larger period.

Greece

For Greece, the period of simulation is from 12 March 2020 until 12 April 2020. The statistics were collected from the National Health, while the information related to the quality of the healthcare system was collected from the website of the Ministry of Health. The rate of confirmed new cases is illustrated in Fig. 2.

Fig. 2
figure 2

The rate of new confirmed diseased cases for Greece between 12/3/2020 and 12/4/2020

South Korea

For South Korea, the period of simulation is from 1 March 2020 until 1 April 2020. The statistics were collected from the Centers for Disease Control & Prevention, while the information related to the quality of the healthcare system was collected from the website of the Ministry of Health and Welfare. The rate of confirmed new cases is illustrated in Fig. 3.

Fig. 3
figure 3

The rate of new confirmed diseased cases for South Korea between 1/3/2020 and 1/4/2020

Germany

For Germany, the period of simulation is from 15 March 2020 until 15 April 2020. The statistics were collected from the Robert Koch Institute, while the information related to the quality of the healthcare system was collected from the website of the Federal Ministry of Health. The rate of confirmed new cases is illustrated in Fig. 4.

Fig. 4
figure 4

The rate of new confirmed diseased cases for Germany between 1/3/2020 and 1/4/2020

Results

In the following Table, the weekly predictions of the AFCM model are presented for each country. Besides, the numeric predictions for each week are provided. Due to interconnections between the input concepts and the state concepts, some iterations were necessary for the system to reach a steady-state for each week. Those iterations are also presented in Table 4. The column “Actual Situation” is computed by the statistics and corresponds to the real progress of the pandemic for the week under consideration. The numeric output is the model’s actual response after a stable equilibrium is achieved. This equilibrium is reached after the number of iterations provided in the relevant column. The numeric output may be interpreted for comparison reasons to a verbal statement describing the prediction of the spread for the following week.

Table 4 Results of the simulation

The response of the system demonstrates its effectiveness in two situations: (a) the correct prediction of the alteration of the situation from steady-ascending to steady in the example of Greece, (b) the correct prediction of the alteration of the situation from steady-ascending to ascending and then back to steady ascending in the example of the outbreak in Germany. Due to the absence of available information related to a descending behavior of the spread, the model could not be evaluated further. In Table 5, the input values for the first two weeks in Greece are given.

Table 5 Input values for Greece for the first two weeks

The system was able to capture the disturbance caused by the predictor concepts 1–6 and concept 7 to the response concept 13 and turn this disturbance to an interpretable outcome connected to the risk of the spread of COVID-19 the next week. For the system to reach a steady-state and to respond in the desired way, the translation of the input values to desired intervals was mandatory. For example, the value of concept 1 (days) is expected to grow every week by 7, which must be interpreted as a small change in the value of the concept handled by the algorithm; therefore, the actual number of days were divided by 180 to fit the interval (0,1). The same strategy was followed for all the input concepts, as it is shown in Table 4.

Results in terms of predicted patient cases

To further evaluate the system, we simulate the COVID-19 outbreak in Greece, South Korea, and Germany. Tables 6 and 7 summarize the results.

Table 6 Statistical analysis of confirmed and predicted COVID-19 patient cases for each country
Table 7 Errors and correlation between confirmed and predicted COVID-19 patient cases per country

For Greece, the proposed methodology is slightly underestimating the potential spread of the pandemic. With a mean absolute error of 210 out of the 2114 total confirmed cases of the period under consideration, an estimation error is calculated at 9.9%. The time series of confirmed versus predicted patients for Greece is illustrated in Fig. 5.

Fig. 5
figure 5

Confirmed and Predicted COVID-19 patients time series for Greece

For South Korea, the model is underestimating the potential spread of the pandemic, as in Greece. With a mean absolute error of 1752 out of the 9976 total confirmed cases of the period under consideration, an estimation error is calculated at 17.9%%. The time series of confirmed versus predicted patients for South Korea is illustrated in Fig. 6.

Fig. 6
figure 6

Confirmed and Predicted COVID-19 patients time series for South Korea

For Germany, the model is underestimating the potential spread of the pandemic in the beginning and overestimating the spread later. With a mean absolute error of 7985 out of the 127,584 total confirmed cases of the period under consideration, an estimation error is calculated at 6%. The time series of confirmed versus predicted patients for South Korea is illustrated in Fig. 7.

Fig. 7
figure 7

Confirmed and Predicted COVID-19 patients time series for Germany

Despite the observed errors, the model is accurately predicting the trend of the spread, yielding remarkable Pearson’s Correlation Coefficient (r) and Coefficient of Determination (R2). For Greece, r and R2 are 0.9976 and 0.9952, respectively. For South Korea, r and R2 are estimated at 0.997 and 0.9941, respectively. Finally, for Germany, r and R2 are 0.989 and 0.9782, respectively.

Discussion

A new method was proposed, the Advanced Fuzzy Cognitive Map (AFCM), for the simulation of the Coronavirus pandemic characteristics in various countries. The simulations were performed utilizing the statistical data from South Korea, Germany, and Greece. The model was able to capture the progress of the disease and to predict the change of the situation very accurately from an ascending spread to a steadier situation occurring when the spread is reaching its peak value. This behavior can be observed particularly in Figs. 6 and 7, where the two lines are simultaneously changing their trend. The reader should note that the particular modeling methodology is not depending its behavior on any pandemic curve. While compiled to expect a certain peak of the spread at some point of time, this would not always be the case. Altering specific concepts describing the current situation (e.g. impact of restrictions, restriction policy, and effectiveness of the healthcare system) would change the behavior of the model and could overcome the need of assigning a peak. More simulations are expected in the near future to investigate the response of the methodology in very uncertain conditions.

The obtained results demonstrate promising effectiveness of the proposed system despite the fact that the weights of the input and the state concepts were not analyzed or defined by experienced epidemiologists or data analysis experts. The approach of the current work exploits the benefits of the AFCMs, such as the convenient modeling of the causal relations between factors of different nature and their ability to perform in changing real-time environments by making full use of the disturbance caused by the one concept to the other. The idea of studying the known causality between pandemic factors and not predicting and discovering new ones is the core of AFCMs. The statement that “correlation does not imply causality, while causality always implies correlation” can be better understood with a specific example. Car accidents were reduced during the COVID-19 pandemic, due to restriction measures. One can notice that a negative correlation between the pandemic spread and car accidents is indeed present. However, this correlation does not imply any direct cause and effect relation between those two phenomena. For example, we do not expect that if the car accidents are increased, the pandemic will decline.

The study has some limitations. Firstly, the model was not fully evaluated regarding its performance of predicting the spread after the peak value is reached. This is due to the absence of real data and conformed data at the time this research was conducted. Secondly, the weights were defined by the authors and did not necessarily reflect the actual relations between the factors incorporated into the AFCM. A more scientifically valid weight definition, which can be produced through cooperation between epidemiologists and committees, could enhance the robustness of the model in the future. The reader should note that the desired output depends heavily on the assigned weights and the pre-defined input concepts. In the present simulation, although arbitrarily defined, the selected concepts and weights yield very promising results. Thirdly, for a complete and accurate estimation, more factors have to be included. For example, a number of environmental factors may be incorporated into the model to describe the spread of communicable diseases that are prone to cause epidemics, such as the water supply facilities, the sanitation facilities, the quality of food, and the climate. Climate can affect disease transmission in a variety of ways. The distribution and population size of disease vectors can be heavily affected by the local climate. One more fact must be taken into consideration, which is many more real data are needed and form more countries.

Nevertheless, the proposed methodology may be appropriate to overcome the limitations of the statistical analysis, wherein several predictors are not taken into consideration (for example, the quality of the healthcare system). As it is demonstrated by the first simulation results, the AFCM modeling methodology succeeds in providing a reliable prediction for the spread of the disease both in early situations and in near-peak conditions. The decent handling of seemingly tough-to-handle attributes increases the dynamism of the predictions and may be able to circumvent the relatively static approach of statistical methodologies.

This study gives strong evidence that the FCM theories are probably the only ones exploring the causality between the variables of medical problems in a sound mathematical and scientific foundation.

The response of the model in unexpected pandemic reactions and trends is to be investigated in the near future. More specifically, the effectiveness of the AFCMs in predicting situations where the pandemic trend is not following an expected pandemic curve, or the pandemic trend is showing heavy instability due to a number of reasons, is a future research opportunity.

Physicians are requested to handle information of different nature, e.g., patient’s history, clinical diagnostic tests, medical images, personal health problems, and demographic characteristics (Apostolopoulos et al. 2020). The interpretation of these results involves ambiguity, fuzziness, and uncertainty, which plays a critical role in decision-making to a wide and diverse set of medical problems. The COVID-19 pandemic has all these characteristics. Although these approaches provide us with answers that are needed and used to study medical problems, they are still not sufficient and adequate to provide us with acceptable and convincing solutions. Therefore, new advanced scientific approaches and new mathematical models are urgently needed.

Conclusions

Αccurate and reliable Medical Decision Systems predicting the spread of dangerous pandemics can be a critical tool for society. Modern public health emergency management methods do not exhaust the necessary medical measures, such as improving health systems, but extend to the whole spectrum of social and economic life. For this reason, it is necessary to develop powerful simulation tools that are able to take many variables of different nature, to be able to respond to uncertain and unpredictable situations by exploring the causation between them. Such tools can be irreplaceable and helpful for scientists, especially to medical professionals.

The proposed AFCMs appear to be a robust methodology for creating and monitoring such complex modeling systems, while, in this study, their advantages and effectiveness are proven in practice. It is the authors’ opinion that these methods can be further developed and gain the trust and attention of scientists, especially the medical professionals in the near future.

Future research should address the aforementioned limitations and perhaps explore the possibilities of machine learning approaches, such as the backpropagation utilized to train the networks. One option is the development of a hybrid AFCM-Neural Network-Statistical approach, wherein the advantages of each method are combined to constitute a robust predicting model. Methods of Artificial Intelligence (AI) such as machine learning, deep learning, and probabilistic methods for uncertain reasoning, should also be used to study COVID-19 (Apostolopoulos and Groumpos 2020; Apostolopoulos and Mpesiana 2020).

New concepts that the medical doctors can propose for the spread of the pandemic would develop new AFCM models. Close cooperation with medical doctors is needed. Models must be developed for different COVID-19 affected people and for different geographical regions.