Inverse problem solver for epidemiological geographic profiling

Maeno, Yoshiharu

doi:10.1007/s40844-024-00281-3

Inverse problem solver for epidemiological geographic profiling

Article
Open access
Published: 30 March 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Evolutionary and Institutional Economics Review Aims and scope Submit manuscript

Inverse problem solver for epidemiological geographic profiling

Download PDF

Yoshiharu Maeno ORCID: orcid.org/0009-0008-5399-7330¹

230 Accesses
Explore all metrics

Abstract

This work studies machine-learning-based inverse problem solvers for a reaction–diffusion process. The study focus is on the performance of a state-of-the-art convolutional neural network in discovering the source of disease spreading. This problem is called epidemiological geographic profiling. The performance is investigated with synthetic datasets for SIR epidemiological compartments on a square grid geo-space. The convolutional neural network works effectively in discovering a single source and achieves the largest time average of accuracy for growing infection in a heterogeneous geo-space. The hit score remains near the lower bound over time. Discovering multiple sources is feasible potentially as well by learning the dataset for a single source.

Explaining Causal Influence of External Factors on Incidence Rate of Covid-19

Article 18 September 2021

Deep Spatial Q-Learning for Infectious Disease Control

Article 08 July 2023

PolSIRD: Modeling Epidemic Spread Under Intervention Policies

Article 14 June 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Spreading of an infectious disease is a stochastic reaction–diffusion process in a heterogeneous space. Infection of susceptible individuals is a reaction process in a subpopulation. Removal (or recovery) of infectious individuals is also a reaction process. Movement of infectious individuals is a diffusion process across subpopulation boundaries. A subpopulation is a group of individuals in a small distinct geographic area. Similar processes underlie many investigation topics in data-driven mathematical sciences, econophysics, and sociophysics. Some examples include dissemination of online fake-news (Vosoughi et al. 2018), spreading of rumors on social networking services (Grinberg et al. 2019), fluctuating prosperity of firms in an economic system (Maeno 2013), structural changes in the international trade network (Ikeda and Iyetomi 2018), and a crash of financial markets (Maskawa et al. 2013). Uncertainty in such processes results in the emergence of systemic risk (Kuyyamudi et al. 2019) and often imposes a large adverse impact on the entire system (Daglis et al. 2022).

Public health authorities are interested in the inception of the spreading. Once they detect a possible outbreak and get a big picture on the spatial distribution of the spreading, it becomes a pressing task for both the first-aid intervention and long-term policy making to discover the index case (Cesar Henrique and da Fontoura 2011). The index case is a so-called patient-zero as a source (individual or subpopulation) of spreading. This problem is called epidemiological geographic profiling (Le Comber et al. 2011). It is inspired by criminological geographic profiling which aids crime investigators in searching for a home base of an offender from serial crime sites on a map (Canter et al. 2000). Similarly, it is a significant task for financial regulatory authorities to discover the source of circulating disinformation and transmitting financial distress in the markets. Discovery of a source is an inverse problem for identifying the initial state retrospectively from the current snapshot of variables.

In general, temporal evolution of time-dependent variables is formulated by a Langevin equation (stochastic differential equation). Equivalently, temporal evolution of the probability density function of stochastic variables is described by a Fokker-Planck forward equation (multivariate partial differential equation). The Fokker-Plank formulation is analogous to a Boltzmann transport equation in describing non-equilibrium transient behaviors. The Boltzmann transport equation formalizes collisions and diffusion of molecular species under external forces. Susceptible and infectious individuals in epidemiology are such molecular species in thermodynamics. Note that relaxation to equilibrium, or non-decreasing entropy in the second law of thermodynamics, is an informational process to lose the memory of the initial state irreversibly. Unless the spreading terminates in the equilibrium, the most probable candidate for the initial state can be obtained in principle by analyzing a corresponding Fokker-Planck backward equation. But practical reaction-diffusion processes are too complex to obtain a tractable solution even numerically.

This work studies machine-learning-based inverse problem solvers for epidemiological geographic profiling. The study focus is on the performance of a state-of-the-art convolutional neural network (CNN). The other 3 solvers for performance comparison include a naive Bayes classifier (NBC), a random forest classifier (RFC), and a multinomial logistic regression (MLR). They are one of the most widely used solvers in predicting probability for Bayesian inference, supervised learning, and statistical modeling respectively. For example, some works on molecular transports (Janczura et al. 2020; Kowalek et al. 2019) investigate a random forest classifier in detail. A trajectory of time-dependent variables is computed numerically from a Langevin equation for SIR epidemiological compartments on a square grid geo-space. Snapshots of variables are recorded from the trajectory and organized as a synthetic dataset. The solvers are applied to discovering either a single source or multiple sources in the dataset. Accuracy and hit score are investigated to quantify the performance of the 4 solvers.

2 Related works

Recent works present that machine learning can play a vital role for data analysis in mathematical sciences. One example is approximating a solution of a complicated diffusion equation for polymer microphase separation (Wei et al. 2018). Another is classifying a diffusion mode in single particle tracking for molecular transports (Janczura et al. 2020; Kowalek et al. 2019). On the other hand, many works study global intercity aviation and human-to-human contacts as a medium of spreading of an infectious disease. The key idea is founded on modeling to discretize the medium as a complex, temporal, or multi-layered network (Li and Saad 2021; Ortega et al. 2022; Torrisi et al. 2021).

Some of the works propose to analyze network topologies and apply meta-heuristics to epidemiological geographic profiling (Menin and Bauch 2018; Nguyen and Vural 2017; Paluch et al. 2021; Shi et al. 2022). But few of them study machine-learning-based inverse problem solvers, which are applicable potentially to a multi-dimensional geo-space and any other models of the medium. This is the motivation of this work.

3 Materials and methods

3.1 Reaction–diffusion process for disease spreading

Time-dependent variables $I_{j}(t) \ (j=1, 2, \cdots , N)$ is the number of infectious individuals in the j-th subpopulation at continuous time $t \ge 0$. N is the number of subpopulations. $J_{0}$ is the set of source subpopulations. Temporal evolution of $I_{j}(t)$ is described by a Langevin equation (system of stochastic differential equations) in Eq. (1) (Maeno 2016).

$$\begin{aligned} \frac{\textrm{d} I_{j}(t)}{\textrm{d} t}= & {} (\alpha I_{j}(t) + \xi _{j}^{\mathrm{[ \alpha ]}} (t)) - (\beta I_{j}(t) + \xi _{j}^{\mathrm{[ \beta ]}} (t)) \nonumber \\{} & {} + \sum _{k \ne j} (\gamma _{kj} I_{k}(t) + \xi _{kj}^{\mathrm{[ \gamma ]}} (t)) - \sum _{k \ne j} (\gamma _{jk} I_{j}(t) + \xi _{jk}^{\mathrm{[ \gamma ]}} (t)) \nonumber \\{} & {} (j=1, 2, \dots , N). \end{aligned}$$

(1)

The coefficients $\alpha $ and $\beta $ are the probability of infection and removal (or recovery) per unit time. These are the state transitions in SIR epidemiological compartments ($S \rightarrow I \rightarrow R$). In the SIR epidemiological compartments, an individual is assigned to one of 3 compartments with labels S (susceptible), I (infectious), and R (removed (or recovered)). It is assumed in this work that $I_{j}(t) \ll S_{j}(t)$ which is the number of susceptible individuals. The decrease in $S_{j}$ is negligible. The term for infection becomes $\alpha ^{\prime } S_{j} I_{j}(t) \approx \alpha I_{j}(t)$ in Eq. (1) approximately. Basic reproductive ratio is given by $R = \alpha / \beta $. The Hurst exponent (Vilk 2022) for $I(t) = \sum _{j=1}^{N} I_{j}(t)$ is $h = 0.5$ (normal diffusion) if $\alpha = \beta \ (R=1)$. The coefficients $\varGamma = \{ \gamma _{ij }\}$ are the probability of unidirectional movements (from the j-th to the k-th subpopulation) per unit time. The probability of staying in the current subpopulation is $\gamma _{jj} = 1 - \sum _{k \ne j} \gamma _{jk}$. The terms $\xi _{j}^{\mathrm{[ \alpha ]}} (t)$, $\xi _{j}^{\mathrm{[ \beta ]}} (t)$, and $\xi _{jk}^{\mathrm{[ \gamma ]}} (t)$ are Gaussian white noise. Their functional forms are not known.

Subpopulations are placed on a two-dimensional square grid geo-space. The size of the geo-space is $32 \times 32 \ (N = 1024)$. A subpopulation is surrounded by 8 neighbors. Heterogeneity parameter $r_{\textrm{ht}}$ is the ratio of missing connections. A missing connection means that the probability of movements is negligible there. The geo-space is sparser and more disjoint for larger $ r_{\textrm{ht}}$. This work studies only non-disjoint connections. It is assumed that $\gamma _{jk} = \gamma $ is a constant for connections. In other words, the probability distribution is $P(\gamma _{jk} = 0) = r_{\textrm{ht}}$ and $P(\gamma _{jk} = \gamma ) = 1- r_{\textrm{ht}}$. The mean and standard deviation of $\gamma _{jk}$ over j, k in $\varGamma $ are $\mu = (1- r_{\textrm{ht}}) \gamma $ and $\sigma = \sqrt{(1- r_{\textrm{ht}}) r_{\textrm{ht}}} \gamma $.

3.2 Dataset for epidemiological geographic profiling

The Langevin equation in Eq. (1) is integrated with a Monte-Carlo simulation for a single source ($|J_{0}| = 1$). The initial state is $I_{j}(0) = 10^{4}$ for $j \in J_{0}$ and $I_{j}(0) = 0$ otherwise. Time t is discretized into a step $\varDelta t$. Multinomial random number generators $x \sim \mathcal {M} (\alpha , \beta , \varGamma )$ reproduce stepwise state transitions and fluctuating changes in $\varDelta I_{j}$. Variable snapshot vectors ${\varvec{I}} = (I_{1}, I_{2}, \dots , I_{N})$ are recorded at an observation interval $\tau $. The simulation conditions are $t \le 200$ and $\tau = 10$ (20 snapshots per a trajectory). The snapshots ${\varvec{I}}_{j}$ are the instances of N features with a source identifier $j \in J_{0}$ as a category label. The pairs of the features and category label are assembled into a training dataset ${\varvec{\mathcal {D}}}^{\mathrm{[tr]}} = \{ ({\varvec{I}}_{j}, j) \}$. ${\varvec{\mathcal {D}}}^{\mathrm{[tr]}}$ includes $28 \times 28 = 784$ different trajectories (initial states). The source excludes the edges of the geo-space ($1024-784=240$ subpopulations). The instances ($|{\varvec{\mathcal {D}}}^{\mathrm{[tr]}} | = 20 \times 784 = 15680$) are stored in an arbitrary order. Note that ${\varvec{\mathcal {D}}}^{\mathrm{[tr]}}$ is not informative explicitly on the parameters ($\alpha , \beta , \varGamma , t, \varDelta t, \tau $).

A test dataset ${\varvec{\mathcal {D}}}^{\mathrm{[te]}}$ is assembled similarly. The values of the probability parameters $(\alpha , \beta , \varGamma )$ are the same. There is either a single source ($| J_{0} |=1$) or multiple sources ($| J_{0} |>1$) initially. The sources are chosen randomly so that the distance between any pair in $J_{0}$ is larger than the mean (0.52 in a unit square) because they are indistinguishable if they are too close. The initial state is $I_{j}(0) = 10^{4} / |J_{0}|$ for $j \in J_{0}$. The observation interval is $\tau = 25$ (8 snapshots per a trajectory). Figure 1 shows some instances in ${\varvec{\mathcal {D}}}^{\mathrm{[te]}}$ for $r_{\textrm{ht}}=0.45$ and $R=1.2$. Each time series (a), (b), (c) represents a trajectory ($t \le 200$) for $|J_{0}| = 1$ (single source), 2 (multiple sources), and 3 (multiple sources). The trace of the initial state gets fainter gradually (correlation is lost) as fluctuation accumulates and relaxation to the equilibrium progresses. They visualize how the signal level falls while the noise level rises.

3.3 Machine-learning-based inverse problem solver

For an instance ${\varvec{I}}$ in ${\varvec{\mathcal {D}}}^{\mathrm{[te]}}$, $p_{j}=P(j \in J_{0} | {\varvec{I}})$ is the probability that the j-th subpopulation is a source. The vector ${\varvec{p}} = (p_{1}, p_{2}, \cdots , p_{N})$ is a posterior probability. The role of a machine-learning-based solver is to find the optimal mapping function ${\varvec{p}} = f({\varvec{I}})$ with respect to ${\varvec{\mathcal {D}}}^{\mathrm{[tr]}}$ and predict the source $j_{0}$ for ${\varvec{\mathcal {D}}}^{\mathrm{[te]}}$ by eq.(2).

$$\begin{aligned} j_{0} = \underset{1 \le j \le N}{{\text {argmin}}} \ p_{j}. \end{aligned}$$

(2)

Prediction succeeds if $j_{0} \in J_{0}$. An accuracy measure A is defined as the number of successful predictions as a fraction of $| {\varvec{\mathcal {D}}}^{\mathrm{[te]}} |$. A hit score measure H is a cumulative search area, which is investigated according to the ranking of $p_{j}$ in a descending order to discover the source, as a fraction of the total area (Le Comber et al. 2011). The area is measured by the number of grids. Larger accuracy and smaller hit score indicate more effective prediction. The most effective prediction achieves $A = 1$ for $|J_{0}| = 1$ and $H = |J_{0}|/N$ for $|J_{0}| \ge 1$. Mere guesswork results in $H = |J_{0}|/(|J_{0}|+1)$ regardless of N.

A convolutional neural network (CNN) is a state-of-the-art deep learning algorithm (Kowalek et al. 2019; Wei et al. 2018). ${\varvec{I}}$ and ${\varvec{p}}$ are processed as if they were the input pixels in a monochrome image ($32 \times 32 \ (N = 1024)$) and the output likeliness that the image contains a particular image object ($C=1024$ categories). Figure 2 shows the configuration of an 8-layer convolutional neural network ($L=8$) in this work. Large kernels ($K=8 \times 8$) are applied in calculating convolution. Filters decide the number of parallel channels ($F=32, 64$) in the output. A convolutional neural network with 10 or more hidden layers works as an excellent image classifier empirically. Based on the rule of thumb, the configuration in Fig. 2 is decided by an exhaustive search to improve A and H around $L=10$, $K=10$, and $F=10 \sim 10^{2}$. L, K, and F are kept as small as possible. Note that the configuration is not proven the best but verified effective experimentally. The hidden layers 2 through 7 employ leaky rectified linear functions. The activation and output layer employs a soft-max function $f({\varvec{I}}) \propto \exp (\sum _{k=1}^{N} a_{jk} I_{k} + b_{j})$ where $a_{jk}$ and $b_{j}$ are weight parameters. The number of weight parameters is $W_{\textrm{CNN}} = \mathcal {O}(CLKF)$. $W_{\textrm{CNN}} \sim 10^{6}$ if $L, K, F \sim 10$. In total, $7.2 \times 10^{5}$ weight parameters in Fig. 2 are optimized by an adaptive-moment algorithm with cross-entropy $E = - \sum _{j=1}^{N} p_{j} \log p_{j}$ as a loss function.

A naive Bayes classifier (NBC) is based on a maximal likelihood estimation. The likelihood $L(j \in J_{0}) = P ({\varvec{I}} | j \in J_{0})$ (Li and Saad 2021; Torrisi et al. 2021) cannot be computed reliably unless ${\varvec{\mathcal {D}}}^{\mathrm{[tr]}}$ is dense enough. Instead in this work, it is evaluated approximately by $L(j \in J_{0}) \propto \max _{j} {\varvec{I}} \cdot {\varvec{I}}_{j} $ where ${\varvec{I}}_{j} \in {\varvec{\mathcal {D}}}^{\mathrm{[tr]}}$. It does not have any weight parameters. Optimization of weight parameters is omitted, but maximization of ${\varvec{I}} \cdot {\varvec{I}}_{j}$ for prediction consumes more time as $| {\varvec{\mathcal {D}}}^{\mathrm{[tr]}} |$ increases. A random forest classifier (RFC) is a bootstrap aggregation algorithm for an ensemble of decision tree classifiers. An ensemble of $T=100$ decision tree classifiers is optimized with respect to cross-entropy. The number of weight parameters is $W_{\textrm{RFC}} = \mathcal {O}(CT) \sim 10^{5}$. It is nearly in the same order as $W_{\textrm{CNN}}$ in Fig. 2. A multinomial logistic regression (MLR) is a generalized linear model with a soft-max function. The weight parameters are optimized by a Newton conjugate gradient algorithm. The number of weight parameters is $W_{\textrm{MLR}} = \mathcal {O}(C^{2}) \sim 10^{6}$. It is as large as $W_{\textrm{CNN}}$.

4 Results and discussion

4.1 Discovery of single source

Table 1 Time average of accuracy $\bar{A}$ of the 4 machine-learning-based solvers (CNN, NBC, RFC, MLR)

Full size table

Table 1 shows the time average of accuracy $\bar{A}$ (over the all instances in ${\varvec{\mathcal {D}}}^{\mathrm{[te]}}$) of the 4 machine-learning-based solvers (CNN, NBC, RFC, MLR) for discovering a single source ($|J_{0}|=1$). The geo-space is either homogeneous ($r_{\textrm{ht}} = 0.05$) or heterogeneous ($r_{\textrm{ht}} = 0.45$). Infection is either growing ($R=1.2$), steady ($R=1$), or declining ($R=0.9$). The combinations of the values of $r_{\textrm{ht}}$ and R are referred to by the 6 test cases a, b, $\cdots $, f. Note that the solvers do not rely on any prior information on SIR epidemiological compartments, probability parameters, and heterogeneity of a space ($\alpha , \beta , \varGamma , t, \varDelta t, \tau , r_{\textrm{ht}}$) in executing optimization and prediction.

The convolutional neural network achieves $\bar{A} \ge 0.85$ for growing infection (test cases a and d) and outperforms the others. Growing infection in a heterogeneous geo-space (test case d) is of much practical significance. The convolutional neural network works most effectively ($\bar{A}=0.88$) in this test case. The performance deteriorates to $\bar{A}=0.51$ for declining infection (test case f). The signal gets less conspicuous. But it still achieves the best performance. The multinomial logistic regression fails to work in contrast. The solvers must be capable of learning and expressing a complex mapping function ${\varvec{p}} = f({\varvec{I}})$. The convolutional neural network performs a variety of non-linear computations for any correlated subpopulations in each hidden layer while the others iterate similar computations for the all subpopulations. The naive Bayes classifier and random forest classifier work moderately. They can be an alternative for the test cases d and b, c respectively. But the performance tends to be unstable on a case-by-case basis. It may be difficult to decide confidently when to apply them if $r_{\textrm{ht}}$ and R are not known. The convolutional neural network is not the only choice but the first choice for public health practitioners in investigating a variety of cases.

It is also verified that the convolutional neural network works comprehensively for $\varGamma $ in other probability distributions ($\ne \varGamma $ in 3.1). For example, the time average of accuracy in the test case d is $\bar{A}=0.91$ for a uniform distribution $\gamma _{jk} \sim \mathcal {U} (0, 2\mu )$, where $\mu = (1-r_{\textrm{ht}}) \gamma $ and $\sigma =\mu / \sqrt{3}$, and $\bar{A} = 0.93$ for a normal distribution $\gamma _{jk} \sim \mathcal {N} (\mu , \mu /2)$. An interesting finding is that the convolutional neural network works more accurately for heterogeneous geo-spaces. It may ensue because the speed of time evolution is different. The probability of movements is a constant for connections. But the nodal degree is smaller and the hop-count between subpopulations is larger for larger $r_{\textrm{ht}}$. The effective flux is smaller, the consequent relaxation time is larger, and the trace of the initial state may be kept more recognizable. This may no longer be true after relaxation progresses.

Figure 3 shows the accuracy A(t) as a function of time $t \ (\tau = 25)$. The scatter plots (a), (b), $\dots $, (f) correspond to the 6 test cases a, b, $\dots $, f in Table 1. The 4 curves in a scatter plot represent the 4 solvers (CNN, NBC, RFC, MLR). The convolutional neural network almost always works most effectively of the 4 solvers. Note that the naive Bayes classifier works as effectively in the test case d, and the random forest classifier works slightly more effectively at $t=150$ through 200 in the test cases b, c. These solvers can be alternatives under particular conditions of spreading. In terms of the overall performance in a variety of test cases, it is concluded that the convolutional neural network is still the first choice, and the naive Bayes classifier can be the second choice for helping the practitioners verity the prediction. Performance degradation for declining infection (test cases c and f) at $t \ge 100$ is evident as the correlation with the initial state ${\varvec{I}} (0)$ is almost lost (normalized autocorrelation coefficient $\rho (t) \le 0.05$).

4.2 Discovery of multiple sources

Figure 4 shows the hit score H(t) of a convolutional neural network as a function of time t for discovering a single source ($|J_{0}|=1$ for the scatter plot (a)) and multiple sources ($|J_{0}| =2, 3$ for the scatter plots (b), (c)). The test cases a, b, $\cdots $, f are the same as in Table 1 and Fig. 3. In the scatter plot (a), the hit scores $H(t) \le 1.5 \times 10^{-3}$ remains near the lower bound ($1/N = 9.8 \times 10^{-4}$) over time for growing infection in both homogeneous and heterogeneous geo-spaces (test cases a and d). It means that the second or third candidates from ${\varvec{p}}$ are promising nearly equally even if the first candidate $j_{0}$ in Eq. (2) fails and impairs the accuracy A.

The performance characteristics in the scatter plots (b) and (c) are different from those in the scatter plot (a). The performance worsens as $|J_{0}|$ increases. But the hit score H is still below 0.01 and 0.1 in the early stage of disease spreading, and converges steadily to the upper bound 0.67 and 0.75 for $|J_{0}|=2$ and 3 respectively. The solver always works more effectively for a heterogeneous geo-space (test cases d, e, and f). Note that such instances as in Fig. 1b and c are absent in ${\varvec{\mathcal {D}}}^{\mathrm{[tr]}}$ and the mapping function ${\varvec{p}} = f({\varvec{I}})$ is optimized with respect to ${\varvec{\mathcal {D}}}^{\mathrm{[tr]}}$. The number of patterns for the initial state $\mathcal {O} (N^{|J_{0}|})$ is too large for $|J_{0}| > 1$. Those findings imply that discovery of small number of sources is potentially achievable, at least in the early stage of spreading, by learning a limited number of instances for a single source.

5 Conclusion

In this work, it is demonstrated that a convolutional neural network works effectively as an inverse problem solver for epidemiological geographic profiling. In terms of the time average of accuracy $\bar{A}$, it outperforms the other 3 machine-learning-based solvers in any test cases of growing, steady, and declining infection in heterogeneous and homogeneous geo-spaces. The largest accuracy ($\bar{A} \ge 0.85$)) is achieved and the hit score ($H \le 1.5 \times 10^{-3}$) remains near the lower bound over time in discovering a single source ($|J_{0}| = 1$) for the most significant test case (growing infection in a heterogeneous geo-space). Discovering multiple sources ($|J_{0}| > 1$) is feasible potentially as well merely by learning of a limited number of instances in the dataset for a single source.

It is anticipated that growing infection in a heterogeneous geo-space generates a unique complex spatio-temporal pattern in the trajectories. The complex pattern may be left as a hidden signal on the source after a simple linear pattern (correlation) is lost in the noise. It is known as a profound design principle that the complexity of a solver is managed to match that of the dataset. Therefore, exploring less complex meta-heuristics as in some previous is a futile effort. Those findings corroborate the conclusion that the complex nature of a convolutional neural network deserves the first-choice solver for detecting such a hidden signal. Note that the number of hidden layers and filters, and the kernel settings are arbitrarily flexible, and the configuration in this work is not proven the best. Performance limits with respect to the nature of inverse problems and the capability of solvers are yet to resolve as future theoretical, experimental, and empirical works.

Recently, large datasets become available by collecting real-time data from sensors in every corner of human activities. Then in turn, machine learning aids investigators in identifying the initial state (and even boundary conditions, external forces, stochastic terms, and equational forms possibly) and in solving miscellaneous inverse problems for a reaction-diffusion process. For example, the inverse problem solvers in this work can be applied to analyzing such an economic system as a financial market, a stock exchange, a supply chain, and a trade network. The price of a stock or a group of stocks in an industry sector is a time-dependent variable (similar to the variables $I_{j}(t)$ in 3.1). Variable snapshot vectors represent the entire market conditions (similar to the instances ${\varvec{I}}$ in the datasets ${\varvec{\mathcal {D}}}^{\mathrm{[tr]}}$ and ${\varvec{\mathcal {D}}}^{\mathrm{[te]}}$ in 3.2). Retrospective nowcasting is an inverse problem to discover the recent source incident on a stock which causes a big impact on the current market volatility. Such an analysis is beneficial to regulatory authorities, financial institutions, and investors in making a decision to deal with systemic risk, financial crisis, and a crash of markets. An inductive approach as in this work can be an essential investigation tool for data-driven mathematical sciences, econophysics, and sociophysics, and may replace conventional reductionistic modeling approaches.

Data availability

The datasets in this work are not shared openly but available from the author on reasonable request.

References

Canter D, Coffey T, Huntley M, Missen C (2000) Predicting serial killers’ home base using a decision support system. J Quant Criminol 16:457
Article Google Scholar
Cesar Henrique C, da Fontoura CL (2011) Identifying the starting point of a spreading process in complex networks. Phys Rev E 84:056105
Article Google Scholar
Daglis T, Melissaropoulos IG, Konstantakis KN, Michaelides PG (2022) The impact of COVID-19 on global stock markets: early linear and non-linear evidence for Italy. Evolut Inst Econ Rev 19:485
Article Google Scholar
Grinberg N, Joseph K, Friedland L, Swire-Thompson B, Lazer D (2019) Fake news on Twitter during the 2016 U.S. presidential election. Science 363:374
Article Google Scholar
Ikeda Y, Iyetomi H (2018) Trade network reconstruction and simulation with changes in trade policy. Evolut Inst Econ Rev 15:495
Article Google Scholar
Janczura J, Kowalek P, Loch-Olszewska H, Szwabinski J, Weron A (2020) Classification of particle trajectories in living cells: machine learning versus statistical testing hypothesis for fractional anomalous diffusion. Phys Rev E 102:032402
Article Google Scholar
Kowalek P, Loch-Olszewska H, Szwabinski J (2019) Classification of diffusion modes in single-particle tracking data: feature-based versus deep-learning approach. Phys Rev E 100:032410
Article Google Scholar
Kuyyamudi C, Chakrabarti AS, Sinha S (2019) Emergence of frustration signals systemic risk. Phys Rev E 99:052306
Article Google Scholar
Le Comber SC, Rossmo DK, Hassan AN, Fuller DO, Beier JC (2011) Geographic profiling as a novel spatial tool for targeting infectious disease control. Int J Health Geogr 10:35
Article Google Scholar
Li B, Saad D (2021) Impact of pre symptomatic transmission on epidemic spreading in contact networks: a dynamic message-passing analysis. Phys Rev E 103:052303
Article Google Scholar
Maeno Y (2013) Transient fluctuation of the prosperity of firms in a network economy. Phys A 392:3351
Article Google Scholar
Maeno Y (2016) Detecting a trend change in cross-border epidemic transmission. Phys A 457:73
Article Google Scholar
Maskawa J, Murai J, Kuroda K (2013) Market-wide price co-movement around crashes in the Tokyo Stock Exchange. Evolut Inst Econ Rev 10:81
Article Google Scholar
Menin OH, Bauch CT (2018) Solving the patient zero inverse problem by using generalized simulated annealing. Phys A 490:1513
Article Google Scholar
Nguyen VAT, Vural DC (2017) Morphological inversion of complex diffusion. Phys Rev E 96:032314
Article Google Scholar
Ortega E, Machado D, Lage-Castellanos A (2022) Dynamics of epidemics from cavity master equations: susceptible-infectious-susceptible models. Phys Rev E 105:024308
Article Google Scholar
Paluch R, Gajewski LG, Suchecki K, Holyst JA (2021) Impact of interactions between layers on source localization in multilayer networks. Phys A 582:126238
Article Google Scholar
Shi C, Zhang Q, Chu T (2022) Source estimation in continuous-time diffusion networks via incomplete observation. Phys A 592:126843
Article Google Scholar
Torrisi G, Annibale A, Kühn R (2021) Overcoming the complexity barrier of the dynamic message-passing method in networks with fat-tailed degree distributions. Phys Rev E 104:045313
Article Google Scholar
Vilk O et al (2022) Unraveling the origins of anomalous diffusion: from molecules to migrating storks. Phys Rev Res 4:033055
Article Google Scholar
Vosoughi S, Roy D, Aral S (2018) The spread of true and false news online. Science 359:1146
Article Google Scholar
Wei Q, Jiang Y, Chen JZY (2018) Machine-learning solver for modified diffusion equations. Phys Rev E 98:053304
Article Google Scholar

Download references

Funding

Open Access funding provided by Meiji University.

Author information

Authors and Affiliations

School of Interdisciplinary Mathematical Sciences, Meiji University, Tokyo, 1648525, Japan
Yoshiharu Maeno

Authors

Yoshiharu Maeno
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yoshiharu Maeno.

Ethics declarations

Conflict of interest

The author declares that the author has no Conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This research was supported partly by Tokutei-Kojin (Specific Individual) Research Funds from Meiji University.

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Maeno, Y. Inverse problem solver for epidemiological geographic profiling. Evolut Inst Econ Rev (2024). https://doi.org/10.1007/s40844-024-00281-3

Download citation

Received: 11 September 2023
Accepted: 16 March 2024
Published: 30 March 2024
DOI: https://doi.org/10.1007/s40844-024-00281-3

Keywords

JEL Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Inverse problem solver for epidemiological geographic profiling

Abstract

Similar content being viewed by others

Explaining Causal Influence of External Factors on Incidence Rate of Covid-19

Deep Spatial Q-Learning for Infectious Disease Control

PolSIRD: Modeling Epidemic Spread Under Intervention Policies

1 Introduction

2 Related works

3 Materials and methods

3.1 Reaction–diffusion process for disease spreading

3.2 Dataset for epidemiological geographic profiling

3.3 Machine-learning-based inverse problem solver