1 Introduction

Are spatial econometric methods suitable to model the recent global spread of the coronavirus (COVID-19)? Overall, there is a vast literature on spatial data analysis with a rather heterogeneous treatment of spatial dependence and spillover structures (see, for example, Zoller 2004). Spatial econometric specifications (LeSage and Pace 2009) make the spatial dependence structures among the observations particularly explicit (Bivand et al. 2015). These approaches aim at highlighting the importance of directly accounting for spatial interdependencies among the observations under scrutiny and have recently gained momentum particularly in the regional science and economics literature. Spatial econometric specifications use so-called spatial weight matrices in order to augment standard classical linear model specifications by allowing for spatial spillovers among the observations (see, LeSage and Pace 2009; Anselin 2013). Multiple previous studies highlight the importance of spatial econometrics for capturing disease transmission pathways and network effects, as well as in quantifying the magnitude of spatial spillovers (see, e.g. Emch et al. 2012; Wang et al. 2015; Chagas et al. 2016).

For modelling the recent coronavirus pandemic, allowing for spatial dependence appears of predominant importance. In the beginning of the crisis (January 2020), the virus was often seen as a Chinese and later an Italian problem. However, due to the rapid spread of the virus across the globe, almost all Western countries reacted by employing drastic measures to contain or delay the further spread of the virus. These measures entail considerable restrictions in every day social and economic life. Most notable policy measures comprise closings of borders and general curfews to curb the spread. At the beginning of the outbreak, national linkages thus appear of particular importance to explain the spread across the globe.

In this paper we use spatial econometric frameworks to model COVID-19 infections across the globe. Our spatial econometric specification pays particular attention to different types of spatial dependence including information on geographic neighbourhood, travel linkages as well as trade ties. By using daily data on country-specific infections, we moreover allow the strength of spatial dependence to vary over time. Both features appear to be of particular importance for adequately modelling the spread of the virus.

2 A spatial dynamic panel model

We make use of a spatial autoregressive (SAR) dynamic panel model by particularly focussing on time-dependent spatial dependence structures during the spread of the virus, which we aim to model for N countries over T days. The model can be written as follows:

$$\begin{aligned} \varvec{y}_t = \rho _t\varvec{W}(q_t) \varvec{y}_{t} + \varvec{\alpha } + \varvec{y}_{t-1}\beta + \varvec{\varepsilon }_t \end{aligned}$$
(2.1)

where \(\varvec{y}_{t}\) is an \(N \times 1\) vector of country-specific infections at time t (\(t=1,\ldots ,T\)), \(\varvec{W}(q_t) \varvec{y}_{t}\) denotes the so-called spatial lag, and \(\varvec{\alpha }\) contains country-specific trends. The \(N \times 1\) vector of innovations \(\varvec{\varepsilon }_t\) is assumed iid normal with zero mean and variance \(\sigma ^2\).

Note that the spatial autoregressive term in (2.1) comprises a time-dependent \(N\times N\) spatial weight matrix \(\varvec{W}(q_t)\). The (scalar) parameter \(\rho _t\) measures the strength of spatial autocorrelation and is also modelled time-variant. Positive (negative) values of \(\rho _t\) indicate positive (negative) spatial autocorrelation, with sufficient stability condition \(\rho _t \in (-\,1, 1)\) for all t (see LeSage and Pace 2009).

The spatial weight matrix \(\varvec{W}(q_t)\) captures spatial linkages between countries i and j (with \(i,j = 1,\ldots ,N\)) at time t. \(\varvec{W}(q_t)\) is non-negative and row-stochastic, with entries treated as known constants. Its typical element \(\left[ \varvec{W}(q_t)\right] _{ij} = 0\) for \(i = j\) and \(\left[ \varvec{W}(q_t)\right] _{ij} > 0\), if there is a considered link between countries i and j. Moreover, \(\left[ \varvec{W}(q_t)\right] _{ii} = 0\), as no region is considered to be its own neighbour.

The time-variant discrete parameter \(q_t \in \{1,\ldots ,P\}\) governs the choice of a spatial weight matrix from P alternatives, eventually of different classes, and is to be estimated. In the spirit of work by Piribauer and Crespo Cuaresma (2016) and Fischer and LeSage (2015), such a specification allows to trace the nature of global spatial spillovers over time in a flexible way. Bayesian estimation techniques allow to efficiently deal with such flexible mixture specifications of spatial weight matrices. In addition to the time-variant spatial autoregressive parameter, the proposed model specification thus also allows to study the nature of spatial spillover processes over time.

In the spatial econometrics literature, spatial spillovers are defined as the impacts to a region’s outcome variable due to shocks in other regions. By reformulating Eq. (2.1) to its reduced form representation, \(\varvec{y}_t = \left( \varvec{I}-\rho _t\varvec{W}(q_t)\right) ^{-1}( \varvec{\alpha } + \varvec{y}_{t-1}\beta + \varvec{\varepsilon }_t)\), the spatial multiplier matrix \(\left( \varvec{I}-\rho _t\varvec{W}(q_t)\right) ^{-1}=\sum _{r=0}^\infty \rho _t^r\varvec{W}(q_t)^r\) for a given spatial weight matrix is governed by the spatial autoregressive parameter \(\rho _t\) (for a thorough discussion, see LeSage and Pace 2009). By accounting for both spatial and non-spatial components, our proposed model thus allows to distinguish between intra- and cross-regional transmission processes.

Since country-specific infections are non-negative count data, one may argue that a spatial econometric specification for count data might be more suitable. However, in such cases explicit spatial autoregressive specifications are much more difficult to handle (see Bivand et al. 2014 or LeSage and Pace 2009). As a benchmark model, we therefore also apply a popular alternative in the spatial econometric literature put forward by Fischer et al. (2006), by using a Bayesian Poisson framework with an explicit spatial autoregressive error structure. This model (henceforth labelled Poisson spatial error model—Poisson SEM) can be written as follows:Footnote 1

$$\begin{aligned} \varvec{y}_t&\sim {\mathcal {P}}(\varvec{\lambda }_t) \nonumber \\ \varvec{\lambda }_t&= \exp \left( \varvec{\alpha } + \varvec{y}_{t-1}\beta + \varvec{\nu }_t \right) , \end{aligned}$$
(2.2)

where \({\mathcal {P}}(\cdot )\) denotes the Poisson distribution and the \(N \times 1\) vector \(\varvec{\lambda }_t\) is the mean of the Poisson process. The \(N \times 1\) vector \(\varvec{\nu }_t\) captures country specific random effects. We follow work by LeSage et al. (2007) and introduce a source of spatial dependence via the random effects vector \(\varvec{\nu }_t\), which is assumed to follow a first order spatial autoregressive process:

$$\begin{aligned} \varvec{\nu }_t&= \phi _t \varvec{W}(q_t) \varvec{\nu }_t + \varvec{\eta }_t \qquad \varvec{\eta }_t = {\mathcal {N}}(0,\tau _t^2 \varvec{I}). \end{aligned}$$

The scalar \(\phi _t \in (-\,1,1)\) measures the strength of spatial autocorrelation at time t, and \(\varvec{W}(q_t)\) is specified as above. The disturbance error vector \(\varvec{\eta }_t\) is assumed to be iid normally distributed with zero mean and \(\tau ^2\) variance.Footnote 2

3 Data, spatial weights and estimation

Fig. 1
figure 1

First confirmed cases by country

We make use of the COVID-19 database provided by the Johns Hopkins University (Dong et al. 2020). The data set contains information on daily case counts of confirmed infections for 99 countries. Our dependent variable is the logged daily number of confirmed cases per 100,000 inhabitants per country from January 23rd 2020 to March 28th 2020. Population data is obtained from the World Bank development indicators database. Figure 1 depicts a map of the countries in our sample along with the country-specific timing of the coronavirus outbreak.Footnote 3

We allow for four alternative types of spatial weight matrices: first, the presence of common borders (Source: Eurostat). Second, the intensity of bilateral flight connections, measured in terms of the total number of weekly commercial flights between country pairs (Source: openflights.com). Third, trade intensity, using aggregate trade in 2010 USD to construct a k-nearest neighbour type spatial weight matrix, where the seven partners with highest aggregate trade value were considered (Source: WITS trade database).Footnote 4 Finally, international agreements guaranteeing the free movement of people, where if a country pair has signed a treaty allowing for free movement of people, they are considered to be neighbours (Source: Krisztin and Fischer 2015).

For the SAR model in Eq. (2.1), estimation is carried out using well-known Bayesian Markov-chain Monte Carlo (MCMC) sampling techniques. For the Poisson SEM model in Eq. (2.2), we use the sampling algorithm laid out in Fischer et al. (2006) and Frühwirth-Schnatter et al. (2009). For both specifications we use rather non-informative Gaussian priors for the parameters \(\varvec{\alpha }\) and \(\beta\), with zero mean and a large variance of \(10^8\). For \(\sigma ^2\) and \(\tau ^2_t\), we similarly elicit a common inverted gamma specification \(\mathcal {IG}(0.001,0.001)\).Footnote 5 For \(\rho _t\) and \(\phi _t\) we use a standard beta prior specification as suggested in LeSage and Pace (2009). For the choice of the alternative spatial weight matrices \(q_t\), a non-informative uniform prior specification is used.

Sampling for the parameters \(\varvec{\alpha }\), \(\beta\), \(\sigma ^2\), and \(\tau ^2\) is done using standard conditional posteriors (see, LeSage and Pace 2009; Fischer et al. 2006). To account for the heterogeneity of \(\rho _t\) and \(\phi _t\), we use a sampling strategy discussed in LeSage and Chih (2018). Sampling for \(q_t\) is discussed in Piribauer and Crespo Cuaresma (2016).Footnote 6

4 Results

We present the MCMC estimation results obtained from 20,000 posterior draws, where 10,000 were discarded as burn-ins.Footnote 7 Estimation results are summarised in Fig. 2 and Table 1. Table 1 summarizes estimation results for the SAR and Poisson SEM specifications. The country-specific intercepts are excluded for the sake of brevity.

A first inspection reveals that the temporal autoregressive parameter \(\beta\) is—as expected—statistically significant in both specifications. Forecasting our fitted SAR model results—on average across countries and time—in a doubling rate of confirmed infections every 4 days. The \(R^2\) values of 0.905 (SAR) and 0.979 (Poisson SEM) indicate that the proposed models appear to fit the data very well.

Table 1 Posterior parameter estimates for the spatial dynamic panel SAR specification and Poisson SEM specification

Section (ii) of Table 1 contains the average posterior inclusion probability for the pre and post lockdown period of the four spatial weight matrices under scrutiny. Our models indicate that the main channel of virus transmissions before restrictive actions were taken can be attributed to international flight passengers and to a lesser degree to treaties guaranteeing free movement of people. However, international trade and common land borders played a comparatively minor role. Also, our models suggest that for the period after most countries entered a certain form of lockdown, neither of the four transmission channels predominantly explains the further spreading.

Turning attention to Fig. 2, panel (a) reveals a more differentiated picture of spatial spillovers in the SAR specification. The top panel contains the daily posterior inclusion probabilities of the four spatial weight matrices under scrutiny. The bottom panel depicts the smoothed daily posterior median estimate for the spatial autoregressive parameter \(\rho _t\).

First, the daily estimates of spatial dependence confirm that initially spatial spillovers played a key and also statistically significant role in virus transmission: at the end of February only 4 countries within the sample (and only Italy within the EU) introduced flight suspension and \(\rho _t\) is significant. After March 11th 2020, when the majority of European countries started introducing quarantine policies, closed their border crossings and reduced air travel, spatial autocorrelation becomes insignificant. Interestingly, in the mid of March, the spatial autoregressive parameter even becomes negative for a rather short period. One explanation for the short-term negative degree of spatial autocorrelation could be that some countries employed particularly tight travel bans to regions with high infection rates in order to reduce own-country virus transmission, resulting in dissimilarities (negative spatial autocorrelation) among spatial units. By March 24th, when over 3.5 billion people were living in some form of quarantine, spatial spillovers have become insignificant.

Second, the key role of flight travel is revealed to be only of importance in the first two months of our sample. Coupled with the suspension of international airline traffic in the beginning of March, the posterior importance of flight connections across all country significantly decreases. This is accompanied by a slight increase in the posterior importance of other measures of neighbourhood, particularly free movement of people within the EU.

The results of the Poisson SEM specification in Fig. 2, panel (b) largely confirm our findings from the SAR model. Spatial dependence markedly decreased after lockdown was implemented in the majority of countries and stayed insignificant in the following months. Furthermore, flight travel was the most significant spatial transmission channel between countries. The wider error margins and comparatively higher spatial dependence parameters are a direct result of the fundamental differences between the SAR and Poisson SEM specifications.

Fig. 2
figure 2

Posterior parameter estimates for the spatial dynamic panel SAR (a) and Poisson SEM (b) specifications. Top panels indicate posterior inclusion probability of spatial weight matrices over time. Bottom panels indicate the smoothed posterior median of the spatial autoregressive parameter \(\rho _t\) and \(\lambda _t\), respectively

Overall, we find that spatial dependence notably decreased over time as countries elected to implement regional and global movement restrictions in the form of border closures, flight suspensions and even complete lockdowns. Additionally, air travel connections played a particularly important role in the early stages of the virus, suggesting that the shutdown of flight connections was indeed an important measure in mitigating early transmissions.

5 Concluding remarks

We examine the virtues of spatial econometric specifications to study the spread of the recent coronavirus pandemic. Our results indicate that cross-country spatial spillover processes, specifically via international flight connections, played a particularly important role in the early stages of the virus spread. When countries began restricting airline traffic, the relative importance of flight connections as well as spatial autocorrelation decreased. Overall, our results imply that the shutdown of international airports and border closures were important policies to prevent further spillovers across countries. Moreover, the estimated spatial dependence structures seem to trace the process of the virus spread very well. Recent spatial econometric methods thus appear useful tools to model the global spread of coronavirus.