1 Introduction

By the end of 2019, a novel coronavirus, the severe acute respiratory syndrome coronavirus 2, in short Covid-19, was detected in the city of Wuhan in China (Guan et al. 2020; Zhang et al. 2020). What seemed first to cause only a country wide epidemic has fast developed into a worldwide pandemic, which is by now judged as the greatest human challenge since the Second World War. By the time of October 2021, the pandemic has caused about 246 million infections and more than 4.9 million deaths worldwide.

As a policy response, several countries implemented severe lockdowns, including school and store closings, curfews and travel bans to reduce human interaction and thereby the spread of the virus. While aiming to save human life, such restrictions also limit human freedoms. Hence, they require a continuous evaluation of whether they are sufficient to protect vulnerable populations and whether more or less strict measures can and should be considered. One quantitative indicator of policy effectiveness and pandemic severity is the case fatality ratio (WHO 2020; Ioannidis 2021), which is based on the current number of deaths relative to the lagged number of infections. The appropriate length of the time lag to be used, however, is heavily debated (Baud et al. 2020; Kim and Goel 2020). An accurate measure is essential as both, over- and underestimation can have severe consequences, such as not taking the pandemic seriously or causing redundant panic (Kim and Goel 2020).

The literature reflects the uncertainty about the appropriate length. Chrusciel and Szybka (2021), for example, investigate the time lag in several European countries and find that the lag between reported cases and deaths averages around 7 days. Vanella et al. (2020) also investigate the appropriate time lag for European countries to calculate an unbiased case fatality ratio and conclude that a lag between five and ten days should be used. Testa et al. (2020) determine the lag for US counties and find a substantially longer lag. They conclude that deaths often occur two to eight weeks after the onset of the first symptoms. Wilson et al. (2020) determine the case fatality ratio for China and find that a 13-day lag best describes the pattern of the data.

To add to this debate, I provide rigorous evidence for the length of the time lag between a rise in infections and a subsequent rise in death cases using daily panel data for Germany’s 16 federal states from May 2020 to December 2020. In comparison to the studies outlined above, however, I rely on a spatial econometric approach to consider the dynamic spatial spread of the virus. I thereby also add to the literature that analyses the spatial dynamics of Covid-19 and quantifies the spatio-temporal interactions and spillovers of the virus (e.g., Guliyev 2020; Krisztin et al. 2020; Ehlert 2021). Specifically, I estimate different spatial econometric models that allow not only to consider the infections in a given state but also those in neighboring states, so-called spatial lags. Spatial econometric models are useful to model interaction effects between geographical units (Elhorst 2021) and are hence especially useful to model the global-spreading and infectious nature of the coronavirus.

I determine the lag between the wave of infections and the wave of Covid-19 death cases in Germany’s federal states and their geographical spread with four spatial models: I use (1) a model with spatial lags in the independent variables (SLX model), (2) a non-dynamic and (3) a dynamic model that include spatial lags in both, the dependent and independent variable, together with space and time fixed-effects (Spatial Durbin Models with fixed-effects), and (4) a dynamic Spatial Durbin Model with common factors to capture potential strong cross-sectional dependence (i.e., cross-sectional averages instead of time fixed-effects). All of these spatial models allow to derive the direct (same-state) and indirect (other states) effects of infections on death cases. Specifically, the direct effect shows the marginal effect on the number of death cases driven by a change in the number of infections in the own state, while the indirect effect is the marginal effect on the number of death cases driven by an increase in infections in all other states, while both effects taken together form the total effect (Golgher and Voss 2016).

All four spatial econometric models have the advantage that the spillover effects are fully flexible, i.e., they can take any value, which makes them more suitable for economic research focusing on spillover effects in comparison to a Spatial Autoregressive Model (SAR) or Spatial Autoregressive Combined Model (SAC) (Elhorst 2021). Moreover, the two dynamic models allow to differentiate between short-run (SR) and long-run (LR) effects, i.e., is possible to assess whether the effect of an increase in infections on the number of death cases fades out over time, if, for example, effective policies are implemented to protect the most vulnerable.

The remainder of this article proceeds as follows. In Sect. 2, I present the data used for the analysis. In Sect. 3, I outline the empirical strategy and briefly discuss the differences in the spatial econometric models. I present the results in Sect. 4 and conclude in Sect. 5.

2 Data

2.1 Daily infections and daily death cases

The main variables used in this article are the number of daily new infections and the number of daily reported death cases due to Covid-19 in Germany at the federal state level. There are in total 16 federal states and the German Robert Koch Institut (RKI) provides and updates the respective numbers on a daily basis since the very beginning of the pandemic.Footnote 1 Yet, as the number of cases in the first and second wave are not directly comparable (due to different testing strategies and measures taken), I make use of the data from the period between May 1, 2020 and December 31, 2020, hence dropping observations from the first wave’s peak as well as those that were recorded after the vaccination campaign started. The latter is done to get an estimate of the time lag between infections and deaths without the vaccine being available, hence, an unbiased measure of when the rise of deaths should be expected after a rise of infections.

2.2 Population and geospatial data

To make the numbers of infections and deaths comparable across the different states, I calculate them per 100,000 inhabitants in the respective state. The corresponding population numbers are derived from the Bundesamt für Kartographie und GeodäsieFootnote 2 which provides geospatial shape files for Germany, including population numbers for each state. The same source is used to derive the geographical coordinates for each state, which are essential to construct the spatial weight matrix used in spatial models (see Sect. 3).

2.3 Intensive care cases

To be able to derive an unbiased estimate of the time lag between infections and death cases, I control for the number of daily new intensive care (IC) cases per federal state, i.e., the number of patients that have contracted the corona disease and are under intensive care. This variable is also calculated per 100,000 inhabitants. The number of daily new IC cases might be positively related to both variables of interest, the number of infections and deaths. Hence, including the variable in the regression avoids an upward omitted variable bias. Information on the number of patients in IC is provided by the RKI and the German Interdisciplinary Group for Intensive and Emergency Care (DIVI).Footnote 3 The data are available on a daily level and per federal state.

2.4 Temperature data

The medical literature reports that respiratory diseases and infections follow seasonal cycles and are susceptible to temperature (e.g., Shaman et al. 2010; Martinez 2018). Similar patterns have recently been confirmed for the coronavirus. Ma et al. (2020), in their analysis on temperature and humidity effects on Covid-19 deaths in China, find that higher temperatures lead to increases in death cases. Wu et al. (2020) contrarily identify a negative relationship between temperature and new cases and deaths in a study on 166 countries. To control for the potential confounding factor of temperature, I include the daily mean temperature on the state level in my analysis. Data on daily meteorological conditions is received from the German Weather Service (DWD). The DWD provides daily time series data for 538 weather stations across Germany, including the coordinates of each station. I merge each station to the corresponding federal state, using their respective coordinates, and average the mean temperature across all stations.

Figure 1 shows the spatial distribution of the number of infections per 100,000 inhabitants per state cumulated over the observation window from May 1, 2020 to December 31, 2020. The figure shows a clear variation in the number of cumulative Covid-19 cases across all federal states, with states in the south and south-west being the most severely affected. From a first glimpse, cross-sectional dependence seems to be present as there is some clustering of states with similar infection rates.

Fig. 1
figure 1

Cumulative Covid-19 infections in Germany. Notes: The map shows the cumulative number of Covid-19 infections per 100,000 inhabitants between May 1, 2020 and December 31, 2020 for the 16 federal states in Germany

To confirm the presence of cross-sectional dependence statistically, I use Pesarans’s CD-test for panel data (Pesaran 2004, 2015), which is based on the pairwise correlation coefficients of the different geographical units. I also estimate the standardized cross-sectional exponent \(\alpha\) (Bailey et al. 2016) to determine the degree of cross-sectional dependence. The results are shown in Table 1 and confirm the cross-sectional dependence for the number of infections. Cross-sectional dependence is also confirmed for deaths, the number of IC patients and temperature. The correlation is in each case significant at the 1% level and very strong as the exponent \(\alpha\) of nearly one indicates. This suggest that a model with common factors instead of time fixed-effects might be better suited to capture the strong cross-sectional dependence (Ciccarelli and Elhorst 2018; Elhorst et al. 2021). Proxying for common factors can be empirically implemented by either cross-sectional averages (pioneered by Pesaran 2006) or principle components (e.g., Bai 2009; Shi and Lee 2017; Bai and Li 2021); the former approach following Pesaran (2006) is applied in this study.

Table 1 Pesaran’s CD test for cross-sectional dependence and Bailey et al.’s cross-sectional exponent α

3 Empirical strategy

To identify the time lag between the number of cases and the number of deaths, I rely on a spatial econometric approach, which allows me to model direct effects within the geographical unit of interest, while accounting for possible interaction effects with neighboring spatial units. Elhorst (2021) differentiates between three types of interactions: (1) exogenous interactions effects (i.e., the independent variable in one spatial unit can affect the dependent variable in other spatial units), (2) endogenous interaction effects (i.e., the dependent variable in one spatial unit can affect the dependent variable in other spatial units), and (3) interaction effects among the error terms (i.e., the error term in one unit can affect the error term in other units). In the modeling approach outlined below, I consider endogenous and exogenous interaction effects.

I use the number of daily deaths per 100,000 inhabitants within each state as dependent variable and regress it on the same-day number of new infections per 100,000 inhabitants in the same state as well as on the number of new infections per 100,000 inhabitants up to 14 days lagged in time in the same state. As explained above, I do control for the number of new patients with the Corona-disease per 100,000 being treated in IC as well as for mean temperature. Both variables also enter the regression with in total 14 time lags. Moreover, I include state fixed-effects to control for all non-time-varying effects that are specific to each state (such as being located at the sea or at the border to a different country) and which might impact the number of daily death cases and infections. Also, day fixed-effects are included to control for time effects affecting all states similarly. This will, for example, control for the effect of the whole country being in lockdown as well as for week-day specific patterns, such as fewer tests during the weekend.

So far, this model corresponds to a ‘standard’ distributed lag model with fixed-effects. To account for the spatial dependence, I further add the spatial components to the model. The full model reads

$$\begin{aligned} Deaths_{it} & = \tau Deaths_{it - 1} + \rho \mathop \sum \limits_{j = 1}^{N} w_{ij} Deaths_{jt} + \eta \mathop \sum \limits_{j = 1}^{N} w_{ij} Deaths_{jt - 1} \\ & \quad + \mathop \sum \limits_{k = 0}^{14} \beta_{k} Infections_{it - k} + \mathop \sum \limits_{k = 0}^{14} \theta_{1k} \mathop \sum \limits_{j = 1}^{N} w_{ij} Infections_{jt - k} \\ & \quad + \mathop \sum \limits_{k = 0}^{14} \delta_{k} Temperature_{it - k} + \mathop \sum \limits_{k = 0}^{14} \theta_{2k} \mathop \sum \limits_{j = 1}^{N} w_{ij} Temperature_{jt - k} \\ & \quad + \mathop \sum \limits_{k = 0}^{14} \gamma_{k} Intensive_{it - k} + \mathop \sum \limits_{k = 0}^{14} \theta_{3k} \mathop \sum \limits_{j = 1}^{N} w_{ij} Intensive_{jt - k} \\ & \quad + \alpha_{i} + \sigma_{t} + u_{it} , \\ \end{aligned}$$
(1)

where \(w_{ij}\) are the elements of the matrix \({\varvec{W}}\), which is a 16 × 16 row-normalized binary contiguity matrix, with its elements being equal to one when state i and state j are neighboring states and zero otherwise, and all diagonal elements \(w_{ij}\) with \(i = j\) equal to zero.Footnote 4\(\alpha_{i}\) and \(\sigma_{t}\) are the federal state and time fixed-effects, respectively, i.e., a binary dummy variable is included for each state i and each day t (minus one to avoid the dummy variable trap). \(Deaths_{it}\), \(Infections_{it}\) and \(Intensive_{it}\) are the respective numbers of death cases, infections and intensive care patients per 100,000 inhabitants at day t in state i. \(Temperature_{it}\) is the mean temperature at day t in state i. The subscript j always denotes all states excluding state i. \(u_{it}\) is the error term. \(k\) is an index running from 0 to 14 and indicates the respective time lag.

I estimate five different specifications of the outlined model by constraining several of the parameters. In Model (1), I set \(\tau = \eta = \rho = \theta_{1k} = \theta_{2k} = \theta_{3k} = 0\), resulting in a standard distributed lag model with spatial and time fixed-effects. With these restrictions, no spatial spillover effects can occur.

In Model (2), I set \(\tau = \eta = \rho = 0\), such that the model is the SLX model (spatial lag in independent variables). This model allows only to derive long-run and only local spillover effects, i.e., spatial effects cannot change in size over time and only neighboring states can affect each other—which is, however, an implausible assumption given the global spread of the virus.

In Model (3), therefore, I allow \(\rho\) to be different from 0 and thereby for global spillover effects. I only set \(\tau = \eta = 0\), leading to the static Spatial Durbin Model with fixed-effects. Finding the parameter \(\rho\) to be significantly different from zero implies global spillover effects, since also non-neighboring states can (through other states) affect each other. The static model, however, does not allow to differentiate between short-run and long-run effects, i.e., it does not allow to investigate whether the effects become smaller or larger over time.

Allowing all parameters to differ from zero results in Model (4), the dynamic Spatial Durbin Model with spatial and time fixed effects. As outlined above, adding the dynamic element allows also to differentiate between short-run and long-run effects, i.e., to assess whether the effect size and significance changes over time or remains constant. In Model (5), I drop the time fixed-effects and instead account for common factors by including the cross-sectional averages (following Pesaran 2006) for \(Deaths_{it} , Deaths_{it - 1}\), \(Infections_{it}\), \(Intensive_{it}\) and \(Temperature_{it} .\) This approach should be more suitable to account for the strong cross-sectional dependence that was indicated by the high cross-sectional exponent α.

The parameters of the model in Eq. (1) are estimated via Ordinary Least Squares (OLS) if no endogenous interaction effects are included (i.e., in Model (1) and (2)), and by quasi maximum likelihood (QML) if endogenous interaction effects are considered (Model (3)-(5)), since estimating spatial models with endogenous interaction effects via OLS will result in inefficient estimates (LeSage and Pace 2009). The bias correction approach of Yu et al. (2008) is applied in the two dynamic models to yield centered confidence intervals (see also Lee and Yu 2010).

One drawback of the bias-corrected QML estimator is that it will be inconsistent under heteroskedasticity (Bai and Li 2021). While the standard errors for all models are adjusted for possible heteroskedasticity by applying the robust standard error approach proposed by Driscoll and Kraay (1998) (see also Hoechle 2007; Belotti et al. 2017), this will not eliminate the possibility that the QML estimator is not centered around the true value. A quasi-maximum likelihood estimator for dynamic spatial panel data models with common factors that is consistent in the presence of heteroskedasticity has been proposed by Bai and Li (2021), who also show that the bias can reach large magnitudes, especially if the sample size is small. I therefore also present the results of their proposed QML estimator and contrast it with the results from the non-heteroskedasticity robust estimator.Footnote 5

The parameters \(\eta ,{ }\rho ,{ }\theta_{1k} ,{ }\theta_{2k}\) and \(\theta_{3k}\) jointly determine the spatial interaction effects. The main parameters of interest to determine the time lag are the parameters \(\beta_{k}\) and \(\theta_{1k}\). However, the point estimates cannot directly be interpreted as the direct effects and the spillover effects (except in the SLX model), but have to be calculated separately. Specifically, as soon as endogenous interaction effects enter the regression, the direct effect is calculated as the mean diagonal element of the models’ N × N matrix of partial derivatives ((1-\(\tau\))I − (ρ + \(\eta\))W)−1k I + Wθk], where I is the identity matrix. The indirect effect is the mean row sum of the non-diagonal elements of the same matrix (LeSage and Pace 2009; Debarsy et al. 2012; Elhorst 2014, 2021). To differentiate between the short- and long-run effects, the respective parameters \(\tau\) and \(\eta\) have to be set equal to zero.Footnote 6 The corresponding standard errors are computed via Monte Carlo simulations (LeSage and Pace 2009). A complete and detailed overview for the formulas of the marginal effects (for short- and long-run direct and indirect effects) as well as for the respective t-values or standard errors can be found in Elhorst (2014, pp. 25 and 105) and in Belotti et al. (2017, p. 146).

4 Results

Table 2 presents the regression coefficients for each of the models outlined above, while Table 3 presents the corresponding (short-run and long-run) marginal direct effects and spillover effects.

Table 2 Estimation results of the lag between Covid-19 infections and deaths using different model specifications
Table 3 Direct effects and spillover effects for infections in the short-run (SR) and long-run (LR)

Column (1) in Table 2 displays the results for the standard linear distributed lag model with fixed-effects, in which the point estimates of the parameters can be interpreted directly. It shows, as expected, that the number of daily infections is a strong determinant of the number of deaths cases. An interesting pattern can be observed: the number of the same day infections positively relates to the same day number of deaths cases, while the first lag is negative. This suggests that some form of mortality displacement is present, i.e., a temporary increase in death cases is followed by days with decreased death cases. While this phenomenon has especially been observed during heat waves (see for example Deschênes and Moretti 2009; Karlsson and Ziebarth 2018), studies also investigate this phenomenon in the context of Covid-19 (Michelozzi et al. 2020; Cerqua et al. 2021). Afterwards, the effects become again significantly positive at lag eight, followed once by negative effects at nine and ten days lagged, and then turning once again positive and significant at lag 11–14. Hence, this first basic model suggests that the number of Covid-19 death cases consistently rises after an increase in number of infections after about 11 days. The coefficients for the daily mean temperature are insignificant, while the coefficients for the intensive care variable are first partly negative and become positive after about one week (coefficients for temperature and intensive care cases are shown in Table 4 in the Appendix). This model, however, does not yet account for spatial interdependence, reflected also in the value of the CD-test for the residuals. It shows that the test statistic does not lie within the interval [− 1.96; 1.96], which is needed to conclude that there is no further cross-sectional dependence in the residuals. At this point it is to mention that the CD-test significantly loses in power as soon as time fixed-effects or common factors are included in a model. Nevertheless, it is used in empirical studies to decide on model fit (see for example Halleck Vega and Elhorst 2016; Ciccarelli and Elhorst 2018), but should always be interpreted carefully and only be considered jointly with other test statistics, namely R-squared, the log-likelihood, and the cross-sectional exponent \(\alpha\). The latter supports the rejection of the model as it is not significantly different from 1.

Moving to the SLX model in Column (2) already improves the model fit as the increased values of R-squared and the log-likelihood indicate. It also allows to make a first statement about local spillovers. They show that the number of infections in neighboring states j also significantly relates to the death cases in state i, yet the coefficients are primarily significantly negative, and only turn positive after a lag of eleven days. Specifically, the spillover effects suggest that an increase in infections in neighboring states leads first to a decrease in own-state death cases with a three to ten-day lag and afterwards to an increase in own-state death cases with an eleven to 13-day lag. In terms of the size, an increase in infections of 1000 per 100,000 inhabitants within a given federal state would lead to about five to six more deaths after 12–13 days, while an increase in infections of 1000 per 100,000 inhabitants in neighboring federal states would lead to one to two more deaths after eleven to 13 days. As before, the direct and indirect effects for temperature are insignificant and those for the intensive care patients support the positive effect after about one week (shown in Table 5 in the Appendix). The residual CD-test statistic jointly with the cross-sectional exponent \(\alpha\) being indifferent from 1, however, do again not allow to conclude that this model captures fully the cross-sectional dependence.

Column (3) displays the results for the non-dynamic SDM with fixed effects, which includes now also an endogenous interaction effect. The coefficient ρ is significantly negative and several of the \(\theta_{1k}\) coefficients also remain negative. Calculating the indirect long-run effects (Table 3) shows that the majority of them are likewise significantly negative, in line with the rather negative than positive spillover effects, as suggested by the results of the SLX model. Exploring the various test statistics reveals that the log-likelihood ratio clearly favors Model (3) over Model (2), yet, the R-squared value is not significantly larger. Moreover, the residual CD statistic is still significant, and \(\alpha\) is still not significantly different from 1, which jointly lead to a rejection of the model in terms of fully capturing the spatial dependence.

Introducing the dynamic model in Column (4) shows a somewhat different picture for the ρ coefficient, which now turns significantly positive. The likewise positive significant value of \(\tau\) suggests serial correlation in the number of deaths. The direct effects are again consistently positive and statistically significant after eleven days, while the indirect effects are again first primarily negative, both in the short- and in the long-run and only turn positive after a lag of eleven days. R-squared becomes somewhat larger in comparison to the static model and also the log-likelihood ratio test rejects the static model in favor of the dynamic model.Footnote 7 Nevertheless, the CD-test statistic still lies outside of the [− 1.96; 1.96] interval and the confidence intervals of the cross-sectional exponent \(\alpha\) still include the value 1, making the dynamic SDM with fixed effects also unsuitable to fully capture the strong cross-sectional dependence. Another aspect leading to a rejection of the model is the increasing size (in absolute terms) of both the direct and indirect effect. Given that several federal states implemented cautionary measures to de-link the number of infections and death cases, one would plausibly expect a decrease in the effect sizes over time.

The only model that is well suited to fully capture the cross-sectional dependence in the data is the dynamic SDM with cross-sectional averages presented in Column (5), which aligns with the cross-sectional exponent α of nearly one that was estimated for the raw data. R-squared and the log-likelihood value both take on the largest value in this model and the residual CD-test statistic lies within the [− 1.96; 1.96] interval and is, hence, no longer significant. The average correlation coefficient of the residuals is now close to zero (− 0.005) and the cross-sectional exponent alpha is now significantly different from 1, with the point estimate below the threshold of 3/4, which indicates that common factors have adequately been addressed (Ciccarelli and Elhorst 2018). Moreover, the sum of \(\tau , \eta\) and \(\rho\) is smaller than 1, which is essential for the model to be stationary. \(\tau\) is statistically significant and negative, supporting again the hypothesis of mortality displacement, i.e., days with high numbers of death cases are followed by days with on average lower numbers of death cases.

Several of the \(\theta_{1k}\) coefficients as well as the coefficient ρ and are statistically significant, η remains insignificant, yet, a LR-test rejects that this variable can be dropped from the model. The significant coefficients jointly indicate that global short-run and long-run spillover effects are present. Interestingly, and in contrast to the model in Ciccarelli and Elhorst (2018), the model does only account for the full cross-sectional dependence when cross-sectional averages of both the dependent and the independent variables are included. Including only the cross-sectional averages of the current and lagged independent variable results in a residual CD-test statistic that is still outside the required interval.

As discussed above, the applied QML estimator might be inconsistent in presence of heteroskedasticity. I therefore present the results of the dynamic spatial Durbin model with common factors using the QML estimator proposed by Bai and Li (2021) that explicitly allows for heteroskedasticity in Table 6 in the Appendix. Common factors in this case are modeled as principal components instead of cross-sectional averages. To ease the comparison with the main model in Column (5) of Table 2, which includes five cross-sectional averages, the model estimated with the Bai and Li (2021) estimator includes five common factors (Column (2), Table 6). For further robustness checks, I also present a model with only three common factors (Column (3), Table 6) and a model where the space–time lag is excluded (Column (4), Table 6). The results of these estimations show that some form of bias seems indeed to be present in the main results, primarily for the η, \(\beta_{k}\) and \(\theta_{1k}\) coefficients. While τ and ρ are fairly similar, the coefficient for the space–time lag (η) turns negative in Columns (2) and (3) of Table 6. Also, the coefficients for infections (\(\beta_{k} )\) and spatially weighted infections (\(\theta_{1k}\)), especially the later lags after 11 or 12 days, are somewhat smaller (in absolute terms) when estimated with the QMLE of Bai and Li (2021). Overall, this implies that the main results presented in Tables 2 and 3 should be cautiously interpreted as upper-bound estimates.

Turning back to the main results, Table 3 shows that the direct effects of increases in the own-state infections are, as expected, positive and significant, which is in line with the hypothesis that increases in infections are followed by increases in deaths. Regarding the size, an increase in the number of infections by 1000 per 100,000 leads to an additional one to three deaths cases in the same state. The time lag between an increase in infections and a subsequent increase in death cases can now be determined at approximately 12 days. Before the 12th lag, the coefficients for both, the direct and the total effect switch between a negative and positive sign, but become consistently positive afterwards in each of the five models.

The estimated indirect effects also remain significantly negative (now even for the longer lags after eleven days) in Model (5), supporting the previous two models in the hypothesis that increases in infection rates in other states decrease own-state death cases. These negative spillover effects might seem counterintuitive at first, but could be an indicator that states choose to implement stricter preventive policy measures in their own state when the infections in other states rise (assuming that own infection rates would remain constant). This finding is also well in line with previous studies. Ehlert (2021), for example, finds that in a given German district the number of deaths and infections shrink with higher numbers in early Covid-19 infections in neighboring districts. Krisztin et al. (2020) also find a temporarily negative degree of global spatial autocorrelation and explain their finding with temporary travel bans to regions with excessive infection rates to prevent transmission to the own country. Nevertheless, these negative spillovers seem to contradict the infectious nature of the virus and should be investigated in further detail in future studies to assess whether the hypothesis of negative spillovers caused by stricter policy measures can be confirmed.

The size of the short-run (SR) and long-run (LR) effects for the preferred Model (5) is now also in line with the theoretical considerations, as they show that the size of all three types of effects becomes smaller in size over time. While this is somewhat blurred in Table 3 given the decimal rounding, taking a closer look reveals that the direct effect after 14 days in the short-run (equal to 0.0036) is reduced by 10% to 0.0033 in the long-run. The same pattern is found for the indirect effect; for example, the indirect short-run effect after 14 days is equal to − 0.0022 but is reduced in absolute terms to − 0.0020. This reduction in absolute size indicates that policy measures that have been taken in the time horizon considered in this study seem to have been effective in protecting the most vulnerable individuals and thereby have slightly weakened the link between the rising number of infections from the rising number of death cases.

The coefficients for temperature remain insignificant in every model and both, the short-run and long-run direct and indirect effects are essentially zero (shown in Table 5 in the Appendix). Hence, in contrast to studies in different settings (Ma et al. 2020; Wu et al. 2020), temperature seems not to affect the number of daily deaths in Germany. Yet, this can at least to some extent be explained by the inclusion of time fixed-effects or common factors, respectively, as they absorb a substantial part of the within-day variation in temperature across states. The number of patients being treated in intensive care seems to relate to the number of deaths (Table 5 in the Appendix), yet no clear positive or negative pattern emerges.

5 Conclusion

The severe restrictions and policies that are implemented to limit the dynamic spread of Covid-19 require a continuous evaluation to assess their effectiveness. In this regard, the case fatality ratio is one of the most important quantitative measures (WHO 2020; Ioannidis 2021). Yet, its calculation requires an exact estimate of the time lag between infections and deaths. In this article, I analyze this lag using daily spatial panel data of the 16 German federal states over the period May 2020 to December 2020. My results suggest that the curve of death cases follows the curve of infections with a lag of approximately 12 days.

Moreover, to account for the spatial spread of the virus, I use spatial econometric models that allow for exogenous and endogenous spillover effects between states. I find that only the dynamic Spatial Durbin Model with state-fixed effects and cross-sectional averages can fully capture the strong cross-sectional dependence found in the data. While the overall effect is largely positive across the different models, the direct and indirect effects differ in sign. The direct effects show that an increase in infections within a given state leads to an increase in the number of reported death cases after about 12 days. Contrarily, the indirect effects (spillovers) are significantly negative, indicating that an increase in infections in neighboring states (and via the global spillovers also in all other states) reduces the number of death cases in that given state. This could be explained by preventive measures taken by the state government as soon as they observe rising infections in other states. While this is well line with the findings of earlier studies (Krisztin et al. 2020; Ehlert 2021), future studies will be needed to confirm this hypothesis and to investigate the causal underlying reasons in more detail. Attention needs also to be paid to the fact that the quasi-maximum likelihood estimator applied in this study, which is widely used in the micro- and macro-econometric spatial literature, suffers from the shortcoming of being inconsistent in case of underlying heteroskedasticity (Bai and Li 2021). A comparison of the results with the heteroskedasticity-robust estimator proposed by Bai and Li (2021) revealed that the effect sizes of the increase in death cases after an increase in infections presented and discussed in this article need to be cautiously interpreted as upper-bound estimates. The discussion of the consequences of assuming homoskedasticity in case of underlying heteroskedasticity should therefore receive broader attention in future studies concerning dynamic spatial panel data.