1 Introduction

In the ecological and biological literature so-called predator-prey (PP) models have become an established framework to describe patterns of predation, i.e., the killing and consumption of one species (the prey) by another (the predator). The instantaneous rate of prey consumption per predator—the so-called functional response—is both an essential feature of PP interaction and an important dimension for structuring a broad variety of continuous time PP models (see Jost and Ellner 2000, for an overview).Footnote 1 The debate on the functional response (see Jost and Ellner 2000, for a comprehensive discussion) largely fluctuates around the functional specification and subsequent estimation of the so-called predation function in continuous time. A few studies have provided empirical assessments of PP models in discrete time.Footnote 2 Unlike continuous patterns of PP interaction, discrete time models can capture multiperiod dynamics in form of regressive or autoregressive patterns without explicit reference to an underlying theoretical model. Adding to this merit, it is worth to note that empirical sampling might take place at low frequencies such that important intergenerational dynamics could be subject to (implicit or unspecified) aggregation. In this context, Ewing et al. (2007) (henceforth, ERE07) are among the first to argue convincingly in favour of the informational content of vector autoregressive (VAR) models for the dynamic analysis in PP systems.Footnote 3 To materialize their claim, ERE07 provide an in-depth analysis of the semi-daily population data from the classic ciliate experiments of Veilleux (1979) in their digitalized form of Jost and Ellner (2000). In this work, we reconsider the analysis of ERE07. Apart from replication exercises, we complement their analysis with the quantification of instantaneous responses among prey and predator population growth rates in either direction.

ERE07 can be considered as the first scholars who illustrate dynamic model implications by means of impulse response analysis (or so-called innovation accounting). In particular, they adopt the generalized impulse response functions (generalized IRFs, GIRFs) of Pesaran and Shin (1998). Since GIRFs lack a strictly structural (or causal) interpretation (see, e.g. Kim 2013), however, ERE07 remain ultimately silent about a steady state assessment of the functional response. For example, GIRFs displayed in Fig. 2 of ERE07 suffer from the counter intuitive interpretation that positive surprises to the prey population invoke a significant deterioration of predator population growth. The structural model approach that we undertake in this work largely benefits from recent contributions to structural VAR (SVAR) analysis (see Kilian and Lütkepohl 2017, for an up to date textbook treatment of identification in SVARs). Specifically, we trace the stochastic variations in the PP system back to unique (i.e., identified) non-Gaussian independent components (Lancaster 1954; Comon 1994). The model implied impact effects of the independent components on population growth rates accord with the axiomatic assumptions of the so-called Kolmogorov model as the theoretical and intuitive foundation of many representatives of PP models (Freedman 1980). Conditional on the process history, by implication, an increase of the predator population by one percentage point reduces prey population growth by about 1.2 percentage points, on average.

The remainder of this work is organized as follows: The next Section sketches the data, highlights the (structural) identification problem and puts the independent component analysis (ICA) adopted in this work into the perspective of alternative approaches to identification in SVARs. The main empirical results are provided and discussed in Sect. 3. Section 4 concludes. The Appendix provides an explicit representation of the dependence coefficient of Bakirov et al. (2006) which we employ for the detection of independent components.

2 A structural model for predator-prey interaction

This Section develops the structural model of interest. On the one hand, we highlight stylized characteristics of the population growth rates under scrutiny and point to the structural VAR as a means to translate correlation estimates into causal linkages. On the other hand, we review briefly alternative approaches (theory- vs. data-based) to model identification, and argue in favour of ICA for the subsequent analysis.

2.1 The data and conditional correlations

The population data analysed in ERE07 have been determined in mixed ciliate experiments of B. Veilleux with two freshwater organisms, the prey Paramecium aurelia and the predator Didinium nasutum (Veilleux 1979). Specifically, with initial densities of 45 (Paramecium) and 15 (Didinium) individuals per milliliter (ml) ciliates were cultured in Petri dishes containing six ml of culture medium maintained at \(27^\circ \text {C}\). To achieve sustained patterns of PP interaction, B. Veilleux elaborated on diverse scenarios combining controlled amounts of a bacterial nutrient for feeding the prey species (Cerophyl) with additions of Methyl Cellulose which governed thickness of the medium and thereby the movements of the ciliates. Non-destructive population counts of individuals per ml were made at the semi-daily frequency that accounts for intergenerational dynamics, since the number of fissions per day is larger than two for both species. While the fission rates might depend on the actual laboratory experiment (see Figures 2 and 3 of Veilleux 1979) further complexity of intergenerational dynamics arises from the fact that fission rates of the predator (Didinium) are by about 30% larger than those of the prey (Paramecium). To achieve sufficient accuracy of the population counts each quote is the average of eight independent countings. For intermediate choices of nutrition available to the prey species (i.e., cerophyl concentration) the experiments resulted in stable population patterns that are likely to reflect ‘inherent properties of the system and not an artifact due to the experimental method’  (Veilleux 1979, p. 796).Footnote 4

Fig. 1
figure 1

Analysed semi-daily population growth rates of the predator Didinium nasutum and the prey Paramecium aurelia. The x-axis indicates the days of the experiment. The dashed horizontal line indicates the beginning of the reduced sample (RS)

Table 1 Descriptive statistics: mean and standard deviations (sd) of prey and predator population growth rates in the full sample (51 observations) and the reduced sample (49 observations)

As in ERE07 the population growth rates of interest are defined as

$$\begin{aligned} n_t = \frac{N_{t}-N_{t-1}}{N_{t-1}} \text{ and } p_t = \frac{P_{t}-P_{t-1}}{P_{t-1}} , \end{aligned}$$
(1)

where \(N_{t}\) and \(P_{t}\) are individuals per ml of Paramecium (the prey) and Didinium (the predator), respectively. The observation index t indicates measurement intervals of 12 h. The total number of observations (\(N_t,P_t\)) is 52. Hence, the number of available growth rates (\(n_t,p_t\)) is 51. ERE07 provide ADF statistics in favour of stationarity of both growth rates.

Table 2 Conditional correlations

The graphical display of the growth rates in Fig. 1 shows two markedly outlying observations which might reflect the initialisation of the experimental design. Therefore, this work complements the analysis of full sample data in ERE07 (51 observations) with a robustness analysis for a subsample that excludes the first two recorded growth rates (49 observations). Table 1 documents mean growth rates and standard deviation (sd) statistics for the two samples. Moreover, Table 2 documents some conditional correlations obtained from ad-hoc regression models. Stylized bivariate regressions as well as richer dynamic structures obtain a common effect direction for both alternative choices of the dependent variable (\(n_t\) or \(p_t\)). Nevertheless, the documented regression outcomes allow for an interesting insight into conditional correlations among both growth rates \(n_t\) and \(p_t\). While the stylized bivariate regression with full sample information is characterized by positive coefficient estimates, these conditional correlations turn negative after conditioning on the set \(\{n_{t-i},p_{t-i},\,i=1,2,3\}\) as additional explanatory variables. Apparently, the model estimation by means of stylized bivariate regressions suffers from omitted dynamics. Put differently, dynamic effects appear as an important building block for the understanding of the PP system at hand. Corroborating established features of linear projections, however, the (dynamic) regression models are apparently not suitable to discover causal relationships by means of conditioning \(n_t\) on \(p_t\) (or \(p_t\) on \(n_t\)). It is likely that with semi-daily sampling the populations of the species (and, hence, population growth rates) are subject to joint endogeneity. As a result, using contemporaneous population or growth rate data as explanatory variables will induce correlation with regression residuals and, hence, estimation bias. Bias free estimation approaches would deserve instrumental information or the a-priori knowledge of the causal structure. Both preconditions hardly apply in the present context of the analysis of PP interaction among freshwater organisms.

Seeing that cross equation linkages hide causal structures in ad-hoc (dynamic) regressions, the subsequent steps do not only take account of dynamic profiles but also aim at a regression design composed of unrelated equations. For this purpose we first develop the VAR model of ERE07 further towards a structural representation tracing model residuals back to orthogonal (and identified) shocks. Subsequently, the isolation of these shocks provides a view at PP linkages in a form of unrelated equations.

2.2 The VAR model and its structural form

Let the vector \(y_t\) collect the growth rates defined in (1) as \(y_t=(n_t,p_t)'\) such that quotes on prey (predator) population growth are ordered first (second) in \(y_t\). To analyse competitive struggle in a PP model, the bivariate VAR model of order q (VAR(q)) reads as

$$\begin{aligned} y_t= & \nu + A_1 y_{t-1}+A_2 y_{t-2}+\ldots +A_q y_{t-q} + \varepsilon _t,\,t=1,2,\ldots ,T \end{aligned}$$
(2)

(see Lütkepohl 2007, for a textbook treatment of VARs). Taking account of unconditional effects \(\nu \) is a vector of intercepts, and \(\varepsilon _t\) is an uninformative model residual with mean zero and covariance \(\Sigma \), i.e., \(\varepsilon _t \sim (0,\Sigma )\). The model parameters (\(A_i,\,i=1,\ldots ,p,\) and \(\Sigma \)) and residuals (\(\varepsilon _t\)) in (2) can be consistently estimated by means of OLS estimation. Conditioning on predetermined variables, however, the model in (2) does not provide an explicit representation of the contemporaneous linkages that operate among the two species.

To address structural or contemporaneous relations, it has become a widespread strategy to regard the residuals in \(\varepsilon _t\) as the outcome of a linear and non-singular weighting scheme that applies to latent and orthogonal shocks collected in a vector \(\xi _t\). Rewriting the model residuals accordingly as \(\varepsilon _t = D \xi _t\) obtains the SVAR counterpart of (2),

$$\begin{aligned} y_t= & \nu + A_1 y_{t-1}+A_2 y_{t-2}+\ldots +A_q y_{t-q} + D \xi _t. \end{aligned}$$
(3)

Without loss of generality, the orthogonal shocks in \(\xi _t\) are often assumed to have (marginal) variances of unity, i.e., \(\text{ Cov }[\xi _t]=I\) with I denoting the identity matrix. The residual covariance \(\Sigma \) from (2) and the weighting parameters in D (see (3)) are related such that

$$\begin{aligned} \Sigma = \text{ Cov }[\varepsilon _t] = \text{ Cov }[D\xi _t] = E[D\xi _t\xi _t'D'] = DD'. \end{aligned}$$

Highlighting the core identification problem, it is easy to see that the space of potential covariance factors \(\mathcal{D}=\{D|DD' = \Sigma \} \) is infinite. For instance, a parametrized space of covariance decompositions obtains from \(\Sigma =D_{\theta }D_{\theta }'=GR_{\theta }R_{\theta }'G'\), where G is a lower triangular Cholesky factor (\(\Sigma =GG'\)) and

$$\begin{aligned} R_{\theta } = \left( \begin{array}{cc} \cos (\theta ) & -\sin (\theta ) \\ \sin (\theta ) & \cos (\theta ) \\ \end{array} \right) ,\; 0 \le \theta \le {\pi }, \end{aligned}$$

is a rotation matrix with rotation angle \(\theta \). Accordingly, the covariance decomposition space could be defined as \(\mathcal{D} = \{D_{\theta }|D_{\theta }=GR_{\theta },0 \le \theta \le \pi \}\), and implied orthogonalized residuals are \({\xi }^{(\theta )}_t = D_{\theta }^{-1} {\varepsilon }_t\).

2.3 Identification

The identification problem in structural VAR analysis consists of the selection of one specific D matrix from \(\mathcal{D}\). Since all members of \(\mathcal{D}\) align with the model outlines in (2) and (3), it is immediate that the identification problem in structural VARs cannot be solved without further external assumptions. In the case of a conditional Gaussian model, for instance, all rotations of \(\xi _t\) (or equivalently of D) are informationally equivalent. Identifying external information might stem from two sources, (i) theoretical considerations about the variables in \(y_t\) (i.e., about PP responses to the elements in \(\xi _t\)), or (ii) from stylized features of the distributions of \(\varepsilon _t\) or \(\xi _t\).

We next encounter four alternative identification schemes in a stylized manner. For a detailed and up-to-date textbook treatment of identification in SVARs we refer the reader to Kilian and Lütkepohl (2017). Throughout we consider the case of bivariate SVARs. Extending the arguments to higher model dimensions is straightforward. Subsequently, we briefly discuss particular merits of theory- vs. data-based identification, and argue for ICA as a means to identify the PP SVAR model. Finally, we provide a short outline of the extraction of independent components.

2.3.1 Identification schemes

  1. 1.

    Recursive systems: The estimate of the covariance \(\Sigma \) comprises three single (co)variances. To identify the four structural parameters in D, Sims (1980) has suggested to impose a (lower) triangular structure on this matrix. With such a restriction, the unrestricted parameters in D can be quantified straightforwardly. However, this identification scheme comes with the imposition of a hierarchical structure of the model. By implication, the structural properties depend on the ordering of the variables in \(y_t\).

  2. 2.

    Identification by means of sign restrictions: As formalized in (3) typical elements of D quantify the impact effects of a unit shock \(\xi _{it}=1,\,i=1,2,\) on the observables in the system. While this effect is unknown in its exact metric form, it is often the case that theoretical considerations imply specific effect directions. In the econometric literature Faust (1998), Canova and De Nicolo (2002) and Uhlig (2005) have pioneered and advocated the identification of SVARs by means of such consensual effect directions. In the framework of PP models a theory guided shock labelling could benefit from theoretical knowledge of how the organisms under scrutiny react to surprising changes of living conditions (i.e., shocks). For instance, in real life scenarios such shocks might materialize in the form of the invasion of new species or changing environmental conditions. In this context it is worth observing that many modern representatives of PP models are based upon the Kolmogorov model - a minimal set of two differential equations coupled with axiomatic assumptions (Freedman 1980). Population changes as implied by the Kolmogorov model are

    $$\begin{aligned} \frac{dN}{dt}=g(N,P)N \text{ and } \frac{dP}{dt}=h(N,P)P, \end{aligned}$$

    and the axiomatic assumptions concern the so-called predation function g (\(\frac{\partial g}{\partial P} < 0\)) and, moreover, \(\frac{\partial h}{\partial N} > 0\). Approximating the continuous time model in discrete time with \(dt = 1\) obtains \(n_{t+1} \approx g(N_t,P_t)\) and \(p_{t+1} \approx h(N_t,P_t)\). Hence, a positive shock which primarily impacts on preys implies \(n_{t+1}>0\) (direct effect) and \(p_{t+1}>0\) (indirect effect channelled through h). Moreover, shocks which are directly beneficial to growth rates of the predator population are likely to impact adversely on the living conditions of the preys (indirect effect channelled through g).Footnote 5 Casting these considerations in the framework of the discrete time SVAR model obtains the following effect directions for typical elements of D, denoted \(d_{ij}\): \(d_{ii}>0,i=1,2\), \(d_{12}<0\) and \(d_{21}>0\). Hence, a theory supported sign pattern for the structural matrix D is

    $$\begin{aligned} {D}_{theo}=\left( \begin{array}{cc} + & - \\ + & + \end{array} \right) . \end{aligned}$$
    (4)

    Providing a clear and unique effect pattern, the sign restrictions in \({D}_{theo}\) qualify for theory based model identification. Practically, a set of models identified by means of sign restrictions

    $$\begin{aligned} \widetilde{\mathcal{D}} = \{D = GR_{\theta }|d_{ii}>0,\,i=1,2,\,d_{21}>0,\,d_{12}<0 \} \end{aligned}$$

    obtains from random sampling of rotation angles \(\theta ,\, 0 \le \theta \le \pi \), and storing all implied covariance decompositions that align with the sign pattern provided in (4). Once an analyst has sampled a sufficient number of admitted structural matrices, the set of identified models \(\widetilde{\mathcal{D}}\) might be described by common statistical measures (e.g., mean, standard deviation, quantiles).

  3. 3.

    Identification through heteroskedasticity: The identification through heteroskedasticity as pioneered by Rigobon (2003) and developed further by, amongst others, Lanne and Lütkepohl (2008) and Bacchiocchi and Fanelli (2015) exploits information on observable heteroskedasticity in the residuals \(\varepsilon _t\). Assume that an analyst has access to two distinct covariance estimates, denoted \(\Sigma ^{(1)}\) and \(\Sigma ^{(2)}\), that she obtains from disjoint subsamples of the data. In the bivariate case this yields six single (co)variance estimates to solve the identification problem. The two covariances allow for a reparameterization as \(\Sigma ^{(1)}=DD'\) and \(\Sigma ^{(2)}=DWD'\), where W is a diagonal matrix. Hence, the six estimated (co)variances can be mapped in a one-to-one manner to the six unknown parameters in D and W, thereby identifying D. It is evident, however, that unique identification by means of heteroskedasticity requires that the parameters in W must be different. In other words, the switch from the first to the second covariance regime must not affect the variances of the shocks proportionally.

  4. 4.

    Identification by means of independent components: Early results on the properties of linear combinations of (non)Gaussian random variables have motivated to obtain structural shocks as independent components. While Lancaster (1954) has stated that rotation invariance is unique to independent Gaussian random variables, results of Comon (1994) imply that the weighting scheme \(\varepsilon _t = D \xi _t\) allows a unique recovery of the independent shocks in \(\xi _t\) from \(\varepsilon _t\) if and only if at most one element of \(\xi _t\) exhibits a Gaussian distribution. Taking advantage of these results, amongst others, Moneta et al. (2013), Gouriéroux et al. (2017) and Lanne et al. (2017) have developed identification schemes for SVARs which differ with respect to the space of admitted D matrices or the distributional assumptions made for the structural shocks in \(\xi _t\). Non-parametric approaches to ICA have been suggested or applied, for instance, by Matteson and Tsay (2017) and Herwartz (2019). Following their lines of reasoning an estimator of D could be obtained from solving the minimization problem

    $$\begin{aligned} \widehat{D} = \text{ argmin}_{D \in \mathcal{D}}\{\text{ Joint } \text{ dependence } \text{ of } \text{ elements } \text{ in } \xi _t\text{, } \text{ with }\; \xi _t = D^{-1} \varepsilon _t\} \end{aligned}$$
    (5)

    by means of a general non-parametric test of the null hypothesis of joint independence.

2.3.2 A comparative discussion of identification schemes

To support the choice for a particular identification scheme in empirical practice it is helpful to first contrast theory- vs. data based identification. While theory-based identification ensures that obtained structural shocks have meaningful impact properties, such schemes are also prone to intermingling model assumptions and conclusions (Uhlig 2005). In contrast, data-based identification typically provides sufficient external information such that otherwise overidentifying restrictions could be subjected to testing. At the downside, however, structural shocks obtained from these identification schemes do not necessarily allow for a sound theoretical interpretation. In this work we opt for a data-based approach to identification, and investigate subsequently if the retrieved shocks allow for a theory-conform interpretation in the spirit of the theoretical sign pattern of \(D_{theo}\) in (4). For this purpose it is important to notice that an identified matrix D is unique only up to column signs and the column ordering. With \(d_i\) denoting the i-th column of D it holds that \(\Sigma = DD' = \sum _{i=1}^2 d_{i}d_{i}'\). Hence, the matrices \([d_1,d_2]\), \([-d_1,d_2]\) or \([-d_2,d_1]\) are all in line with the covariance restriction \(\Sigma = DD'\).

The application of data-based identification schemes deserves either that the VAR residuals are subject to informative covariance changes or non-Gaussian. While we obtain some evidence in favour of downward shifting variances of VAR residuals (see also Fig. 1), the null hypothesis of proportional variance changes cannot be rejected with 5% significance. In contrast, the evidence against normally distributed VAR residuals is very strong such that we choose ICA for identification. The following paragraph is explicit on the independence statistic used to select D from \(\mathcal{D}\).

2.3.3 Independent component analysis

Solving the minimization problem in (5) requires a statistical quantification of joint dependence. To detect unique independent components we follow the approach of Herwartz (2019) and employ the dependence coefficient introduced by Bakirov et al. (2006). The dependence coefficient, denoted \(\mathcal{C}\), provides a \(\sqrt{T}\) consistent estimator of the population distance of the joint characteristic function of two random variables (in our case \(\xi _{1t}\) and \(\xi _{2t}\)) and the product of both marginal characteristic functions (see the Appendix for an explicit exposition). Similar to absolute estimates of linear correlations, \(\mathcal{C}\) is bounded between zero and unity which indicate the limiting scenarios of independence and complete dependence, respectively. For testing the null hypotheses of independence, \(T\mathcal{C}^2\) is a bounded random variable. Since critical values of \(T\mathcal{C}^2\) are difficult to obtain analytically, Bakirov et al. (2006) suggest a permutation bootstrap approach for the assessment of significance of the test statistic.Footnote 6 To determine independent components practically, one might either minimize \(\mathcal{C}\) or maximize the p-value of \(T\mathcal{C}^2\). In light of simulation results of Herwartz (2019) we opt for the latter and determine the structural matrix \(\widehat{D}=GR_{\hat{\theta }}\) such that

$$\begin{aligned} \hat{\theta } = \text{ argmax}_{\theta }\; p\text{-value }(T\mathcal{C}^2_{\theta }), \end{aligned}$$

where \(\mathcal{C}_{\theta }\) is determined from the sample \(\{ {\xi }^{({\theta })}_t = D_{{\theta }}^{-1} {\varepsilon }_t \}_{t=1}^T\).Footnote 7 Practically, the estimates \(\hat{\theta }\) obtain from a grid search over alternative rotation angles \(\theta =\frac{j\pi }{2}/50,\,j=1,2,3,\ldots ,50\). Extending the grid to \(\frac{\pi }{2} < \theta \le \pi \) results in an informationally equivalent second solution for \(D_{\theta }\) with exchanged column positions.

2.4 Marginal effects

The model in (3) is structural in the sense that observable regression residuals are traced back to orthogonal shocks. As such the model is not explicit on the implied causal relations between the jointly endogenous variables in \(y_t=(n_t,p_t)'\). Multiplying (3) from the left with \(D^{-1}\) obtains a structural model with unrelated residuals and a parametric representation of the conditional linkages among the elements in \(y_t\),

$$\begin{aligned} D^{-1}y_t = D^{-1}\nu + A_1^* y_{t-1}+A_2^* y_{t-2}+\cdots +A_q^* y_{t-q} + \xi _t, \end{aligned}$$
(6)

where \(A_i^*=D^{-1}A_i,\,i=1,2,\ldots ,q\). Let \(\Omega _{t-1}\) denote an information set that comprises the process information up to time \(t-1\). Moreover, let \(d^{(ij)}\) denote a typical element of \(D^{-1}\). Conditional on \(\Omega _{t-1}\), the elements in \(D^{-1}\) describe the marginal causal effects. Since

$$\begin{aligned} D^{-1}y_t = D^{-1}\left( \begin{array}{c} n_t \\ p_t \end{array} \right) = \left( \begin{array}{cc} d^{(11)} & d^{(12)} \\ d^{(21)} & d^{(22)} \end{array} \right) \left( \begin{array}{c} n_t \\ p_t \end{array} \right) , \end{aligned}$$

we obtain after normalization from (6)

$$\begin{aligned} E[n_{t}|p_{t} \cup \Omega _{t-1}] = \frac{-d^{(12)}}{d^{(11)}}p_{t}+\omega _{t-1}^{(n)} \text{ and } E[p_{t}|n_{t} \cup \Omega _{t-1}] = \frac{-d^{(21)}}{d^{(22)}}n_{t}+\omega _{t-1}^{(p)}, \end{aligned}$$
(7)

where \(\omega _{t-1}^{(n)}\) and \(\omega _{t-1}^{(p)}\) are observable conditional on the VAR parameters and \(\Omega _{t-1}\). It is worth noticing that by virtue of their definition in (7) the marginal effects \(-d^{(21)}/d^{(22)}\) obey a similar interpretation in discrete time as the predation function g(NP) in the continuous time Kolmogorov model.

2.5 Impulse response functions

Owing to the rich and complex dynamic parametrization of VAR models, IRFs have become a common practice to illustrate model implied dynamics. To formally derive IRFs as in Chapter 2 of Lütkepohl (2007) the VAR in (2) can be represented in so-called moving average form as

$$\begin{aligned} y_t= & A^{-1}(L)\nu + A^{-1}(L) \varepsilon _t \nonumber \\= & \mu + (\Psi _0 + \Psi _1 L + \Psi _2 L^2 + \Psi _3 L^3 +\cdots) \varepsilon _t, \end{aligned}$$
(8)

where L is the backshift operator such that \(Ly_t = y_{t-1}\) and \(\mu = A^{-1}(L)\nu \). The moving average parameter matrices \(\Psi _i,\,i=0,1,2,3,\ldots \), can be obtained recursively from nonlinear transformations of the autoregressive parameter matrices \(A_i,\,i=1,\ldots ,q\). Specifically,

$$\begin{aligned} \Psi _0 = I_2, \, \Psi _1= \Psi _0 A_1, \Psi _2=\Psi _1 A_1+\Psi _0 A_2,\ldots \end{aligned}$$

and

$$\begin{aligned} \Psi _i = \sum _{j=1}^i \Psi _{i-j} A_j, \; A_j=0 \text{ for } j > q. \end{aligned}$$

The moving average representation is of core importance for purposes of impulse response analysis. For instance, the typical elements of the matrices

$$\begin{aligned} \Theta _h = \Psi _h D, \end{aligned}$$
(9)

characterize the effect of isolated unit shocks in \(\xi _{jt}\) on \(y_{i,t+h}\), the i-th time series variables at horizon h. Obviously, impulse response schemes are specific to the selection of the structural matrix D.

To avoid strong identifying assumptions on the covariance decomposition, the innovation accounting analysis in ERE07 does not rely on impulse responses to orthogonalized shocks. Rather ERE07 make use of the so-called GIRFs of Pesaran and Shin (1998). Formally, GIRFs read as

$$\begin{aligned} \Xi _h = \Psi _h \Sigma {\varvec{e}}_i/\sigma _{ii}, \end{aligned}$$
(10)

where \({\varvec{e}_i}\) is a zero vector with a unit element in its i-th position, and \(\sigma _{ii}\) is the square root of the i-th diagonal element of the residual covariance \(\Sigma \). The qualification of the profiles in (10) as ‘generalized’  IRFs comes from the fact that these statistics are invariant to the ordering of the variables in the system. As argued in Kim (2013), however, GIRFs could be interpreted as an aggregation of impulse responses from VARs with distinct variable orderings. By implication, GIRFs lack effect consistency under the typical case of a non-diagonal residual covariance \(\Sigma \).

3 Structural analysis of predator-prey interaction

In this Section we first highlight that the VAR model of ERE07 is characterized by significant deviations from conditional normality and, hence, that independence scores might be used for characterizing ad-hoc identification schemes in terms of independence of orthogonalized shocks. Subsequently, we pursue the detection of independent components and provide an interpretation of the identified shocks. Furthermore, the Section includes a comparative description of the GIRFs of ERE07 and identified IRFs. Finally, we employ the identified model to assess the contemporaneous causal linkages among the growth patterns of the two species.

3.1 Non-Gaussianity and recursive model structures

ERE07 argue in favour of a VAR model with \(q=3\), as it minimizes the BIC and obtains serially uncorrelated estimates of \(\varepsilon _t\). After fitting a VAR(3) model to the bivariate vector of population growth rates, the residual covariance matrices estimated from the full and reduced sample read, respectively, as

$$\begin{aligned} \widehat{\Sigma }^{(FS)}=\left( \begin{array}{cc} 1.952 & -1.178 \\ -1.178 & 3.116\end{array} \right) /100 \text{ and } \widehat{\Sigma }^{(RS)}=\left( \begin{array}{cc} 1.664 & -1.035 \\ -1.035& 3.064 \end{array} \right) /100. \end{aligned}$$
(11)
Table 3 Normality tests for VAR residuals (\(\varepsilon _{it},\,i=1,2\))

While the removal of initial outlying observations reduces markedly the residual variances of both variables, the residual correlations are significantly negative, namely −0.477 and −0.458 for the full and the reduced sample, respectively. To uncover the causal relationships behind these empirical correlations by means of unique independent components, it is essential that the data are non-Gaussian. Table 3 provides outcomes from normality tests applied to VAR residuals. Both the Shapiro-Wilk (SW) and the Jarque-Bera (JB) test statistics for the null hypothesis of normally distributed marginal residuals provide strong evidence against the normality assumption. JB summary statistics indicate non-Gaussianity also for the joint distribution.

Given significant deviations from the Gaussian model, it is tempting to order alternative ad-hoc orthogonalization schemes with respect to their potential to align with notions of independent (rather than merely orthogonal) shocks. Cholesky factors of the residual covariance depend on the ordering of the variables in \(y_t\) (see the discussion in Section 2.3.1). Alternatively, for a given variable ordering one might distinguish a lower (denoted \(D_l\)) and an upper triangular Cholesky factor (\(D_u\)). The left hand side and middle panels of Table 4 document estimated alternative covariance decomposition factors. In terms of joint independence the orthogonalized shocks implied by alternative hierarchical model structures allow a clear distinction. Considering residuals \(\varepsilon _{1t}\) (prey growth) to originate in isolation through orthogonalized shocks (as implied by \(D_l\)) is in line with the null hypothesis of independence at conventional significance levels. While one could argue that accepting a null hypothesis as such does not provide a strong inferential result, it is worth to highlight that the data have no possibility to object against an orthogonalization of a model with prey residual growth ordered first. Unlike this hierarchical specification, assuming residuals \(\varepsilon _{2t}\) (predator growth) to originate in isolation through orthogonalized shocks (as implied by \(D_u\)) obtains a rejection of the independence hypothesis with 5% (10%) significance for the full (reduced) sample. In sum, statistical identification is an informative means to establish diagnostic evidence in favour of the actual variable ordering \(y_t=(n_t,p_t)'\) over the alternative \(y_t=(p_t,n_t)'\).Footnote 8

Table 4 Alternative a-priori covariance factors for alternative sample sizes (full sample (FS) vs. reduced sample (RS))

3.2 Identified independent components

The right hand side panel of Table 4 documents structural parameters estimates \(\widehat{D}\) joint with bootstrap-based t-ratios.Footnote 9 For both samples the detection of shocks with minimum dependence results in solutions which are close to the orthogonalized shocks obtained from the Cholesky covariance factor. The maximization of model implied p-values obtains estimates of about 60%. While one should be careful in interpreting supremum p-values from multiple testing in the usual way, it seems that the shocks implied by \(\widehat{D}\) show some weaker dependence in comparison with the shocks retrieved after imposing a hierarchical model structure (\(D_l\)). However, this improvement is marginal and might lack significance.

The estimator \(\widehat{D}\) provides evidence for two well identified shocks, since the sign patterns of estimated effect directions are distinct. While the first shock invokes opposite impacts on the two population growth rates, the second shock exerts a positive impact effect on both the growth rate of the predator and the growth rate of the prey population. Supporting the restriction implied by the Cholesky factor \(D_l\), however, the upper right element in \(\widehat{D}\) lacks positiveness with conventional significance.

The estimates in \(\widehat{D}\) are the solution of a statistical optimization problem. As resulting from a purely data-driven approach, however, it is not clear if the implied structural shocks \(\{\hat{\xi }_t = \widehat{D}^{-1}\varepsilon _t\}_{t=1}^T\) conform with any theoretical interpretation. In data-based SVAR identification the issue of shock labelling has become an important step of the analysis. In the form provided in Table 4 both structural matrices \(D_l\) and \(\widehat{D}\) point to the prominent role of the first shock (i.e., isolated shocks entering the model through prey growth rate residuals). As implied by the documented sign pattern, this shock improves prey growth and diminishes growth rates of the predator population. In a ‘material’  sense, such theoretical properties of ‘positive’  shocks might lack intuition. Therefore, we reorganize the columns and sign patterns of \(\widehat{D}\) to establish features of the shocks that align with \(D_{theo}\) in (4). Specifically, we reverse the column ordering and subsequently multiply the elements of the second column with minus unity. This obtains the structural matrix estimates in labelled form

$$\begin{aligned} \widehat{D}^{(FS)}=\left( \begin{array}{cc} \begin{array}[t]{c} 0.267 \\ (1.06)\end{array} & \begin{array}[t]{c} -1.371 \\ (-18.7)\end{array} \\ \begin{array}[t]{c} 1.361 \\ (6.19)\end{array} & \begin{array}[t]{c} 1.124 \\ (4.77)\end{array} \end{array} \right) /10 \text{ and } \widehat{D}^{(RS)}=\left( \begin{array}{cc} \begin{array}[t]{c} 0.206 \\ (0.82)\end{array} & \begin{array}[t]{c} -1.274 \\ (-21.7)\end{array} \\ \begin{array}[t]{c} 1.408 \\ (7.00)\end{array} & \begin{array}[t]{c} 1.040 \\ (3.66)\end{array} \end{array} \right) /10, \end{aligned}$$
(13)

for the full sample and the reduced sample, respectively.

When it comes to describe actual shocks behind the empirical growth rates analysed in this work, the controlled conditions of the experiments described in Veilleux (1979) leave little room to assign a theoretical interpretation to the labelled structural shocks. For instance, the experiments control the amount of food that was available to the prey species (i.e., the cerophyl concentration). However, potential ‘shocks’  which improve the living conditions of the prey species could occur in the forms of (i) favourable stochastic fluctuations of the nutrient (or its quality) around the controlled mean level, or (ii) an advantageous spatial distribution of the nutrient which minimizes search time/cost to the prey population. Improving growth rates of the Paramecium (prey) population, such shocks also improve the ingestion conditions for Didinium (predator). Therefore such shocks can be seen to invoke impact responses as described in the first columns of the structural matrices in (13) for the identified \(\xi _{1t}\). For arguing in favour of potential positive shocks to the predator population it is worth noticing that the predator Didinium is unable to detect its prey Paramecium visually or in a chemotactic manner. Hence, it cannot actively move towards its prey, and it necessitates a random direct contact to induce the capture and ingestion mechanism. Therefore particular states of species distribution that are favourable for predator growth include the absence of predator clustering or the presence of prey clustering. Establishing favourable conditions for predator growth, such ‘shocks’  are most likely detrimental for prey growth, and point to the impact effects in the second columns of the structural matrices in (13) and the properties of the identified \(\xi _{2t}\).

Fig. 2
figure 2

Generalized impulse response functions (GIRFs, see (10)) of Paramecium (prey, \(n_{t+h}\)) and Didinium (predator) population growth rates (\(p_{t+h}\)) to unit shocks. The time index t operates at the semi-daily frequency. Point estimates are equivalent to those shown in Figure 2 of ERE07. Inferential assessments rely on a fixed-design wild bootstrap scheme in the spirit of Gonçalves and Kilian (2004) (see footnote 9 for details). Dashed curves indicate point-wise 95% bootstrap confidence bands (i.e., the 2.5% and 97.5% quantiles of the bootstrap distribution). Displayed GIRFs are for the full sample (FS). Results for the reduced sample (RS) are almost identical

3.3 Impulse response analysis

We next describe the implications of the VAR model for the dynamic and contemporaneous interplay of the two species. For this purpose we discuss a replication of the GIRF estimates of ERE07 which is displayed in Fig. 2. As a complement to these results, we discuss effects implied by the identified SVAR in the form of IRFs to orthogonalised and independent shocks shown in Fig. 3.

3.3.1 Generalized impulse response functions reconsidered

While point estimates shown in Fig. 2 fully accord with the GIRFs of ERE07, the displayed confidence intervals with 95% coverage differ slightly in terms of shape.Footnote 10 The displayed functional forms indicate the stationarity of the system of population growth rates as the effects of initial impulses die out over an horizon of approximately seven days. As highlighted in ERE07, features of both persistence and cyclicality characterize the dynamic system. Seemingly, prey growth rates are in the lead of predator growth rates.

Positive impulses in the growth rates of the predator or prey population invoke markedly negative impact effects in the growth rates of the other population. This outcome is a direct consequence of the negative contemporaneous correlation characterizing the residual covariances documented in (11). Hence, the GIRFs lack a strictly structural interpretation. In the present case this lack shows up in an important theoretical model deficiency. While it is most intuitive to see that positive shocks to the predator population growth are - on impact - detrimental for the growth rate of the prey population, the reverse transmission channel lacks such an intuition. In fact, the model implication that positive shocks to prey population growth exert a negative impact on the growth rate of the predator population contradicts basic assumptions of a stylized Kolmogorov PP model. This result is supportive for the critical perspective on GIRFs provided by Kim (2013).

Fig. 3
figure 3

Impulse responses of Paramecium (prey) and Didinium (predator) population growth rates to identified orthogonal and independent unit shocks (see (9)). For further notes see Fig. 2

3.3.2 Identified impulse responses to structural shocks

Continuing the structural analysis we discuss the dynamic response patterns of prey and predator population growth rates to stylized (orthogonalised and independent) unit shocks. With regard to the first shock the upper left panel of Fig. 3 indicates that its positive impact on prey growth turns significantly negative within the first 24 hours. Then, cyclical effects imply a return to positive rates of population growth which lack 10% significance. The response of the prey population to this shock leads the response of the predator population growth. The latter achieves its minimum value after 48 hours and the maximum of recovery also follows the maximum prey recovery with a delay of 24 hours.

The second shock displayed in the right hand side panels of Fig. 3 impacts positively on predator growth and shows a markedly diminishing effect on prey growth. Due to the deterioration of the availability of nutrition, the initially positive response of predator growth turns negative within 24 hours to allow an almost significant recovery of the prey population followed again by positive dynamic responses of predator population growth.

3.4 Marginal effects

As argued in Sect. 3.4 the structural model and - in particular - its reformulation in (6) allows to recover the contemporaneous effect of growth rates in either species on the remaining one conditional on the history of the process. Structural model estimates of inverse (and normalized) \(\widehat{D}^{-1}\) matrices are

$$\begin{aligned} D^{-1}_{(FS)} = \left( \begin{array}{cc} 1 & 1.220 \\ -5.095 & 1 \end{array} \right) \text{ and } D_{(RS)}^{-1} = \left( \begin{array}{cc} 1 & 1.225 \\ -6.838 & 1 \end{array} \right) \end{aligned}$$

for the full sample and the reduced sample respectively. The estimated structural model implies the contemporaneous relationships between prey and predator population as documented in Table 5. In the steady state an increase of the predator population growth by one percentage point invokes a reduction of the prey population growth by about 1.2 percentage points. Bootstrapping the structural model obtains an interquartile range of this relation which is between 1.06 and 1.48 (1.02 and 1.39) percentage points for the full (reduced) sample. Regarding the reversed directional impact, an increase in prey growth by one percentage point increases, on average, predator growth by 5.1% (6.8%) percentage points for the full (reduced) sample. Although these estimates are representative for the steady state, it is worth noticing that in directional terms both contemporaneous causal effects are in line with the axiomatic assumptions of the Kolmogorov model.

Table 5 Contemporaneous causal effects implied by the identified model after its reorganization (see (6))

4 Conclusions

In the ecological literature predator-prey (PP) models have become a successful approach to understand species’ interactions. Ewing et al. (2007) have argued convincingly in favour of the informational content of vector autoregressive models for the dynamic analysis in PP systems conditional on samples drawn (with low frequency) at discrete time. In this paper we complement their analysis of a prominent laboratory time series of population counts of two freshwater organisms (Veilleux 1979). Generalised impulse response patterns (Pesaran and Shin 1998) displayed in the benchmark study of Ewing et al. (2007) are not fully in line with the axiomatic foundations of PP models as stated in the Kolmogorov model (Freedman 1980). This caveat can be traced back to the fact that generalised impulse response functions lack a strictly structural interpretation.

We take advantage of recent contributions to the literature on identification in structural VAR models to provide insights into both the dynamic and the contemporaneous interplay of the two freshwater organisms. In specific, our analysis builds upon the uniqueness of independent shocks which exhibit a non-Gaussian distribution (Lancaster 1954; Comon 1994). Independent components give rise to a structural model which is well identified and in line with the basic axiomatic foundations of PP models. The statistical approach to identification allows to test the otherwise just identifying assumption of a hierarchical model structure. If the hierarchical model develops from regarding surprises to prey growth as rescaled structural shocks, the model is in line with an assumption of independent shocks. Estimating the structural parameters unrestrictedly provides further support for the hierarchical structure. With regard to the functional response in PP models our steady state approximation implies that conditional on a history of three half days, the prey population growth ‘instantaneously’  shrinks by about 1.2 percentage points in response to an increase of the predator population by one percentage point.

Adopting statistical identification schemes to the laboratory data of Veilleux (1979) shows the potential of structural perspectives on PP models of competition, coordination and predation. As a matter of fact, the laboratory design of data generation limits the material interpretation of the identified shocks. An interesting avenue for future research is the adaptation of structural VARs to real life data in ecology and other disciplines (e.g., the prominent linkage among ‘snowshoe hare’  and ‘Canada lynx’  Stenseth et al. 1997). In such a context interesting ecological issues could be addressed by means of SVARs that are identified in a statistical and/or theory based manner. For instance, the effects of controlled releases of the predator species into the wild could be assessed by means of structural impulse responses. Similarly, to understand the possible effects of weakened protection rules for preys (and, hence, more hunting) structural impulse responses promise valuable information.