Abstract
In ecology, the concept of predation describes interdependent patterns of having one species (called the predator) killing and consuming another (the prey). Specifying the so-called functional response of prey populations to predation is an important matter of debate which is typically addressed by means of continuous time models. Empirical regression or autoregression models applied to discrete predator-prey population data promise feasible steady state approximations of often complicated dynamic patterns of population growth and interaction. Ewing et al. (Ecol Econ 60:605–612, 2007) argue in favour of the informational content of so-called vector autoregressive models for the dynamic analysis of predator-prey systems. In this work we reconsider their analysis of dynamic interaction of two freshwater organisms, and design a structural model that allows to approximate the functional response in causal form. Results from an unrestricted structural model are in line with core axiomatic assumptions of predator-prey models. Conditional on population growth lagged up to three periods (i.e., 36 h), the semi-daily population growth of the prey Paramecium aurelia diminishes, on average, by 1.2 percentage points in response to an increase of the population growth of the predator Didinium nasutum by one percentage point.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
In the ecological and biological literature so-called predator-prey (PP) models have become an established framework to describe patterns of predation, i.e., the killing and consumption of one species (the prey) by another (the predator). The instantaneous rate of prey consumption per predator—the so-called functional response—is both an essential feature of PP interaction and an important dimension for structuring a broad variety of continuous time PP models (see Jost and Ellner 2000, for an overview).Footnote 1 The debate on the functional response (see Jost and Ellner 2000, for a comprehensive discussion) largely fluctuates around the functional specification and subsequent estimation of the so-called predation function in continuous time. A few studies have provided empirical assessments of PP models in discrete time.Footnote 2 Unlike continuous patterns of PP interaction, discrete time models can capture multiperiod dynamics in form of regressive or autoregressive patterns without explicit reference to an underlying theoretical model. Adding to this merit, it is worth to note that empirical sampling might take place at low frequencies such that important intergenerational dynamics could be subject to (implicit or unspecified) aggregation. In this context, Ewing et al. (2007) (henceforth, ERE07) are among the first to argue convincingly in favour of the informational content of vector autoregressive (VAR) models for the dynamic analysis in PP systems.Footnote 3 To materialize their claim, ERE07 provide an in-depth analysis of the semi-daily population data from the classic ciliate experiments of Veilleux (1979) in their digitalized form of Jost and Ellner (2000). In this work, we reconsider the analysis of ERE07. Apart from replication exercises, we complement their analysis with the quantification of instantaneous responses among prey and predator population growth rates in either direction.
ERE07 can be considered as the first scholars who illustrate dynamic model implications by means of impulse response analysis (or so-called innovation accounting). In particular, they adopt the generalized impulse response functions (generalized IRFs, GIRFs) of Pesaran and Shin (1998). Since GIRFs lack a strictly structural (or causal) interpretation (see, e.g. Kim 2013), however, ERE07 remain ultimately silent about a steady state assessment of the functional response. For example, GIRFs displayed in Fig. 2 of ERE07 suffer from the counter intuitive interpretation that positive surprises to the prey population invoke a significant deterioration of predator population growth. The structural model approach that we undertake in this work largely benefits from recent contributions to structural VAR (SVAR) analysis (see Kilian and Lütkepohl 2017, for an up to date textbook treatment of identification in SVARs). Specifically, we trace the stochastic variations in the PP system back to unique (i.e., identified) non-Gaussian independent components (Lancaster 1954; Comon 1994). The model implied impact effects of the independent components on population growth rates accord with the axiomatic assumptions of the so-called Kolmogorov model as the theoretical and intuitive foundation of many representatives of PP models (Freedman 1980). Conditional on the process history, by implication, an increase of the predator population by one percentage point reduces prey population growth by about 1.2 percentage points, on average.
The remainder of this work is organized as follows: The next Section sketches the data, highlights the (structural) identification problem and puts the independent component analysis (ICA) adopted in this work into the perspective of alternative approaches to identification in SVARs. The main empirical results are provided and discussed in Sect. 3. Section 4 concludes. The Appendix provides an explicit representation of the dependence coefficient of Bakirov et al. (2006) which we employ for the detection of independent components.
2 A structural model for predator-prey interaction
This Section develops the structural model of interest. On the one hand, we highlight stylized characteristics of the population growth rates under scrutiny and point to the structural VAR as a means to translate correlation estimates into causal linkages. On the other hand, we review briefly alternative approaches (theory- vs. data-based) to model identification, and argue in favour of ICA for the subsequent analysis.
2.1 The data and conditional correlations
The population data analysed in ERE07 have been determined in mixed ciliate experiments of B. Veilleux with two freshwater organisms, the prey Paramecium aurelia and the predator Didinium nasutum (Veilleux 1979). Specifically, with initial densities of 45 (Paramecium) and 15 (Didinium) individuals per milliliter (ml) ciliates were cultured in Petri dishes containing six ml of culture medium maintained at \(27^\circ \text {C}\). To achieve sustained patterns of PP interaction, B. Veilleux elaborated on diverse scenarios combining controlled amounts of a bacterial nutrient for feeding the prey species (Cerophyl) with additions of Methyl Cellulose which governed thickness of the medium and thereby the movements of the ciliates. Non-destructive population counts of individuals per ml were made at the semi-daily frequency that accounts for intergenerational dynamics, since the number of fissions per day is larger than two for both species. While the fission rates might depend on the actual laboratory experiment (see Figures 2 and 3 of Veilleux 1979) further complexity of intergenerational dynamics arises from the fact that fission rates of the predator (Didinium) are by about 30% larger than those of the prey (Paramecium). To achieve sufficient accuracy of the population counts each quote is the average of eight independent countings. For intermediate choices of nutrition available to the prey species (i.e., cerophyl concentration) the experiments resulted in stable population patterns that are likely to reflect ‘inherent properties of the system and not an artifact due to the experimental method’ (Veilleux 1979, p. 796).Footnote 4
As in ERE07 the population growth rates of interest are defined as
where \(N_{t}\) and \(P_{t}\) are individuals per ml of Paramecium (the prey) and Didinium (the predator), respectively. The observation index t indicates measurement intervals of 12 h. The total number of observations (\(N_t,P_t\)) is 52. Hence, the number of available growth rates (\(n_t,p_t\)) is 51. ERE07 provide ADF statistics in favour of stationarity of both growth rates.
The graphical display of the growth rates in Fig. 1 shows two markedly outlying observations which might reflect the initialisation of the experimental design. Therefore, this work complements the analysis of full sample data in ERE07 (51 observations) with a robustness analysis for a subsample that excludes the first two recorded growth rates (49 observations). Table 1 documents mean growth rates and standard deviation (sd) statistics for the two samples. Moreover, Table 2 documents some conditional correlations obtained from ad-hoc regression models. Stylized bivariate regressions as well as richer dynamic structures obtain a common effect direction for both alternative choices of the dependent variable (\(n_t\) or \(p_t\)). Nevertheless, the documented regression outcomes allow for an interesting insight into conditional correlations among both growth rates \(n_t\) and \(p_t\). While the stylized bivariate regression with full sample information is characterized by positive coefficient estimates, these conditional correlations turn negative after conditioning on the set \(\{n_{t-i},p_{t-i},\,i=1,2,3\}\) as additional explanatory variables. Apparently, the model estimation by means of stylized bivariate regressions suffers from omitted dynamics. Put differently, dynamic effects appear as an important building block for the understanding of the PP system at hand. Corroborating established features of linear projections, however, the (dynamic) regression models are apparently not suitable to discover causal relationships by means of conditioning \(n_t\) on \(p_t\) (or \(p_t\) on \(n_t\)). It is likely that with semi-daily sampling the populations of the species (and, hence, population growth rates) are subject to joint endogeneity. As a result, using contemporaneous population or growth rate data as explanatory variables will induce correlation with regression residuals and, hence, estimation bias. Bias free estimation approaches would deserve instrumental information or the a-priori knowledge of the causal structure. Both preconditions hardly apply in the present context of the analysis of PP interaction among freshwater organisms.
Seeing that cross equation linkages hide causal structures in ad-hoc (dynamic) regressions, the subsequent steps do not only take account of dynamic profiles but also aim at a regression design composed of unrelated equations. For this purpose we first develop the VAR model of ERE07 further towards a structural representation tracing model residuals back to orthogonal (and identified) shocks. Subsequently, the isolation of these shocks provides a view at PP linkages in a form of unrelated equations.
2.2 The VAR model and its structural form
Let the vector \(y_t\) collect the growth rates defined in (1) as \(y_t=(n_t,p_t)'\) such that quotes on prey (predator) population growth are ordered first (second) in \(y_t\). To analyse competitive struggle in a PP model, the bivariate VAR model of order q (VAR(q)) reads as
(see Lütkepohl 2007, for a textbook treatment of VARs). Taking account of unconditional effects \(\nu \) is a vector of intercepts, and \(\varepsilon _t\) is an uninformative model residual with mean zero and covariance \(\Sigma \), i.e., \(\varepsilon _t \sim (0,\Sigma )\). The model parameters (\(A_i,\,i=1,\ldots ,p,\) and \(\Sigma \)) and residuals (\(\varepsilon _t\)) in (2) can be consistently estimated by means of OLS estimation. Conditioning on predetermined variables, however, the model in (2) does not provide an explicit representation of the contemporaneous linkages that operate among the two species.
To address structural or contemporaneous relations, it has become a widespread strategy to regard the residuals in \(\varepsilon _t\) as the outcome of a linear and non-singular weighting scheme that applies to latent and orthogonal shocks collected in a vector \(\xi _t\). Rewriting the model residuals accordingly as \(\varepsilon _t = D \xi _t\) obtains the SVAR counterpart of (2),
Without loss of generality, the orthogonal shocks in \(\xi _t\) are often assumed to have (marginal) variances of unity, i.e., \(\text{ Cov }[\xi _t]=I\) with I denoting the identity matrix. The residual covariance \(\Sigma \) from (2) and the weighting parameters in D (see (3)) are related such that
Highlighting the core identification problem, it is easy to see that the space of potential covariance factors \(\mathcal{D}=\{D|DD' = \Sigma \} \) is infinite. For instance, a parametrized space of covariance decompositions obtains from \(\Sigma =D_{\theta }D_{\theta }'=GR_{\theta }R_{\theta }'G'\), where G is a lower triangular Cholesky factor (\(\Sigma =GG'\)) and
is a rotation matrix with rotation angle \(\theta \). Accordingly, the covariance decomposition space could be defined as \(\mathcal{D} = \{D_{\theta }|D_{\theta }=GR_{\theta },0 \le \theta \le \pi \}\), and implied orthogonalized residuals are \({\xi }^{(\theta )}_t = D_{\theta }^{-1} {\varepsilon }_t\).
2.3 Identification
The identification problem in structural VAR analysis consists of the selection of one specific D matrix from \(\mathcal{D}\). Since all members of \(\mathcal{D}\) align with the model outlines in (2) and (3), it is immediate that the identification problem in structural VARs cannot be solved without further external assumptions. In the case of a conditional Gaussian model, for instance, all rotations of \(\xi _t\) (or equivalently of D) are informationally equivalent. Identifying external information might stem from two sources, (i) theoretical considerations about the variables in \(y_t\) (i.e., about PP responses to the elements in \(\xi _t\)), or (ii) from stylized features of the distributions of \(\varepsilon _t\) or \(\xi _t\).
We next encounter four alternative identification schemes in a stylized manner. For a detailed and up-to-date textbook treatment of identification in SVARs we refer the reader to Kilian and Lütkepohl (2017). Throughout we consider the case of bivariate SVARs. Extending the arguments to higher model dimensions is straightforward. Subsequently, we briefly discuss particular merits of theory- vs. data-based identification, and argue for ICA as a means to identify the PP SVAR model. Finally, we provide a short outline of the extraction of independent components.
2.3.1 Identification schemes
-
1.
Recursive systems: The estimate of the covariance \(\Sigma \) comprises three single (co)variances. To identify the four structural parameters in D, Sims (1980) has suggested to impose a (lower) triangular structure on this matrix. With such a restriction, the unrestricted parameters in D can be quantified straightforwardly. However, this identification scheme comes with the imposition of a hierarchical structure of the model. By implication, the structural properties depend on the ordering of the variables in \(y_t\).
-
2.
Identification by means of sign restrictions: As formalized in (3) typical elements of D quantify the impact effects of a unit shock \(\xi _{it}=1,\,i=1,2,\) on the observables in the system. While this effect is unknown in its exact metric form, it is often the case that theoretical considerations imply specific effect directions. In the econometric literature Faust (1998), Canova and De Nicolo (2002) and Uhlig (2005) have pioneered and advocated the identification of SVARs by means of such consensual effect directions. In the framework of PP models a theory guided shock labelling could benefit from theoretical knowledge of how the organisms under scrutiny react to surprising changes of living conditions (i.e., shocks). For instance, in real life scenarios such shocks might materialize in the form of the invasion of new species or changing environmental conditions. In this context it is worth observing that many modern representatives of PP models are based upon the Kolmogorov model - a minimal set of two differential equations coupled with axiomatic assumptions (Freedman 1980). Population changes as implied by the Kolmogorov model are
$$\begin{aligned} \frac{dN}{dt}=g(N,P)N \text{ and } \frac{dP}{dt}=h(N,P)P, \end{aligned}$$and the axiomatic assumptions concern the so-called predation function g (\(\frac{\partial g}{\partial P} < 0\)) and, moreover, \(\frac{\partial h}{\partial N} > 0\). Approximating the continuous time model in discrete time with \(dt = 1\) obtains \(n_{t+1} \approx g(N_t,P_t)\) and \(p_{t+1} \approx h(N_t,P_t)\). Hence, a positive shock which primarily impacts on preys implies \(n_{t+1}>0\) (direct effect) and \(p_{t+1}>0\) (indirect effect channelled through h). Moreover, shocks which are directly beneficial to growth rates of the predator population are likely to impact adversely on the living conditions of the preys (indirect effect channelled through g).Footnote 5 Casting these considerations in the framework of the discrete time SVAR model obtains the following effect directions for typical elements of D, denoted \(d_{ij}\): \(d_{ii}>0,i=1,2\), \(d_{12}<0\) and \(d_{21}>0\). Hence, a theory supported sign pattern for the structural matrix D is
$$\begin{aligned} {D}_{theo}=\left( \begin{array}{cc} + & - \\ + & + \end{array} \right) . \end{aligned}$$(4)Providing a clear and unique effect pattern, the sign restrictions in \({D}_{theo}\) qualify for theory based model identification. Practically, a set of models identified by means of sign restrictions
$$\begin{aligned} \widetilde{\mathcal{D}} = \{D = GR_{\theta }|d_{ii}>0,\,i=1,2,\,d_{21}>0,\,d_{12}<0 \} \end{aligned}$$obtains from random sampling of rotation angles \(\theta ,\, 0 \le \theta \le \pi \), and storing all implied covariance decompositions that align with the sign pattern provided in (4). Once an analyst has sampled a sufficient number of admitted structural matrices, the set of identified models \(\widetilde{\mathcal{D}}\) might be described by common statistical measures (e.g., mean, standard deviation, quantiles).
-
3.
Identification through heteroskedasticity: The identification through heteroskedasticity as pioneered by Rigobon (2003) and developed further by, amongst others, Lanne and Lütkepohl (2008) and Bacchiocchi and Fanelli (2015) exploits information on observable heteroskedasticity in the residuals \(\varepsilon _t\). Assume that an analyst has access to two distinct covariance estimates, denoted \(\Sigma ^{(1)}\) and \(\Sigma ^{(2)}\), that she obtains from disjoint subsamples of the data. In the bivariate case this yields six single (co)variance estimates to solve the identification problem. The two covariances allow for a reparameterization as \(\Sigma ^{(1)}=DD'\) and \(\Sigma ^{(2)}=DWD'\), where W is a diagonal matrix. Hence, the six estimated (co)variances can be mapped in a one-to-one manner to the six unknown parameters in D and W, thereby identifying D. It is evident, however, that unique identification by means of heteroskedasticity requires that the parameters in W must be different. In other words, the switch from the first to the second covariance regime must not affect the variances of the shocks proportionally.
-
4.
Identification by means of independent components: Early results on the properties of linear combinations of (non)Gaussian random variables have motivated to obtain structural shocks as independent components. While Lancaster (1954) has stated that rotation invariance is unique to independent Gaussian random variables, results of Comon (1994) imply that the weighting scheme \(\varepsilon _t = D \xi _t\) allows a unique recovery of the independent shocks in \(\xi _t\) from \(\varepsilon _t\) if and only if at most one element of \(\xi _t\) exhibits a Gaussian distribution. Taking advantage of these results, amongst others, Moneta et al. (2013), Gouriéroux et al. (2017) and Lanne et al. (2017) have developed identification schemes for SVARs which differ with respect to the space of admitted D matrices or the distributional assumptions made for the structural shocks in \(\xi _t\). Non-parametric approaches to ICA have been suggested or applied, for instance, by Matteson and Tsay (2017) and Herwartz (2019). Following their lines of reasoning an estimator of D could be obtained from solving the minimization problem
$$\begin{aligned} \widehat{D} = \text{ argmin}_{D \in \mathcal{D}}\{\text{ Joint } \text{ dependence } \text{ of } \text{ elements } \text{ in } \xi _t\text{, } \text{ with }\; \xi _t = D^{-1} \varepsilon _t\} \end{aligned}$$(5)by means of a general non-parametric test of the null hypothesis of joint independence.
2.3.2 A comparative discussion of identification schemes
To support the choice for a particular identification scheme in empirical practice it is helpful to first contrast theory- vs. data based identification. While theory-based identification ensures that obtained structural shocks have meaningful impact properties, such schemes are also prone to intermingling model assumptions and conclusions (Uhlig 2005). In contrast, data-based identification typically provides sufficient external information such that otherwise overidentifying restrictions could be subjected to testing. At the downside, however, structural shocks obtained from these identification schemes do not necessarily allow for a sound theoretical interpretation. In this work we opt for a data-based approach to identification, and investigate subsequently if the retrieved shocks allow for a theory-conform interpretation in the spirit of the theoretical sign pattern of \(D_{theo}\) in (4). For this purpose it is important to notice that an identified matrix D is unique only up to column signs and the column ordering. With \(d_i\) denoting the i-th column of D it holds that \(\Sigma = DD' = \sum _{i=1}^2 d_{i}d_{i}'\). Hence, the matrices \([d_1,d_2]\), \([-d_1,d_2]\) or \([-d_2,d_1]\) are all in line with the covariance restriction \(\Sigma = DD'\).
The application of data-based identification schemes deserves either that the VAR residuals are subject to informative covariance changes or non-Gaussian. While we obtain some evidence in favour of downward shifting variances of VAR residuals (see also Fig. 1), the null hypothesis of proportional variance changes cannot be rejected with 5% significance. In contrast, the evidence against normally distributed VAR residuals is very strong such that we choose ICA for identification. The following paragraph is explicit on the independence statistic used to select D from \(\mathcal{D}\).
2.3.3 Independent component analysis
Solving the minimization problem in (5) requires a statistical quantification of joint dependence. To detect unique independent components we follow the approach of Herwartz (2019) and employ the dependence coefficient introduced by Bakirov et al. (2006). The dependence coefficient, denoted \(\mathcal{C}\), provides a \(\sqrt{T}\) consistent estimator of the population distance of the joint characteristic function of two random variables (in our case \(\xi _{1t}\) and \(\xi _{2t}\)) and the product of both marginal characteristic functions (see the Appendix for an explicit exposition). Similar to absolute estimates of linear correlations, \(\mathcal{C}\) is bounded between zero and unity which indicate the limiting scenarios of independence and complete dependence, respectively. For testing the null hypotheses of independence, \(T\mathcal{C}^2\) is a bounded random variable. Since critical values of \(T\mathcal{C}^2\) are difficult to obtain analytically, Bakirov et al. (2006) suggest a permutation bootstrap approach for the assessment of significance of the test statistic.Footnote 6 To determine independent components practically, one might either minimize \(\mathcal{C}\) or maximize the p-value of \(T\mathcal{C}^2\). In light of simulation results of Herwartz (2019) we opt for the latter and determine the structural matrix \(\widehat{D}=GR_{\hat{\theta }}\) such that
where \(\mathcal{C}_{\theta }\) is determined from the sample \(\{ {\xi }^{({\theta })}_t = D_{{\theta }}^{-1} {\varepsilon }_t \}_{t=1}^T\).Footnote 7 Practically, the estimates \(\hat{\theta }\) obtain from a grid search over alternative rotation angles \(\theta =\frac{j\pi }{2}/50,\,j=1,2,3,\ldots ,50\). Extending the grid to \(\frac{\pi }{2} < \theta \le \pi \) results in an informationally equivalent second solution for \(D_{\theta }\) with exchanged column positions.
2.4 Marginal effects
The model in (3) is structural in the sense that observable regression residuals are traced back to orthogonal shocks. As such the model is not explicit on the implied causal relations between the jointly endogenous variables in \(y_t=(n_t,p_t)'\). Multiplying (3) from the left with \(D^{-1}\) obtains a structural model with unrelated residuals and a parametric representation of the conditional linkages among the elements in \(y_t\),
where \(A_i^*=D^{-1}A_i,\,i=1,2,\ldots ,q\). Let \(\Omega _{t-1}\) denote an information set that comprises the process information up to time \(t-1\). Moreover, let \(d^{(ij)}\) denote a typical element of \(D^{-1}\). Conditional on \(\Omega _{t-1}\), the elements in \(D^{-1}\) describe the marginal causal effects. Since
we obtain after normalization from (6)
where \(\omega _{t-1}^{(n)}\) and \(\omega _{t-1}^{(p)}\) are observable conditional on the VAR parameters and \(\Omega _{t-1}\). It is worth noticing that by virtue of their definition in (7) the marginal effects \(-d^{(21)}/d^{(22)}\) obey a similar interpretation in discrete time as the predation function g(N, P) in the continuous time Kolmogorov model.
2.5 Impulse response functions
Owing to the rich and complex dynamic parametrization of VAR models, IRFs have become a common practice to illustrate model implied dynamics. To formally derive IRFs as in Chapter 2 of Lütkepohl (2007) the VAR in (2) can be represented in so-called moving average form as
where L is the backshift operator such that \(Ly_t = y_{t-1}\) and \(\mu = A^{-1}(L)\nu \). The moving average parameter matrices \(\Psi _i,\,i=0,1,2,3,\ldots \), can be obtained recursively from nonlinear transformations of the autoregressive parameter matrices \(A_i,\,i=1,\ldots ,q\). Specifically,
and
The moving average representation is of core importance for purposes of impulse response analysis. For instance, the typical elements of the matrices
characterize the effect of isolated unit shocks in \(\xi _{jt}\) on \(y_{i,t+h}\), the i-th time series variables at horizon h. Obviously, impulse response schemes are specific to the selection of the structural matrix D.
To avoid strong identifying assumptions on the covariance decomposition, the innovation accounting analysis in ERE07 does not rely on impulse responses to orthogonalized shocks. Rather ERE07 make use of the so-called GIRFs of Pesaran and Shin (1998). Formally, GIRFs read as
where \({\varvec{e}_i}\) is a zero vector with a unit element in its i-th position, and \(\sigma _{ii}\) is the square root of the i-th diagonal element of the residual covariance \(\Sigma \). The qualification of the profiles in (10) as ‘generalized’ IRFs comes from the fact that these statistics are invariant to the ordering of the variables in the system. As argued in Kim (2013), however, GIRFs could be interpreted as an aggregation of impulse responses from VARs with distinct variable orderings. By implication, GIRFs lack effect consistency under the typical case of a non-diagonal residual covariance \(\Sigma \).
3 Structural analysis of predator-prey interaction
In this Section we first highlight that the VAR model of ERE07 is characterized by significant deviations from conditional normality and, hence, that independence scores might be used for characterizing ad-hoc identification schemes in terms of independence of orthogonalized shocks. Subsequently, we pursue the detection of independent components and provide an interpretation of the identified shocks. Furthermore, the Section includes a comparative description of the GIRFs of ERE07 and identified IRFs. Finally, we employ the identified model to assess the contemporaneous causal linkages among the growth patterns of the two species.
3.1 Non-Gaussianity and recursive model structures
ERE07 argue in favour of a VAR model with \(q=3\), as it minimizes the BIC and obtains serially uncorrelated estimates of \(\varepsilon _t\). After fitting a VAR(3) model to the bivariate vector of population growth rates, the residual covariance matrices estimated from the full and reduced sample read, respectively, as
While the removal of initial outlying observations reduces markedly the residual variances of both variables, the residual correlations are significantly negative, namely −0.477 and −0.458 for the full and the reduced sample, respectively. To uncover the causal relationships behind these empirical correlations by means of unique independent components, it is essential that the data are non-Gaussian. Table 3 provides outcomes from normality tests applied to VAR residuals. Both the Shapiro-Wilk (SW) and the Jarque-Bera (JB) test statistics for the null hypothesis of normally distributed marginal residuals provide strong evidence against the normality assumption. JB summary statistics indicate non-Gaussianity also for the joint distribution.
Given significant deviations from the Gaussian model, it is tempting to order alternative ad-hoc orthogonalization schemes with respect to their potential to align with notions of independent (rather than merely orthogonal) shocks. Cholesky factors of the residual covariance depend on the ordering of the variables in \(y_t\) (see the discussion in Section 2.3.1). Alternatively, for a given variable ordering one might distinguish a lower (denoted \(D_l\)) and an upper triangular Cholesky factor (\(D_u\)). The left hand side and middle panels of Table 4 document estimated alternative covariance decomposition factors. In terms of joint independence the orthogonalized shocks implied by alternative hierarchical model structures allow a clear distinction. Considering residuals \(\varepsilon _{1t}\) (prey growth) to originate in isolation through orthogonalized shocks (as implied by \(D_l\)) is in line with the null hypothesis of independence at conventional significance levels. While one could argue that accepting a null hypothesis as such does not provide a strong inferential result, it is worth to highlight that the data have no possibility to object against an orthogonalization of a model with prey residual growth ordered first. Unlike this hierarchical specification, assuming residuals \(\varepsilon _{2t}\) (predator growth) to originate in isolation through orthogonalized shocks (as implied by \(D_u\)) obtains a rejection of the independence hypothesis with 5% (10%) significance for the full (reduced) sample. In sum, statistical identification is an informative means to establish diagnostic evidence in favour of the actual variable ordering \(y_t=(n_t,p_t)'\) over the alternative \(y_t=(p_t,n_t)'\).Footnote 8
3.2 Identified independent components
The right hand side panel of Table 4 documents structural parameters estimates \(\widehat{D}\) joint with bootstrap-based t-ratios.Footnote 9 For both samples the detection of shocks with minimum dependence results in solutions which are close to the orthogonalized shocks obtained from the Cholesky covariance factor. The maximization of model implied p-values obtains estimates of about 60%. While one should be careful in interpreting supremum p-values from multiple testing in the usual way, it seems that the shocks implied by \(\widehat{D}\) show some weaker dependence in comparison with the shocks retrieved after imposing a hierarchical model structure (\(D_l\)). However, this improvement is marginal and might lack significance.
The estimator \(\widehat{D}\) provides evidence for two well identified shocks, since the sign patterns of estimated effect directions are distinct. While the first shock invokes opposite impacts on the two population growth rates, the second shock exerts a positive impact effect on both the growth rate of the predator and the growth rate of the prey population. Supporting the restriction implied by the Cholesky factor \(D_l\), however, the upper right element in \(\widehat{D}\) lacks positiveness with conventional significance.
The estimates in \(\widehat{D}\) are the solution of a statistical optimization problem. As resulting from a purely data-driven approach, however, it is not clear if the implied structural shocks \(\{\hat{\xi }_t = \widehat{D}^{-1}\varepsilon _t\}_{t=1}^T\) conform with any theoretical interpretation. In data-based SVAR identification the issue of shock labelling has become an important step of the analysis. In the form provided in Table 4 both structural matrices \(D_l\) and \(\widehat{D}\) point to the prominent role of the first shock (i.e., isolated shocks entering the model through prey growth rate residuals). As implied by the documented sign pattern, this shock improves prey growth and diminishes growth rates of the predator population. In a ‘material’ sense, such theoretical properties of ‘positive’ shocks might lack intuition. Therefore, we reorganize the columns and sign patterns of \(\widehat{D}\) to establish features of the shocks that align with \(D_{theo}\) in (4). Specifically, we reverse the column ordering and subsequently multiply the elements of the second column with minus unity. This obtains the structural matrix estimates in labelled form
for the full sample and the reduced sample, respectively.
When it comes to describe actual shocks behind the empirical growth rates analysed in this work, the controlled conditions of the experiments described in Veilleux (1979) leave little room to assign a theoretical interpretation to the labelled structural shocks. For instance, the experiments control the amount of food that was available to the prey species (i.e., the cerophyl concentration). However, potential ‘shocks’ which improve the living conditions of the prey species could occur in the forms of (i) favourable stochastic fluctuations of the nutrient (or its quality) around the controlled mean level, or (ii) an advantageous spatial distribution of the nutrient which minimizes search time/cost to the prey population. Improving growth rates of the Paramecium (prey) population, such shocks also improve the ingestion conditions for Didinium (predator). Therefore such shocks can be seen to invoke impact responses as described in the first columns of the structural matrices in (13) for the identified \(\xi _{1t}\). For arguing in favour of potential positive shocks to the predator population it is worth noticing that the predator Didinium is unable to detect its prey Paramecium visually or in a chemotactic manner. Hence, it cannot actively move towards its prey, and it necessitates a random direct contact to induce the capture and ingestion mechanism. Therefore particular states of species distribution that are favourable for predator growth include the absence of predator clustering or the presence of prey clustering. Establishing favourable conditions for predator growth, such ‘shocks’ are most likely detrimental for prey growth, and point to the impact effects in the second columns of the structural matrices in (13) and the properties of the identified \(\xi _{2t}\).
3.3 Impulse response analysis
We next describe the implications of the VAR model for the dynamic and contemporaneous interplay of the two species. For this purpose we discuss a replication of the GIRF estimates of ERE07 which is displayed in Fig. 2. As a complement to these results, we discuss effects implied by the identified SVAR in the form of IRFs to orthogonalised and independent shocks shown in Fig. 3.
3.3.1 Generalized impulse response functions reconsidered
While point estimates shown in Fig. 2 fully accord with the GIRFs of ERE07, the displayed confidence intervals with 95% coverage differ slightly in terms of shape.Footnote 10 The displayed functional forms indicate the stationarity of the system of population growth rates as the effects of initial impulses die out over an horizon of approximately seven days. As highlighted in ERE07, features of both persistence and cyclicality characterize the dynamic system. Seemingly, prey growth rates are in the lead of predator growth rates.
Positive impulses in the growth rates of the predator or prey population invoke markedly negative impact effects in the growth rates of the other population. This outcome is a direct consequence of the negative contemporaneous correlation characterizing the residual covariances documented in (11). Hence, the GIRFs lack a strictly structural interpretation. In the present case this lack shows up in an important theoretical model deficiency. While it is most intuitive to see that positive shocks to the predator population growth are - on impact - detrimental for the growth rate of the prey population, the reverse transmission channel lacks such an intuition. In fact, the model implication that positive shocks to prey population growth exert a negative impact on the growth rate of the predator population contradicts basic assumptions of a stylized Kolmogorov PP model. This result is supportive for the critical perspective on GIRFs provided by Kim (2013).
3.3.2 Identified impulse responses to structural shocks
Continuing the structural analysis we discuss the dynamic response patterns of prey and predator population growth rates to stylized (orthogonalised and independent) unit shocks. With regard to the first shock the upper left panel of Fig. 3 indicates that its positive impact on prey growth turns significantly negative within the first 24 hours. Then, cyclical effects imply a return to positive rates of population growth which lack 10% significance. The response of the prey population to this shock leads the response of the predator population growth. The latter achieves its minimum value after 48 hours and the maximum of recovery also follows the maximum prey recovery with a delay of 24 hours.
The second shock displayed in the right hand side panels of Fig. 3 impacts positively on predator growth and shows a markedly diminishing effect on prey growth. Due to the deterioration of the availability of nutrition, the initially positive response of predator growth turns negative within 24 hours to allow an almost significant recovery of the prey population followed again by positive dynamic responses of predator population growth.
3.4 Marginal effects
As argued in Sect. 3.4 the structural model and - in particular - its reformulation in (6) allows to recover the contemporaneous effect of growth rates in either species on the remaining one conditional on the history of the process. Structural model estimates of inverse (and normalized) \(\widehat{D}^{-1}\) matrices are
for the full sample and the reduced sample respectively. The estimated structural model implies the contemporaneous relationships between prey and predator population as documented in Table 5. In the steady state an increase of the predator population growth by one percentage point invokes a reduction of the prey population growth by about 1.2 percentage points. Bootstrapping the structural model obtains an interquartile range of this relation which is between 1.06 and 1.48 (1.02 and 1.39) percentage points for the full (reduced) sample. Regarding the reversed directional impact, an increase in prey growth by one percentage point increases, on average, predator growth by 5.1% (6.8%) percentage points for the full (reduced) sample. Although these estimates are representative for the steady state, it is worth noticing that in directional terms both contemporaneous causal effects are in line with the axiomatic assumptions of the Kolmogorov model.
4 Conclusions
In the ecological literature predator-prey (PP) models have become a successful approach to understand species’ interactions. Ewing et al. (2007) have argued convincingly in favour of the informational content of vector autoregressive models for the dynamic analysis in PP systems conditional on samples drawn (with low frequency) at discrete time. In this paper we complement their analysis of a prominent laboratory time series of population counts of two freshwater organisms (Veilleux 1979). Generalised impulse response patterns (Pesaran and Shin 1998) displayed in the benchmark study of Ewing et al. (2007) are not fully in line with the axiomatic foundations of PP models as stated in the Kolmogorov model (Freedman 1980). This caveat can be traced back to the fact that generalised impulse response functions lack a strictly structural interpretation.
We take advantage of recent contributions to the literature on identification in structural VAR models to provide insights into both the dynamic and the contemporaneous interplay of the two freshwater organisms. In specific, our analysis builds upon the uniqueness of independent shocks which exhibit a non-Gaussian distribution (Lancaster 1954; Comon 1994). Independent components give rise to a structural model which is well identified and in line with the basic axiomatic foundations of PP models. The statistical approach to identification allows to test the otherwise just identifying assumption of a hierarchical model structure. If the hierarchical model develops from regarding surprises to prey growth as rescaled structural shocks, the model is in line with an assumption of independent shocks. Estimating the structural parameters unrestrictedly provides further support for the hierarchical structure. With regard to the functional response in PP models our steady state approximation implies that conditional on a history of three half days, the prey population growth ‘instantaneously’ shrinks by about 1.2 percentage points in response to an increase of the predator population by one percentage point.
Adopting statistical identification schemes to the laboratory data of Veilleux (1979) shows the potential of structural perspectives on PP models of competition, coordination and predation. As a matter of fact, the laboratory design of data generation limits the material interpretation of the identified shocks. An interesting avenue for future research is the adaptation of structural VARs to real life data in ecology and other disciplines (e.g., the prominent linkage among ‘snowshoe hare’ and ‘Canada lynx’ Stenseth et al. 1997). In such a context interesting ecological issues could be addressed by means of SVARs that are identified in a statistical and/or theory based manner. For instance, the effects of controlled releases of the predator species into the wild could be assessed by means of structural impulse responses. Similarly, to understand the possible effects of weakened protection rules for preys (and, hence, more hunting) structural impulse responses promise valuable information.
Notes
Noticing that loss-win interactions of competition, coordination and/or predation are at the heart of these models, it is not surprising that PP specifications have also been considered for purposes beyond ecosystem analysis. For instance, in economics PP linkages have been picked up amongst others (i) by Capello and Faggian (2002) for modelling urban growth patterns in Italy, (ii) by Mehlum et al. (2003) to develop a general equilibrium model of industrialisation and (iii) in Crookes and Blignaut (2016) to describe vertical production structures in global steel and vehicle production.
Along with many further references, Ewing et al. (2007) provide a stylized sketch of the history of PP models and their empirical assessments in discrete time.
See also Beisner et al. (2003) for an example VAR model augmented with exogenous information.
See Table 1 of Veilleux (1979) for a summary of (un)stable dynamic behaviours within Methyl Cellulose cultures. Based on simulated data from a continuous time model of PP interaction Jost and Ellner (2000) provide further support to consider the data as drawn from stable states of a PP model. The population counts are available from the web, http://cbi-toulouse.fr/images/upload/prslbdatasets.txt. As ERE07 we analyse the last series provided (‘cerophyl concentration, CC = 0.5’ ; digitalized from Figure 14c of Veilleux (1979)).
Being formalized in continuous time, however, the effects of such shocks are in general conditional on the state of the system, i.e., on actual population figures. As such, the axiomatic assumptions of the Kolmogorov model are seen here to apply ‘on average’ to the steady state of PP interaction. Conditions for the stability of PP systems in discrete time depend on the functional forms of g(N, P) and h(N, P) (see Huang et al. 2008, for an example case).
For all computational purposes that involve \(\mathcal{C}\) we use the R package ‘energy’ (Rizzo and Székely 2017). Bootstrap p-values for \(T\mathcal{C}^2\) are based on \(R=249\) permutations. It turns out that subjecting a given sample to independence testing obtains some variation in the implied p-values for a given statistic \(T\mathcal{C}^2\). To assess the significance (and p-values) of \(T\mathcal{C}^2\) robustly, all respective resampling exercises build upon average p-values from 49 alternative initializations of the permutation bootstrap. Results obtained from using just one initialization for the permutation bootstrap are qualitatively identical, however.
Solving an estimation problem by maximizing the p-value of a suitable test statistic (i.e., minimizing the evidence against the null hypothesis) has become prominent as so-called Hodges-Lehmann estimation (Hodges and Lehmann 1963).
See also the simulation results in Herwartz (2019) on the informational content of the independence coefficient \(\mathcal{C}\) to rank causal hierarchies implied by alternative variable orderings.
For inferential assessments of estimates in \(\widehat{D}\) (and (G)IRFs) we use a fixed-design wild bootstrap scheme in the spirit of Gonçalves and Kilian (2004). At each bootstrap replication pseudo samples are composed of
$$\begin{aligned} y_{t}^* = \hat{\nu } + \hat{A}_1 y_{t-1} + \hat{A}_2 y_{t-2} + \cdots + \hat{A}_p y_{t-p} + {\varepsilon }_t^*, \; {\varepsilon }_t^* = \hat{\varepsilon }_t \psi _t, \; t=1,2,\ldots ,T. \end{aligned}$$(12)In (12) \(\hat{\nu }\) and \(\hat{A}_j,\,j=1,\ldots ,q\), are OLS parameter estimates retrieved from the data. Independently of the data, the scalar random variable \(\psi _t\) exhibits a Rademacher distribution (\(\text{ Prob }[\psi =1]=\text{ Prob }[\psi =-1]=0.5\)). The inferential analysis relies on 999 bootstrap replications. Brüggemann et al. (2016) have shown that wild bootstrap variants fail to mimic the joint distribution of estimators of slope (\(\nu ,\,{A}_i,i=1,2,\ldots ,q\)) and (co)variance parameters (\(\Sigma =DD'\)) asymptotically. In finite samples, however, a consistent moving block bootstrap and the wild bootstrap likely obtain qualitatively identical results (Brüggemann et al. 2016).
Targeting at a nominal coverage of 95%, ERE07 use symmetric confidence intervals of \(\pm 2\) standard error estimates. In this study, we use quantiles of the bootstrap distribution to bound confidence intervals with given coverage.
References
Bacchiocchi E, Fanelli L (2015) Identification in structural vector autoregressive models with structural changes: an application to US monetary policy. Oxford Bull Econ Stat 66:771–779
Bakirov NK, Rizzo ML, Székely GJ (2006) A multivariate nonparametric test of independence. J Multivar Anal 97:1742–1756
Beisner B, Ives A, Carpenter S (2003) The effects of an exotic fish invasion on the prey communities in two lakes. J Anim Ecol 72:331–342
Brüggemann R, Jentsch C, Trenkler C (2016) Inference in VARs with conditional heteroskedasticity of unknown form. J Econ 191:69–85
Canova F, De Nicolo G (2002) Monetary disturbances matter for business fluctuations in the G-7. J Monet Econ 49:1131–1159
Capello R, Faggian A (2002) An economic-ecological model of urban growth and urban externalities: Empirical evidence from Italy. Ecol Econ 40:181–198
Comon P (1994) Independent component analysis. A new concept? Sig Process 36:287–314
Crookes D, Blignaut J (2016) Predator-prey analysis using system dynamics: an application to the steel industry. S Afr J Econ Manag Sci 19:733–746
Ewing B, Riggs K, Ewing K (2007) Time series analysis of a predator-prey system: application of VAR and generalized impulse response function. Ecol Econ 60:605–612
Faust J (1998) The robustness of identified VAR conclusions about money. Carnegie-Rochester Conf Public Policy 49:207–244
Freedman H (1980) Deterministic mathematical models in population ecology. Marcel Dekker, New York
Gonçalves S, Kilian L (2004) Bootstrapping autoregressions with conditional heteroskedasticity of unknown form. J Econ 123:89–120
Gouriéroux C, Monfort A, Renne J (2017) Statistical inference for independent component analysis: application to structural VAR models. J Econ 196:111–126
Herwartz H (2019) Long-run neutrality of demand shocks: revisiting Blanchard and Quah (1989) with independent structural shocks. J Appl Econ 34:811–819
Hodges J Jr, Lehmann E (1963) Estimates of location based on rank tests. Ann Math Stat 34:598–611
Huang Y, Jiang X, Zou X (2008) Dynamics in numerics: on a discrete predator-prey model. Diff Equ Dyn Syst 16:163–182
Jost C, Ellner SP (2000) Testing for predator dependence in predator-prey dynamics: a non-parametric approach. Proc R Soc Lond B Biol Sci 267:1611–1620
Kilian L, Lütkepohl H (2017) Structural vector autoregressive analysis. Cambridge University Press, Cambridge
Kim H (2013) Generalized impulse response analysis: general or extreme? EconoQuantum 10:135–141
Lancaster H (1954) Traces and cumulants of quadratic forms in normal variables. J R Stat Soc B (Methodol) 16:247–254
Lanne M, Lütkepohl H (2008) Identifying monetary policy shocks via changes in volatility. J Money Credit Bank 40:1131–1149
Lanne M, Meitz M, Saikkonen P (2017) Identification and estimation of non-Gaussian structural vector autoregressions. J Econ 196:288–304
Lütkepohl H (2007) New introduction to multiple time series analysis. Springer, New York
Matteson DS, Tsay RS (2017) Independent component analysis via distance covariance. J Am Stat Assoc 112:623–637
Mehlum H, Moene K, Torvik R (2003) Predator or prey? Parasitic enterprises in economic development. Eur Econ Rev 47:275–294
Moneta A, Entner D, Hoyer PO, Coad A (2013) Causal inference by independent component analysis: theory and applications. Oxford Bull Econ Stat 75:705–730
Pesaran M, Shin Y (1998) Generalized impulse response analysis in linear multivariate models. Econ Lett 58:17–29
Rigobon R (2003) Identification through heteroskedasticity. Rev Econ Stat 85:777–792
Rizzo ML, Székely GJ (2017) Energy: E-statistics: multivariate inference via the energy of data. R package version 1.7-2. https://CRAN.R-project.org/package=energy
Sims CA (1980) Macroeconomics and reality. Econometrica 48:1–48
Stenseth N, Falck W, Bjørnstad O, Krebs C (1997) Population regulation in snowshoe hare and Canadian lynx: asymmetric food web configurations between hare and lynx. Proc Natl Acad Sci USA 94:5147–5152
Uhlig H (2005) What are the effects of monetary policy on output? Results from an agnostic identification procedure. J Monetary Econ 52:381–419
Veilleux B (1979) An analysis of the predatory interaction between Didinium and Paramecium. J Anim Ecol 48:787–803
Acknowledgements
I cordially acknowledge excellent computational assistance from Jan Schick and Tilo Schnabel, and helpful discussions with Markus Fülle, Alexander Lange, Martin Quaas, Till Requate, Jan Schick, Bernd Theilen, Yabibal M. Walle and participants of the research seminar at the University of Leipzig. I am also grateful to Marina Ettl for her most valuable comments on interaction patterns of the prey Paramecium aurelia and the predator Didinium nasutum, and to three anonymous reviewers for their comments on an earlier version of this work.
Funding
Open Access funding enabled and organized by Projekt DEAL
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix: The dependence coefficient
Appendix: The dependence coefficient
Let \(\xi _{st} = (\xi _{1s},\xi _{2t})',\,t=1,2,\ldots ,T\) denote a sample of bivariate tuples consisting of two random variables \(\xi _{1t}\) and \(\xi _{2t}\). Then, the dependence coefficient of Bakirov et al. (2006) is
where
and \(|\cdot |\) and \(||\cdot ||\) are the Euclidean norms in \(\mathbb {R}\) and \(\mathbb {R}^2\), respectively. It is worth noticing that in the empirical application pursued here, estimated quantities \(\hat{\xi }_t\) enter \(\mathcal{C}\) and not directly observed random variables. However, from the \(\sqrt{T}\) consistency of estimated VAR residuals \(\hat{\varepsilon }_t\) it is easy to show that observation specific estimation errors of \(\hat{\xi }_t\) vanish on average when determining the sample moments \(\psi _i,\,i=1,2,\ldots ,5\). Unlike rank-based dependence diagnostics (as, e.g., Hoeffding’s D) the dependence coefficient \(\mathcal{C}\) is informationally efficient, since it relies on the original random variables in \(\xi _t\) in their metric form.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Herwartz, H. Modelling interaction patterns in a predator-prey system of two freshwater organisms in discrete time: an identified structural VAR approach. Stat Methods Appl 31, 63–85 (2022). https://doi.org/10.1007/s10260-021-00564-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10260-021-00564-8