1 Introduction

The essence of stable population theory isFootnote 1 the fact that a population subject to time-invariant vital rates will (with a few exceptions not of interest here) converge to a stable structure and grow exponentially at a constant rate (the population growth rate, or intrinsic rate of increase). The calculation of the population growth rate from the vital rates is one of the most important accomplishments of formal demography (Sharpe and Lotka 1911).Footnote 2 Ecologists recognized early on that, by integrating survival and fertility over the life course, the population growth rate provided a powerful tool for describing the population consequences of environmental conditions (e.g., Birch 1953). For the same reason, evolutionary biologists recognized it as a measure of fitness (Fisher 1930), although that concept requires careful consideration of both demographic and genetic processes (Charlesworth 1994; de Vries and Caswell 2018).

This makes the sensitivity analysis of population growth rate an important problem. It has been approached in three ways. The earliest approach (Hamilton 1966) is specific to age-classified models, and relies on differentiation of the characteristic equation. The second (Caswell 1978) applies to stage-classified as well as age-classified models, and uses eigenvalue perturbation theory. The third is based on matrix calculus and is more flexible than its predecessors.

2 Hamilton’s Equation for Age-Classified Populations

Consider an age-classified model, in which age x is a continuous variable, with mortality rate μ(x) and maternity function m(x). The survivorship function is

$$\displaystyle \begin{aligned} \ell(x) = \exp \left( - \int_0^x \mu(a) da \right) {} \end{aligned} $$
(3.1)

and the population growth rate r is the solution to the Euler-Lotka equation

$$\displaystyle \begin{aligned} 1 = \int_0^\infty e^{-ra} \ell(a) m(a) da . {} \end{aligned} $$
(3.2)

The stable age distribution, reproductive value function, birth rate, and generation time (mean age of reproduction in the stable population) are given by

$$\displaystyle \begin{aligned} \begin{array}{rcl} c(x) &\displaystyle =&\displaystyle \frac{e^{-rx} \ell(x) }{\int_0^\infty e^{-ra} \ell(a) da} \qquad \mbox{stable age distribution}{} \end{array} \end{aligned} $$
(3.3)
$$\displaystyle \begin{aligned} \begin{array}{rcl} v(x) &\displaystyle =&\displaystyle \frac{e^{rx}}{\ell(x)} \int_x^\infty e^{-ra} \ell(a) m(a) da \qquad \mbox{reproductive value} {} \end{array} \end{aligned} $$
(3.4)
$$\displaystyle \begin{aligned} \begin{array}{rcl} b &\displaystyle =&\displaystyle \left[ \int_0^\infty e^{-ra} \ell(a) da \right]^{-1} \qquad \mbox{birth rate} {} \end{array} \end{aligned} $$
(3.5)
$$\displaystyle \begin{aligned} \begin{array}{rcl} \bar A &\displaystyle =&\displaystyle \int_0^\infty a e^{-ra} \ell(a) m(a) da \qquad \mbox{generation time} {} \end{array} \end{aligned} $$
(3.6)

Sensitivity of r

Hamilton (1966) derived the sensitivities of r to changes in mortality and fertility at a specified age x. His results are equivalent to

$$\displaystyle \begin{aligned} \begin{array}{rcl} {d r \over d \mu(x)} &\displaystyle =&\displaystyle \frac{- c(x) v(x)}{b \bar{A} \rule{0in}{2ex}} {} \end{array} \end{aligned} $$
(3.7)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {d r \over d m(x)} &\displaystyle =&\displaystyle \frac{c(x)}{b \bar{A}} {} \end{array} \end{aligned} $$
(3.8)

That is, the sensitivity of r to a change in mortality at age x is proportional to the product of the reproductive value at age x and the abundance of age x in the stable age distribution. The sensitivity of r to a change in fertility at age x is proportional to the stable age distribution (and the reproductive value at age 0, which equals 1). The proportionality constant in each case is the inverse of the product of the birth rate and the mean age of reproduction.

Derivation

Hamilton’s results are obtained by implicit differentiation of the Euler-Lotka equation (3.2). We will derive Hamilton’s original formulation and then show how it reduces to the relation between the stable age distribution and the reproductive value distribution in (3.7) and (3.8).

First, introduce a perturbation parameter θ to measure the change in mortality or fertility at the specified age. Writing survival, fertility, and r as functions of θ gives the Euler-Lotka equation

$$\displaystyle \begin{aligned} 1 = \int_0^\infty e^{-r(\theta) a} \ell(\theta,a) m(\theta,a)\; da . {} \end{aligned} $$
(3.9)

Differentiating both sides of (3.9) with respect to θ gives

$$\displaystyle \begin{aligned} \begin{array}{rcl} 0 &\displaystyle =&\displaystyle - {d r(\theta) \over d \theta} \; \int_0^\infty a e^{-r(\theta) a} \ell(\theta,a) m(\theta,a)\; da \\ {} &\displaystyle &\displaystyle + \int_0^\infty e^{-r(\theta) a} {d \ell(\theta,a) \over d \theta} m(\theta,a)\; da \\ {} &\displaystyle &\displaystyle + \int_0^\infty e^{-r(\theta) a} \ell(\theta,a) {d m(\theta,a) \over d \theta} \; da . {} \end{array} \end{aligned} $$
(3.10)

Solving (3.10) for dr gives

$$\displaystyle \begin{aligned} {d r(\theta) \over d \theta} =\frac{1}{\bar A} \left( \underbrace{ \int_0^\infty e^{-r(\theta) a} {d \ell(\theta,a) \over d \theta} m(\theta,a)\; da }_{\mathrm{mortality}} + \underbrace{ \int_0^\infty e^{-r(\theta) a} \ell(\theta,a) {d m(\theta,a) \over d \theta} \; da }_{\mathrm{fertility}} \right) {} \end{aligned} $$
(3.11)

Equation (3.11) has two terms, one capturing effects of θ on mortality and the other capturing effects on fertility.

2.1 Effects of Changes in Mortality

We want to perturb mortality at one exact age x (remember that age and time are continuous), leaving mortality at all other ages unchanged. To do this, we use the unit impulse function, or Dirac delta function. This is a generalized function defined by

$$\displaystyle \begin{aligned} \begin{array}{rcl} \delta(x) &\displaystyle =&\displaystyle 0 \qquad x \neq 0 \end{array} \end{aligned} $$
(3.12)
$$\displaystyle \begin{aligned} \begin{array}{rcl} \int_{-\infty}^\infty \delta(s) ds&\displaystyle =&\displaystyle 1 . \end{array} \end{aligned} $$
(3.13)

The unit impulse is used in signal processing (e.g., Kamen and Heck 1997, p. 7) to represent the limit of a perturbation of unit strength applied over a shorter and shorter time interval. Think of a normal distribution with mean 0, in the limit as the variance goes to 0, while the area under the curve remains at 1. The most useful properties of the unit impulse, for our application, are

$$\displaystyle \begin{aligned} \int_{-\infty}^\infty \delta(a-x) f(a) da = f(x) \end{aligned} $$
(3.14)

and

$$\displaystyle \begin{aligned} \int_{- \infty}^x \delta(s) ds = H(x) \end{aligned} $$
(3.15)

where H(x) is the Heaviside function, or unit step function, which satisfies H(x) = 0 for x < 0 and H(x) = 1 for x > 0.

We write mortality as

$$\displaystyle \begin{aligned} \mu(\theta,a) = \mu(0, a) + \theta \delta(a-x) {} \end{aligned} $$
(3.16)

where δ(x) is the unit impulse function. The sensitivity of r to μ(x) is obtained as the derivative of r with respect to θ, evaluated at θ = 0,

$$\displaystyle \begin{aligned} {d r \over d \mu(x)} = \left. {d r \over d \theta} \right|{}_{\theta=0} . \end{aligned} $$
(3.17)

Because only mortality is affected by θ

$$\displaystyle \begin{aligned} \begin{array}{rcl} {d m(\theta,a) \over d \theta} &\displaystyle =&\displaystyle 0 \end{array} \end{aligned} $$
(3.18)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {d \mu(\theta,a) \over d \theta} &\displaystyle =&\displaystyle \delta(a-x). \end{array} \end{aligned} $$
(3.19)

From (3.1),

$$\displaystyle \begin{aligned} \begin{array}{rcl} {d \ell(\theta,a) \over d \theta} &\displaystyle =&\displaystyle - e^{- \int_0^a \mu(\theta,s) ds } \int_0^a \delta(a-x) da \end{array} \end{aligned} $$
(3.20)
$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle =&\displaystyle - \ell(\theta,a) H(a-x) . \end{array} \end{aligned} $$
(3.21)

Substituting into (3.11) and evaluating at θ = 0 gives

$$\displaystyle \begin{aligned} {d r \over d \mu(x)} = \frac{-1}{\bar A} \left( \int_x^\infty e^{-r a} \ell(a) m(a) da \right) . {} \end{aligned} $$
(3.22)

The integral in (3.22) is close to the reproductive value v(x) given by (3.4); specifically,

$$\displaystyle \begin{aligned} \int_x^\infty e^{-r a} \ell(a) m(a) da = \ell(x) e^{-rx} v(x) . {} \end{aligned} $$
(3.23)

However, from (3.3) and (3.5), (x)e rx = c(x)∕b. Making these substitutions into (3.22) gives the formal relationship (3.7).

2.2 Effects of Changes in Fertility

Following the same approach, if the perturbation affects fertility at exact age x, we write

$$\displaystyle \begin{aligned} m(\theta,a) = m(0,a) + \theta \delta(a-x) . {} \end{aligned} $$
(3.24)

Because only fertility is affected by θ, (θ, a)∕ = 0 and dm(θ, a)∕ = δ(a − x). Substituting these into (3.11) and evaluating the result at θ = 0 gives

$$\displaystyle \begin{aligned} {d r \over d m(x)}= \frac{1}{\bar A} \; \left( \rule{0in}{2.5ex} e^{-rx} \ell(x) \right) . \end{aligned} $$
(3.25)

From (3.3) and (3.5) it can be seen that the numerator is c(x)∕b, which leads to the formal relationship (3.8).

2.3 History and Perspectives

Hamilton (1966) obtained the relationship (3.22) in his analysis of the evolution of senescence. From (3.22) and (3.8) it is apparent that (provided r ≥ 0) the magnitudes of the sensitivities of r to mortality and fertility decline with age. These sensitivities measure the selection gradients on age-specific mortality and fertility. Thus Hamilton concluded that the strength of selection against deleterious mutations would necessarily decline with their age of action, that small positive effects at early ages could easily compensate for much larger negative effects at later ages, and that the evolution of senescence was therefore inevitable.

In the years that followed Hamilton’s paper, several other authors developed perturbation analysis for r, using related methods. Demetrius (1969) used a discrete age-classified model, and Emlen (1970) used Hamilton’s results to derive the dynamics of gene frequencies resulting from the selection gradients on age-specific survival and fertility.

Keyfitz (1971) in a remarkable paper, used implicit differentiation to obtain the sensitivity of population growth rate, life expectancy, birth rates, death rates, and the stable age distribution, apparently independently of Hamilton. He noted the appearance of reproductive value in the sensitivity of r to mortality. Goodman (1971) was apparently the first to note that the sensitivities of r to mortality and fertility could be expressed in terms of the stable age distribution and reproductive value.

When Hamilton’s paper appeared, it was regarded as difficult and esoteric, but it had a great impact. It provided the analytical machinery for examining trade-offs between opposing demographic traits, known as antagonistic pleiotropy (Williams 1957; Rose 1991). It also describes the accumulation of deleterious mutations due to the balance between mutation and selection (e.g., Steinsaltz et al. 2005). These ideas are fundamental to the analysis of human aging (e.g., Rose 1991; Wachter and Finch 1997; Carey and Tuljapurkar 2003; Baudisch 2008) and, more generally, the analysis of life history evolution in humans and other species (e.g., Charlesworth 1994; Stearns 1992).

3 Stage-Classified Populations: Eigenvalue Perturbations

Implicit in Hamilton’s analysis is the assumption that the vital rates are functions of age. In many cases, they are not. In humans, characteristics such as education, marital status, health status, or spatial location, may provide important information in addition to age. In other species, the vital rates may depend on developmental stage or body size more than on age. Such populations are described by stage-classified demographic models, of which the age-classified theory is a special case.

Stage-classified demography can be analyzed using matrix population models (Leslie 1945; Caswell 2001). The discrete-time population growth rate λ is the dominant eigenvalue of the population projection matrix A (guaranteed to be real and positive by the Perron-Frobenius theorem). Let n(t) be the population vector at time t, and A the population projection matrix, with

$$\displaystyle \begin{aligned} {\mathbf{n}}(t+1) = {\mathbf{A}} {\mathbf{n}}(t) \end{aligned} $$
(3.26)

and the population growth rate is given by the dominant eigenvalue λ of A. The stable stage distribution is given by the corresponding right eigenvector w and the reproductive value function by the left eigenvector v; they satisfy

$$\displaystyle \begin{aligned} \begin{array}{rcl} {\mathbf{A}} {\mathbf{w}} &\displaystyle =&\displaystyle \lambda {\mathbf{w}} {} \end{array} \end{aligned} $$
(3.27)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {\mathbf{v}}^{\mathsf{T}} {\mathbf{A}} &\displaystyle =&\displaystyle \lambda {\mathbf{v}}^{\mathsf{T}} {} \end{array} \end{aligned} $$
(3.28)

Sensitivity of λ

The effects of perturbations on population growth are approached by looking for the sensitivity of an eigenvalue to changes in the entries of a matrix. We will see that the sensitivity of λ to a change in the entry a ij of A is (Caswell 1978)

$$\displaystyle \begin{aligned} {\partial \lambda \over \partial a_{ij}} = \frac{v_i w_j}{{\mathbf{v}}^{\mathsf{T}} {\mathbf{w}}} . {} \end{aligned} $$
(3.29)

The entry a ij measures the per-capita production of stage i by stage j. Thus the effect of a change in a ij is proportional to the reproductive value of the destination stage and to the abundance of the origin stage in the stable population. This is a generalization of the relationships (3.7) and (3.8) obtained from Hamilton’s analysis.

Derivation

The eigenvalue λ is a solution to the characteristic equation of A, which generalizes the Euler-Lotka equation (3.2). Except in special cases, however, the characteristic equation cannot be written down explicitly, making the implicit differentiation approach used by Hamilton impossible. Instead, the relationship (3.29) is obtained by a perturbation expansion. Suppose that A is perturbed to A +  ΔA. This will result in perturbations of λ and of w, which must satisfy

$$\displaystyle \begin{aligned} \left( {\mathbf{A}} + \Delta {\mathbf{A}} \right) \left( {\mathbf{w}} + \Delta {\mathbf{w}} \right) = (\lambda + \Delta \lambda) ({\mathbf{w}} + \Delta {\mathbf{w}}) . \end{aligned} $$
(3.30)

Expanding the products, setting second order terms to zero, and remembering that Aw = λ w gives

$$\displaystyle \begin{aligned} {\mathbf{A}} (\Delta {\mathbf{w}}) + (\Delta {\mathbf{A}}) {\mathbf{w}} = \lambda (\Delta {\mathbf{w}}) + (\Delta \lambda) {\mathbf{w}}. \end{aligned} $$
(3.31)

Multiply on the left by v T and simplify to obtain

$$\displaystyle \begin{aligned} (\Delta \lambda) {\mathbf{v}}^{\mathsf{T}} {\mathbf{w}} = {\mathbf{v}}^{\mathsf{T}} (\Delta {\mathbf{A}}) {\mathbf{w}}. \end{aligned} $$
(3.32)

If the perturbation affects only one entry, say a ij, of A, then

$$\displaystyle \begin{aligned} \Delta \lambda = \frac{v_i w_j \left( \Delta a_{ij} \right)}{{\mathbf{v}} ^{\mathsf{T}} {\mathbf{w}}}. \end{aligned} $$
(3.33)

Dividing both sides by Δa ij and taking the limit as Δa ij → 0 gives the sensitivity result (3.29).

3.1 Age-Classified Models as a Special Case

To compare (3.29) with Hamilton’s results (3.7) and (3.8), consider an age-classified matrix (a Leslie matrix) with fertilities F i in the first row, survival probabilities P i on the subdiagonal, and zeros elsewhere (Leslie 1945; Keyfitz 1968). In this case (3.29) simplifies to

$$\displaystyle \begin{aligned} \begin{array}{rcl} {\partial \lambda \over \partial P_i} &\displaystyle =&\displaystyle \frac{v_{i+1} w_i}{{\mathbf{v}}^{\mathsf{T}} {\mathbf{w}}} {} \end{array} \end{aligned} $$
(3.34)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {\partial \lambda \over \partial F_i} &\displaystyle =&\displaystyle \frac{v_{1} w_i}{{\mathbf{v}}^{\mathsf{T}} {\mathbf{w}}} . {} \end{array} \end{aligned} $$
(3.35)

Equation (3.34) corresponds to (3.7); the sensitivity is proportional to the product of the reproductive value and the stable stage distribution. Equation (3.35) corresponds to (3.8), and shows why reproductive value is apparently missing from (3.8): reproductive value at birth [v(0) in Hamilton’s notation] is scaled to equal 1.

3.2 Sensitivity to Lower-Level Demographic Parameters

The entries of A are often functions of other, lower-level parameters. The sensitivity of λ to these parameters is obtained by the chain rule. For example, suppose that stage 1 may contribute individuals to stages 2 or 3 (Fig. 3.1). Write the transition probabilities as

$$\displaystyle \begin{aligned} \begin{array}{rcl} a_{21} &\displaystyle =&\displaystyle \gamma \sigma \end{array} \end{aligned} $$
(3.36)
$$\displaystyle \begin{aligned} \begin{array}{rcl} a_{31} &\displaystyle =&\displaystyle (1-\gamma) \sigma \end{array} \end{aligned} $$
(3.37)

where σ is the survival probability and γ the probability that the individual moves to stage 2, conditional on survival. Then the sensitivities of λ to γ and to σ are given by

$$\displaystyle \begin{aligned} \begin{array}{rcl} {d \lambda \over d \sigma} &\displaystyle =&\displaystyle {\partial \lambda \over \partial a_{21}} {d a_{21} \over d \sigma} + {\partial \lambda \over \partial a_{31}} {d a_{31} \over d \sigma} \end{array} \end{aligned} $$
(3.38)
$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle =&\displaystyle \frac{w_1 \left[ \gamma v_2 + (1-\gamma) v_3 \right]}{{\mathbf{v}}^{\mathsf{T}} {\mathbf{w}}} \end{array} \end{aligned} $$
(3.39)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {d \lambda \over d \gamma} &\displaystyle =&\displaystyle {\partial \lambda \over \partial a_{21}} {d a_{21} \over d \gamma} + {\partial \lambda \over \partial a_{31}} {d a_{31} \over d \gamma} \end{array} \end{aligned} $$
(3.40)
$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle =&\displaystyle \frac{\sigma w_1 \left( v_2 - v_3 \right) }{{\mathbf{v}}^{\mathsf{T}} {\mathbf{w}}} . {} \end{array} \end{aligned} $$
(3.41)

The sensitivity to survival is proportional to the weighted average of the reproductive values of the destination stages, and the sensitivity to the transition probability γ is proportional to the difference in reproductive value between the destination stages.

Fig. 3.1
figure 1

An example of lower-level parameters appearing in a portion of a life cycle. Individuals in stage 1 survive with probability σ, and, conditional on survival, move to stage 2 with probability γ and to stage 3 with probability 1 − γ

3.3 History

I first encountered the basis for this perturbation expansion in a paper by C.A. Desoer in the proceedings of an engineering conference (Desoer 1967).Footnote 3 Eigenvalue perturbations were of particular interest to engineers in the 1960s as part of a shift from frequency-domain methods to state-space methods in the study of linear systems (Zadeh and Desoer 1963). However, the result dates back to Jacobi (1846), and has been independently rediscovered many times (e.g., Faddeev 1959; Papoulis 1966; Franklin 1968). In population biology, this perturbation approach has been extended to many other sensitivity problems, including the sensitivity of subdominant eigenvalues and transient behavior, of growth rates in periodic and stochastic environments, of the eigenvectors, and of the spreading speed in biological or demographic invasions (see Caswell (2001) for reviews and references).

4 Growth Rate Sensitivity via Matrix Calculus

Matrix calculus provides a still more general approach to the sensitivity analysis of the population growth rate. Equation (3.29) perturbs only a single entry of A; derivatives with respect to other parameters are assembled by summing their effects over all the entries of A, as in (3.41). Using matrix calculus, we now consider λ as a scalar function of A and A as a matrix-valued function of a parameter vector θ.

Sensitivity of λ

We will show that the derivative of λ with respect to θ is

$$\displaystyle \begin{aligned} {d \lambda \over d \boldsymbol{\theta}^{\mathsf{T}}} = \left( \frac{ {\mathbf{w}}^{\mathsf{T}} \otimes {\mathbf{v}}^{\mathsf{T}} }{{\mathbf{v}}^{\mathsf{T}} {\mathbf{w}}} \right) \left( {d \mbox{vec} \, {\mathbf{A}} \over d \boldsymbol{\theta}^{\mathsf{T}}} \right) , {} \end{aligned} $$
(3.42)

where ⊗ denotes the Kronecker product. If θ is a p × 1 vector of parameters, then d θ T is a 1 × p matrix whose ith entry is i.

Derivation

Following the steps in Chap. 2, begin by taking the differential of both sides of (3.27) to give

$$\displaystyle \begin{aligned} (d {\mathbf{A}}) {\mathbf{w}} + {\mathbf{A}} (d {\mathbf{w}}) = (d \lambda) {\mathbf{w}} + \lambda (d {\mathbf{w}}) . \end{aligned} $$
(3.43)

Multiply both sides on the left by v T and simplify to obtain

$$\displaystyle \begin{aligned} (d \lambda) {\mathbf{v}}^{\mathsf{T}} {\mathbf{w}} = {\mathbf{v}}^{\mathsf{T}} (d {\mathbf{A}}) {\mathbf{w}} {} \end{aligned} $$
(3.44)

Next, apply the vec operator to both sides of (3.44). Since the left side is a scalar, the vec operator has no effect. The right side is a product of three quantities, so Roth’s theorem implies that

$$\displaystyle \begin{aligned} (d \lambda) {\mathbf{v}}^{\mathsf{T}} {\mathbf{w}} = \left( {\mathbf{w}}^{\mathsf{T}} \otimes {\mathbf{v}}^{\mathsf{T}} \right) d \mbox{vec} \, {\mathbf{A}} . {} \end{aligned} $$
(3.45)

The First Identification Theorem then gives

$$\displaystyle \begin{aligned} {d \lambda \over d \mbox{vec} \,^{\mathsf{T}} {\mathbf{A}}} = \frac{ {\mathbf{w}}^{\mathsf{T}} \otimes {\mathbf{v}}^{\mathsf{T}} }{{\mathbf{v}}^{\mathsf{T}} {\mathbf{w}}}. {} \end{aligned} $$
(3.46)

Finally, the chain rule (2.18) gives us

$$\displaystyle \begin{aligned} {d \lambda \over d \boldsymbol{\theta}^{\mathsf{T}}} = {d \lambda \over d \mbox{vec} \,^{\mathsf{T}} {\mathbf{A}}} {d \mbox{vec} \, {\mathbf{A}} \over d \boldsymbol{\theta}^{\mathsf{T}}} . \end{aligned} $$
(3.47)

The matrix calculus approach is particularly powerful because of the flexibility in specifying the effect of θ on the vital rates. Suppose that A depends on a vector σ of survival probabilities, which are a function of the concentration X of a pollutant, which in turn is changing as a function of time t. The rate of change of λ over time is

$$\displaystyle \begin{aligned} {d \lambda \over d t} = \left( {d \lambda \over d \mbox{vec} \,^{\mathsf{T}} {\mathbf{A}}} \right)\left( {d \mbox{vec} \, {\mathbf{A}} \over d \boldsymbol{\sigma}^{\mathsf{T}}} \right) \left( {d \boldsymbol{\sigma} \over d X} \right) \left( {d X \over d t} \right) {} \end{aligned} $$
(3.48)

Each of the terms in (3.48) can be evaluated separately; the matrix product gives the correct dimension for the final sensitivity result (a 1 × 1 scalar in this case).

5 Second Derivatives of Population Growth Rate

The second derivatives of λ measure the curvature of the response to changes in parameters. They have important applications in evolutionary demography, where they indicate the action of stabilizing, disruptive, or correlational selection on fitness-related traits (e.g., Phillips and Arnold 1989; Caswell 2001), in adaptive dynamics, where they help determine the stability of evolutionary singular strategies (e.g., Diekmann 2004), and in extending sensitivity analysis to second-order effects.

Since the first derivatives of λ are written, in Eqs. (3.29) and (3.46), in terms of the right and left eigenvectors of A, the second derivatives of λ require the first derivatives of those eigenvectors. Caswell (1996) derived the second derivatives of λ to entries of A by an extension of the method in Sect. 3.3. However, a more general and rigorous method is available using matrix calculus.

Consider a (scalar) variable ξ which is a function of a vector θ of parameters. The complete set of second derivatives of ξ are given by the Hessian matrix

$$\displaystyle \begin{aligned} {\mathbf{H}} = \left(\begin{array}{c} \displaystyle {\partial^2 \xi \over \partial \theta_i \partial \theta_j} \end{array}\right) \end{aligned} $$
(3.49)

Magnus and Neudecker (1988) proved (their Second Identification Theorem) that if the second differential of ξ can be written as

$$\displaystyle \begin{aligned} d^2 \xi = d \boldsymbol{\theta}^{\mathsf{T}} {\mathbf{B}} d \boldsymbol{\theta} \end{aligned} $$
(3.50)

for some matrix B, then

$$\displaystyle \begin{aligned} {\mathbf{H}} = \frac{1}{2} \left( {\mathbf{B}} + {\mathbf{B}}^{\mathsf{T}} \right) . \end{aligned} $$
(3.51)

Shyu and Caswell (2014) used this approach to derive the second derivatives of the population growth rate λ, the continuous-time population growth rate \(r = \log \lambda \), and the net reproductive rate R 0, to changes in either the entries of A or to arbitrary lower-level parameters of which A is a function. We will not explore second derivatives in this book, but Shyu’s other work (Shyu and Caswell 2016a,b) applies them to analyze the evolutionary demography of sex ratios, and Caswell and Shyu (2017) use them to analyze the effects of mortality on the selection gradients on senescence.

6 Conclusion

Each of the three approaches to growth rate sensitivity, leading to Eqs. (3.7), (3.8), (3.29), and (3.42), uses its own analytical methods. They agree, however, in showing how the sensitivity of population growth rate can be written in terms of the stable stage distribution and the reproductive value. In general, the effect of a change in the rate at which individuals move from stage j to stage i is proportional to the abundance of the origin stage (j) and the reproductive value of the destination stage (i). If a transition yields individuals with low reproductive value, or if few individuals are available to experience the change in the rate of transition, the effect on population growth will be small.