1 Introduction

1.1 Indicators

Biodiversity indicators are one of the most direct ways in which biodiversity data inform governmental policy and action (Butchart et al. 2010; Costelloe et al. 2016), particularly in response to international targets of reducing the rate of biodiversity loss. Most multispecies indicators (MSIs) are based on small numbers of species (<50) within a narrowly defined scope (e.g. birds, butterflies or bats), although some combine species across a much wider taxonomic range (Eaton et al. 2015; van Strien et al. 2016). The transition to a new UN ‘Decade on Biodiversity’ is expected to stimulate the creation of new multispecies biodiversity indicators.

The UK has a long-established set of biodiversity indicators. The UK government reports annually on 11 MSIs, each spanning several decades. All report trend statistics which are in some sense ‘averages’ of time series for individual species. Most are based on count data and thus assumed to index changes in national abundance, although two are derived from occupancy detection models for several hundreds of species (Isaac et al. 2016). Generally, source data are gathered annually for each species at a large number of sampling sites and converted into time trends via statistical models containing ‘year’ effects that serve as indices of annual status. The precise way in which this is done varies between taxonomic groups for reasons of species ecology and sampling design. The methods of this paper do not require standardisation in this respect, and we shall not elaborate further on the many variations currently adopted for estimating individual species’ trends.

We note four points of inconsistency in contemporary indicators, both in the UK and internationally:

  • Indicators are not consistent in the sources of uncertainty that are propagated through to estimates of precision in the headline MSI, and there are differences in the way in which precision is calculated.

  • How, or indeed whether, the indicator line is smoothed.

  • Indicators differ in how they deal with missing values, i.e. species:year combinations where there were insufficient surveys to produce reliable data.

  • Whether the indicator measures changes in species abundance, species distributions (occupancy), or both.

1.2 Desirable Features of Multispecies Indicators

Buckland et al. (2005) discuss six useful properties a multispecies indicator should possess. We note two further specific qualities in addition to the issues raised above.

  • Where practical, MSIs should follow a consistent methodology in order to ensure comparability across indicators and increase transparency. Harmonisation of indicator methods does not mean that we should also harmonise the methods used to calculate trends for individual species. Harmonising indicator methods but not the input data implies a separation of indicator production from the models used to produce species indices. We can think of species index values derived from raw data as Essential Biodiversity Variables (Kissling et al. 2018), from which the indicator is a derived summary.

  • The indicator should be interpretable in terms of changes in some meaningful biological unit, e.g. the ‘average’ change in abundance across species.

Differences across MSIs in statistical methods make it hard to compare across taxa. There is also a transparency deficit: MSIs are described in varying levels of detail, often in hard-to-find technical reports. These issues will become exacerbated as new indicators are developed that span multiple taxonomic groups with different time spans, methodologies and currencies (Eaton et al. 2015; van Strien et al. 2016).

1.3 Paper Outline

Our approach is based on a stochastic model for population growth rates, a convenient way to address the problem of missing species in particular years. In Sect. 2, we provide a hierarchical model for the calculation of multispecies biodiversity indicators within a state-space formulation. Classical and Bayesian implementations are given, based on the likelihood for MCMC and also the Kalman filter form, which is efficient for analysis using classical inference. We also provide smoothing within the indicator, rather than post hoc as is frequently done, with implications for the estimation of precision. Performance is demonstrated in the simulation study of Sect. 3. Section 4 provides a Bayesian application to British bat, butterfly and dragonfly data, and Sect. 5 illustrates the use of the classical approach for model selection. The paper ends with discussion, emphasising the wide applicability of the methods of the paper, and how they may be extended, in Sect. 6.

2 Inference

2.1 Model Formulation

State-space models provide a convenient way to produce a multispecies indicator. Species index values can be expressed as a realisation of some unknown latent state, with a hierarchical structure that permits the full propagation of uncertainty. Sampling variances at the various levels are estimated, leaving trends that account for both sampling/observation error and population stochasticity.

Let \(Y_{s,t}\) be the observed index of status for species s in year t, measured on the logarithmic scale, for \(s=1, \ldots , n\) and \(t=1, \ldots ,T\). For example, Y could be the fitted log abundance derived from a Poisson linear model (Dennis et al. 2016b). However, working on the unbounded scale means that Y could equally represent the log odds of a site being occupied, derived from a probability-based site occupancy model (Dennis et al. 2017; MacKenzie et al. 2017). We assume these observations are normally distributed around a true, unknown, value \(S_{s,t}\) :

$$\begin{aligned} Y_{s,t} \sim \mathrm{N} (S_{s,t}, \sigma ^2_{s,t}), \quad t=1,2, \ldots , T. \end{aligned}$$
(1)

Initially, we assume that \(\sigma ^2_{s,t} = \sigma ^2\), a constant to be estimated. In practice, this variation may change between species or years, and the model is readily extended in this fashion. Species-specific annual growth rates, \(R_{s,t}\) follow from the relationships:

$$\begin{aligned} R_{s,t}= S_{s,t+1}- S_{s,t}, \end{aligned}$$

so that

$$\begin{aligned} S_{s,t}=S_{s,1}+\sum _{j=1}^{t-1}R_{s,j}, \quad t=2, 3, \ldots , T, \quad s=1, \ldots , n. \end{aligned}$$
(2)

The indices in the first year could be regarded as unknown parameters or more parsimoniously taken as, for example

$$\begin{aligned} S_{s,1} \sim \mathrm{N}(M_1, \tau _0^2), \quad s=1, \ldots , n, \end{aligned}$$
(3)

with parameters \(M_1\) and \(\tau _0\) to be estimated. Alternatively, species indices are often presented as, and always convertible to, trends relative to some value (e.g. 0, on this logarithmic scale) in the first year or, for some species, a subsequent year \(F_s\). In this case, the option exists then to constrain \(S_{s,1}\) or \(S_{s,F_s}\) also to zero. We assume the \(\{R_{s,t}\}\) are normally distributed around multispecies average rates of growth, \(\{G_t\}\), such that:

$$\begin{aligned} R_{s,t} \sim \mathrm{N} (G_t, \tau _t^2), \quad t=1,2, \ldots , T-1, \quad s=1, \ldots , n. \end{aligned}$$
(4)

This model recognises that growth rates vary among species each year, but that due to shared environmental conditions they may be generally higher or lower in one year than another. While their variance may also vary between years, throughout this paper we shall again assume for simplicity a constant value \(\tau ^2\).

Within this framework, we have two options for calculating a multispecies indicator. One metric, \(M_t\), analogous to Eq. 2 can be derived by accumulating values of \(\{G_t\}\):

$$\begin{aligned} M_t = M_1+\sum _{j=1}^{t-1} G_j, \end{aligned}$$
(5)

so that, with \(M_1=0\), or estimated from the data, for example

$$\begin{aligned} G_t=M_{t+1}-M_t. \end{aligned}$$

Where complete series are available for all species, common practice is to estimate the geometric mean abundance across them (Buckland et al. 2011), and following the same logic, an alternative multispecies indicator may be derived as:

$$\begin{aligned} M'_t=\frac{1}{n} \sum _{s=1}^nS_{s,t}, \end{aligned}$$
(6)

where n is the number of species. This is therefore the average of the individual species trends as estimated, on the log scale, within the model via Eq. 2. Presentation in the form exp(\(M'_t\)) gives geometric means of the fitted species indices, if these are similarly back-transformed. These model-derived species indices, \(S_{s,t}\), will always exist, even for years in which an observed index, \(Y_{s,t}\), is not available.

The unbounded scale, with Gaussian errors, is a convenient one in which to work and general enough to encompass either count or presence/absence data. Inverting the log transform produces an alternative presentation such as \(e^M\), bounded only by zero, in which percentage change represents proportional change in abundance (count data) or the odds \(\{p/(1-p)\}\) of species presence, consistent with standard practice, e.g. the Farmland Bird Indicator (Freeman et al. 2001). Further, indicators are often presented relative to zero (log-scale) or 100% in a base year, usually the first in the series, requiring just a final straightforward rescaling.

2.2 Inference Based on the Kalman Filter

The formulation of Eqs. 14 can alternatively be summarised by the equations:

$$\begin{aligned} S_{s, {t+1}} | S_{s,t} \sim&\mathrm{N}(S_{s,t}+G_t, \tau _t^2) \end{aligned}$$
(7)
$$\begin{aligned} S_{s,1} \sim&\mathrm{N}(M_1, \tau _0^2) \end{aligned}$$
(8)
$$\begin{aligned} Y_{s,t} | S_{s,t} \sim&\mathrm{N}(S_{s,t}, \sigma _{s,t}^2). \end{aligned}$$
(9)

The model can be seen to be a stochastic exponential growth model (Holmes 2001) and also a form of Gompertz state-space model on the log scale (Besbeas and Morgan 2020). Due to the normal distributions assumed, the model specified is a linear Gaussian state-space model. This is efficiently fitted using the Kalman filter, a recursive process with filtering and smoothing described in, for example, Newman et al. (2014, p.64), and we provide details in Sect. 2 of the Supplementary material. In the notation outlined there, the prediction error decomposition form of the log likelihood for a single species is given in Eq. 10. The likelihood is easily formed and maximum-likelihood estimation is fast. Simplicity of computation is clear from the short Matlab program given in Besbeas and Morgan (2012). For species s, we denote the likelihood by \(\mathrm{L}_s\), and write

$$\begin{aligned} \log \mathrm{L}_s(\varvec{\theta };Y_{s,1}, \ldots , Y_{s,T}) = -\frac{T}{2}\log (2\pi ) - \frac{1}{2}\sum _{t=1}^{T}\left( \log |\mathbf {F}_t|+\mathbf {v}_t'\mathbf {F}_t^{-1}\mathbf {v}_t \right) , \end{aligned}$$
(10)

where \(\varvec{\theta }\) denotes the model parameters. The terms \(\mathbf {v}_t\) and \(\mathbf {F}_t\) are, respectively, the one-step forecast error of \(Y_{s,t}\) given all earlier observations, and its variance. They are calculated through the implementation of the Kalman filter; see Equation 1 of the Supplementary material. The overall log likelihood is then formed by summing the terms in Eq. 10 over s.

A more general modelling framework is provided using hidden Markov modelling. Both approaches deal easily with missing data, for instance, for the Kalman filter this is explained in Harvey (1996, p.144). See also Sect. 2 of the Supplementary material.

2.3 Bayesian Inference

For Bayesian analysis, the likelihood for \(\varvec{Y}\) with the parameters \(\varvec{\theta }\) is given formally by

$$\begin{aligned} \mathrm{L}(\varvec{\theta };\varvec{Y}) = \prod _{s=1}^n\prod _{t=1}^T \phi (Y_{s,t};S_{s,t}, \sigma ^2), \end{aligned}$$

where \(\phi (.)\) denotes the standard Normal probability density function, and the \(\{S_{s,t}\}\) result from the recursion of Eq. 2. While we have specified the model in terms of normal distributions, other options might be considered.

The model of Eqs. 14 is readily specified in a Bayesian context in the BUGS language and has been implemented in the R package ‘BRCindicators’ (August et al. 2020). Prior distributions were set for the error standard deviations, and all \(M_t\) (or alternatively \(G_t\)), respectively, gamma distributions or uniform distributions with a lower bound at zero for the former, and normal distributions with mean zero and large variance for the latter. Results have not been found sensitive to these choices. Sampling error, if it cannot be assumed as known, is estimated and any values of \(\{Y_{s,t}\}\) missed at random (at least with respect to the true rates of growth) are also automatically accounted for.

2.4 Smoothing

Conventionally, indices in single-species (Fewster et al. 2000) or compound (Freeman et al. 2001) form are generally smoothed for presentation. This is done to retain the signal of long-term changes and remove ephemeral influences, such as the effect of severe weather. Our approach makes the adoption of smoothing in situ straightforward. Many approaches to smoothing are available. The {\(G_t\)}, for example, are parameters to be estimated, and can be smoothed within the model. In a classical context, we use P-splines, with a linear function and, for most examples, 12 knots (Gimenez et al. 2006). Estimated variances of the MSI terms can be obtained using the delta method. In the paper, we demonstrate the behaviour of two alternatives, P-splines, as mentioned above, and the JAGS-based penalised spline formulation of Crainiceanu et al. (2005); growth is constrained such that, retaining the notation of the original paper:

$$\begin{aligned} G_t=m(t,\beta _1, \beta _2, \tau _\beta ) \end{aligned}$$
(11)

where the parameters of the function m, along with time t, represent those inducing the smoothing. (Note that we have omitted the additional variance term \(\sigma ^2_{eps}\) from the formula adopted by Crainiceanu et al. (2005), as we account for stochasticity via the other error terms in Eqs. 14). In the Bayesian approach, we need to select priors for these additional parameters, rather than M or G, and the number of knots that control the extent of smoothing imposed. Here too we follow Crainiceanu et al. (2005) and embed the code they provide directly into our own. Smoothing on the growth rates, rather than the indicator provides a convenient way to accommodate missing data, which are merely imputed. The indicators \(M_t\) and \(M'_t\) are naturally smoothed as a consequence.

Fewster et al. (2000) further adopted a classical method of estimating both standard errors and the second derivatives of their smoothed trends, via bootstrapping. Freeman et al. (2001) used this method for multispecies indicators; for each species in turn sites were resampled, with replacement, to produce a replicate bootstrapped trend for each species. These were then combined into a multispecies indicator, and the entire process repeated multiple times to produce a number of bootstrapped indicators, each containing one (and only one, different) trend for each species. Years in which the bootstrapped 95% CI of the second derivatives excluded zero were taken to indicate a significant change in direction or a significantly increased or reduced rate of change—acceleration, as distinct from velocity (Buckland et al. 2005; Fewster et al. 2000).

The smoothing of \(M_t\) directly, or alternatively growth rates \(G_t\) in Eq. 5, allows us to produce analogous estimates of second derivatives via a difference-equation approximation, e.g. using sixth differences where possible:

$$\begin{aligned} \frac{\mathrm{d}^2M_t}{\mathrm{d}t^2}\approx \frac{1}{180}\left( 2M_{t+3} - 27M_{t+2} + 270M_{t+1}-490M_t+270M_{t-1}-27M_{t-2}+2M_{t-3}\right) , \end{aligned}$$
(12)

and lower-order alternative difference approximations at the beginning and end of the series, where the calculation of Eq. 12 cannot be executed (see Fewster et al. 2000). In a Bayesian context, credible intervals immediately arise, and again by analogy we might take those years in which zero lies outside the 95% credible interval to indicate years in which the data suggest a notable increase/decrease or change in direction.

3 Simulation Study

To investigate the performance of the method, we simulated a set of data \(\{Y_{s,t}\}\) for 8 species over an assumed 30-year period, consistent with Eqs. 14 and using a randomly chosen sequence \(M_1 \ldots M_{30}\) and specified variance parameters. For simplicity, we assume a time-invariant constant \(\sigma \), both in generating data and in fitting the model, within which it is assumed unknown and estimated from the data. The model of Eqs. 14, implemented here in the Bayesian framework, provides an accurate reconstruction of the true indicator, \(M_t\) (Fig. 1a,b), and with complete data is virtually indistinguishable from the traditional geometric mean method.

Fig. 1
figure 1

Simulation results — multispecies indicators: a M\(_t\) (solid line), based on all data and shown on original log scale, with 95% credible intervals; b exp(\(M_t\)), relative to baseline value of 1.0 in year 1 after transformation; c exp(\(M_t\)) with lowest 20% of data omitted geometric mean of remaining data plotted in closed circles; d Smoothed version of exp(\(M_t\)), all data. True values used in simulation are plotted as open circles throughout

However, it will be clear that if particular species indices are unavailable in certain years, bias will ensue if the absent data are not missing at random but, for example, predominantly rare (or rarely encountered) species, as may be the case in practice. Figure 1c shows a repeat of the analysis of Fig. 1a,b but with those data with the lowest values omitted (20% of the whole) and regarded as missing.

It is clear that, as most missed values are late in the series for this set of largely declining species, the geometric mean of the remaining data underestimates the strength of the decline, while the growth-based alternative remains robust. Parameter values used in the simulations and a summary from a larger number of simulations are shown in Supplementary Figures 1 and 2.

We then applied the smoother of Eq. 11 to the entire dataset, with missing values reinstated. Priors for smoothing parameters were exactly as in Crainiceanu et al. (2005), and twelve knots were used throughout. The smoother accurately captures the changes in the indicator, and the species-specific components, whose growth rates are assumed normally distributed about {\(G_t\)} are also estimated accurately (Figs. 1d, 2).

Fig. 2
figure 2

Simulation results. True indices of abundance for eight species (open circles) and fitted values (solid line). For ease of comparison Figure 1d is repeated in the bottom right-hand corner

4 Application: UK Multispecies Indicators

We present four case studies to illustrate the flexibility of our method. In each case, we present results from the Bayesian implementation, but the results from the Kalman filter were indistinguishable.

4.1 UK Bats

There are 17 species of bat breeding in the UK. Annual indices for eight of these species, produced by the National Bat Monitoring Program, contribute to UK biodiversity indicators (Supplementary Table 1). The rest have limited data or are, in the case of several Myotis species, difficult to identify to species level. The data are based on counts from a variety of surveys, including direct observations at hibernation roosts and field surveys using bat detectors, which are combined into a single index for each species Barlow et al. (2015). Six of the eight species have index values for every year (1998–2014): the other two enter the time series in 2001 (Brown Long-eared) and 2002 (Natterer’s).

We fitted the model of Eqs. 14 to these data using JAGS. The indicator shows an increase over the duration of this relatively short series, and changes smoothly, with no significant second derivatives, as a consequence of the limited ability for rapid increase in species with low annual productivity (Fig. 3).

Fig. 3
figure 3

Observed (closed circles) and fitted (solid line) species indices for 8 British bat species. Corresponding (smoothed) multispecies indicators based on \(M_t\) and \(M^{\prime }_t\) are shown in bottom right-hand corner — indicator exp(\(M_t\)) is shown in black, geometric means exp(\(M^{\prime }_{t}\)) in grey (blue in online version), both with 95% credible intervals and each back-transformed and scaled to 1.0 in 1998, or individual species’ year of first encounter

4.2 UK Butterflies

There are 59 regularly occurring species of butterfly in the UK, and data from the UK Butterfly Monitoring Scheme (UKBMS) produces annual indices for 57 of these (Brereton et al. 2018). The data are primarily based on counts collected from standardised transects, as well as reduced effort methods including timed counts, larval web counts and egg counts. Since 2009, additional data have been contributed from the Wider Countryside Butterfly Survey (Brereton et al. 2011). Species-specific indices are derived using the approach of Dennis et al. (2016a), with an additional modification—see Brereton et al. (2018).

Indices from UKBMS data are typically separated into habitat specialists (26 species) and wider countryside species (24 species). The latter has only one species (Scotch Argus) that joins the series late, whereas the habitat specialists dataset is far more incomplete, with only ten species possessing an index value in every year. Another 15 species were added to the series between 1978 and 1995, and one species (Swallowtail) has data for every year except 1978. Full species lists for both datasets are listed in Supplementary Table 2.

We used the model of Eqs. 14 to produce both Bayesian smoothed indicators for both datasets; see Fig. 4. The fitted species indices (Supplementary Figures 3 and 4) demonstrate how the model smooths the population fluctuations of individual species’ time series.

Fig. 4
figure 4

Smoothed butterfly indicators, a habitat specialist species and b Wider countryside species. Indicator exp(\(M^{\prime }_t\)) is shown in black, geometric means exp(\(M^{\prime }_{t}\)) in grey (blue in online version), both with 95% credible intervals and each transformed to 1.0 in 1976. Filled circles indicate years in which the credible intervals of the second derivative of \(M_t\) do not include zero

For both butterfly models in Fig. 4, years are indicated in which the second derivative of the growth rates are markedly nonzero. Clearly, the trend for wider countryside species is considerably more variable over short-term periods, even after smoothing, although both have stabilised over the last decade. Over the duration of the series however, the decrease has been greater for the specialist species. For both datasets, there is little difference between \(M_t\) and \(M'_t\), although the geometric mean method \((M'_t)\) is very much more precise.

4.3 UK Dragonflies

Dragonfly distribution records have been collected by the Dragonfly Recording Network and coordinated by the British Dragonfly Society. Dragonflies are among the best-recorded insect taxa in the UK, after butterflies (Isaac and Pocock 2015; Powney et al. 2015). Although these data constitute presence-only records, species trends can be estimated using occupancy-detection models (MacKenzie et al. 2017). We took advantage of existing occupancy-detection models for UK dragonflies (Outhwaite et al. 2019), which had been fitted using methods described in Isaac et al. (2014) and Outhwaite et al. (2018). From these, we extracted occupancy estimates for each year 1970–2015, on the unbounded logit scale before calculation of the Bayesian MSI, which are shown in Supplementary Figure 5. For the multispecies indicator, we use the 38 species (listed in Supplementary Table 3) for which trends were estimated in the most recent Atlas of Dragonflies (Cham et al. 2014), with the exception of Irish Damselfly, which does not occur in Great Britain. The time series are complete except Small red-eyed Damselfly, for which the first estimate is in 2001, the year in which it was first recorded in Great Britain.

Fig. 5
figure 5

Dragonfly indicator, based upon occupancy data. Lines and symbols as Fig. 4

We fitted the smoothed multispecies model (Fig. 5, with fitted species trends in Supplementary Figure 5). A general increase in the multispecies indicators from the late 1980s is revealed, with several spells of rapid average expansion, but with a reversal of direction towards the very end of the series. Note that the indicators, based on changes in range, are smooth and very much more so, with a similar number of knots, than those in the abundance-based wider countryside species butterfly graph, though more similar to that for the habitat specialists. Fitted species trends, however, are noticeably less smooth than their counterparts in the analogous butterfly models.

5 Including Estimated Standard Errors

Individual species indices may arise with associated measures of uncertainty, for example in the form of standard errors \(\{se_{s,t}\}\), which may be obtained in different ways. These have been regarded as difficult to incorporate directly (Soldaat et al. 2017) but, in obvious notation, the proposed methodology allows us to model observation error variance \(\{\sigma ^2_{s,t}\}\) in terms of estimated standard error using various formulations, along a spectrum ranging from values fixed a priori to the estimation of entirely free parameters, e.g.:

$$\begin{aligned} \mathrm{[i]}\, \sigma ^2_{s,t}= & {} 0\\ \mathrm{[ii]}\, \sigma ^2_{s,t}= & {} se_{s,t}^2\\ \mathrm{[iii]}\, \sigma ^2_{s,t}= & {} c^2 se_{s,t}^2\\ \mathrm{[iv]}\, \sigma ^2_{s,t}= & {} \theta ^2, \end{aligned}$$

where \(\theta \) and c are unknown non-negative parameters to be estimated. Model iv, the most general, is our basic model of Eqs. 14.

Table 1 A comparison of the four alternative models for the observation error applied to the full set of UKBMS data, for the two subsets: wider countryside and habitat specialists, the dragonfly data and also the bat data

We can compare these alternative models relying solely on information in the data using classical inference (i.e. the Kalman filter). For example, in Table 1 we compare the above four models fitted to the UKBMS data, and the dragonfly data and models (i) and (iv) for the bat data. UKBMS and dragonfly data were smoothed via a P-spline with 12 knots, and bats with 5 knots (reflecting a shorter time series for this taxon). It is interesting that the best model for the UKBMS data is the model with a constant observation variance, suggesting that the estimated standard errors are in conflict with the model and the indices in this case, despite the use of a scaling factor to account for standardisation. This may be because the estimates of error at the species level are derived from one stage of a statistical model and are therefore subject to additional uncertainty that we have not accounted for; see Dennis et al. (2016a) for details. The results for bats are limited, as estimates of standard error at the species level are not available. For the dragonfly data, models (i) and (iv) give the same parameter estimates, as under model (iv) \(\theta \) is estimated on the boundary \(\theta =0\). This is because there is little variation between the indices in this application, and presumably whatever variation there is is being taken up through the system part of the model, rather than the observation process. Plugging in the estimated index standard errors directly, as in model (ii), therefore results in a large drop in maximised log likelihood. The best model is model (iii) as the scaling constant c can greatly reduce the estimated standard errors, while maintaining variation in the estimation precision of the indices. An interesting feature of the dragonfly data is that one species, Small Red-eyed Damselfly, was a late entrant, with data first recorded in 2001. However, the indices that we used were the result of the model of Outhwaite et al. (2019), which estimates missing data. If we use the model of this paper so that for that species we only use records from 2001, then there is no difference between models (iii) and (iv), as in each case \(\sigma _{s,t}\) is estimated on the boundary \(\sigma _{s,t}=0\).

6 Discussion

6.1 Advantages of the New Model

Use of a state-space model for an MSI meets the prevailing need for transparent, consistent and flexible summaries of data. It is based upon reasonable assumptions about variation in annual change between species. A key advantage is that it allows a degree of standardisation in the production of multispecies indicators, while permitting flexibility in the derivation of the single-species trends contributing to them. Notably, it is applicable either to trends derived from count data or to trends in occupancy. Note that combining trends in occupancy and abundance into a single MSI may not be appropriate, as changes in species’ abundance do not generally equate directly to changes in distribution size (Dennis et al. 2019).

The stochastic, state-space form allows separation of sampling errors and population stochasticity, with flexibility in the way that both are handled. The method is robust to values missed not at random and, specifically, it readily accommodates species for which appropriate data arrive only late in the series, either because these are recent, rapid colonists like the Small Red-eyed Damselfly or because data from the early period is considered inadequate (as in the UKBMS). The model improves upon the use of the geometric mean when data are missing, as seen in Fig. 1. Applied to three very different datasets, the model provides both a good fit to the observed species’ indices and an all-species indicator smooth enough to capture major changes and identify points of transition.

Smoothing in situ is a particular advantage where the available species indices are not smoothed (as in the UK butterflies) or where they contain a mixture of smoothed and unsmoothed data (as in the UK Priority Species Indicator, Eaton et al. 2015). Credible/confidence limits are immediately tractable, with or without smoothing, without the often prohibitive need for bootstrapping. Such limits for derivatives of the smoothed trend are equally straightforward, at least in a Bayesian context. Note that smoothing requires the user to define the number of knots, which is effectively a free parameter. We followed Crainiceanu et al. (2005) and Fewster et al. (2000) in selecting the number of knots in our examples, but the resulting MSIs were very similar unless the number of knots was changed radically (results not shown).

The approach also permits the direct incorporation of existing estimates of observation error. However, our results suggest that while this can be an improvement over disregarding such errors altogether, it is not always optimal. An important finding is that plugging in estimated standard errors should be carefully evaluated, as shown in Table 1.

Our approach is made possible by assuming that species’ growth rates are normally distributed around some global average (Eq. 4). It is possible to envisage circumstances in which this assumption might be violated, leading to directional bias in the multispecies indicator. As with any complex model, we advocate post hoc diagnosis of the assumptions, e.g. by plotting the distribution of species-specific growth rates \(\{R_{s,t}\}\). Violations of the normality assumption could be addressed by the addition of extra terms (e.g. via a mixture model).

Of the two alternative forms considered, that based on averages of the fitted species trends (\(M'_t\)) is more precise than that based on growth rates (\(M_t\)). Imprecision of (\(M'_t\)) reflects only the uncertainty in individual species’ status, whereas (\(M_t\)) additionally reflects the fact that species in the dataset are assumed to be indicative of a wider set of species. The reality for most indicators is that species within are a highly non-random selection, included solely on the basis of data availability. This is exemplified by the case studies, which include the vast majority of bat, butterfly and dragonfly species in GB: the omissions are either rare, localised, cryptic or otherwise lacking in data. In these cases, \(M'_t\) seems an appropriate choice. The alternative \(M_t\) might be preferred where the species selected are a small fraction of the eligible set, and where extrapolation to the wider set of species is desirable. Illustrations based upon smaller subsets of data are given in Supplementary Figures 6-8.

6.2 Further Model Extensions and Applications

Standardising to a baseline, standard practice among biodiversity indicators, may have undesirable consequences. First, it conveys the impression that precision is greatest near the start of the series, typically the period with fewest data and therefore the lowest precision (e.g. the UKBMS contained just 30 sites in 1976, but >1000 in the last two decades). Second, it can lead to uncertainty growing over time; hence, counter-intuitively it becomes progressively harder to detect ‘significant’ change even as data increase (Gregory et al. 2019). Although we have followed this convention, minor modification would permit an indicator to be presented with any alternative baseline year, and/or uncertainty in the baseline year, at little additional computational cost.

We note, too, that the multispecies model of Eqs. 16 extends naturally to the initial estimation of growth rates and indices for a single species from data at multiple sites. Such data have format identical to that of the indices modelled in Eqs. 14, the subscript s merely indicates the site rather than species. If therefore \(Y_{s,t}\) were redefined in such a situation as the count (say) of a particular species at site s in year t, application of the method produces \(M_t\) and \(M'_t\) that serve as indicators of average change in growth over the sites at which the species was observed. As the count in a raw data set will be a non-negative integer, often small or zero in practice, it is rational to amend Eq. 1 such that either

$$\begin{aligned} \mathrm{log}(1+Y_{s,t} )\sim N(S_{s,t}, \sigma ^2_{s,t}), \end{aligned}$$

or

$$\begin{aligned} Y_{s,t} \sim \mathrm{Poisson} \{\mathrm{exp}(S_{s,t})\}. \end{aligned}$$

As the \(\varvec{Y}\) now represent absolute counts (rather than relative indices) the method would proceed without any standardisation to an arbitrary value in a base year, and the alternative indicator in Eq. 6 is perhaps more rationally replaced by the conventional arithmetic mean (or sum) of the fitted counts. In this context, the value is that the method does not make the assumption that the trend is the same at all sites, as frequently done to produce species indices used for UK indicators, or determined via a spatial covariate (Freeman and Newson 2008; Harrison et al. 2014; Dennis et al. 2016b). The model is thereby more flexible, and estimated trends for different sites can be meaningfully compared. These estimated species-specific \(M_t\) or \(M'_t\) could then be naturally combined into a multispecies indicator by treating them now as data and applying Eqs. 16. It remains, however, fundamental to the value of the approach that it is not necessary for all species indices combined into an indicator to have been initially calculated the same way, only that each is considered proportional, as appropriate, to abundance or odds of occupancy.

The model of the paper has been fitted using both MCMC and maximum-likelihood. Results show excellent agreement, providing a check of the methods. General advice on selecting a model-fitting procedure is provided by Newman et al. (2014, p. 80). The MCMC implementation has been incorporated into the R package ‘BRCindicators’ (August et al. 2020). Example code for both implementations is available as supplementary material.