Skip to main content

Advertisement

Log in

Disease dynamics in wild populations: modeling and estimation: a review

  • Review
  • Published:
Journal of Ornithology Aims and scope Submit manuscript

Abstract

Models of infectious disease dynamics focus on describing the temporal and spatial variations in disease prevalence, and on understanding the factors that affect how many cases will occur in each time period and which individuals are likely to become infected. Classical methods for selecting and fitting models, mostly motivated by human diseases, are almost always based solely on raw counts of infected and uninfected individuals. We begin by reviewing the main classical approaches to parameter estimation, and some of their applications. We then review recently developed methods which enable representation of component processes such as infection and recovery, with observation models that acknowledge the complexities of the sampling and detection processes. We demonstrate the need to account for detectability in modeling disease dynamics, and explore a number of mark–recapture and occupancy study designs for estimating disease parameters while simultaneously accounting for variation in detectability. We highlight the utility of different modeling approaches and also consider the typically strong assumptions that may actually serve to limit their utility in general application to the study of disease dynamics (e.g., assignment of individuals to discrete disease states when underlying state space is more generally continuous; transitions assumed to be simple first-order Markov; temporal separation of hazard and transition events).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. or in structured population models by the contact matrix \((\varvec{\beta}_{ij})\), but for clarity we limit the exposition to unstructured models.

  2. In fact, the two approaches are conceptually equivalent, with the primary differences being (1) the handling of losses on capture, and (2) whether the estimation conditions on first encounter (e.g., Link and Barker 2005) or not (e.g., Schwarz and Arnason 1996).

  3. Most studies that have used MSMR models have referred to finite state transition probabilities from susceptible to infected states as force of infection, either explicitly (Lachish et al. 2007; Ozgul et al. 2009) or implicitly (Faustino et al. 2004). However, Arnason–Schwarz MSMR models estimate the probability that an individual will be infected at time t + 1 given that it was not infected (susceptible) at time t and survived from \(t \rightarrow t+1\). This interpretation differs from the usual definition of FOI as an instantaneous rate (vs. "Appendix A"). It is possible to write MSMR models in terms of instantaneous rates (see the paper by Conn et al., this issue)

  4. see Dupuis (1995) for a similar analysis of a multi-stare mark–reencounter model with a specified number of hidden states.

  5. However, as rapid evolution and coevolution in host and pathogen populations become more widely recognized, e.g., Altizer et al. 2003, these assumptions are increasingly questionable in many systems. In particular, for pathogens with short generation times and high mutation rates, changes can occur very rapidly.

  6. Despite the apparent efforts of Emmanuelle Cam and her colleagues to do nearly that with Kittiwakes (Rissa tridactyla) breeding on the Brittany coast.

References

  • Altizer S, Nunn C, Thrall P, Gittleman J, Ezenwa V, Pulliam J, Pedersen A, Dobson A, Poss M, Cunningham A, Antonovics J, Jones K (2003) Social organization and disease risk in mammals: integrating theory and empirical studies. Annu Rev Ecol Evol S 34:517–547

    Google Scholar 

  • Altizer S, Hochachka W, Dhondt AA (2004) Seasonal dynamics of mycoplasmal conjunctivitis in eastern north american house finches. J Anim Ecol 73:309–322

    Google Scholar 

  • Altizer S, Dobson AP, Hosseini P, Hudson P, Pascual M, Rohani P (2006) Seasonality and the dynamics of infectious diseases. Ecol Lett 9:467–484

    PubMed  Google Scholar 

  • Anderson RM, May RM (1992) Infectious diseases of humans, dynamics and control. Oxford University Press, Oxford

    Google Scholar 

  • Andersson H, Britton T (2000) Stochastic epidemic models and their statistical analysis, lecture notes in statistics, vol 151. Springer, New York

    Google Scholar 

  • Arnason AN (1973) The estimation of population size, migration rates, and survival in a stratified population. Res Popul Ecol 15:1–8

    Google Scholar 

  • Barker RJ, White GC (2004) Towards the ‘mother-of-all-models’: customized construction of the mark-recapture likelihood function. Anim Biodivers Conserv 27:177–185

    Google Scholar 

  • Barker RJ, Cooch EG, Schwarz CJ (2002) Discussion comments on: approaches for the direct estimation of λ and demographic contributions to λ using capture-recapture data. J Appl Stat 29:569–572

    Google Scholar 

  • Beal MJ, Ghahramani Z, Rasmussen CE (2002) The infinite hidden Markov model. In: Dietteric T, Becker S, Ghahramani Z (eds) Advances in neural information processing systems 14. MIT Press, Cambridge

    Google Scholar 

  • Becker NG (1989) Analysis of infectious disease data. Monographs on statistics and applied probability. Chapman and Hall, London

    Google Scholar 

  • Benyoussef L, Carincotte C, Derrod S (2008) Extension of higher-order HMC modeling with application to image segmentation. Digit Signal Process 18:849–860

    Google Scholar 

  • Besbeas P, Freeman SN, Morgan BJT, Catchpole EA (2002) Integrating mark-recapture-recovery and census data to estimate animal abundance and demographic parameters. Biometrics 58:540–547

    CAS  PubMed  Google Scholar 

  • Besbeas P, Freeman SN, Morgan BJT (2005) The potential of integrated population modelling. Aust NZ J Stat 47:35–48

    Google Scholar 

  • Bjørnstad O, Grenfell BT (2005) Hazards, spatial transmission and timing of outbreaks in epidemic metapopulations. Environ Ecol Stat 15:265–277

    Google Scholar 

  • Borchers DL, Buckland ST, Zucchini W (2002) Estimating animal abundance—closed populations. Statistics for biology and health. Springer, Berlin

    Google Scholar 

  • Borysiewicz RS, Morgan BJT, Hénaux V, Bregnballe T, Lebreton JD, Gimenez O (2009) An integrated analysis of multisite recruitment, mark-recapture-recovery and multisite census data. In: Thomson DL, Cooch EG, Conroy MJ (eds) Modeling demographic processes in marked populations, environmental and ecological statistics series, vol 3. Springer, UK, pp 579–591

    Google Scholar 

  • Bradshaw CJA, Brook BW (2005) Disease and the devil: density-dependent epidemiological processes explain historical population fluctuations in the Tasmanian devil. Ecography 28:181–190

    Google Scholar 

  • Bretó C, He D, Ionides EL, King AA (2008) Time series analysis via mechanistic models. Ann Stat 3:319–348

    Google Scholar 

  • Brooks SP, King R, Morgan BJT (2004) A Bayesian approach to combining animal abundance and demographic data. Anim Biodivers Conserv 21:515–529

    Google Scholar 

  • Brown BW, Monfort A, van Dijk H (eds) (1993) Econometric inference using simulation techniques. Wiley, New York

    Google Scholar 

  • Brownie C, Hines JE, Nichols JD, Pollock KH, Hestbeck JB (1993) Capture-recapture studies for multiple strata including non-Markovian transitions. Biometrics 49:1173–1187

    Google Scholar 

  • Bruno JF, Ellner SP, Vu I, Kim K, Harvell CD (2010) Impacts of aspergillosis on sea fan coral demography: modeling a moving target. Ecol Monogr (in press)

  • Buckland ST, Newman KB, Thomas L, Koesters NB (2004) State-space models for the dynamics of wild animal populations. Ecol Model 171:157–175

    Google Scholar 

  • Buckland ST, Newman KB, Fernández C, Thomas L, Harwood J (2007) Embedding population dynamics models in inference. Stat Sci 22:44–58

    Google Scholar 

  • Burnham KP (1993) A theory for combined analysis of ring recovery and recapture data. In: Lebreton JD, North P (eds) Marked individuals in the study of bird population. Birkhauser, Basel, pp 199–213

    Google Scholar 

  • Burthe S, Telfer S, Begon M, Bennett M, Smith A, Lambin X (2008) Cowpox virus infection in natural field vole Microtus agrestis populations: significant negative impacts on survival. J Anim Ecol 77:110–119

    PubMed  PubMed Central  Google Scholar 

  • Buzby M, Neckels D, Antolin MF, Estep D (2008) Analysis of the sensitivity properties of a model of vector-borne bubonic plague. J R Soc Interface 5:1099–1107

    PubMed  PubMed Central  Google Scholar 

  • Caley P, Marion G, Hutchings MR (2009) Assessment of transmission rates and routes, and the implications for management. In: Delahey RJ, Smith GC, Hutchings MR (eds) Management of disease in wild mammals. Springer, New York, pp 31–51

    Google Scholar 

  • Cam E (2009) Contribution of capture-mark-recapture modeling to studies of evolution by natural selection. In: Thomson DL, Cooch EG, Conroy MJ (eds) Modeling demographic processes in marked populations, environmental and ecological statistics series, vol 3. Springer, New York, pp 83–129

    Google Scholar 

  • Cam E, Link WA, Cooch EG, Monnat JY, Danchin E (2002) Individual covariation in life-history traits: seeing the trees despite the forest. Am Nat 159:96–105

    PubMed  Google Scholar 

  • Cappé O, Moulines E, Rydén T (2005) inference in hidden Markov models. Springer series in statistics. Springer, New York

    Google Scholar 

  • Caswell H (2001) Matrix population models—construction, analysis, and interpretation, 2nd edn. Sinauer, Sunderland

    Google Scholar 

  • Cauchemez S, Ferguson NM (2008) Likelihood-based estimation of continuous-time epidemic models from time-series data: application to measles transmission in london. J R Soc Interface 5:885–889

    PubMed  PubMed Central  Google Scholar 

  • Cauchemez S, Valleron AJ, Boelle PY, Flahault A, Ferguson NM (2008) Estimating the impact of school closure on influenza transmission from sentinel data. Nat Biotechnol 452:750–756

    CAS  Google Scholar 

  • Challenger WO, Schwarz CJ (2009) Mark-recapture Jolly-Seber abundance estimation with classification uncertainty. In: Thomson DL, Cooch EG, Conroy MJ (eds) Modeling demographic processes in marked populations, environmental and ecological statistics series, vol 3. Springer, New York, pp 827–844

    Google Scholar 

  • Chao A, Huggins RM (2005) Modern closed-population capture-recapture models. In: Amstrup SC, McDonald TL, Manly BFJ (eds) Handbook of capture-recapture analysis. Princeton University Press, Princeton, pp 58–86

    Google Scholar 

  • Chao A, Tsay PK, Lin SH, Shau WY, Chao DY (2004) The applications of capture-recapture models to epidemiological data. In: D’Agostino RB (eds) Statistical methods in clinical studies, tutorials in biostatistics, vol 1. Wiley, New York, pp 31–66

    Google Scholar 

  • Childs DZ, Rees M, Rose KE, Grubb PJ, Ellner SP (2004) Evolution of size-dependent flowering in a variable environment: construction and analysis of a stochastic integral projection model. Proc R Soc Lond B 271:425–434

    Google Scholar 

  • Ching WK, Fung ES, Ng MK (2003) Higher-order hidden Markov models with applications to DNA sequences. In: Intelligent data engineerings and automated learning, lecture notes in computer science, Springer, Berlin, pp 535–539

  • Choisy M, Rohani P (2006) Harvesting can increase the severity of wildlife disease epidemics. Philos Trans R Soc Lond B 273:2025–2034

    Google Scholar 

  • Choquet R, Rouan L, Pradel R (2009) Program E-SURGE: a software application for fitting multievent models. In: Thomson DL, Cooch EG, Conroy MJ (eds) Modeling demographic processes in marked populations, environmental and ecological statistics series, vol 3. Springer, New York, pp 845–866

    Google Scholar 

  • Chowell G, Rivas AL, Hengartner NW, Hyman JM, Castillo-Chavez C (2006) The role of spatial mixing in the spread of foot-and-mouth disease. Prev Vet Med 73:297–314

    CAS  PubMed  Google Scholar 

  • Clark JS, Bjørnstad O (2004) Population time series: process variability, observation errors, missing values, lags, and hidden states. Ecol Lett 85:3140–3150

    Google Scholar 

  • Clark JS, Gelfland AE (2006) A future for models and data in environmental science. Trends Ecol Evol 21:375–380

    PubMed  Google Scholar 

  • Cochran ME, Ellner SP (1992) Simple methods for calculating age-based life history parameters for stage-structured populations. Ecol Monogr 62:345–364

    Google Scholar 

  • Conn PB, Cooch E (2009) Multistate capture-recapture analysis under imperfect state observation: an application to disease models. J Appl Ecol 46:486–492

    Google Scholar 

  • Conner MM, McCarty CW, Miller MMW (2000) Detection of bias in harvest-based estimates of chronic wasting disease prevalence in mule deer. J Wildl Dis 36:691–699

    CAS  PubMed  Google Scholar 

  • Craft ME, Hawthorne PL, Packer C, Dobson AP (2008) Dynamics of a multihost pathogen in a carnivore community. J Anim Ecol 77:1257–1264

    CAS  PubMed  Google Scholar 

  • Daley DJ, Gani J (2001) Epidemic modelling: an introduction. Cambridge University Press, Cambridge

    Google Scholar 

  • Delahay, R, Smith, G, Hutchings, M (eds) (2009) Management of disease in wild mammals. Springer, New York

    Google Scholar 

  • de Castro F, Bolker BM (2005) Mechanisms of disease-induced extinction. Ecol Lett 8:117–126

    Google Scholar 

  • Dethlefsen C, Lundbye-Christensen S, Christensen AL (2009) SSPIR: State Space Models in R. http://CRAN.R-project.org/package=sspir, R package version 0.2.7

  • de Jong M, Diekmann O, Heesterbeek H (1995) How does transmission of infection depend on population size?. In: Mollison D (eds) Epidemic models: their structure and relation to data. Cambridge University Press, Cambridge, pp 84–94

    Google Scholar 

  • de Jong M, Bouma A, Diekmann O, Heesterbeek H (2002) Modeling transmission: mass action and beyond. Trends Ecol Evol 17:64

    Google Scholar 

  • Diekmann O, Heesterbeek JAP (2000) Mathematical epidemiology of infectious diseases: model buidling, analysis, and interpretation. Wiley series in mathematical and computational biology. Wiley, New York

    Google Scholar 

  • du Preez JA (1998) Efficient training of high-order hidden Markov models using first-order representations. Comput Speech Lang 12:23–29

    Google Scholar 

  • du Preez JA, Weber DM (1999) The integration and training of arbitrary-order HMMs. Austr J Intel Inf Proc Sys (ICSLP98 Special Issue) 5:261–268

    Google Scholar 

  • Dupuis J (1995) Bayesian estimation of movement and survival probabilities from capture-recapture data. Biometrics 82:761–772

    Google Scholar 

  • Dupuis JA, Schwarz CJ (2007) A Bayesian approach to the multistate Jolly-Seber capture-recapture model. Biometrics 63:1015–1022

    PubMed  Google Scholar 

  • de Valpine P (2004) Better inferences from population-dynamics experiments using Monte Carlo state-space likelihood methods. Ecol Lett 84:3064–3077

    Google Scholar 

  • de Valpine P, Hilborn R (2005) State-space likelihoods for nonlinear fisheries time-series. Can J Fish Aquat Sci 62:1937–1952

    Google Scholar 

  • Easterling MR, Ellner SP, Dixon PM (2000) Size-specific sensitivity: applying a new structured population model. Ecol Lett 81:694–708

    Google Scholar 

  • Efford MG (2004) Density estimation in live-trapping studies. Oikos 106:598–610

    Google Scholar 

  • Efford MG, Borchers DL, Byrom AE (2009) Density estimation by spatially explicit capture-recpature: likelihood-based methods. In: Thomson DL, Cooch EG, Conroy MJ (eds) Modeling demographic processes in marked populations, environmental and ecological statistics series, vol 3. Springer, New York, pp 255–269

    Google Scholar 

  • Ellner SP, Rees M (2006) Integral projection models for species with complex demography. Am Nat 167:410–428

    PubMed  Google Scholar 

  • Ellner SP, Rees M (2007) Stochastic stable population growth in integral projection models: theory and application. J Math Biol 54:227–256

    PubMed  Google Scholar 

  • Ellner SP, Bailey BA, Bobashev GV, Gallant AR, Grenfell BT, Nychka DW (1998) Noise and nonlinearity in measles epidemics: combining mechanistic and statistical approaches to population modeling. Am Nat 151:425–440

    CAS  PubMed  Google Scholar 

  • Engelbrecht H, du Preez JA (2009) Efficient backward decoding of high-order hidden Markov models. Pattern Recogn Lett 43:99–112

    Google Scholar 

  • Faustino CR, Jennelle CS, Connolly V, Davis AK, Swarthout EC, Dhondt AA, Cooch EG (2004) Mycoplasma gallisepticum infection dynamics in a house finch population: seasonal variation in survival, encounter and transmission rate. J Anim Ecol 73:651–669

    Google Scholar 

  • Fenichel EP, Horan RD (2007) Jointly-determined ecological thresholds and economic trade-offs in wildlife disease management. Nat Resour Model 20:511–547

    Google Scholar 

  • Fine P, Clarkson J (1982) Measles in england and wales-1: an analysis of factors underlying seasonal patterns. Int J Epidemiol 11:5–14

    CAS  PubMed  Google Scholar 

  • Finkenstädt BF, Grenfell BT (2000) Time series modelling of childhood diseases: a dynamical systems approach. J R Stat Soc C 49:187–205

    Google Scholar 

  • Fletcher D, Efford MG (2009) Effect of senescence on estimation of survival probability when age is unknown. In: Thomson DL, Cooch EG, Conroy MJ (eds) Modeling demographic processes in marked populations, environmental and ecological statistics series, vol 3. Springer, New York, pp 1037–1053

    Google Scholar 

  • Gallant AR, Tauchen G (1996) Which moments to match?. Econ Theory 12:657–681

    Google Scholar 

  • Gibson GJ, Renshaw E (1998) Estimating parameters in stochastic compartmental models using Markov chain methods. IMA J Math Appl Med 15:19–40

    Google Scholar 

  • Gimenez O, Rossi V, Choquet R, Dehais C, Doris B, Varella H, Villa JP, Pradel R (2007) State-space modelling of data on marked individuals. Ecol Model 206:431–438

    Google Scholar 

  • Gollery M (2008) Handbook of hidden Markov models in bioinformatics. Chapman and Hall, London

    Google Scholar 

  • Gouriéroux C, Monfort A (1996) Simulation-based econometric methods. CORE lectures. Oxford University Press, Oxford

    Google Scholar 

  • Grassly NC, Fraser C (2006) Seasonal infectious disease epidemiology. Proc R Soc Lond B 273:2541–2550

    Google Scholar 

  • Grenfell B, Bjørnstad O, Kappey J (2001) Travelling waves and spatial hierarchies in measles epidemics. Nat Biotechnol 414:716–723

    CAS  Google Scholar 

  • Grenfell BT, Dobson AP (1995) Ecology of infectious diseases in natural populations. Cambridge University Press, Cambridge

    Google Scholar 

  • Hansen L, Scheinkman J (1995) Back to the future: generating moment implications for continuous-time Markov models. Econometrika 63:767–804

    Google Scholar 

  • Hawley DM, Davis AK, Dhondt AA (2007) Transmission-relevant behaviours shift with pathogen infection in wild house finches (Carpodacus mexicanus). Can J Zool 85:752–757

    Google Scholar 

  • Hawley DM, Jennelle CS, Sydenstricker KV, Dhondt AA (2007) Pathogen resistance and immunocompetence covary with social status in house finches. Funct Ecol 21:520–527

    Google Scholar 

  • Hedrich AW (1933) Monthly estimates of the child population ’susceptible’ to measles, 1900–1931, baltimore, md. Am J Hyg 17:613–636

    Google Scholar 

  • Hestbeck JB, Nichols JD, Malecki RA (1991) Estimates of movement and site fidelity using mark-resight data of wintering canada geese. Ecol Lett 72:523–533

    Google Scholar 

  • Hochachka WM, Dhondt AA (2006) House finch (Carpodacus mexicanus) population- and group-level responses to bacterial disease. Ornithol Monogr 60:30–43

    Google Scholar 

  • Hooker G, Xiao L (2009) CollocInfer: collocation inference for continuous-time dynamic systems. http://www.bscb.cornell.edu/hooker/CollocInfer, R package version 0.0.2

  • Hosseini PR, Dhondt AA, Dobson AP (2004) Seasonality and wildlife disease: how season birth, aggregation and variation in immunity affect the dynamics of Mycoplasma gallisepticum in house finches. Proc R Soc Lond B 271:2569–2577

    Google Scholar 

  • Hudson PJ, Rizzoli A, Grenfell BT, Hesterbeek H, Dobson AP (2002) The ecology of wildlife diseases. Oxford University Press, Oxford

    Google Scholar 

  • Ionides EI, Breto C, King AA (2006) Inference for nonlinear dynamical systems. Proc Natl Acad Sci USA 103:18,438–18,443

    CAS  Google Scholar 

  • Jackson C (2009) msm: multi-state Markov and hidden Markov models in continuous time. http://CRAN.R-project.org/package=msm, R package version 0.9.2

  • Jennelle CS, Cooch EG, Conroy MJ, Senar JC (2007) State-specific detection probabilities and disease prevalence. Ecol Appl 17:204–167

    Google Scholar 

  • Joe M, Pollock KH (2002) Separation of survival and movement rates in multi-state tag-return and capture-recapture models. J Appl Stat 29:373–384

    Google Scholar 

  • Keeling M, Bjørnstad O, Grenfell B (2004) Metapopulation dynamics of infectious diseases. In: Hanski I, Gaggiotti O (eds) Ecology, genetics, and evolution of metapopulations. Elsevier, Amsterdam, pp 415–445

    Google Scholar 

  • Keeling MJ, Gilligan CA (2000) Bubonic plague: a metapopulation model of a zoonosis. Proc R Soc Lond B 267:2219–2230

    CAS  Google Scholar 

  • Keeling MJ, Rohani P (2007) Modeling infectious diseases in humans and animals. Princeton University Press, Princeton

    Google Scholar 

  • Kendall WL (2009) One size does not fit all: adapting mark-recapture and occupancy models for state uncertainty. In: Thomson DL, Cooch EG, Conroy MJ (eds) Modeling demographic processes in marked populations, environmental and ecological statistics series, vol 3. Springer, New York, pp 765–780

    Google Scholar 

  • Kendall WL, Pollock KH, Brownie C (1995) A likelihood-based approach to capture-recapture estimation of demographic parameters under the robust design. Biometrics 51:293–308

    CAS  PubMed  Google Scholar 

  • Kendall WL, Langtimm CA, Beck CA, Runge MC (2004) Capture-recapture analysis for estimating manatee reproductive rates. Mar Mammal Sci 20:424–437

    Google Scholar 

  • King AA, Ionides EL, Pascual M, Bouma MJ (2008) Inapparent infections and cholera dynamics. Nat Biotechnol 454:877–U29

    CAS  Google Scholar 

  • King AA, Ionides EL, Bretó CM, Ellner S, Kendall B (2009) pomp: statistical inference for partially observed Markov processes (R package). http://pomp.r-forge.r-rproject.org

  • Korobeinikov A, Maini PK, Walker WJ (2008) Non-linear transmission, mortality and stability of discrete-time infectious disease models. In: Wilson LB (eds) Mathematical biology research trends, mathematical biology research trends. Nova, Hauppauge, pp 237–259

    Google Scholar 

  • Koski T (2002) Hidden Markov models for bioinformatics computational biology, vol 2. Springer, New York

    Google Scholar 

  • Lachish S, Jones M, McCallum H (2007) The impact of disease on the survival and population growth rate of the Tasmanian devil. J Anim Ecol 76:926–936

    PubMed  Google Scholar 

  • Lande R (1987) Extinction thresholds in demographic models of territorial populations. Am Nat 130:624–635

    Google Scholar 

  • Lawson AB (2009) Bayesian disease mapping—hierarchical modeling in spatial epidemiology. Interdisciplinary statistics. CRC, Boca Raton

    Google Scholar 

  • Lebreton JD (2009) Assessing density-dependence: where are we left?. In: Thomson DL, Cooch EG, Conroy MJ (eds) Modeling demographic processes in marked populations, environmental and ecological statistics series, vol 3. Springer, New York, pp 19–41

    Google Scholar 

  • Lebreton JD, Pradel R (2002) Multistate recapture models: modeling incomplete individual histories. J Appl Stat 29:353–369

    Google Scholar 

  • Link WA, Barker RJ (1994) Density estimation using the trapping web design: a geometric analysis. Biometrics 50:733–745

    Google Scholar 

  • Link WA, Barker RJ (2005) Modeling association among demographic parameters in analysis of open population capture-recapture data. Biometrics 61:46–54

    PubMed  Google Scholar 

  • Link WA, Cooch EG, Cam E (2002) Model-based estimation of individual fitness. J Appl Stat 29:207–224

    Google Scholar 

  • MacKenzie DI (2009) Getting the biggest bang for our conservation buck. Trends Ecol Evol 24:175–177

    PubMed  Google Scholar 

  • MacKenzie DI, Nichols JD, Hines JE, Knutson MG, Franklin AB (2003) Estimating site occupancy, colonization, and local extinction when a species is detected imperfectly. Ecol Lett 84:2200–2207

    Google Scholar 

  • MacKenzie DI, Bailey LL, Nichols JD (2004) Investigating species co-occurrence patterns when species are detected imperfectly. J Anim Ecol 73:546–555

    Google Scholar 

  • MacKenzie DI, Nichols JD, Royle JA, Pollock KH, Hines JE, Bailey LL (2005) Occupancy estimation and modeling: inferring patterns and dynamics of species occurrence. Elsevier, Amsterdam

    Google Scholar 

  • MacKenzie DI, Nichols JD, Seamans ME, Gutierrez RJ (2009) Modeling species occurrence dynamics with multiple states and imperfect detection. Ecol Lett 90:823–835

    Google Scholar 

  • Magle SB, McClintock BT, Tripp DW, White GC, Antolin MF, Crooks KR (2007) Mark-resight methodology for estimating population densities for praire dogs. J Wildl Manage 71:2067–2073

    Google Scholar 

  • Marcos MM, Almaraz P, J A Sánchez-Zapata JA (2008) An emerging infectious disease triggering large-scale hyperpredation. PLoS One 3:e2307

    Google Scholar 

  • Maunder MN, Skaug HJ, Fournier DA, Hoyle SD (2009) Comparison of fixed effect, random effect and hierachical Bayes estimators for mark-recapture data using AD model builder. In: Thomson DL, Cooch EG, Conroy MJ (eds) Modeling demographic processes in marked populations, environmental and ecological statistics series, vol 3. Springer, New York, pp 917–946

    Google Scholar 

  • Mausner JS, Bahn AK (1974) Epidemiology: an introductory text. Saunders, Philadeiphia

    Google Scholar 

  • McCallum H, Dobson A (1995) Detecting disease and parasite threats to endangered species and ecosystems. Trends Ecol Evol 10:190–194

    CAS  PubMed  Google Scholar 

  • McCallum H, Barlow N, Hone J (2001) How should pathogen transmission be modelled?. Trends Ecol Evol 16:295–300

    CAS  PubMed  Google Scholar 

  • McCallum H, Barlow N, Hone J (2002) Modelling transmission: mass action and beyond—response from McCallum, Barlow and Hone. Trends Ecol Evol 17:64–65

    Google Scholar 

  • McCallum H, Tompkins DM, Jones M, Lachish S, Marvanek S, Lazenby B, Hocking G, Wiersma J, Hawkins CE (2007) Distribution and impacts of Tasmanian devil facial tumor disease. Econ Health 4:318–325

    Google Scholar 

  • McClintock BT, White GC (2009) A less field-intensive robust design for estimating demographic parameters with mark-resight data. Ecol Lett 90:313–320

    Google Scholar 

  • McClintock BT, White GC, Antolin MF, Tripp DW (2009) Estimating abundance using mark-resight when sampling is with replacement or the number of marked individuals is unknown. Biometrics 65:237–246

    PubMed  Google Scholar 

  • McClintock BT, White GC, Burnham KP, Pryde MA (2009) A generalized mixed effects model of abundance for mark-resight data when sampling is without replacement. In: DL Thomson EC, Conroy M (eds) Modeling demographic processes in marked populations. Springer, New York, pp 271–289

    Google Scholar 

  • McClintock BT, Nichols JD, Bailey L, MacKenzie DI, Kendall W, Fran (2010) Seeking a second opinion: uncertainty in disease ecology. Ecol Lett 13:659–674

  • Miller MW, Hobbs NT, Tavener SJ (2006) Dynamics of prion disease transmission in mule deer. Ecol Appl 16:2208–2214

    PubMed  Google Scholar 

  • Mochihashi D, Sumita E (2008) The infinite Markov model. In: Platt J, Koller D, Singer Y, Roweis S (eds) Advances in neural information processing systems 20. MIT Press, Cambridge, pp 1017–1024

    Google Scholar 

  • Morgan BJT (2007) A Kalman filter primer. J R Stat Soc A 170:506

    Google Scholar 

  • Morvai G, Weiss B (2005) Order estimation of Markov chains. IEEE T Inform Theory 51:1496–1497

    Google Scholar 

  • Murray KA, Skerratt LF, Spear R, McCallum H (2009) Impact and dynamics of disease in species threatened by the amphibian chytrid fungus, Batrachochytrium dendrobatidis. Conserv Biol 23:1242–1252

    PubMed  Google Scholar 

  • New LF, Matthiopoulos J, Redpath S, Buckland ST (2009) Fitting models of multiple hypotheses to partial population data: investigating the causes of cycles in red grouse. Am Nat 174:399–412

    PubMed  Google Scholar 

  • Newman KB, Buckland ST, Lindley ST, Thomas L, Fernández C (2006) Hidden process models for animal population dynamics. Ecol Appl 16:74–86

    CAS  PubMed  Google Scholar 

  • Newman KB, Fernandez C, Thomas L, Buckland ST (2009) Monte Carlo inference for state-space models of wild animal populations. Biometrics 65(2):572–583

    PubMed  Google Scholar 

  • Nichols JD, Hines JE (2002) Approaches for the direct estimation of λ, and demographic contributions to λ, using capture-recapture data. J Appl Stat 29:539–568

    Google Scholar 

  • Nichols JD, Hines JE, Lebreton JD, Pradel R (2000) Estimation of contributions to population growth: a reverse-time capture-recapture approach. Ecol Lett 81:3362–3376

    Google Scholar 

  • Nichols JD, Hines JE, Mackenzie DI, Seamans ME, Gutierrez RJ (2007) Occupancy estimation and modeling with multiple states and state uncertainty. Ecol Lett 88:1395–1400

    Google Scholar 

  • Oli MK, Venkataraman M, Klein PA, Wendland LD, Brown MB (2006) Population dynamics of infectious diseases: a discrete time model. Ecol Model 198:183–194

    Google Scholar 

  • Ozgul A, Oli MK, Boker BJ, Perez-Hyedrich C (2009) Upper respiratory tract disease, force of infection, and effects on survival of gopher tortoises. Ecol Appl 19:786–798

    PubMed  Google Scholar 

  • Patterson TA, Thomas L, Wilcox C, Ovaskainen O, Matthiopoulos J (2008) State-space models of individual animal movement. Trends Ecol Evol 23:87–94

    PubMed  Google Scholar 

  • Pledger S, Efford M, Pollock K, Collazo J, Lyons J (2009) Stopover duration analysis with departure probability dependent on unkown time since arrival. In: Thomson DL, Cooch EG, Conroy MJ (eds) Modeling demographic processes in marked populations, environmental and ecological statistics series, vol 3. Springer, New York, pp 349–363

    Google Scholar 

  • Pradel R (1996) Utilization of capture-mark-recapture for the study of recruitment and population growth rate. Biometrics 52:703–709

    Google Scholar 

  • Pradel R (2005) Multievent: an extension of multistate capture-recapture models to uncertain states. Biometrics 61:442–447

    PubMed  Google Scholar 

  • Pradel R (2009) The stakes of capture-recapture models with state uncertainty. In: Thomson DL, Cooch EG, Conroy MJ (eds) Modeling demographic processes in marked populations, environmental and ecological statistics series, vol 3. Springer, New York, pp 781–795

    Google Scholar 

  • Pradel R, Maurin-Bernier L, Gimenez O, Genovart M, Choquet R, Oro D (2008) Estimation of sex-specific survival with uncertainty in sex assessment. Can J Stat 36:29–42

    Google Scholar 

  • Rabiner LR (1989) A tutorial on hidden Markov models and selected applications in speech recognition. P IEEE 77:257–286

    Google Scholar 

  • Ramsey D, Spencer N, Caley P, Efford M, Hansen K, Lam M, Cooper D (2002) The effects of reducing population density on contact rates between brushtail possums: implications for transmission of bovine tuberculosis. J Appl Ecol 39:806–818

    Google Scholar 

  • Roberts MG (1996) The dynamics of bovine tuberculosis in possum populations, and its eradication or control by culling or vaccination. J Anim Ecol 64:451–464

    Google Scholar 

  • Rosychuk RJ, Islam S (2009) Parameter estimation in a model for misclassified Markov data—a Bayesian approach. Comput Stat Data An 53(11):3805–3816

    Google Scholar 

  • Roweis S, Ghahramani Z (1999) A unifying review of linear Gaussian models. Neural Comput Appl 11:305–345

    CAS  Google Scholar 

  • Royle J, Dorazio R (2008) Hierarchical modeling and inference in ecology. Academic, New York

    Google Scholar 

  • Royle JA (2008) Modeling individual effects in the Cormack-Jolly-Seber model: a state-space formulation. Biometrics 64:364–370

    PubMed  Google Scholar 

  • Samuel MD, Takekawa JY, Baranyuk VV, Orthmeyer DL (1999) Effects of avian cholera on survival of Lesser Snow Geese Anser caerulescens: an experimental approach. Bird Study 46:239–247

    Google Scholar 

  • Schauber EM, Woolf A (2003) Chronic wasting disease in deer and elk: a critique of current models and their application. Wildl Soc B 31:610–616

    Google Scholar 

  • Schmidtmann I (2008) Estimating completeness in cancer registries—comparing capture-recapture methods in a simulation study. Biometrical J 50:1077–1092

    Google Scholar 

  • Schofield MR, Barker RJ (2008) A unified capture-recapture framework. J Agric Biol Environ S 13:458–477

    Google Scholar 

  • Schofield MR, Barker RJ (2009) A further step toward the ‘mother-of-all-models’: flexilibity and functionality in the modeling of capture-recapture data. In: Thomson DL, Cooch EG, Conroy MJ (eds) Modeling demographic processes in marked populations, environmental and ecological statistics series, vol .3. Springer, New York, pp 677–692

    Google Scholar 

  • Schofield MR, Barker RJ, MacKenzie DI (2009) Flexible hierarchical mark-recapture modeling for open populations using WinBUGS. Environ Ecol Stat 16:369–387

    Google Scholar 

  • Schwartz CC, Haroldson MA, White GC, Harris RB, Cherry S, Keating KA, Moody D, Servheen C (2006) Temporal, spatial, and environmental influences on the demographics of grizzly bears in the Greater Yellowstone ecosystem. Wildl Monogr 162:1–68

    Google Scholar 

  • Schwarz CJ (2002) Real and quasi-experiments in capture-recapture studies. J Appl Stat 29:459–473

    Google Scholar 

  • Schwarz CJ (2009) Migration and movement—the next stage. In: Thomson DL, Cooch EG, Conroy MJ (eds) Modeling demographic processes in marked populations, environmental and ecological statistics series, vol 3. Springer, New York, pp 323–348

    Google Scholar 

  • Schwarz CJ, Arnason AN (1996) A general methodology for the analysis of capture-recapture experiments in open populations. Biometrics 52:860–873

    Google Scholar 

  • Schwarz CJ, Schweigert JF, Arnason AN (1993) Estimating migration rates using tag-recovery data. Biometrics 49:177–193

    Google Scholar 

  • Senar JC, Conroy MJ (2004) Multi-state analysis of the impacts of avian pox on a population of serins (Serinus serinus): the importance of estimating recapture rates. Anim Biodivers Conserv 27:133–146

    Google Scholar 

  • Sisson SA, Fan Y, Tanaka MM (2007) Sequential Monte Carlo without likelihoods. Proc Natl Acad Sci USA 104:1760–1765

    CAS  PubMed  PubMed Central  Google Scholar 

  • Slattery SM, Alisauskas RT (2002) Use of the Barker model in an experiment examining covariate effects on first-year survival in Ross’s geese (Chen rossii): a case study. J Appl Stat 29:497–508

    Google Scholar 

  • Smith AA (1993) Estimating nonlinear time-series models using simulated vector autoregressions. J Appl Econ 8:S63–S84

    Google Scholar 

  • Smith GC, Cheeseman CL (2002) A mathematical model for the control of disease in wildlife populations: culling, vaccination and fertility control. Ecol Model 150:45–53

    Google Scholar 

  • Smith GC, Cheeseman CL, Clifton-Hadley RS (2001) A model of bovine tuberculosis in the badger Meles meles: an evaluation of control strategies. J Appl Ecol 38:509–519

    Google Scholar 

  • Smith T, Vounatsou P (2003) Estimation of infection and recovery rates for highly polymorphic parasites when detectability is imperfect, using hidden Markov models. Stat Med 22:1709–1724

    PubMed  Google Scholar 

  • Streftaris G, Gibson GJ (2004) Bayesian analysis of experimental epidemics of foot-and-mouth disease. Proc R Soc Lond B 271:1111–1117

    Google Scholar 

  • Tavecchia G, Besbeas P, Coulson T, Morgan BJT, Clutton-Brock TH (2009) Estimating population size and hidden demographic parameters with state-space modeling. Am Nat 173:722–733

    PubMed  Google Scholar 

  • Telfer S, Bennett M, Bown K, Cavanagh R, Crespin L, Hazel S, Jones T, Begon M (2002) The effects of cowpox virus on survival in natural rodent populations: increases and decreases. J Anim Ecol 71:558–568

    Google Scholar 

  • Thomas L, Buckland ST, Newman KB, Harwood J (2005) A unified framework for modelling wildlife population dynamics. Aust NZ J Stat 47:19–34

    Google Scholar 

  • Thompson K (2007) Use of site occupancy models to estimate prevalence of of myxobolus cerebralis infection in trout. J Aquat Anim Health 19:8–13

    PubMed  Google Scholar 

  • Titman AC, Sharples LD (2008) A general goodness-of-fit test for Markov and hidden Markov models. Stat Med 27:2177–2195

    PubMed  Google Scholar 

  • Titman AC, Sharples LD (2009) Model diagnostics for multi-state models. Stat Meth Med Res :1–31, doi:https://doi.org/10.1177/0962280209105541

  • Viallefont A, Auget JL (1999) Using capture-recapture models to estimate transition rates between states in interval-censored data. Rev Epidemiol Sante Publ 47:627–634

    CAS  Google Scholar 

  • van Boven M, Weissing FJ (2004) The evolutionary economics of immunity. Am Nat 163:277–294

    PubMed  Google Scholar 

  • Wang G (2007) On the latent state estimation of nonlinear population dynamics using Bayesian and non-Bayesian state-space models. Ecol Model 200:521–528

    Google Scholar 

  • Wasserberg G, Osnas EE, Rolley RE, Samuel MD (2009) Host culling as an adaptive management tool for chronic wasting disease in white-tailed deer: a modelling study. J Appl Ecol 46:457–466

    PubMed  Google Scholar 

  • Webb CT, Brooks CP, Gage KL, Antolin MF (2006) Classic flea-borne transmission does not drive plague epizootics in prairie dogs. Proc Natl Acad Sci USA 103:6236–6241

    CAS  PubMed  PubMed Central  Google Scholar 

  • Williams BK, Nichols JD, Conroy MJ (2002) Analysis and management of animal populations. Academic, New York

    Google Scholar 

  • Yip P, Bruno G, Tajima N, Seber G, Buckland S, Cormack R, Unwin N, Chang Y, Fienberg S, Junker B, Laporte R, Libman I, Mccarty D (1995) Capture-recapture and multiple-record systems estimation. 1. History and theoretical development. Am J Epidemiol 142:1047–1058

    Google Scholar 

  • Yip P, Bruno G, Tajima N, Seber G, Buckland S, Cormack R, Unwin N, Chang Y, Fienberg S, Junker B, Laporte R, Libman I, Mccarty D (1995) Capture-recapture and multiple-record systems estimation. 2. Applications in human diseases. Am J Epidemiol 142:1059–1068

    Google Scholar 

  • Zucchini W, Raubenheimer D, MacDonald IL (2008) Modeling time series of animal behavior by means of a latent-state model with feedback. Biometrics 64:807–815

    PubMed  Google Scholar 

Download references

Acknowledgments

We thank Jim Nichols and Stephen Baillie for inviting this contribution, and Bill Kendall and Michael Schaub for accommodating our intent to provide a broad overview of a large and complex topic (resulting in a clearly failed attempt at brevity). We thank Jim Nichols and Darryl MacKenzie for discussions of the applications of occupancy modeling to disease dynamics. Additional comments from Mary Conner, Andre Dhondt, Bill Kendall, Darryl MacKenzie, Brett McClintock, and Anne Viallefont greatly improved the paper. Funding for the first author was partially provided by NIH/NSF grant EF0622705.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Evan G. Cooch.

Additional information

Communicated by M. Schaub.

Appendices

Appendices

Appendix A. Classical disease models: a (very) short review

It is utterly implausible that a mathematical formula should make the future known to us, and those who think it can would once have believed in witchcraft…”—B. de Jouvenel

The literature on the mathematical modeling of disease dynamics is enormous—here, we provide only a brief introduction (very brief) to provide some of the necessary terminology and classical model structures. The reader is referred to Daley and Gani (2001) for a general introduction; see Diekmann and Heesterbeek (2000) and Keeling and Rohani (2007) for more mathematically advanced treatments.

Most disease models are constructed by subdividing the population into discrete divisions (‘compartments’) reflecting the underlying disease ‘state’ of the individual. Frequently, these states include individuals that may be susceptible to infection (S), those that are infected (I), and those who are ‘removed’ (R), either by virtue of having died, or potentially recovered with some degree of immunity. The dynamics of such an SIR-type system are governed by the rate of transition between states (Fig. 4).

Fig. 4
figure 4

Classical SIR compartmental model. Nodes refer to susceptible (S), infected (or infectious) (I), and ‘removed’ or recovered (R) disease states. Transition parameters β and γ reflect rate of movement between successive disease states over some time scale

The classical Ross–Kermack–McKendrick SIR model represented in Fig. 5 is a sequential model with a single absorbing state (implying permanent removal from the susceptible class, typically by death, or permanent immunity). Although there are a very large number of elaborations of this simple model (e.g., the SIRS model, where removal from the susceptible class is not permanent, as might be expected if immunity to the pathogen is temporary), we focus here on the classical deterministic SIR model since it is very general.

Fig. 5
figure 5

Life cycle graph for a SIR-type disease (modified following Oli et al. 2006), assuming a post-breeding census. Nodes represent susceptible (S), infective (I) and removed (R) disease states. Model parameters are p k = survival probability of individuals in disease state k (k = SI or R), β is the probability that a susceptible individual becomes infective between time (t) and (t + 1), γ = the probability that an infective individual recovers between time (t) and (t + 1), B k is the fertility rate of individuals in disease state k

Classical SIR models: deterministic, continuous time

The SIR model (Fig. 4) in continuous time is traditionally represented by a set of coupled ordinary differential equations:

$$ \begin{aligned} \frac{dS}{dt}&=-\beta{IS}\\ \frac{dI}{dt}&=\beta{IS}-\gamma{I}\\ \frac{dR}{dt}&=\gamma{I} \\ \end{aligned} $$
(11)

Analysis of such models generally focuses on two things: the conditions under which the system described by these equations is at equilibrium, and the sensitivity of this equilibrium (or any other point in plausible state space) to perturbation of one or more parameters. Classical sensitivity analysis uses partial derivatives of model output with respect to parameters in the model—the most common approach is based on finalization analysis, where equations for the partial derivatives of the solution of a particular model with respect to parameters are found by differentiating the model

$$ \frac{\partial}{\partial{t}}\left(\frac{\partial{x}} {\partial{\theta}}\right)=D_xf(x(t);\theta)\frac{\partial{x}} {\partial{\theta}}+D_{\theta}f(x(t);\theta) $$

where ∂x/∂θ is the gradient of x with respect to parameter θ, and D x f and D θ f are the Jacobians of f (model functions) with respect to x and θ, respectively. The stability and local transient dynamics of equilibria (and other fixed points) defined by the system of equations is determined by the eigenspectrum of the Jacobian matrix. See Keeling and Gilligan (2000) and Buzby et al. (2008) for a general discussion, with application to analysis of a vector-transmitted disease system.

Analysis of the equilibrium condition(s) often provide qualitative insights to the conditions necessary for an emerging pathogen to increase in a population of susceptible hosts. For example, if we apply the constraint that N = S + I + R is a constant (which implies that dS/dt + dI/dt + dR/dt = 0), we can look at the dynamics of the system simply by focusing on two of the three variables. If we consider S and I, we see that the invasion of a population that initially consists entirely of susceptible individuals requires that dI/dt > 0, which is clearly possible only if (βS − γ > 0); unless the rate of increase in the number of infective individuals is greater than the rate at which they are removed, then dI/dt cannot be greater than 0.

We can re-arrange this condition as

$$ R_o=\frac{\beta{S}}{\gamma}>1 $$
(12)

Traditionally, R o is known as the reproductive number for the disease (occasionally also referred to as the basic reproduction ratio). We see that βS is the rate at which an infective causes new infections. Since 1/γ is the mean time an individual is infective, then R o is the mean number of new infections caused by a single infective individual. The incidence of the disease will increase if R o > 1, and decline if R o < 1.

The reproductive number R o is fundamental for analysis of disease dynamics. For example, if S max = N (which is the null initial condition where all individuals in the population are susceptible), then dI/dt > 0 if and only if N > γ/β (which is equivalent to R o > 1), indicating there is a minimum population size necessary for the pathogen to invade (i.e., for dI/dt > 0).

In the classical SIR model, the function F = βI models the transition from the susceptible compartment to the infectious compartment. Generally, this function F = βI is referred to as the force of infection.

Classical SIR models: deterministic, discrete time

Compared to continuous time models, discrete time forms of classic epidemiological models have received little attention (see van Boven and Weissing 2004; Oli et al. 2006; Korobeinikov et al. 2008 as notable exceptions). One possible discrete parameterization of the SIR equation, based on a post-breeding census is shown in Fig. 5

Here, we allow for state-specific survival and fertility (production of neonates). This model can be reduced to the equivalent constant N model discussed above, by fixing survival at 1 and fertility to 0.

Following Oli et al. (2006), we can decompose the projection matrix A corresponding to life cycle graph as the sum of a transition matrix T, where the elements (t ij) represent the probability than an individual in state j at time (t) will be in state i at time (t + 1), and a fecundity matrix F, where the elements f ij are the expected number of type i offspring produced at (t + 1) by an individual in state j at time (t); A = T+F (Cochran and Ellner 1992; Caswell 2001). The fundamental matrix N is defined as \({\bf I}+{\bf T}+{\bf T}^2+\dots=({\bf I}-{\bf T})^{-1}\equiv {\bf N}\) where I is an identity matrix with dimension equal to the number of disease states. The fundamental matrix N gives the expected number of time steps in each state. Since the fertility matrix F gives the expected number of offspring produced by each stage per time step, then the matrix R = FN has elements r ij which quantify the expected lifetime production of offspring of type i by an individual in state j. The dominant eigenvalue of R is the net reproduction ratio R o. Further, evaluation of the sensitivity of key population parameters (e.g., λ,R 0) is straightforward in the projection matrix framework.

However, while discrete-time analogues of many disease model parameters are available (see Oli et al. 2006 for a general review, and discussion of important differences in calculating and interpreting several key disease paramors), we note that the discretization of both disease state and time have important implications for the modeling of disease dynamics, and the estimation of model parameters. Most interactions in disease systems are truly continuous (at least temporally)—discrete-time models generally represent approximations to continuous systems, necessitated in many instances by the discrete timing of sampling (or, equivalently, the aggregation of disease data into discrete time frames). The statistical challenge in the estimation of disease parameters concerns the need for methods to accommodate discretized systems, which are typically only partially observable, non-stationary and (generally) nonlinear.

Appendix B. HMM models: a (very) short introduction

A system can be modeled by a HMM if the sequence of hidden states is Markov, and if the sequence of observations are either independent, or Markov, given the hidden state.

A HMM is specified by:

  • Q, the set of possible states \(\{q_1,q_2,\ldots,q_n\}\)

  • O, the output ‘alphabet’ (often referred to as ‘emission’ distributions’); the set of observed ‘event’ states (sensu Pradel 2005); \(\{o_1,o_2,\ldots,o_m\}\)

  • πi, the probability of being in state q i at time t = 0 (i.e., the initial states)

  • \(\varvec{\Upphi}\), the matrix of transition probabilities ϕij, where ϕij = Pr(entering state q j at time \(t+1 \mid \) being in state q i at time t). Assuming the system is Markov, then the state transitions do not depend on the previous states at earlier times.

  • B, the matrix of output probabilities {b j(k)} (i.e., the ‘observed event’ array), where b j(k) = Pr(producing ‘event’ v k at time \(t \mid\) being in state q j at time t)

Formally, we define a HMM model M as

$$ M=(\Upphi,{\bf B},\pi) $$

The key point is that the data consist of observations of the events V, which are mapped to the underlying Markovian transitions between unobserved states Q by the transition matrices \(\varvec{\Upphi}\) and B. A Markov chain has a strict one-to-one mapping between observations and underlying states. This is not a requirement for HMM, where an observation can typically be generated by several different states, and the probabilities of generating an observation given a state differ.

From above, we define Q to be an unobserved fixed state sequence (path through possible states Q) of length T (i.e., T sampling events), and corresponding observations O made at discrete time intervals:

$$ \begin{aligned} Q &= q_1, q_2, \ldots , q_T \\ O &= o_1, o_2, \ldots , o_T \\ \end{aligned} $$

Given O, there are 3 primary questions of interest:

  1. 1.

    What is the probability of the observed ‘event history’ O, given the model M? In other words, we wish to calculate Pr(O|M). This is usually referred to as evaluation.

  2. 2.

    At each time step, what state is most likely? In other words, what is the hidden state sequence Q that was most likely to have produced a given observation sequence O? It is important to note that the sequence of states computed by this criterion might be impossible. Thus more often we are interested in what single sequence of states has the largest probability of occurrence. That is, find the state sequence \(q_1,q_2,\ldots,q_T\) such that \(Pr(o_1,o_2,\ldots,o_T | M)\) is maximized. This is usually referred to as decoding.

  3. 3.

    Given some data, how do we “learn” a good hidden Markov model to describe the data? That is, given the structure of a HMM, and observed event data, how do we parameterize the model which maximizes Pr(O|M)? This is referred to as learning.

Due to the large number of possible sample paths, even for low dimension HMM with small time horizon T, the need for an efficient algorithm to satisfy the objectives (noted above) should be apparent. Consider, for example, the evaluation problem—trying to find the probability of observations \(O=\{o_1,o_2,\ldots,o_T\}\) by means of considering all possible hidden state sequences is clearly impractical (for even simple problems, the number of possible state sequences R is astronomically large). Generally, dynamic programming approaches are used to minimize the computational burden. Forward–backward and Viterbi algorithms are frequently applied to the evaluation and decoding problems, respectively. For the learning problem (estimation of parameters for the HMM), the Baum–Welch algorithm is frequently used (the Baum–Welch algorithm is a generalized expectation–maximization routine, and can compute maximum likelihood estimates and posterior mode estimates for the parameters—both transition and event probabilities—of a HMM, when given only ‘events’ as training data), but analysis with complete data (sensu Schofield and Barker 2008) is certainly also possible. The Baum–Welch and related estimation approaches do not estimate the number of states—that must be specified and fixed. For a more comprehensive introduction to general inference from HMM, see Cappé et al. (2005).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cooch, E.G., Conn, P.B., Ellner, S.P. et al. Disease dynamics in wild populations: modeling and estimation: a review. J Ornithol 152 (Suppl 2), 485–509 (2012). https://doi.org/10.1007/s10336-010-0636-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10336-010-0636-3

Keywords

Navigation