Abstract
I argue for a principled approach to model fitting in mathematical biology that combines statistical and mechanistic insights.
Similar content being viewed by others
References
Andersson H, Britton T (2000) Stochastic epidemic models and their statistical analysis. In: Springer lectures notes in statistics, vol 151. Springer, Berlin
Bailey NTJ (1957) The mathematical theory of epidemics. Griffin, London
Black AJ, McKane AJ (2012) Stochastic formulation of ecological models and their applications. Trends Ecol Evol 27(6):337–345
Brooks S, Gelman A, Jones GL, Meng X-L (eds) (2011) Handbook of Markov chain Monte Carlo. CRC Press, London
Communicable Disease Surveillance Centre (Public Health Laboratory Service) (1978) Influenza in a boarding school. BMJ 1(6112):586–590 (3)
Gilks WR, Richardson S, Spiegelhalter DJ (1995) Markov chain Monte Carlo in practice. Chapman and Hall/CRC, London
Jewell CP, Kypraios T, Neal P, Roberts GO (2009) Bayesian analysis for emerging infectious diseases. Bayesian Anal 4(4):465–496
Murray JD (2002) Mathematical biology I, 3rd edn. Springer, New York
Murray JD (2003) Mathematical biology II, 3rd edn. Springer, New York
O’Neill PD, Roberts GO (1999) Bayesian inference for partially observed stochastic epidemics. J R Stat Soc A 162:121–129
Author information
Authors and Affiliations
Corresponding author
Appendix A: Technical appendix
Appendix A: Technical appendix
1.1 A.1: Individual-based model and likelihood
The underlying stochastic process is the general stochastic epidemic model (Bailey 1957), which consists of two integer-valued non-independent random variables in continuous time \(S(t)\) and \(I(t)\), such that \(S(t) + I(t) \le N\), where the integer \(N\) is the population size. This model has two real-valued parameters, \(\lambda \) and \(\gamma \), which have dimensions of inverse time and determine the rates of processes of the Markov chain:
And so if the current state is \((S,I)\) then the probability densities for the next event being an infection or recovery after time \(t\) are respectively
We will consider the case where observations are a set of times \(T\) and events \( \{e(t) \; | \; t \in T \; \& \; e(t) \in \{1,2\}\}\), so that a likelihood function can be defined as
1.2 A.2: Early diffusion limit
For a population with large size \(N\), the early epidemic prevalence \(I(t) \ll N\) converges \(I(t) \rightarrow Y(t)\), where \(Y(t)\) obeys the stochastic differential equation
where \(\beta := N\lambda \). If we make a series of observations \(\{y_m\}\) of infectious prevalence at times \(\{t_m\}\), then we can approximate the likelihood using a Gaussian process:
1.3 A.3: Deterministic limit
In the limit of large \(N\) (or more strictly \(I(t) \gg 1\)) the stochastic process (1) converges on the well-known SIR equations
where
If one approached data of the kind discussed in Sect. A.1 with the Eq. (6), then an alternative to likelihood-based fitting would be to employ a ‘least squares’ approach, and choose parameters to minimise the function
Rights and permissions
About this article
Cite this article
House, T. For principled model fitting in mathematical biology. J. Math. Biol. 70, 1007–1013 (2015). https://doi.org/10.1007/s00285-014-0787-6
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00285-014-0787-6