For principled model fitting in mathematical biology

House, Thomas

doi:10.1007/s00285-014-0787-6

For principled model fitting in mathematical biology

Published: 04 May 2014

Volume 70, pages 1007–1013, (2015)
Cite this article

Journal of Mathematical Biology Aims and scope Submit manuscript

Thomas House¹

614 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

I argue for a principled approach to model fitting in mathematical biology that combines statistical and mechanistic insights.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Article Open access 01 April 2016

Evaluating significance in linear mixed-effects models in R

Article 12 September 2016

The math is not the territory: navigating the free energy principle

Article 02 June 2021

References

Andersson H, Britton T (2000) Stochastic epidemic models and their statistical analysis. In: Springer lectures notes in statistics, vol 151. Springer, Berlin
Bailey NTJ (1957) The mathematical theory of epidemics. Griffin, London
Google Scholar
Black AJ, McKane AJ (2012) Stochastic formulation of ecological models and their applications. Trends Ecol Evol 27(6):337–345
Article Google Scholar
Brooks S, Gelman A, Jones GL, Meng X-L (eds) (2011) Handbook of Markov chain Monte Carlo. CRC Press, London
MATH Google Scholar
Communicable Disease Surveillance Centre (Public Health Laboratory Service) (1978) Influenza in a boarding school. BMJ 1(6112):586–590 (3)
Gilks WR, Richardson S, Spiegelhalter DJ (1995) Markov chain Monte Carlo in practice. Chapman and Hall/CRC, London
Google Scholar
Jewell CP, Kypraios T, Neal P, Roberts GO (2009) Bayesian analysis for emerging infectious diseases. Bayesian Anal 4(4):465–496
Article MathSciNet Google Scholar
Murray JD (2002) Mathematical biology I, 3rd edn. Springer, New York
Google Scholar
Murray JD (2003) Mathematical biology II, 3rd edn. Springer, New York
Google Scholar
O’Neill PD, Roberts GO (1999) Bayesian inference for partially observed stochastic epidemics. J R Stat Soc A 162:121–129
Article Google Scholar

Download references

Author information

Authors and Affiliations

Warwick Mathematics Institute, University of Warwick, Coventry, CV4 7AL, UK
Thomas House

Authors

Thomas House
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Thomas House.

Appendix A: Technical appendix

1.1 A.1: Individual-based model and likelihood

The underlying stochastic process is the general stochastic epidemic model (Bailey 1957), which consists of two integer-valued non-independent random variables in continuous time $S(t)$ and $I(t)$, such that $S(t) + I(t) \le N$, where the integer $N$ is the population size. This model has two real-valued parameters, $\lambda $ and $\gamma $, which have dimensions of inverse time and determine the rates of processes of the Markov chain:

$$\begin{aligned} (S,I) \rightarrow (S-1,I+1)\,\, \text { at rate }\,\, \lambda S I,\quad (S,I) \rightarrow (S,I-1) \,\,\text { at rate }\,\, \gamma I. \end{aligned}$$

(1)

And so if the current state is $(S,I)$ then the probability densities for the next event being an infection or recovery after time $t$ are respectively

$$\begin{aligned} \rho _1(t) = \lambda S I \mathrm{e}^{-(\lambda S + \gamma )I t},\quad \rho _2(t) = \gamma I \mathrm{e}^{-(\lambda S + \gamma )I t}. \end{aligned}$$

(2)

We will consider the case where observations are a set of times $T$ and events $ \{e(t) \; | \; t \in T \; \& \; e(t) \in \{1,2\}\}$, so that a likelihood function can be defined as

$$\begin{aligned} \mathcal {L}(\beta ,\gamma ) = \prod _{t\in T} \rho _{e(t)}(t). \end{aligned}$$

(3)

1.2 A.2: Early diffusion limit

For a population with large size $N$, the early epidemic prevalence $I(t) \ll N$ converges $I(t) \rightarrow Y(t)$, where $Y(t)$ obeys the stochastic differential equation

$$\begin{aligned} \frac{\mathrm {d}Y}{\mathrm {d}t} = (\beta - \gamma ) Y + \left( \beta ^2 + \gamma ^2\right) ^{1/2} Y \xi , \end{aligned}$$

(4)

where $\beta := N\lambda $. If we make a series of observations $\{y_m\}$ of infectious prevalence at times $\{t_m\}$, then we can approximate the likelihood using a Gaussian process:

$$\begin{aligned}&\displaystyle \mathcal {L}(\beta ,\gamma ) = \prod _m \mathbb {P}[Y(t_{m+1}) =y_{m+1} | Y(t_m)=y_m], \nonumber \\&\displaystyle \mathbb {P}[Y(t+\delta ) =y' | Y(t)=y] \approx \mathcal {N}\bigg (y' \bigg | \mu = y \mathrm{e}^{(\beta - \gamma ) \delta } , \; \sigma = \left( \beta ^2 + \gamma ^2\right) ^{1/2}\mu \delta \bigg ), \nonumber \\&\displaystyle \mathcal {N}(x | \mu , \sigma ) := \frac{1}{(2\pi )^{1/2} \sigma } \mathrm{e}^{-(x-\mu )^2/(2\sigma ^2)}. \end{aligned}$$

(5)

1.3 A.3: Deterministic limit

In the limit of large $N$ (or more strictly $I(t) \gg 1$) the stochastic process (1) converges on the well-known SIR equations

$$\begin{aligned} \frac{\mathrm {d}s}{\mathrm {d}t} = -\beta s i, \quad \frac{\mathrm {d}i}{\mathrm {d}t} = \beta s i - \gamma i, \end{aligned}$$

(6)

where

$$\begin{aligned} s(t):= \frac{1}{N} \mathbb {E}[S(t)],\quad i(t):= \frac{1}{N} \mathbb {E}[I(t)]. \end{aligned}$$

(7)

If one approached data of the kind discussed in Sect. A.1 with the Eq. (6), then an alternative to likelihood-based fitting would be to employ a ‘least squares’ approach, and choose parameters to minimise the function

$$\begin{aligned} \mathcal {E}(\beta , \gamma , i_0) = \sum _{t\in T} \left( I(t) - i(t; \beta , \gamma , i_0)\right) ^2. \end{aligned}$$

(8)

Rights and permissions

Reprints and permissions

About this article

Cite this article

House, T. For principled model fitting in mathematical biology. J. Math. Biol. 70, 1007–1013 (2015). https://doi.org/10.1007/s00285-014-0787-6

Download citation

Received: 28 February 2013
Revised: 14 April 2014
Published: 04 May 2014
Issue Date: April 2015
DOI: https://doi.org/10.1007/s00285-014-0787-6

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

For principled model fitting in mathematical biology

Abstract

Access this article

Similar content being viewed by others

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Evaluating significance in linear mixed-effects models in R

The math is not the territory: navigating the free energy principle

References

Author information

Authors and Affiliations

Corresponding author

Appendix A: Technical appendix

1.1 A.1: Individual-based model and likelihood

1.2 A.2: Early diffusion limit

1.3 A.3: Deterministic limit

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

For principled model fitting in mathematical biology

Abstract

Access this article

Similar content being viewed by others

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Evaluating significance in linear mixed-effects models in R

The math is not the territory: navigating the free energy principle

References

Author information

Authors and Affiliations

Corresponding author

Appendix A: Technical appendix

Appendix A: Technical appendix

1.1 A.1: Individual-based model and likelihood

1.2 A.2: Early diffusion limit

1.3 A.3: Deterministic limit

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation