Abstract
Two major forces shaping evolution are drift and selection. The standard models of neutral drift—the Wright–Fisher (WF) and Moran processes—can be extended to include selection. However, these standard models are not always applicable in practice, and—even without selection—many other drift models make very different predictions. For example, “generalised Wright–Fisher” models (so-called because their first two conditional moments agree with those of the WF process) can yield wildly different absorption times from WF. Additionally, evolutionary stability in finite populations depends only on fixation probabilities, which can be evaluated under less restrictive assumptions than those required to estimate fixation times or more complex population-genetic quantities. We therefore distill the notion of a selection process into a broad class of finite-population, mutationless models of drift and selection (including the WF and Moran processes). We characterize when selection favours fixation of one strategy over another, for any selection process, which allows us to derive finite-population conditions for evolutionary stability independent of the selection process. In applications, the precise details of the selection process are seldom known, yet by exploiting these new theoretical results it is now possible to make rigorously justifiable inferences about fixation of traits.
Similar content being viewed by others
Notes
Fitnesses need not be defined for \(i=0\) or N, as in these extremes the population is homogeneous and there is no variability in fitness.
An invading strategy may appear in the population by immigration, mutation, or (in the case of cultural traits) innovation.
We assume here that the fitnesses \(\overline{W}_{A}\left( {j}\right) \) and \(\overline{W}_{B}\left( {j}\right) \) are positive for \(1\le j\le N-1\).
In the original version of the EW process (Eldon and Wakeley 2006), the fitnesses of both types were equal, and the parent agent was guaranteed to survive to the next generation. Der et al. (2012) generalized the original model to types with different fitnesses, but in their version of the EW process, it is possible for the parent to be chosen for replacement. Here, we reformulate Der et al’s extended model while retaining the original condition that the reproducing agent cannot be chosen for replacement. In contrast to our version of the EW process, setting \(U=2\) in Der et al’s version yields the Moran process exactly.
Taking \(\{\nu _{k}\}_{k=1}^{N}\) to be exchangeable multinomial variables yields the neutral WF process (Cannings 1974).
This occurs when the two types effectively make up two sub-populations that reproduce and evolve independently.
Importantly, note that assuming equal mean numbers of offspring in Lessard and Ladret’s model, \(\mu _{A}\left( i\right) \,\)=\(\,\mu _{B}\left( i\right) =1\), does not recover Cannings’ original (neutral) model.
References
Acerbi A, Bentley RA (2014) Biases in cultural transmission shape the turnover of popular traits. Evol Hum Behav 35(3):228–236. doi:10.1016/j.evolhumbehav.2014.02.003
Alexander H, Wahl L (2008) Fixation probabilities depend on life history: fecundity, generation time and survival in a burst-death model. Evolution 62(7):1600–1609. doi:10.1111/j.1558-5646.2008.00396.x
Allen B, Nowak MA, Dieckmann U (2013) Adaptive dynamics with interaction structure. Am Nat 181(6):E139–E163
Aoki K, Lehmann L, Feldman MW (2011) Rates of cultural change and patterns of cultural accumulation in stochastic models of social transmission. Theor Popul Biol 79(4):192–202. doi:10.1016/j.tpb.2011.02.001
Bentley RA, Hahn MW, Shennan SJ (2004) Random drift and culture change. Proc R Soc Lond B: Biol Sci 271(1547):1443–1450
Cannings C (1974) The latent roots of certain markov chains arising in genetics: a new approach. I. Haploid models. Adv Appl Probab 6:260–290
Charlesworth B (2009) Effective population size and patterns of molecular evolution and variation. Nat Rev Genet 10(3):195–205
Chia A, Watterson G (1969) Demographic effects on the rate of genetic evolution: I. Constant size populations with two genotypes. J Appl Probab 6:231–248
Chung KL (1967) Markov chains with stationary transition probabilities. Springer, Berlin
Der R (2010) A theory of generalised population processes. ProQuest, Philadelphia
Der R, Epstein CL, Plotkin JB (2012) Dynamics of neutral and selected alleles when the offspring distribution is skewed. Genetics 191(4):1331–1344
Der R, Epstein CL, Plotkin JB (2011) Generalized population models and the nature of genetic drift. Theor Popul Biol 80(2):80–99
Durrett R (2008) Probability models for DNA sequence evolution. Springer, Berlin
Eldon B, Wakeley J (2006) Coalescent processes when the distribution of offspring number among individuals is highly skewed. Genetics 172(4):2621–2633
Eldon B, Wakeley J (2008) Linkage disequilibrium under skewed offspring distribution among individuals in a population. Genetics 178(3):1517–1532
Eldon B, Wakeley J (2009) Coalescence times and fst under a skewed offspring distribution among individuals in a population. Genetics 181(2):615–629
Ewens WJ (2012) Mathematical population genetics 1: theoretical introduction, volume 27 of interdisciplinary applied mathematics. Springer, Berlin
Feller W (1968) An introduction to probability theory and its applications: vol 3. Wiley, New York
Fisher RA (1930) The genetical theory of natural selection: a complete, variorum edn. Oxford University Press, Oxford
Haldane JBS (1932) A mathematical theory of natural and artificial selection. Part ix. Rapid selection. In: Mathematical Proceedings of the Cambridge Philosophical Society, vol 28, pp 244–248. Cambridge University Press, Cambridge
Hartl DL, Clark AG (2007) Principles of population genetics, 4th edn. Sinauer associates, Sunderland. ISBN 9780878933082
Hedgecock D (1994) Does variance in reproductive success limit effective population sizes of marine organisms. In: Beaumont AR (ed) Genetics and evolution of aquatic organisms. Chapman and Hall, London, pp 122–134
Hofbauer J, Sigmund K (1998) Evolutionary games and population dynamics. Cambridge University Press, Cambridge
Huillet T, Möhle M (2011) Population genetics models with skewed fertilities: a forward and backward analysis. Stoch Model 27(3):521–554
Imhof LA, Nowak MA (2006) Evolutionary game dynamics in a Wright-Fisher process. J Math Biol 52(5):667–681
Karlin S, McGregor J (1964) Direct product branching processes and related markov chains. PNAS 51(4):598
Karlin S, Taylor HM (1975) A first course in stochastic processes, 2nd edn. Academic Press, New York
Kurokawa S, Ihara Y (2009) Emergence of cooperation in public goods games. Proc R Soc Lond B: Biol Sci 276(1660):1379–1384
Lessard S (2005) Long-term stability from fixation probabilities in finite populations: new perspectives for ESS theory. Theor Popul Biol 68(1):19–27
Lessard S, Ladret V (2007) The probability of fixation of a single mutant in an exchangeable selection model. J Math Biol 54(5):721–744
Maynard Smith J (1982) Evolution of theory games. Cambridge University Press, Cambridge
Metz JA, Geritz SA, Meszéna G, Jacobs FJ, Van Heerwaarden JS et al (1996) Adaptive dynamics, a geometrical study of the consequences of nearly faithful reproduction. Stoch Spat Struct Dyn Syst 45:183–231
Molina C, Earn DJD (in prep.) Evolutionarily stability in symmetric games in finite populations
Moran PAP (1962) The statistical processes of evolutionary theory. Clarendon Press, Oxford
Nowak MA (2006) Evolutionary dynamics: exploring the equations of life. Harvard University Press, Cambridge
Nowak MA, Sasaki A, Taylor C, Fudenberg D (2004) Emergence of cooperation and evolutionary stability in finite populations. Nature 428(6983):646–650
Ohtsuki H (2010) Stochastic evolutionary dynamics of bimatrix games. J Theor Biol 264(1):136–142
Ohtsuki H, Hauert C, Lieberman E, Nowak MA (2006) A simple rule for the evolution of cooperation on graphs and social networks. Nature 441(7092):502–505
Patwa Z, Wahl LM (2008) The fixation probability of beneficial mutations. J R Soc Interface 5(28):1279–1289
Pitman J (1999) Coalescents with multiple collisions. Ann Probab 27(4):1870–1902
Proulx S, Day T (2002) What can invasion analyses tell us about evolution under stochasticity in finite populations? Selection 2(1–2):2–15
Proulx SR (2000) The ESS under spatial variation with applications to sex allocation. Theor Popul Biol 58(1):33–47
Ridley M (2003) Evolution. Wiley, New York. ISBN 9781405103459
Sagitov S et al (1999) The general coalescent with asynchronous mergers of ancestral lines. J Appl Probab 36(4):1116–1125
Sargsyan O, Wakeley J (2008) A coalescent process with simultaneous multiple mergers for approximating the gene genealogies of many marine organisms. Theor Popul Biol 74(1):104–114
Schweinsberg J (2003) Coalescent processes obtained from supercritical Galton-Watson processes. Stoch Process Appl 106(1):107–139
Stewart AJ, Plotkin JB (2013) From extortion to generosity, evolution in the iterated prisoners dilemma. PNAS 110(38):15348–15353
Tarnita CE, Ohtsuki H, Antal T, Fu F, Nowak MA (2009) Strategy selection in structured populations. J Theor Biol 259(3):570–581
Tarnita CE, Wage N, Nowak MA (2011) Multiple strategies in structured populations. PNAS 108(6):2334–2337
Wild G, Taylor PD (2004) Fitness and evolutionary stability in game theoretic models of finite populations. Proc R Soc Lond B: Biol Sci 271(1555):2345–2349
Wright S (1931) Evolution in mendelian populations. Genetics 16(2):97
Wu B, Gokhale CS, Veelen M, Wang L, Traulsen A (2013) Interpretations arising from Wrightian and Malthusian fitness under strong frequency dependent selection. Ecol Evol 3(5):1276–1280
Acknowledgements
DE was supported by NSERC. CM was supported by an Ontario Trillium Scholarship and the McMaster University Department of Mathematics and Statistics. We are grateful to Erol Akçay, Ben Bolker, Sarah Otto and Joshua Plotkin for valuable discussions and comments.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflicts of interest.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Appendices
Appendix
A Proofs
1.1 A.1 Proof of Proposition 2.7
If \(X(0) = 0\) or \(X(0) = N\), nothing remains to be shown.
Let \(C=\{1,2,\ldots ,N-1\}\) and consider \(i\in C\). Suppose, in order to derive a contradiction, that the absorption probability starting from state i is
Then,
If X(t) takes values in C for all times \(t\ge 0\), then since C is finite, at least one index j, \(1\le j\le N-1\) is visited infinitely often, that is, for some j, \(1\le j\le N-1\),
Now note that H1 implies that C is a set of inessential, and therefore nonrecurrent states (see Appendix D.2 in the ESM and Theorem I.4.4 of Chung 1967), which cannot be visited infinitely often (Theorem I.4.3 of Chung 1967), contradicting inequality (26). \(\square \)
1.2 A.2 Proof of Proposition 4.1
Define the random variable \(T_A=\left. \min \{t\,|\, X(t) = N\}\right. \), that is, \(T_A\) is the fixation time of A (\(T_A=\infty \) if A never fixes). Similarly, let \(T_B=\left. \min \{t\,|\, X(t) = 0\}\right. \) be the fixation time of B. Both \(T_A\) and \(T_B\) are stopping times (see Definition S3), and hence the absorption time \(T_\mathrm{abs}= \min \{T_A,T_B\}\) is also a stopping time (Karlin and Taylor 1975, p. 256).
Since either A or B must fix (Proposition 2.7),
so
and
For any t, we have \(0\le X(t)\le N\), so it follows that for any stopping time T,
Thus, since X(t) is a martingale and \(T_\mathrm{abs}\) is a stopping time satisfying Eqs. (27) and (30) the optional stopping theorem (theorem S5) implies that
and hence
\(\square \)
Remark A.1
Feller (1968, p.399) gives an alternative proof of Proposition 4.1 that does not rely on the optional stopping theorem.
1.3 A.3 Proof of Lemma 4.4
Observe that X(t) is a non-negative supermartingale (Definition S2). Thus, for any stopping time S, with \({{\mathrm{\mathrm Pr}}}(S<\infty ) = 1\), a version of the optional stopping theorem for supermartingales (theorem S6) states that
Using a constant stopping time \(S=\tau \ge 0\), inequality (33) gives
Letting \(T_\mathrm{abs}\) be the absorption time for the system, by Proposition 2.7 we can apply inequality (33) and Eq. (29) to show that for any initial state \(X(0) = i\) for (\(0\le i\le N\)) the fixation probability of A satisfies
so \(p_\mathrm{fix}\left( {i}\right) \le i/N\), and the fixation probability of B is \(1-p_\mathrm{fix}\left( {i}\right) \ge (N-i)/N\).
Similarly, if we use H1 as well then
so \(p_\mathrm{fix}\left( {\hat{\imath }}\right) < \hat{\imath }/N\).
Denoting the probability of reaching state j at time \(\tau \ge 0\) starting from state \(X(0) = i\) by
we have \(P_{{i},{j}}^{(\tau )}= (P^\tau )_{i,j}\).
If i leads to \(\hat{\imath }\), then for some time \(\tau \ge 0\), the probability of reaching state \(\hat{\imath }\) from state i is nonzero, \(P_{{i},{\hat{\imath }}}^{(\tau )} >0\). Conditioning on the state arrived at in the \(\tau \)-th time-step, we have
Using Eq. (34), we obtain
and the probability of B fixing is \(1-p_\mathrm{fix}\left( {i}\right) >\frac{N-i}{N}\). \(\square \)
1.4 A.4 Proof of Lemma 4.6
Suppose, in order to derive a contradiction, that inequality (16) does not hold: for all states \(1\le j\le N-1\)
Then for any selection process, from Lemma 4.4 (with the roles of A and B reversed), \(p_\mathrm{fix}\left( {i}\right) \ge i/N\), contradicting inequality (14). Thus inequality (16) holds.
Now suppose, in order to derive a contradiction, that inequality (15) does not hold: there exists a state \(\hat{\jmath }\) for which
We will construct a transition matrix P for a selection process \({\mathscr {P}}\) (consistent with the fitnesses \(\overline{W}_{A}\left( {j}\right) \) and \(\overline{W}_{B}\left( {j}\right) \), \(1\le j\le N-1\)) such that \(p_\mathrm{fix}\left( {i}\right) \ge i/N\), which contradicts inequality (14) holding for all selection processes.
To find such a selection process \({\mathscr {P}}\), we can restrict attention to processes with the property that at any time and any mixed-type state, the number of individuals of type A must change by exactly 1. Thus, for any mixed-type state k, \(P_{{j},{k}}\ne 0\) if and only if \(j=k\pm 1\). The matrix P then defines a “birth-death” process, for which the fixation probabilities starting from state \(X(0) =i\) satisfy (see Appendix C):
Let \({\mathcal A_{>}}\), \({\mathcal A_{<}}\) and \({\mathcal A_{=}}\) be the sets of states in which the expected fitness of individuals of type A is higher than, lower than or equal to that of B individuals (respectively). Note that \(\hat{\jmath }\in {\mathcal A_{>}}\) and \(\hat{\imath }\in {\mathcal A_{<}}\). We then specify the ratios of the non-vanishing transition probabilities by
where \(r_+\) and \(r_-\) are constants—independent of j—that satisfy \(0<r_+<1<r_-\).
Observe that P defines a mixed-irreducible selection process \({\mathscr {P}}\):
-
If \(X(t)=j\), then
$$\begin{aligned} \left. {{\mathrm{\mathbb {E}}}}\big (X(t+1)\,|\,X(t) = j\big )\right. = (j+1)P_{{j},{j+1}} + (j-1)P_{{j},{j-1}} = j +P_{{j},{j+1}} - P_{{j},{j-1}},\nonumber \\ \end{aligned}$$(44)so H1 is satisfied.
-
As for the Moran process [see Eq. (54a)], for any j and k such that \(1\le j \le N-1\), \(0\le k \le N\) and \(j\ne k\), there is a positive probability of transitioning from state j to state k in \(d=|j-k|\) steps: setting \(\sigma = {{\mathrm{\mathrm sign}}}(k-j)\), we have
$$\begin{aligned} \left. {{\mathrm{\mathrm Pr}}}\big (X(t+d) = k\,|\, X(t) = j\big )\right.&= \prod _{m=1}^{d}P_{{(j+\sigma (m-1))},{(j+\sigma m)}}>0,\end{aligned}$$(45a)$$\begin{aligned} \left. {{\mathrm{\mathrm Pr}}}\big (X(t+2) = j\,|\, X(t)=j \big )\right.&= P_{{j},{j+1}}P_{{j+1},{j}}+ P_{{j},{j-1}}P_{{j-1},{j}}>0, \end{aligned}$$(45b)so all states can be reached from state \(X(t)=i\) in finite time. Thus, P is mixed-irreducible. Moreover, the probability of B fixing at a future time \(t+\tau \) (\(\tau \ge 0\)) is positive, so H2 is satisfied.
-
The states 0 and N are absorbing, so H3 is trivially satisfied.
For \(1\le j\le N-1\), we define the number of states k (\(1\le k\le j\)) in which the expected fitness of A individuals is higher than that of B individuals,
and similarly,
Lastly, let \(a_+\) be the smallest number of individuals of type A in the population for which type A’s expected fitness is higher than type B’s, that is,
Note that \(a_+\le \hat{\jmath }<N\), and that \({\alpha _+}(j) =0\) for all \(j<a_+\).
From Eq. (42), the fixation probability \(p_\mathrm{fix}\left( {i}\right) \) is a rational function of \(r_+\) and \(r_-\),
and is continuous because the denominator is positive for any \(r_-,r_+>0\).
If \(i\ge a_+\), then \(p_\mathrm{fix}\left( {i}\right) \rightarrow 1\) as \(r_+\rightarrow 0\). If \(i<a_+\), then
It is thus possible to choose \(r_-\) sufficiently close to 1 and \(r_+\) sufficiently close to 0 to ensure that \(p_\mathrm{fix}\left( {i}\right) >i/N\), which completes the proof. \(\square \)
B Examples
1.1 B.1 The Moran process
If the population evolves according to the Moran process (Moran 1962; Hartl and Clark 2007; Ewens 2012), then exactly one agent is replaced at each time step. In detail, at each time step:
-
An agent is chosen for death, with equal probability for all agents;
-
An agent is chosen for reproduction, with probability proportional to its fitnessFootnote 4;
-
The agent chosen for death is replaced with a clone of the agent chosen for reproduction.
Note that sampling of agents is done with replacement, so that an agent can be chosen for both death and reproduction (in which case the population remains unchanged).
When the population consists of i mutants (individuals of type A) and \(N-i\) residents (individuals of type B), the probabilities of choosing a mutant or a resident for death are i / N and \((N-i)/N\), respectively. The probabilities of choosing a mutant or a resident for reproduction are
and
Because the death and reproduction events are independent, the transition probabilities are simply
and (since at each time step at most one individual is replaced)
Lastly, \(P_{{0},{0}} = P_{{N},{N}} = 1\) and \(P_{{0},{i}} = P_{{N},{N-i}} =0\) for all \(1\le i\le N\) (the states where the resident or mutant have fixed are absorbing, so H3 is trivially satisfied).
For any \(1\le i \le N-1\), if \(X(t) = i\), we have
The expected number of individuals of type A (respectively B) in the next time-step is larger than in the current time-step, if and only if \(\overline{W}_{A}\left( {i}\right) >\overline{W}_{B}\left( {i}\right) \) (respectively \(\overline{W}_{B}\left( {i}\right) >\overline{W}_{A}\left( {i}\right) \)), so H1 is satisfied.
To see that H2 is satisfied, and moreover, that P defines a mixed-irreducible selection process, consider i and j such that \(1\le i \le N-1\), \(0\le j \le N\) and \(j\ne i\), and observe that there is a positive probability of changing from state i to state j in \(d=|j-i|\) steps: setting \(\sigma = {{\mathrm{\mathrm sign}}}(j-i)\), we have
so all states can be reached from state \(X(t)=i\) in finite time, and in particular, the probability of B fixing at a future time \(t+\tau \) (\(\tau \ge 0\)) is positive.
If neither type has a selective advantage over the other, regardless of their frequencies in the population, then for all \(1\le i \le N-1\), \(\overline{W}_{A}\left( {i}\right) = \overline{W}_{B}\left( {i}\right) \), so from Eq. (53), \({{\mathrm{\mathbb {E}}}}\big (X(t+1)\big ) = X(t)\), and P defines a neutral drift process.
1.2 B.2 The Wright–Fisher process
If the population evolves according to the Wright–Fisher process (Hartl and Clark 2007; Ewens 2012) then all individuals are replaced at each time step (generations do not overlap). At each time step, the entire population of N individuals is replaced by a new generation constructed using binomial sampling: in each of the N Bernoulli trials, the probability of drawing any type represented in the current generation is proportional to its present mean fitness and to the present number of individuals of that type. Thus, the probability that an individual in the next generation will be of type A is
and
where \(P_{{0},{0}}=P_{{N},{N}}=1\) (so the states \(X=0\) and \(X=N\) are absorbing and H3 is satisfied). Note that if A is not present at some time \(\tau \), B has fixed and the population remains in state \(X(t)=0\) for all \(t\ge \tau \), and similarly if B is not present at some time \(\tau \), then \(X(t)=N\) for all \(t\ge \tau \).
The mean of a binomial random variable defined by n trials with success probability p is np, so for any \(0\le i \le N\), we have
so H1 is satisfied. H2 is trivially satisfied because for any \(1\le i \le N-1\), \(P_{{i},{0}}>0\). Thus, P defines a selection process, which is, moreover, mixed-irreducible, because for any \(1\le i \le N-1\), \(P_{{i},{j}}>0\) also for any \(1\le j\le N-1\).
If neither type has a selective advantage over the other, \(\overline{W}_{A}\left( {i}\right) = \overline{W}_{B}\left( {i}\right) \) for all \(1\le i \le N-1\), and Eq. (57) becomes \(\left. {{\mathrm{\mathbb {E}}}}\big (X(t+1)\,|\, X(t)=i\big )\right. = i=X(t)\), so X(t) is a neutral drift process.
1.3 B.3 The Eldon–Wakeley process with viability selection
The Eldon–Wakeley (EW) process (Eldon and Wakeley 2006; Der et al. 2012) is a variation on the neutral Moran process that allows for a skewed (rather than uniform) offspring distribution. It has been used to interpret genetic data from Pacific Oysters (Eldon and Wakeley 2006; Der et al. 2012).
The EW process describes neutral drift in a population of constant size N, consisting of two types, A and B. At each time step, a single agent is randomly drawn from the population with uniform probability, and produces a random number of offspring \(U-1\). The parent agent survives to the next generation and its \(U-1\) offspring replace \(U-1\) randomly chosen members of the remainder of the population. In the special case that exactly one offspring is always produced, i.e. \({{\mathrm{\mathrm Pr}}}(U=2)=1\), the EW process is similar (but not identical) to the classical Moran process (Ewens 2012; Moran 1962): in both processes, the parent always produces one offspring, which increases the number of individuals of the parent’s type in the next generation iff the agent chosen to be replaced is not of the parent’s type. In the EW process, the parent is guaranteed to survive, and one additional offspring replaces another randomly chosen member of the population, so if there are i agents of type A, the probability that the population state remains the same is
By contrast, in the Moran process, this probability is given by \(\frac{i^2 +(N-i)^2}{N^2}\) [see Eq. (52c)]. Thus, whenever the population is in a mixed-type state (i.e. \(1\le i\le N-1\)), the probability that the population state remains unchanged is larger for the Moran model than for the EW model. However, for both models, the probability of increase in type A is the same as the probability of increase in type B (this probability does depend on the population composition). Thus, in effect, the neutral (i.e. selectionless) EW process with \(U=2\) is a slightly “sped up” version of the neutral Moran process, with fewer time-steps in which the population state is unchanged.Footnote 5
Letting \(X(t)=i\) be the number of individuals of type A at some time \(t\ge 0\), then the probabilities that an agent of type A and B are chosen for reproduction are i / N and \((N-i)/N\), respectively. If an agent of type A is chosen for reproduction and produces \(U-1=u-1\) offspring, then the number of B agents chosen for replacement is hypergeometrically distributed with sample size \(N-1\), initial configuration \(N-i\) and \(u-1\) draws (Der et al. 2012), so the probability of k agents of type B (\(0\le k\le u-1\)) being replaced by agents of type A is
which has mean \((u-1)\frac{N-i}{N-1}\). Similarly, the mean number of agents of type A to be replaced, given that a B agent is chosen for reproduction and produces \(u-1\) offspring is \((u-1)\frac{i}{N-1}\). Thus, by the law of total expectation (theorem S1, conditioning on the type of agent chosen for reproduction), the expected number of individuals of type A in the next generation, given their present number, is:
Der et al. (2012) have generalized the neutral EW process (Eldon and Wakeley 2006) by adding a deterministic “viability selection” step: for \(s\in \mathbb {R}\), given the population state X(t) at time t, an intermediate, pre-selection offspring population state at time \(t+1\) is generated according to the EW model without selection (described above). The population state \(X(t+1)\) at time \(t+1\) is then obtained by transforming the pre-selection offspring state according to standard (deterministic) logistic growth:
where \(\lfloor {x}\rfloor \) is the largest integer smaller than x. This corresponds to selection acting on the offspring before reaching reproductive age (X(t) represents the state of the reproductively-mature population).
Now observe that for any \(1\le i\le N-1\), if \(s>0\) then
if \(s<0\)
and if \(s=0\), \(v\left( {i}\right) = i\) (so the original EW process is recovered). Note also that because \(\frac{(1+s/N)i}{(1+s/N)i+ (N-i)}N<N\), fixation cannot occur in the selection step.
For any s, the selection step and neutral EW process above define a Markov process. Equations (60) and (62) imply that H1 is satisfied for this Markov process.
To verify H2 for any \(s\ge 0\), choose any i (\(1\le i\le N-1\)) and \(u\ge 2\) such that \({{\mathrm{\mathrm Pr}}}(U=u)=p_u>0\) (such u must exist because otherwise no offspring are ever created). The probability of an individual of type A reproducing is \(\frac{i}{N}\). Using Eq. (59), the probability of increasing the number of As in the population given that an individual of type A reproduces and that \(U=u\) is
and \(p_+(i)>0\) because \(i<N\). Hence, the probability of increasing the number of agents of type A in the population in the next generation is no less than
[recall that the selection step cannot decrease the number of As in the population; see inequality (62)]. Now, starting from state i, if the number of agents of type A in the population is increased at each step, fixation of A is attained in at most \(N-i\) steps. Since the probability of increasing the number of A’s in the population is positive for \(1\le i<N\), the probability of A fixing in i steps is positive,
Verifying H2 for \(s<0\) is similar.
As in Appendices B.1 and B.2, H3 is satisfied because there is no mutation, and consequently the EW process with viability selection defines a selection process.
Note that Eq. (60) implies that in the absence of selection, the EW process is a neutral drift process. Moreover, a similar method to that used in Appendix B.1 shows that the EW process without selection is mixed-irreducible.
1.4 B.4 Generalized Wright–Fisher models
Generalized Wright–Fisher (GWF) models are “a broad class of forward-time population models that share the same mean and variance of the Wright–Fisher model, but may otherwise differ”(Der et al. 2011). GWF models can allow for selection and mutation, but the general construction builds on pure-drift GWF models.
Mathematically, a pure-drift GWF model is Markov processes X(t) such that
If \(\sigma ^2= 0\) then \( {{\mathrm{\mathrm Var}}}\left( X(t+1)\,|\,X(t) = i\right) = 0\) and the transition matrix for the corresponding Markov process is the identity matrix; this case, in which the population state never changes, is biologically absurd, so \(\sigma ^2> 0\) is assumed hereafter.
Pure-drift GWF models are neutral drift processes (Definition 2.9):
-
Neutrality: Equation (8) is satisfied by assumption [Eq. (67a)], i.e. neither type is expected to increase in frequency from one time-step to the next.
- H2: :
-
This hypothesis stipulates that starting from a mixed-type state i (\(0<i<N\)), the fixation of at least one the types (A or B) must be possible.
Equation (67b) implies that for any mixed-type state \(X(t)=i\notin \{0,N\}\),
$$\begin{aligned} {{\mathrm{\mathrm Var}}}\left( X(t+1)\,|\,X(t) = i\right) >0. \end{aligned}$$(68)From Eq. (67a), it follows that
$$\begin{aligned} {{\mathrm{\mathrm Pr}}}\left( X(t+1)<i\,|\,X(t) = i)\right) >0, \end{aligned}$$(69)Thus, starting at any mixed-type state i (\(0<i<N\)), it is possible to reach the state 0 in i or fewer steps in which A decreases in frequency, each of which occurs with positive probability, so A can fix with positive probability, and H2 holds. A similar argument shows that B can also fix with positive probability.
- H3: :
-
This hypothesis stipulates that the states at which the population is composed only of one type (A or B) are absorbing. Pure-drift GWF processes satisfy H3, because if \(X(t) = 0\) then from Eq. (67b), \({{\mathrm{\mathrm Var}}}\left( X(t+1)\,|\,X(t) =0\right) = 0\) so from Eq. (67a), if \(X(t)=0\), then
$$\begin{aligned} X(t+1)= {{\mathrm{\mathbb {E}}}}\left( X(t+1)\,|\,X(t) = 0\right) =X(t)= 0, \end{aligned}$$(70)and similarly, if \(X(t)=N\), then \(X(t+1) = N\).
For a population of size N, pure selection (i.e. mutationless) GWF models are constructed by modifying a pure-drift GWF modelFootnote 6: starting with a pure-drift process with transition matrix \(Q^{(N)}\) selection is represented by a second \((N+1)\times (N+1)\) row-stochastic matrix \(S^{(N)}\), and the transition matrix for the pure selection process is defined by \(S^{(N)} Q^{(N)}\). When choosing how to construct selection matrices \(S^{(N)}\), the only requirement is that in the limit \(N\rightarrow \infty \), if \(Q^{(N)}\rightarrow I\) (which means that the offspring variance approaches 0 as \(N\rightarrow \infty \)), the dynamics converge to Haldane’s classical theory of deterministic evolution (Haldane 1932). This amounts to requiring that
where u is any smooth function and
and \(\gamma \) is type B’s selective advantage (Der 2010; Der et al. 2011).
Because of the generality of the method in which Der et al. allow for selection, GWF models with selection are not necessarily selection processes acorrding to Definition 2.1. To see that this is possible, observe that there is no restriction on the selection matrix, \(S^{(N)}\) for any specific population size N (other than it being row-stochastic); only the infinite-population limit of a sequence of such selection matrices is restricted. Thus, let Q be the transition matrix for a pure-drift GWF model. Let the selection matrix S be any row-stochastic matrix with first and last columns composed of zeros other than the top and bottom (respectively) entries, which are taken to be 1. The transition matrix \(P=SQ\) defines a Markov process for which fixation from any mixed-type state is impossible (violating H2).
1.5 B.5 Neutral and non-neutral Cannings (exchangeable) models
1.5.1 B.5.1 Neutral Cannings models
An important class of models arising in population genetics are due to Cannings (1974). In the most basic formulation, a population of N individuals is considered, each of which can be of either type A or B. The reproduction of each of these individuals (regardless of its type) is assumed to be equivalent in the sense that the numbers of offspring left by each individual are exchangeable random variables.
Mathematically, for any time \(t\ge 0\) and population state, \(X(t) =i\) (that is, i is the number of individuals of type A), let \(\nu _{k}\) \((1\le k\le N)\) be a random variable describing the number of offspring of the \(k\mathrm{th}\) individual in the population (that is, its contribution to the next generation, at time \(t+1\)). Without loss of generality, we label the individuals of type A as \(1,\dots ,i\) and the individuals of type B as \(i+1,\dots ,N\) (where one of these sets of indices is empty if \(i=0\) or \(i=N\)). The population state at time \(t+1\) given that \(X(t) = i\) is then
The assumption of exchangeability of the offspring variables is then that \(\{\nu _{k}\}_{k=1}^{N}\) is a set of exchangeable random variables and independent of the population state, i, that is, the joint probability distribution of \(\{\nu _{k}\}_{k=1}^{N}\) is invariant to the order of these random variables: for any permutation \(\sigma \) of the indices \(1,\dots ,N\), and numbers of offspring \(\left( \xi _{1},\dots ,\xi _{N}\right) \in \{0,\dots ,N\}^N\),
Because the population size is constant, the offspring variables \(\{\nu _{k}\}_{k=1}^{N}\) must also satisfy
so the variables \(\{\nu _{k}\left( i\right) \}_{k=1}^{N}\) are in general not independently distributed.Footnote 7
Any Cannings process is a pure-drift GWF process (Appendix B.4), and thus a neutral drift process (Definition 2.9). To prove this, we must show that the first and second conditional moments of a Cannings process conform to Eq. (67). The exchangeability of the offspring variables implies that the expected number of offspring of all individuals are equal (regardless of their type),
Using Eq. (75),
so \({{\mathrm{\mathbb {E}}}}(\nu _{k})={{\mathrm{\mathbb {E}}}}(\nu _{1})=1\) for any k such that \(1\le k\le N\). It follows that,
so X(t) is a martingale and Eq. (67a) is satisfied. The following Lemma shows that Eq. (67b) also holds.
Lemma B.1
The conditional variance of a Cannings process with offspring variance \(\sigma ^2\) is
Proof
Our derivation follows that of Ewens (2012). Using Eq. (75),
By symmetry, for any \(k\ne j\) such that \(1\le j\le N\) and \(1\le k\le N\), we have
and hence,
It follows that
\(\square \)
1.5.2 B.5.2 Cannings models with selection
Lessard and Ladret (2007) introduced an extension of Cannings models that includes selection. Here, we show that although not all of the models in the class defined by Lessard and Ladret (2007) are selection processes, this is due to some biologically absurd models belonging to this class; under minimal biologically reasonable assumptions, such models are selection processes.
Following Lessard and Ladret (2007, with slightly modified notation), we consider a population of N individuals, each of which can be of either type A or B. In contrast to neutral Cannings models, in which all individuals are exchangeable, we now suppose individuals can be exchanged only with others of their own type (so an A can be exchanged with any other A but not a B).
Mathematically, for any time \(t\ge 0\) and population state, \(X(t) =i\) (that is, i is the number of individuals of type A), let \(\nu _{k}\left( i\right) \) \((1\le k\le N)\) be a random variable describing the number of offspring of the \(k\mathrm{th}\) individual in the population. Without loss of generality, we label the individuals of type A as \(1,\dots ,i\) and the individuals of type B as \(i+1,\dots ,N\). The population state at time \(t+1\) given that \(X(t) = i\) is then
We assume that the offspring variables for each type, \(\{\nu _{k}\left( i\right) \}_{k=1}^{i}\) and \(\{\nu _{k}\left( i\right) \}_{k=i+1}^{N}\) are both sets of exchangeable random variables, that is the joint probability distributions of \(\{\nu _{k}\left( i\right) \}_{k=1}^{i}\) and \(\{\nu _{k}\left( i\right) \}_{k=i+1}^{N}\) are invariant to the order of these random variables: for any permutations \(\sigma _A\) and \(\sigma _B\) of the indices \(1,\dots ,i\) and \(i+1,\dots ,N\), respectively, and numbers of offspring \(\left( \xi _{1},\dots ,\xi _{N}\right) \in \{0,\dots ,N\}^N\),
and
Because the population size is constant, the offspring variables \(\{\nu _{k}\left( i\right) \}_{k=1}^{N}\) must also satisfy
Let the expected number of offspring of individuals of type A be
and the expected number of offspring of individuals of type B be
Equation (86) then implies
Differential fitnesses for the two types can then be introduced by allowing \(\mu _{A}\left( i\right) \) and \(\mu _{B}\left( i\right) \) to differ, and defining the fitness of each type in a manner consistent with hypothesis H1 of Definition 2.1 (for example, Wrightian fitness can be used, i.e. define the expected fitness of each type \(t=A\) or B as \(\overline{W}_{t}\left( {i}\right) \triangleq \mu _{t}\left( i\right) \); see for example Wu et al. 2013). In particular, frequency dependent selection is obtained by allowing the expected numbers of offspring to depend on the population state i.
Lessard and Ladret’s extension of Cannings’ model (described above) defines a discrete time Markov chain with transition matrix
Not all models in the class defined by Lessard and Ladret are selection processes. For example, suppose that \(\{\nu _{k}\left( i\right) \}_{k=1}^{i}\) and \(\{\nu _{k}\left( i\right) \}_{k=i+1}^{N}\) are sets of exchangeable random variables with means \(\mu _{A}\left( i\right) \)=\(\mu _{B}\left( i\right) =1\). Then,
and
that, is, each type evolves independently according to a (neutral) Cannings model.Footnote 8 The joint probability distribution of \(\{\nu _{k}\left( i\right) \}_{k=1}^{i}\) is then
with an analogous expression for \(\{\nu _{k}\left( i\right) \}_{k=i+1}^{N}\) (where we have used Kronecker’s delta notation: \(\delta _{mn}=1\) if \(m=n\) and \(\delta _{mn}=0\) otherwise). The resulting transition matrix \(P=I\) defines a neutral process in the sense that \(\left. {{\mathrm{\mathbb {E}}}}\left( X(t+1)\,|\, X(t)\right) \right. = X(t)\). However, any state is an absorbing state of this Markov process, and in particular, hypothesis H2 of Definition 2.1 is violated.Footnote 9
It is thus natural to ask which of Lessard and Ladret’s models are selection processes? To answer this question, we consider the three hypotheses of Definition 2.1:
- H1: :
-
This hypothesis asserts that the type that has higher fitness at time t is expected to increase in frequency in the next time step. While fitness as such is not part of the definition of Lessard and Ladret’s models, one may define the expected fitness of each type \(t=A\) or B as the expected number of offspring of individuals of that type (as suggested above),
$$\begin{aligned} \overline{W}_{t}\left( {i}\right) \triangleq \mu _{t}\left( i\right) . \end{aligned}$$(93)Under this definition of fitness,
$$\begin{aligned} \left. {{\mathrm{\mathbb {E}}}}(X(t+1)\,|\,X(t) = i)\right.&= {{\mathrm{\mathbb {E}}}}\left( \sum _{k=1}^{i}\nu _{k}\left( i\right) \right) = i\mu _{A}\left( i\right) = i + \frac{i}{N}(N\mu _{A}\left( i\right) -N)\nonumber \\&= i + \frac{i}{N}\big [N\mu _{A}\left( i\right) -(i\mu _{A}\left( i\right) + (N-i)\mu _{B}\left( i\right) \big ] \nonumber \\&=i + i\frac{N-i}{N}(\mu _{A}\left( i\right) -\mu _{B}\left( i\right) ), \end{aligned}$$(94)so H1 is satisfied.
- H2: :
-
This hypothesis stipulates that starting from a mixed-type state i (where both types are present in the population), the fixation of at least one the types (A or B) must be possible. We are not aware of a simple sufficient condition on the exchangeable sets \(\{\nu _{k}\left( i\right) \}_{k=1}^{i}\) and \(\{\nu _{k}\left( i\right) \}_{k=i+1}^{N}\) ensuring that H2 holds. However, models violating this hypothesis seem to us biologically unreasonable.
- H3::
-
This hypothesis stipulates that the states at which the population is composed only of one type (A or B) are absorbing. Any process in the class defined by Lessard and Ladret satisfies H3, because if \(X(t) = 0\), then from Eq. (84)
$$\begin{aligned} X(t+1) = \sum _{k=1}^{0}\nu _{k}\left( i\right) =0, \end{aligned}$$(95)and similarly, if \(X(t)=N\), then \(X(t+1) = N\).
C Fixation probabilities for birth–death processes
Suppose that individuals in a population of constant size N can possess one of two traits, A and B. Let the state of the population (i.e. the number of individuals of type A) evolve according to a discrete-time birth–death process in which a trait that has disappeared cannot re-emerge. That is, the population state may change by at most one at any given time-step (individuals change their type one at a time), and the states 0 and N are absorbing. In this Appendix, we find \(p_\mathrm{fix}\left( {i}\right) \), the fixation probability of the trait A, when there are initially i individuals of type A in the population. We do this following the method presented by Nowak (2006).
Mathematically, the time evolution of the population composition follows a Markov process with transition matrix P satisfying
and \(P_{{k},{j}}=0\) for all \(0\le j<k-1\) and \(k+1<j\le N\), where \(P_{{k},{k+1}}\) and \(P_{{k},{k-1}}\) are the transition probabilities from the state in which there are k individuals of type A, to the ones in which the population contains \(k+1\) or \(k-1\) individuals of type A, respectively. Note also that \(P_{{0},{0}} = P_{{1},{1}} = 1\) and \(P_{{0},{k}} = P_{{N},{N-k}} =0\) for all \(1\le k\le N\) (the states corresponding to homogeneous populations are absorbing).
Let \(p_\mathrm{fix}\left( {i}\right) \) be the probability of reaching state N (fixation of A) when starting from state i. It follows that \(p_\mathrm{fix}\left( {0}\right) = 0\), \(p_\mathrm{fix}\left( {N}\right) =1\) and for \(1\le i \le N-1\),
Consequently,
so
or, defining \(y_i = p_\mathrm{fix}\left( {i}\right) - p_\mathrm{fix}\left( {i-1}\right) \) for \(1\le i \le N\),
Thus,
for \(2\le i\le N-1\).
Summing \(y_k\) for \(1\le k\le i\le N\) gives
Since \(p_\mathrm{fix}\left( {N}\right) =1\), substituting \(i=N\) in Eq. (100) gives
Thus, from Eqs. (101) and (100), the fixation probability of A when there are initially i individuals of type A in the population is
Rights and permissions
About this article
Cite this article
Molina, C., Earn, D.J.D. On selection in finite populations. J. Math. Biol. 76, 645–678 (2018). https://doi.org/10.1007/s00285-017-1151-4
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00285-017-1151-4