Skip to main content
Log in

Population Growth Combined with Wide Offspring Distributions can Increase Fixation Rate and Reduce Genetic Diversity

  • Original Article
  • Published:
Bulletin of Mathematical Biology Aims and scope Submit manuscript

Abstract

Genetic diversity grows with the population size in most neutral evolution models. Empirical evidence of large populations with limited diversity has been proposed to be due either to genetic bottlenecks or to selection. An alternative explanation is that the limited diversity is a result of rare reproduction events. Indeed, recent estimates of the offspring number distribution highlight the role of large reproduction events. We here show that in a large class of models containing such rare events, genetic diversity decreases as the population size increases, in neutral evolution models. For many realistic offspring number distributions, the contribution of rare events to the dynamics grows with the population size. In the context of genetic diversity, these rare events induce a decrease in the time to the most recent common ancestor and in the genetic heterogeneity as the population grows. This phenomenon may explain the observed rapid fixation of genes in large populations, in the absence of observable selection or bottlenecks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  • Arnason E (2004) Mitochondrial cytochrome b DNA variation in the high-fecundity Atlantic cod: trans-Atlantic clines and shallow gene genealogy. Genetics 166:1871–1885

    Article  Google Scholar 

  • Athreya K (2012) Coalescence in the recent past in rapidly growing populations. Stoch Process Appl 122:3757–3766

    Article  MathSciNet  MATH  Google Scholar 

  • Eldon B, Wakeley J (2006) Coalescent processes when the distribution of offspring number among individuals is highly skewed. Genetics 172:2621–2633

    Article  Google Scholar 

  • Ewens WJ (2004) Mathematical population genetics 1: I. Theoretical introduction. Springer, Berlin

    Book  MATH  Google Scholar 

  • Fields B et al (2007) Fields’ virology, 5th edn. Wolters Kluwer Health/Lippincott Williams & Wilkins, Philadelphia

    Google Scholar 

  • Haas M, Pigorsch C (2009) Financial economics, fat-tailed distributions. Encycl Complex Syst Sci 4:3404–3435

    Article  Google Scholar 

  • Hedgecock D, Pudovkin AI (2011) Sweepstakes reproductive success in highly fecund marine fish and shellfish: a review and commentary. Bull Mar Sci 87:971–1002

    Article  Google Scholar 

  • Hedgecock D, Tracey M, Nelson K (1982) Genetics, the biology of crustacea. Academic Press, New York

    Google Scholar 

  • Hoban SM, Mezzavilla M, Gaggiotti OE, Benazzo A, Van Oosterhout C, Bertorelle G (2013) High variance in reproductive success generates a false signature of a genetic bottleneck in populations of constant size: a simulation study. BMC Bioinformatics 14(1):1–10

  • Huillet T, Möhle M (2013) On the extended Moran model and its relation to coalescents with multiple collisions. Theor Popul Biol 87:5–14

  • Kingman JF (1982a) On the genealogy of large populations. J Appl Probab 27–43

  • Kingman JFC (1982b) The coalescent. Stoch Process Appl 13:235–248

    Article  MathSciNet  MATH  Google Scholar 

  • Lambert A (2008) Population dynamics and random genealogies. Stoch Models 24:45–163

    Article  MathSciNet  MATH  Google Scholar 

  • Leonarduzzi C et al (2012) Towards an optimal sampling effort for paternity analysis in forest trees: what do the raw numbers tell us? iFor-Biogeosci For 5:18–25

  • Möhle M, Sagitov S (2001) A classification of coalescent processes for haploid exchangeable population models. Ann Probab 29:1547–1562

    Article  MathSciNet  MATH  Google Scholar 

  • Nei M et al (1975) The bottleneck effect and genetic variability in populations. Evolution 29(1):1–10

  • Pitman J (1999) Coalescents with multiple collisions. Ann Probab 27(4):1870–1902

  • Resnick SI (2007) Heavy-tail phenomena: probabilistic and statistical modeling. Springer, Berlin

    MATH  Google Scholar 

  • Schweinsberg J (2003) Coalescent processes obtained from supercritical Galton–Watson processes. Stoch Proc Appl 106(1):107–139

  • Turner TF et al (1999) Temporal genetic variation of mitochondrial DNA and the female effective population size of red drum (Sciaenops ocellatus) in the northern Gulf of Mexico. Molec Ecol 8:1223–1229

    Article  Google Scholar 

  • Wakeley J (2009) Coalescent theory: an introduction. Roberts & Company Publishers, Englewood

    Google Scholar 

  • Wakeley J, Sargsyan O (2009) Extensions of the coalescent effective population size. Genetics 181:341–345

    Article  Google Scholar 

  • Zerjal T et al (2003) The genetic legacy of the Mongols. Am J Hum Genet 72:717–721

    Article  Google Scholar 

Download references

Acknowledgments

We thank Gur Yaari, Reuven Cohen, Naama Brenner and Marc Feldman for their comments and Miriam Beller for the English editing.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yoram Louzoun.

Appendices

Appendix 1: Asymptotic Analysis of Truncated Power Laws

Let \(U_N\) be a r.v. with truncated power law distribution:

$$\begin{aligned} P\left( {U_N =k} \right) =D_N k^{-\alpha } \quad \hbox {for} \quad k=1,2,\ldots ,N, \end{aligned}$$
(10)

where \(D_N\) is the normalizing constant. Note that

$$\begin{aligned} E\left[ {U_N^m } \right] =\sum _{k=1}^N {D_N k^{m-\alpha }} , \quad \frac{ 1}{D_N }=\sum _{k=1}^N {k^{-\alpha }} . \end{aligned}$$
(11)

The asymptotic approximations of these sums are well known (we write \(f\sim g\) for \(\lim \limits _{N\rightarrow \infty } f/g=1)\):

$$\begin{aligned} \sum _{k=1}^N {k^{-\alpha }} \sim \left\{ {{\begin{array}{lll} {N^{1-\alpha }/\left( {1-\alpha } \right) }&{} \quad \hbox {if} &{} -\infty<\alpha<1 \\ {\log N}&{} \quad \hbox {if} &{} \alpha =1 \\ {\zeta \left( \alpha \right) }&{} \quad \hbox {if} &{} 1<\alpha <\infty \\ \end{array} } } \right. , \end{aligned}$$

where \(\zeta \) denotes the Riemann zeta function. From this, it immediately follows that

$$\begin{aligned} E\left[ {U_N} \right] =D_N \sum _{k=1}^N {k^{1-\alpha }} \sim \left\{ {{\begin{array}{lll} {\left( {1-\alpha } \right) N/\left( {2-\alpha } \right) }&{} \quad \hbox {if} &{} -\infty<\alpha<1 \\ {N/\log N}&{} \quad \hbox {if} &{} \alpha =1 \\ {N^{2-\alpha }/\left( {\left( {2-\alpha } \right) \zeta \left( \alpha \right) } \right) }&{}\quad \hbox {if} &{} 1<\alpha<2 \\ {\left( {\log N} \right) /\zeta \left( 2 \right) }&{} \quad \hbox {if} &{} \alpha =2 \\ {\zeta \left( {\alpha -1} \right) /\zeta \left( \alpha \right) }&{} \quad \hbox {if} &{} 2<\alpha <\infty \\ \end{array}}} \right. \end{aligned}$$

and

$$\begin{aligned} \hbox {var}\left[ {U_N} \right] \sim \left\{ {{ \begin{array}{lll} {\left( {1-\alpha } \right) N^{2}/\left( {\left( {3-\alpha } \right) \left( {2-\alpha } \right) ^{2}} \right) }&{} \quad \hbox {if} &{} -\infty<\alpha<1 \\ {N^{2}/\left( {2\log N} \right) }&{} \quad \hbox {if} &{} \alpha =1 \\ {N^{3-\alpha }/\left( {\left( {3-\alpha } \right) \zeta \left( \alpha \right) } \right) } &{}\quad \hbox {if} &{} 1<\alpha<3 \\ {\left( {\log N} \right) /\zeta \left( 3 \right) } &{} \quad \hbox {if} &{} \alpha =3 \\ {\zeta \left( {\alpha -2} \right) /\zeta \left( \alpha \right) -\left( {\zeta \left( {\alpha -1} \right) /\zeta \left( \alpha \right) } \right) ^{2}} &{}\quad \hbox {if} &{} 3<\alpha <\infty \\ \end{array} } } \right. . \end{aligned}$$

Thus, we can conclude that in the notation of Theorem 1, the choice of MC as a modified Moran model with \(U_N\) distributed as above will lead to MG having a decreasing TMRCA of a sample as time passes if and only if \(\alpha <2\).

Appendix 2: Simulation Methods

For the growing population model, we simulate the entire process forward in time. First we generate the times of reproduction events as exponentials with rate equal to N, the current population size. Then we generate the embedded jump chain by drawing at each transition a (uniformly) random member of the population to reproduce, drawing his number of offspring from the law of \(U_N +c\) and drawing a random sample of size \(N-U_N\) from the previous generation who will survive to the next generation. We iterate this (with different values of N), maintain the entire history of the embedded chain in memory and average out the waiting times to recover a realization of the continuous process.

To calculate the TMRCA of a sample, we simply follow its ancestral line in the generated lineage tree up to the first occurrence of a single ancestor.

To calculate the heterozigosity and mean pairwise difference of the population, we superimposed a mutation model independent of reproduction events, with exponentially distributed mutation events.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Alter, I., Louzoun, Y. Population Growth Combined with Wide Offspring Distributions can Increase Fixation Rate and Reduce Genetic Diversity. Bull Math Biol 78, 1477–1492 (2016). https://doi.org/10.1007/s11538-016-0191-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11538-016-0191-z

Keywords

Navigation