Journal of Mathematical Biology

, Volume 57, Issue 3, pp 435–465

Computing likelihoods for coalescents with multiple collisions in the infinitely many sites model

Article

DOI: 10.1007/s00285-008-0170-6

Cite this article as:
Birkner, M. & Blath, J. J. Math. Biol. (2008) 57: 435. doi:10.1007/s00285-008-0170-6

Abstract

One of the central problems in mathematical genetics is the inference of evolutionary parameters of a population (such as the mutation rate) based on the observed genetic types in a finite DNA sample. If the population model under consideration is in the domain of attraction of the classical Fleming–Viot process, such as the Wright–Fisher- or the Moran model, then the standard means to describe its genealogy is Kingman’s coalescent. For this coalescent process, powerful inference methods are well-established. An important feature of the above class of models is, roughly speaking, that the number of offspring of each individual is small when compared to the total population size, and hence all ancestral collisions are binary only. Recently, more general population models have been studied, in particular in the domain of attraction of so-called generalised Λ-Fleming–Viot processes, as well as their (dual) genealogies, given by the so-called Λ-coalescents, which allow multiple collisions. Moreover, Eldon and Wakeley (Genetics 172:2621–2633, 2006) provide evidence that such more general coalescents might actually be more adequate to describe real populations with extreme reproductive behaviour, in particular many marine species. In this paper, we extend methods of Ethier and Griffiths (Ann Probab 15(2):515–545, 1987) and Griffiths and Tavaré (Theor Pop Biol 46:131–159, 1994a, Stat Sci 9:307–319, 1994b, Philos Trans Roy Soc Lond Ser B 344:403–410, 1994c, Math Biosci 12:77–98, 1995) to obtain a likelihood based inference method for general Λ-coalescents. In particular, we obtain a method to compute (approximate) likelihood surfaces for the observed type probabilities of a given sample. We argue that within the (vast) family of Λ-coalescents, the parametrisable sub-family of Beta(2 − α, α)-coalescents, where α ∈ (1, 2], are of particular relevance. We illustrate our method using simulated datasets, thus obtaining maximum-likelihood estimators of mutation and demographic parameters.

Keywords

Λ-coalescent Likelihood-based inference Infinitely-many-sitesmodel Population genetics Fleming–Viot process Multiple collisions Monte-Carlo method 

Mathematics Subject Classification (2000)

92D15 60G09 60G52 60J75 60J85 

Copyright information

© Springer-Verlag 2008

Authors and Affiliations

  1. 1.Weierstraß-Institut für Angewandte Analysis und StochastikBerlinGermany
  2. 2.Institut für MathematikTechnische Universität BerlinBerlinGermany