Exclusion probabilities and likelihood ratios with applications to kinship problems

Slooten, Klaas-Jan; Egeland, Thore

doi:10.1007/s00414-013-0938-0

Exclusion probabilities and likelihood ratios with applications to kinship problems

Original Article
Published: 27 November 2013

Volume 128, pages 415–425, (2014)
Cite this article

International Journal of Legal Medicine Aims and scope Submit manuscript

Klaas-Jan Slooten^1,3 &
Thore Egeland²

655 Accesses
14 Citations
Explore all metrics

Abstract

In forensic genetics, DNA profiles are compared in order to make inferences, paternity cases being a standard example. The statistical evidence can be summarized and reported in several ways. For example, in a paternity case, the likelihood ratio (LR) and the probability of not excluding a random man as father (RMNE) are two common summary statistics. There has been a long debate on the merits of the two statistics, also in the context of DNA mixture interpretation, and no general consensus has been reached. In this paper, we show that the RMNE is a certain weighted average of inverse likelihood ratios. This is true in any forensic context. We show that the likelihood ratio in favor of the correct hypothesis is, in expectation, bigger than the reciprocal of the RMNE probability. However, with the exception of pathological cases, it is also possible to obtain smaller likelihood ratios. We illustrate this result for paternity cases. Moreover, some theoretical properties of the likelihood ratio for a large class of general pairwise kinship cases, including expected value and variance, are derived. The practical implications of the findings are discussed and exemplified.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Relationship inference based on DNA mixtures

Article 05 November 2015

Pedigree-based relationship inference from complex DNA mixtures

Article 19 January 2017

Mixtures with relatives and linked markers

Article 27 November 2015

References

Buckleton J, Curran J (2008) A discussion of the merits of random man not excluded and likelihood ratios. Forensic Sci Int Genet 2(4):343–348
Article PubMed Google Scholar
Li CC, Chakravarti A (1988) An expository review of two methods of calculating the paternity probability. Am J Hum Genet 43(2):197
CAS PubMed Central PubMed Google Scholar
Slooten K, Meester R (2013) Probabilistic strategies for familial DNA searching. J Roy Statist Soc Ser C. doi:10.1111/rssc.12035
Thompson EA (1986) Likelihood inference of paternity. Am J Hum Genet 39(2):285
CAS PubMed Central PubMed Google Scholar
Thompson EA (2000) Statistical inference from genetic data on pedigrees. In: NSF-CBMS regional conference series in probability and statistics. JSTOR
Egeland T, Pinto N, Vigeland MD (2013) A general approach to power calculation for relationship testing. Forensic Sci Int Genet. doi:10.1016/j.fsigen.2013.05.001
Jacquard A (1972) Genetic information given by a relative. Biometrics 28(4):1101
Article CAS PubMed Google Scholar
Gjertson DW, Brenner CH, Baur MP et al (2007) ISFG: recommendations on biostatistics in paternity testing. Forensic Sci Int Genet 1(3):223–231
Article PubMed Google Scholar
Nothnagel M, Schmidtke J, Krawczak M. (2010) Potentials and limits of pairwise kinship analysis using autosomal short tandem repeat loci. Int J Legal Med 124(3):205–215
Article PubMed Google Scholar

Download references

Acknowledgments

The work of TE leading to these results was financially supported by the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement no. 285487 (EUROFORGEN-NoE).

Author information

Authors and Affiliations

Netherlands Forensic Institute, P.O. Box 24044, 2490 AA, The Hague, The Netherlands
Klaas-Jan Slooten
Norwegian University of Life Sciences, 1432, Aas, Norway
Thore Egeland
Department of Mathematics, VU University, De Boelelaan 1081a, 1081 HV, Amsterdam, The Netherlands
Klaas-Jan Slooten

Authors

Klaas-Jan Slooten
View author publications
You can also search for this author in PubMed Google Scholar
Thore Egeland
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Thore Egeland.

Appendices

Appendix 1: Mathematical derivations

We first derive Eq. 15. The probability for individuals with genotypes g and g ^′ can be written as follows:

$$ P(g,g'|\kappa)=\kappa_{0} P(g,g'|0)+\kappa_{1}P(g,g'|1)+\kappa_{2}P(g,g'|2). $$

(17)

where P(g, g ^′ | i) denotes the joint distribution conditioned on the individuals sharing i alleles IBD. For i = 0, 1, 2, this corresponds to the probability of two DNA profiles of unrelated individuals, a parent–child pair and identical twins, respectively. Our goal is to derive a closed formula for the following:

$$ a_{P}(\kappa_{0},\kappa_{1},\kappa_{2})=E[{\text{LR}}(\mathcal{H}_{\mathrm{P}})]=\sum_{g,g'} \frac{P(g,g'|\kappa)^{2}}{P(g,g'|0)}. $$

Consider first

$$\begin{array}{@{}rcl@{}} a_{P}(1-\kappa_{1},\kappa_{1},0)&=&{(1-\kappa_{1})}^{2}+2(1-\kappa_{1})\kappa_{1}\notag\\ &&+\kappa_{1}^{2} \sum_{g,g'}\frac{{ P(g,g'|1)}^{2}}{f_{g}f_{g'}}\notag\\ &=&{(1-\kappa_{1})}^{2}+2(1-\kappa_{1})\kappa_{1}+\kappa_{1}^{2}\frac{L+3}{4}\notag\\ \end{array} $$

(18)

The last sum is (L + 3) / 4 as shown previously for the paternity case and, for instance, the previous half-sib case follows.

Consider next the general case. We show below that

$$\begin{array}{@{}rcl@{}} a_{P}(\kappa_{0},\kappa_{1},\kappa_{2})&=&S_{1}+S_{2}+S_{3}\\ &=& \kappa_{0}^{2}+2\kappa_{0}\kappa_{1}+\kappa_{1}^{2}\frac{L+3}{4}\\ &+&2\kappa_{0}\kappa_{2}+\kappa_{1}\kappa_{2}(L+1)\\&+&\kappa_{2}^{2}\frac{L(L+1)}{2}. \end{array} $$

(19)

This is seen as follows:

The term
$$S_{1}=\sum_{g,g'} \frac{{(\kappa_{0} P(g,g'|0)+\kappa_{1}{P(g,g'|1))}}^{2}}{P(g,g'|0)} $$
follows from Eq. 18.
Consider next S ₂ = A + B, where
$$ A=2\kappa_{0}\kappa_{2}\sum_{g,g'} \frac{P(g,g'|0)P(g,g'|2)}{P(g,g'|0)}=2\kappa_{0}\kappa_{2}. $$
Next,
$$\begin{array}{@{}rcl@{}} B&=&2\kappa_{1}\kappa_{2}\sum_{g,g'} \frac{P(g,g'|1)P(g,g'|2)}{P(g,g'|0)}\\ &=&2\kappa_{1}\kappa_{2}\sum_{g} \frac{P(g,g|1)}{f_{g}}\\ &=&2\kappa_{1}\kappa_{2} \left(\sum_{a=1}^{L}p_{a}+\sum_{a\neq b}\frac{\frac{1}{4}p_{a}p_{b}(p_{a}+p_{b})}{p_{a}p_{b}}\right)\\ &=&\kappa_{1}\kappa_{2}(L+1). \end{array} $$
The expression for S ₂ = A + B = 2κ ₂ κ ₀ + κ ₂ κ ₁(L + 1) follows.
For S ₃, g = g ^′ and so
$$S_{3}=\kappa_{2}^{2} \sum_{g} \frac{{P(g,g|2)}^{2}}{P(g,g|0)}=\kappa_{2}^{2} \sum_{g} \frac{f_{g}}{f_{g}} =\kappa_{2}^{2}\frac{L(L+1)}{2}. $$
completing the derivation of Eq. 15.

Sibling index. Variance

The variance of the sibling index for true siblings is derived along similar lines as has been done for the paternity index. First, consider a homozygous first sibling with genotype (a a). For such a person, the expected squared sibling index is equal to

$$p_{a}^{2}\left( \frac{(1+p_{a})^{2}}{4p_{a}^{2}}\right)^{3}+2p_{a}(1-p_{a})\left(\frac{1+p_{a}}{4p_{a}}\right)^{3}+(1-p_{a})^{2}\frac{1}{64},$$

and summing this quantity, multiplied by $p_{a}^{2}$ gives the contribution of the homozygotes to $E\left [{\text {LR}}^{2}(\mathcal {H}_{\mathrm {P}})\right ]$, which turns out to be equal to

$$\left(\frac{1+3p_{a}+4p_{a}^{2}}{8p_{a}}\right)^{2}.$$

Similarly, the contribution of heterozygotes to $E[{\text {LR}}^{2}(\mathcal {H}_{\mathrm {P}})]$ is

$$\sum_{a<b}\frac{1+3(p_{a}+p_{b})+4\left(p_{a}^{2}+p_{b}^{2}\right)+12p_{a}p_{b}+24p_{a}p_{b}(p_{a}+p_{b})+64p_{a}^{2}p_{b}^{2}}{128p_{a}p_{b}}. $$

Together, this leads to

$$\begin{array}{@{}rcl@{}} E\left[{\text{LR}}^{2}(\mathcal{H}_{\mathrm{P}})\right]&=&\frac{1}{64}\left(28+26L+3L^{2}\right)+\frac{1}{64}\sum_{a}\left(\frac{1}{p_{a}^{2}}+\frac{6}{p_{a}}\right)\\&&+\frac{1}{128}\sum_{a<b}\frac{3(p_{a}+p_{b})+4\left(p_{a}^{2}+p_{b}^{2}\right)+1}{p_{a}p_{b}}. \end{array} $$

and this completes the derivation as

$$ {\text{Var}}\left[{\text{LR}}(\mathcal{H}_{\mathrm{P}})\right]=E\left[{\text{LR}}^{2}(\mathcal{H}_{\mathrm{P}})\right]-a_{P}^{2}\left(\frac{1}{4},\frac{1}{2},\frac{1}{4}\right) $$

(20)

where the a _P-function is given in Eq. 15. The variance is minimal for equal allele frequencies in which case it is given by

$$\frac{(L-1)\left(3L^{3}+25L^{2}+80L+128\right)}{1024}.$$

Half siblings. Variance

From Eq. 15,

$$ E\left[{\text{LR}}(\mathcal{H}_{\mathrm{P}})\right]=\frac{L+15}{16}, $$

(21)

which is again independent of the allele frequencies, and only involves the number of alleles on the locus that we consider. It is clear from this expression that the expected likelihood ratio in favor of being half-siblings is rather low.

A derivation analogous to the one for parents and children gives

$$E\left[{\text{LR}}^{2}(\mathcal{H}_{\mathrm{P}})\right]=\frac{11}{64}L+\frac{53}{64}+\frac{1}{128}\sum_{a<b}\frac{p_{a}}{p_{b}}+\frac{p_{b}}{p_{a}},$$

leading to a variance equal to

$${\text{Var}}[{\text{LR}}(\mathcal{H}_{\mathrm{P}})]=\frac{1}{256}\left(-L^{2}+14L-13\right)+\frac{1}{128}\sum_{a<b}\frac{p_{a}}{p_{b}}+\frac{p_{b}}{p_{a}},$$

which has a minimum equal to

$$\frac{1}{256}(L-1)(L+13),$$

obtained if and only if all allele frequencies are equal.

Appendix 2: Numerical calculations

For numerical calculations we have used the freely available R package ( http://cran.r-project.org/) and also functions (coded by TE) in the R package euroMix freely available from arken.umb.no/∼theg/euroMix_1.1.zip.. The database used for the examples is available as the data set db as shown below.

For the parent–offspring relationship, we have the explicit formula given in Eq. 10. Below follows calculations confirming the standard deviation of ${\text {LR}}(\mathcal {H}_{\mathrm {P}})$ for VWA in Table 7:

Alternatively, the calculations can be based on Eq. 15. See documentation of euroMix for further documentation. Below, the above answer is reproduced.

The below output shows that ${\text {LR}}(\mathcal {H}_{\mathrm {P}})$ can be skewed to the left as stated in “Skewness” section:

The mean, standard deviation and skewness of ${\text {LR}}(\mathcal {H}_{\mathrm {P}})$ are, respectively, 2.3281, 1.1344 and − 0. 3441. There are other examples, some with κ ₀ > 0, leading to negative skewness. However, whether any of these examples correspond to possible (inbred or not) pedigrees remains unknown.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Slooten, KJ., Egeland, T. Exclusion probabilities and likelihood ratios with applications to kinship problems. Int J Legal Med 128, 415–425 (2014). https://doi.org/10.1007/s00414-013-0938-0

Download citation

Received: 04 August 2013
Accepted: 29 October 2013
Published: 27 November 2013
Issue Date: May 2014
DOI: https://doi.org/10.1007/s00414-013-0938-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Exclusion probabilities and likelihood ratios with applications to kinship problems

Abstract

Access this article

Similar content being viewed by others

Relationship inference based on DNA mixtures

Pedigree-based relationship inference from complex DNA mixtures

Mixtures with relatives and linked markers

References

Acknowledgments