Skip to main content
Log in

Exclusion probabilities and likelihood ratios with applications to kinship problems

  • Original Article
  • Published:
International Journal of Legal Medicine Aims and scope Submit manuscript

Abstract

In forensic genetics, DNA profiles are compared in order to make inferences, paternity cases being a standard example. The statistical evidence can be summarized and reported in several ways. For example, in a paternity case, the likelihood ratio (LR) and the probability of not excluding a random man as father (RMNE) are two common summary statistics. There has been a long debate on the merits of the two statistics, also in the context of DNA mixture interpretation, and no general consensus has been reached. In this paper, we show that the RMNE is a certain weighted average of inverse likelihood ratios. This is true in any forensic context. We show that the likelihood ratio in favor of the correct hypothesis is, in expectation, bigger than the reciprocal of the RMNE probability. However, with the exception of pathological cases, it is also possible to obtain smaller likelihood ratios. We illustrate this result for paternity cases. Moreover, some theoretical properties of the likelihood ratio for a large class of general pairwise kinship cases, including expected value and variance, are derived. The practical implications of the findings are discussed and exemplified.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Buckleton J, Curran J (2008) A discussion of the merits of random man not excluded and likelihood ratios. Forensic Sci Int Genet 2(4):343–348

    Article  PubMed  Google Scholar 

  2. Li CC, Chakravarti A (1988) An expository review of two methods of calculating the paternity probability. Am J Hum Genet 43(2):197

    CAS  PubMed Central  PubMed  Google Scholar 

  3. Slooten K, Meester R (2013) Probabilistic strategies for familial DNA searching. J Roy Statist Soc Ser C. doi:10.1111/rssc.12035

  4. Thompson EA (1986) Likelihood inference of paternity. Am J Hum Genet 39(2):285

    CAS  PubMed Central  PubMed  Google Scholar 

  5. Thompson EA (2000) Statistical inference from genetic data on pedigrees. In: NSF-CBMS regional conference series in probability and statistics. JSTOR

  6. Egeland T, Pinto N, Vigeland MD (2013) A general approach to power calculation for relationship testing. Forensic Sci Int Genet. doi:10.1016/j.fsigen.2013.05.001

  7. Jacquard A (1972) Genetic information given by a relative. Biometrics 28(4):1101

    Article  CAS  PubMed  Google Scholar 

  8. Gjertson DW, Brenner CH, Baur MP et al (2007) ISFG: recommendations on biostatistics in paternity testing. Forensic Sci Int Genet 1(3):223–231

    Article  PubMed  Google Scholar 

  9. Nothnagel M, Schmidtke J, Krawczak M. (2010) Potentials and limits of pairwise kinship analysis using autosomal short tandem repeat loci. Int J Legal Med 124(3):205–215

    Article  PubMed  Google Scholar 

Download references

Acknowledgments

The work of TE leading to these results was financially supported by the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement no. 285487 (EUROFORGEN-NoE).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Thore Egeland.

Appendices

Appendix 1: Mathematical derivations

We first derive Eq. 15. The probability for individuals with genotypes g and g can be written as follows:

$$ P(g,g'|\kappa)=\kappa_{0} P(g,g'|0)+\kappa_{1}P(g,g'|1)+\kappa_{2}P(g,g'|2). $$
(17)

where P(g, g | i) denotes the joint distribution conditioned on the individuals sharing i alleles IBD. For i = 0, 1, 2, this corresponds to the probability of two DNA profiles of unrelated individuals, a parent–child pair and identical twins, respectively. Our goal is to derive a closed formula for the following:

$$ a_{P}(\kappa_{0},\kappa_{1},\kappa_{2})=E[{\text{LR}}(\mathcal{H}_{\mathrm{P}})]=\sum_{g,g'} \frac{P(g,g'|\kappa)^{2}}{P(g,g'|0)}. $$

Consider first

$$\begin{array}{@{}rcl@{}} a_{P}(1-\kappa_{1},\kappa_{1},0)&=&{(1-\kappa_{1})}^{2}+2(1-\kappa_{1})\kappa_{1}\notag\\ &&+\kappa_{1}^{2} \sum_{g,g'}\frac{{ P(g,g'|1)}^{2}}{f_{g}f_{g'}}\notag\\ &=&{(1-\kappa_{1})}^{2}+2(1-\kappa_{1})\kappa_{1}+\kappa_{1}^{2}\frac{L+3}{4}\notag\\ \end{array} $$
(18)

The last sum is (L + 3) / 4 as shown previously for the paternity case and, for instance, the previous half-sib case follows.

Consider next the general case. We show below that

$$\begin{array}{@{}rcl@{}} a_{P}(\kappa_{0},\kappa_{1},\kappa_{2})&=&S_{1}+S_{2}+S_{3}\\ &=& \kappa_{0}^{2}+2\kappa_{0}\kappa_{1}+\kappa_{1}^{2}\frac{L+3}{4}\\ &+&2\kappa_{0}\kappa_{2}+\kappa_{1}\kappa_{2}(L+1)\\&+&\kappa_{2}^{2}\frac{L(L+1)}{2}. \end{array} $$
(19)

This is seen as follows:

  • The term

    $$S_{1}=\sum_{g,g'} \frac{{(\kappa_{0} P(g,g'|0)+\kappa_{1}{P(g,g'|1))}}^{2}}{P(g,g'|0)} $$

    follows from Eq. 18.

  • Consider next S 2 = A + B, where

    $$ A=2\kappa_{0}\kappa_{2}\sum_{g,g'} \frac{P(g,g'|0)P(g,g'|2)}{P(g,g'|0)}=2\kappa_{0}\kappa_{2}. $$

    Next,

    $$\begin{array}{@{}rcl@{}} B&=&2\kappa_{1}\kappa_{2}\sum_{g,g'} \frac{P(g,g'|1)P(g,g'|2)}{P(g,g'|0)}\\ &=&2\kappa_{1}\kappa_{2}\sum_{g} \frac{P(g,g|1)}{f_{g}}\\ &=&2\kappa_{1}\kappa_{2} \left(\sum_{a=1}^{L}p_{a}+\sum_{a\neq b}\frac{\frac{1}{4}p_{a}p_{b}(p_{a}+p_{b})}{p_{a}p_{b}}\right)\\ &=&\kappa_{1}\kappa_{2}(L+1). \end{array} $$

    The expression for S 2 = A + B = 2κ 2 κ 0 + κ 2 κ 1(L + 1) follows.

  • For S 3, g = g and so

    $$S_{3}=\kappa_{2}^{2} \sum_{g} \frac{{P(g,g|2)}^{2}}{P(g,g|0)}=\kappa_{2}^{2} \sum_{g} \frac{f_{g}}{f_{g}} =\kappa_{2}^{2}\frac{L(L+1)}{2}. $$

    completing the derivation of Eq. 15.

Sibling index. Variance

The variance of the sibling index for true siblings is derived along similar lines as has been done for the paternity index. First, consider a homozygous first sibling with genotype (a a). For such a person, the expected squared sibling index is equal to

$$p_{a}^{2}\left( \frac{(1+p_{a})^{2}}{4p_{a}^{2}}\right)^{3}+2p_{a}(1-p_{a})\left(\frac{1+p_{a}}{4p_{a}}\right)^{3}+(1-p_{a})^{2}\frac{1}{64},$$

and summing this quantity, multiplied by \(p_{a}^{2}\) gives the contribution of the homozygotes to \(E\left [{\text {LR}}^{2}(\mathcal {H}_{\mathrm {P}})\right ]\), which turns out to be equal to

$$\left(\frac{1+3p_{a}+4p_{a}^{2}}{8p_{a}}\right)^{2}.$$

Similarly, the contribution of heterozygotes to \(E[{\text {LR}}^{2}(\mathcal {H}_{\mathrm {P}})]\) is

$$\sum_{a<b}\frac{1+3(p_{a}+p_{b})+4\left(p_{a}^{2}+p_{b}^{2}\right)+12p_{a}p_{b}+24p_{a}p_{b}(p_{a}+p_{b})+64p_{a}^{2}p_{b}^{2}}{128p_{a}p_{b}}. $$

Together, this leads to

$$\begin{array}{@{}rcl@{}} E\left[{\text{LR}}^{2}(\mathcal{H}_{\mathrm{P}})\right]&=&\frac{1}{64}\left(28+26L+3L^{2}\right)+\frac{1}{64}\sum_{a}\left(\frac{1}{p_{a}^{2}}+\frac{6}{p_{a}}\right)\\&&+\frac{1}{128}\sum_{a<b}\frac{3(p_{a}+p_{b})+4\left(p_{a}^{2}+p_{b}^{2}\right)+1}{p_{a}p_{b}}. \end{array} $$

and this completes the derivation as

$$ {\text{Var}}\left[{\text{LR}}(\mathcal{H}_{\mathrm{P}})\right]=E\left[{\text{LR}}^{2}(\mathcal{H}_{\mathrm{P}})\right]-a_{P}^{2}\left(\frac{1}{4},\frac{1}{2},\frac{1}{4}\right) $$
(20)

where the a P -function is given in Eq. 15. The variance is minimal for equal allele frequencies in which case it is given by

$$\frac{(L-1)\left(3L^{3}+25L^{2}+80L+128\right)}{1024}.$$

Half siblings. Variance

From Eq. 15,

$$ E\left[{\text{LR}}(\mathcal{H}_{\mathrm{P}})\right]=\frac{L+15}{16}, $$
(21)

which is again independent of the allele frequencies, and only involves the number of alleles on the locus that we consider. It is clear from this expression that the expected likelihood ratio in favor of being half-siblings is rather low.

A derivation analogous to the one for parents and children gives

$$E\left[{\text{LR}}^{2}(\mathcal{H}_{\mathrm{P}})\right]=\frac{11}{64}L+\frac{53}{64}+\frac{1}{128}\sum_{a<b}\frac{p_{a}}{p_{b}}+\frac{p_{b}}{p_{a}},$$

leading to a variance equal to

$${\text{Var}}[{\text{LR}}(\mathcal{H}_{\mathrm{P}})]=\frac{1}{256}\left(-L^{2}+14L-13\right)+\frac{1}{128}\sum_{a<b}\frac{p_{a}}{p_{b}}+\frac{p_{b}}{p_{a}},$$

which has a minimum equal to

$$\frac{1}{256}(L-1)(L+13),$$

obtained if and only if all allele frequencies are equal.

Appendix 2: Numerical calculations

For numerical calculations we have used the freely available R package ( http://cran.r-project.org/) and also functions (coded by TE) in the R package euroMix freely available from arken.umb.no/∼theg/euroMix_1.1.zip.. The database used for the examples is available as the data set db as shown below.

For the parent–offspring relationship, we have the explicit formula given in Eq. 10. Below follows calculations confirming the standard deviation of \({\text {LR}}(\mathcal {H}_{\mathrm {P}})\) for VWA in Table 7:

Alternatively, the calculations can be based on Eq. 15. See documentation of euroMix for further documentation. Below, the above answer is reproduced.

The below output shows that \({\text {LR}}(\mathcal {H}_{\mathrm {P}})\) can be skewed to the left as stated in “Skewness” section:

The mean, standard deviation and skewness of \({\text {LR}}(\mathcal {H}_{\mathrm {P}})\) are, respectively, 2.3281, 1.1344 and − 0. 3441. There are other examples, some with κ 0 > 0, leading to negative skewness. However, whether any of these examples correspond to possible (inbred or not) pedigrees remains unknown.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Slooten, KJ., Egeland, T. Exclusion probabilities and likelihood ratios with applications to kinship problems. Int J Legal Med 128, 415–425 (2014). https://doi.org/10.1007/s00414-013-0938-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00414-013-0938-0

Keywords

Navigation