It has been documented that there exist some errors in most large genotype datasets and that an error rate of 1–2% is sufficient to lead to the distortion of map distance as well as a false conclusion of linkage (Abecasis et al., Eur J Hum Genet 9:130–134, 2001), therefore one needs to ensure that the data are as clean as possible. On the other hand, the process of data cleaning is tedious and demands effort and experience. O’Connell and Weeks implemented four error-checking algorithms in computer software called PedCheck. In this chapter, the four algorithms implemented in PedCheck are discussed with a focus on the genotype-elimination method. Furthermore, an example for four levels of error checking permitted by PedCheck is provided with the required input files. In addition, alternative algorithms implemented in other statistical computing programs are also briefly discussed.
Ott J (1974) Estimation of the recombination fraction in human pedigrees: efficient computation of the likelihood for human linkage studies. Am J Hum Genet 26(5):588–597PubMedPubMedCentralGoogle Scholar
Lange K, Weeks DE (1989) Efficient computation of lod scores: genotype elimination, genotype redefinition, and hybrid maximum likelihood algorithms. Ann Hum Genet 53(Pt 1):67–83CrossRefPubMedGoogle Scholar
Terwilliger JD, Ott J (1994) Handbook of human genetics linkage, 1st edn. The Johns Hopkins University Press, BaltimoreGoogle Scholar
Sobel E, Lange K (1996) Descent graphs in pedigree analysis: applications to haplotyping, location scores, and marker-sharing statistics. Am J Hum Genet 58(6):1323–1337PubMedPubMedCentralGoogle Scholar
Abecasis GR, Cherny SS, Cookson WO, Cardon LR (2002) Merlin--rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet 30(1):97–101CrossRefPubMedGoogle Scholar
Sobel E, Sengul H, Weeks DE (2001) Multipoint estimation of identity-by-descent probabilities at arbitrary positions among marker loci on general pedigrees. Hum Hered 52(3):121–131CrossRefPubMedGoogle Scholar
Abecasis GR, Cherny SS, Cardon LR (2001) The impact of genotyping error on family-based analysis of quantitative traits. Eur J Hum Genet 9(2):130–134CrossRefPubMedGoogle Scholar