Skip to main content
Log in

Inferences for genotyping error rate in ancestry identification from simple sequence repeat marker profiles

  • Published:
Journal of Agricultural, Biological, and Environmental Statistics Aims and scope Submit manuscript

Abstract

Genotyping errors can cause difficulties in a variety of scientific analyses including enetic mapping of diseases, ancestry identification, and gene environment interaction etection. This work develops maximum likelihood approaches to estimating genotyping rror rates in the context of calculating ancestry probabilities from simple sequence epeat (SSR) marker profiles. The likelihood function is based on the probabilities for bserved alleles at each marker locus given ancestral genetic profiles. Simulations are sed to demonstrate and evaluate the likelihood approach to inferences. Simulation results ndicate that the quality of inferences is impacted by the relationship (parents or randparents) of the closest ancestors. We apply the methods to a data set of maize SR marker profiles provided by Pioneer Hi-Bred International, Inc. where the results ndicate a low error rate.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Berry, A. D., Seltzer, D. J., Xie, C., Wright, L. D., and Smith, C. S. (2002), “Assessing Probability of Ancestry Using Simple Sequence Repeat Profiles: Applications to Maize Hybrids and Inbreds,” Genetics, 161, 813–824.

    Google Scholar 

  • Boehnke, M., and Cox, N. J. (1997), “Accurate Inference of Relationships in Sib-Pair Linkage Studies,” American Journal of Human Genetics, 61, 423–429.

    Article  Google Scholar 

  • Broman, K. W., and Weber, J. L. (1998), “Estimation of Pairwise Relationships in the Presence of Genotyping Errors,” American Journal of Human Genetics, 63, 1563–1564.

    Article  Google Scholar 

  • Buetow, K. H. (1991), “Influence of Aberrant Observations on High-Resolution Linkage Analysis Outcomes,” American Journal of Human Genetics, 49, 985–994.

    Google Scholar 

  • Duchesne, P., Godbout, M. H., and Bernatchez, L. (2002), “PAPA (Package for the Analysis of Parental Allocation): A Computer Program for Simulated and Real Parental Allocation,” Molecular Ecology Notes, 2, 191–193.

    Article  Google Scholar 

  • Ewen, K., Bahlo, M., Treloar, S., Levinson, D., Mowry, B., Barlow, J., and Foote, S. (2000), “Identification and Analysis of Error Types in High-Throughput Genotyping,” American Journal of Human Genetics, 67, 727–736.

    Article  Google Scholar 

  • Gerber, S., Mariette, S., Streiff, R., Bodenes, C., and Kremer, A. (2000), “Comparison of Microsatellites and Amplified Fragment Length Polymorphism Markers for Parentage Analysis,” Molecular Ecololgy, 9, 1037–1048.

    Article  Google Scholar 

  • Goldstein, D. R., Zhao, H., and Speed, T. P. (1997), “The Effects of Genotyping Errors and Interference on Estimation of Genetic Distance,” Human Heredity, 47, 86–100.

    Article  Google Scholar 

  • Heaton, M. P., Harhay, G. P., Bennett, G. L., Stone, R. T., Grosse, W. M., Casas, E., Keele, J. W., Smith, T. P. L., Chitko-McKown, C. G., and Laegreid, W. W. (2002), “Selection and Use of SNP Markers for Animal Identification and Paternity Analysis in U.S. Beef Cattle,” Mammalian Genome, 13, 272–281.

    Article  Google Scholar 

  • Huebner, C., Petermann, I., Browning, L. B., Shelling, A. N., and Ferguson, L. R. (2007), “Triallelic Single Nucleotide Polymorphisms and Genotyping Error in Genetic Epidemiology Studies: MDR1 (ABCB1) G2677/T/A as an Example,” Cancer Epidemiology Biomarkers and Prevention, 16, 1185–1192.

    Article  Google Scholar 

  • Jones, A. G., and Ardren, W. R. (2003), “Methods of Parentage Analysis in Natural Populations,” Molecular Ecology, 12, 2511–2523.

    Article  Google Scholar 

  • Jones, A. G., and Avise, J. C. (1997), “Polygynandry in the Dusky Pipefish Syngnathus Floridae Revealed by Microsatellite DNA Markers,” Evolution, 51, 1611–1622.

    Article  Google Scholar 

  • Jones, A. G., Ostlund-Nilsson, S., and Avise, J. C. (1998), “A Microsatellite Assessment of Sneaked Fertilizations and Egg Thievery in the Fifteenspine Stickleback,” Evolution, 52, 848–858.

    Article  Google Scholar 

  • Jones, E. S., Sullivan, H., Bhattramakki, D., and Smith, J. S. C. (2007), “A Comparison of Simple Sequence Repeat and Single Nucleotide Polymorphism Marker Technologies for the Genotypic Analysis of Maize,” Theoretical and Applied Genetics, 115, 361–371.

    Article  Google Scholar 

  • Lao, O., Duijn, K. V., Kersbergen, P., Knijff, P. D., and Kayser, M. (2006), “Proportioning Whole-Genome Single-Nucleotid-Polymorphism Diversity for the Identification of Geographic Population Structure and Genetic Ancestry,” American Journal of Human Genetics, 78, 680–690.

    Article  Google Scholar 

  • Lincoln, S. E., and Lander, E. S. (1992), “Systematic Detection of Errors in Genetic Linkage Data,” Genomics, 14, 604–610.

    Article  Google Scholar 

  • Marshall, T. C., Slate, J., Kruuk, L. E., and Pemberton, J. M. (1998), “Statistical Confidence for Likelihood-Based Paternity Inference in Natural Populations,” Molecular Ecology, 7, 639–655.

    Article  Google Scholar 

  • Maudet, C., Luikart, G., Dubray, D., von Hardenberg, A., and Taberlet, P. (2004), “Low Genotyping Error Rates in Wild Ungulate Faeces Sampled in Winter,” Molecular Ecology Notes, 4, 772–775.

    Article  Google Scholar 

  • Meagher, T. R., and Thompson, E. (1986), “The Relationship Between Single and Parent Pair Genetic Likelihoods in Genealogy Reconstruction,” Theoretical Population Biology, 29, 87–106.

    Article  MATH  MathSciNet  Google Scholar 

  • Quade, S. R. E., Elston, R. C., and Goddard, K. A. B. (2005), “Estimating Haplotype Frequencies in Pooled DNA Samples When There Is Genotyping Error,” BMC Genetics, 6, 25.

    Article  Google Scholar 

  • Slate, J., Marshall, T., and Pemberton, J. (2000), “A Retrospective Assessment of the Accuracy of the Paternity Inference Program CERVUS,” Molecular Ecology, 9, 801–808.

    Article  Google Scholar 

  • Tung, L., Gordonb, D., and Fincha, S. J. (2007), “The Impact of Genotype Misclassification Errors on the Power to Detect a Gene-Environment Interaction Using Cox Proportional Hazards Modeling,” Human Heredity, 63, 101–110.

    Article  Google Scholar 

  • Zhang, H., and Stern, H. (2006), “Assessment of Ancestry Probabilities in the Presence of Genotyping Errors,” Theoretical and Applied Genetics, 112, 472–482.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hongmei Zhang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, H., Stern, H. Inferences for genotyping error rate in ancestry identification from simple sequence repeat marker profiles. JABES 14, 170–187 (2009). https://doi.org/10.1198/jabes.2009.0011

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1198/jabes.2009.0011

Key Words

Navigation