Statistical Human Genetics pp 539-558 | Cite as
Comparison of Requirements and Capabilities of Major Multipurpose Software Packages
- 1 Citations
- 1 Mentions
- 4k Downloads
Abstract
The aim of this chapter is to introduce the reader to commonly used software packages and illustrate their input requirements, analysis options, strengths, and limitations. We focus on packages that perform more than one function and include a program for quality control, linkage, and association analyses. Additional inclusion criteria were (1) programs that are free to academic users and (2) currently supported, maintained, and developed. Using those criteria, we chose to review three programs: Statistical Analysis for Genetic Epidemiology (S.A.G.E.), PLINK, and Merlin. We will describe the required input format and analysis options. We will not go into detail about every possible program in the packages, but we will give an overview of the packages requirements and capabilities.
Key words
Software S.A.G.E Merlin PLINK Statistical analysis Linkage analysis Association analysis Quality control GUI Data managementReferences
- 1.Epstein MP, Duren WL, Boehnke M (2000) Improved inference of relationships for pairs of individuals. Am J Hum Genet 67: 1219–1231PubMedGoogle Scholar
- 2.Keen KJ, Elston RC (2003) Robust asymptotic sampling theory for correlations in pedigrees. Stat Med 22: 3229–3247PubMedCrossRefGoogle Scholar
- 3.Bonney GE (1984) On the statistical determination of major gene mechanisms in continuous human traits: regressive models. Am J Med Genet 18: 731–749PubMedCrossRefGoogle Scholar
- 4.George VT, Elston RC (1988) Generalized modulus power transformations. Commun Statist Theory Meth 17: 2933–2952CrossRefGoogle Scholar
- 5.Fernando RL, Stricker C, Elston RC (1994) The finite polygenic mixed model: an alternative formulation for the mixed model of inheritance. Theor Appl Genet 88: 573–580CrossRefGoogle Scholar
- 6.Lange K (1997) An approximate model of polygenic inheritance. Genetics 147: 1423–1430PubMedGoogle Scholar
- 7.Karunaratne PM, Elston RC (1998) A multivariate logistic model (MLM) for analyzing binary family data. Am J Med Genet 76: 428–437PubMedCrossRefGoogle Scholar
- 8.Elston RC, Stewart J (1971) A general model for the analysis of pedigree data. Hum Hered 21: 523–542PubMedCrossRefGoogle Scholar
- 9.Lander E, Green P (1987) Construction of multilocus genetic linkage maps in humans. Proc Natl Acad Sci USA 84: 2363–2367PubMedCrossRefGoogle Scholar
- 10.Kruglyak L, Lander ES (1995) Complete multipoint sib-pair analysis of qualitative and quantitative traits. Am J Hum Genet 57: 439–454PubMedGoogle Scholar
- 11.Sobel E, Lange K (1996) Descent graphs in pedigree analysis: applications to haplotyping, location scores, and marker-sharing statistics. Am J Hum Genet 58: 1323–1337PubMedGoogle Scholar
- 12.Haseman J, Elston R (1972) The investigation of linkage between a quantitative trait and a marker locus. Behav Genet 2: 3–19PubMedCrossRefGoogle Scholar
- 13.Elston R, Buxbaum S, Jacobs K, Olson J (2000) Haseman and Elston revisited. Genet Epidemiol 19: 1–17PubMedCrossRefGoogle Scholar
- 14.Shete S, Jacobs K, Elston R (2003) Adding further power to the Haseman and Elston method for detecting linkage in larger sibships: weighting sums and differences. Hum Hered 55: 79–85PubMedCrossRefGoogle Scholar
- 15.Goddard KAB, Witte JS, Suarez BK, Catalona WJ, Olson JM (2001) Model-free linkage analysis with covariates confirms linkage of prostate cancer to chromosomes 1 and 4. Am J Hum Genet 68: 1197–1206PubMedCrossRefGoogle Scholar
- 16.Olson JM (1999) A general conditional-logistic model for affected-relative-pair linkage studies. Am J Hum Genet 65: 1760–1769PubMedCrossRefGoogle Scholar
- 17.Wang T, Elston RC (2005) Two-level Haseman-Elston regression for general pedigree data analysis. Genet Epidemiol 29: 12–22PubMedCrossRefGoogle Scholar
- 18.Wang T, Elston RC (2007) Regression-based multivariate linkage analysis with an application to blood pressure and body mass index. Ann Hum Genet 71: 96–106PubMedCrossRefGoogle Scholar
- 19.Laird NM, Horvath S, Xu X (2000) Implementing a unified approach to family-based tests of association. Genet Epidemiol 19, S36–S42PubMedCrossRefGoogle Scholar
- 20.Kopplin LJ, Igo RP Jr, Wang Y et al (2010) Genome-wide association identifies SKIV2L and MYRIP as protective factors for age-related macular degeneration. Genes Immun 11: 609–621PubMedCrossRefGoogle Scholar
- 21.Abecasis GR, Cherny SS, Cookson WO, Cardon LR (2002) Merlin—rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet 30: 97–101PubMedCrossRefGoogle Scholar
- 22.Markianos K, Daly MJ, Kruglyak L (2001) Efficient multipoint linkage analysis through reduction of inheritance space. Am J Hum Genet 68: 963–977PubMedCrossRefGoogle Scholar
- 23.Lathrop GM, Lalouel JM, Julier C, Ott J (1984) Strategies for multilocus linkage analysis in humans. Proc Natl Acad Sci USA 81: 3443–3446PubMedCrossRefGoogle Scholar
- 24.Sobel E, Sengul H, Weeks DE (2001) Multipoint estimation of identity-by-descent probabilities at arbitrary positions among marker loci on general pedigrees. Hum Hered 52: 121–131PubMedCrossRefGoogle Scholar
- 25.Whittemore AS, Halpern J (1994) A class of tests for linkage using affected pedigree members. Biometrics 50: 118–127PubMedCrossRefGoogle Scholar
- 26.Sham PC, Purcell S, Cherny SS, Abecasis G (2002) Powerful regression-based quantitative-trait linkage analysis of general pedigrees. Am J Hum Genet 71: 238–253PubMedCrossRefGoogle Scholar
- 27.Kong A, Cox NJ (1997) Allele-sharing models: LOD scores and accurate linkage tests. Am J Hum Genet 61: 1179–1188PubMedCrossRefGoogle Scholar
- 28.Sham PC, Purcell S (2001) Equivalence between Haseman-Elston and variance-components linkage analyses for sib pairs. Am J Hum Genet 68: 1527–1532PubMedCrossRefGoogle Scholar
- 29.Chen W-M, Abecasis GR (2007) Family-based association tests for genomewide association scans. Am J Hum Genet 81: 913–926PubMedCrossRefGoogle Scholar
- 30.Zhang K, Zhao H (2006) A comparison of several methods for haplotype frequency estimation and haplotype reconstruction for tightly linked markers from general pedigrees. Genet Epidemiol 30: 423–437PubMedCrossRefGoogle Scholar
- 31.Li Y, Willer C, Sanna S, Abecasis GR (2009) Genotype imputation. Annu Rev Genomics Hum Genet 10: 387–406PubMedCrossRefGoogle Scholar
- 32.Stephens M, Donnelly P (2003) A comparison of Bayesian methods for haplotype reconstruction from population genotype data. Am J Hum Genet 73: 1162–1169PubMedCrossRefGoogle Scholar
- 33.Purcell S, Neale B, Todd-Brow K et al (2007) PLINK: a toolset for whole-genome association and population-based linkage analysis. Am J Hum Genet 81: 559–575PubMedCrossRefGoogle Scholar
- 34.Milligan BG (2003) Maximum-likelihood estimation of relatedness. Genetics 163: 1153–1167PubMedGoogle Scholar
- 35.Spielman RS, Ewens WJ (1998) A sibship test for linkage in the presence of association: the sib transmission/disequilibrium test. Am J Hum Genet 62: 450–458PubMedCrossRefGoogle Scholar
- 36.Abecasis GR, Cookson WO, Cardon LR (2000) Pedigree tests of transmission disequilibrium. Eur J Hum Genet 8: 545–551PubMedCrossRefGoogle Scholar
- 37.Nothnagel M, Ellinghaus D, Schreiber S, Krawczak M, Franke A (2009) A comprehensive evaluation of SNP genotype imputation. Hum Genet 125: 163–171PubMedCrossRefGoogle Scholar
- 38.Almasy L, Blangero J (1998) Multipoint quantitative-trait linkage analysis in general pedigrees. Am J Hum Genet 62: 1198–1211PubMedCrossRefGoogle Scholar
- 39.Amos CI (1994) Robust variance-components approach for assessing genetic linkage in pedigrees. Am J Hum Genet 54: 535–543PubMedGoogle Scholar
- 40.Kruglyak L, Daly MJ, Reeve-Daly MP, Lander ES (1996) Parametric and nonparametric linkage analysis, a unified multipoint approach. Am J Hum Genet 58: 1347–1363PubMedGoogle Scholar
- 41.Barrett JC, Fry B, Maller J, Daly MJ (2005) Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 21: 263–265PubMedCrossRefGoogle Scholar
- 42.Mukhopadhyay N, Almasy L, Schroeder M, Mulvihilll WP, Weeks DE (1999) Mega2, a data-handling program for facilitating genetic linkage and association analyses. Am J Hum Genet 65: A436Google Scholar