cnF2freq: Efficient Determination of Genotype and Haplotype Probabilities in Outbred Populations Using Markov Models
We have applied and implemented HMM (Hidden Markov Model) algorithms to calculate QTL genotype probabilities from marker and pedigree data in general population structures. These algorithms have a linear complexity in memory. In nearly all experimental pedigrees they result in more precise genotype estimates than the most commonly used approaches for determining genotypes at non-marker positions in QTL analysis in outbred F2 line intercrosses , which include an exponential complexity factor as well as a data-reducing sampling step . With a proper choice of parameters, the results from the existing methods can also be reproduced exactly. We show how the relative run times differ by a factor of 50 when 24 SNP markers are used, with our run time practically independent of marker count. The new method can also provide multi-generational probability estimates and perform haplotype inference from unphased data, which further improves accuracy and flexibility. An important future application of this method is for computationally efficient QTL genotype estimation in maps based on data from SNP chips containing 1000s of markers with mixed information content, for which there are no other suitable methods available at present.
Unable to display preview. Download preview PDF.
- 13.Broman, K.W.: Use of Hidden Markov Models for QTL mapping. Working Paper 125, John Hopkins University, Dept. of Biostatistics (2006)Google Scholar
- 18.Crooks, L., Sahana, G., de Koning, D.J., Sando Lund, M., Carlborg, O.: Comparison of analyses of the QTLMAS XII common data set II: genome-wide association and fine mapping (submitted) (2008)Google Scholar