Abstract
Molecular markers have been widely used to map quantitative trait loci (QTL). The QTL mapping partly relies on accurate linkage maps. The non-Mendelian segregation of markers, which affects not only the estimation of genetic distance between two markers but also the order of markers on a same linkage group, is usually observed in QTL analysis. However, these distorted markers are often ignored in the real data analysis of QTL mapping so that some important information may be lost. In this paper, we developed a multipoint approach via Hidden Markov chain model to reconstruct the linkage maps given a specified gene order while simultaneously making use of distorted, dominant and missing markers in an F2 population. The new method was compared with the methods in the MapManager and Mapmaker programs, respectively, and verified by a series of Monte Carlo simulation experiments along with a working example. Results showed that the adjusted linkage maps can be used for further QTL or segregation distortion locus (SDL) analysis unless there are strong evidences to prove that all markers show normal Mendelian segregation.
Similar content being viewed by others
References
Ansari N, Hou E (1999) The design and analysis of computer algorithm. Addison-Wesley, Reading
Carr DE, Dudash MR (2003) Recent approaches into the genetic basis of inbreeding depression in plants. Phil Trans R Soc Lond 358:1071–1084
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via EM algorithm. J R Stat Soc B 39:1–38
Edwards AWF (1972) Likelihood. The John Hopkins University Press, Baltimore
Falconer DS, Mackay TFC (1996) Introduction to Quantitative Genetics. Longman, London
Faure S, Noyer JL, Horry JP, Bakry F, Lanaud C, Gonzalez de Leon (1993) A molecular marker-based linkage map of diploid bananas. Theor Appl Genet 87:517–526
Garcia-Dorado A, Gallego A (1992) On the use of the classical tests for detecting linkage. J Heredity 83:143–146
Jansen RC, Stam P (1994) High resolution of quantitative traits into multiple loci via interval mapping. Genetics 136:1447–1455
Jiang CJ, Zeng ZB (1997) Mapping quantitative trait loci with dominant and missing markers in various crosses from two inbred lines. Genetica 101:47–56
Kirkpatrick S, Gelatt CD, Vecchi MP (1983) Optimization by simulated annealing. Science 220:671–680
Lander E, Botstein D (1989) Mapping Mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 121:185–199
Lander ES, Green P (1987) Construction of multilocus genetic linkage maps in humans. Proc Natl Acad Sci USA 84:2363–2367
Lander E, Green P, Abrahamson J, Barlow A, Daly MJ, Lincoln SE, Newburg L (1987) MAPMAKER: an interactive computer package for constructing primary genetic linkage maps of experimental and nature populations. Genomics 1:174–181
Lorieux MB, Perrier GX, Gonzalez de Leon D, Lanaud C (1995a) Maximum likelihood models for mapping genetic markers showing segregation distortion. 1. Backcross population. Theor Appl Genet 90:73–80
Lorieux M, Perrier X, Goffinet B, Lanaud C, Gonzalez de Leon D (1995b) Maximum likelihood models for mapping genetic markers showing segregation distortion. 2. F2 population. Theor Appl Genet 90:81–89
Luo L, Zhang YM, Xu S (2005) A quantitative genetics model for viability selection. Heredity 94: 347–355
Lyttle TW (1991) Segregation distortion. Ann Rev Genetics 25:511–557
Manly KF, Cudmore RH, Meer JM (2001) Map Manager QTX cross-platform software for genetic mapping. Mammal Genome 12:930–932
Pham JL, Glaszmann JC, Sano R, Barbier P, Ghesquiere A, Second G (1990) Isozyme markers in rice: genetic analysis and linkage relationships. Genome 33:348–359
Press WH, Flanner BP, Teukolsky SA, Vellerting WT (2001) Numerical recipes in C++: the art of scientific computing, 2nd version. Cambridge University Press, New York
Whitkus R (1998) Genetics of adaptive radiation in Hawaiian and cook island species of Tetramolopium II. Genetic linkage map and its implications for interspecific breeding barriers. Genetics 150:1209–1216
Wu RL, Ma CX, Casella G (2005) Statistical genomics of complex traits. Springer, Berlin Heidelberg New York
Acknowledgments
We are grateful to Dr Charcosset, Prof Melchinger and two anonymous reviewers for their thoughtful criticisms, comments and suggestions, which have been helpful in improving the presentation of the paper and in removing several ambiguities. The research was supported in part by: (1) the National Natural Science Foundation of China (No. 30470998, No. 30671333), Jiangsu Natural Science Foundation (No. BK2005087), NCET (NCET-05-0489), 973 program (2006CB101708) and the Talent Foundation of Nanjing Agricultural Univiversity to Dr. Zhang; (2) China and Jiangsu Postdoctoral Science Foundation to Dr. Zhu (No. 2005038246); and (3) the Program for Changjiang Scholars and Innovative Research Team in University, the Ministry of Education.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by A. Charcosset.
Appendix
Appendix
We use here the F2 design as an example to infer the maximum likelihood estimate of recombinant fraction between two markers.
One-gene model
Provided that only one marker, M 1, displays zygotic viability selection, the viabilities of genotypes M 1 m 1 and m 1 m 1 relative to M 1 M 1 are s 1 and s 2, respectively. Thus, the frequencies for the three genotypes among the survival individuals after selection are 1/D for M 1 M 1, 2s 1/D for M 1 m 1, and s 2/D for m 1 m 1, respectively, where D = 1 + 2s 1 + s 2. If another marker, M 2, is linked to the marker M 1 with recombinant fraction r. The expected frequencies of nine F2 genotypes are a function of the viability coefficients and the recombination fraction, arrayed by
The MLE of r is obtained by the EM algorithm for a normal F2. This is because the first derivative of the likelihood function with respect to recombination fraction contains no information about the viability coefficients s 1 and s 2. It indicates that the estimate of r is not affected by the viability coefficients. Thus we can estimate r directly using the familiar formula in the M step of Jansen and Stam (1994)
where n = ∑ 3 i=1 ∑ 3 j=1 n ij , these n ij were the number of the nine genotypes above in matrix A1. Parameters s 1 and s 2 are obtained by
Based on the Fisher’s information matrix, the sample variance of MLE of the recombination fraction can be indicated by
It is obvious that the sample variance of the recombination fraction is affected by the viability coefficients.
Two-gene model
Provided that two linked markers with recombinant fraction r, say M 1 and M 2, display zygotic viability selection, so the viabilities of genotypes M 1 m 1 and m 1 m 1 relative to M 1 M 1 are s 1,1 and s 1,2, respectively; for the marker M 2, similarly, they are s 2,1 and s 2,2, respectively. Thus, the expected frequencies of nine F2 genotypes are a function of the viability coefficients and the recombination fraction, arrayed by
where \({ \circ }\) stands for the component-wise product between the two matrices, the first (F r ) only associated with r and the second with \({s_{{i,j}} (i, j=1,2)\;\hbox{and}\;D = (1 + s_{{1,2}} s_{{2,2}})(1 - r)^{2} + 2r(1 - r)[s_{{1,1}} (1 + s_{{2,2}}) + s_{{2,1}} (1 + s_{{1,2}})] + r^{2} (s_{{1,2}} + s_{{2,2}}) + 2 (1 - 2r + r^{2})s_{{1,1}} s_{{2,1}}.}\) The EM algorithm can be used to obtain the MLE of r based on matrix B1, but this will be difficult to derive because the coefficients within each cell of this matrix contain r. By dividing matrix B1 into two component matrices in B2, however, we can simplify this derivation process. Based on the results of Wu et al. (2005), similarly, the MLE of r can be expressed by
where
The four viability coefficients \({\ifmmode\expandafter\hat\else\expandafter\^\fi{s}_{{i,j}} } (i, j =1, 2)\) can be estimated simultaneously
As for multiple viability loci, the viability coefficients are affected by its adjacent marker loci (i.e. the left and the right viability loci). Thus the coefficients of viability can be expressed as presented above.
Rights and permissions
About this article
Cite this article
Zhu, C., Wang, C. & Zhang, YM. Modeling segregation distortion for viability selection I. Reconstruction of linkage maps with distorted markers. Theor Appl Genet 114, 295–305 (2007). https://doi.org/10.1007/s00122-006-0432-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00122-006-0432-x