Skip to main content

Advertisement

Log in

Modeling segregation distortion for viability selection I. Reconstruction of linkage maps with distorted markers

  • Original Paper
  • Published:
Theoretical and Applied Genetics Aims and scope Submit manuscript

Abstract

Molecular markers have been widely used to map quantitative trait loci (QTL). The QTL mapping partly relies on accurate linkage maps. The non-Mendelian segregation of markers, which affects not only the estimation of genetic distance between two markers but also the order of markers on a same linkage group, is usually observed in QTL analysis. However, these distorted markers are often ignored in the real data analysis of QTL mapping so that some important information may be lost. In this paper, we developed a multipoint approach via Hidden Markov chain model to reconstruct the linkage maps given a specified gene order while simultaneously making use of distorted, dominant and missing markers in an F2 population. The new method was compared with the methods in the MapManager and Mapmaker programs, respectively, and verified by a series of Monte Carlo simulation experiments along with a working example. Results showed that the adjusted linkage maps can be used for further QTL or segregation distortion locus (SDL) analysis unless there are strong evidences to prove that all markers show normal Mendelian segregation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Ansari N, Hou E (1999) The design and analysis of computer algorithm. Addison-Wesley, Reading

  • Carr DE, Dudash MR (2003) Recent approaches into the genetic basis of inbreeding depression in plants. Phil Trans R Soc Lond 358:1071–1084

    Article  CAS  Google Scholar 

  • Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via EM algorithm. J R Stat Soc B 39:1–38

    Google Scholar 

  • Edwards AWF (1972) Likelihood. The John Hopkins University Press, Baltimore

  • Falconer DS, Mackay TFC (1996) Introduction to Quantitative Genetics. Longman, London

    Google Scholar 

  • Faure S, Noyer JL, Horry JP, Bakry F, Lanaud C, Gonzalez de Leon (1993) A molecular marker-based linkage map of diploid bananas. Theor Appl Genet 87:517–526

    Article  CAS  Google Scholar 

  • Garcia-Dorado A, Gallego A (1992) On the use of the classical tests for detecting linkage. J Heredity 83:143–146

    CAS  Google Scholar 

  • Jansen RC, Stam P (1994) High resolution of quantitative traits into multiple loci via interval mapping. Genetics 136:1447–1455

    PubMed  CAS  Google Scholar 

  • Jiang CJ, Zeng ZB (1997) Mapping quantitative trait loci with dominant and missing markers in various crosses from two inbred lines. Genetica 101:47–56

    Article  PubMed  CAS  Google Scholar 

  • Kirkpatrick S, Gelatt CD, Vecchi MP (1983) Optimization by simulated annealing. Science 220:671–680

    Article  Google Scholar 

  • Lander E, Botstein D (1989) Mapping Mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 121:185–199

    PubMed  CAS  Google Scholar 

  • Lander ES, Green P (1987) Construction of multilocus genetic linkage maps in humans. Proc Natl Acad Sci USA 84:2363–2367

    Article  PubMed  CAS  Google Scholar 

  • Lander E, Green P, Abrahamson J, Barlow A, Daly MJ, Lincoln SE, Newburg L (1987) MAPMAKER: an interactive computer package for constructing primary genetic linkage maps of experimental and nature populations. Genomics 1:174–181

    Article  PubMed  CAS  Google Scholar 

  • Lorieux MB, Perrier GX, Gonzalez de Leon D, Lanaud C (1995a) Maximum likelihood models for mapping genetic markers showing segregation distortion. 1. Backcross population. Theor Appl Genet 90:73–80

    Google Scholar 

  • Lorieux M, Perrier X, Goffinet B, Lanaud C, Gonzalez de Leon D (1995b) Maximum likelihood models for mapping genetic markers showing segregation distortion. 2. F2 population. Theor Appl Genet 90:81–89

    Google Scholar 

  • Luo L, Zhang YM, Xu S (2005) A quantitative genetics model for viability selection. Heredity 94: 347–355

    Article  PubMed  CAS  Google Scholar 

  • Lyttle TW (1991) Segregation distortion. Ann Rev Genetics 25:511–557

    Article  CAS  Google Scholar 

  • Manly KF, Cudmore RH, Meer JM (2001) Map Manager QTX cross-platform software for genetic mapping. Mammal Genome 12:930–932

    Article  CAS  Google Scholar 

  • Pham JL, Glaszmann JC, Sano R, Barbier P, Ghesquiere A, Second G (1990) Isozyme markers in rice: genetic analysis and linkage relationships. Genome 33:348–359

    CAS  Google Scholar 

  • Press WH, Flanner BP, Teukolsky SA, Vellerting WT (2001) Numerical recipes in C++: the art of scientific computing, 2nd version. Cambridge University Press, New York

  • Whitkus R (1998) Genetics of adaptive radiation in Hawaiian and cook island species of Tetramolopium II. Genetic linkage map and its implications for interspecific breeding barriers. Genetics 150:1209–1216

    PubMed  CAS  Google Scholar 

  • Wu RL, Ma CX, Casella G (2005) Statistical genomics of complex traits. Springer, Berlin Heidelberg New York

Download references

Acknowledgments

We are grateful to Dr Charcosset, Prof Melchinger and two anonymous reviewers for their thoughtful criticisms, comments and suggestions, which have been helpful in improving the presentation of the paper and in removing several ambiguities. The research was supported in part by: (1) the National Natural Science Foundation of China (No. 30470998, No. 30671333), Jiangsu Natural Science Foundation (No. BK2005087), NCET (NCET-05-0489), 973 program (2006CB101708) and the Talent Foundation of Nanjing Agricultural Univiversity to Dr. Zhang; (2) China and Jiangsu Postdoctoral Science Foundation to Dr. Zhu (No. 2005038246); and (3) the Program for Changjiang Scholars and Innovative Research Team in University, the Ministry of Education.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuan-Ming Zhang.

Additional information

Communicated by A. Charcosset.

Appendix

Appendix

We use here the F2 design as an example to infer the maximum likelihood estimate of recombinant fraction between two markers.

One-gene model

Provided that only one marker, M 1, displays zygotic viability selection, the viabilities of genotypes M 1 m 1 and m 1 m 1 relative to M 1 M 1 are s 1 and s 2, respectively. Thus, the frequencies for the three genotypes among the survival individuals after selection are 1/D for M 1 M 1, 2s 1/D for M 1 m 1, and s 2/D for m 1 m 1, respectively, where D =  1 +  2s 1s 2. If another marker, M 2, is linked to the marker M 1 with recombinant fraction r. The expected frequencies of nine F2 genotypes are a function of the viability coefficients and the recombination fraction, arrayed by

figure a

The MLE of r is obtained by the EM algorithm for a normal F2. This is because the first derivative of the likelihood function with respect to recombination fraction contains no information about the viability coefficients s 1 and s 2. It indicates that the estimate of r is not affected by the viability coefficients. Thus we can estimate r directly using the familiar formula in the M step of Jansen and Stam (1994)

$$\hat{r}= \frac{1}{{2n}}\left[n_{{12}} + 2n_{{13}} + n_{{21}} + \frac{{2r^{2} }}{{r^{2} + (1 - r)^{2}}}n_{{22}} + n_{{23}} + 2n_{{31}} + n_{{32}} \right]$$
(A2)

where n =  ∑ 3 i=1 3 j=1 n ij , these n ij were the number of the nine genotypes above in matrix A1. Parameters s 1 and s 2 are obtained by

$$\hat{s}_{1} = \frac{{n_{{21}} + n_{{22}} + n_{{23}}}}{{2(n_{{11}} + n_{{12}} + n_{{13}})}}\quad\hat{s}_{2} = \frac{{n_{{31}} + n_{{32}} + n_{{33}}}}{{n_{{11}} + n_{{12}} + n_{{13}}}}$$
(A3)

Based on the Fisher’s information matrix, the sample variance of MLE of the recombination fraction can be indicated by

$$V(\ifmmode\expandafter\hat\else\expandafter\^\fi{ r}) = \frac{{Dr(1 - r)(1 - 2r + 2r^{2})}}{{2n\left[D(1 - 2r)^{2} (1 - 2r + 2r^{2}) + 4(1 + s_{2})r(1 - r)(1 - 2r + 2r^{2}) + 4s_{1} r(1 - r)(1 - 2r)^{2} \right]}}$$
(A4)

It is obvious that the sample variance of the recombination fraction is affected by the viability coefficients.

Two-gene model

Provided that two linked markers with recombinant fraction r, say M 1 and M 2, display zygotic viability selection, so the viabilities of genotypes M 1 m 1 and m 1 m 1 relative to M 1 M 1 are s 1,1 and s 1,2, respectively; for the marker M 2, similarly, they are s 2,1 and s 2,2, respectively. Thus, the expected frequencies of nine F2 genotypes are a function of the viability coefficients and the recombination fraction, arrayed by

figure b

where \({ \circ }\) stands for the component-wise product between the two matrices, the first (F r ) only associated with r and the second with \({s_{{i,j}} (i, j=1,2)\;\hbox{and}\;D = (1 + s_{{1,2}} s_{{2,2}})(1 - r)^{2} + 2r(1 - r)[s_{{1,1}} (1 + s_{{2,2}}) + s_{{2,1}} (1 + s_{{1,2}})] + r^{2} (s_{{1,2}} + s_{{2,2}}) + 2 (1 - 2r + r^{2})s_{{1,1}} s_{{2,1}}.}\) The EM algorithm can be used to obtain the MLE of r based on matrix B1, but this will be difficult to derive because the coefficients within each cell of this matrix contain r. By dividing matrix B1 into two component matrices in B2, however, we can simplify this derivation process. Based on the results of Wu et al. (2005), similarly, the MLE of r can be expressed by

$$\ifmmode\expandafter\hat\else\expandafter\^\fi{r} = \frac{1}{{2n}}\left[n_{{12}} + 2n_{{13}} + n_{{21}} + \frac{{2r^{2} }}{{r^{2} + (1 - r)^{2}}}n_{{22}} + n_{{23}} + 2n_{{31}} + n_{{32}} \right] - \frac{{r(1 - r)}}{{2D}}\frac{{\partial D}}{{\partial r}} $$
(B3)

where

$$\frac{{\partial D}}{{\partial r}}=2\left\{ r(s_{{1,2}} + s_{{2,2}}) - (1 - r)(1 + s_{{1,2}} s_{{2,2}}) + (1 - 2r)\left[s_{{1,1}} (1 - s_{{2,1}} + s_{{2,2}}) + s_{{2,1}} (1 - s_{{1,1}} + s_{{1,2}})\right]\right\} $$

The four viability coefficients \({\ifmmode\expandafter\hat\else\expandafter\^\fi{s}_{{i,j}} } (i, j =1, 2)\) can be estimated simultaneously

$$\begin{aligned} \hat{s}_{{1,1}} &= \frac{{D(n_{{21}} + n_{{22}} + n_{{23}})}}{{2n\left[(1 + s_{{2,2}})r(1 - r) + s_{{2,1}} (1 - 2r + 2r^{2})\right]}}\;\hat{s}_{{1,2}} = \frac{{D(n_{{31}} + n_{{32}} + n_{{33}})}}{{n\left[r^{2} + 2r(1 - r)s_{{2,2}} + s_{{2,1}} (1 - r^{2})\right]}} \\ \hat{s}_{{2,1}} &= \frac{{D(n_{{12}} + n_{{22}} + n_{{32}})}}{{2n\left[(1 + s_{{1,2}})r(1 - r) + s_{{1,1}} (1 - 2r + 2r^{2})\right]}}\; \hat{s}_{{2,2}} = \frac{{D(n_{{13}} + n_{{23}} + n_{{33}})}}{{n\left[r^{2} + 2r(1 - r)s_{{1,2}} + s_{{1,1}} (1 - r^{2})\right]}} \\ \end{aligned} $$
(B4)

As for multiple viability loci, the viability coefficients are affected by its adjacent marker loci (i.e. the left and the right viability loci). Thus the coefficients of viability can be expressed as presented above.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhu, C., Wang, C. & Zhang, YM. Modeling segregation distortion for viability selection I. Reconstruction of linkage maps with distorted markers. Theor Appl Genet 114, 295–305 (2007). https://doi.org/10.1007/s00122-006-0432-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00122-006-0432-x

Keywords

Navigation