Modeling segregation distortion for viability selection I. Reconstruction of linkage maps with distorted markers

Zhu, Chengsong; Wang, Chunming; Zhang, Yuan-Ming

doi:10.1007/s00122-006-0432-x

Modeling segregation distortion for viability selection I. Reconstruction of linkage maps with distorted markers

Original Paper
Published: 22 November 2006

Volume 114, pages 295–305, (2007)
Cite this article

Theoretical and Applied Genetics Aims and scope Submit manuscript

Chengsong Zhu¹,
Chunming Wang² &
Yuan-Ming Zhang¹

501 Accesses
47 Citations
Explore all metrics

Abstract

Molecular markers have been widely used to map quantitative trait loci (QTL). The QTL mapping partly relies on accurate linkage maps. The non-Mendelian segregation of markers, which affects not only the estimation of genetic distance between two markers but also the order of markers on a same linkage group, is usually observed in QTL analysis. However, these distorted markers are often ignored in the real data analysis of QTL mapping so that some important information may be lost. In this paper, we developed a multipoint approach via Hidden Markov chain model to reconstruct the linkage maps given a specified gene order while simultaneously making use of distorted, dominant and missing markers in an F₂ population. The new method was compared with the methods in the MapManager and Mapmaker programs, respectively, and verified by a series of Monte Carlo simulation experiments along with a working example. Results showed that the adjusted linkage maps can be used for further QTL or segregation distortion locus (SDL) analysis unless there are strong evidences to prove that all markers show normal Mendelian segregation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Simultaneous estimation of QTL effects and positions when using genotype data with errors

Article 15 March 2015

New statistical methods for estimation of recombination fractions in F2 population

Article Open access 03 October 2017

Fine Mapping

References

Ansari N, Hou E (1999) The design and analysis of computer algorithm. Addison-Wesley, Reading
Carr DE, Dudash MR (2003) Recent approaches into the genetic basis of inbreeding depression in plants. Phil Trans R Soc Lond 358:1071–1084
Article CAS Google Scholar
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via EM algorithm. J R Stat Soc B 39:1–38
Google Scholar
Edwards AWF (1972) Likelihood. The John Hopkins University Press, Baltimore
Falconer DS, Mackay TFC (1996) Introduction to Quantitative Genetics. Longman, London
Google Scholar
Faure S, Noyer JL, Horry JP, Bakry F, Lanaud C, Gonzalez de Leon (1993) A molecular marker-based linkage map of diploid bananas. Theor Appl Genet 87:517–526
Article CAS Google Scholar
Garcia-Dorado A, Gallego A (1992) On the use of the classical tests for detecting linkage. J Heredity 83:143–146
CAS Google Scholar
Jansen RC, Stam P (1994) High resolution of quantitative traits into multiple loci via interval mapping. Genetics 136:1447–1455
PubMed CAS Google Scholar
Jiang CJ, Zeng ZB (1997) Mapping quantitative trait loci with dominant and missing markers in various crosses from two inbred lines. Genetica 101:47–56
Article PubMed CAS Google Scholar
Kirkpatrick S, Gelatt CD, Vecchi MP (1983) Optimization by simulated annealing. Science 220:671–680
Article Google Scholar
Lander E, Botstein D (1989) Mapping Mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 121:185–199
PubMed CAS Google Scholar
Lander ES, Green P (1987) Construction of multilocus genetic linkage maps in humans. Proc Natl Acad Sci USA 84:2363–2367
Article PubMed CAS Google Scholar
Lander E, Green P, Abrahamson J, Barlow A, Daly MJ, Lincoln SE, Newburg L (1987) MAPMAKER: an interactive computer package for constructing primary genetic linkage maps of experimental and nature populations. Genomics 1:174–181
Article PubMed CAS Google Scholar
Lorieux MB, Perrier GX, Gonzalez de Leon D, Lanaud C (1995a) Maximum likelihood models for mapping genetic markers showing segregation distortion. 1. Backcross population. Theor Appl Genet 90:73–80
Google Scholar
Lorieux M, Perrier X, Goffinet B, Lanaud C, Gonzalez de Leon D (1995b) Maximum likelihood models for mapping genetic markers showing segregation distortion. 2. F₂ population. Theor Appl Genet 90:81–89
Google Scholar
Luo L, Zhang YM, Xu S (2005) A quantitative genetics model for viability selection. Heredity 94: 347–355
Article PubMed CAS Google Scholar
Lyttle TW (1991) Segregation distortion. Ann Rev Genetics 25:511–557
Article CAS Google Scholar
Manly KF, Cudmore RH, Meer JM (2001) Map Manager QTX cross-platform software for genetic mapping. Mammal Genome 12:930–932
Article CAS Google Scholar
Pham JL, Glaszmann JC, Sano R, Barbier P, Ghesquiere A, Second G (1990) Isozyme markers in rice: genetic analysis and linkage relationships. Genome 33:348–359
CAS Google Scholar
Press WH, Flanner BP, Teukolsky SA, Vellerting WT (2001) Numerical recipes in C++: the art of scientific computing, 2nd version. Cambridge University Press, New York
Whitkus R (1998) Genetics of adaptive radiation in Hawaiian and cook island species of Tetramolopium II. Genetic linkage map and its implications for interspecific breeding barriers. Genetics 150:1209–1216
PubMed CAS Google Scholar
Wu RL, Ma CX, Casella G (2005) Statistical genomics of complex traits. Springer, Berlin Heidelberg New York

Download references

Acknowledgments

We are grateful to Dr Charcosset, Prof Melchinger and two anonymous reviewers for their thoughtful criticisms, comments and suggestions, which have been helpful in improving the presentation of the paper and in removing several ambiguities. The research was supported in part by: (1) the National Natural Science Foundation of China (No. 30470998, No. 30671333), Jiangsu Natural Science Foundation (No. BK2005087), NCET (NCET-05-0489), 973 program (2006CB101708) and the Talent Foundation of Nanjing Agricultural Univiversity to Dr. Zhang; (2) China and Jiangsu Postdoctoral Science Foundation to Dr. Zhu (No. 2005038246); and (3) the Program for Changjiang Scholars and Innovative Research Team in University, the Ministry of Education.

Author information

Authors and Affiliations

Section on Statistical Genomics, State Key Laboratory of Crop Genetics and Germplasm Enhancement/National Center for Soybean Improvement, College of Agriculture, Nanjing Agricultural University, 1 Weigang Road, Nanjing, 210095, People’s Republic of China
Chengsong Zhu & Yuan-Ming Zhang
Molecular Population Genetics Group, Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore, 117604, Singapore
Chunming Wang

Authors

Chengsong Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Chunming Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yuan-Ming Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuan-Ming Zhang.

Additional information

Communicated by A. Charcosset.

Appendix

We use here the F₂ design as an example to infer the maximum likelihood estimate of recombinant fraction between two markers.

One-gene model

Provided that only one marker, M ₁, displays zygotic viability selection, the viabilities of genotypes M ₁ m ₁ and m ₁ m ₁ relative to M ₁ M ₁ are s ₁ and s ₂, respectively. Thus, the frequencies for the three genotypes among the survival individuals after selection are 1/D for M ₁ M ₁, 2s ₁/D for M ₁ m ₁, and s ₂/D for m ₁ m ₁, respectively, where D = 1 + 2s ₁ + s ₂. If another marker, M ₂, is linked to the marker M ₁ with recombinant fraction r. The expected frequencies of nine F₂ genotypes are a function of the viability coefficients and the recombination fraction, arrayed by

The MLE of r is obtained by the EM algorithm for a normal F₂. This is because the first derivative of the likelihood function with respect to recombination fraction contains no information about the viability coefficients s ₁ and s ₂. It indicates that the estimate of r is not affected by the viability coefficients. Thus we can estimate r directly using the familiar formula in the M step of Jansen and Stam (1994)

$$\hat{r}= \frac{1}{{2n}}\left[n_{{12}} + 2n_{{13}} + n_{{21}} + \frac{{2r^{2} }}{{r^{2} + (1 - r)^{2}}}n_{{22}} + n_{{23}} + 2n_{{31}} + n_{{32}} \right]$$

(A2)

where n = ∑ ³_i=1 ∑ ³_j=1 n _ij, these n _ij were the number of the nine genotypes above in matrix A1. Parameters s ₁ and s ₂ are obtained by

$$\hat{s}_{1} = \frac{{n_{{21}} + n_{{22}} + n_{{23}}}}{{2(n_{{11}} + n_{{12}} + n_{{13}})}}\quad\hat{s}_{2} = \frac{{n_{{31}} + n_{{32}} + n_{{33}}}}{{n_{{11}} + n_{{12}} + n_{{13}}}}$$

(A3)

Based on the Fisher’s information matrix, the sample variance of MLE of the recombination fraction can be indicated by

$$V(\ifmmode\expandafter\hat\else\expandafter\^\fi{ r}) = \frac{{Dr(1 - r)(1 - 2r + 2r^{2})}}{{2n\left[D(1 - 2r)^{2} (1 - 2r + 2r^{2}) + 4(1 + s_{2})r(1 - r)(1 - 2r + 2r^{2}) + 4s_{1} r(1 - r)(1 - 2r)^{2} \right]}}$$

(A4)

It is obvious that the sample variance of the recombination fraction is affected by the viability coefficients.

Two-gene model

Provided that two linked markers with recombinant fraction r, say M ₁ and M ₂, display zygotic viability selection, so the viabilities of genotypes M ₁ m ₁ and m ₁ m ₁ relative to M ₁ M ₁ are s _1,1 and s _1,2, respectively; for the marker M ₂, similarly, they are s _2,1 and s _2,2, respectively. Thus, the expected frequencies of nine F₂ genotypes are a function of the viability coefficients and the recombination fraction, arrayed by

where ${ \circ }$ stands for the component-wise product between the two matrices, the first (F _r) only associated with r and the second with ${s_{{i,j}} (i, j=1,2)\;\hbox{and}\;D = (1 + s_{{1,2}} s_{{2,2}})(1 - r)^{2} + 2r(1 - r)[s_{{1,1}} (1 + s_{{2,2}}) + s_{{2,1}} (1 + s_{{1,2}})] + r^{2} (s_{{1,2}} + s_{{2,2}}) + 2 (1 - 2r + r^{2})s_{{1,1}} s_{{2,1}}.}$ The EM algorithm can be used to obtain the MLE of r based on matrix B1, but this will be difficult to derive because the coefficients within each cell of this matrix contain r. By dividing matrix B1 into two component matrices in B2, however, we can simplify this derivation process. Based on the results of Wu et al. (2005), similarly, the MLE of r can be expressed by

$$\ifmmode\expandafter\hat\else\expandafter\^\fi{r} = \frac{1}{{2n}}\left[n_{{12}} + 2n_{{13}} + n_{{21}} + \frac{{2r^{2} }}{{r^{2} + (1 - r)^{2}}}n_{{22}} + n_{{23}} + 2n_{{31}} + n_{{32}} \right] - \frac{{r(1 - r)}}{{2D}}\frac{{\partial D}}{{\partial r}} $$

(B3)

where

$$\frac{{\partial D}}{{\partial r}}=2\left\{ r(s_{{1,2}} + s_{{2,2}}) - (1 - r)(1 + s_{{1,2}} s_{{2,2}}) + (1 - 2r)\left[s_{{1,1}} (1 - s_{{2,1}} + s_{{2,2}}) + s_{{2,1}} (1 - s_{{1,1}} + s_{{1,2}})\right]\right\} $$

The four viability coefficients ${\ifmmode\expandafter\hat\else\expandafter\^\fi{s}_{{i,j}} } (i, j =1, 2)$ can be estimated simultaneously

$$\begin{aligned} \hat{s}_{{1,1}} &= \frac{{D(n_{{21}} + n_{{22}} + n_{{23}})}}{{2n\left[(1 + s_{{2,2}})r(1 - r) + s_{{2,1}} (1 - 2r + 2r^{2})\right]}}\;\hat{s}_{{1,2}} = \frac{{D(n_{{31}} + n_{{32}} + n_{{33}})}}{{n\left[r^{2} + 2r(1 - r)s_{{2,2}} + s_{{2,1}} (1 - r^{2})\right]}} \\ \hat{s}_{{2,1}} &= \frac{{D(n_{{12}} + n_{{22}} + n_{{32}})}}{{2n\left[(1 + s_{{1,2}})r(1 - r) + s_{{1,1}} (1 - 2r + 2r^{2})\right]}}\; \hat{s}_{{2,2}} = \frac{{D(n_{{13}} + n_{{23}} + n_{{33}})}}{{n\left[r^{2} + 2r(1 - r)s_{{1,2}} + s_{{1,1}} (1 - r^{2})\right]}} \\ \end{aligned} $$

(B4)

As for multiple viability loci, the viability coefficients are affected by its adjacent marker loci (i.e. the left and the right viability loci). Thus the coefficients of viability can be expressed as presented above.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhu, C., Wang, C. & Zhang, YM. Modeling segregation distortion for viability selection I. Reconstruction of linkage maps with distorted markers. Theor Appl Genet 114, 295–305 (2007). https://doi.org/10.1007/s00122-006-0432-x

Download citation

Received: 28 December 2005
Accepted: 14 October 2006
Published: 22 November 2006
Issue Date: January 2007
DOI: https://doi.org/10.1007/s00122-006-0432-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Modeling segregation distortion for viability selection I. Reconstruction of linkage maps with distorted markers

Abstract

Access this article

Similar content being viewed by others

Simultaneous estimation of QTL effects and positions when using genotype data with errors