Abstract
One of the goals of the Human Genome Project is to produce libraries of largely contiguous, ordered sets of molecular clones for use in sequencing and gene mapping projects. This is planned to be done for human and many model organisms. Theory and practice have shown that long-range contiguity and the degree to which the entire genome is covered by ordered clones can be affected by many biological variables. Many laboratories are currently experimenting with different experimental strategies and theoretical models to help plan strategies for accomplishing longrange molecular mapping of genomes. Here we describe a new mathematical model and formulas for helping to plan genome mapping projects, using various single-copy landmark (SCL) detection, or “anchoring”, strategies. We derive formulas that allow us to examine the effects of interactions among the following variables: average insert size of the cloning vector, average size of SCL, the number of SCL, and the redundancy in coverage of the clone library. We also examine and compare three different ways in which anchoring can be implemented: (1) anchors are selected independently of the library to be ordered (random anchoring); (2) anchors are made from end probes from both ends of clones in the library to be ordered (nonrandom anchoring); and (3) anchors are made from one end or the other, randomly, from clones in the library to be ordered (nonrandom anchoring). Our results show that, for biologically realistic conditions, nonrandom anchoring is always more effective than random anchoring for contig building, and there is little to be gained from making SCL from both ends of clones vs. only one end of clones. We compare and contrast our results with other similar mathematical models.
Similar content being viewed by others
References
Arratia, R., Lander, E.S., Tavaré, S., and Waterman, M.S.: Genomic mapping by anchoring random clones: a mathematical analysis. Genomics 11: 806–825, 1991.
Barillot, E., Dausset, J., and Cohen, D.: Theoretical analysis of a physical mapping strategy using random single-copy landmarks. Proc Natl Acad Sci USA 88: 3917–3921, 1991.
Branscomb, E., Slezak, T., Pae, R., Galas, D., Carrano, A.V., and Waterman, M.S.: Optimizing restriction fragment fingerprinting methods for ordering large genomic libraries. Genomics 8: 351–366, 1990.
Carrano, A.V., Lamerdin, J., Ashworth, L.K., Watkins, B., Branscomb, E., Slezak, T., Raff, M., DeJong, P.J., Keith, D., McBride, L., Meister, S., Kronick, M.: A high-resolution, fluorescence-based, semiautomated method for DNA fingerprinting. Genomics 4: 129–136, 1989.
Coulson, A., Sulston, J., Brenner, S., and Karn, J.: Towards a physical map for the genome of the nematode Caenorhabditis elegans. Proc Natl Acad Sci USA 83: 7821–7825, 1986.
Ewens, W.J., Bell, C.J., Donnelly, P.J., Dunn, P., Matallana, E., Ecker, J.R.: Genome mapping with anchored clones: theoretical aspects. Genomics 11: 799–805, 1991.
Green, E.D. and Olson, M.V.: Systematic screening of yeast artificial-chromosome libraries using the polymerase chain reaction. Proc Natl Acad Sci USA 87: 1213–1217, 1990.
Kohara, Y., Akiyama, K., and Isono, K.: The physical map of the whole E. coli chromosome: application of a new strategy for rapid analysis and sorting of a large genomic library. Cell 50: 495–580, 1987.
Lander, E.S. and Waterman, W.S.: Genomic mapping by fingerprinting random clones: a mathematical analysis. Genomics 2: 231–239, 1988.
Marr, T.G., Chang, W.I., and Salit, J.: Computer methods for contig assembly and minimal path determination for contigs generated by SCL content analysis. In preparation.
Olson, M.V., Dutchik, J.E., Graham, M.Y., Brodeur, G.M., Helms, C., Frank, M., Maccollin, M., Scheinman, R., and Frank, T.: Random-clone strategy for genomic restriction mapping in yeast. Proc Natl Acad Sci USA 83: 7826–7830, 1986.
Olson, M.V., Hood, L., Cantor, C., and Botstein, D.: A common language for physical mapping of the human genome. Science 245: 1434–1435, 1989.
Palazzolo, M.J., Sawyer, S.A., Martin, C.H., Smoller, D.A., and Hartl, D.L.: Optimized strategies for sequence-tagged-site selection in genome mapping. Proc Natl Acad Sci USA 88: 8034–8038, 1991.
Stallings, R.L., Torney, D.C., Hildebrand, C.E., Longmire, J.L., Deaven, L.L., Jett, J.H., Doggett, N.A., and Moyzis, R.K.: Physical mapping of human chromosomes by repetitive sequence fingerprinting. Proc Natl Acad Sci USA 87: 6218–6222, 1990.
Torney, D.C.: Mapping using unique sequences. J Mol Biol 217: 001–006, 1991.
Yan, X.: Technical report on some average character of quantities in the genomic mapping by anchoring method. State University of New York at Stony Brook, Stony Brook, N.Y., 1992.
Zhang, M.Q. and Marr, T.G.: Accelerated genome mapping with anchored clones: a discrete theoretical analysis. In preparation.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Marr, T.G., Yan, X. & Yu, Q. Genomic mapping by single copy landmark detection: a predictive model with a discrete mathematical approach. Mammalian Genome 3, 644–649 (1992). https://doi.org/10.1007/BF00352482
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF00352482