Skip to main content
Log in

Construction and utility of 10-kb libraries for efficient clone-gap closure for rice genome sequencing

  • Published:
Theoretical and Applied Genetics Aims and scope Submit manuscript

Abstract

Rice is an important crop and a model system for monocot genomics, and is a target for whole genome sequencing by the International Rice Genome Sequencing Project (IRGSP). The IRGSP is using a clone by clone approach to sequence rice based on minimum tiles of BAC or PAC clones. For chromosomes 10 and 3 we are using an integrated physical map based on two fingerprinted and end-sequenced BAC libraries to identifying a minimum tiling path of clones. In this study we constructed and tested two rice genomic libraries with an average insert size of 10 kb (10-kb library) to support the gap closure and finishing phases of the rice genome sequencing project. The HaeIII library contains 166,752 clones covering approximately 4.6× rice genome equivalents with an average insert size of 10.5 kb. The Sau3AI library contains 138,960 clones covering 4.2× genome equivalents with an average insert size of 11.6 kb. Both libraries were gridded in duplicate onto 11 high-density filters in a 5 × 5 pattern to facilitate screening by hybridization. The libraries contain an unbiased coverage of the rice genome with less than 5% contamination by clones containing organelle DNA or no insert. An efficient method was developed, consisting of pooled overgo hybridization, the selection of 10-kb gap spanning clones using end sequences, transposon sequencing and utilization of in silico draft sequence, to close relatively small gaps between sequenced BAC clones. Using this method we were able to close a majority of the gaps (up to approximately 50 kb) identified during the finishing phase of chromosome-10 sequencing. This method represents a useful way to close clone gaps and thus to complete the entire rice genome.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1.
Fig. 2.
Fig. 3.
Fig. 4.
Fig. 5.
Fig. 6.

Similar content being viewed by others

References

  • Arabidopsis Genome Initiative (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408:796–815

    PubMed  Google Scholar 

  • Atrazhev AM, Elliott JF (1996) Simplified de-salting of ligation reactions immediately prior to electroporation into E. coli. Biotechniques 21:1024

    CAS  Google Scholar 

  • Barry G (2001) The use of the Monsanto draft rice genome sequence in research. Plant Physiol 125:1164–1165

    CAS  PubMed  Google Scholar 

  • Batzoglou S, Berger B, Mesirov J, Lander ES (1999) Sequencing a genome by walking with clone-end sequences: a mathematical analysis. Genome Res 9:1163–1174

    Article  CAS  PubMed  Google Scholar 

  • Budiman MA, Mao L, Wood T, Wing RA (2000) A deep-coverage tomato BAC library and prospects toward development of an STC framework for genome sequencing. Genome Res 10:129–136

    CAS  PubMed  Google Scholar 

  • Chen M, Presting G, Barbazuk WB, Goicoechea JL, Blackmon B, Fang G, Kim H, Frisch D, Yu Y, Sun S, et al. (2002) An integrated physical and genetic map of the rice genome. Plant Cell 14:1–10

    Article  CAS  PubMed  Google Scholar 

  • Devine SE, Chissoe SL, Eby Y, Wilson RK, Boeke JD (1997) A transposon-based strategy for sequencing repetitive DNA in eukaryotic genomes. Genome Res 7:551–563

    CAS  PubMed  Google Scholar 

  • Ewing B, Green P (1998) Base-calling of automated sequencer traces using PhredII. Error probabilities. Genome Res 8:186–194

    CAS  PubMed  Google Scholar 

  • Ewing B, Hillier L, Wendl MC, Green P (1998) Base-calling of automated sequencer traces using PHRED. I. Accuracy assessment. Genome Res 8:175–185

    CAS  PubMed  Google Scholar 

  • Feng Q, Zhang Y, Hao P, Wang S, Fu G, Huang Y, Li Y, Zhu J, Liu Y, Hu X, et al. (2002) Sequence and analysis of rice chromosome 4. Nature 420:316–320

    Article  CAS  PubMed  Google Scholar 

  • Frohme M, Camargo AA, Czink C, Matsukuma AY, Simpson AJG, Hoheisel JD, Verjovski-Almeida S (2001) Directed gap closure in large-scale sequencing projects. Genome Res 11:901–903

    Article  CAS  PubMed  Google Scholar 

  • Goff SG, Ricke D, Lan TH, Presting G, Wang R, Dunn M, Glazebrook J, Sessions A, Oeller P, Varma H, et al. (2002) A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science 296:92–100

    CAS  PubMed  Google Scholar 

  • Gordon D, Abajian C, Green P (1998) Consed: a graphical tool for sequence finishing. Genome Res 8:195–202

    CAS  PubMed  Google Scholar 

  • Green P (1997) Against a whole-genome shotgun. Genome Res 7:410–417

    CAS  PubMed  Google Scholar 

  • International Human Genome Sequencing Consortium (2001) Initial sequencing and analysis of the human genome. Nature 409:860–941

    CAS  PubMed  Google Scholar 

  • Luo M, Wang YH, Frisch D, Joobeur T, Wing RA, Ralph AD (2001) Melon BAC library construction using improved methods and identification of clones linked to the locus conferring resistance to melon Fusarium Wilt (Fom-2). Genome 44:154–162

    Article  CAS  PubMed  Google Scholar 

  • McMurray AA, Sulston, JE, Quail MA (1998) Short-insert libraries as a method of problem solving in genome sequencing. Genome Res 8:562–566

    CAS  PubMed  Google Scholar 

  • Ohmido N, Kijima K, Akiyama Y, de Jong JH, Fukui K (2000) Quantification of total genomic DNA and selected repetitive sequences reveals concurrent changes in different DNA families in indica and jopanica rice. Mol Gen Genet 263:388–394

    CAS  PubMed  Google Scholar 

  • Roach JC, Thorsson V, Siegel AF (2000) Parking strategies for genome sequencing. Genome Res 10:1020–1030

    Article  CAS  PubMed  Google Scholar 

  • Sasaki T, Burr B (2000) International rice genome sequencing project: the effort to completely sequence the rice genome. Curr Opin Plant Biol 3:138–141

    CAS  PubMed  Google Scholar 

  • Sasaki T, Matsumoto T, Yamamoto K, Sakata K, Baba T, Katayose Y, Wu J, Niimura Y, Cheng Z, Nagamura Y, et al. (2002) The genome sequence and structure of rice chromosome 1. Nature 420:312–316

    Article  CAS  PubMed  Google Scholar 

  • Siegel AF, Trask B, Roach JD, Mahairas GG, Hood L, van den Engh G (1999) Analysis of sequence-tagged-connector strategies for DNA sequencing. Genome Res 9:297–307

    CAS  PubMed  Google Scholar 

  • Soderlund C, Humphray S, Dunham A, French L (2000) Contigs built with fingerprints, markers and FPC V4.7. Genome Res 10:1772–1787

    Article  CAS  PubMed  Google Scholar 

  • Tomkins JP, Peterson DG, Yang TJ, Main D, Wilkins TA, Paterson AH, Wing RA (2001) Development of genomic resources for cotton (Gosypium hirsutuum): BAC library construction, preliminary STC analysis, and identification of clones associated with fiber development. Mol Breed 8:255–261

    Article  CAS  Google Scholar 

  • Venter JD, Smith HO, Hood L (1996) A new strategy for genome sequencing. Nature 381:364–366

    CAS  PubMed  Google Scholar 

  • Venter JD, Adams MD, Myers EW, Li PW, Mural RJ, Sutton SS, Smith HO, Yandell M, Evans CA, Holt RA, et al. (2001) The sequence of the human genome. Science 291:1304–1351

    CAS  PubMed  Google Scholar 

  • Wendl MC, Marra MA, Hillier LW, Chinwalla AT, Wilson RK, Waterston RH (2001) Theories and applications for sequencing randomly selected clones. Genome Res 11:274–280

    Article  CAS  PubMed  Google Scholar 

  • Yang TJ, Yu Y, Frisch D, Wing RA (2002) Two series of plasmid vectors for the shotgun library. Plant and Animal Genome Conference X, p 24

  • Yu J, Hu S, Wang J, Wong GK, Li S, Liu B, Deng Y, Dai L, Zhou Y, Zhang X, et al. (2002) A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science 296:79–91

    CAS  PubMed  Google Scholar 

  • Yu Y, Tomkins JP, Waugh R, Frisch D, Kudrna D, Kleinhofs A, Brueggeman RS, Muehlbauer GJ, Wise RP, Wing RA (2000) A bacterial artificial chromosome library for barley (Hordeum vulgare L.) and the identification of clones containing putative resistance genes. Theor Appl Genet 101:1093–1099

    Article  CAS  Google Scholar 

  • Yu Y, Rambo T, Currie J, Sasaki C, Kim HR, Collura K, Thompson S, Simmons J, Yang TJ, Park GN, Patel AJ, Thurmond S, Henry D, Oates R, Palmer M, Pries G, Gibson J, Anderson H, Paradkar M, Crane L, Dale J, Carver MB, Wood T, Frisch D, Engler F, Soderlund C, Palmer LE, Tetylman L, Nascimento L, Bastide M de la, Spiegel L, Ware D, O'Shaughnessy A, Dike S, Dedhia N, Preston R, Huang E, Ferraro K, Kuit K, Miller B, Zutavern T, Katzenberger F, Muller S, Balija V, Martienssen RA, Stein L, Minx P, Johnson D, Cordum H, Mardis E, Cheng Z, Jiang J, Wilson R, McCombie WR, Wing RA, Yuan Q, Ouyang S, Liu J, Jones KM, Gansberger K, Moffat K, Hill J, Tsitrin T, Overton L, Bera J, Kim M, Jin S, Tallon L, Ciecko A, Pai G, Aken SV, Utterback T, Reidmuller S, Bormann J, Feldblyum T, Hsiao J, Zismann V, Blunt S, Vazeilles A de, Shaffer T, Koo H, Suh B, Yang Q, Haas B, Peterson J, Pertea M, Volfovsky N, Worman J, White O, Salzberg SL, Fraser CV, Buell CR, Messing J, Song R, Fuks G, Llaca V, Kovchak S, Young S, Bowers JE, Paterson AH, Johns MA, Mao L, Pan H, Dean RA (2003) In-depth view of structure, activity, and evolution of rice chromosome 10. Science (in press)

  • Zhang HB, Zhao X, Ding X, Paterson AH, Wing RA (1995) Preparation of megabase-size DNA from plant nuclei. Plant J 7:175–184

    CAS  Google Scholar 

Download references

Acknowledgements

This work was supported by grants from the U.S. Department of Agriculture CREES, the National Science Foundation, and the Department of Energy and postdoctoral fellowship program from Korea Science and Engineering Foundation (KOSEF). Any opinions, findings, and conclusions or recommendations expressed herein are those of the authors and do not necessarily reflect the views of the U.S. Department of Agriculture, the National Science Foundation or the Department of Energy. We thank Monsanto for providing access to their draft sequence data.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rod A. Wing.

Additional information

Communicated by Q. Zhang

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, TJ., Yu, Y., Nah, G. et al. Construction and utility of 10-kb libraries for efficient clone-gap closure for rice genome sequencing. Theor Appl Genet 107, 652–660 (2003). https://doi.org/10.1007/s00122-003-1302-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00122-003-1302-4

Keywords

Navigation