Skip to main content
Log in

Genome-wide discovery of DNA polymorphism in Brassica rapa

  • Original Paper
  • Published:
Molecular Genetics and Genomics Aims and scope Submit manuscript

Abstract

Single nucleotide polymorphisms (SNPs) and/or insertion/deletions (InDels) are frequent sequence variations in the plant genome, which can be developed as molecular markers for genetic studies on crop improvement. The ongoing Brassica rapa genome sequencing project has generated vast amounts of sequence data useful in genetic research. Here, we report a genome-wide survey of DNA polymorphisms in the B. rapa genome based on the 557 bacterial artificial clone sequences of B. rapa ssp. pekinensis cv. Chiifu. We identified and characterized 21,311 SNPs and 6,753 InDels in the gene space of the B. rapa genome by re-sequencing 1,398 sequence-tagged sites (STSs) in eight genotypes. Comparison of our findings with a B. rapa genetic linkage map confirmed that STS loci were distributed randomly over the B. rapa whole genome. In the 1.4 Mb of aligned sequences, mean nucleotide polymorphism and diversity were θ = 0.00890 and π = 0.00917, respectively. Additionally, the nucleotide diversity in introns was almost three times greater than that in exons, and the frequency of observed InDel was almost 17 times higher in introns than in exons. Information regarding SNPs/InDels obtained here will provide an important resource for genetic studies and breeding programs of B. rapa.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  • Absalan F, Ronaghi M (2008) Molecular inversion probe assay. Comp Genomics 396:315–330

    Article  Google Scholar 

  • Baudry E, Kerdelhue C, Innan H, Stephan W (2001) Species and recombination effects on DNA variability in the tomato Genus. Genetics 158:1725–1735

    PubMed  CAS  Google Scholar 

  • Beilstein MA, Al-Shehbaz IA, Kellogg EA (2006) Brassicaceae phylogeny and trichome evolution. Am J Bot 93:607–619

    Article  CAS  Google Scholar 

  • Brumfield RT, Beerli P, Nickerson DA, Edwards SV (2003) The utility of single nucleotide polymorphisms in inferences of population history. Trends Ecol Evol 18:249–256

    Article  Google Scholar 

  • Cantor CR, Nelson MR (2005) Haplotyping in biomedicine-practical challenges. Nat Biotechnol 23:21–22

    Article  PubMed  CAS  Google Scholar 

  • Cargill M, Altshuler D, Ireland J, Sklar P, Ardlie K, Patil N, Shaw N, Lane CR, Lim EP, Kalyanaraman N, Nemesh J, Ziaugra L, Friedland L, Rolfe A, Warrington J, Lipshutz R, Daley GQ, Lander ES (1999) Characterization of single-nucleotide polymorphisms in coding regions of human genes. Nat Genet 22:231–238

    Article  PubMed  CAS  Google Scholar 

  • Chen D, Ahlford A, Schnorrer F, Kalchhauser I, Fellner M, Viragh E, Kiss I, Syvanen A-C, Dickson BJ (2008) High-resolution, high-throughput SNP mapping in Drosophila melanogaster. Nat Methods 5:323–329

    Article  PubMed  CAS  Google Scholar 

  • Ching A, Caldwell KS, Jung M, Dolan M, Smith OS, Tingey S, Morgante M, Rafalski AJ (2002) SNP frequency, haplotype structure and linkage disequilibrium in elite maize inbred lines. BMC Genet 3:19

    Article  PubMed  Google Scholar 

  • Choi W (2003) Genetic analysis on disease resistance to Turnip mosaic virus, Clubroot and Soft rot, and rapid development of multiple resistant inbreds in Chinese cabbage (Brassica rapa ssp. pekinensis). PhD thesis, Horticultural science, Chung-Ang University, Anseong, pp 1–96

  • Choi IY, Hyten DL, Matukumalli LK, Song Q, Chaky JM, Quigley CV, Chase K, Lark KG, Reiter RS, Yoon MS, Hwang EY, Yi SI, Young ND, Shoemaker RC, van Tassell CP, Specht JE, Cregan PB (2007a) A soybean transcript map: gene distribution, haplotype and single-nucleotide polymorphism analysis. Genetics 176:685–696

    Article  PubMed  CAS  Google Scholar 

  • Choi S, Teakle G, Plaha P, Kim J, Allender C, Beynon E, Piao Z, Soengas P, Han T, King G, Barker G, Hand P, Lydiate D, Batley J, Edwards D, Koo D, Bang J, Park B-S, Lim Y (2007b) The reference genetic linkage map for the multinational Brassica rapa genome sequencing project. Theor Appl Genet 115:777–792

    Article  PubMed  CAS  Google Scholar 

  • Cooper DN, Smith BA, Cooke HJ, Niemann S, Schmidtke J (1985) An estimate of unique DNA sequence heterozygosity in the human genome. Hum Genet 69:201–205

    Article  PubMed  CAS  Google Scholar 

  • Côrte-Real HBSM, Dixon DR, Holland PWH (1994) Intron-targeted PCR: a new approach to survey neutral DNA polymorphism in bivalve populations. Marine Biol 120:407–413

    Article  Google Scholar 

  • Dantec LL, Chagné D, Pot D, Cantin O, Garnier-Géré P, Bedon F, Frigerio J-M, Chaumeil P, Léger P, Garcia V, Laigret F, de Daruvar A, Plomion C (2004) Automated SNP detection in expressed sequence tags: statistical considerations and application to maritime pine sequences. Plant Mol Biol 54:461–470

    Article  PubMed  Google Scholar 

  • Ewing B, Green P (1998) Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res 8:186–194

    PubMed  CAS  Google Scholar 

  • Goldman N, Yang Z (1994) A codon-based model of nucleotide substitution for protein-coding DNA sequences. Mol Biol Evol 11:725–736

    PubMed  CAS  Google Scholar 

  • Gordon D, Abajian C, Green P (1998) Consed: a graphical tool for sequence finishing. Genome Res 8:195–202

    PubMed  CAS  Google Scholar 

  • Groenen MAM, Wahlberg P, Foglio M, Cheng HH, Megens H-J, Crooijmans RPMA, Besnier F, Lathrop M, Muir WM, Wong GK-S, Gut I, Andersson L (2009) A high-density SNP-based linkage map of the chicken genome reveals sequence features correlated with recombination rate. Genome Res 19:510–519

    Article  PubMed  CAS  Google Scholar 

  • Gupta PK, Rustgi S, Mir RR (2008) Array-based high-throughput DNA markers for crop improvement. Heredity 101:5–18

    Article  PubMed  CAS  Google Scholar 

  • Hong CP, Kwon SJ, Kim JS, Yang TJ, Park BS, Lim YP (2008) Progress in understanding and sequencing the genome of Brassica rapa. Int J Plant Genomics 9 (Article ID 582837)

  • Jander G, Norris SR, Rounsley SD, Bush DF, Levin IM, Last RL (2002) Arabidopsis map-based cloning in the post-genome era. Plant Physiol 129:440–450

    Article  PubMed  CAS  Google Scholar 

  • Johston JS, Pepper AE, Hall AE, Chen ZJ, Hodnett G, Drabek J, Lopez R, Price HJ (2005) Evolution of genome size in Brassicaceae. Ann Bot 95:229–235

    Article  CAS  Google Scholar 

  • Jorgenson E, Witte JS (2006) A gene-centric approach to genome-wide association studies. Nat Rev Genet 7:885–891

    Article  PubMed  CAS  Google Scholar 

  • Kanazin V, Talbert H, See D, DeCamp P, Nevo E, Blake T (2002) Discovery and assay of single-nucleotide polymorphisms in barley (Hordeum vulgare). Plant Mol Biol 48:529–537

    Article  PubMed  CAS  Google Scholar 

  • Kim JS, Chung TY, King GJ, Jin M, Yang TJ, Jin YM, Kim HI, Park BS (2006) A sequence-tagged linkage map of Brassica rapa. Genetics 174:29–39

    Article  PubMed  CAS  Google Scholar 

  • Lessa EP (1992) Rapid surveying of DNA sequence variation in natural populations. Mol Biol Evol 9:323–330

    PubMed  CAS  Google Scholar 

  • Lijavetzky D, Cabezas JA, Ibanez A, Rodriguez V, Martinez-Zapater JM (2007) High throughput SNP discovery and genotyping in grapevine (Vitis vinifera L.) by combining a re-sequencing approach and SNPlex technology. BMC Genomics 8:424

    Article  PubMed  Google Scholar 

  • Liu F, Charlesworth D, Kreitman M (1999) The effect of mating system differences on nucleotide diversity at the phosphoglucose isomerase Locus in the plant genus Leavenworthia. Genetics 151:343–357

    PubMed  CAS  Google Scholar 

  • Lukens L, Zou F, Lydiate D, Parkin I, Osborn T (2003) Comparison of a Brassica oleracea genetic map with the genome of Arabidopsis thaliana. Genetics 164:359–372

    PubMed  CAS  Google Scholar 

  • Lysak MA, Koch MA, Pecinka A, Schubert I (2005) Chromosome triplication found across the tribe Brassiceae. Genome Res 15:516–525

    Article  PubMed  CAS  Google Scholar 

  • Marth G, Korf I, Yandell M, Yeh R, Gu Z, Zakeri H, Stitziel N, Hillier L, Kwok P, Gish W (1999) A general approach to single-nucleotide polymorphism discovery. Nat Genet 23:452–456

    Article  PubMed  CAS  Google Scholar 

  • Matukumalli L, Grefenstette J, Hyten D, Choi I-Y, Cregan P, Van Tassell C (2006) SNP-PHAGE—high throughput SNP discovery pipeline. BMC Bioinform 7:468

    Article  CAS  Google Scholar 

  • Moriyama E, Powell J (1996) Intraspecific nuclear DNA variation in Drosophila. Mol Biol Evol 13:261–277

    PubMed  CAS  Google Scholar 

  • Mun J-H, Kwon S-J, Yang T-J, Seol Y-J, Jin M, Kim J-A, Lim M-H, Kim JS, Baek S, Choi B, Yu H-J, Kim D-S, Kim N, Lim K, Lee S-I, Lim Y, Bancroft I, Hahn J-H, Park B (2009) Genome-wide comparative analysis of the Brassica rapa gene space reveals genome shrinkage and differential loss of duplicated genes after whole genome triplication. Genome Biol 10:R111

    Article  PubMed  CAS  Google Scholar 

  • O’neill CM, Bancroft I (2000) Comparative physical mapping of segments of the genome of Brassica oleracea var. alboglabra that are homoeologous to sequenced regions of chromosomes 4 and 5 of Arabidopsis thaliana. Plant J 23:233–243

    Article  PubMed  Google Scholar 

  • Panjabi P, Jagannath A, Bisht N, Padmaja KL, Sharma S, Gupta V, Pradhan A, Pental D (2008) Comparative mapping of Brassica juncea and Arabidopsis thaliana using Intron Polymorphism (IP) markers: homoeologous relationships, diversification and evolution of the A, B and C Brassica genomes. BMC Genomics 9:113

    Google Scholar 

  • Pavy N, Parsons L, Paule C, MacKay J, Bousquet J (2006) Automated SNP detection from a large collection of white spruce expressed sequences: contributing factors and approaches for the categorization of SNPs. BMC Genomics 7:174

    Article  PubMed  CAS  Google Scholar 

  • Piquemal J, Cinquin E, Couton F, Rondeau C, Seignoret E, Doucet I, Perret D, Villeger MJ, Vincourt P, Blanchard P (2005) Construction of an oilseed rape (Brassica napus L.) genetic map with SSR markers. Theor Appl Genet 111:1514–1523

    Article  PubMed  CAS  Google Scholar 

  • Pollak E (1987) On the theory of partially inbreeding finite populations. I. Partial selfing. Genetics 117:353–360

    PubMed  CAS  Google Scholar 

  • Rafalski A (2002) Applications of single nucleotide polymorphisms in crop genetics. Curr Opin Plant Biol 5:94–100

    Article  PubMed  CAS  Google Scholar 

  • Rozen S, Skaletsky H (1999) Primer3 on the WWW for general users and for biologist programmers. In: Bioinformatics methods and protocols, pp 365–386

  • Schmid KJ, Sorensen TR, Stracke R, Torjek O, Altmann T, Mitchell-Olds T, Weisshaar B (2003) Large-scale identification and analysis of genome-wide single-nucleotide polymorphisms for mapping in Arabidopsis thaliana. Genome Res 13:1250–1257

    Article  PubMed  Google Scholar 

  • Schmid KJ, Ramos-Onsins S, Ringys-Beckstein H, Weisshaar B, Mitchell-Olds T (2005) A multilocus sequence survey in Arabidopsis thaliana reveals a genome-wide departure from a neutral model of DNA sequence polymorphism. Genetics 169:1601–1615

    Article  PubMed  CAS  Google Scholar 

  • Schneider K, Weisshaar B, Borchardt DC, Salamini F (2001) SNP frequency and allelic haplotype structure of Beta vulgaris expressed genes. Mol Breed 8:63–74

    Article  CAS  Google Scholar 

  • Soengas P, Hand P, Vicente J, Pole J, Pink D (2007) Identification of quantitative trait loci for resistance to Xanthomonas campestris pv. campestris in Brassica rapa. Theor Appl Genet 114:637–645

    Article  PubMed  CAS  Google Scholar 

  • Stranger BE, Nica AC, Forrest MS, Dimas A, Bird CP, Beazley C, Ingle CE, Dunning M, Flicek P, Koller D, Montgomery S, Tavare S, Deloukas P, Dermitzakis ET (2007) Population genomics of human gene expression. Nat Genet 39:1217–1224

    Article  PubMed  CAS  Google Scholar 

  • Suwabe K, Tsukazaki H, Iketani H, Hatakeyama K, Kondo M, Fujimura M, Nunome T, Fukuoka H, Hirai M, Matsumoto S (2006) Simple sequence repeat-based comparative genomics between Brassica rapa and Arabidopsis thaliana: the genetic origin of clubroot resistance. Genetics 173:309–319

    Article  PubMed  CAS  Google Scholar 

  • Syvanen AC (2001) Accessing genetic variation: genotyping single nucleotide polymorphisms. Nat Rev Genet 2:930–942

    Article  PubMed  CAS  Google Scholar 

  • Syvanen AC (2005) Toward genome-wide SNP genotyping. Nat rev genet 37:S5–S10

    Article  CAS  Google Scholar 

  • Tajima F (1983) Evolutionary relationship of DNA sequences in finite populations. Genetics 105:437–460

    PubMed  CAS  Google Scholar 

  • Town CD, Cheung F, Maiti R, Crabtree J, Haas BJ, Wortman JR, Hine EE, Althoff R, Arbogast TS, Tallon LJ, Vigouroux M, Trick M, Bancroft I (2006) Comparative genomics of Brassica oleracea and Arabidopsis thaliana reveal gene loss, fragmentation, and dispersal after polyploidy. Plant Cell 18:1348–1359

    Article  PubMed  CAS  Google Scholar 

  • UN (1935) Genome analysis in Brassica with special reference to the experimental formation of B. napus and peculiar mode of fertilization. Jpn J Bot 7:389–452

    Google Scholar 

  • Venter JC, Adams MD, Myers EW, Li PW, Mural RJ et al (2001) The sequence of the human genome. Science 291:1304–1351

    Article  PubMed  CAS  Google Scholar 

  • Wang DG, Fan J-B, Siao C-J, Berno A, Young P, Sapolsky R, Ghandour G, Perkins N, Winchester E, Spencer J, Kruglyak L, Stein L, Hsie L, Topaloglou T, Hubbell E, Robinson E, Mittmann M, Morris MS, Shen N, Kilburn D, Rioux J, Nusbaum C, Rozen S, Hudson TJ, Lipshutz R, Chee M, Lander ES (1998) Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. Science 280:1077–1082

    Article  PubMed  CAS  Google Scholar 

  • Wang X, Zhao X, Zhu J, Wu W (2005) Genome-wide investigation of intron length polymorphisms and their potential as molecular markers in rice (Oryza sativa L.). DNA Res 12:417–427

    Article  PubMed  CAS  Google Scholar 

  • Watterson GA (1975) On the number of segregating sites in genetical models without recombination. Theor Popul Biol 7:256–276

    Article  PubMed  CAS  Google Scholar 

  • Wright SI, Lauga B, Charlesworth D (2002) Rates and patterns of molecular evolution in inbred and outbred Arabidopsis. Mol Biol Evol 19:1407–1420

    PubMed  CAS  Google Scholar 

  • Wright SI, Bi IV, Schroeder SG, Yamasaki M, Doebley JF, McMullen MD, Gaut BS (2005) The effects of artificial selection on the maize genome. Science 308:1310–1314

    Article  PubMed  CAS  Google Scholar 

  • Wydner KS, Passmore HC, Sechler JL, Boyd CD (1994) Use of an intron length polymorphism to localize the tropoelastin gene to mouse Chromosome 5 in a region of linkage conservation with human Chromosome 7. Genomics 23:125–131

    Article  PubMed  CAS  Google Scholar 

  • Yang Z (2007) PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol 24:1586–1591

    Article  PubMed  CAS  Google Scholar 

  • Yang T-J, Kim JS, Kwon S-J, Lim K-B, Choi B-S, Kim J-A, Jin M, Park JY, Lim M-H, Kim H-I, Lim YP, Kang JJ, Hong J-H, Kim C-B, Bhak J, Bancroft I, Park B-S (2006) Sequence-level analysis of the diploidization process in the triplicated FLOWERING LOCUS C region of Brassica rapa. Plant Cell 18:1339–1347

    Article  PubMed  CAS  Google Scholar 

  • Zhang L, Vision TJ, Gaut BS (2002) Patterns of nucleotide substitution among simultaneously duplicated gene pairs in Arabidopsis thaliana. Mol Biol Evol 19:1464–1473

    PubMed  CAS  Google Scholar 

  • Zhu Y, Song Q, Hyten D, Van Tassell C, Matukumalli L, Grimm D, Hyatt S, Fickus E, Young N, Cregan P (2003) Single-nucleotide polymorphisms in soybean. Genetics 163:1123–1134

    PubMed  CAS  Google Scholar 

Download references

Acknowledgments

We thank Dr. Ik-Young Choi of NICEM, Dr. Venkatesan Sundaresan of UC Davis, and Dr. Dae-Geun Oh of KNCAF for helpful comments on this paper. We also thank Man-Ki Kim of NIHHS, MACROGEN Co., and NICEM for technical assistance, Beom-Seok Park of NAAS for genetic mapping information, and Dr. Soo-Seong Lee of BBI, Dr. Yong Pyo Lim of CNU, and Dr. Su Hyoung Park of NIHHS for providing plant materials. This work was supported by NIHHS grants 2007139057600000502 and 200901FHT020508395, and by Post Doctoral Course Program of NIHHS for Soomin Park, Rural Development Administration, Republic of Korea.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hee-Ju Yu.

Additional information

Communicated by Y. Van de Peer.

S. Park and H.-J. Yu contributed equally to this work.

Electronic supplementary material

Below is the link to the electronic supplementary material.

438_2009_504_MOESM1_ESM.xls

Summary of DNA polymorphisms discovered in this study with a list of source BAC accessions used for this polymorphism survey (XLS 9078 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Park, S., Yu, HJ., Mun, JH. et al. Genome-wide discovery of DNA polymorphism in Brassica rapa . Mol Genet Genomics 283, 135–145 (2010). https://doi.org/10.1007/s00438-009-0504-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00438-009-0504-0

Keywords

Navigation