Abstract
Single nucleotide polymorphisms (SNPs) and/or insertion/deletions (InDels) are frequent sequence variations in the plant genome, which can be developed as molecular markers for genetic studies on crop improvement. The ongoing Brassica rapa genome sequencing project has generated vast amounts of sequence data useful in genetic research. Here, we report a genome-wide survey of DNA polymorphisms in the B. rapa genome based on the 557 bacterial artificial clone sequences of B. rapa ssp. pekinensis cv. Chiifu. We identified and characterized 21,311 SNPs and 6,753 InDels in the gene space of the B. rapa genome by re-sequencing 1,398 sequence-tagged sites (STSs) in eight genotypes. Comparison of our findings with a B. rapa genetic linkage map confirmed that STS loci were distributed randomly over the B. rapa whole genome. In the 1.4 Mb of aligned sequences, mean nucleotide polymorphism and diversity were θ = 0.00890 and π = 0.00917, respectively. Additionally, the nucleotide diversity in introns was almost three times greater than that in exons, and the frequency of observed InDel was almost 17 times higher in introns than in exons. Information regarding SNPs/InDels obtained here will provide an important resource for genetic studies and breeding programs of B. rapa.
Similar content being viewed by others
References
Absalan F, Ronaghi M (2008) Molecular inversion probe assay. Comp Genomics 396:315–330
Baudry E, Kerdelhue C, Innan H, Stephan W (2001) Species and recombination effects on DNA variability in the tomato Genus. Genetics 158:1725–1735
Beilstein MA, Al-Shehbaz IA, Kellogg EA (2006) Brassicaceae phylogeny and trichome evolution. Am J Bot 93:607–619
Brumfield RT, Beerli P, Nickerson DA, Edwards SV (2003) The utility of single nucleotide polymorphisms in inferences of population history. Trends Ecol Evol 18:249–256
Cantor CR, Nelson MR (2005) Haplotyping in biomedicine-practical challenges. Nat Biotechnol 23:21–22
Cargill M, Altshuler D, Ireland J, Sklar P, Ardlie K, Patil N, Shaw N, Lane CR, Lim EP, Kalyanaraman N, Nemesh J, Ziaugra L, Friedland L, Rolfe A, Warrington J, Lipshutz R, Daley GQ, Lander ES (1999) Characterization of single-nucleotide polymorphisms in coding regions of human genes. Nat Genet 22:231–238
Chen D, Ahlford A, Schnorrer F, Kalchhauser I, Fellner M, Viragh E, Kiss I, Syvanen A-C, Dickson BJ (2008) High-resolution, high-throughput SNP mapping in Drosophila melanogaster. Nat Methods 5:323–329
Ching A, Caldwell KS, Jung M, Dolan M, Smith OS, Tingey S, Morgante M, Rafalski AJ (2002) SNP frequency, haplotype structure and linkage disequilibrium in elite maize inbred lines. BMC Genet 3:19
Choi W (2003) Genetic analysis on disease resistance to Turnip mosaic virus, Clubroot and Soft rot, and rapid development of multiple resistant inbreds in Chinese cabbage (Brassica rapa ssp. pekinensis). PhD thesis, Horticultural science, Chung-Ang University, Anseong, pp 1–96
Choi IY, Hyten DL, Matukumalli LK, Song Q, Chaky JM, Quigley CV, Chase K, Lark KG, Reiter RS, Yoon MS, Hwang EY, Yi SI, Young ND, Shoemaker RC, van Tassell CP, Specht JE, Cregan PB (2007a) A soybean transcript map: gene distribution, haplotype and single-nucleotide polymorphism analysis. Genetics 176:685–696
Choi S, Teakle G, Plaha P, Kim J, Allender C, Beynon E, Piao Z, Soengas P, Han T, King G, Barker G, Hand P, Lydiate D, Batley J, Edwards D, Koo D, Bang J, Park B-S, Lim Y (2007b) The reference genetic linkage map for the multinational Brassica rapa genome sequencing project. Theor Appl Genet 115:777–792
Cooper DN, Smith BA, Cooke HJ, Niemann S, Schmidtke J (1985) An estimate of unique DNA sequence heterozygosity in the human genome. Hum Genet 69:201–205
Côrte-Real HBSM, Dixon DR, Holland PWH (1994) Intron-targeted PCR: a new approach to survey neutral DNA polymorphism in bivalve populations. Marine Biol 120:407–413
Dantec LL, Chagné D, Pot D, Cantin O, Garnier-Géré P, Bedon F, Frigerio J-M, Chaumeil P, Léger P, Garcia V, Laigret F, de Daruvar A, Plomion C (2004) Automated SNP detection in expressed sequence tags: statistical considerations and application to maritime pine sequences. Plant Mol Biol 54:461–470
Ewing B, Green P (1998) Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res 8:186–194
Goldman N, Yang Z (1994) A codon-based model of nucleotide substitution for protein-coding DNA sequences. Mol Biol Evol 11:725–736
Gordon D, Abajian C, Green P (1998) Consed: a graphical tool for sequence finishing. Genome Res 8:195–202
Groenen MAM, Wahlberg P, Foglio M, Cheng HH, Megens H-J, Crooijmans RPMA, Besnier F, Lathrop M, Muir WM, Wong GK-S, Gut I, Andersson L (2009) A high-density SNP-based linkage map of the chicken genome reveals sequence features correlated with recombination rate. Genome Res 19:510–519
Gupta PK, Rustgi S, Mir RR (2008) Array-based high-throughput DNA markers for crop improvement. Heredity 101:5–18
Hong CP, Kwon SJ, Kim JS, Yang TJ, Park BS, Lim YP (2008) Progress in understanding and sequencing the genome of Brassica rapa. Int J Plant Genomics 9 (Article ID 582837)
Jander G, Norris SR, Rounsley SD, Bush DF, Levin IM, Last RL (2002) Arabidopsis map-based cloning in the post-genome era. Plant Physiol 129:440–450
Johston JS, Pepper AE, Hall AE, Chen ZJ, Hodnett G, Drabek J, Lopez R, Price HJ (2005) Evolution of genome size in Brassicaceae. Ann Bot 95:229–235
Jorgenson E, Witte JS (2006) A gene-centric approach to genome-wide association studies. Nat Rev Genet 7:885–891
Kanazin V, Talbert H, See D, DeCamp P, Nevo E, Blake T (2002) Discovery and assay of single-nucleotide polymorphisms in barley (Hordeum vulgare). Plant Mol Biol 48:529–537
Kim JS, Chung TY, King GJ, Jin M, Yang TJ, Jin YM, Kim HI, Park BS (2006) A sequence-tagged linkage map of Brassica rapa. Genetics 174:29–39
Lessa EP (1992) Rapid surveying of DNA sequence variation in natural populations. Mol Biol Evol 9:323–330
Lijavetzky D, Cabezas JA, Ibanez A, Rodriguez V, Martinez-Zapater JM (2007) High throughput SNP discovery and genotyping in grapevine (Vitis vinifera L.) by combining a re-sequencing approach and SNPlex technology. BMC Genomics 8:424
Liu F, Charlesworth D, Kreitman M (1999) The effect of mating system differences on nucleotide diversity at the phosphoglucose isomerase Locus in the plant genus Leavenworthia. Genetics 151:343–357
Lukens L, Zou F, Lydiate D, Parkin I, Osborn T (2003) Comparison of a Brassica oleracea genetic map with the genome of Arabidopsis thaliana. Genetics 164:359–372
Lysak MA, Koch MA, Pecinka A, Schubert I (2005) Chromosome triplication found across the tribe Brassiceae. Genome Res 15:516–525
Marth G, Korf I, Yandell M, Yeh R, Gu Z, Zakeri H, Stitziel N, Hillier L, Kwok P, Gish W (1999) A general approach to single-nucleotide polymorphism discovery. Nat Genet 23:452–456
Matukumalli L, Grefenstette J, Hyten D, Choi I-Y, Cregan P, Van Tassell C (2006) SNP-PHAGE—high throughput SNP discovery pipeline. BMC Bioinform 7:468
Moriyama E, Powell J (1996) Intraspecific nuclear DNA variation in Drosophila. Mol Biol Evol 13:261–277
Mun J-H, Kwon S-J, Yang T-J, Seol Y-J, Jin M, Kim J-A, Lim M-H, Kim JS, Baek S, Choi B, Yu H-J, Kim D-S, Kim N, Lim K, Lee S-I, Lim Y, Bancroft I, Hahn J-H, Park B (2009) Genome-wide comparative analysis of the Brassica rapa gene space reveals genome shrinkage and differential loss of duplicated genes after whole genome triplication. Genome Biol 10:R111
O’neill CM, Bancroft I (2000) Comparative physical mapping of segments of the genome of Brassica oleracea var. alboglabra that are homoeologous to sequenced regions of chromosomes 4 and 5 of Arabidopsis thaliana. Plant J 23:233–243
Panjabi P, Jagannath A, Bisht N, Padmaja KL, Sharma S, Gupta V, Pradhan A, Pental D (2008) Comparative mapping of Brassica juncea and Arabidopsis thaliana using Intron Polymorphism (IP) markers: homoeologous relationships, diversification and evolution of the A, B and C Brassica genomes. BMC Genomics 9:113
Pavy N, Parsons L, Paule C, MacKay J, Bousquet J (2006) Automated SNP detection from a large collection of white spruce expressed sequences: contributing factors and approaches for the categorization of SNPs. BMC Genomics 7:174
Piquemal J, Cinquin E, Couton F, Rondeau C, Seignoret E, Doucet I, Perret D, Villeger MJ, Vincourt P, Blanchard P (2005) Construction of an oilseed rape (Brassica napus L.) genetic map with SSR markers. Theor Appl Genet 111:1514–1523
Pollak E (1987) On the theory of partially inbreeding finite populations. I. Partial selfing. Genetics 117:353–360
Rafalski A (2002) Applications of single nucleotide polymorphisms in crop genetics. Curr Opin Plant Biol 5:94–100
Rozen S, Skaletsky H (1999) Primer3 on the WWW for general users and for biologist programmers. In: Bioinformatics methods and protocols, pp 365–386
Schmid KJ, Sorensen TR, Stracke R, Torjek O, Altmann T, Mitchell-Olds T, Weisshaar B (2003) Large-scale identification and analysis of genome-wide single-nucleotide polymorphisms for mapping in Arabidopsis thaliana. Genome Res 13:1250–1257
Schmid KJ, Ramos-Onsins S, Ringys-Beckstein H, Weisshaar B, Mitchell-Olds T (2005) A multilocus sequence survey in Arabidopsis thaliana reveals a genome-wide departure from a neutral model of DNA sequence polymorphism. Genetics 169:1601–1615
Schneider K, Weisshaar B, Borchardt DC, Salamini F (2001) SNP frequency and allelic haplotype structure of Beta vulgaris expressed genes. Mol Breed 8:63–74
Soengas P, Hand P, Vicente J, Pole J, Pink D (2007) Identification of quantitative trait loci for resistance to Xanthomonas campestris pv. campestris in Brassica rapa. Theor Appl Genet 114:637–645
Stranger BE, Nica AC, Forrest MS, Dimas A, Bird CP, Beazley C, Ingle CE, Dunning M, Flicek P, Koller D, Montgomery S, Tavare S, Deloukas P, Dermitzakis ET (2007) Population genomics of human gene expression. Nat Genet 39:1217–1224
Suwabe K, Tsukazaki H, Iketani H, Hatakeyama K, Kondo M, Fujimura M, Nunome T, Fukuoka H, Hirai M, Matsumoto S (2006) Simple sequence repeat-based comparative genomics between Brassica rapa and Arabidopsis thaliana: the genetic origin of clubroot resistance. Genetics 173:309–319
Syvanen AC (2001) Accessing genetic variation: genotyping single nucleotide polymorphisms. Nat Rev Genet 2:930–942
Syvanen AC (2005) Toward genome-wide SNP genotyping. Nat rev genet 37:S5–S10
Tajima F (1983) Evolutionary relationship of DNA sequences in finite populations. Genetics 105:437–460
Town CD, Cheung F, Maiti R, Crabtree J, Haas BJ, Wortman JR, Hine EE, Althoff R, Arbogast TS, Tallon LJ, Vigouroux M, Trick M, Bancroft I (2006) Comparative genomics of Brassica oleracea and Arabidopsis thaliana reveal gene loss, fragmentation, and dispersal after polyploidy. Plant Cell 18:1348–1359
UN (1935) Genome analysis in Brassica with special reference to the experimental formation of B. napus and peculiar mode of fertilization. Jpn J Bot 7:389–452
Venter JC, Adams MD, Myers EW, Li PW, Mural RJ et al (2001) The sequence of the human genome. Science 291:1304–1351
Wang DG, Fan J-B, Siao C-J, Berno A, Young P, Sapolsky R, Ghandour G, Perkins N, Winchester E, Spencer J, Kruglyak L, Stein L, Hsie L, Topaloglou T, Hubbell E, Robinson E, Mittmann M, Morris MS, Shen N, Kilburn D, Rioux J, Nusbaum C, Rozen S, Hudson TJ, Lipshutz R, Chee M, Lander ES (1998) Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. Science 280:1077–1082
Wang X, Zhao X, Zhu J, Wu W (2005) Genome-wide investigation of intron length polymorphisms and their potential as molecular markers in rice (Oryza sativa L.). DNA Res 12:417–427
Watterson GA (1975) On the number of segregating sites in genetical models without recombination. Theor Popul Biol 7:256–276
Wright SI, Lauga B, Charlesworth D (2002) Rates and patterns of molecular evolution in inbred and outbred Arabidopsis. Mol Biol Evol 19:1407–1420
Wright SI, Bi IV, Schroeder SG, Yamasaki M, Doebley JF, McMullen MD, Gaut BS (2005) The effects of artificial selection on the maize genome. Science 308:1310–1314
Wydner KS, Passmore HC, Sechler JL, Boyd CD (1994) Use of an intron length polymorphism to localize the tropoelastin gene to mouse Chromosome 5 in a region of linkage conservation with human Chromosome 7. Genomics 23:125–131
Yang Z (2007) PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol 24:1586–1591
Yang T-J, Kim JS, Kwon S-J, Lim K-B, Choi B-S, Kim J-A, Jin M, Park JY, Lim M-H, Kim H-I, Lim YP, Kang JJ, Hong J-H, Kim C-B, Bhak J, Bancroft I, Park B-S (2006) Sequence-level analysis of the diploidization process in the triplicated FLOWERING LOCUS C region of Brassica rapa. Plant Cell 18:1339–1347
Zhang L, Vision TJ, Gaut BS (2002) Patterns of nucleotide substitution among simultaneously duplicated gene pairs in Arabidopsis thaliana. Mol Biol Evol 19:1464–1473
Zhu Y, Song Q, Hyten D, Van Tassell C, Matukumalli L, Grimm D, Hyatt S, Fickus E, Young N, Cregan P (2003) Single-nucleotide polymorphisms in soybean. Genetics 163:1123–1134
Acknowledgments
We thank Dr. Ik-Young Choi of NICEM, Dr. Venkatesan Sundaresan of UC Davis, and Dr. Dae-Geun Oh of KNCAF for helpful comments on this paper. We also thank Man-Ki Kim of NIHHS, MACROGEN Co., and NICEM for technical assistance, Beom-Seok Park of NAAS for genetic mapping information, and Dr. Soo-Seong Lee of BBI, Dr. Yong Pyo Lim of CNU, and Dr. Su Hyoung Park of NIHHS for providing plant materials. This work was supported by NIHHS grants 2007139057600000502 and 200901FHT020508395, and by Post Doctoral Course Program of NIHHS for Soomin Park, Rural Development Administration, Republic of Korea.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Y. Van de Peer.
S. Park and H.-J. Yu contributed equally to this work.
Electronic supplementary material
Below is the link to the electronic supplementary material.
438_2009_504_MOESM1_ESM.xls
Summary of DNA polymorphisms discovered in this study with a list of source BAC accessions used for this polymorphism survey (XLS 9078 kb)
Rights and permissions
About this article
Cite this article
Park, S., Yu, HJ., Mun, JH. et al. Genome-wide discovery of DNA polymorphism in Brassica rapa . Mol Genet Genomics 283, 135–145 (2010). https://doi.org/10.1007/s00438-009-0504-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00438-009-0504-0