Abstract
Single-nucleotide polymorphisms, which can be identified in the thousands or millions from comparisons of transcriptome or genome sequences, are ideally suited for making high-resolution genetic maps, investigating population evolutionary history, and discovering marker–trait linkages. Despite significant results from their use in human genetics, progress in identification and use in plants, and particularly polyploid plants, has lagged. As part of a long-term project to identify and use SNPs suitable for these purposes in cultivated peanut, which is tetraploid, we generated transcriptome sequences of four peanut cultivars, namely OLin, New Mexico Valencia C, Tamrun OL07 and Jupiter, which represent the four major market classes of peanut grown in the world, and which are important economically to the US southwest peanut growing region. CopyDNA libraries of each genotype were used to generate 2 × 54 paired-end reads using an Illumina GAIIx sequencer. Raw reads were mapped to a custom reference consisting of Tifrunner 454 sequences plus peanut ESTs in GenBank, compromising 43,108 contigs; 263,840 SNP and indel variants were identified among four genotypes compared to the reference. A subset of 6 variants was assayed across 24 genotypes representing four market types using KASP chemistry to assess the criteria for SNP selection. Results demonstrated that transcriptome sequencing can identify SNPs usable as selectable DNA-based markers in complex polyploid species such as peanut. Criteria for effective use of SNPs as markers are discussed in this context.
Similar content being viewed by others
Abbreviations
- SNP:
-
Single-nucleotide polymorphism
- EST:
-
Expressed sequence tag
- RIN:
-
RNA integrity number
- TSA:
-
Transcriptome shotgun assembly
- O/L:
-
Oleic/linoleic
- RFLP:
-
Restriction fragment length polymorphism
- AFLP:
-
Amplified fragment length polymorphism
- SSR:
-
Simple sequence repeat
References
1000 Genomes Project Consortium (2012) An integrated map of genetic variation from 1,092 human genomes. Nature 491:56–65
Allen AM, Barker GL, Berry ST, Coghill JA, Gwilliam R, Kirby S, Robinson P, Brenchley RC, D’Amore R, McKenzie N, Waite D, Hall A, Bevan M, Hall N, Edwards KJ (2011) Transcript-specific, single-nucleotide polymorphism discovery and linkage analysis in hexaploid bread wheat (Triticum aestivum L.). Plant Biotechnol J 9:1086–1099
Baring MR, Simpson CE, Burow MD, Black MC, Cason JM, Ayers J, Lopez Y, Melouk HA (2006) Registration of ‘Tamrun OL07’ peanut. Crop Sci 46:2721–2722
Belamkar V, Selvaraj MG, Ayers JL, Payton PR, Puppala N, Burow MD (2011) A first insight into population structure and linkage disequilibrium in the US peanut minicore collection. Genetica 139:411–429
Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW (2009) GenBank. Nucleic Acids Res 37:D26–D31
Bertioli DJ, Ozias-Akins P, Chu Y, Dantas KM, Santos SP, Gouvea E, Guimarães PM, Leal-Bertioli SCM, Knapp SJ, Moretzsohn MC (2014) The use of SNP markers for linkage mapping in diploid and tetraploid peanuts. G3 (Bethesda) 10:89–96
Bhattramakki D, Dolan M, Hanafey M, Wineland R, Vaske D, Register JC, Tingey SV, Rafalski A (2002) Insertion–deletion polymorphisms in 3’ regions of maize genes occur frequently and can be used as highly informative genetic markers. Plant Mol Biol 48:539–547
Burow MD, Baring, Ayers JL, Schubert AM, López Y, Simpson CE (2012) Registration of ‘Tamrun OL12’ peanut. J Plant Regist 8:117–121
Ching A, Rafalski A (2002) Rapid genetic mapping of ESTs using SNP pyrosequencing and indel analysis. Cell Mol Biol Lett 7:803–810
Chopra R, Burow G, Farmer A, Mudge J, Simpson CE, Burow MD (2014) Comparisons of de novo transcriptome assemblers in diploid and polyploid species using peanut (Arachis spp.) RNA-seq data. PLoS ONE 9(12):e115055
Chu Y, Ramos L, Holbrook CC, Ozias-Akins P (2007) Frequency of a loss-of-function mutation in oleoyl-PC desaturase (ahFAD2A) in the minicore of the US peanut germplasm collection. Crop Sci 47:2372–2378
Cloonan N, Forrest A, Kolle G, Gardiner B, Faulkner G, Brown M, Taylor D, Steptoe A, Wani S, Bethel G (2008) Stem cell transcriptome profiling via massive-scale mRNA sequencing. Nat Methods 5:613–619
Close TJ, Bhat PR, Lonardi S, Wu Y, Rostoks N, Ramsay L, Druka A, Stein N, Svensson JT, Wanamaker S, Bozdag S, Roose ML, Moscou MJ, Chao S, Varshney RK, Szucs P, Sato K, Hayes PM, Matthews DE, Kleinhofs A, Muehlbauer GJ, DeYoung J, Marshall DF, Madishetty K, Fenton RD, Condamine P, Graner A, Waugh R (2009) Development and implementation of high-throughput SNP genotyping in barley. BMC Genomics 10:582
Collard BCY, Mackill DJ (2008) Marker-assisted selection: an approach for precision plant breeding in the twenty-first century. Phil Trans Royal Soc B: Biol Sci 363:557–572
Davey JW, Blaxter ML (2010) RADSeq: next-generation population genetics. Brief Funct Genomics. 9:416–423
Deschamps S, Llaca V, May GD (2012) Genotyping-by-sequencing in plants. Biology 1:460–483
Doyle JJ (2012) Polyploidy in Legumes. In: Soltis PS, Soltis DE (eds) Polyploidy and genome evolution. Springer, Berlin
Duan J, Xia C, Zhao G, Jia J, Kong X (2012) Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data. BMC Genomics 13:392
Ekblom R, Galindo J (2011) Applications of next generation sequencing in molecular ecology of non-model organisms. Heredity (Edinb) 107:1–15
FAO, Food and Agriculture Organization of the United Nations (2012) FAOSTAT. Groundnuts (in Shell). http://faostat.fao.org/site/339/default.aspx. Accessed 28 Jan 2015
Gautami B, Fonceka D, Pandey MK, Moretzsohn MC, Sujay V, Qin H, Hong Y, Faye I, Chen X, BhanuPrakash A, Shah TM, Gowda MVC, Nigam SN, Liang X, Hoisington DA, Guo B, Bertioli DJ, Rami J-F, Varshney RK (2012) An international reference consensus genetic map with 897 marker loci based on 11 mapping populations for tetraploid groundnut (Arachis hypogaea L.). PLoS ONE 7:e41213
Guo B, Beavis WD (2011) In silico genotyping of the maize nested association mapping population. Mol Breed 27:107–113
Holbrook CC, Culbreath AK (2007) Registration of ‘Tifrunner’ peanut. J Plant Reg 1:124
Holbrook CC, Dong W (2005) Development and evaluation of a mini core collection for the US peanut germplasm collection. Crop Sci 45:1540–1544
Hong Y, Chen X, Liang X, Liu H, Zhou G, Li S, Wen S, Holbrook C, Guo B (2010) A SSR-based composite genetic linkage map for the cultivated peanut (Arachis hypogaea L) genome. BMC Plant Biol 10:17
Horn M, Eikenberry E, Romero-Lanuza J, Sutton J (2001) High stability peanut oil. US. Patent 6,214,405 A
Hsi DC (1980) Registration of ‘New Mexico Valencia C’ peanut (Reg No 24). Crop Sci 20:113–114
Iida A, Ohnishi Y, Ozaki K, Ariji Y, Nakamura Y, Tanaka T (2001) High-density single-nucleotide polymorphism (SNP) map in the 96-kb region containing the entire human DiGeorge syndrome critical region 2 (DGCR2) gene at 22q112. J Hum Genet 46:604–608
Kaur S, Francki MG, Forster JW (2012) Identification, characterization and interpretation of single-nucleotide sequence variation in allopolyploid crop species. Plant Biotechnol J 10:125–138
Khera P, Upadhyaya HD, Pandey MK, Roorkiwal M, Sriswathi M, Janila P, Guo Y, McKain MR, Nagy ED, Knapp SJ, Leebens-Mack J, Conner JA, Ozias-Akins P, Varshney RK (2013) Single nucleotide polymorphism–based genetic diversity in the reference set of peanut (spp.) by developing and applying cost-effective kompetitive allele specific polymerase chain reaction genotyping assays. Plant Genome 6:1–11
Knauft D, Ozias-Akins P (1995) Recent methods for germplasm enhancement and breeding. In: Pattee HE, Stalker HT (eds) Advances in peanut science. APRES, Stillwater, pp 54–94
Knauft DA, Gorbet DW, Norden AJ, Norden CK (1997) Peanut oil from enhanced peanut products. US patent no 5,922,390 A. Accessed 27 Jan 2015
Koboldt DC, Steinberg KM, Larson DE, Wilson RK, Mardis ER (2013) The next-generation sequencing revolution and its impact on genomics. Cell 155:27–38
Kottapalli KR, Burow MD, Burow G, Burke J, Puppala N (2007) Molecular characterization of the US peanut mini core collection using microsatellite markers. Crop Sci 47:1718–1727
Külheim C, Yeoh SH, Maintz J, Foley WJ, Moran GF (2009) Comparative SNP diversity among four Eucalyptus species for genes from secondary metabolite biosynthetic pathways. BMC Genomics 10:452
Lister R, O’Malley R, Tonti-Filippini J, Gregory B, Berry C, Millar A, Ecker J (2008) Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell 133:523–536
Lopez Y, Nadaf HL, Smith OD, Connell JP, Reddy AS, Fritz AK (2000) Isolation and characterization of the Δ12-fatty acid desaturase in peanut (Arachis hypogaea L.) and search for polymorphisms for the high oleate trait in spanish market-type lines. Theor Appl Genet 101:1131–1138
Lopez Y, Nadaf HL, Smith OD, Simpson CE, Fritz AK (2002) Expressed variants of Δ12-fatty acid desaturase for the high oleate trait in spanish market-type peanut lines. Mol Breed 9:183–190
Miller NA, Kingsmore SF, Farmer A, Langley RJ, Mudge J, Crow JA, Gonzalez AJ, Schilkey FD, Kim RJ, van Velkinburgh J, May GD, Black CF, Myers MK, Utsey JP, Frost NS, Sugarbaker DJ, Bueno R, Gullans SR, Baxter SM, Day SW, Retzel EF (2008) Management of high-throughput DNA sequencing projects: Alpheus. J Comput Sci Syst Biol 1:132
Mortazavi A, Williams B, McCue K, Schaeffer L, Wold B (2008) Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods 5:621–628
Nagalakshmi U, Wang Z, Waern K, Shou C, Raha D, Gerstein M, Snyder M (2008) The transcriptional landscape of the yeast genome defined by RNA sequencing. Science 320:1344–1349
Nagy ED, Guo Y, Tang S, Bowers JE, Okashah RA, Taylor CA, Zhang D, Khanal S, Heesacker AF, Khalilian N, Farmer AD, Carrasquilla-Garcia N, Penmetsa RV, Cook D, Stalker HT, Nielsen N, Ozias-Akins P, Knapp SJ (2012) A high-density genetic map of Arachis duranensis, a diploid ancestor of cultivated peanut. BMC Genomics 13:469
Norden AJ, Lipscomb RW, Carver WA (1969) Registration of ‘Florunner’ peanuts (Reg No 2). Crop Sci 9:850
Nwokolo E, Smartt J (1996) Food and feed from legumes and oilseeds. Chapman and Hall, London
Patel M, Jung S, Moore K, Powell G, Ainsworth C, Abbott A (2004) High-oleate peanut mutants result from a MITE insertion into the FAD2 gene. Theoral Appl Genet 108:1492–1502
Pimratch S, Jogloy S, Toomsan B, Jaisil P, Sikhinarum J, Kesmala T, Patanothai P (2004) Evaluation of seven peanut genotypes for nitrogen fixation and agronomic traits Songklanakarin. J Sci Technol 26:295–304
Ratan A, Miller W, Guillory J, Stinson J, Seshagiri S, Schuster SC (2013) Comparison of sequencing platforms for single nucleotide variant calls in a human sample. PLoS One 8:e55089
Salem M, Vallejo RL, Leeds TD, Palti Y, Liu S, Sabbagh A, Rexroad CE III, Yao J (2012) RNA-seq identifies SNP markers for growth traits in rainbow trout. PLoS ONE 7:e36264
Severin AJ, Woody JL, Bolon YT, Joseph B, Diers BW, Farmer AD, Muehlbauer GJ, Nelson RT, Grant D, Specht JE, Graham MA, Cannon SB, May GD, Vance CP, Shoemaker RC (2010) RNA-seq atlas of Glycine max: a guide to the soybean transcriptome. BMC Plant Biol 10:160
Shirasawa K, Koilkonda P, Aoki K, Hirakawa H, Tabata S, Watanabe M, Hasegawa M, Kiyoshima H, Suzuki S, Kuwata C (2012) In silico polymorphism analysis for the development of simple sequence repeat and transposon markers and construction of linkage map in cultivated peanut. BMC Plant Biol 12:80
Shirasawa K, Bertioli DJ, Varshney RK, Moretzsohn MC, Leal-Bertioli SCM, Thudi M, Pandey MK, Rami J-F, Foncéka D, Gowda MVC, Qin H, Guo B, Hong Y, Liang X, Hirakawa H, Tabata S, Isobe S (2013) Integrated consensus map of cultivated peanut and wild relatives reveals structures of the A and B genomes of Arachis and divergence of the legume. Genomes DNA Res 20:173–184
Simpson CE, Baring MR, Schubert AM, Melouk HA, Lopez Y, Kirby JS (2003) Registration of ‘OLin’ peanut. Crop Sci 43:1880–1881
Simpson CE, Nelson SC, Starr JL, Woodard KE, Smith OD (1993) Registration of ‘TxAG-6’ and ‘TxAG-7’ peanut germplasm lines. Crop Sci 33:1418
Soltis DE, Albert VA, Leebens-Mack J, Bell CD, Paterson AH, Zheng C, Sankoff D, dePamphilis CW, Wall PK, Soltis PS (2009) Polyploidy and angiosperm diversification. Am J Bot 96:336–348
Soltis DE, Soltis PS (1999) Polyploidy: recurrent formation and genome evolution. Trends Ecol Evol 14:348–352
Thimm O, Blasing O, Gibon Y, Nagel A, Meyer S, Kruger P, Selbig J, Muller LA, Rhee SY, Stitt M (2004) MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes. Plant J 37:914–939
Varshney RK, Bertioli DJ, Moretzsohn MC, Vadez V, Krishnamurthy L, Aruna R, Nigam SN, Moss BJ, Seetha K, Ravi K, He G, Knapp SJ, Hoisington DA (2009) The first SSR-based genetic linkage map for cultivated groundnut (Arachis hypogaea L.). Theor Appl Genet 118:729–739
Wang M, Barkley N, Chen Z, Pittman R (2011) FAD2 gene mutations significantly alter fatty acid profiles in cultivated peanuts (Arachis hypogaea). Biochem Genet 49:748–759
Wang X, Luan J, Li J, Bao Y, Zhang C, Liu S (2010) De novo characterization of a whitefly transcriptome and analysis of its gene expression during development. BMC Genomics 11:400
Welter D, MacArthur J, Morales J, Burdett T, Hall P, Junkins H, Klemm A, Flicek P, Manolio T, Hindorff L, Parkinson H (2013) The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res 42:D1001–D1006
Williams EJ, Drexler JS (1981) A non-destructive method for determining peanut pod maturity. Peanut Sci 8:134–141
Wu T, Watanabe C (2005) GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics (Oxford) 21:1859–1875
Wu X, Ren C, Joshi T, Vuong T, Xu D, Nguyen H (2010) SNP discovery by high-throughput sequencing in soybean. BMC Genomics 11:469
Wynne JC, Mozingo RW, Emery DA (1979) Registration of NC 7 peanut. Crop Sci 19:563
Xiao-Ping R, Hui-Fang J (2010) Comparison of genetic diversity between peanut mini core collections from China and ICRISAT by SSR Markers. Acta Agronomica Sinica 36:1084
Zhao Y, Prakash C, He G (2012) Characterization and compilation of polymorphic simple sequence repeat (SSR) markers of peanut from public database. BMC Res Notes C7–362(5):1–7
Zhou X, Xia Y, Ren X, Chen Y, Huang L, Huang S, Liao B, Lei Y, Yan L, Jiang H (2014) Construction of a SNP-based genetic linkage map in cultivated peanut based on large scale marker development using next-generation double-digest restriction-site-associated DNA sequencing (ddRADseq). BMC Genomics 15:351
Acknowledgments
The authors wish to thank Jennifer Chagoya at Texas A&M AgriLife Research, and Halee Hughes and Nancy Layland at the USDA-ARS, Lubbock for technical support. This work was funded by grants from the Texas Peanut Producers Board award CY2008-Burow-TTU-Development to MDB and CES, and 2009-TTU-Burow-Genotyping to MDB, National Peanut Board grant #332/TX-99/1139 to MDB, and #332/TX-99/1213 to MDB and CES, Peanut Foundation grant 04-810-08 to MDB, Ogallala Aquifer Initiative award IPM12.06 to MDB, and United States Department of Agriculture/National Institute of Food and Agriculture Hatch Act award TEX08835 to MDB.
Conflict of interest
The authors declare that they have no conflict of interest. All experiments were performed in accordance with current biomedical research ethical standards in the US.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by S. Hohmann.
Electronic supplementary material
Below is the link to the electronic supplementary material.
438_2014_976_MOESM2_ESM.tif
Figure 2: Histogram of the number of SNPs across 36,102 contigs which were aligned to the custom reference (TIFF 251 kb)
438_2014_976_MOESM3_ESM.tif
Figure 3: FAD2 gene sequences, showing similarity of homoeologous copies in the region resulting in functional differences between alleles, and interference from homoeologous copies in the KASP assay. Differences between genes and alleles for each gene in this region are highlighted. KASP primers designed to amplify each gene are shown above and below each gene as the sequence that they are intended to amplify; SNP1_ASP is the pair of allele-specific primer designed to selectively amplify each allele of SNP1; SNP 1_C is the SNP1 common primer; designations are similar for SNP 2. When genomic DNA is amplified, SNP 1 primers will distinguish the two alleles of sequence 1 (FAD2B), and will amplify sequence 2 (FAD2A) giving the sequence 1 reference allele score for sequence 2. Likewise, SNP 2 primers will distinguish the two alleles of sequence 2 (FAD2A), but will also amplify sequence 1 (FAD2B), giving the sequence 2 (FAD2A) reference allele score (TIFF 247 kb)
Rights and permissions
About this article
Cite this article
Chopra, R., Burow, G., Farmer, A. et al. Next-generation transcriptome sequencing, SNP discovery and validation in four market classes of peanut, Arachis hypogaea L.. Mol Genet Genomics 290, 1169–1180 (2015). https://doi.org/10.1007/s00438-014-0976-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00438-014-0976-4