Skip to main content
Log in

Next-generation transcriptome sequencing, SNP discovery and validation in four market classes of peanut, Arachis hypogaea L.

  • Original Paper
  • Published:
Molecular Genetics and Genomics Aims and scope Submit manuscript

Abstract

Single-nucleotide polymorphisms, which can be identified in the thousands or millions from comparisons of transcriptome or genome sequences, are ideally suited for making high-resolution genetic maps, investigating population evolutionary history, and discovering marker–trait linkages. Despite significant results from their use in human genetics, progress in identification and use in plants, and particularly polyploid plants, has lagged. As part of a long-term project to identify and use SNPs suitable for these purposes in cultivated peanut, which is tetraploid, we generated transcriptome sequences of four peanut cultivars, namely OLin, New Mexico Valencia C, Tamrun OL07 and Jupiter, which represent the four major market classes of peanut grown in the world, and which are important economically to the US southwest peanut growing region. CopyDNA libraries of each genotype were used to generate 2 × 54 paired-end reads using an Illumina GAIIx sequencer. Raw reads were mapped to a custom reference consisting of Tifrunner 454 sequences plus peanut ESTs in GenBank, compromising 43,108 contigs; 263,840 SNP and indel variants were identified among four genotypes compared to the reference. A subset of 6 variants was assayed across 24 genotypes representing four market types using KASP chemistry to assess the criteria for SNP selection. Results demonstrated that transcriptome sequencing can identify SNPs usable as selectable DNA-based markers in complex polyploid species such as peanut. Criteria for effective use of SNPs as markers are discussed in this context.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Abbreviations

SNP:

Single-nucleotide polymorphism

EST:

Expressed sequence tag

RIN:

RNA integrity number

TSA:

Transcriptome shotgun assembly

O/L:

Oleic/linoleic

RFLP:

Restriction fragment length polymorphism

AFLP:

Amplified fragment length polymorphism

SSR:

Simple sequence repeat

References

  • 1000 Genomes Project Consortium (2012) An integrated map of genetic variation from 1,092 human genomes. Nature 491:56–65

    Article  Google Scholar 

  • Allen AM, Barker GL, Berry ST, Coghill JA, Gwilliam R, Kirby S, Robinson P, Brenchley RC, D’Amore R, McKenzie N, Waite D, Hall A, Bevan M, Hall N, Edwards KJ (2011) Transcript-specific, single-nucleotide polymorphism discovery and linkage analysis in hexaploid bread wheat (Triticum aestivum L.). Plant Biotechnol J 9:1086–1099

    Article  CAS  PubMed  Google Scholar 

  • Baring MR, Simpson CE, Burow MD, Black MC, Cason JM, Ayers J, Lopez Y, Melouk HA (2006) Registration of ‘Tamrun OL07’ peanut. Crop Sci 46:2721–2722

    Article  Google Scholar 

  • Belamkar V, Selvaraj MG, Ayers JL, Payton PR, Puppala N, Burow MD (2011) A first insight into population structure and linkage disequilibrium in the US peanut minicore collection. Genetica 139:411–429

    Article  PubMed  Google Scholar 

  • Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW (2009) GenBank. Nucleic Acids Res 37:D26–D31

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  • Bertioli DJ, Ozias-Akins P, Chu Y, Dantas KM, Santos SP, Gouvea E, Guimarães PM, Leal-Bertioli SCM, Knapp SJ, Moretzsohn MC (2014) The use of SNP markers for linkage mapping in diploid and tetraploid peanuts. G3 (Bethesda) 10:89–96

    Article  Google Scholar 

  • Bhattramakki D, Dolan M, Hanafey M, Wineland R, Vaske D, Register JC, Tingey SV, Rafalski A (2002) Insertion–deletion polymorphisms in 3’ regions of maize genes occur frequently and can be used as highly informative genetic markers. Plant Mol Biol 48:539–547

    Article  CAS  PubMed  Google Scholar 

  • Burow MD, Baring, Ayers JL, Schubert AM, López Y, Simpson CE (2012) Registration of ‘Tamrun OL12’ peanut. J Plant Regist 8:117–121

    Article  Google Scholar 

  • Ching A, Rafalski A (2002) Rapid genetic mapping of ESTs using SNP pyrosequencing and indel analysis. Cell Mol Biol Lett 7:803–810

    CAS  PubMed  Google Scholar 

  • Chopra R, Burow G, Farmer A, Mudge J, Simpson CE, Burow MD (2014) Comparisons of de novo transcriptome assemblers in diploid and polyploid species using peanut (Arachis spp.) RNA-seq data. PLoS ONE 9(12):e115055

  • Chu Y, Ramos L, Holbrook CC, Ozias-Akins P (2007) Frequency of a loss-of-function mutation in oleoyl-PC desaturase (ahFAD2A) in the minicore of the US peanut germplasm collection. Crop Sci 47:2372–2378

    Article  CAS  Google Scholar 

  • Cloonan N, Forrest A, Kolle G, Gardiner B, Faulkner G, Brown M, Taylor D, Steptoe A, Wani S, Bethel G (2008) Stem cell transcriptome profiling via massive-scale mRNA sequencing. Nat Methods 5:613–619

    Article  CAS  PubMed  Google Scholar 

  • Close TJ, Bhat PR, Lonardi S, Wu Y, Rostoks N, Ramsay L, Druka A, Stein N, Svensson JT, Wanamaker S, Bozdag S, Roose ML, Moscou MJ, Chao S, Varshney RK, Szucs P, Sato K, Hayes PM, Matthews DE, Kleinhofs A, Muehlbauer GJ, DeYoung J, Marshall DF, Madishetty K, Fenton RD, Condamine P, Graner A, Waugh R (2009) Development and implementation of high-throughput SNP genotyping in barley. BMC Genomics 10:582

    Article  PubMed Central  PubMed  Google Scholar 

  • Collard BCY, Mackill DJ (2008) Marker-assisted selection: an approach for precision plant breeding in the twenty-first century. Phil Trans Royal Soc B: Biol Sci 363:557–572

    Article  CAS  Google Scholar 

  • Davey JW, Blaxter ML (2010) RADSeq: next-generation population genetics. Brief Funct Genomics. 9:416–423

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  • Deschamps S, Llaca V, May GD (2012) Genotyping-by-sequencing in plants. Biology 1:460–483

    Article  PubMed Central  PubMed  Google Scholar 

  • Doyle JJ (2012) Polyploidy in Legumes. In: Soltis PS, Soltis DE (eds) Polyploidy and genome evolution. Springer, Berlin

    Google Scholar 

  • Duan J, Xia C, Zhao G, Jia J, Kong X (2012) Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data. BMC Genomics 13:392

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  • Ekblom R, Galindo J (2011) Applications of next generation sequencing in molecular ecology of non-model organisms. Heredity (Edinb) 107:1–15

    Article  CAS  Google Scholar 

  • FAO, Food and Agriculture Organization of the United Nations (2012) FAOSTAT. Groundnuts (in Shell). http://faostat.fao.org/site/339/default.aspx. Accessed 28 Jan 2015

  • Gautami B, Fonceka D, Pandey MK, Moretzsohn MC, Sujay V, Qin H, Hong Y, Faye I, Chen X, BhanuPrakash A, Shah TM, Gowda MVC, Nigam SN, Liang X, Hoisington DA, Guo B, Bertioli DJ, Rami J-F, Varshney RK (2012) An international reference consensus genetic map with 897 marker loci based on 11 mapping populations for tetraploid groundnut (Arachis hypogaea L.). PLoS ONE 7:e41213

    Article  PubMed Central  PubMed  Google Scholar 

  • Guo B, Beavis WD (2011) In silico genotyping of the maize nested association mapping population. Mol Breed 27:107–113

    Article  PubMed Central  PubMed  Google Scholar 

  • Holbrook CC, Culbreath AK (2007) Registration of ‘Tifrunner’ peanut. J Plant Reg 1:124

    Article  Google Scholar 

  • Holbrook CC, Dong W (2005) Development and evaluation of a mini core collection for the US peanut germplasm collection. Crop Sci 45:1540–1544

    Article  Google Scholar 

  • Hong Y, Chen X, Liang X, Liu H, Zhou G, Li S, Wen S, Holbrook C, Guo B (2010) A SSR-based composite genetic linkage map for the cultivated peanut (Arachis hypogaea L) genome. BMC Plant Biol 10:17

    Article  PubMed Central  PubMed  Google Scholar 

  • Horn M, Eikenberry E, Romero-Lanuza J, Sutton J (2001) High stability peanut oil. US. Patent 6,214,405 A

  • Hsi DC (1980) Registration of ‘New Mexico Valencia C’ peanut (Reg No 24). Crop Sci 20:113–114

    Article  Google Scholar 

  • Iida A, Ohnishi Y, Ozaki K, Ariji Y, Nakamura Y, Tanaka T (2001) High-density single-nucleotide polymorphism (SNP) map in the 96-kb region containing the entire human DiGeorge syndrome critical region 2 (DGCR2) gene at 22q112. J Hum Genet 46:604–608

    Article  CAS  PubMed  Google Scholar 

  • Kaur S, Francki MG, Forster JW (2012) Identification, characterization and interpretation of single-nucleotide sequence variation in allopolyploid crop species. Plant Biotechnol J 10:125–138

    Article  CAS  PubMed  Google Scholar 

  • Khera P, Upadhyaya HD, Pandey MK, Roorkiwal M, Sriswathi M, Janila P, Guo Y, McKain MR, Nagy ED, Knapp SJ, Leebens-Mack J, Conner JA, Ozias-Akins P, Varshney RK (2013) Single nucleotide polymorphism–based genetic diversity in the reference set of peanut (spp.) by developing and applying cost-effective kompetitive allele specific polymerase chain reaction genotyping assays. Plant Genome 6:1–11

    Article  CAS  Google Scholar 

  • Knauft D, Ozias-Akins P (1995) Recent methods for germplasm enhancement and breeding. In: Pattee HE, Stalker HT (eds) Advances in peanut science. APRES, Stillwater, pp 54–94

    Google Scholar 

  • Knauft DA, Gorbet DW, Norden AJ, Norden CK (1997) Peanut oil from enhanced peanut products. US patent no 5,922,390 A. Accessed 27 Jan 2015

  • Koboldt DC, Steinberg KM, Larson DE, Wilson RK, Mardis ER (2013) The next-generation sequencing revolution and its impact on genomics. Cell 155:27–38

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  • Kottapalli KR, Burow MD, Burow G, Burke J, Puppala N (2007) Molecular characterization of the US peanut mini core collection using microsatellite markers. Crop Sci 47:1718–1727

    Article  CAS  Google Scholar 

  • Külheim C, Yeoh SH, Maintz J, Foley WJ, Moran GF (2009) Comparative SNP diversity among four Eucalyptus species for genes from secondary metabolite biosynthetic pathways. BMC Genomics 10:452

    Article  PubMed Central  PubMed  Google Scholar 

  • Lister R, O’Malley R, Tonti-Filippini J, Gregory B, Berry C, Millar A, Ecker J (2008) Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell 133:523–536

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  • Lopez Y, Nadaf HL, Smith OD, Connell JP, Reddy AS, Fritz AK (2000) Isolation and characterization of the Δ12-fatty acid desaturase in peanut (Arachis hypogaea L.) and search for polymorphisms for the high oleate trait in spanish market-type lines. Theor Appl Genet 101:1131–1138

    Article  CAS  Google Scholar 

  • Lopez Y, Nadaf HL, Smith OD, Simpson CE, Fritz AK (2002) Expressed variants of Δ12-fatty acid desaturase for the high oleate trait in spanish market-type peanut lines. Mol Breed 9:183–190

    Article  CAS  Google Scholar 

  • Miller NA, Kingsmore SF, Farmer A, Langley RJ, Mudge J, Crow JA, Gonzalez AJ, Schilkey FD, Kim RJ, van Velkinburgh J, May GD, Black CF, Myers MK, Utsey JP, Frost NS, Sugarbaker DJ, Bueno R, Gullans SR, Baxter SM, Day SW, Retzel EF (2008) Management of high-throughput DNA sequencing projects: Alpheus. J Comput Sci Syst Biol 1:132

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  • Mortazavi A, Williams B, McCue K, Schaeffer L, Wold B (2008) Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods 5:621–628

    Article  CAS  PubMed  Google Scholar 

  • Nagalakshmi U, Wang Z, Waern K, Shou C, Raha D, Gerstein M, Snyder M (2008) The transcriptional landscape of the yeast genome defined by RNA sequencing. Science 320:1344–1349

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  • Nagy ED, Guo Y, Tang S, Bowers JE, Okashah RA, Taylor CA, Zhang D, Khanal S, Heesacker AF, Khalilian N, Farmer AD, Carrasquilla-Garcia N, Penmetsa RV, Cook D, Stalker HT, Nielsen N, Ozias-Akins P, Knapp SJ (2012) A high-density genetic map of Arachis duranensis, a diploid ancestor of cultivated peanut. BMC Genomics 13:469

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  • Norden AJ, Lipscomb RW, Carver WA (1969) Registration of ‘Florunner’ peanuts (Reg No 2). Crop Sci 9:850

    Article  Google Scholar 

  • Nwokolo E, Smartt J (1996) Food and feed from legumes and oilseeds. Chapman and Hall, London

    Book  Google Scholar 

  • Patel M, Jung S, Moore K, Powell G, Ainsworth C, Abbott A (2004) High-oleate peanut mutants result from a MITE insertion into the FAD2 gene. Theoral Appl Genet 108:1492–1502

    Article  CAS  Google Scholar 

  • Pimratch S, Jogloy S, Toomsan B, Jaisil P, Sikhinarum J, Kesmala T, Patanothai P (2004) Evaluation of seven peanut genotypes for nitrogen fixation and agronomic traits Songklanakarin. J Sci Technol 26:295–304

    CAS  Google Scholar 

  • Ratan A, Miller W, Guillory J, Stinson J, Seshagiri S, Schuster SC (2013) Comparison of sequencing platforms for single nucleotide variant calls in a human sample. PLoS One 8:e55089

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  • Salem M, Vallejo RL, Leeds TD, Palti Y, Liu S, Sabbagh A, Rexroad CE III, Yao J (2012) RNA-seq identifies SNP markers for growth traits in rainbow trout. PLoS ONE 7:e36264

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  • Severin AJ, Woody JL, Bolon YT, Joseph B, Diers BW, Farmer AD, Muehlbauer GJ, Nelson RT, Grant D, Specht JE, Graham MA, Cannon SB, May GD, Vance CP, Shoemaker RC (2010) RNA-seq atlas of Glycine max: a guide to the soybean transcriptome. BMC Plant Biol 10:160

    Article  PubMed Central  PubMed  Google Scholar 

  • Shirasawa K, Koilkonda P, Aoki K, Hirakawa H, Tabata S, Watanabe M, Hasegawa M, Kiyoshima H, Suzuki S, Kuwata C (2012) In silico polymorphism analysis for the development of simple sequence repeat and transposon markers and construction of linkage map in cultivated peanut. BMC Plant Biol 12:80

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  • Shirasawa K, Bertioli DJ, Varshney RK, Moretzsohn MC, Leal-Bertioli SCM, Thudi M, Pandey MK, Rami J-F, Foncéka D, Gowda MVC, Qin H, Guo B, Hong Y, Liang X, Hirakawa H, Tabata S, Isobe S (2013) Integrated consensus map of cultivated peanut and wild relatives reveals structures of the A and B genomes of Arachis and divergence of the legume. Genomes DNA Res 20:173–184

    Article  CAS  Google Scholar 

  • Simpson CE, Baring MR, Schubert AM, Melouk HA, Lopez Y, Kirby JS (2003) Registration of ‘OLin’ peanut. Crop Sci 43:1880–1881

    Article  Google Scholar 

  • Simpson CE, Nelson SC, Starr JL, Woodard KE, Smith OD (1993) Registration of ‘TxAG-6’ and ‘TxAG-7’ peanut germplasm lines. Crop Sci 33:1418

    Article  Google Scholar 

  • Soltis DE, Albert VA, Leebens-Mack J, Bell CD, Paterson AH, Zheng C, Sankoff D, dePamphilis CW, Wall PK, Soltis PS (2009) Polyploidy and angiosperm diversification. Am J Bot 96:336–348

    Article  PubMed  Google Scholar 

  • Soltis DE, Soltis PS (1999) Polyploidy: recurrent formation and genome evolution. Trends Ecol Evol 14:348–352

    Article  PubMed  Google Scholar 

  • Thimm O, Blasing O, Gibon Y, Nagel A, Meyer S, Kruger P, Selbig J, Muller LA, Rhee SY, Stitt M (2004) MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes. Plant J 37:914–939

    Article  CAS  PubMed  Google Scholar 

  • Varshney RK, Bertioli DJ, Moretzsohn MC, Vadez V, Krishnamurthy L, Aruna R, Nigam SN, Moss BJ, Seetha K, Ravi K, He G, Knapp SJ, Hoisington DA (2009) The first SSR-based genetic linkage map for cultivated groundnut (Arachis hypogaea L.). Theor Appl Genet 118:729–739

    Article  CAS  PubMed  Google Scholar 

  • Wang M, Barkley N, Chen Z, Pittman R (2011) FAD2 gene mutations significantly alter fatty acid profiles in cultivated peanuts (Arachis hypogaea). Biochem Genet 49:748–759

    Article  CAS  PubMed  Google Scholar 

  • Wang X, Luan J, Li J, Bao Y, Zhang C, Liu S (2010) De novo characterization of a whitefly transcriptome and analysis of its gene expression during development. BMC Genomics 11:400

    Article  PubMed Central  PubMed  Google Scholar 

  • Welter D, MacArthur J, Morales J, Burdett T, Hall P, Junkins H, Klemm A, Flicek P, Manolio T, Hindorff L, Parkinson H (2013) The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res 42:D1001–D1006

    Article  PubMed Central  PubMed  Google Scholar 

  • Williams EJ, Drexler JS (1981) A non-destructive method for determining peanut pod maturity. Peanut Sci 8:134–141

    Article  Google Scholar 

  • Wu T, Watanabe C (2005) GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics (Oxford) 21:1859–1875

    Article  CAS  Google Scholar 

  • Wu X, Ren C, Joshi T, Vuong T, Xu D, Nguyen H (2010) SNP discovery by high-throughput sequencing in soybean. BMC Genomics 11:469

    Article  PubMed Central  PubMed  Google Scholar 

  • Wynne JC, Mozingo RW, Emery DA (1979) Registration of NC 7 peanut. Crop Sci 19:563

    Article  Google Scholar 

  • Xiao-Ping R, Hui-Fang J (2010) Comparison of genetic diversity between peanut mini core collections from China and ICRISAT by SSR Markers. Acta Agronomica Sinica 36:1084

    Google Scholar 

  • Zhao Y, Prakash C, He G (2012) Characterization and compilation of polymorphic simple sequence repeat (SSR) markers of peanut from public database. BMC Res Notes C7–362(5):1–7

    Google Scholar 

  • Zhou X, Xia Y, Ren X, Chen Y, Huang L, Huang S, Liao B, Lei Y, Yan L, Jiang H (2014) Construction of a SNP-based genetic linkage map in cultivated peanut based on large scale marker development using next-generation double-digest restriction-site-associated DNA sequencing (ddRADseq). BMC Genomics 15:351

    Article  PubMed Central  PubMed  Google Scholar 

Download references

Acknowledgments

The authors wish to thank Jennifer Chagoya at Texas A&M AgriLife Research, and Halee Hughes and Nancy Layland at the USDA-ARS, Lubbock for technical support. This work was funded by grants from the Texas Peanut Producers Board award CY2008-Burow-TTU-Development to MDB and CES, and 2009-TTU-Burow-Genotyping to MDB, National Peanut Board grant #332/TX-99/1139 to MDB, and #332/TX-99/1213 to MDB and CES, Peanut Foundation grant 04-810-08 to MDB, Ogallala Aquifer Initiative award IPM12.06 to MDB, and United States Department of Agriculture/National Institute of Food and Agriculture Hatch Act award TEX08835 to MDB.

Conflict of interest

The authors declare that they have no conflict of interest. All experiments were performed in accordance with current biomedical research ethical standards in the US.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mark D. Burow.

Additional information

Communicated by S. Hohmann.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Figure 1: Contig distribution of the custom reference assembly, for contigs with length ≥ 200 bp (TIFF 194 kb)

438_2014_976_MOESM2_ESM.tif

Figure 2: Histogram of the number of SNPs across 36,102 contigs which were aligned to the custom reference (TIFF 251 kb)

438_2014_976_MOESM3_ESM.tif

Figure 3: FAD2 gene sequences, showing similarity of homoeologous copies in the region resulting in functional differences between alleles, and interference from homoeologous copies in the KASP assay. Differences between genes and alleles for each gene in this region are highlighted. KASP primers designed to amplify each gene are shown above and below each gene as the sequence that they are intended to amplify; SNP1_ASP is the pair of allele-specific primer designed to selectively amplify each allele of SNP1; SNP 1_C is the SNP1 common primer; designations are similar for SNP 2. When genomic DNA is amplified, SNP 1 primers will distinguish the two alleles of sequence 1 (FAD2B), and will amplify sequence 2 (FAD2A) giving the sequence 1 reference allele score for sequence 2. Likewise, SNP 2 primers will distinguish the two alleles of sequence 2 (FAD2A), but will also amplify sequence 1 (FAD2B), giving the sequence 2 (FAD2A) reference allele score (TIFF 247 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chopra, R., Burow, G., Farmer, A. et al. Next-generation transcriptome sequencing, SNP discovery and validation in four market classes of peanut, Arachis hypogaea L.. Mol Genet Genomics 290, 1169–1180 (2015). https://doi.org/10.1007/s00438-014-0976-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00438-014-0976-4

Keywords

Navigation