Abstract
High-throughput RNA sequencing was performed for comprehensively analyzing the transcriptome of the purple sweet potato. A total of 58,800 unigenes were obtained and ranged from 200 nt to 10,380 nt with an average length of 476 nt. The average expression of one unigene was 34 reads per kb per million reads (RPKM) with a maximum expression of 1,935 RPKM. At least 40,280 (68.5%) unigenes were identified to be protein-coding genes, in which 11,978 and 5,184 genes were homologous to Arabidopsis and rice proteins, respectively. Gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) analysis showed that 19,707 (33.5%) unigenes were classified to 1,807 terms of GO including molecular functions, biological processes, and cellular components and 9,970 (17.0%) unigenes were enriched to 11,119 KEGG pathways. We found that at least 3,553 genes may be involved in the biosynthesis pathways of starch, alkaloids, anthocyanin pigments, and vitamins. Additionally, 851 potential simple sequence repeats (SSRs) were identified in all unigenes. Transcriptome sequencing on tuberous roots of the sweet potato yielded substantial transcriptional sequences and potentially useful SSR markers which provide an important data source for sweet potato research. Comparison of two RNA-sequence datasets from the purple and the yellow sweet potato showed that UDP-glucose-flavonoid 3-O-glucosyltransferase was one of the key enzymes in the pathway of anthocyanin biosynthesis and that anthocyanin-3-glucoside might be one of the major components for anthocyanin pigments in the purple sweet potato. This study contributes to the molecular mechanisms of sweet potato development and metabolism and therefore that increases the potential utilization of the sweet potato in food nutrition and pharmacy.
Similar content being viewed by others
Abbreviations
- AFLP:
-
Amplified fragment-length polymorphism
- bHLH:
-
Helix-loop-helix
- CHI:
-
Chalcone isomerase
- CHS:
-
Chalcone synthase
- COG:
-
Clusters of orthologous groups
- EST:
-
Expression sequence tag
- F3′5′H:
-
Flavonoid 3′ 5′-hydroxylase
- F3H:
-
Flavonoid 3′ hydroxylase
- GO:
-
Gene ontology
- KEGG:
-
Kyoto encyclopedia of genes and genomes
- RNA-seq:
-
RNA sequencing
- RPKM:
-
Reads per kb per million reads
- SRA:
-
Short read archive
- SSR:
-
Simple sequence repeat
- UFGT:
-
UDP-glucose-flavonoid 3-O-glucosyltransferase
- VNTR:
-
Variable number of tandem repeat
References
Aharoni A, De Vos CH, Wein M, Sun Z, Greco R, Kroon A, Mol JN, O’Connell AP (2001) The strawberry FaMYB1 transcription factor suppresses anthocyanin and flavonol accumulation in transgenic tobacco. Plant J 28:319–332
Andersen JR, Lubberstedt T (2003) Functional markers in plants. Trends Plant Sci 8:554–560
Bovell-Benjamin AC (2007) Sweet potato: a review of its past, present, and future role in human nutrition. Adv Food Nutr Res 52:1–59
Buteler MI, Jarret RL, LaBonte DR, USDA A (1999) Sequence characterization of microsatellites in diploid and polyploid Ipomoea. Theor Appl Genet 99:123–132
Butelli E, Titta L, Giorgio M, Mock HP, Matros A, Peterek S, Schijlen EG, Hall RD, Bovy AG, Luo J, Martin C (2008) Enrichment of tomato fruit with health-promoting anthocyanins by expression of select transcription factors. Nat Biotechnol 26:1301–1308
Cervantes-Flores JC, Yencho GC, Kriegner A, Pecota KV, Faulk MA, Mwanga ROM, Sosinski BR (2008) Development of a genetic linkage map and identification of homologous linkage groups in sweet potato using multiple-dose AFLP markers. Mol Breeding 21:511–532
Chen WH, Hsu CY, Cheng HY, Chang H, Chen HH, Ger MJ (2011) Downregulation of putative UDP-glucose: flavonoid 3-O-glucosyltransferase gene alters flower coloring in Phalaenopsis. Plant Cell Rep 30:1007–1017
Conesa A, Gotz S, Garcia-Gomez JM, Terol J, Talon M, Robles M (2005) Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21:3674–3676
Coutinho PM, Deleury E, Davies GJ, Henrissat B (2003) An evolving hierarchical family classification for glycosyltransferases. J Mol Biol 328:307–317
FAO (2010) (http://faostat.fao.org/) (Accessed 5 October 2010)
Hu J, Nakatani M, Mizuno K, Fujimura AT (2004) Development and characterization of microsatellite markers in sweet potato. Breeding Science 54:177–188
Iseli C, Jongeneel CV, Bucher P (1999) ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences. Proc Int Conf Intell Syst Mol Biol 138–148
Jin H, Martin C (1999) Multifunctionality and diversity within the plant MYB-gene family. Plant Mol Biol 41:577–585
Johnson M, Pace RD (2010) Sweet potato leaves: properties and synergistic interactions that promote health and prevent disease. Nutr Rev 68:604–615
Kanehisa M, Goto S (2000) KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28:27–30
Li L, Stoeckert CJ Jr, Roos DS (2003) OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res 13:2178–2189
Li R, Li Y, Kristiansen K, Wang J (2008) SOAP: short oligonucleotide alignment program. Bioinformatics 24:713–714
Li R, Zhu H, Ruan J, Qian W, Fang X, Shi Z, Li Y, Li S, Shan G, Kristiansen K, Yang H, Wang J (2010) De novo assembly of human genomes with massively parallel short read sequencing. Genome Res 20:265–272
Lu Q, Yang Q, Shen C (2009) Accumulation of anthocyanins in Arabidopsis thaliana caused by transformation with 3GT gene from wild potato. Acta Agriculture Zhejiangensis 21:544–548
Mano H, Ogasawara F, Sato K, Higo H, Minobe Y (2007) Isolation of a regulatory gene of anthocyanin biosynthesis in tuberous roots of purple-fleshed sweet potato. Plant Physiol 143:1252–1268
Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B (2008) Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods 5:621–628
Rychlik W (1995) Selection of primers for polymerase chain reaction. Mol Biotechnol 3:129–134
Sanchez de la Hoz MP, Davila JA, Loarce Y, Ferrer E (1996) Simple sequence repeat primers used in polymerase chain reaction amplifications to study genetic diversity in barley. Genome 39:112–117
Schafleitner R, Tincopa LR, Palomino O, Rossel G, Robles RF, Alagon R, Rivera C, Quispe C, Rojas L, Pacheco JA, Solis J, Cerna D, Kim JY, Hou J, Simon R (2010) A sweet potato gene index established by de novo assembly of pyrosequencing and Sanger sequences and mining for gene-based microsatellite markers. BMC Genomics 11:604–613
Takos AM, Jaffe FW, Jacob SR, Bogs J, Robinson SP, Walker AR (2006) Light-induced expression of a MYB gene regulates anthocyanin biosynthesis in red apples. Plant Physiol 142:1216–1232
Wang Z, Gerstein M, Snyder M (2009) RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 10:57–63
Wang Z, Fang B, Chen J, Zhang X, Luo Z, Huang L, Chen X, Li Y (2010) De novo assembly and characterization of root transcriptome using Illumina paired-end sequencing and development of cSSR markers in sweet potato (Ipomoea batatas). BMC Genomics 11:726–739
Zeng S, Xiao G, Guo J, Fei Z, Xu Y, Roe BA, Wang Y (2010) Development of a EST dataset and characterization of EST-SSRs in a traditional Chinese medicinal plant, Epimedium sagittatum (Sieb. Et Zucc.) Maxim. BMC Genomics 11:94–104
Author information
Authors and Affiliations
Corresponding authors
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Xie, F., Burklew, C.E., Yang, Y. et al. De novo sequencing and a comprehensive analysis of purple sweet potato (Impomoea batatas L.) transcriptome. Planta 236, 101–113 (2012). https://doi.org/10.1007/s00425-012-1591-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00425-012-1591-4