Gene Annotation and Database of the Radish

Chapter
Part of the Compendium of Plant Genomes book series (CPG)

Abstract

The genome sequences of Raphanus sativus var. hortensis were determined for the Japanese cultivars ‘Aokubi S-h’ (Kitashiba et al. in DNA Res 21:481–490, 2014), ‘Aokubi doubled haploid (DH)’ (Mitsui et al. in Sci Rep 5:10835, 2015), and the Korean cultivar ‘WK10039’ (Jeong et al. in Theor Appl Genet 129:1357–1372, 2016). After the genome sequencing, gene annotations, such as gene finding, domain searches, and similarity searches against several databases, were conducted. The genomic information obtained by these analyses was then released from a database. In this section, the gene annotation methods conducted in each of the three genome sequencing projects is described. In addition, several databases related to the genome sequencing, EST sequencing, and DNA markers in radish are introduced.

Keywords

Database Gene annotation Genome sequences DNA markers Pseudomolecule Raphanus 

References

  1. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402CrossRefPubMedPubMedCentralGoogle Scholar
  2. Asamizu E, Ichihara H, Nakaya A, Nakamura Y, Hirakawa H, Ishii T, Tamura T, Fukami-Kobayashi K, Nakajima Y, Tabata S (2014) Plant Genome DataBase Japan (PGDBj): a portal website for the integration of plant genome-related databases. Plant Cell Physiol 55:e8CrossRefPubMedPubMedCentralGoogle Scholar
  3. Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M (2005) Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21:3674–3676CrossRefPubMedGoogle Scholar
  4. Eddy SR (2011) Accelerated profile HMM searches. PLoS Comput Biol 7:e1002195CrossRefPubMedPubMedCentralGoogle Scholar
  5. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, Chen Z, Mauceli E, Hacohen N, Gnirke A, Rhind N, di Palma F, Birren BW, Nusbaum C, Lindblad-Toh K, Friedman N, Regev A (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29:644–652CrossRefPubMedPubMedCentralGoogle Scholar
  6. Haas B, Salzberg S, Zhu W, Pertea M, Allen J, Orvis J, White O, Buell C, Wortman J (2008) Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments. Genome Biol 9:R7CrossRefPubMedPubMedCentralGoogle Scholar
  7. Jeong YM, Kim N, Ahn BO, Oh M, Chung WH, Chung H, Jeong S, Lim KB, Hwang YJ, Kim GB, Baek S, Choi SB, Hyung DJ, Lee SW, Sohn SH, Kwon SJ, Jin M, Seol YJ, Chae WB, Choi KJ, Park BS, Yu HJ, Mun JH (2016) Elucidating the triplicated ancestral genome structure of radish based on chromosome-level comparison with the Brassica genomes. Theor Appl Genet 129:1357–1372CrossRefPubMedGoogle Scholar
  8. Kitashiba H, Li F, Hirakawa H, Kawanabe T, Zou Z, Hasegawa Y, Tonosaki K, Shirasawa S, Fukushima A, Yokoi S, Takahata Y, Kakizaki T, Ishida M, Okamoto S, Sakamoto K, Shirasawa K, Tabata S, Nishio T (2014) Draft sequences of the radish (Raphanus sativus L.) genome. DNA Res 21:481–490CrossRefPubMedPubMedCentralGoogle Scholar
  9. Korf I (2004) Gene finding in novel genomes. BMC Bioinformat 5:59CrossRefGoogle Scholar
  10. Li L, Stoeckert CJ Jr, Roos DS (2003) OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res 13:2178–2189CrossRefPubMedPubMedCentralGoogle Scholar
  11. Li W, Godzik A (2006) Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22:1658–1659CrossRefPubMedGoogle Scholar
  12. Llorens C, Futami R, Covelli L, Domínguez-Escribá L, Viu JM, Tamarit D, Aguilar-Rodríguez J, Vicente-Ripolles M, Fuster G, Bernet GP, Maumus F, Munoz-Pomer A, Sempere JM, Latorre A, Moya A (2011) The Gypsy Database (GyDB) of mobile genetic elements: release 2.0. Nucleic Acids Res 39(Database Issue):D70–D74CrossRefPubMedGoogle Scholar
  13. Lowe TM, Eddy SR (1997) tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964CrossRefPubMedPubMedCentralGoogle Scholar
  14. Mitsui Y, Shimomura M, Komatsu K, Namiki N, Shibata-Hatta M, Imai M, Katayose Y, Mukai Y, Kanamori H, Kurita K, Kagami T, Wakatsuki A, Ohyanagi H, Ikawa H, Minaka N, Nakagawa K, Shiwa Y, Sasaki T (2015) The radish genome and comprehensive gene expression profile of tuberous root formation and development. Sci Rep 5:10835CrossRefPubMedPubMedCentralGoogle Scholar
  15. Moghe GD, Hufnagel DE, Tang H, Xiao Y, Dworkin I, Town CD, Conner JK, Shiu SH (2014) Consequences of whole-genome triplication as revealed by comparative genomic analyses of the wild radish Raphanus raphanistrum and three other Brassicaceae species. Plant Cell 26:1925–1937CrossRefPubMedPubMedCentralGoogle Scholar
  16. Mulder NJ, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Bork P, Buillard V, Cerutti L, Copley R, Courcelle E, Das U, Daugherty L, Dibley M, Finn R, Fleischmann W, Gough J, Haft D, Hulo N, Hunter S, Kahn D, Kanapin A, Kejariwal A, Labarga A, Langendijk-Genevaux PS, Lonsdale D, Lopez R, Letunic I, Madera M, Maslen J, McAnulla C, McDowall J, Mistry J, Mitchell A, Nikolskaya AN, Orchard S, Orengo C, Petryszak R, Selengut JD, Sigrist CJ, Thomas PD, Valentin F, Wilson D, Wu CH, Yeats C (2007) New developments in the InterPro database. Nucleic Acids Res 35(Database Issue):D224–D228CrossRefPubMedPubMedCentralGoogle Scholar
  17. Ogata H, Goto S, Sato K, Fujibuchi W, Bono H, Kanehisa M (1999) KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res 27:29–34CrossRefPubMedPubMedCentralGoogle Scholar
  18. Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R (2005) InterProScan: protein domains identifier. Nucleic Acids Res 33(Web Server Issue):W116–W120CrossRefPubMedPubMedCentralGoogle Scholar
  19. Rice P, Longden I, Bleasby A (2000) EMBOSS: the European molecular biology open software suite. Trends Genet 16:276–277CrossRefPubMedGoogle Scholar
  20. Shen D, Sun H, Huang M, Zheng Y, Li X, Fei Z (2013) RadishBase: a database for genomics and genetics of radish. Plant Cell Physiol 54:e3CrossRefPubMedGoogle Scholar
  21. Shirasawa K, Isobe S, Tabata S, Hirakawa H (2014) Kazusa marker DataBase: a database for genomics, genetics, and molecular breeding in plants. Breed Sci 64:264–271CrossRefPubMedPubMedCentralGoogle Scholar
  22. Stanke M, Waack S (2003) Gene prediction with a hidden Markov model and a new intron submodel. Bioinformatics 19:ii215–ii225CrossRefPubMedGoogle Scholar
  23. Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, Krylov DM, Mazumder R, Mekhedov SL, Nikolskaya AN, Rao BS, Smirnov S, Sverdlov AV, Vasudevan S, Wolf YI, Yin JJ, Natale DA (2003) The COG database: an updated version includes eukaryotes. BMC Bioinformat 4:41CrossRefGoogle Scholar
  24. Wu T, Nacu N (2010) Fast and SNP-tolerant detection of complex variants and splicing in short reads. Bioinformatics 26:873–881CrossRefPubMedPubMedCentralGoogle Scholar
  25. Zhang X, Yue Z, Mei S, Qiu Y, Yang X, Chen X, Cheng F, Wu Z, Sun Y, Jing Y, Liu B, Shen D, Wang H, Cui N, Duan Y, Wu J, Wang J, Gan C, Wang J, Wang X, Li X (2015) A de novo genome of a Chinese radish cultivar. Hortic Plant J 1:155–164Google Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  1. 1.Kazusa DNA Research InstituteKisarazuJapan

Personalised recommendations