Abstract
For Salvia miltiorrhiza, bioinformatic tools have been used to analyze its DNA sequences, protein sequences as well as next-generation DNA sequencing (NGS) data studying the transcriptome, circular RNAs, etc. Here, we first described the basic file formats used in studying S. miltiorrhiza. FASTA is the standard format describing nucleotide and amino acid sequences. FASTQ format adds the quality scores to the FASTA file. GFF/GTF is used to describe the genome and gene structures, such as genes, CDS, proteins, exons, and introns. SAM format is widely used to describe the mapping of NGS reads to the reference sequences. For various analysis tasks, many commercial software tools have been developed. However, for users who prefer to use free software tools or develop computational pipelines, the basic function of EMBOSS software is introduced. Mapping of reads to reference sequences is the most widely used method for analyzing NGS data, and we introduced several frequently used tools. Bowtie2 is one of the most widely used tools for read mapping. SAMtools can be used to analyze the mapping data. As an example, several software packages identifying circRNAs are described. This chapter thus provides an introduction to the most fundamental bioinformatic tools.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Altschul SF, Gish W, Miller W, Myers EW, Lipman JD (1990) Basic local alignment search tool. J Mol Biol 215:403–410
Gao Y, Zhang J, Zhao F (2018) Circular RNA identification based on multiple seed matching. Brief Bioinform 19:803–810
Huang X, Chen Y, Xiao J, Huang Z, Peng J (2018) Identification of differentially expressed circular RNAs during TGF-ß1-induced endothelial-to-mesenchymal transition in rat coronary artery endothelial cells. Anat J Cardiol 19:192
Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Meth 9:357–359
Langmead B, Trapnell C, Pop M, Salzberg SL (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10:R25
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R (2007) Clustal W and Clustal X version 2.0. Bioinformatics 23:2947–2948
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup (2009) The sequence Alignment/Map format and SAMtools. Bioinformatics 25:2078–2079
Rice P, Longden I, Bleasby A (2000) EMBOSS: the European molecular biology open software suite. Trends Genet 16:276–277
Westholm JO, Miura P, Olson S, Shenker S, Joseph B, Sanfilippo P, Celniker SE, Graveley BR, Lai EC (2014) Genome-wide analysis of drosophila circular RNAs reveals their structural and sequence properties and age-dependent neural accumulation. Cell Rep 9:1966–1980
Zhang XO, Dong R, Zhang Y, Zhang JL, Luo Z, Zhang J, Chen LL, Yang L (2016) Diverse alternative back-splicing and alternative splicing landscape of circular RNAs. Genome Res 26:1277–1287
Acknowledgements
This work has been supported by Chinese Academy of Medical Sciences, Innovation Funds for Medical Sciences (CIFMS) (2016-I2M-3-016, 2017-I2M-1-013) and National Natural Science Foundation of China (81872966).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Wang, L., Liu, C. (2019). Bioinformatic Tools for Salvia miltiorrhiza Functional Genomics. In: Lu, S. (eds) The Salvia miltiorrhiza Genome. Compendium of Plant Genomes. Springer, Cham. https://doi.org/10.1007/978-3-030-24716-4_9
Download citation
DOI: https://doi.org/10.1007/978-3-030-24716-4_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-24715-7
Online ISBN: 978-3-030-24716-4
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)