Abstract
Over the last 15Â years, the costs of DNA sequencing have sharply fallen, effectively shifting the costs of DNA analysis from sequencing to bioinformatic curation and storage. A huge number of available DNA sequences (including genomes and metagenomes) resulted in the development of various tools for sequence annotation. While much effort has been invested into the development of automatic annotation pipelines, manual curation of their results is still necessary in order to obtain a reliable and strictly validated data. Unfortunately, due to its time-consuming nature, manual annotation is now rarely used.
In this chapter, a protocol for efficient manual annotation of prokaryotic DNA sequences using a novel bioinformatic tool—MAISEN (http://maisen.ddlemb.com), is presented. MAISEN is a free, web-based tool designed to accelerate manual annotation, by providing the user with simple interface and precomputed alignments for each predicted feature. It was designed to be available for every scientist, regardless of their bioinformatic proficiency.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Brettin T, Davis JJ, Disz T et al (2015) RASTtk: a modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes. Sci Rep 5:8365. https://doi.org/10.1038/srep08365
Seemann T (2014) Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068–2069. https://doi.org/10.1093/bioinformatics/btu153
Aziz RK, Bartels D, Best AA et al (2008) The RAST server: rapid annotations using subsystems technology. BMC Genomics 9:75. https://doi.org/10.1186/1471-2164-9-75
Wattam AR, Brettin T, Davis JJ et al (2018) Assembly, annotation, and comparative genomics in PATRIC, the all bacterial bioinformatics resource center. Methods Mol Biol 1704:79–101. https://doi.org/10.1007/978-1-4939-7463-4_4
McNeil LK, Reich C, Aziz RK et al (2007) The National Microbial Pathogen Database Resource (NMPDR): a genomics platform based on subsystem annotation. Nucleic Acids Res 35:D347–D353. https://doi.org/10.1093/nar/gkl947
Allen B, Drake M, Harris N, Sullivan T (2017) Using KBase to assemble and annotate prokaryotic genomes. Curr Protoc Microbiol 46:1E.13.1–1E.13.18. https://doi.org/10.1002/cpmc.37
Dunn NA, Unni DR, Diesh C et al (2019) Apollo: Democratizing genome annotation. PLoS Comput Biol 15:1–14. https://doi.org/10.1371/journal.pcbi.1006790
Carver T, Berriman M, Tivey A et al (2008) Artemis and ACT: viewing, annotating and comparing sequences stored in a relational database. Bioinformatics 24:2672–2676
Buchfink B, Xie C, Huson DH (2015) Fast and sensitive protein alignment using DIAMOND. Nat Methods 12:59–60. https://doi.org/10.1038/nmeth.3176
Eddy SR (2011) Accelerated profile HMM searches. PLoS Comput Biol 7:e1002195. https://doi.org/10.1371/journal.pcbi.1002195
Apweiler R, Bairoch A, Wu CH et al (2004) UniProt: the universal protein knowledgebase. Nucleic Acids Res 32:D115–D119
Agarwala R, Barrett T, Beck J et al (2018) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 46:D8–D13. https://doi.org/10.1093/nar/gkx1095
Wee Y, Bhyan SB, Liu Y et al (2019) The bioinformatics tools for the genome assembly and analysis based on third-generation sequencing. Brief Funct Genomics 18:1–12. https://doi.org/10.1093/bfgp/ely037
Sohn J, Nam J-W (2016) The present and future of de novo whole-genome assembly. Brief Bioinform 19:23–40. https://doi.org/10.1093/bib/bbw096
Hyatt D, Chen G-L, LoCascio PF et al (2010) Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11:119. https://doi.org/10.1186/1471-2105-11-119
Lowe TM, Chan PP (2016) tRNAscan-SE on-line: integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res 44:W54–W57. https://doi.org/10.1093/nar/gkw413
Laslett D, Canback B (2004) ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. https://doi.org/10.1093/nar/gkh152
McNair K, Zhou C, Dinsdale EA et al (2019) PHANOTATE: a novel approach to gene identification in phage genomes. Bioinformatics 35:4537–4542. https://doi.org/10.1093/bioinformatics/btz265
Acknowledgments
This work was supported by the National Science Centre, Poland (grant number 2016/22/E/NZ8/00340).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Dziurzynski, M., Decewicz, P., Ciuchcinski, K., Gorecki, A., Dziewit, L. (2021). Simple, Reliable, and Time-Efficient Manual Annotation of Bacterial Genomes with MAISEN. In: Mengoni, A., Bacci, G., Fondi, M. (eds) Bacterial Pangenomics. Methods in Molecular Biology, vol 2242. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-1099-2_14
Download citation
DOI: https://doi.org/10.1007/978-1-0716-1099-2_14
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-1098-5
Online ISBN: 978-1-0716-1099-2
eBook Packages: Springer Protocols