Skip to main content

Simple, Reliable, and Time-Efficient Manual Annotation of Bacterial Genomes with MAISEN

  • Protocol
  • First Online:
Bacterial Pangenomics

Part of the book series: Methods in Molecular Biology ((MIMB,volume 2242))

Abstract

Over the last 15 years, the costs of DNA sequencing have sharply fallen, effectively shifting the costs of DNA analysis from sequencing to bioinformatic curation and storage. A huge number of available DNA sequences (including genomes and metagenomes) resulted in the development of various tools for sequence annotation. While much effort has been invested into the development of automatic annotation pipelines, manual curation of their results is still necessary in order to obtain a reliable and strictly validated data. Unfortunately, due to its time-consuming nature, manual annotation is now rarely used.

In this chapter, a protocol for efficient manual annotation of prokaryotic DNA sequences using a novel bioinformatic tool—MAISEN (http://maisen.ddlemb.com), is presented. MAISEN is a free, web-based tool designed to accelerate manual annotation, by providing the user with simple interface and precomputed alignments for each predicted feature. It was designed to be available for every scientist, regardless of their bioinformatic proficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Brettin T, Davis JJ, Disz T et al (2015) RASTtk: a modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes. Sci Rep 5:8365. https://doi.org/10.1038/srep08365

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  2. Seemann T (2014) Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068–2069. https://doi.org/10.1093/bioinformatics/btu153

    Article  CAS  PubMed  Google Scholar 

  3. Aziz RK, Bartels D, Best AA et al (2008) The RAST server: rapid annotations using subsystems technology. BMC Genomics 9:75. https://doi.org/10.1186/1471-2164-9-75

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Wattam AR, Brettin T, Davis JJ et al (2018) Assembly, annotation, and comparative genomics in PATRIC, the all bacterial bioinformatics resource center. Methods Mol Biol 1704:79–101. https://doi.org/10.1007/978-1-4939-7463-4_4

    Article  CAS  PubMed  Google Scholar 

  5. McNeil LK, Reich C, Aziz RK et al (2007) The National Microbial Pathogen Database Resource (NMPDR): a genomics platform based on subsystem annotation. Nucleic Acids Res 35:D347–D353. https://doi.org/10.1093/nar/gkl947

    Article  CAS  PubMed  Google Scholar 

  6. Allen B, Drake M, Harris N, Sullivan T (2017) Using KBase to assemble and annotate prokaryotic genomes. Curr Protoc Microbiol 46:1E.13.1–1E.13.18. https://doi.org/10.1002/cpmc.37

    Article  Google Scholar 

  7. Dunn NA, Unni DR, Diesh C et al (2019) Apollo: Democratizing genome annotation. PLoS Comput Biol 15:1–14. https://doi.org/10.1371/journal.pcbi.1006790

    Article  CAS  Google Scholar 

  8. Carver T, Berriman M, Tivey A et al (2008) Artemis and ACT: viewing, annotating and comparing sequences stored in a relational database. Bioinformatics 24:2672–2676

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Buchfink B, Xie C, Huson DH (2015) Fast and sensitive protein alignment using DIAMOND. Nat Methods 12:59–60. https://doi.org/10.1038/nmeth.3176

    Article  CAS  PubMed  Google Scholar 

  10. Eddy SR (2011) Accelerated profile HMM searches. PLoS Comput Biol 7:e1002195. https://doi.org/10.1371/journal.pcbi.1002195

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Apweiler R, Bairoch A, Wu CH et al (2004) UniProt: the universal protein knowledgebase. Nucleic Acids Res 32:D115–D119

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Agarwala R, Barrett T, Beck J et al (2018) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 46:D8–D13. https://doi.org/10.1093/nar/gkx1095

    Article  CAS  Google Scholar 

  13. Wee Y, Bhyan SB, Liu Y et al (2019) The bioinformatics tools for the genome assembly and analysis based on third-generation sequencing. Brief Funct Genomics 18:1–12. https://doi.org/10.1093/bfgp/ely037

    Article  CAS  PubMed  Google Scholar 

  14. Sohn J, Nam J-W (2016) The present and future of de novo whole-genome assembly. Brief Bioinform 19:23–40. https://doi.org/10.1093/bib/bbw096

    Article  CAS  Google Scholar 

  15. Hyatt D, Chen G-L, LoCascio PF et al (2010) Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11:119. https://doi.org/10.1186/1471-2105-11-119

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Lowe TM, Chan PP (2016) tRNAscan-SE on-line: integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res 44:W54–W57. https://doi.org/10.1093/nar/gkw413

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Laslett D, Canback B (2004) ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. https://doi.org/10.1093/nar/gkh152

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. McNair K, Zhou C, Dinsdale EA et al (2019) PHANOTATE: a novel approach to gene identification in phage genomes. Bioinformatics 35:4537–4542. https://doi.org/10.1093/bioinformatics/btz265

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgments

This work was supported by the National Science Centre, Poland (grant number 2016/22/E/NZ8/00340).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lukasz Dziewit .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Science+Business Media, LLC, part of Springer Nature

About this protocol

Check for updates. Verify currency and authenticity via CrossMark

Cite this protocol

Dziurzynski, M., Decewicz, P., Ciuchcinski, K., Gorecki, A., Dziewit, L. (2021). Simple, Reliable, and Time-Efficient Manual Annotation of Bacterial Genomes with MAISEN. In: Mengoni, A., Bacci, G., Fondi, M. (eds) Bacterial Pangenomics. Methods in Molecular Biology, vol 2242. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-1099-2_14

Download citation

  • DOI: https://doi.org/10.1007/978-1-0716-1099-2_14

  • Published:

  • Publisher Name: Humana, New York, NY

  • Print ISBN: 978-1-0716-1098-5

  • Online ISBN: 978-1-0716-1099-2

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics