An Agile Functional Analysis of Metagenomic Data Using SUPER-FOCUS

Silva, Genivaldo Gueiros Z.; Lopes, Fabyano A. C.; Edwards, Robert A.

doi:10.1007/978-1-4939-7015-5_4

Genivaldo Gueiros Z. Silva³,
Fabyano A. C. Lopes⁴ &
Robert A. Edwards^3,5,6

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1611))

3034 Accesses
2 Citations
2 Altmetric

Abstract

One of the main goals in metagenomics is to identify the functional profile of a microbial community from unannotated shotgun sequencing reads. Functional annotation is important in biological research because it enables researchers to identify the abundance of functional genes of the organisms present in the sample, answering the question, “What can the organisms in the sample do?” Most currently available approaches do not scale with increasing data volumes, which is important because both the number and lengths of the reads provided by sequencing platforms keep increasing. Here, we present SUPER-FOCUS, SUbsystems Profile by databasE Reduction using FOCUS, an agile homology-based approach using a reduced reference database to report the subsystems present in metagenomic datasets and profile their abundances. SUPER-FOCUS was tested with real metagenomes, and the results show that it accurately predicts the subsystems present in the profiled microbial communities, is computationally efficient, and up to 1000 times faster than other tools. SUPER-FOCUS is freely available at http://edwards.sdsu.edu/SUPERFOCUS.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 119.00; Price excludes VAT (USA)

Softcover Book: USD 159.99; Price excludes VAT (USA)

Hardcover Book: USD 159.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

HirBin: high-resolution identification of differentially abundant functions in metagenomes

Article Open access 21 April 2017

MetaLAFFA: a flexible, end-to-end, distributed computing-compatible metagenomic functional annotation pipeline

Article Open access 21 October 2020

Metagenomics Bioinformatic Pipeline

Notes

1.
This option can be one of rapsearch, blast, or diamond.
2.
Aligner choice [rapsearch (default), blast, or diamond]

References

Handelsman J (2004) Metagenomics: application of genomics to uncultured microorganisms. Microbiol Mol Biol Rev 68:669–685
Article CAS PubMed PubMed Central Google Scholar
Zhang J, Chiodini R, Badr A et al (2011) The impact of next-generation sequencing on genomics. J Genet Genomics 38:95–109
Article PubMed PubMed Central Google Scholar
T.H.M.P. Consortium (2012) Structure, function and diversity of the healthy human microbiome. Nature 486:207–214
Article Google Scholar
Sunagawa S, Coelho LP, Chaffron S et al (2015) Structure and function of the global ocean microbiome. Science 348:1261359
Article PubMed Google Scholar
Mendoza MLZ, Sicheritz-Pontén T, Gilbert MTP (2015) Environmental genes and genomes: understanding the differences and challenges in the approaches and software for their analyses. Brief Bioinform 16(5):745–758. doi:10.1093/bib/bbv001
Article Google Scholar
Overbeek R, Begley T, Butler RM et al (2005) The subsystems approach to genome annotation and its use in the project to annotate 1000 genomes. Nucleic Acids Res 33:5691–5702
Article CAS PubMed PubMed Central Google Scholar
Kanehisa M, Goto S (2000) KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28:27–30
Article CAS PubMed PubMed Central Google Scholar
Caspi R, Altman T, Dale JM et al (2010) The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res 38:D473–D479
Article CAS PubMed Google Scholar
Altschul SF, Madden TL, Schäffer AA et al (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402
Article CAS PubMed PubMed Central Google Scholar
Kent WJ (2002) BLAT—The BLAST-like alignment tool. Genome Res 12:656–664
Article CAS PubMed PubMed Central Google Scholar
Zhao Y, Tang H, Ye Y (2012) RAPSearch2: a fast and memory-efficient protein similarity search tool for next-generation sequencing data. Bioinformatics 28:125–126
Article CAS PubMed Google Scholar
Huson DH, Beier S, Flade I, Górska A, El-Hadidi M, Mitra S, et al. MEGAN Community Edition—Interactive Exploration and Analysis of Large-Scale Microbiome Sequencing Data. PLOS Comput. Biol. 2016;12:e1004957
Google Scholar
Mitra S, Rupek P, Richter DC et al (2011) Functional analysis of metagenomes and metatranscriptomes using SEED and KEGG. BMC Bioinformatics 12:S21
Article PubMed PubMed Central Google Scholar
Meyer F, Paarmann D, D’Souza M et al (2008) The metagenomics RAST server—a public resource for the automatic phylogenetic and functional analysis of metagenomes. BMC Bioinformatics. 9:386
Article CAS PubMed PubMed Central Google Scholar
Edwards RA, Olson R, Disz T et al (2012) Real Time Metagenomics: Using k-mers to annotate metagenomes. Bioinformatics 28:3316–3317
Article CAS PubMed PubMed Central Google Scholar
G.G.Z. Silva, K.T. Green, B.E. Dutilh, et al. (2015) SUPER-FOCUS: a tool for agile functional analysis of shotgun metagenomic data, Bioinformatics. btv584
Google Scholar
Silva GGZ, Cuevas DA, Dutilh BE et al (2014) FOCUS: an alignment-free model to identify organisms in metagenomes using non-negative least squares. PeerJ 2:e425
Article PubMed PubMed Central Google Scholar
Berendzen J, Bruno WJ, Cohn JD et al (2012) Rapid phylogenetic and functional classification of short genomic fragments with signature peptides. BMC Res Notes 5:460
Article CAS PubMed PubMed Central Google Scholar
Rho M, Tang H, Ye Y (2010) FragGeneScan: predicting genes in short and error-prone reads. Nucleic Acids Res 38:e191–e191
Article PubMed PubMed Central Google Scholar
Zhang J, Kobert K, Flouri T et al (2014) PEAR: a fast and accurate Illumina Paired-End reAd mergeR. Bioinformatics 30:614–620
Article CAS PubMed Google Scholar
Magoč T, Salzberg SL (2011) FLASH: fast length adjustment of short reads to improve genome assemblies. Bioinformatics 27:2957–2963
Article PubMed PubMed Central Google Scholar
Parks DH, Tyson GW, Hugenholtz P et al (2014) STAMP: statistical analysis of taxonomic and functional profiles. Bioinformatics 30:3123–3124
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgments

We thank the SEED curators Dr Ross Overbeek, Dr Veronika Vonstein, and Dr Ramy Aziz for the amazing work on the annotation of subsystems since 2004. GGZS was supported by NSF Grants (CNS-1305112, MCB-1330800, and DUE-132809 to RAE and), and FACL was supported by Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES/Brazil) fellowship.

Author information

Authors and Affiliations

Computational Science Research Center, San Diego State University, 5500 Campanile Drive, San Diego, CA, 92182, USA
Genivaldo Gueiros Z. Silva & Robert A. Edwards
Cellular Biology Department, Universidade de Brasília (UnB), 700910-900, Brasília, DF, Brazil
Fabyano A. C. Lopes
Department of Biology, San Diego State University, 5500 Campanile Drive, San Diego, CA, 92182, USA
Robert A. Edwards
Department of Computer Science, San Diego State University, 5500 Campanile Drive, San Diego, CA, 92182, USA
Robert A. Edwards

Authors

Genivaldo Gueiros Z. Silva
View author publications
You can also search for this author in PubMed Google Scholar
Fabyano A. C. Lopes
View author publications
You can also search for this author in PubMed Google Scholar
Robert A. Edwards
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Robert A. Edwards .

Editor information

Editors and Affiliations

Department of Biological Sciences and Computer Science, Purdue University, West Lafayette, Indiana, USA
Daisuke Kihara

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Silva, G.G.Z., Lopes, F.A.C., Edwards, R.A. (2017). An Agile Functional Analysis of Metagenomic Data Using SUPER-FOCUS. In: Kihara, D. (eds) Protein Function Prediction. Methods in Molecular Biology, vol 1611. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-7015-5_4

Download citation

DOI: https://doi.org/10.1007/978-1-4939-7015-5_4
Published: 28 April 2017
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-7013-1
Online ISBN: 978-1-4939-7015-5
eBook Packages: Springer Protocols

Publish with us

Policies and ethics

An Agile Functional Analysis of Metagenomic Data Using SUPER-FOCUS

Abstract

Access this chapter

Similar content being viewed by others

HirBin: high-resolution identification of differentially abundant functions in metagenomes

MetaLAFFA: a flexible, end-to-end, distributed computing-compatible metagenomic functional annotation pipeline

Metagenomics Bioinformatic Pipeline

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this protocol

Cite this protocol

Download citation

Publish with us

Navigation

An Agile Functional Analysis of Metagenomic Data Using SUPER-FOCUS

Abstract

Access this chapter

Similar content being viewed by others

HirBin: high-resolution identification of differentially abundant functions in metagenomes

MetaLAFFA: a flexible, end-to-end, distributed computing-compatible metagenomic functional annotation pipeline

Metagenomics Bioinformatic Pipeline

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this protocol

Cite this protocol

Download citation

Publish with us

Search

Navigation