Abstract
Since most microorganisms in natural environments cannot be cultured in laboratory media, due to low growth rates or their dependency on specific conditions, culture-based systems are unable to estimate the full microbial diversity of an environment. However, molecular techniques can provide this ability by analysing the diversity of macromolecules, such as proteins, RNA or DNA, present in the environment. Metagenomic methods employ sequencing procedures for the determination of the microbial diversity of a community (sequence-driven metagenomic analysis) or for examining a particular functional ability of microorganisms in the environment (function-driven metagenomic gene identification), using genomic DNA obtained directly from environmental samples. Application of metagenomic methods provides a huge amount of data that can be analysed only by using powerful computational bioinformatics tools. Currently, these bioinformatic tools are adequate to allow the ecological structure of a community and the possible functions of its members to be determined. The resulting data can be useful for phylogenetic and biotechnological studies. This paper reviews the emergence of new technologies based on sequencing environmental DNA and on bioinformatics. We assess its potential for application in environmental studies, particularly for developing new biomolecules that can be applied to degrading recalcitrant pollutants in terrestrial and aquatic systems.
Similar content being viewed by others
References
Aagaard K, Petrosino J, Keitel W, Watson M, Katancik J, Garcia N, Patel S, Cutting M, Madden T, Hamilton H (2013) The Human Microbiome Project strategy for comprehensive sampling of the human microbiome and why it matters. FASEB J 27(3):1012–1022
Abarenkov K, Nilsson RH, Larsson KH, Alexander IJ, Eberhardt U, Erland S, Hoiland K, Kjoller R, Larsson E, Pennanen T, Sen R, Taylor SAF, Tedersoo L, Ursing BM, Vralstad T, Liimatainen K, Peinter U, Koljalg U (2010) The UNITE database for molecular identification of fungi–recent updates and future perspectives. New Phytol 186:281–285
Abbai NS, Govender A, Shaik R, Pillay B (2012) Pyrosequence analysis of unamplified and whole genome amplified DNA from hydrocarbon-contaminated groundwater. Mol Biotechnol 50(1):39–48
Abbasian F, Saberbaghi T (2013) Metagenomic study of human gastrointestinal tracts in health and diseases. J Gastroenterol Hepatol Res 2(12):885–896
Acland A, Agarwala R, Barrett T, Beck J, Benson DA, Bollin C, Bolton E, Bryant SH, Canese K, Church DM (2013) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 41(D1):D8–D20
Alioto T, Picardi E, Guigó R, Pesole G (2013) ASPic-GeneID: a lightweight pipeline for gene prediction and alternative isoforms detection. BioMed Res Int. doi:10.1155/2013/502827
Allen HK, Moe LA, Rodbumrer J, Gaarder A, Handelsman J (2009) Functional metagenomics reveals diverse β-lactamases in a remote Alaskan soil. ISME J 3(2):243–251
Andoh A, Imaeda H, Aomatsu T, Inatomi O, Bamba S, Sasaki M, Saito Y, Tsujikawa T, Fujiyama Y (2011) Comparison of the fecal microbiota profiles between ulcerative colitis and Crohn’s disease using terminal restriction fragment length polymorphism analysis. J Gastroenterol 46(4):479–486
Ansorge WJ (2009) Next-generation DNA sequencing techniques. New Biotechnol 25(4):195–203
Ariyaratne PN, Sung W-K (2011) PE-Assembler: de novo assembler using short paired-end reads. Bioinformatics 27(2):167–174
Au KF, Underwood JG, Lee L, Wong WH (2012) Improving PacBio long read accuracy by short read alignment. PLoS ONE 7:e46679
Bacci G, Bazzicalupo M, Benedetti A, Mengoni A (2013) StreamingTrim 1.0: a Java software for dynamic trimming of 16S rRNA sequence data from metagenetic studies. Mol Ecol Resour 14:426–434
Bairoch A, Apweiler R (2000) The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res 28:45–48
Baker BJ, Hugenholtz P, Dawson SC, Banfield JF (2003) Extremely acidophilic protists from acid mine drainage host Rickettsiales-lineage endosymbionts that have intervening sequences in their 16S rRNA genes. Appl Environ Microbiol 69(9):5512–5518
Bartlett D (2008) Bacterial adaptation to extremes of low temperature and elevated pressure. Paper presented at the 37th COSPAR Scientific Assembly
Bazinet AL, Cummings MP (2012) A comparative evaluation of sequence classification programs. BMC Bioinformatics 13(1):92
Béja O, Aravind L, Koonin EV, Suzuki MT, Hadd A, Nguyen LP, Jovanovich SB, Gates CM, Feldman RA, Spudich JL (2000) Bacterial rhodopsin: evidence for a new type of phototrophy in the sea. Science 289(5486):1902–1906
Birney E, Clamp M, Durbin R (2004) GeneWise and genomewise. Genome Res 14(5):988–995
Bonham-Carter O, Ali H, Bastola D (2013) A base composition analysis of natural patterns for the preprocessing of metagenome sequences. BMC Bioinformatics 14(Suppl 11):S5
Brady A, Salzberg SL (2009) Phymm and PhymmBL: metagenomic phylogenetic classification with interpolated Markov models. Nat Methods 6(9):673–676
Brulc JM, Antonopoulos DA, Miller MEB, Wilson MK, Yannarell AC, Dinsdale EA, Edwards RE, Frank ED, Emerson JB, Wacklin P (2009) Gene-centric metagenomics of the fiber-adherent bovine rumen microbiome reveals forage specific glycoside hydrolases. Proc Natl Acad Sci 106(6):1948–1953
Burge SW, Daub J, Eberhardt R, Tate J, Barquist L, Nawrocki EP, Eddy SR, Gardner PP, Bateman A (2013) Rfam 11.0: 10 years of RNA families. Nucleic Acids Res 41:D226–D232
Butler P, Tomczyk M, Smith S, Berger J, Nasarabadi S, Park C, Smith M, Quail M (2012) Universal NGS library preparation on the Apollo 324 TM System: automated ion torrent personal genome machine library preparation. J Biomol Tech 23:S33
Caporaso JG, Lauber CL, Walters WA, Berg-Lyons D, Lozupone CA, Turnbaugh PJ, Fierer N, Knight R (2011) Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proc Natl Acad Sci 108(Supplement 1):4516–4522
Cardoso AM, Coutinho FH (2012) Metagenomics in polluted aquatic environments
Cartegni L, Wang J, Zhu Z, Zhang MQ, Krainer AR (2003) ESEfinder: a web resource to identify exonic splicing enhancers. Nucleic Acids Res 31(13):3568–3571
Caspi R, Altman T, Billington R, Dreher K, Foerster H, Fulcher CA, Holland TA, Keseler IM, Kothari A, Kubo A, Krummenacker M, Latendresse M, Mueller LA, Ong Q, Paley S, Subhraveti P, Weaver DS, Weerasinghe D, Zhang P, Karp PD (2014) The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res 42:D459–D471
Cawley S (2000) The Genscan HMM
Chai B, Cole JR (2014) 169. RDP: data and tools for studying structure and function of microbial communities. In: 2014 genomic science contractor-grantee meeting XII, p 295
Chang JI, Lin C-C (2006) A study of storage tank accidents. J Loss Prev Process Ind 19(1):51–59
Chang JY, Antonopoulos DA, Kalra A, Tonelli A, Khalife WT, Schmidt TM, Young VB (2008) Decreased diversity of the fecal microbiome in recurrent clostridium difficile: associated diarrhea. J Infect Dis 197(3):435–438
Chattopadhyay MK (2006) Mechanism of bacterial adaptation to low temperature. J Biosci 31(1):157–165
Chen C, Khaleel SS, Huang H, Wu CH (2013) ngsShoRT: a software for pre-processing illumina short read sequences for de novo genome assembly. Paper presented at the proceedings of the international conference on bioinformatics, computational biology and biomedical informatics
Church GM, Vigneault F, Sismour AM (2011) Enzymatic oligonucleotide pre-adenylation: google patents
Cong Q, Grishin NV (2012) MESSA: MEta-server for protein sequence analysis. BMC Biol 10(1):82
Consortium Human Microbiome Project (2012) Structure, function and diversity of the healthy human microbiome. Nature 486(7402):207–214
Cox MP, Peterson DA, Biggs PJ (2010) SolexaQA: at-a-glance quality assessment of Illumina second-generation sequencing data. BMC Bioinformatics 11(1):485
Criscuolo A, Brisse S (2013) AlienTrimmer: a tool to quickly and accurately trim off multiple short contaminant sequences from high-throughput sequencing reads. Genomics 102(5):500–506
Cryan JF, Dinan TG (2012) Mind-altering microorganisms: the impact of the gut microbiota on brain and behaviour. Nat Rev Neurosci 13(10):701–712
Dalhus B, Saarinen M, Sauer UH, Eklund P, Johansson K, Karlsson A, Ramaswamy S, Bjork A, Synstad B, Naterstad K (2002) Structural basis for thermophilic protein stability: structures of thermophilic and mesophilic malate dehydrogenases. J Mol Biol 318(3):707–721
Davenport CF, Tümmler B (2013) Advances in computational analysis of metagenome sequences. Environ Microbiol 15(1):1–5
de Lima Morais DA, Fang H, Rackham OJ, Wilson D, Pethica R, Chothia C, Gough J (2011) SUPERFAMILY 1.75 including a domain-centric gene ontology method. Nucleic Acids Res 39:D427–D434
de Marvao A, Dawes T, Keenan NG, Bai W, Shi W, Durighel G, Diamond T, Monje-Garcia L, Cook SA, O’Regan DP (2013) The UK GenScan study-population-based imaging genetics research using 3D Cardiac Magnetic Resonance. J Cardiovasc Magn Reson 15(Suppl 1):E2
Del Fabbro C, Scalabrin S, Morgante M, Giorgi FM (2013) An extensive evaluation of read trimming effects on illumina NGS data analysis. PLoS ONE 8(12):e85024
dela Torre R, Larkin J, Singer A, Meller A (2012) Fabrication and characterization of solid-state nanopore arrays for high-throughput DNA sequencing. Nanotechnology 23(38):385308
Delcher AL, Harmon D, Kasif S, White O, Salzberg SL (1999) Improved microbial gene identification with GLIMMER. Nucleic Acids Res 27(23):4636–4641
Della Sala G, Hochmuth T, Costantino V, Teta R, Gerwick W, Gerwick L, Piel J, Mangoni A (2013) Polyketide genes in the marine sponge Plakortis simplex: a new group of mono-modular type I polyketide synthases from sponge symbionts. Environ Microbiol Rep 5(6):809–818
Delseny M, Han B, Hsing YI (2010) High throughput DNA sequencing: the new sequencing revolution. Plant Sci 179(5):407–422
Denef VJ, Kalnejais LH, Mueller RS, Wilmes P, Baker BJ, Thomas BC, Ver Berkmoes NC, Hettich RL, Banfield JF (2010) Proteogenomic basis for ecological divergence of closely related bacteria in natural acidophilic microbial communities. Proc Natl Acad Sci 107(6):2383–2390
Desai N, Antonopoulos D, Gilbert JA, Glass EM, Meyer F (2012) From genomics to metagenomics. Curr Opin Biotechnol 23(1):72–76
Desmet F-O, Hamroun D, Lalande M, Collod-Béroud G, Claustres M, Béroud C (2009) Human splicing finder: an online bioinformatics tool to predict splicing signals. Nucleic Acids Res 37(9):e67–e67
Diaz NN, Krause L, Goesmann A, Niehaus K, Nattkemper TW (2009) TACOA–Taxonomic classification of environmental genomic fragments using a kernelized nearest neighbor approach. BMC Bioinformatics 10(1):56
Dionisi HM, Lozada M, Olivera NL (2012) Bioprospection of marine microorganisms: biotechnological applications and methods. Rev Argent Microbiol 44(1):49–60
Dohm JC, Lottaz C, Borodina T, Himmelbauer H (2007) SHARCGS, a fast and highly accurate short-read assembly algorithm for de novo genomic sequencing. Genome Res 17(11):1697–1706
Edwards KJ, Bond PL, Druschel GK, McGuire MM, Hamers RJ, Banfield JF (2000) Geochemical and biological aspects of sulfide mineral dissolution: lessons from Iron Mountain, California. Chem Geol 169(3):383–397
Eisenstein M (2012) Oxford Nanopore announcement sets sequencing sector abuzz. Nat Biotechnol 30:295–296
Eschenfeldt WH, Stols L, Rosenbaum H, Khambatta ZS, Quaite-Randall E, Wu S, Kilgore DC, Trent JD, Donnelly MI (2001) DNA from uncultured organisms as a source of 2, 5-diketo-D-gluconic acid reductases. Appl Environ Microbiol 67(9):4206–4214
Faheem M, Vidal JFD, Fernandes JPC, Oliveira GM, Caroline CF, Souto BM, Parachin NS, Quirino BF, Barbosa JARG (2013) Cloning and initial characterization of cellulolytic enzymes selected in a metagenomic approach. Paper presented at the Embrapa Agroenergia-Resumo em anais de congresso (ALICE)
Fakruddin RMM, Chowdhury A, Hossain N, Mahajan S, Islam S (2013) Pyrosequencing-a next generation sequencing technology. World Appl Sci J 24(12):1558–1571
Favier CF, de Vos Willem M, Akkermans ADL (2003) Development of bacterial and bifidobacterial communities in feces of newborn babies. Anaerobe 9(5):219–229
Felsenstein J (2006) PHYLIP (Phylogeny Inference Package) documentation files, version 3.66. Seattle: Department of Genome Sciences, University of Washington
Ferrarini M, Moretto M, Ward JA, Šurbanovski N, Stevanović V, Giongo L, Viola R, Cavalieri D, Velasco R, Cestaro A, Sargent DJ (2013) An evaluation of the PacBio RS platform for sequencing and de novo assembly of a chloroplast genome. BMC Genomics 14:670
Finegold SM (2008) Therapy and epidemiology of autism–clostridial spores as key elements. Med Hypotheses 70(3):508–511
Finn RD, Bateman A, Clements J, Coggill P, Eberhardt RY, Eddy SR, Heger A, Hetherington K, Holm L, Mistry J, Sonnhammer EL, Tate J, Punta M (2014) Pfam: the protein families database. Nucleic Acids Res 42:D222–D230
Garcia-Armisen T, Vercammen K, Passerat J, Triest D, Servais P, Cornelis P (2011) Antimicrobial resistance of heterotrophic bacteria in sewage-contaminated rivers. Water Res 45(2):788–796
Garmendia L, Hernandez A, Sanchez MB, Martinez JL (2012) Metagenomics and antibiotics. Clin Microbiol Infect 18(s4):27–31
Glass EM, Wilkening J, Wilke A, Antonopoulos D, Meyer F (2010) Using the metagenomics RAST server (MG-RAST) for analyzing shotgun metagenomes. Cold Spring Harbor Protocols, 2010(1), pdb. prot5368
Gołębiewski M, Deja-Sikora E, Cichosz M, Tretyn A, Wróbel B (2014) 16S rDNA pyrosequencing analysis of bacterial community in heavy metals polluted soils. Microb Ecol 67(3):635–647
Gong J-S, Lu Z-M, Li H, Zhou Z-M, Shi J-S, Xu Z-H (2013) Metagenomic technology and genome mining: emerging areas for exploring novel nitrilases. Appl Microbiol Biotechnol 97(15):6603–6611
Gosalbes MJ, Durbán A, Pignatelli M, Abellan JJ, Jiménez-Hernández N, Pérez-Cobas AE, Latorre A, Moya A (2011) Metatranscriptomic approach to analyze the functional human gut microbiota. PLoS ONE 6(3):e17447
Green SJ, Leigh MB, Neufeld JD (2010) Denaturing gradient gel electrophoresis (DGGE) for microbial community analysis handbook of hydrocarbon and lipid. Microbiology, pp 4137–4158
Grigoriev IV, Nordberg H, Shabalov I, Aerts A, Cantor M, Goodstein D, Kuo A, Minovitsky S, Nikitin R, Ohm RA, Otillar R, Poliakov A, Ratnere I, Riley R, Smirnova T, Rokhsar D, Dubchak I (2012) The genome portal of the department of energy joint genome institute. Nucleic Acids Res 40:D26–D32
Guo X, Ding X, Meng Y, Pan Y (2013) Cloud computing for de novo metagenomic sequence assembly. Bioinformatics Research and Applications. Springer, pp 185–198
Gutierrez T (2011) Identifying polycyclic aromatic hydrocarbon-degrading bacteria in oil-contaminated surface waters at Deepwater Horizon by cultivation, stable isotope probing and pyrosequencing. Rev Environ Sci BioTechnol 10(4):301–305
Gutmanas A, Alhroub Y, Battle GM, Berrisford JM, Bochet E, Conroy MJ, Dana JM, Fernandez Montecelo MA, van Ginkel G, Gore SP, Haslam P, Hatherley R, Hendrickx PM, Hirshberg M, Lagerstedt I, Mir S, Mukhopadhyay A, Oldfield TJ, Patwardhan A, Rinaldi L, Sahni G, Sanz-García E, Sen S, Slowley RA, Velankar S, Wainwright ME, Kleywegt GJ (2014) PDBe: protein data bank in Europe. Nucleic Acids Res 42:D285–D291
Haiser HJ, Turnbaugh PJ (2013) Developing a metagenomic view of xenobiotic metabolism. Pharmacol Res 69(1):21–31
Hamady M, Knight R (2009) Microbial community profiling for human microbiome projects: tools, techniques, and challenges. Genome Res 19(7):1141–1152
Han D, Pal S, Yang Y, Jiang S, Nangreave J, Liu Y, Yan H (2013) DNA gridiron nanostructures based on four-arm junctions. Science 339(6126):1412–1415
Haque MM, Ghosh TS, Komanduri D, Mande SS (2009) SOrt-ITEMS: sequence orthology based approach for improved taxonomic estimation of metagenomic sequences. Bioinformatics 25(14):1722–1730
Hebsgaard SM, Korning PG, Tolstrup N, Engelbrecht J, Rouzé P, Brunak S (1996) Splice site prediction in Arabidopsis thaliana pre-mRNA by combining local and global sequence information. Nucleic Acids Res 24(17):3439–3452
Heijtz RD, Wang S, Anuar F, Qian Y, Björkholm B, Samuelsson A, Hibberd ML, Forssberg H, Pettersson S (2011) Normal gut microbiota modulates brain development and behavior. Proc Natl Acad Sci 108(7):3047–3052
Henson J, Tischler G, Ning Z (2012) Next-generation sequencing and large genome assemblies. Pharmacogenomics 13(8):901–915
Higashi S, Barreto AMS, Cantão ME, de Vasconcelos ATR (2012) Analysis of composition-based metagenomic classification. BMC Genom 13(Suppl 5):S1
Hoff KJ, Tech M, Lingner T, Daniel R, Morgenstern B, Meinicke P (2008) Gene prediction in metagenomic fragments: a large scale machine learning approach. BMC Bioinformatics 9(1):217
Hoff KJ, Lingner T, Meinicke P, Tech M (2009) Orphelia: predicting genes in metagenomic sequencing reads. Nucleic Acids Res 37(suppl 2):W101–W105
Hoffman JI, Clark MS, Amos W, Peck LS (2012) Widespread amplification of amplified fragment length polymorphisms (AFLPs) in marine Antarctic animals. Polar Biol 35(6):919–929
Hooper LV, Gordon JI (2001) Commensal host-bacterial relationships in the gut. Science 292(5519):1115–1118
Huang T, He Z-S, Cui W-R, Cai Y-D, Shi X-H, Hu L-L, Chou K-C (2013) A sequence-based approach for predicting protein disordered regions. Protein Pept Lett 20(3):243–248
Hugenholtz P, Tyson GW (2008) Metagenomics. Nature 455(25):481–483
Husi H, Skipworth RJ, Fearon KCH, Ross JA (2013) LSCluster, a large-scale sequence clustering and aligning software for use in partial identity mapping and splice-variant analysis. J Proteom 84:185–189
Inskeep WP, Rusch DB, Jay ZJ, Herrgard MJ, Kozubal MA, Richardson TH, Macur RE, Hamamura N, deM Jennings R, Fouke BW (2010) Metagenomes from high-temperature chemotrophic systems reveal geochemical controls on microbial community structure and function. PLoS ONE 5(3):e9773
Jabbour D, Sorger A, Sahm K, Antranikian G (2013) A highly thermoactive and salt-tolerant α-amylase isolated from a pilot-plant biogas reactor. Appl Microbiol Biotechnol 97(7):2971–2978
Jianping XU (2006) Microbial ecology in the age of genomics and metagenomics: concepts, tools, and recent advances. Mol Ecol 15:1713–1731
Jones DS, Albrecht HL, Dawson KS, Schaperdoth I, Freeman KH, Pi Y, Pearson A, Macalady JL (2011) Community genomic analysis of an extremely acidophilic sulfur-oxidizing biofilm. ISME J 6(1):158–170
Joshi GK, Jugran J, Bhatt JP (2014) Metagenomics: the exploration of unculturable microbial world advances in biotechnology, Springer, pp 105–115
Julian TR, Schwab KJ (2012) Challenges in environmental detection of human viral pathogens. Curr Opin Virol 2(1):78–83
Kanehisa M, Goto S, Sato Y, Furumichi M, Tanabe M (2012) KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res 40:D109–D114
Kankainen M, Ojala T, Holm L (2012) BLANNOTATOR: enhanced homology-based function prediction of bacterial proteins. BMC Bioinformatics 13(1):33
Katara J, Deshmukh R, Singh NK, Kaur S (2013) Diversity analysis of Bacillus thuringiensis isolates recovered from diverse habitats in india using random amplified polymorphic DNA (RAPD) markers. J Biol Sci 13(5):514–520
Katoh K, Standley DM (2013) MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 30(4):772–780
Kelley DR, Liu B, Delcher AL, Pop M, Salzberg SL (2012) Gene prediction with Glimmer for metagenomic sequences augmented by classification and clustering. Nucleic Acids Res 40(1):e9–e9
Keseler IM, Mackie A, Peralta-Gil M, Santos-Zavaleta A, Gama-Castro S, Bonavides-Martínez C, Fulcher C, Huerta AM, Kothari A, Krummenacker M, Latendresse M, Muñiz-Rascado L, Ong Q, Paley S, Schröder I, Shearer AG, Subhraveti P, Travers M, Weerasinghe D, Weiss V, Collado-Vides J, Gunsalus RP, Paulsen I, Karp PD (2013) EcoCyc: fusing model organism databases with systems biology. Nucleic Acids Res 41:D605–D612
Kim HS, Mittenthal JE, Caetano-Anollés G (2006) MANET: tracing evolution of protein architecture in metabolic networks. BMC Bioinform 7:351
Kim EY, Oh KH, Lee M-H, Kang CH, Oh TK, Yoon JH (2009) Novel cold-adapted alkaline lipase from an intertidal flat metagenome and proposal for a new family of bacterial lipases. Appl Environ Microbiol 75(1):257–260
King RW, Bauer JD, Brady SF (2009) An environmental DNA-derived type ii polyketide biosynthetic pathway encodes the biosynthesis of the pentacyclic polyketide erdacin. Angew Chem Int Ed 48(34):6257–6261
Kinjo AR, Suzuki H, Yamashita R, Ikegawa Y, Kudou T, Igarashi R, Kengaku Y, Cho H, Standley DM, Nakagawa A, Nakamura H (2012) Protein Data Bank Japan (PDBj): maintaining a structural data archive and resource description framework format. Nucleic Acids Res 40:D453–D460
Kircher M, Kelso J (2010) High-throughput DNA sequencing–concepts and limitations. BioEssays 32(6):524–536
Klenk HP, Göker M (2010) En route to a genome-based classification of Archaea and Bacteria. Syst Appl Microbiol 33(4):175–182
Koboldt DC, Steinberg KM, Larson DE, Wilson RK, Mardis ER (2013) The next-generation sequencing revolution and its impact on genomics. Cell 155(1):27–38
Kocot KM, Citarella MR, Moroz LL, Halanych KM (2013) PhyloTreePruner: a phylogenetic tree-based approach for selection of orthologous sequences for phylogenomics. Evol Bioinform Online 9:429
Kong Y (2011) Btrim: a fast, lightweight adapter and quality trimming program for next-generation sequencing technologies. Genomics 98(2):152–153
Koren S, Schatz MC, Walenz BP, Martin J, Howard JT, Ganapathy G, Wang Z, Rasko DA, McCombie WR, Jarvis ED, Phillippy AM (2012) Hybrid error correction and de novo assembly of single-molecule sequencing reads. Nat Biotechnol 30:693–700
Köser CU, Holden MTG, Ellington MJ, Cartwright EJP, Brown NM, Ogilvy-Stuart AL, Hsu Li Y, Chewapreecha C, Croucher NJ, Harris SR (2012) Rapid whole-genome sequencing for investigation of a neonatal MRSA outbreak. N Engl J Med 366(24):2267–2275
Kosuge T, Mashima J, Kodama Y, Fujisawa T, Kaminuma E, Ogasawara O, Okubo K, Takagi T, Nakamura Y (2014) DDBJ progress report: a new submission system for leading to a correct annotation. Nucleic Acids Res 42:D44–D49
Kotlar HK, Lewin A, Johansen J, Throne-Holst M, Haverkamp T, Markussen S, Winnberg A, Ringrose P, Aakvik T, Ryeng E (2011) High coverage sequencing of DNA from microorganisms living in an oil reservoir 25 kilometres subsurface. Environ Microbiol Rep 3(6):674–681
Ku CS, Roukos DH (2013) From next-generation sequencing to nanopore sequencing technology: paving the way to personalized genomic medicine. Expert Rev Med Devices 10(1):1–6
Kuang X, Han JG, Zhao N, Pang B, Shyu C-R, Korkin D (2012) DOMMINO: a database of macromolecular interactions. Nucleic Acids Res 40:D501–D506
LeBlanc JG, Milani C, de Giori GS, Sesma F, van Sinderen D, Ventura M (2013) Bacteria as vitamin suppliers to their host: a gut microbiota perspective. Curr Opin Biotechnol 24(2):160–168
Lees JG, Lee D, Studer RA, Dawson NL, Sillitoe I, Das S, Yeats C, Dessailly BH, Rentzsch R, Orengo CA (2014) Gene3D: multi-domain annotations for protein sequence and comparative genome analysis. Nucleic Acids Res 42(D1):D240–D245
Leggett RM, Clavijo BJ, Clissold L, Clark MD, Caccamo M (2013) NextClip: an analysis and read preparation tool for Nextera long mate pair libraries. Bioinformatics btt 702:1–3
Lepage P, Leclerc MC, Joossens M, Mondot S, Blottière HM, Raes J, Ehrlich D, Doré J (2013) A metagenomic insight into our gut’s microbiome. Gut 62(1):146–158
Lewin A, Wentzel A, Valla S (2012) Metagenomics of microbial life in extreme temperature environments. Curr Opin Biotechnol 24:1–10
Lewin A, Wentzel A, Valla S (2013) Metagenomics of microbial life in extreme temperature environments. Curr Opin Biotechnol 24(3):516–525
Lewis TE, Sillitoe I, Andreeva A, Blundell TL, Buchan DWA, Chothia C, Cuff A, Dana JM, Filippis I, Gough J (2013) Genome3D: a UK collaborative project to annotate genomic sequences with predicted 3D structures based on SCOP and CATH domains. Nucleic Acids Res 41(D1):D499–D507
Ley RE, Peterson DA, Gordon JI (2006) Ecological and evolutionary forces shaping microbial diversity in the human intestine. Cell 124(4):837–848
Li R, Zhu H, Ruan J, Qian W, Fang X, Shi Z, Li Y, Li S, Shan G, Kristiansen K (2010) De novo assembly of human genomes with massively parallel short read sequencing. Genome Res 20(2):265–272
Licata L, Briganti L, Peluso D, Perfetto L, Iannuccelli M, Galeota E, Sacco F, Palma A, Nardozza AP, Santonico E, Castagnoli L, Cesareni G (2012) MINT, the molecular interaction database: 2012 update. Nucleic Acids Res 40:D857–D861
Liles MR, Manske BF, Bintrim SB, Handelsman J, Goodman RM (2003) A census of rRNA genes and linked genomic sequences within a soil metagenomic library. Appl Environ Microbiol 69(5):2684–2691
Liu Y, Schmidt B, Maskell DL (2011a) Parallelized short read assembly of large genomes using de Bruijn graphs. BMC Bioinformatics 12(1):354
Liu Z, Klatt CG, Wood JM, Rusch DB, Ludwig M, Wittekindt N, Tomsho Lynn P, Schuster SC, Ward DM, Bryant DA (2011b) Metatranscriptomic analyses of chlorophototrophs of a hot-spring microbial mat. ISME J 5(8):1279–1290
Liu L, Li Y, Li S, Hu N, He Y, Pong R, Lin D, Lu L, Law M (2012) Comparison of next-generation sequencing systems. BioMed Res Int 2012:11
Liu J, Wang H, Yang H, Zhang Y, Wang J, Zhao F, Qi J (2013) Composition-based classification of short metagenomic sequences elucidates the landscapes of taxonomic and functional enrichment of microorganisms. Nucleic Acids Res 41(1):e3
Logan AC, Katzman M (2005) Major depressive disorder: probiotics may be an adjuvant therapy. Med Hypotheses 64(3):533–538
Lozupone C, Knight R (2005) UniFrac: a new phylogenetic method for comparing microbial communities. Appl Environ Microbiol 71(12):8228–8235
Luciani F, Bull RA, Lloyd AR (2012) Next generation deep sequencing and vaccine design: today and tomorrow. Trends Biotechnol 30(9):443–452
Lukashin AV, Borodovsky M (1998) GeneMark. hmm: new solutions for gene finding. Nucleic Acids Res 26(4):1107–1115
Luo J (2013) Applied bioinformatics tools. In: Rui J, Xuegong Z, Michael QZ (eds) Basics of bioinformatics. Springer, pp 271–301
Maitra RD, Kim J, Dunbar WB (2012) Recent advances in nanopore sequencing. Electrophoresis 33(23):3418–3428
Mani D, Kumar C (2014) Biotechnological advances in bioremediation of heavy metals contaminated ecosystems: an overview with special reference to phytoremediation. Int J Environ Sci Technol 11(3):843–872
Manrao EA, Derrington IM, Laszlo AH, Langford KW, Hopper MK, Gillgren N, Pavlenok M, Niederweis M, Gundlach JH (2012) Reading DNA at single-nucleotide resolution with a mutant MspA nanopore and phi29 DNA polymerase. Nat Biotechnol 30(4):349–353
Margesin R, Schinner F (2001) Biodegradation and bioremediation of hydrocarbons in extreme environments. Appl Microbiol Biotechnol 56:650–663
Martin M, Rahmann S (2012) From cutadapt to sequencetools (sqt): a versatile toolset for sequencing projects. EMBnet J 17(B):35
Masoudi-Nejad A, Narimani Z, Hosseinkhan N (2013a) Next generation sequencing and sequence assembly: methodologies and algorithms. Springer
Masoudi-Nejad A, Narimani Z, Hosseinkhan N (2013b) Emergence of next-generation sequencing next generation sequencing and sequence assembly. Springer, pp 11–39
McWilliam H, Li W, Uludag M, Squizzato S, Park YM, Buso N, Cowley AP, Lopez R (2013) Analysis tool web services from the EMBL-EBI. Nucleic Acids Res 41:W597–W600
Merriman B, Torrent I, Rothberg JM, Team D (2012) Progress in ion torrent semiconductor chip based sequencing. Electrophoresis 33(23):3397–3417
Metzker ML (2009) Sequencing technologies—the next generation. Nat Rev Genet 11(1):31–46
Milanesi L, D’Angelo D, Rogozin IB (1999) GeneBuilder: interactive in silico prediction of gene structure. Bioinformatics 15(7):612–621
Minneci F, Piovesan D, Cozzetto D, Jones DT (2013) FFPred 2.0: improved homology-independent prediction of gene ontology terms for eukaryotic protein sequences. PLoS ONE 8(5):e63754
Mizrachi I (2013) Genbank
Mocali S, Benedetti A (2010) Exploring research frontiers in microbiology: the challenge of metagenomics in soil microbiology. Res Microbiol 161(6):497–505
Mohammed MH, Ghosh TS, Singh NK, Mande SS (2011) SPHINX: an algorithm for taxonomic binning of metagenomic sequences. Bioinformatics 27(1):22–30
Mokili JL, Rohwer F, Dutilh BE (2012) Metagenomics and future perspectives in virus discovery. Curr Opin Virol 2(1):63–77
Nakamura S, Maeda N, Miron IM, Yoh M, Izutsu K, Kataoka C, Honda T, Yasunaga T, Nakaya T, Kawai J (2008) Metagenomic diagnosis of bacterial infections. Emerg Infect Dis 14(11):1784
Nakamura S, Yang C-S, Sakon N, Ueda M, Tougan T, Yamashita A, Goto N, Takahashi K, Yasunaga T, Ikuta K (2009) Direct metagenomic detection of viral pathogens in nasal and fecal specimens using an unbiased high-throughput sequencing approach. PLoS ONE 4(1):e4219
Nelson WC, Wollerman L, Bhaya D, Heidelberg JF (2011) Analysis of insertion sequences in thermophilic cyanobacteria: exploring the mechanisms of establishing, maintaining, and withstanding high insertion sequence abundance. Appl Environ Microbiol 77(15):5458–5466
Ng C, DeMaere MZ, Williams TJ, Lauro FM, Raftery M, Gibson JAE, Andrews-Pfannkoch C, Lewis M, Hoffman JM, Thomas T (2010) Metaproteogenomic analysis of a dominant green sulfur bacterium from Ace Lake, Antarctica. ISME J 4(8):1002–1019
Nimchua T, Uengwetwanit T, Eurwilaichitr L (2012) Metagenomic analysis of novel lignocellulose-degrading enzymes from higher termite guts inhabiting microbes. J Microbiol Biotechnol 22(4):462–469
Noguchi H, Park J, Takagi T (2006) MetaGene: prokaryotic gene finding from environmental genome shotgun sequences. Nucleic Acids Res 34(19):5623–5630
Ono A, Miyazaki R, Sota M (2007) Isolation and characterization of naphthalene-catabolic genes and plasmids from oil-contaminated soil by using two cultivation-independent approaches. Appl Microbiol Biotechnol 74:501–510
Ozer S, Doshi KJ, Xu W, Gutell RR (2011) rCAD: a novel database schema for the comparative analysis of RNA. In: E-Science (e-Science), 2011 IEEE 7th International Conference on, 2011. IEEE, pp 15–22
Pagani I, Liolios K, Jansson J, Chen IM, Smirnova T, Nosrat B, Markowitz VM, Kyrpides NC (2012) The Genomes OnLine Database (GOLD) v.4: status of genomic and metagenomic projects and their associated metadata. Nucleic Acids Res 40:D571–D579
Pannarale P, Catalano D, De Caro G, Grillo G, Leo P, Pappadà G, Rubino F, Scioscia G, Licciulli F (2012) GIDL: a rule based expert system for GenBank intelligent data loading into the molecular biodiversity database. BMC Bioinformatics 13(Suppl 4):S4
Park N, Shirley L, Gu Y, Keane TM, Swerdlow H, Quail MA (2013) An improved approach to mate-paired library preparation for Illumina sequencing. Methods Next Gener Seq 1:10–20
Pearson WR (2014) BLAST and FASTA similarity searching for multiple sequence alignment multiple sequence alignment methods. Springer, pp 75–101
Pertea M, Lin X, Salzberg SL (2001) GeneSplicer: a new computational method for splice site prediction. Nucleic Acids Res 29(5):1185–1190
Pickrell WO, Rees MI, Chung S-K (2012) 1 Next generation sequencing methodologies: an overview. Adv Protein Chem Struct Biol 89:1
Polz MF, Alm EJ, Hanage WP (2013) Horizontal gene transfer and the evolution of bacterial and archaeal population structure. Trends Genet 29(3):170–175
Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, Peplies J, Glöckner FO (2013) The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res 41:D590–D596
Rabausch U, Juergensen J, Ilmberger N, Böhnke S, Fischer S, Schubach B, Schulte M, Streit WR (2013) Functional screening of metagenome and genome libraries for detection of novel flavonoid-modifying enzymes. Appl Environ Microbiol 79(15):4551–4563
Rachamalla MR, Monzoorul HM, Mande SS (2012) TWARIT: an extremely rapid and efficient approach for phylogenetic classification of metagenomic sequences. Gene 505(2):259–265
Radford AD, Chapman D, Dixon L, Chantrey J, Darby AC, Hall N (2012) Application of next-generation sequencing technologies in virology. J Gen Virol 93(Pt 9):1853–1868
Rajendhran J, Gunasekaran P (2011) Microbial phylogeny and diversity: small subunit ribosomal RNA sequence analysis and beyond. Microbiol Res 166(2):99–110
Ramos RTJ, Carneiro AR, Caracciolo PH, Azevedo V, Schneider MPC, Barh D, Silva A (2013) Graphical contig analyzer for all sequencing platforms (G4ALL): a new stand-alone tool for finishing and draft generation of bacterial genomes. Bioinformation 9(11):599
Rankin LM, Gobson JAE, Franzmann PD, Burton Harry R (1999) The chemical stratification and microbial communities of Ace Lake, Antarctica: a review of the characteristics of a marine-derived meromictic lake. Polarforschung 66(1/2):33–52
Rasheed Z, Rangwala H, Barbara D (2012) LSH-Div: species diversity estimation using locality sensitive hashing. Paper presented at the Bioinformatics and Biomedicine (BIBM), 2012 IEEE international conference on
Rasheed Z, Rangwala H, Barbará D (2013) 16S rRNA metagenome clustering and diversity estimation using locality sensitive hashing. BMC Syst Biol 7(4):1–13
Rasko DA, Webster DR, Sahl JW, Bashir A, Boisen N, Scheutz F, Paxinos EE, Sebra R, Chin CS, Iliopoulos D (2011) Origins of the E. coli strain causing an outbreak of hemolytic–uremic syndrome in Germany. N Engl J Med 365(8):709–717
Reese MG, Eeckman FH, Kulp D, Haussler D (1997) Improved splice site detection in Genie. J Comput Biol 4(3):311–323
Ronquist F, Teslenko M, Mark P, Ayres DL, Darling A, Höhna S, Larget B, Liu L, Suchard MA, Huelsenbeck JP (2012) MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst Biol 61(3):539–542
Rosen GL, Reichenberger ER, Rosenfeld AM (2011) NBC: the Naive Bayes Classification tool webserver for taxonomic classification of metagenomic reads. Bioinformatics 27(1):127–129
Royo JL, Hidalgo M, Ruiz A (2007) Pyrosequencing protocol using a universal biotinylated primer for mutation detection and SNP genotyping. Nat Protoc 2(7):1734–1739
Ruby JG, Bellare P, DeRisi JL (2013) PRICE: software for the targeted assembly of components of (Meta) genomic sequence data. G3: Genes|Genomes|. Genetics 3(5):865–880
Sabehi G, Massana R, Bielawski JP, Rosenberg M, Delong EF, Béjà O (2003) Novel proteorhodopsin variants from the Mediterranean and Red Seas. Environ Microbiol 5(10):842–849
Sait M, Hugenholtz P, Janssen PH (2002) Cultivation of globally distributed soil bacteria from phylogenetic lineages previously only detected in cultivation-independent surveys. Environ Microbiol 4(11):654–666
Salgado H, Peralta-Gil M, Gama-Castro S, Santos-Zavaleta A, Muñiz-Rascado L, García-Sotelo JS, Weiss V, Solano-Lira H, Martínez-Flores I, Medina-Rivera A, Salgado-Osorio G, Alquicira-Hernández S, Alquicira-Hernández K, López-Fuentes A, Porrón-Sotelo L, Huerta AM, Bonavides-Martínez C, Balderas-Martínez YI, Pannier L, Olvera M, Labastida A, Jiménez-Jacinto V, Vega-Alvarado L, Del Moral-Chávez V, Hernández-Alvarez A, Morett E, Collado-Vides J (2013) RegulonDB v8.0: omics data sets, evolutionary conservation, regulatory phrases, cross-validated gold standards and more. Nucleic Acids Res 41:D203–D213
Sathya TA, Jacob Ani Methew, Khan Mahejibin (2014) Cloning and molecular modelling of pectin degrading glycosyl hydrolase of family 28 from soil metagenomic library. Molecul Biol Rep 41(4):2645–2656
Schimel J, Balser TC, Wallenstein M (2007) Microbial stress-response physiology and its implications for ecosystem function. Ecology 88(6):1386–1394
Schloss PD, Handelsman J (2003) Biotechnological prospects from metagenomics. Curr Opin Biotechnol 14:303–310
Schloss PD, Handelsman J (2005) Introducing DOTUR, a computer program for defining operational taxonomic units and estimating species richness. Appl Environ Microbiol 71(3):1501–1506
Schloss PD, Westcott SL, Ryabin T, Hall JR, Hartmann M, Hollister EB, Lesniewski RA, Oakley BB, Parks DH, Robinson CJ (2009) Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl Environ Microbiol 75(23):7537–7541
Schmalenberger A, Tebbe CC (2014) Profiling the diversity of microbial communities with single-strand conformation polymorphism (SSCP). Environ Microbiol 1096:71–83
Schwartz S, Zhang Z, Frazer KA, Smit A, Riemer C, Bouck J, Gibbs R, Hardison R, Miller W (2000) PipMaker: a web server for aligning two genomic DNA sequences. Genome Res 10(4):577–586
Seifert J, Taubert M, Jehmlich N, Schmidt F, Völker U, Vogt C, Richnow HH, von Bergen M (2012) Protein-based stable isotope probing (protein-SIP) in functional metaproteomics. Mass Spectrom Rev 31(6):683–697
Seitz EM, Brockman JP, Sandler SJ, Clark AJ, Kowalczykowski SC (1998) RadA protein is an archaeal RecA protein homolog that catalyzes DNA strand exchange. Genes Dev 12(9):1248–1253
Shah V, Zakrzewski M, Wibberg D, Eikmeyer F, Schlüter A, Madamwar D (2013) Taxonomic profiling and metagenome analysis of a microbial community from a habitat contaminated with industrial discharges. Microb Ecol 66(3):533–550
Sharma S, Khan FG, Qazi GN (2010) Molecular cloning and characterization of amylase from soil metagenomic library derived from Northwestern Himalayas. Appl Microbiol Biotechnol 86(6):1821–1828
She R, Chu JSC, Wang K, Pei J, Chen N (2009) GenBlastA: enabling BLAST to identify homologous gene sequences. Genome Res 19(1):143–149
Shendure J, Ji H (2008) Next-generation DNA sequencing. Nat Biotechnol 26(10):1135–1145
Shendure JA, Porreca GJ, Church GM, Gardner AF, Hendrickson CL, Kieleczawa J, Slatko BE (2011) Overview of DNA sequencing strategies. Curr Protocol Mole Biol pp 7.1.1–7.1.23
Shokralla S, Spall JL, Gibson JF, Hajibabaei M (2012) Next-generation sequencing technologies for environmental DNA research. Mol Ecol 21(8):1794–1805
Simon C, Daniel R (2011) Metagenomic analyses: past and future trends. Appl Environ Microbiol 77(4):1153–1161
Singh J, Behal A, Singla N, Joshi A, Birbian N, Singh S, Bali V, Batra N (2009) Metagenomics: concept, methodology, ecological inference and recent advances. Biotechnol J 4(4):480–494
Sjöberg F, Nowrouzian F, Rangel I, Hannoun C, Moore E, Adlerberth I, Wold AE (2013) Comparison between terminal-restriction fragment length polymorphism (T-RFLP) and quantitative culture for analysis of infants’ gut microbiota. J Microbiol Methods 94(1):37–46
Smeds L, Künstner A (2011) ConDeTri-a content dependent read trimmer for Illumina data. PLoS ONE 6(10):e26314
Soergel DAW, Dey N, Knight R, Brenner SE (2012) Selection of primers for optimal taxonomic classification of environmental 16S rRNA gene sequences. ISME J 6(7):1440–1444
Solomon KV, Haitjema CH, Thompson DA, O’Malley MA (2014) Extracting data from the muck: deriving biological insight from complex microbial communities and non-model organisms with next generation sequencing. Curr Opin Biotechnol 28:103–110
Stachler E, Bibby K (2014) Metagenomic evaluation of the highly abundant human gut bacteriophage CrAssphage for source tracking of human fecal pollution. Environ Sci Technol Lett 1(10):405–409
Stajich JE, Harris T, Brunk BP, Brestelli J, Fischer S, Harb OS, Kissinger JC, Li W, Nayak V, Pinney DF, Stoeckert CJ Jr, Roos DS (2012) FungiDB: an integrated functional genomics database for fungi. Nucleic Acids Res 40:D675–D681
Steele HL, Streit WR (2005) Metagenomics: advances in ecology and biotechnology. FEMS Microbiol Lett 247(2):105–111
Stefanis C, Alexopoulos A, Voidarou C, Vavias S, Bezirtzoglou E (2013) Principal methods for isolation and identification of soil microbial communities. Folia Microbiol 58(1):61–68
Stranneheim H, Lundeberg J (2012) Stepping stones in DNA sequencing. Biotechnol J 7(9):1063–1073
Su C, Lei L, Duan Y, Zhang K-Q, Yang J (2012) Culture-independent methods for studying environmental microorganisms: methods, application, and perspective. Appl Microbiol Biotechnol 93(3):993–1003
Subramanian AR, Kaufmann M, Morgenstern B (2008) DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment. Algorithms Mol Biol 3(6):1–11
Sunagawa S, Mende DR, Zeller G, Izquierdo-Carrasco F, Berger SA, Kultima JR, Coelho LP, Arumugam M, Tap J, Nielsen HB (2013) Metagenomic species profiling using universal phylogenetic marker genes. Nature Methods 10(12):1196–1199
Szilágyi A, Závodszky P (2000) Structural differences between mesophilic, moderately thermophilic and extremely thermophilic protein subunits: results of a comprehensive survey. Structure 8(5):493–504
Taher L, Rinner O, Garg S, Sczyrba A, Brudno M, Batzoglou S, Morgenstern B (2003) AGenDA: homology-based gene prediction. Bioinformatics 19(12):1575–1577
Tamaki H, Wright CL, Li X, Lin Q, Hwang C, Wang S, Thimmapuram J, Kamagata Y, Liu WT (2011) Analysis of 16S rRNA amplicon sequencing options on the Roche/454 next-generation Titanium sequencing platform. PLoS ONE 6(9):e25263
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S (2011) MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol 28(10):2731–2739
Tang S, Antonov I, Borodovsky M (2013) MetaGeneTack: ab initio detection of frameshifts in metagenomic sequences. Bioinformatics 29(1):114–116
Teeling H, Waldmann J, Lombardot T, Bauer M, Glöckner FO (2004) TETRA: a web-service and a stand-alone program for the analysis and comparison of tetranucleotide usage patterns in DNA sequences. BMC Bioinformatics 5(1):163
Tehei M, Franzetti B, Madern D, Ginzburg M, Ginzburg BZ, Giudici-Orticoni MT, Bruschi M, Zaccai G (2003) Adaptation to extreme environments: macromolecular dynamics in bacteria compared in vivo by neutron scattering. EMBO Rep 5(1):66–70
Thomas T, Gilbert J, Meyer F (2012) Metagenomics-a guide from sampling to data analysis. Microb Inform Exp 2(3):1–12
Timp W, Mirsaidov UM, Wang D, Comer J, Aksimentiev A, Timp G (2010) Nanopore sequencing: electrical measurements of the code of life. IEEE Trans Nanotechnol 9(3):281–294
Trindade-Silva AE, Rua CPJ, Andrade BGN, Vicente ACP, Silva GGZ, Berlinck RGS, Thompson FL (2013) Polyketide synthase gene diversity within the microbiome of the sponge Arenosclera brasiliensis, endemic to the Southern Atlantic Ocean. Appl Environ Microbiol 79(5):1598–1605
Trivedi S, Rao SR, Gehlot HS (2005) Nucleic acid stability in thermophilic prokaryotes: a review. J Cell Mol Biol 4:61–69
Turnbaugh PJ, Micah Hamady, Tanya Yatsunenko, Cantarel Brandi L, Alexis Duncan, Ley Ruth E, Sogin Mitchell L, Jones William J, Roe Bruce A, Affourtit JasonP (2009) A core gut microbiome in obese and lean twins. Nature 457(7228):480–484
Tyson GW, Chapman J, Hugenholtz P, Allen EE, Ram RJ, Richardson PM, Solovyev VV, Rubin EM, Rokhsar DS, Banfield JF (2004) Community structure and metabolism through reconstruction of microbial genomes from the environment. Nature 428(6978):37–43
Ueno A, Ito Y, Yumoto I, Okuyama H (2007) Isolation and characterization of bacteria from soil contaminated with diesel oil and the possible use of these in autochthonous bioaugmentation. World J Microbiol Biotechnol 23(12):1739–1745
Uribe A, Alam M, Johansson O, Midtvedt T, Theodorsson E (1994) Microflora modulates endocrine cells in the gastrointestinal mucosa of the rat. Gastroenterology 107(5):1259–1269
Varin T, Lovejoy C, Jungblut AD, Vincent WF, Corbeil J (2012) Metagenomic analysis of stress genes in microbial mat communities from Antarctica and the High Arctic. Appl Environ Microbiol 78(2):549–559
Verner-Jeffreys DW, Welch TJ, Schwarz T, Pond MJ, Woodward MJ, Haig SJ, Rimmer Georgina SE, Roberts E, Morrison V, Baker-Austin C (2009) High prevalence of multidrug-tolerant bacteria and associated antimicrobial resistance genes isolated from ornamental fish and their carriage water. PLoS ONE 4(12):e8388
Vieites JM, Guazzaroni M-E, Beloqui A, Golyshin PN, Ferrer M (2009) Metagenomics approaches in systems microbiology. FEMS Microbiol Rev 33(1):236–255
von Bergen M, Jehmlich N, Taubert M, Vogt C, Bastida F, Herbst F-A, Schmidt F, Richnow H-H, Seifert J (2013) Insights from quantitative metaproteomics and protein-stable isotope probing into microbial ecology. ISME J 7(10):1877–1885
Wang Q, Qian C, Zhang X-Z, Liu N, Yan X, Zhou Z (2012) Characterization of a novel thermostable β-glucosidase from a metagenomic library of termite gut. Enzyme Microbial Technol 51(6–7):319–324
Wang H, Li X, Ma Y, Song J (2014) Characterization and high-level expression of a metagenome-derived alkaline pectate lyase in recombinant Escherichia coli. Process Biochem 49(1):69–76
Warnecke F, Luginbühl P, Ivanova N, Ghassemian M, Richardson TH, Stege JT, Cayouette M, McHardy AC, Djordjevic G, Aboushadi N (2007) Metagenomic and functional analysis of hindgut microbiota of a wood-feeding higher termite. Nature 450(7169):560–565
Warren RL, Sutton GG, Jones Steven JM, Holt RA (2007) Assembling millions of short DNA sequences using SSAKE. Bioinformatics 23(4):500–501
Wéry N, Monteil C, Pourcher AM, Godon JJ (2010) Human-specific fecal bacteria in wastewater treatment plant effluents. Water Res 44(6):1873–1883
Wiehe T, Gebauer-Jung S, Mitchell-Olds T, Guigó R (2001) SGP-1: prediction and validation of homologous genes based on sequence alignments. Genome Res 11(9):1574–1583
Wikoff WR, Anfora AT, Liu J, Schultz PG, Lesley SA, Peters EC, Siuzdak G (2009) Metabolomics analysis reveals large effects of gut microflora on mammalian blood metabolites. Proc Natl Acad Sci 106(10):3698–3703
Wilke A, Glass EM, Bischof J, Braithwaite D, D’Souza M, Gerlach W, Harrison T, Keegan K, Matthews H, Paczian T (2013) MG-RAST technical report and manual for version 3.3. 6, revision 2
Williams GJ (2013) Engineering polyketide synthases and nonribosomal peptide synthetases. Curr Opin Struct Biol 23(4):603–612
Wingender E, Dietze P, Karas H, Knüppel R (1996) TRANSFAC: a database on transcription factors and their DNA binding sites. Nucleic Acids Res 24:238–241
Woodhouse JN, Fan L, Brown MV, Thomas T, Neilan BA (2013) Deep sequencing of non-ribosomal peptide synthetases and polyketide synthases from the microbiomes of Australian marine sponges. ISME J 7(9):1842–1851
Wooley JC, Ye Y (2010) Metagenomics: facts and artifacts, and computational challenges. J Comput Sci Technol 25(1):71–81
Wu CH, Apweiler R, Bairoch A, Natale DA, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, Martin MJ, Mazumder R, O'Donovan C, Redaschi N, Suzek B (2006) The Universal Protein Resource (UniProt): an expanding universe of protein information. Nucleic Acids Res 34:D187–D191
Wu D, Doroud L, Eisen JA (2013) TreeOTU: operational taxonomic unit classification based on phylogenetic trees. arXiv preprint arXiv:1308.6333
Xia X (2013) DAMBE5: a comprehensive software package for data analysis in molecular biology and evolution. Mol Biol Evol 30(7):1720–1728
Xie W, Wang F, Guo L, Chen Z, Sievert SM, Meng J, Huang G, Li Y, Yan Q, Wu S (2010) Comparative metagenomics of microbial communities inhabiting deep-sea hydrothermal vent chimneys with contrasting chemistries. ISME J 5(3):414–426
Xiong ZQ (2013) Metagenomic-guided antibiotics discovery. Clin Microbial 2:101
Xiong Z-Q, Wang J-F, Hao Y-Y, Wang Y (2013) Recent advances in the discovery and development of marine microbial natural products. Mar Drugs 11(3):700–717
Yamada M, Nakasone K, Tamegai H, Kato C, Usami R, Horikoshi K (2000) Pressure regulation of soluble cytochromesc in a deep-sea piezophilic bacterium, Shewanella violacea. J Bacteriol 182(10):2945–2952
Yang Y, Yooseph S (2013) SPA: a short peptide assembler for metagenomic data. Nucleic Acids Res 41(8):e91–e91
Yang B, Dai Z, Ding S-Y, Wyman CE (2011) Enzymatic hydrolysis of cellulosic biomass. Biofuels 2(4):421–450
Ye L, Zhang T (2013) Bacterial communities in different sections of a municipal wastewater treatment plant revealed by 16S rDNA 454 pyrosequencing. Appl Microbiol Biotechnol 97(6):2681–2690
Ye Y, Wei B, Wen L, Rayner S (2013) BlastGraph: a comparative genomics tool based on BLAST and graph algorithms. Bioinformatics 29(24):3222–3224
Yok NG, Rosen GL (2011) Combining gene prediction methods to improve metagenomic gene annotation. BMC Bioinformatics 12(1):20
Zakrzewski M, Bekel T, Ander C, Pühler A, Rupp O, Stoye J, Schlüter A, Goesmann A (2012) MetaSAMS–a novel software platform for taxonomic classification, functional annotation and comparative analysis of metagenome datasets. J Biotechnol 167:156–165
Zerbino DR, Birney E (2008) Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18(5):821–829
Zhang A, Zhao R, Jin P, Ma L, Xiong X, Xie T, Pei X, Yu L, Yin X, Wang Q (2013) Discovery of a novel esterase subfamily sharing an identified arm sequence (ArmEst) by gene-specific metagenomic PCR. Biotechnol Lett 35(11):1937–1944
Zhu L, Wu Q, Dai J, Zhang S, Wei F (2011) Evidence of cellulose metabolism by the giant panda gut microbiome. Proc Natl Acad Sci 108(43):17714–17719
Zimin AV, Marçais G, Puiu D, Roberts M, Salzberg SL, Yorke JA (2013) The MaSuRCA genome assembler. Bioinformatics 29(21):2669–2677
Acknowledgments
The authors would like to acknowledge CERAR (Centre for environmental risk assessment and remediation) and CRCCARE (Co-operative Research Centre for Contamination Assessment and Remediation of the Environment) for all their support.
Conflicts of interest
Authors declare no conflict of interest.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Abbasian, F., Lockington, R., Megharaj, M. et al. The integration of sequencing and bioinformatics in metagenomics. Rev Environ Sci Biotechnol 14, 357–383 (2015). https://doi.org/10.1007/s11157-015-9365-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11157-015-9365-7