Background

Methanogenic archaea are key players in anaerobic communities and as such contribute largely to global warming and energy production, with interesting potential roles in human and animal health [1, 2]. These extraordinary microbes perform a type of anaerobic respiration known as methanogenesis by reduction or dismutation of carbon dioxide, methyl compounds, or acetate to methane or methane and carbon dioxide in several ecosystems and consortia [3]. Methanogenesis operates at a very low reducing potential compared to other forms of aerobic and anaerobic respiration [4]. As such, methanogens can be found in such diverse environments as the deep ocean, rice paddies, wetlands, landfills, and the gastrointestinal tracts of termites, ruminants, and humans, where more favorable electron acceptors like oxygen or inorganic ions are unavailable [1]. Because they constitute a major source of atmospheric methane, there has been a drive to increase understanding of methanogen metabolism and their interactions with other organisms [5]. This understanding of methanogen metabolism is necessary for optimal design of biochemical processes aimed at generating or removing methane [6, 7].

Many studies seek to mitigate negative effects contributed by methanogens. Livestock, as a component of agriculture, are the largest anthropogenic source of atmospheric methane [8]. Certain methanogens thrive in the rumen compartment of cattle, where they partner with cellulose-degrading gut microbes, consuming the hydrogen and carbon dioxide these microbes produce as waste [9]. This phenomenon, known as enteric emission, produces approximately 30% of global methane emissions, a greenhouse gas 25 times more potent than carbon dioxide [5, 8]. The effect of methanogens on human and animal health is less clear. Although there are conflicting reports, many studies describe a direct correlation between the presence of methanogens in human guts and the occurrence of diseases such as periodontitis, obesity, and inflammatory bowel disease [2].

The high energy content of methane has recently called attention to methanogens as a tool in waste-to-energy technologies [7, 10]. As in the rumen and other natural anaerobic environments, methanogens can partner with other microbes in specialized anaerobic reactors to convert organic biomass into methane [11]. This process, termed anaerobic digestion, results in generation of potentially-useful biogas [11]. Recently, anaerobic microbes have received attention for their ability to thrive on crude biomass [12,13,14]. Methanogens enable the growth of other microbes by removing self-limiting waste products, which methanogens assimilate or use as electron donors/acceptors in methanogenesis [3, 15]. For this reason, the substrate utilization capabilities of methanogenic species are of great interest, as is elucidating the details of ATP generation via carbon dioxide reduction. Comparing and contrasting the metabolic pathways in methanogenic species of diverse evolutionary origins will be vital to progress in this area.

Currently, methanogens are grouped in three ways: by energy requirements, based on phylogeny, or by catalytic abilities. Metabolic grouping depends on energy source, which can be i) hydrogen/formate and carbon dioxide, ii) methyl compounds, or iii) acetate [1]. Phylogenetic grouping based on 16S rRNA sequencing comprises only two classes: Class I and Class II [16]. This classification also reflects metabolic differences, with basal Class I lineages requiring hydrogen and carbon dioxide, and sometimes also formate, and the more recently evolved Class II lineages requiring either methyl compounds or acetate [17, 18]. Recently, Anderson et al. conducted a new type of phylogenetic analysis and proposed that Class II be defined to include only species of the Methanomicrobiales order, while the Methanosarcinales would form a new class: Class III [18]. Recently, new genomes and metagenomes sequenced have identified new clades of methanogens outside of the early-established class systems, such as the Methanomassiliicoccales [19] or the newly discovered Methanonatronarchaeia [20].

We sought to better understand metabolic capabilities of methanogens and the biochemical pathways behind them. To this end, we obtained four methanogenic species from diverse environments and sequenced their genomes. Under the three class system [18], Methanobacterium bryantii and Methanosphaera cuniculi are Class I methanogens requiring H2/CO2 and H2/methanol, respectively. Methanocorpusculum parvum is a member of Class II and requires H2/CO2, formate, or 2-propanol/CO2. Finally, Methanosarcina spelaei is a member of Class III and utilizes H2/CO2, acetate, or methyl compounds for growth and methanogenesis. By sequencing the genomes of each member and reconstructing their metabolisms, we seek to elucidate the metabolic pathways and mechanisms implemented by each of the three classes and their contributions to energy, health, and agriculture.

Methods

Methanogen culture and DNA isolation

Methanobacterium bryantii strain M.o.H. (DSM 863), Methanosarcina spelaei strain MC-15 (DSM 26047), Methanosphaera cuniculi strain 1R-7 (DSM 4103), and Methanocorpusculum parvum strain XII (DSM 3823) were ordered from the Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Culture (https://www.dsmz.de/). Cultures were grown in a modified version of M2 medium adapted from Teunissen et al. [21]. The base medium contained 150 mL of Solution A (0.45 g/L KH2PO4, 0.45 g/L (NH4)2SO4, 0.9 g/L NaCl, 0.09 g/L MgSO4·7 H2O, 0.09 g/L CaCl2·2 H2O), 150 mL of Solution B (0.45 g/L K2HP04), 12 g/L NaHCO3, 1 mL of Vitamin Supplement (ATCC MD-VS) supplemented with 0.9 μg cyanocobalamin and 40 mg CoM, 10 mL trace element solution (2.5 mg/mL MnCl2·4H2O, 2.5 mg/mL NiCl2·6H2O, 2.5 mg/mL NaMoO4·2H2O, 2.5 mg/mL H3BO3, 2.0 mg/mL FeSO4·7H2O, 0.5 mg/mL CoCl2·6H2O, 0.5 mg/mL SeO2, 0.5 mg/mL NaVO3·4H2O, 0.25 mg/mL ZnCl2, 0.25 mg/mL CuCl2·2H2O, all dissolved in 0.2 M-HCl), 10 mL hemin solution (1 mg/L hemin, 0.01% (v/v) ethanol, dissolved in 0.05 M NaOH), 1 mL of resazurin solution (0.1% (w/v) resazurin), 1 g L-cysteine-HCl [22]. The total volume of the medium was adjusted to 1 L with MilliQ H2O, boiled vigorously to drive out the oxygen, cooled under CO2, and aliquoted under an 80/20 mixture of H2/CO2 before autoclaving. Vitamin solution and methanol were filter sterilized and added after autoclaving. Mbac. bryantii medium was supplemented with 2 g/L sodium acetate, 4 g/L sodium formate, 2 g/L yeast extract, and 4 g/L bacto casitone. Mcor. parvum medium was supplemented with 2 g/L sodium acetate, 4 g/L sodium formate, 2 g/L yeast extract, 4 g/L bacto casitone, and 10 μM sodium tungstate. Msar. spelaei medium was supplemented with 4.5 g/L sodium chloride, 2 g/L sodium acetate, 2 g/L yeast extract, 2 g/L bacto casitone, and 0.5% (v/v) methanol. Msph. cuniculi medium was supplemented with 3.6 g/L sodium acetate, 2 g/L yeast extract, 2 g/L bacto casitone, and 1% (v/v) methanol. All cultures were grown at 39 °C without shaking. Experiments testing growth on substrates alternative to H2 were conducted in 100% CO2 headspace.

To isolate genomic DNA, 1 mL of methanogens was inoculated into 40 mL of media in 60 mL Wheaton serum bottles until stationary phase (OD600 ~ 0.2–0.5) and then harvested by centrifugation for 30 min at 10,000×g at 4 °C. Cell pellets were resuspended in 0.5 mL TE Buffer (10 mM Tris, 1 mM EDTA, pH 8.0). Sodium dodecyl sulfate was added to a final concentration of 0.5%, proteinase K (New England BioLabs, Ipswitch, MA) was added to 100 μg/mL, and RNaseA (MoBio Laboratories, Carlsbad, CA) was added to 100 μg/mL. The mixture was incubated at 37 °C for 1 h. NaCl was added to 0.5 M, and 0.5 mL of phenol:chloroform:isoamyl alcohol (25:24:1) was added. The solution was mixed and then centrifuged at 13,000×g for 10 min at 4 °C. The aqueous phase was transferred to a new tube and 0.6 mL of isopropyl alcohol was added. The mixture was incubated at −20 °C for ~16 h and then centrifuged at 13,000×g for 5 min at 4 °C. The pellet was washed with 70% ethanol, centrifuged at 13,000×g for 5 min at 4 °C, and finally resuspended in 10 mM Tris buffer pH 8.0 and stored at −20 °C [17].

Library preparation and sequencing

Genomic DNA (gDNA) was prepared for high throughput sequencing (HTS) using the TruSeq DNA PCR-Free library prep kit supplied by Illumina, Inc. (San Diego, CA). Briefly, purified gDNA was first fragmented using a Covaris (Woburn, Massachusetts) M220 Focused Ultrasonicator, followed by end repairs, size selection (~330 bp), end adenylation and paired-end adapters ligation using the kit. Prepped libraries were then quantified using Qubit (Life Technologies, Carlsbad, CA) and TapeStation (Agilent, Santa Clara, CA), before pooling. HTS was performed with an Illumina NextSeq500 sequencer using a 150 cycle, mid output kit (2 × 75 paired-end). Resequencing of Methanosarcina spelaei was completed using a 75 cycle mid output kit (75 bp single-end).

Genome assembly and annotation

Genomes were assembled from 75 bp paired end Illumina NextSeq reads (Illumina, San Diego, CA) at a minimum of 49× coverage and annotated with the Department of Energy Systems Biology Knowledgebase (KBase, http://kbase.us) automated pipeline. Briefly, reads were preprocessed with BayesHammer [23] before being assembled de novo into contigs with the SPAdes and Velvet genome assembly algorithms [24, 25]. Genomic features including ORFs, large repeat regions, rRNAs, CRISPRs, and tRNAs were then identified and annotated with the Rapid Annotations using Subsystems Technology toolkit (RASTtk) [26]. These gene annotations were combined with biochemical information from the Kyoto Encyclopedia of Genes and Genomes (KEGG) [27] to reconstruct the metabolism of each methanogen. The assembly for Methanosphaera cuniculi contained a number of low coverage (~1X) contigs, which were removed from the draft assembly and analysis. Scaffolding of assembled contigs was performed using SSPACE [28].

COG analysis

Coding domains were assigned to Clusters of Orthologous Groups (COG) classes by using the RPSBLAST program, version 2.2.26 against the CDD database [29], which was downloaded from the NCBI website on April 24th, 2015. The COG annotations were used to generate genome maps using a combination of BLAST Ring Image Generator (BRIG) [30] and CGview [31]. A random sample of 100 genomes from the RefSeq database [32] was downloaded on July 7th, 2016. The random genomes were assigned to COG classes using the previously described method. To estimate error in the sampling, the 100 genomes were sampled with replacement 1000 times. Significance was determined through Fisher’s exact test, and verified through difference of medians analysis [33]. For comparative purposes, the fraction of genes assigned to a given COG class was defined as the number of genes assigned to the class divided by the total number of genes assigned to any COG class, which accounted for differing genome sizes and unknown gene content.

The evolutionary history of the methanogen isolates was inferred using the Neighbor-Joining method [34]. A sum of branch length of 1.91983843 was used. A bootstrap test with 1000 replicates was used to compute the final tree [35]. The evolutionary distances were computed using the Maximum Composite Likelihood method [36] and are in the units of the number of base substitutions per site. The analysis involved 91 nucleotide sequences. All positions with less than 95% site coverage were eliminated. That is, fewer than 5% alignment gaps, missing data, and ambiguous bases were allowed at any position. There were a total of 1246 positions in the final dataset. Evolutionary analyses were conducted in MEGA7 [37]. Redundant and unnecessary branches were trimmed for a more easily viewed tree and formatted using the Interactive Tree of Life [38].

Additional annotation and analyses

Possible modes of methanogenesis as described on metacyc.com [39] and in Blaut [40] were investigated for each genome. Using the R programming language, an algorithm was developed to search the functional annotations provided by the KBase pipeline for genes that encode enzymes involved in methanogenesis. Methanogenic genes were identified by searching for either Enzyme Commission (EC) numbers [41] or specific enzyme names and functional descriptions. Hydrogenases were identified by searching the KBase annotations specifically for Eha, Ehb, Mvh, Frh, Ech, Vho, and cytochrome b enzymes. Since these enzymes are multisubunit complexes, the components of each hydrogenase were identified. BLAST-based comparisons were made using BLASTP version 2.2.30+ against the genome indicated, each downloaded from the RefSeq database [32] on August 1st, 2016.

Transmembrane domains were predicted using a Transmembrane Hidden Markov Model (TMHMM) [42]. Transporter analysis was conducted by running BLASTP version 2.2.30+ against the transporter classification database (TCDB) [43], downloaded on January 13th, 2015. A custom python script was used to select the blast hit with the lowest e-value, with the strict requirement that the blast hit matched at least 70% of the length of the subject and query. Membrane proteins and transporters were assigned classes based on TCDB number (transporters) or by manual curation of KBase annotations (membrane proteins).

Nucleotide sequence accession numbers

The final genome assembly and annotation information is available in the GenBank database with accession numbers PRJNA300714, PRJNA300715, PRJNA300716, and PRJNA300717 for Mbac. bryantii, Msar. spelaei, Msph. cuniculi, and Mcor. parvum, respectively.

Results and discussion

Genome sequencing and assembly

In order to compare methanogen metabolism and methanogenesis across archaeal genera, four methanogens (Mbac. bryantii, Msar. spelaei, Msph. cuniculi, and Mcor. parvum) were selected for genomic sequencing and analysis. Collectively, the methanogens chosen represent a diverse set of isolates from four different genera and all three classes [18], capable of methylotrophic, acetoclastic, and hydrogenotrophic methanogenesis, as shown in Table 1 [44,45,46,47,48]. Draft genomes for Mbac. bryantii, Msar. spelaei, Msph. cuniculi, and Mcor. parvum were assembled and annotated through the Department of Energy Systems Biology Knowledgebase platform (KBase), a summary of which can be seen in Table 2. The size of each genome was as expected for methanogens of their respective genera, with Msar. spelaei the largest (5.1 Mb), Mcor. parvum and Msph. cuniculi the smallest (1.7 and 1.9 Mb, respectively), and Mbac. bryantii in between (3.5 Mb). Mcor. parvum had the highest GC content (51%), while the other three were more AT-rich (~30% GC content). The GC bias is genome-wide, extending to both coding and intergenic regions. An analysis of codon frequency (Additional file 1: Table S1) reflected the GC bias, where Mcor. parvum had the lowest stop codon frequency of TAA (42%), compared to 67%, 73%, and 54% for Mbac. bryantii, Msph. cuniculi, and Msar. spelaei, respectively [49].

Table 1 Methanogens Characterized in this Study. Displays a summary of the four methanogens sequenced: Mbac. bryantii isolation data from [46, 47], Msar. spelaei isolation data from [48], Msph. cuniculi isolation data from [45], Mcor. parvum isolation data from [44]
Table 2 Genome Sequencing Statistics for Strains in this Study

Analysis of the distribution of COG proteins reveals a shift in genomic composition towards energy conservation

The predicted genes were assigned to COG classes using RPSBLAST. A summary of the number of genes assigned to each COG class can be seen in Fig. 1 and Additional file 1: Table S2. The largest classes of COGs represented across the methanogens include Energy Production and Conversion [C], Coenzyme Transport and Metabolism [H], and Inorganic Ion Transport and Metabolism [P]. Analysis of the predicted membrane proteins further demonstrated the trend towards energy conservation (Additional file 1: Figure S1). The three largest classes of membrane proteins and transporters belonged to energy conservation, ion transport, and metal or cofactor acquisition.

Fig. 1
figure 1

Annotated genome maps highlight key methanogenesis genes. Circular genome maps for Msar. spelaei, Mbac. bryantii, Mcor. parvum, and Msph. cuniculi. Concentric rings represent the following from outermost to innermost: 1) Assembled Scaffold boundaries. 2) Genes responsible for methanogenesis on the forward strand. 3) Predicted ORFs, colored by COG class, and predicted tRNA and rRNA on the forward strand. 4) Predicted ORFs, colored by COG class, and predicted tRNA and rRNA on the reverse strand. 5) Genes responsible for methanogenesis on the reverse strand. 6) GC Content. 7) GC Skew

In order to quantitatively compare the COG categories of the sequenced methanogens, a random sample of 100 genomes from the NCBI RefSeq Database was annotated in a similar method. As shown in Additional file 1: Figure S2, the relative distribution of COG proteins from the RefSeq database achieved stability by 100 genomes sampled. Additional file 1: Figure S3 shows a bootstrap analysis of the sampling, which indicates a standard error of less than 0.3% for each COG category distribution. As shown in Fig. 2, all methanogens sequenced were significantly enriched for genes in Energy Production and Conversion (C), Coenzyme Transport and Metabolism (H), and Translation, Ribosomal Structure and Biogenesis (J). Furthermore, they had significantly fewer genes assigned to Carbohydrate Transport and Metabolism (G), Lipid Transport and Metabolism (I), and Secondary Metabolites Biosynthesis, Transport, and Catabolism (Q). The abnormal genome composition, and in particular the enrichment in Energy Production and Conversion genes, has been noted previously [19, 50,51,52]. As such, a focus on the energy conservation mechanisms of the methanogens studied here was crucial to accurate reconstructions of metabolism.

Fig. 2
figure 2

Distribution of COG genes in categories significantly different than the RefSeq Sample. The fraction of proteins in each methanogen belonging to COG categories shown differ significantly compared to the RefSeq sample. All four sequenced methanogens have significantly more genes categorized as energy production and conversion and coenzyme transport and metabolism, which confirms the observation that the methanogens have a confirmed shift towards energy conservation in order to occupy their ecological niche. Significance was determined through Fisher’s exact test, and verified through difference of medians analysis [24], * represents p < 0.01

BLAST-based comparison to closely related species reflects similarities within methanogen genera

The evolutionary relationship of the methanogens was determined by placing them into a phylogenetic tree as shown in Fig. 3. Each methanogen clustered closely with other methanogens from its assigned genus, with the Class I methanogens (Mbac. bryantii and Msph. cuniculi) distinguished from the Class II methanogen (Mcor. parvum) and the Class III methanogen (Msar. spelaei). Methanosphaera and Methanocorpusculum are both underrepresented in number of species sequenced to date. As such, Msph. cuniculi and Mcor. parvum were both compared to the other sequenced species in their genera. A BLAST-based comparison was performed to determine the proteins found in Msph. cuniculi or Mcor. parvum not found in other sequenced species. The results of the BLAST-based comparison are shown in Additional files 2 and 3 for Msph. cuniculi and Mcor. parvum, respectively. The most striking observation from this comparison is the number of genes annotated as either “hypothetical protein” or with no annotation. Two hundred fifty three of the 277 proteins (91.3%) found in Msph. cuniculi, but not in Methanosphaera stadtmanae, lacked a descriptive annotation, and the same was true for 230 of the 293 proteins (78.5%) found in Mcor. parvum, but not Methanocorpusculum bavaricum or Methanocorpusculum labreanum. Of the few proteins containing a descriptive annotation, none of them play a significant role in methanogenesis in either isolate. The largest group of proteins annotated in the datasets belonged to either Type I or Type II restriction-modification system, suggesting slight differences in specificity against foreign DNA. Taken together, these observations suggest that Msph. cuniculi and Mcor. parvum are metabolically similar to other Methanosphaera and Methanocorpusculum, respectively, and those other sequenced isolates represent good templates for metabolic reconstruction.

Fig. 3
figure 3

Sequenced methanogens cluster with other methanogens in their respective classes. The phylogenetic tree represents the evolutionary relationship of the sequenced methanogens (highlighted in color by Class) compared to other closely related sequenced archaea. As shown, methanogens in this study displayed a close evolutionary relationship to other methanogens within their genera. Within this tree, the Methanomicrobiales (represented by Mcor. parvum) are phylogenetically close to the Methanosarcinales (Msar. spelaei), but metabolically more similar to the Methanobacteriales (Msph. cuniculi and Mbac. bryantii). Bootstrap values are indicated for each node

Metabolic reconstruction of methanogenesis pathways

In order to further investigate proteins involved in energy conservation mechanisms, the specific pathways of methanogenesis were reconstructed for each sequenced isolate (Fig. 4). Of particular interest were the distinct energy conservation mechanisms employed by each isolate and the predicted substrate utilization routes. Together, these two aspects provide the backbone for metabolic models, which can be used for predicting behavior in microbial communities.

Fig. 4
figure 4

Metabolic reconstruction of methanogenesis reveals two mechanisms of energy conservation. Energy conservation through methanogenesis is detailed for electron bifurcation (a) and chemiosmotic coupling (b). Electron bifurcation conserves energy in methanogenesis by taking two pairs of electrons from two separate hydrogen molecules and splitting them into a high energy state (CO2 or ferredoxin reduction) and a low energy state (CoB-S-S-CoM heterodisulfide reduction). Genes for this mechanism were found in Msph. cuniculi, Mbac. bryantii, and Mcor. parvum, although the coupled hydrogenase was not identified in Mcor. parvum. Msar. spelaei utilizes chemiosmotic coupling for energy conservation, where Na+ or H+ transport into the cell is linked to H2 oxidation, and transport out of the cell is linked to methyltransferase and heterodisulfide reductase activity, establishing a net outward gradient for ATP production. The hydrogenase depicted in (a) represents the Mvh hydrogenase (Msph. cuniculi and Mbac. bryantii), and the hydrogenase in (b) represents the Eha (Mbac. bryantii and Mcor. parvum), Ehb (Mbac. bryantii and Msph. cuniculi), or Ech hydrogenase (Msar. spelaei and Mcor. parvum). Methylotrophic methanogenesis pathways are displayed in (c). Methanol utilization pathways are found in Msph. cuniculi, Msar. spelaei, and Mbac. bryantii. Acetate and methylamine utilization pathways are found only in Msar. spelaei. The acetyl-CoA synthase complex is found in Mbac. bryantii and Msar. spelaei, allowing them to fix CO2

Methanobacterium bryantii

As a member of the Class I methanogens, Mbac. bryantii was expected to utilize a cytoplasmic heterodisulfide reductase complex (Hdr, CDS.4429–30), coupled to a non-F420 reducing hydrogenase (Mvh, CDS.4299–302) [53]. These two enzymes couple with the formylmethanofuran dehydrogenase complex (Fwd, CDS.4483–89), which allows for electron bifurcation between the Mvh and Hdr, where two pairs of electrons are split between a thermodynamically unfavorable reaction (CO2 reduction to formylmethanofuran or reduction of ferredoxin) and a thermodynamically favorable reaction (heterodisulfide reduction) [51]. As shown in Fig. 4 and Table 3, Mbac. bryantii contains these components and likely catalyzes methanogenesis through electron bifurcation. Mbac. bryantii has all the genes necessary for its previously reported phenotypic growth on H2/CO2, and contains genes for an acetyl-CoA synthase complex (CDS.3915–3920), allowing it to fix CO2 for central metabolism. The Fwd complex, which requires either tungsten or molybdenum as a cofactor [54], was found in close proximity to genes related to molybdenum transport (CDS.4477–80), molybdopterin synthesis (CDS.4490), and tungsten transport (CDS.4475), as seen in other methanogens [55]. The clustering of genes suggests that they may be co-regulated to aid in complex formation. Two sets of energy-conserving hydrogenases, Eha (CDS.5584–90) and Ehb (CDS.6156–70), were found in the genome of Methanobacterium bryantii. Eha and Ehb have been associated with anaplerosis [56] and CO2 assimilation [57], respectively, during hydrogenotrophic methanogenesis in Methanococcus maripaludis.

Table 3 Hydrogenase Components of the Methanogens in this Study

Mbac. bryantii possesses several genes for metabolic substrate utilization not previously observed from Mbac. bryantii in isolation. It contains several copies of the formate dehydrogenase genes (CDS.3844–7, CDS.333–4, CDS.6040–4) along with a formate transporter (CDS.6039), suggesting the possibility of growth on formate, although this phenotype in Mbac. bryantii has not been reported previously [58]. It also contains the complete set of genes necessary for methanogenesis from methanol/H2, including a methyl transferase (CDS.5844), corrinoid protein (CDS.5843), and corrinoid activating protein (CDS.5842). A sulfite reductase gene is also present (CDS.5502), which likely explains the previously reported observation that it resists sulfite inhibition [59]. Finally, Mbac. bryantii’s genome possesses an alcohol dehydrogenase (CDS.3886) and NADP-dependent F420 reductase (CDS.3694), suggesting the possibility of growth on isopropanol or isobutanol, sometimes seen in other Methanobacterium species [46]. However, attempts at growth on formate, methanol, and isopropanol were unsuccessful under conditions tested in this study, suggesting that the genes responsible are inactive or the proper conditions for gene activation were unmet. The presence of the noted genes are important, however, when considering how Mbac. bryantii might act in co-culture or in its native environment. For example, both formate and hydrogen are commonly used to transfer electrons in microbial consortia [60], so it is possible that formate metabolism is triggered by low partial pressures of H2.

Methanosphaera cuniculi

A genomic analysis of Msph. cuniculi revealed that it is metabolically similar to the other Methanosphaera sequenced, Msph. stadtmanae. Msph. stadtmanae is known to have a restrictive metabolism, requiring both methanol and H2 for growth [61]. Like Msph. stadtmanae, Msph. cuniculi contains the genes encoding the non-F420-reducing hydrogenase, Mvh (CDS.1980–2) [61]. As with Mbac. bryantii, this hydrogenase likely couples with the heterodisulfide reductase complex, Hdr (CDS.2562–4) [51]. The heterodisulfide reduction is likely coupled to oxidation of two H2 molecules and ferredoxin reduction. The ferredoxin is then either used for anabolic processes or oxidized by Ehb (CDS.2129–49) in an energy-conserving mechanism producing H2 and transporting ions out of the cell, as proposed recently by Thauer et al. [5].

The utilization of methanol likely occurs through the action of three proteins: a corrinoid protein (CDS.2507), a methyltransferase corrinoid activation protein (CDS.2506), and a methanol methyltransferase protein (CDS.2508) [62]. These proteins transfer a methyl group from methanol, to the corrinoid protein, and finally to methyl-coenzyme M. Beyond methanol metabolism, Msph. cuniculi contains all the genes necessary for reduction of CO2 for methanogenesis, as shown in Fig. 4a. However, previous characterization has concluded that it requires methanol for growth [45]. The lack of growth is most likely explained by the requirement of molybdopterin by the formylmethanofuran dehydrogenase, and the lack of enzymes capable of producing molybdopterin in Msph. cuniculi. However, it is also possible that genes are not functional in vivo. Like Mbac. bryantii, it is important to note the potentially novel substrate routes, as Msph. cuniculi may be capable of utilizing CO2 and H2 in a native environment or in conditions not seen in isolation.

Methanosarcina spelaei

Instead of utilizing electron bifurcation to couple the unfavorable reduction of CO2 to the favorable reduction of the CoM-S-S-CoB heterodisulfide, H2-utilizing Methanosarcina couple each step to ion transport, termed chemiosmotic coupling, shown in Fig. 4b [5]. The favorable step of heterodisulfide reduction is coupled to transport of H+ or Na+ out of the cell, establishing a gradient [5]. This step is performed by a methanophenazine reducing hydrogenase (Vho, CDS.8117–25) / heterodisulfide reductase (Hdr, CDS.5944–6) complex [5]. Part of this gradient is then used by the energy conserving hydrogenase (Ech, CDS.6169–74) to reduce ferredoxin with H2 [5]. Finally, the ferredoxin is utilized to reduce CO2 by the formylmethanofuran dehydrogenase complex (Fmd, CDS.9330–4) [5]. As shown in Fig. 4b and Table 3, Msar. spelaei’s genome contains all of the genes required for this process. Additionally, the membrane protein and transporter analysis (Additional files 4 and 5) revealed that the Ech and Hdr complexes are predicted to be membrane-bound and catalytically active transporters, which has been experimentally validated in other isolates [63, 64]. The genome for Msar. spelaei contains the F420-reducing hydrogenase (FrhABG, CDS.9055–8), used for CO2 reduction. It also contains a methanophenazine-dependent F420H2 dehydrogenase complex (Fpo, CDS.7883–95), which reduces methanophenazine with F420H2 while transporting two protons across the membrane. This allows for the interconversion of reduced cofactors (methanophenazine and F420) while conserving energy, which is important for methylotrophic methanogenesis [65].

Genome sequencing of Msar. spelaei presents an explanation for its observed methanogenesis on methanol, acetate, and methylamines, as shown in Fig. 4c. Methyltransferases specific to methanol (CDS.8578–80), monomethylamine (CDS.10223–4), dimethylamine (CDS.10167–8), and trimethylamine (CDS.9576,9578) were all identified within the genome. Each methylamine methyltransferase includes an amber stop codon that putatively encodes a pyrrolysine residue, which has been shown to be critical for activity in Methanosarcina acetivorans [66]. Supporting the synthesis of this non-canonical amino acid are proline reductase, pyrrolysine synthetase, proline 2-methylase, and pyrrolysyl-tRNA synthetase, all of which were found in the same region of the genome as the monomethylamine methyltransferase complex (CDS.7548–51) [67]. Finally, an acetate kinase (CDS.6562) and phosphotransacetylase (CDS.6561) were found, explaining growth on acetate [68].

Methanocorpusculum parvum

As shown in Fig. 4, Mcor. parvum has all the genes necessary for methanogenesis with H2/CO2 and formate as substrates. Mcor. parvum lacks the acetyl-CoA synthase (Fig. 4c), uncoupling methanogenesis from CO2 fixation into acetyl-CoA, which explains its phenotypic requirement for acetate or yeast extract for carbon assimilation [44]. It contains the most diverse set of hydrogenases of the methanogens studied here, as shown in Table 3. Two separate gene clusters were found within the genome containing the formylmethanofuran dehydrogenase complex, Fwd. In one case, the Fwd genes (CDS.2339–40) are clustered with an energy conserving hydrogenase, Eha (CDS.2325–37). Several of the Eha components are predicted to be membrane associated (Additional file 4) with a predicted function of coupling Fwd activity to transmembrane sodium or proton transport according to the transporter analysis (Additional file 5) [5]. A second cluster contains genes for the Fwd complex (CDS.3326–9) along with the heterodisulfide reductase (HdrABC, CDS.3322–4), including two HdrA subunits and one each of the HdrB and HdrC subunits. None of the associated subunits contain predicted transmembrane domains, suggesting that the Hdr complex is cytosolic, as previously determined [5]. Unlike members of the Class I methanogens or the Methanocellales species [69], the complementary [NiFe] hydrogenase (MvhAG) was not identified in the Mcor. parvum genome, but the MvhD subunit was present. The FrhABG hydrogenase (CDS.2912–15) was identified as well. The presence of two separate gene clusters coding for the Fwd complex suggest that they may be expressed in response to different growth conditions, or play different roles in methanogenesis. The Eha hydrogenase has been shown to be critical for replenishing intermediates of methanogenesis lost to leaky electron bifurcation, dilution due to growth, or biosynthesis [56]. It was also proposed that a separate ferredoxin was utilized in order to separate electron pools for anabolism and replenishment of intermediates [56]. The weak similarity of the two Fwd complexes in Mcor. parvum (~36–44% amino acid identity of each component), could represent differences in specificity of the ferredoxin used – one specific for ferredoxin reduced by Eha, and one specific for ferredoxin reduced by a putative electron bifurcating Hdr complex. Separate ferredoxin proteins are encoded in each of the Fwd gene clusters, further supporting this hypothesis. The exact hydrogenase utilized in the Hdr complex has yet to be experimentally determined, however the most likely hypothesis is that the FrhAG components also serve to replace the MvhAG used in other methanogens [54], resulting in a FrhAG/MvhD/HdrABC complex, as shown in Fig. 5. Interestingly, a recent study reported the heterologous expression in Escherichia coli of the cytoplasmic HdrA2B2C2 complex from Methanosarcina acetivorans [70]. The HdrA subunit was a fusion of the HdrA/MvhD subunits and was capable of directly utilizing F420H2. While it shares only 42% identity with the concatenated HdrA/MvhD found separately in Mcor. parvum¸ it provides biochemical evidence for an alternative model in which the Frh complex generates F420H2 separately to be utilized by the MvhD/HdrABC complex, thus alleviating the need for MvhAG subunits.

Fig. 5
figure 5

The proposed role for each of the two copies of formylmethanofuran dehydrogenase in Methanocorpusculum parvum. Mcor. parvum contains two copies of the formylmethanofuran dehydrogenase complex, one clustered with the Eha energy conserving hydrogenase and one clustered with the heterodisulfide reductase (HdrABC/MvhD) complex. We propose that the two complexes are specific to different ferredoxins, helping to separate the electron pool utilized for anabolism from that utilized to replenish methanogenesis intermediates as previously proposed [49]. The Eha-mediated electron transfer results in no net energy gain through methanogenesis, but replenishes intermediates lost to leaky electron bifurcation or biosynthesis and anabolism. The Hdr mediated electron transfer functions similarly to Class 1 methanogens through electron bifurcation, resulting in a net gain of energy from methanogenesis. Since MvhAG are not present, the best explanation is through use of FrhAG, however that association still needs to be experimentally verified.

Genes for formate dehydrogenase were found in Mcor. parvum (CDS.1907–8, CDS.2816–17), but were located outside of the Fwd/Hdr cluster. Formate dehydrogenase has been shown to form a complex with the Hdr enzyme complex, allowing the electrons to flow directly from formate to the heterodisulfide bond [71]. This observation links directly to the observed phenotype of growth on formate by Mcor. parvum. Finally, this cluster also contains a gene annotated as either a sulfite reductase or nitrite reductase (CDS.3330). These enzymes have previously been reported in methanogens, but never specifically clustered with methanogenesis genes [72]. Similar clusters are seen in sulfate reducing organisms, where a sulfite reductase is coupled to an Hdr-like complex [73]. The sulfite reductase located in close proximity to the Hdr complex suggests that it may have been acquired at the same time as the Hdr genes, or perhaps that the protein closely associates with the Fdh/Hdr complex in order to accept electrons from formate.

In addition to H2/CO2 and formate, Mcor. parvum is capable of growth and methanogenesis from isopropanol [74]. The genome of Mcor. parvum lacked proteins annotated as alcohol dehydrogenases. Previously, an alcohol dehydrogenase was found in Mcor. parvum, and the protein’s N-terminus was sequenced [75]. By searching the genome for this N-terminal sequence, a gene (CDS.2878) annotated as a threonine dehydrogenase was the only match to this sequence. The threonine dehydrogenase aligned with other alcohol dehydrogenases with >70% identity, suggesting that this gene likely functions as the NADP-dependent alcohol dehydrogenase, converting 2-propanol and NADP to acetone and NADPH, respectively.

Conclusions

The diverse group of methanogens sequenced in this study and the careful reconstruction of their metabolisms have brought forth several new hypotheses as to how methanogens may behave in communities compared to how they behave in isolation. The genome of Msph. cuniculi represents only the 2nd sequenced genome from Methanosphaera, and the genome of Mcor. parvum represents the 3rd from Methanocorpusculum, so their genomes fill in gaps of underrepresented genera. Collectively, methanogens occupy a unique ecological niche, acting as the terminal electron acceptors in anaerobic environments. As demonstrated by the COG analysis, significantly more proteins are used for energy conservation and for coenzyme metabolism than a representative sample from Genbank, enabling the unique metabolism of the methanogens. Many of the hydrogenases and other proteins require coenzymes and ion cofactors, which require protein resources to attain. The vast majority of membrane proteins belong to the general classes of catalysis and energy conservation, ion and metabolite transport, and metal and cofactor acquisition. Methanogens have evolved to occupy the niche of utilizing CO2 or acetate as the primary terminal electron acceptor [76], and their genomes reflect this adaptation.

Energy conservation is predicted to occur through chemiosmotic coupling in Msar. spelaei and through electron bifurcation linked to chemiosmotic coupling in Mbac. bryantii and Msph. cuniculi. Mcor. parvum, however, lacks the cytoplasmic non-F420 reducing hydrogenase typically utilized for electron bifurcation and lacks the membrane-associated heterodisulfide reductase utilized for chemiosmotic coupling. The lack of a membrane-associated Hdr complex in Mcor. parvum suggests that electron bifurcation must take place, with the most likely explanation being that the FrhAG subunits replace the MvhAG found in Class I methanogens. The exact mechanism for conservation in Mcor. parvum and other members of the Methanomicrobiales, however, needs to be determined biochemically.

Metabolic analysis, supported by previous biochemical research, has led to new predicted growth phenotypes in the sequenced methanogens. We have assembled draft metabolic models for the four isolates, which serve as the starting point for construction of models of other methanogens, or further studies to refine the metabolic map of these strains with techniques like Flux Balance Analysis. While none of the newly predicted growth phenotypes were demonstrated in isolation here, their presence is important to understanding how methanogens coexist in microbial communities. For example, Mbac. bryantii’s genome contained the most predicted new phenotypes, and it is a known syntrophic organism. Its activity in a native community could yield phenotypes that are different from those displayed in isolation. Several studies have demonstrated the effect of pH2 on gene expression in methanogens [77, 78]. It is possible that the low H2 threshold exhibited by Mbac. bryantii [79] triggers expression of genes for this phenotype, and that this constant level of low P H2 can only be achieved in co-culture. Additionally, direct electron transfer has been demonstrated for methanogen-containing co-cultures [80], which could explain the presence of the formylmethanofuran dehydrogenase in Msph. cuniculi. Similarly, Methanosaeta harundinacea was unable to reduce CO2 in isolation, but through the action of direct electron transfer, it could utilize the formylmethanofuran dehydrogenase in its genome [80].