Introduction

Legumes engage in nitrogen-fixation symbioses with bacterial partners from at least 13 genera of Proteobacteria [1-4]. Despite the high extent of phylogenetic diversity of root nodule bacteria, the very broad distribution of one particular genus ( Bradyrhizobium ) across host legume clades suggests that bacteria in this genus may have been the first legume symbionts [5]. Bradyrhizobium interacts with the widest diversity of legume clades (at least 24 of ca. 33 nodule-forming legume tribes; [6]) and is associated with nodulating groups that represent early branching lineages [7] in all three legume subfamilies [8,9]. Analysis of basal Bradyrhizobium lineages that are associated with early-diverging legume groups may thus shed light on the origins of this symbiosis.

Here we report the genome sequence of one such organism, Bradyrhizobium strain Tv2a.2. Strain Tv2a.2 was sampled in 1997 from the tree Tachigali versicolor on Barro Colorado Island, Panama, a biological preserve with an old-growth moist tropical forest [10]. Tachigali is one of just a handful of nodule-forming genera in the legume Subfamily Caesalpinioideae [11], which is comprised of the earliest branching lineages in the legume family [7]. Tachigali versicolor is a large canopy tree with an unusual monocarpic life history, in which trees grow for decades without flowering. They produce just a single crop of seeds, and then die [12].

Strain Tv2a.2 is a typical representative of the nodule symbionts that are associated with Tachigali in this tropical forest habitat [13], and appears to represent a unique early-diverging lineage of Bradyrhizobium . Phylogenetic analyses have placed Tv2a.2 somewhere near the early split in the genus between two large superclades represented by B. diazoefficiens USDA 110 and B. elkanii USDA 76. However, its exact position near the base of the Bradyrhizobium tree varies to some extent in different analyses, depending on the loci, the strains included, and the method of tree analysis [5,13]. For example, a Bayesian analysis of 16S rRNA sequences from the type strains of 21 Bradyrhizobium species and strain ORS278 placed Tv2a.2 as the earliest diverging Bradyrhizobium lineage [14].

Here we provide an analysis of the complete genome sequence of Tv2a.2, one of the rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project proposal [15], whose properties should help to clarify early events in the diversification of the genus Bradyrhizobium as a whole.

Organism information

Classification and features

Bradyrhizobium sp. Tv2a.2 is a motile, non-sporulating, non-encapsulated, Gram-negative strain in the order Rhizobiales of the class Alphaproteobacteria . The rod shaped form (Figure 1 Left, Center) has dimensions of approximately 0.5 μm in width and 1.5-2.0 μm in length. It is relatively slow growing, forming colonies after 6–7 days when grown on half strength Lupin Agar (½LA) [16], tryptone-yeast extract agar (TY) [17] or a modified yeast-mannitol agar (YMA) [18] at 28°C. Colonies on ½LA are opaque, slightly domed and moderately mucoid with smooth margins (Figure 1 Right).

Figure 1
figure 1

Images of Bradyrhizobium sp. Tv2a.2 using scanning (Left) and transmission (Center) electron microscopy as well as light microscopy to visualize colony morphology on solid media (Right).

Figure 2 shows the phylogenetic relationship of Bradyrhizobium sp. Tv2a.2 in a 16S rRNA gene sequence based tree. This strain is phylogenetically the most related to Bradyrhizobiumsp. EC3.3 based on a 16S rRNA gene sequence identity of 99.31% as determined using BLAST analysis [19]. Tv2a.2 is also related to the type strains Bradyrhizobium ingae BR 10250T and Bradyrhizobium iriomotense EK05T with 16S rRNA gene sequence identities of 99.16 % and 99.08%, respectively, based on results from the EzTaxon-e server [20,21].

Figure 2
figure 2

Phylogenetic tree highlighting the position of Bradyrhizobium sp. Tv2a.2 (shown in blue print) relative to other type and non-type strains in the Bradyrhizobium genus using a 1,310 bp intragenic sequence of the 16S rRNA gene. Azorhizobium caulinodans ORS 571T sequence was used as an outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [41]. The tree was built using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [22] have the GOLD ID mentioned after the strain number and are represented in bold, otherwise the NCBI accession number is provided.

Minimum Information about the Genome Sequence (MIGS) of Tv2a.2 is provided in Table 1 and Additional file 1: Table S1.

Table 1 Classification and general features of Bradyrhizobium sp. Tv2a.2 in accordance with the MIGS recommendations [42] published by the Genome Standards Consortium [43]

Symbiotaxonomy

Bradyrhizobium strain Tv2a.2 was isolated from nodules of Tachigali versicolor found in a tropical forest on Barro Colorado Island, Panama [10]. Due to the highly erratic pattern of seed production from this host, no seeds of this legume were available to authenticate the symbiotic proficiency of strain Tv2a.2. Nodulation and nitrogen fixation was therefore tested on two promiscuous legumes ( Vigna unguiculata , Macroptilium atropurpureum ) and revealed that nodules could only develop on M. atropurpureum. Acetylene reduction assays also showed that these nodules lacked nitrogenase activity [13]. A further indication that Tv2a.2 may be relatively host-specific is the fact that extensive sampling of other legume hosts in Panama (and elsewhere in the Neotropics) have never recovered strains belonging to the Tv2a.2 lineage from any legume taxa other than T. versicolor [9].

Genome sequencing and annotation information

Genome project history

This organism was selected for sequencing on the basis of its environmental and agricultural relevance to issues in global carbon cycling, alternative energy production, and biogeochemical importance, and is part of the Genomic Encyclopedia of Bacteria and Archaea, Root Nodulating Bacteria (GEBA-RNB) project at the U.S. Department of Energy, Joint Genome Institute (JGI). The genome project is deposited in the Genomes OnLine Database [22] and a high-quality permanent draft genome sequence in IMG [23]. Sequencing, finishing and annotation were performed by the JGI using state of the art sequencing technology [24]. A summary of the project information is shown in Table 2.

Table 2 Project information

Growth conditions and genomic DNA preparation

Bradyrhizobium sp. Tv2a.2 was cultured to mid logarithmic phase in 60 ml of TY rich media on a gyratory shaker at 28°C [25]. DNA was isolated from the cells using a CTAB (Cetyl trimethyl ammonium bromide) bacterial genomic DNA isolation method [26].

Genome sequencing and assembly

The draft genome of Bradyrhizobium sp. Tv2a.2 was generated at the DOE Joint Genome Institute (JGI) using the Illumina technology [27]. An Illumina standard shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 8,336,316 reads totaling 1250.45 Mbp. All general aspects of library construction and sequencing were performed at the JGI and details can be found on the JGI website [28]. All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts (Mingkun L, Copeland A, Han J, Unpublished). Following steps were then performed for assembly: (1) filtered Illumina reads were assembled using Velvet (version 1.1.04) [29], (2) 1–3 Kbp simulated paired end reads were created from Velvet contigs using wgsim [30], (3) Illumina reads were assembled with simulated read pairs using Allpaths–LG (version r39750) [31]. Parameters for the assembly steps were 1) velveth: −-v --s 51 --e 71 --i 2 --t 1 --f “-shortPaired -fastq $FASTQ” --o “-ins_length 250 -min_contig_lgth 500” for Velvet and 2) wgsim: −e 0–1 76–2 76 -r 0 -R 0 -X 0. The final draft assembly contained 87 contigs in 87 scaffolds. The total size of the genome is 8.5 Mb with an average of 109.04x coverage of the genome.

Genome annotation

Genes were identified using Prodigal [32], as part of the DOE-JGI genome annotation pipeline [33,34]. The predicted CDSs were translated and used to search the National Center for Biotechnology Information non-redundant database, UniProt, TIGRFam, Pfam, KEGG, COG, and InterPro databases. The tRNAScanSE tool [35] was used to find tRNA genes, whereas ribosomal RNA genes were found by searches against models of the ribosomal RNA genes built from SILVA [36]. Other non–coding RNAs such as the RNA components of the protein secretion complex and the RNase P were identified by searching the genome for the corresponding Rfam profiles using INFERNAL [37]. Additional gene prediction analysis and manual functional annotation was performed within the Integrated Microbial Genomes-Expert Review (IMG-ER) system [38] developed by the Joint Genome Institute, Walnut Creek, CA, USA.

Genome properties

The genome is 8,496,279 nucleotides with 62.20% GC content (Table 3) and comprised of 87 scaffolds. From a total of 8,181 genes, 8,109 were protein encoding and 72 RNA only encoding genes. The majority of genes (72.94%) were assigned a putative function whilst the remaining genes were annotated as hypothetical. The distribution of genes into COGs functional categories is presented in Table 4.

Table 3 Genome statistics for Bradyrhizobium sp. Tv2a.2
Table 4 Number of genes associated with the general COG functional categories

Conclusions

Bradyrhizobium sp. Tv2a.2 was collected in 1997 from a nodule of the tree Tachigali versicolor on Barro Colorado Island, Panama. Based on 16S rRNA gene analyses, Tv2a.2 is phylogenetically the most closely related to Bradyrhizobium sp. EC3.3 (a strain isolated from a nodule of Erythrina costaricensis collected from Barro Colorado Island, Panama) and to the type strains Bradyrhizobium ingae BR 10250T and Bradyrhizobium iriomotense EK05T isolated from Inga laurina (Sw.) Willd. growing in the Cerrado Amazon region, State of Roraima, Brazil [39] and from Entada koshunensis, a legume available in Okinawa, Japan [40], respectively. Strain Tv2a.2 is one of 25 Bradyrhizobium genomes that were sequenced within the GEBA-RNB project [15]; of these, the Tv2a.2 genome has the fifth lowest genome size (8.5 Mbp), gene count (8,181) and Pfam percentage (74.32%) amongst these strains. The specific genome attributes of Bradyrhizobium sp. Tv2a.2 compared to the other Bradyrhizobium genomes will be important to understand the interactions required for the successful establishment of an effective symbiosis with the host Tachigali versicolor .