Introduction

Strain BL2T (= DSM 18169 = CCUG 54636 = JCM 13652) is the type strain of Bacteroides barnesiae which belongs to the genus Bacteroides [1]. The species epithet is derived from the name of Ella M. Barnes, a British microbiologist, who has contributed much to our knowledge of intestinal bacteriology and anaerobic bacteriology in general. B. barnesiae strain BL2T was isolated from caecum of a healthy chicken. Four other strains belonging to the same species have been isolated from the same source [1]. The genus Bacteroides represents one of the predominant anaerobic genera found in chicken caecum [24]. Bacteroides species are thought to play a fundamental role in the breakdown of complex molecules (such as polysaccharides) into simpler compounds that are used by the animal host as well as the microorganisms themselves [5, 6], in the utilization of nitrogenous substances and in the biotransformation of bile acids and other steroids [7]. They also play a role as beneficent protectors of the gut against pathogenic microorganisms [8]. Here we present a summary classification and set of features for B. barnesiae strain BL2T, together with the description of the complete genomic sequencing and annotation.

Organism information

Classification and features

A 1301 bp long contig contained the most complete 16S rRNA gene copy in the draft genome. This partial gene differed by 7 nucleotides (0.5 %) from the 16S rRNA reference sequence (AB253726) generated for the original description of B. barnesiae [1]. Such a difference is not unusual when comparing original sequences from the time organisms were initially described with sequences of type strain genomes sequenced in the KMG project [9], a problem that was only partially resolved in the sequencing orphan species initiative (SOS) [10]. A representative 16S rRNA gene sequence of strain BL2T was compared with GenBank using NCBI BLAST. The single most frequent genus found was Bacteroides . The highest-scoring environmental sequences (up to 99.8 % sequence identity), including HQ784912 (‘gastrointestinal specimens clone ELU0102-T240-S-NI_000093’), were all from a study on gastrointestinal specimens linked to inflammatory bowel diseases phenotype in human ileum [11] and indicate that close relatives of strain BL2T and representatives of B. barnesiae are also relevant to human health. Fig. 1 shows the phylogenetic position of B. barnesiae in a 16S rRNA gene sequence-based tree.

Fig. 1
figure 1

Phylogenetic tree based on the 16S rRNA gene sequences showing the relationship of Bacteroides barnesiae strain BL2T among the genus Bacteroides . The tree was constructed by the neighbor-joining method. Numbers at nodes indicate the percentage bootstrap values of 1000 replicates. Bars, 0.01 substitutions per nucleotide position. Accession numbers are given for each strain

The cells of B. barnesiae are pleomorphic rods (0.5-1.4 × 0.8-10.6 μm) (Fig. 2). The cells are usually arranged singly or in pairs [1]. B. barnesiae is a Gram-negative, non-sporeforming bacterium (Table 1) that is described as non-motile, with only seven genes associated with motility having been found in the genome (see below). The optimum temperature for growth of strain BL2T is 37 °C. B. barnesiae is a strictly anaerobic chemoorganotroph and is able to ferment glucose, lactose, sucrose, maltose, salicin, xylose, cellobiose, mannose and raffinose [1]. The organism hydrolyzes esculin but does not liquefy gelatin, and neither reduces nitrate nor produces indole from tryptophan [1]. B. barnesiae does not utilize mannitol, arabinose, glycerol, melezitose, sorbitol, rhamnose or trehalose [1]. Growth is possible in the presence of bile [1]. Major fermentation products from broth (1 % peptone, 1 % yeast extract, and 1 % glucose each (w/v)) are acetic acid and succinic acid, whereas isovaleric acid is produced in small amounts [1]. B. barnesiae shows activity for α-galactosidase, β-galactosidase, α-glucosidase, β-glucosidase, α-arabinosidase, N-acetyl-β-glucosaminidase, α-fucosidase, alkaline phosphatase, leucyl glycine arylamidase, alanine arylamidase and glutamyl glutamic acid arylamidase but no activity urease, catalase, arginine dihydrolase, β-galactosidase 6-phosphate, β-glucuronidase, glutamic acid decarboxylase and arginine, proline, phenylalanine, leucine, pyroglutamic acid, tyrosine, glycine, histidine and serine arylamidase [1].

Fig. 2
figure 2

Light microscope image of strain BL2T

Table 1 Classification and general features of Bacteroides barnesiae strain BL2T in accordance with the MIGS recommendations [33] published by the Genome Standards Consortium [34] and the NamesforLife database [35]

B. barnesiae strain BL2T contains menaquinones MK-10 (58 %) and MK-11 (34 %) as principal respiratory quinones, small amounts of MK-8, MK-9 and MK-12 (2 % each) are found as minor components [1]. The major fatty acids found were anteiso-C15:0 (32 %), iso-C15:0 (15 %), 3-hydroxy C16:0 (10 %) and C16:0 (10 %). Fatty acids C14:0 (4 %), C15:0 (2 %), C18:1 ω9c (4 %), C18:2 ω6,9c (2 %) and 3-hydroxy iso-C17:0 (7 %) were found in minor amounts [1]. Chemotaxonomic features are in line with known features from other representatives of the genus [1].

Genome sequencing information

Genome project history

The organism was selected for sequencing on the basis of its phylogenetic position [1214]. Sequencing of B. barnesiae strain BL2T is part of Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes project [9] which aims not only to increase the sequencing coverage of key reference microbial genomes [15], but also to generate a large genomic basis for the discovery of genes encoding novel enzymes [16]. The genome project is deposited in the Genomes OnLine Database [17] and the permanent draft genome sequence is deposited in GenBank. Sequencing, finishing and annotation were performed by the DOE Joint Genome Institute using state of the art sequencing technology [18]. A summary of the project information is shown in Table 2.

Table 2 Genome sequencing project information

Growth conditions and genomic DNA preparation

B. barnesiae strain BL2T, DSM 18169, was grown anaerobically in DSMZ medium 429 (Columbia Blood Agar) at 37 °C [19]. DNA was isolated from 0.5-1 g of cell paste using JetFlex genomic DNA purification (GENOMED) following the standard protocol as recommended by the manufacturer with and additional protease K (50 μl; 21 mg/ml) digest for 60 min. at 58 °C followed by addition of 200 μl Protein Precipitation Buffer after protein precipitation and overnight incubation on ice. DNA is available through the DNA Bank Network [20].

Genome sequencing and assembly

The permanent draft genome of B. barnesiae strain BL2T was generated using Illumina technology [18, 21]. An Illumina Standard shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 11,109,700 reads totaling 1,666.5 Mb. All general aspects of library construction and sequencing performed at the DOE-JGI can be found at [22]. All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts [23]. Following steps were then performed for assembly: (1) filtered Illumina reads were assembled using Velvet [24], (2) 1–3 kb simulated paired end reads were created from Velvet Contigs using wgsim [25], (3) Illumina reads were assembled with simulated read pairs using Allpaths–LG (version r41043) [26]. Parameters for assembly steps were: 1) Velvet (velveth: 63 –shortPaired and velvetg: −very clean yes –export- Filtered yes –min contig lgth 500 –scaffolding no –cov cutoff 10) 2) wgsim (−e 0 –1 100 –2 100 –r 0 –R 0 –X 0) 3) Allpaths–LG (PrepareAllpathsInputs: PHRED 64 = 1 PLOIDY = 1 FRAG COVERAGE = 125 JUMP COVERAGE = 25 LONG JUMP COV = 50, RunAllpathsLG: THREADS = 8 RUN = std shredpairs TARGETS = standard VAPI WARN ONLY = True OVERWRITE = True). The final draft assembly contained 47 contigs in 43 scaffolds. The total size of the genome is 3.6 Mb and the final assembly is based on 443.6 Mb of Illumina data, which provides an average 122.7 × coverage of the genome.

Genome annotation

Genes were identified using Prodigal [27] as part of the DOE-JGI genome annotation pipeline [28, 29], following by a round of manual curation using the JGI GenePRIMP pipeline [30]. The predicted CDSs were translated and used to search the National Center for Biotechnology Information non-redundant database, UniProt, TIGR-Fam, Pfam, PRIAM, KEGG, COG, and InterPro database. These data sources were combined to assert a product description for each predicted protein. Additional gene prediction analysis and functional annotation was performed within the Integrated Microbial Genomes-Expert Review platform [31].

Genome properties

The assembly of the draft genome sequence consists of 43 scaffolds amounting to 3,621,509 bp, and the G + C content is 46.8 % (Table 3). Of the 3,156 genes predicted, 3,059 were protein-coding genes, and 97 RNAs. The majority of the protein-coding genes (71.7 %) were assigned a putative function while the remaining ones were annotated as hypothetical proteins. The distribution of genes into COGs functional categories is presented in Table 4.

Table 3 Genome statistics
Table 4 Number of genes associated with the general COG functional categories

Insights from the genome sequence

B. barnesiae strain BL2T, Bacteroides salanitronis strain BL78T and Bacteroides gallinarum strain C35T were isolated from the cecum of the same healthy chicken [1]. The GGDC (Genome-to-Genome Distance Calculator) web server (GGDC 2.0) [32] was used for the estimation of the overall similarity between the three Bacteroides genomes. The comparison of B. barnesiae with B. salanitronis and B. gallinarum revealed that 11.1 % and 5.2 %, respectively, of the average of the genome lengths are covered with HSPs (high-scoring segment pairs). The identity within the HSPs was 83.6 % and 84.6 %, respectively, whereas the identity over the whole genome was 9.3 % and 4.4 %, respectively. The comparison of B. gallinarum with B. salanitronis revealed that 5.4 % of the genome is covered with HSPs, with an identity within in the HSPs of 84.1 % and an identity over the whole genome of 4.6 %. According to these calculations the similarity between B. barnesiae and B. salanitronis is higher than the similarity between B. barnesiae and B. gallinarum as well as the similarity between B. gallinarum and B. salanitronis .

The genome size of B. barnesiae (3.6 Mb) is significantly smaller than those of B. salanitronis (4.3 Mb) and B. gallinarum (4.9 Mb).

Conclusions

B. barnesiae strain BL2T genome consists of a single chromosome of 3.6 Mb predicted to encode 3,156 genes. Strain BL2T has a relatively small genome in comparison to other sequenced Bacteroides species isolated from the same chicken (4.3-4.9 Mb). These differences of genome size may be the results of adaptation in this niche. Further study will be necessary for elucidation of this idea.