Introduction

The genus Halopiger , which belongs to the family Halobacteriaceae , was originally established in 2007 by Gutiérrez et al. [1]. The type species of the genus Halopiger is Halopiger xanaduensis SH-6T. To date, the genus is comprised of three validly published species and two effectively but not validly published species: H. xanaduensis [1], Halopiger aswanensis [2], Halopiger salifodinae [3], Halopiger djelfamassiliensis [4] and Halopiger goleamassiliensis [5]. The species of the genus were reported to be isolated from hypersaline environments such as salt lake sediment [1, 4, 5], hypersaline soil [2] and salt mine [3]. All are Gram-negative, strictly aerobic and extremely halophilic [15]. In this genus, three genome sequences, including one finished genome sequence H. xanaduensis SH-6T, and two draft genome sequences H. djelfamassiliensis IIH2T and H. goleamassiliensis IIH3T, are available in Standards in Genomic Sciences [46], except H. aswanensis 56T which showed highest 16S rRNA gene similarity to H. xanaduensis SH-6T (99.1 %). Here we present a summary of the classification and a set of features of strain H. salifodinae KCY07-B2T, together with a description of the non-contiguous finished genomic sequencing and annotation.

Organism Information

Classification and features

A representative genomic 16S rRNA gene sequence of H. salifodinae KCY07-B2T was compared with sequences deposited in the GenBank database using BLASTN [7]. The 16S rRNA gene sequence analysis showed that H. salifodinae KCY07-B2T shared the highest sequence identities to H. xanaduensis SH-6T (95.8 %), followed by H. aswanensis 56T (95.5 %), H. djelfamassiliensis IIH2T (94.9 %) and H. goleamassiliensis IIH3T (94.8 %), and shared low sequence similarities (<94.8 %) to species of other genera. The phylogenetic tree was reconstructed by the neighbor-joining method using MEGA 5 and Kimura’s 2-parameter model for distance calculation [8, 9]. The phylogenetic tree was assessed by boot-strapping for 1000 replications, and the consensus tree was shown in Fig. 1.

Fig. 1
figure 1

Neighbor-joining phylogenetic tree based on 16S rRNA gene sequences showed the relationship of H. salifodinae KCY07-B2T and other related haloarchaeal species. GenBank accession numbers were indicated in parentheses. Bootstrap values based on 1000 replicates were shown for branches with more than 60 % support. Bar, 0.02 substitutions per nucleotide positions. Methanospirillum hungatei JF-1T [30] was used as outgroup

H. salifodinae KCY07-B2T can tolerant high salinity (5.4 M NaCl ) and high temperature (50 °C) [3]. Cells lyse in distilled water. The optimal growth condition of strain KCY07-B2T occured in medium NOM-3 with 2.9–3.4 M NaCl [3]. The optimum temperature was 37–45 °C. The optimum pH was 7.0, with a growth range of pH 6.0–8.0 [3]. Cells of strain KCY07-B2T are strictly aerobic, non-motile and pleomorphic rod-shaped (Fig. 2). Several sugars, organic acids and amino acids can serve as sole carbon and energy sources, and amino acids are not required in the growth medium [3]. The features of H. salifodinae KCY07-B2T are listed in Table 1.

Fig. 2
figure 2

Electron micrographs of cells of strain KCY07-B2T grown in liquid medium under optimum condition. a Transmission electron micrographs of strain KCY07-B2T revealing rod-shaped, bar, 0.2 μm, b Showing a mixture of pleomorphic cells including short and long rod-shaped, bar, 2 μm

Table 1 Classification and general features of H. salifodinae KCY07-B2T according to the MIGS recommendations [10]

Genome sequencing information

Genome project history

This genome was selected for sequencing on the basis of its phylogenetic position and 16S rRNA sequence similarity to other members of the genus Halopiger . This whole genome shotgun project of strain H. salifodinae KCY07-B2T was deposited at DDBJ/EMBL/GenBank under the accession number JROF00000000 and the sequence consisted of 83 contigs (further assembling constructed these contigs into 81 scaffolds). Table 2 shows the project information and its association with MIGS version 2.0 compliance [10].

Table 2 Project information

Growth conditions and genomic DNA preparation

H. salifodinae KCY07-B2T was cultivated aerobically on 37 °C for 4 days in NOM-3 medium, which contains (per liter distilled water) 5.4 g KCl, 0.3 g K2HPO4, 0.25 g CaCl2, 0.25 g NH4Cl, 26.8 g MgSO4 · 7H2O, 23.0 g MgCl2 · 6H2O, 184.0 g NaCl, 1.0 g yeast extract, 0.25 g fish peptone, 0.25 g sodium formate, 0.25 g sodium acetate, 0.25 g sodium lactate and 0.25 g sodium pyruvate (adjusted to pH 7.0 with 1 M NaOH) [3]. Genomic DNA was extracted using the method described by Marmur [11]. The purity, quality and the concentration of genomic DNA preparation were analyzed by 0.7 % agarose gel electrophoresis with λ-Hind III digest DNA Marker (TaKaRa, Dalian, China) and measured using a NanoDrop 1000 Spectrophotometer (Thermo Fisher Scientific Inc., USA).

Genome sequencing and assembly

The genome of H. salifodinae KCY07-B2T was sequenced using Solexa paired-end sequencing technology (HiSeq2000 system, Illumina, Inc., USA) [12]. A shotgun library was constructed with a 500 bp-span paired-end library (~500 Mb available reads, ~130-fold genome coverage) and a 2000 bp-span paired-end library (~250 Mb available reads, ~65-fold genome coverage). The sequence data from an Illumina HiSeq 2000 were assembled with SOAPdenovo v.1.05 [1315]. The final assembly identified 83 contigs and 81 scaffolds (the minimum length is 523 bp) generating a genome size of 4.35 Mb. The quality of the sequencing reads data was estimated by G + C content and sequencing depth correlation analysis.

Genome annotation

The tRNAs and rRNAs were identified using tRNAscan-SE [16], RNAmmer [17] and Rfam database [18]; The open reading frames and the functional annotation of translated ORFs were predicted and achieved by using the RAST server online [19, 20]. Classification of some predicted genes and pathways were analyzed using COGs [21, 22] and KEGG [2325] databases. Meanwhile, we used CRISPRs web server [26] to predict CRISPRs and InterPro [27, 28] to obtain the GO annotation with the database of Pfam [29].

To estimate the mean level of nucleotide sequence similarity at the genome level between e KCY07-B2T and the genus Halopier genomes available to date ( H. xanaduensis SH-6T, H. djelfamassiliensis IIH2T and H. goleamassiliensis IIH3T), we compared the ORFs only using comparison sequence based in the server RAST [19] at a query coverage of ≥60 % and a minimum nucleotide length of 100 bp.

Genome properties

The draft genome sequence of H. salifodinae KCY07-B2T revealed a genome size of 4,350,718 bp (scaffold length) with a 65.41 % G + C content. Of the 4254 predicted genes, 4204 were protein-coding genes, and 50 were rRNA genes. There were one 16S rRNA gene, two 23S rRNA genes and two 5S rRNA genes. A total of 2887 genes (68.67 %) were assigned a putative function (Table 3). Table 4 showed the distribution of genes into COG functional categories.

Table 3 Genome statistics of Halopiger salifodinae KCY07-B2T, including nucleotide content and gene count levels
Table 4 Number of genes associated with the 25 general COG functional categories

Insights from the genome sequence

Strain H. salifodinae KCY07-B2T was isolated from a salt mine sample. The experiments showed this strain could grow at 2.9–3.4 M NaCl for optimal growth, and the cells lysed in distilled water. So the analysis of the genome sequence focused on the adaption mechanism of the halophilic archaea in hypersaline-environments. Strain H. salifodinae KCY07-B2T mainly utilized “the salt-in strategy” to maintain osmotic balance. According to the annotation of genome sequence, Trk system potassium uptake protein were found, which were responsible for K+ uptake and transport, including 9 copies TrkH genes and 5 copies TrkA genes. Five copies of Kef-type K+ transport proteins, one copy glutathione-regulated potassium-efflux protein KefB and 8 pH adaptation potassium efflux system proteins were found that were related to K+ efflux. And there also existed 8 copies of potassium channel proteins. In addition, the genome contains 13 copies of Na+/ H+ antiporter proteins related to Na+ efflux. The genome of strain H. salifodinae KCY07-B2T contains 12 genes related to the synthesis and transport of the compatible-solute glycine betaine for resistance to osmotic stress including: 7 choline-sulfatases, 2 high-affinity choline uptake protein BetTs, 2 glucose-methanol-choline oxidoreductase and 1 glycine betaine transporter OpuD coding genes. These proteins were also related to the metabolic pathway converting choline sulfate to glycine betaine. All these proteins and systems mentioned played an important role in the adaption of osmotic stress in high salt environment.

Currently, three genomes from Halopiger species are available. Here, we compare the genome of strain H. salifodinae KCY07-B2T with strains H. xanaduensis SH-6T, H. djelfamassiliensis IIH2T and H. goleamassiliensis IIH3T (Table 5). The size of genome of H. salifodinae KCY07-B2T (4.35 Mb) is similar to H. xanaduensis SH-6T (4.35 Mb) but larger than that of H. djelfamassiliensis IIH2T (3.77 Mb) and H. goleamassiliensis IIH3T (3.90 Mb). The G + C content of H. salifodinae KCY07-B2T (65.41 %) is similar to H. xanaduensis SH-6T (65.18 %) and higher than that of H. djelfamassiliensis IIH2T (64.30 %) but lower than that of H. goleamassiliensis IIH3T (66.06 %). In addition, H. salifodinae KCY07-B2T shares a mean genomic sequence similarity of 79.74 %, 80.16 % and 79.17 % with strains H. xanaduensis SH-6T, H. djelfamassiliensis IIH2T and H. goleamassiliensis IIH3T, respectively.

Table 5 Genomic comparison of H. salifodinae KCY07-B2T with three other Halopiger speciesa

Conclusions

Strain KCY07-B2T is the third member of the genus Halopiger to be described and the fourth whose genome sequence report is available. These data will provide a new perspective of how microorganisms adapt to halophilic environments, and may also provide a pool of functional enzymes that work at higher salty.