Draft genome sequence of the cellulolytic endophyte Chitinophaga costaii A37T2T

Proença, Diogo N.; Whitman, William B.; Shapiro, Nicole; Woyke, Tanja; Kyrpides, Nikos C.; Morais, Paula V.

doi:10.1186/s40793-017-0262-2

Draft genome sequence of the cellulolytic endophyte Chitinophaga costaii A37T2^T

Short genome report
Open access
Published: 06 September 2017

Volume 12, article number 53, (2017)
Cite this article

Download PDF

You have full access to this open access article

Standards in Genomic Sciences Aims and scope Submit manuscript

Draft genome sequence of the cellulolytic endophyte Chitinophaga costaii A37T2^T

Download PDF

Diogo N. Proença¹,
William B. Whitman²,
Nicole Shapiro³,
Tanja Woyke³,
Nikos C. Kyrpides³ &
…
Paula V. Morais ORCID: orcid.org/0000-0002-1939-6389^1,4

2323 Accesses
5 Citations
Explore all metrics

Abstract

Here we report the draft genome sequence of Chitinophaga costai A37T2^T (=CIP 110584^T, =LMG 27458^T), which was isolated from the endophytic community of Pinus pinaster tree. The total genome size of C. costaii A37T2^T is 5.07 Mbp, containing 4204 coding sequences. Strain A37T2^T encoded multiple genes likely involved in cellulolytic, chitinolytic and lipolytic activities. This genome showed 1145 unique genes assigned into 109 Cluster of Orthologous Groups in comparison with the complete genome of C. pinensis DSM 2588^T. The genomic information suggests the potential of the strain A37T2^T to interact with the plant metabolism. As there are only a few bacterial genomes related to Pine Wilt Disease, this work provides a contribution to the field.

Introduction

The genus Chitinophaga belongs to the family Chtiniphagaceae (phylum Bacteroidetes ) alongside with the genera Arachidicoccus , Asinibacterium , Balneola , Cnuella , Crenotalea , Ferruginibacter , Filimonas , Flaviaesturariibacter , Flavihumibacter , Flavisolibacter , Flavitalea , Gracilimonas , Heliimonas , Hydrotalea , Lacibacter , Niabella , Niastella , Parasediminibacterium , Parasegetibacter , Sediminibacterium , Segetibacter , Taibaiella , Terrimonas , Thermoflavifilum and Vibriomonas. The genus Chitinophaga is widely distributed in the environment and strains of this genus have been isolated from pine trees, soil, rhizosphere soil, roots, vermicompost and weathered rock [1]. Twenty-four species belonging to the genus Chitinophaga have been described [2], and only the type species of the genus C. pinensis has the complete genome sequenced [3].

Pinus pinaster trees from Central Portugal present a diverse endophytic microbial community. Strain A37T2^T was isolated as part of the endophytic microbiome of pine trees affected by Pine Wilt Disease (PWD) which is a world devastating disease, consequence of Bursaphelenchus xylophilus colonization in pine trees [4]. Here, we show the second genome of the genus Chitinophaga , a draft genome of Chitinophaga costaii A37T2^T, previously isolated as endophyte of Pinus pinaster affected by PWD [1].

Organism information

Classification and features

The type strain A37T2^T (=CIP 110584 ^T =LMG 27458 ^T), was isolated from tree trunk of a Pinus pinaster tree affected by PWD and it described as Chitinophaga costaii (family Chitinophagaceae , phylum Bacteroidetes ) [1]. It was Gram-stain-negative, facultative anaerobic, non-motile, formed rod-shaped cells, 0-5-1 μm in diameter and 1-8 μm in length after 48 h on R2A agar media (Fig. 1). Showed capacity to grow on R2A agar medium at 15-45 °C (optimum, 26-30 °C), at pH 5.5-8.0 (optimum, pH 7) and supplemented with up to 1% (w/v) NaCl (optimum without NaCl). The major fatty acids (>25%) showed by the strain A37T2^T are saturated iso-C₁₅: ₀ and unsaturated C₁₆: _{1 ω5c}. The major polar lipids were identified as phosphatidylethanolamine, two unidentified aminophospholipids and one unidentified lipid. No glycolipid was detected. The menaquinone 7 (MK-7) was shown as the major respiratory lipoquinone. The determined DNA G + C content of the C. costaii A37T2^T was 46.6 mol%. Key features of this microorganism are summarized in Table 1. A phylogenetic tree based on the 16S rRNA gene sequence of this strain and its closest relative members are given in Fig. 2. The sequences were aligned by SINA (v1.2.9) using the SILVA SEED as reference alignment [5]. Sequences were included in 16S rRNA-based Living Tree Project (LTP) release 115 database [6] by parsimony implemented in the ARB software package version 5.5 [7]. Evolutionary distances were calculated [8] and phylogenetic dendrograms were constructed using the neighbor-joining [9] and Randomized Axelerated Maximum Likelihood (RAxML) method with GTRGAMMA model [10] included in the ARB software [7]. Trees topologies were evaluated by performing bootstrap analysis [11] of 1000 data sets by using ARB software package.

Table 1 Classification and general features of Chitinophaga costaii A37T2^T according to the MIGS recommendations [26]

Full size table

Genome sequencing information

Genome project history

This Whole Genome Shotgun project has been deposited at ENA under the accession numbers FMAR01000001-FMAR01000056 and in the Integrated Microbial Genomes database (IMG) with Biosample ID SAMN05216457 [12]. The genome sequencing of this organism is part of the Genomic Encyclopedia of Bacteria and Archaea [13], 1000 Microbial Genomes project, phase III (KMG-III) [14], at the U.S. Department of Energy, Joint Genome Institute (JGI). The project information and its association with the MIGS is summarized in Table 2.

Table 2 Project information

Full size table

Growth conditions and genomic DNA preparation

The strain A37T2^T was grown on R2A agar media at 30 °C during 48 h and its genomic DNA was extracted using the E.Z.N.A. Bacterial DNA Kit (Omega Bio-Tek, Norcross, GA, USA) according to the manufacturer’s instructions.

Genome sequencing and assembly

The draft genome of C. costaii A37T2^T was generated at the DOE Joint Genome Institute (JGI) using the Illumina technology [15]. An Illumina 300 bp insert standard shotgun library was constructed and sequenced using the Illumina HiSeq–2500 1 TB platform, generating 9,965,394 reads totaling 1494.8 Mbp. All general aspects of library construction and sequencing performed at the JGI can be found at [16]. All raw Illumina sequence data was filtered using BBDuk [17], which removes known Illumina artifacts and PhiX. Reads with more than one “N” or with quality scores (before trimming) averaging less than 8 or reads shorter than 51 bp (after trimming) were discarded. Remaining reads were mapped to masked versions of human, cat and dog references using BBMAP [17] and discarded if identity exceeded 95%. Sequence masking was performed with BBMask [17]. Following steps were then performed for assembly: (1) artifact filtered Illumina reads were assembled using SPAdes (version 3.6.2) [18]; (2) assembled contigs were discarded if length was <1 kbp. Parameters for the SPAdes assembly were ––cov–cutoff auto ––phred–offset 33 –t 8 –m 40 ––careful –k 25,55,95 ––12.

Genome annotation

Protein-coding genes were identified using Prodigal [19], as part of the DOE-JGI genome annotation pipeline [20]. Additional gene prediction analysis and manual functional annotation were performed within the Integrated Microbial Genomes Expert Review system (IMG-ER), which provides tools for analyzing and reviewing the structural and functional annotations of genomes in a comparative context [12, 21]. Genome annotation procedures are detailed in Markowitz et al. [12] and references therein. Briefly, the predicted CDSs were translated and used to search the NCBI nonredundant database, UNIProt, TIGRFam, Pfam, KEGG, COG and InterPro databases. Transfer RNA genes were identified using the tRNAScan-SE tool and other non-coding RNAs were found using INFERNAL. Ribosomal RNA genes were predicted using hmmsearch against the custom models generated for each type of rRNA.

Genome properties

The draft genome sequence of C. costaii strain A37T2^T comprised 5,074,440 bp, based on 1494.8 Mbp of Illumina data with a mapped coverage of 297.2-fold of the genome. The final draft assembly contained 56 contigs in 56 scaffolds with more than 1052 bp. The G + C content was 47.6%. The genome encoded 4204 putative coding sequences (CDSs) (Table 3). Fifty four % of the CDSs, corresponding to 2284 proteins, could be assigned to Cluster of Orthologous Groups (COG) families [22] (Table 4). The draft genome sequence contained four ribosomal RNAs and 50 tRNAs loci (Table 3).

Table 3 General genome features of Chitinophaga costaii A37T2^T

Full size table

Table 4 Number of genes associated with general COG functional categories

Full size table

The Average Nucleotide Identity between C. costaii A37T2^T and C. pinensis DSM 2588 ^T was 70.9 based on 1593 of total Bidirectional Best Hits, using MiSI [23]. Figure 3 shows the circular graph of the genome of C. costaii A37T2^T query to the only available complete genome of the genus Chitinophaga , C. pinensis DSM 2588 ^T [2].

The comparison between the draft genome of C. costaii A37T2^T and the complete genome of C. pinensis DSM 2588 ^T showed 1145 unique genes only present in the genome of C. costaii A37T2^T and 3493 unique genes only present in the genome of C. pinensis DSM 2588 ^T. Focused on the unique genes present on the genome of strain A37T2^T it was possible to assigned 109 COG, summarized in Table 5.

Table 5 Unique Cluster Orthologous Groups present in the genome of C. costaii A37T2^T

Full size table

Insights from the genome sequence

The draft genome sequence of C. costaii A37T2^T carries multiple genes involved in cellulolytic activity, including one gene encoding the enzyme cellulase (SCC15587) and six genes encoding for β-glucosidase (SCB82491, SCB92249, SCB95191, SCC15475, SCC57293, SCC61957), which might be involved in cellulose degradation in the environment and in biotechnological processes [24]. As expected for this genus, four genes encoding chitinases (SCC19468, SCC19522, SCC23114, SCC34676) were found. Six genes encoded lysophospholipase L1, including representatives of both of size groups, i.e. less than 300aa (SCB77875, SCC28514, SCC37316, SCC54197) and less than 500aa (SCB98645, SCC50813). Moreover, the genome of strain A37T2^T encoded 1-aminocyclopropane-1-carboxylate deaminase (SCB80758), a hydrolase that might be involved in lowering ethylene levels in the plant [25]. In summary, the genome sequence suggested multiple potentials for the strain to interact with the plant metabolism.

Conclusions

This work contributed to the knowledge of the genome sequence of the type species of C. costaii A37T2^T (=CIP 110584 ^T, =LMG 27458 ^T), an endophyte of P. pinaster affected by PWD. The genome encoded multiple genes involved in cellulolytic activity and the sequence provided insights into the role of bacteria in PWD. As there are only a few bacterial genomes related to PWD, this work provides a contribution to this field.

Abbreviations

PWD:: Pine wilt disease
PWN:: Pinewood nematode

References

Proença DN, Nobre MF, Morais PV. Chitinophaga costaii sp. nov., an endophyte of Pinus pinaster, and emended description of Chitinophaga niabensis. Int J Syst Evol Microbiol. 2014;64:1237–43.
Article PubMed Google Scholar
List of prokaryotic names with standing in nomenclature. http://www.bacterio.net. Accessed 2 Dec 2016.
Glavina Del Rio T, Abt B, Spring S, Lapidus A, Nolan M, Tice H, et al. Complete genome sequence of Chitinophaga pinensis type strain (UQM 2034^T). Stand Genomic Sci. 2010;2:87–95.
Article PubMed PubMed Central Google Scholar
Proença DN, Grass G, Morais PV. Understanding pine wilt disease: roles of the pine endophytic bacteria and of the bacteria carried by the disease-causing pinewood nematode. Microbiology 2016;0:1–20.
Pruesse E, Peplies J, Glöckner FO. SINA: accurate high-throughput multiple sequence alignment of ribosomal RNA genes. Bioinformatics. 2012;28:1823–9.
Article CAS PubMed PubMed Central Google Scholar
“The All-Species Living Tree” Project. http://www.arb-silva.de/projects/living-tree. Accessed 15 Jan 2016.
Ludwig W, Strunk O, Westram R, Richter L, Meier H. Yadhukumar, et al. ARB: a software environment for sequence data. Nucleic Acids Res. 2004;32:1363–71.
Article CAS PubMed PubMed Central Google Scholar
Jukes TH, Cantor CR. Evolution of protein molecules. In: Munro HN, editor. Mamm. Protein Metab. New York: Academic Press; 1969. p. 21–132.
Chapter Google Scholar
Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987;4:406–25.
CAS PubMed Google Scholar
Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22:2688–90.
Article CAS PubMed Google Scholar
Felsenstein J. Confidence limits on phylogenies: an approach using the bootstrap. Evolution (N Y). 1985;39:783–91.
Google Scholar
Markowitz VM, Chen IMA, Chu K, Szeto E, Palaniappan K, Pillay M, et al. IMG/M 4 version of the integrated metagenome comparative analysis system. Nucleic Acids Res. 2014;42:568–73.
Article Google Scholar
Kyrpides NC, Hugenholtz P, Eisen JA, Woyke T, Göker M, Parker CT, et al. Genomic encyclopedia of bacteria and Archaea: sequencing a myriad of type strains. PLoS Biol. 2014;12:1–7.
Article Google Scholar
Whitman WB, Woyke T, Klenk H-P, Zhou Y, Lilburn TG, Beck BJ, et al. Genomic encyclopedia of bacterial and Archaeal type strains, phase III: the genomes of soil and plant-associated and newly described type strains. Stand Genomic Sci. 2015;10:26.
Article PubMed PubMed Central Google Scholar
Bennett S. Solexa Ltd Pharmacogenomics. 2004;5:433–8.
Article PubMed Google Scholar
Joint Genome Institute. http://www.jgi.doe.gov. Accessed 2 Jan 2017.
BBMap short read aligner, and other bioinformatic tools. http://sourceforge.net/projects/bbmap. Accessed 15 Apr 2016.
Bankevich A, Nurk S, Antipov D, Gurevich A a, Dvorkin M, Kulikov AS, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19:455–77.
Article CAS PubMed PubMed Central Google Scholar
Hyatt D, Chen G-L, Locascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11:119.
Article PubMed PubMed Central Google Scholar
Huntemann M, Ivanova NN, Mavromatis K, Tripp HJ, Paez-Espino D, Palaniappan K, et al. The standard operating procedure of the DOE-JGI microbial genome annotation pipeline (MGAP v.4). Stand Genomic Sci. 2015;10:86.
Article PubMed PubMed Central Google Scholar
Chen I-MA, Markowitz VM, Palaniappan K, Szeto E, Chu K, Huang J, et al. Supporting community annotation and user collaboration in the integrated microbial genomes (IMG) system. BMC Genomics BMC Genomics. 2016;17:307.
Article PubMed Google Scholar
Tatusov RL, Galperin MY, Natale DA, Koonin EV. The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 2000;28:33–6.
Article CAS PubMed PubMed Central Google Scholar
Varghese NJ, Mukherjee S, Ivanova N, Konstantinidis KT, Mavrommatis K, Kyrpides NC, et al. Microbial species delineation using whole genome sequences. Nucleic Acids Res. 2015;43:6761–71.
Article CAS PubMed PubMed Central Google Scholar
Adrio JL, Demain AL. Microbial enzymes: tools for biotechnological processes. Biomol Ther. 2014;4:117–39.
Google Scholar
Glick BR. Plant growth-promoting bacteria: mechanisms and applications. Scientifica (Cairo). 2012;2012:1–15.
Article Google Scholar
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008;26:541–7.
Article CAS PubMed PubMed Central Google Scholar
Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, bacteria, and Eucarya. Proc Natl Acad Sci U S A. 1990;87:4576–9.
Article CAS PubMed PubMed Central Google Scholar
Editor L. List of new names and new combinations previously effectively, but not validly, published. Int J Syst Evol Microbiol. 2012;62:1–4.
Article Google Scholar
Krieg N, Ludwig W, Euzéby J, Whitman W. Phylum XIV. Bacteroidetes phyl. Nov. In: Krieg N, Staley J, Brown D, Hedlund B, Paster B, Ward N, et al., editors. Bergey’s man. Syst. Bacteriol. Second Edi. New York: Springer; 2011. p. 25.
Google Scholar
Kämpfer P, Class III. Sphingobacteriia class. Nov. In: Krieg NR, Staley J, Brown D, Hedlund B, Paster B, Ward N, et al., editors. Bergey’s man. Syst. Bacteriol. Second Edi. New York: Springer; 2011. p. 330.
Google Scholar
Kämpfer P. Order I. Sphingobacteriales ord. Nov. In: Krieg N, Staley J, Brown D, Hedlund B, Paster B, Ward N, et al., editors. Bergey’s Man. Syst. Bacteriol. Second ed. New York: Springer; 2011. p. 330.
Kämpfer P, Lodders N, Falsen E. Hydrotalea flava gen. Nov., sp. nov., a new member of the phylum Bacteroidetes and allocation of the genera Chitinophaga, Sediminibacterium, Lacibacter, Flavihumibacter, Flavisolibacter, Niabella, Niastella, Segetibacter, Parasegetibacter, Terrimonas, Fer. Int J Syst Evol Microbiol. 2011;61:518–23.
Article PubMed Google Scholar
Sangkhobol V, Skerman VBD. Chitinophaga, a new genus of chitinolytic myxobacteria. Int J Syst Bacteriol. 1981;31:285–93.
Article Google Scholar
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: tool for the unification of biology. Nat Genet. 2000;25:25–9.
Article CAS PubMed PubMed Central Google Scholar

Download references

Funding

We thank to Ana Paula Piedade for SEM analysis. This work was supported by CEMMPRE and by Fundação para a Ciência e a Tecnologia (FCT) under the project UID/EMS/00285/2013. D.N.P. was supported by FCT, postdoctoral fellowship SFRH/BPD/100721/2014. The work conducted by the U.S. Department of Energy Joint Genome Institute, a DOE Office of Science User Facility, is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231.

Author information

Authors and Affiliations

CEMMPRE, University of Coimbra, 3030-788, Coimbra, Portugal
Diogo N. Proença & Paula V. Morais
Department of Microbiology, 527 Biological Sciences Building, University of Georgia, Athens, GA, 30602-2605, USA
William B. Whitman
DOE Joint Genome Institute 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
Nicole Shapiro, Tanja Woyke & Nikos C. Kyrpides
Department of Life Sciences, FCTUC, Faculty of Sciences and Technology, University of Coimbra, Calçada Martim de Freitas, 3001-401, Coimbra, Portugal
Paula V. Morais

Authors

Diogo N. Proença
View author publications
You can also search for this author in PubMed Google Scholar
William B. Whitman
View author publications
You can also search for this author in PubMed Google Scholar
Nicole Shapiro
View author publications
You can also search for this author in PubMed Google Scholar
Tanja Woyke
View author publications
You can also search for this author in PubMed Google Scholar
Nikos C. Kyrpides
View author publications
You can also search for this author in PubMed Google Scholar
Paula V. Morais
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

DNP isolated the strain, extracted the DNA, performed laboratory experiments, analyzed all the data, and with PVM wrote the manuscript. WBW, NS, TW and NCK did the genome sequencing, assembly and annotation. WBW, NS, TW and NCK revise the manuscript. All the authors read and approved the final manuscript.

Corresponding author

Correspondence to Paula V. Morais.

Ethics declarations

Competing interests

The authors have no competing of interests to declare.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Proença, D.N., Whitman, W.B., Shapiro, N. et al. Draft genome sequence of the cellulolytic endophyte Chitinophaga costaii A37T2^T . Stand in Genomic Sci 12, 53 (2017). https://doi.org/10.1186/s40793-017-0262-2

Download citation

Received: 06 January 2017
Accepted: 22 August 2017
Published: 06 September 2017
DOI: https://doi.org/10.1186/s40793-017-0262-2

Draft genome sequence of the cellulolytic endophyte Chitinophaga costaii A37T2^T

Abstract

Introduction