Abstract
Streptococcus agalactiae (Lancefield group B; GBS) is the causative agent of meningoencephalitis in fish, mastitis in cows, and neonatal sepsis in humans. Meningoencephalitis is a major health problem for tilapia farming and is responsible for high economic losses worldwide. Despite its importance, the genomic characteristics and the main molecular mechanisms involved in virulence of S. agalactiae isolated from fish are still poorly understood. Here, we present the genomic features of the 1,820,886 bp long complete genome sequence of S. agalactiae SA20-06 isolated from a meningoencephalitis outbreak in Nile tilapia (Oreochromis niloticus) from Brazil, and its annotation, consisting of 1,710 protein-coding genes (excluding pseudogenes), 7 rRNA operons, 79 tRNA genes and 62 pseudogenes.
Similar content being viewed by others
Introduction
Streptococcus agalactiae, also referred as Group B Streptococcus (GBS), is a Gram-positive pathogen with a broad host range. GBS is the most common cause of life-threatening bacterial infections in human newborns [1] and is an important etiological agent of clinical and sub-clinical bovine mastitis [2]. In fish, S. agalactiae infection causes septicemia and meningoencephalitis, mainly in warm water species from freshwater, marine, or estuarine environments [3]. Currently, S. agalactiae is an emerging pathogen associated with severe economic losses due to high mortality rates in fish farms worldwide [4,5].
The pangenome of the species (obtained from only eight human strain genomes) is considered open and it is expected that, for every new GBS genome sequenced, approximately 33 new strain-specific genes will be identified [6]. Since, the first genome of S. agalactiae strain isolated from bovine mastitis was published and 183 strain-specific genes were described, and about 85% of these genes have been clustered into eight genome islands, strongly suggesting that these genes were acquired through lateral gene transfer from other bacteria of genus Streptococcus, which are also etiologic agents of bovine mastitis [2]. However, the molecular mechanisms of virulence and other genomic features of strains isolated from fish isolates remain unclear, and thus, the genome sequencing of different strains isolated from other hosts are still required to better understand the global complexity of this bacterial species.
Classification and Features
The genus Streptococcus comprises a heterogeneous group of bacteria that have an important role in medicine and industry. These microorganisms are Gram-positive, cocci, 0.6–1.2 µm diameter, not motile, do not form spores, are catalase-negative and grow in pairs or chains [7]. Rebecca C. Lancefield, in her work in the early 1930s, systematized the classification of streptococci based on the presence and type of surface antigen: cell wall polysaccharide or lipoteichoic acid [8]. S. agalactiae is classified as Lancefield group B (GBS) based on the presence of a polysaccharide in the cell wall. This polysaccharide is composed of galactose, N-acetylglucosamine, rhamnose and glucitol phosphate [7].Currently, ten serotypes are described for this species (Ia, Ib, II-IX) and occasionally some strains can be non-serotypeable [9].
Major human and animal streptococcal pathogens belong to the pyogenic group of β-hemolytic streptococci [10]. In this context, the β-hemolytic bacteria S. agalactiae, deserves attention for causing diseases in a broad range of homeothermic and heterothermic hosts [4], although this bacteria is also a common member of the gastrointestinal tract microbiota [11].
At the end of the 19th century, GBS was initially described as an etiological agent of mastitis in cows, being reported as causing disease in humans only 50 years later [12]. In fish, S. agalactiae was recognized as a pathogen in 1966 [13]. Sporadically, this pathogen has also been associated with illness in many others hosts, such as chickens, camels, dogs, horses, cats, frogs, hamsters, mice, monkeys, and nutria [14].
S. agalactiae is a facultatively anaerobic bacterium that uses glucose as an energy source, and is also able to use different carbon sources such as cellobioise, beta-glucoside, trehalose, mannose, lactose, fructose, mannitol, N-acetylgalactosamine, and glucose (Table 1). This pathogen is limited in the synthesis of most amino acids precursors. Only the biosynthetic pathways for alanine, serine, glycine, glutamine, aspartate, asparagine and threonine are present [31]. The adaptation to oxygen radical stress of this pathogen is related to superoxide dismutase (sodA gene) which converts superoxide anions to molecular oxygen and hydrogen peroxide, which, in turn, is metabolized by catalases and/or peroxidases [34]. Although GBS does not synthetize catalase to remove toxic H2O2, it is 10-fold more resistant to oxygen metabolites than the catalase-producing S. aureus. This is due to the presence of several enzymes that might detoxify H2O2 that have been identified in the genome of S. agalactiae such as NADH peroxidase, NADH oxidase and thiol peroxidase [31]. This diversity of metabolic and adaptative mechanisms reflects the ability of GSB to survive in various environments and hosts.
The phylogenetic tree was constructed using 16S rRNA sequences of available S. agalactiae genomes and other species from the same genus (Figure1). The tree shows that all S. agalactiae strains are grouped together, and the SA20-06 strain is more similar to the A909 human isolate and to the GD201008-001 fish isolate from China.
Genome sequencing and annotation
Genome project history
This strain was selected for sequencing based on the high mortality rates shown for this pathogen in fish farms worldwide and on the lack of information for the genomic characteristics of S. agalactiae isolated from fish and the molecular mechanisms involved in virulence in this host. The genome project is deposited in the Genomes On Line Database [37] and the Streptococcus agalactiae SA20-06 complete genome sequence and annotation data were deposited in the DDBJ/EMBL/GenBank under the accession number CP003919 (RefSeq NC_019048). Sequencing, assembly steps, finishing and annotation were performed by the teams from the Laboratory of Cellular and Molecular Genetics (LGCM), Minas Gerais, Brazil; Genomics and Proteomics Network of the State of Pará (RPGP), Pará, Brazil and Center for Excellence in Bioinformatics (CEBio-FIOCRUZ-MG), Minas Gerais, Brazil. A summary of the project information is shown in Table 2.
Growth conditions and DNA isolation
Streptococcus agalactiae SA20-06 was obtained from the AQUAVET (Laboratory of Aquatic Animal Diseases) bacterial collection, streaked onto 5% sheep blood agar and incubated at 28°C for 48 h. After that, cells were grown in 150mL brain-heart-infusion broth (BHI-HiMedia Laboratories Pvt. Ltda, India) under agitation (150 rpm), at 28°C. Genomic DNA was obtained by using phenol-chloroform-isoamylic alcohol extraction protocol using micro-wave oven [38].
Genome sequencing and assembly
The genome sequencing of S. agalactiae SA20-06 was performed using the SOLiD v3 Plus and SOLiD 5500 platforms (Applied Biosystems) with two mate-paired libraries (both with 1–2 kb insert size), which generated 50,223,637 and 283,953,694 reads of 50 bp and 60 bp in size, respectively. After sequencing, the reads were subjected to quality filtering using the qualityFilter.pl script (a homemade script), in which reads with an average Phred quality of less than 20 were removed, and error sequence correction was performed with SAET software (Life Technologies).
After quality analysis, 210,004,694 reads were used in the assembly, which generated a genome coverage corresponding to ∼5,700× genome coverage based on the reference genome of 2,127,839 bp size of S. agalactiae strain A909 (NC_007432). The genome sequence of SA20-06 was assembled based on the hybrid strategy using CLC Genome Workbench 4.9, Velvet [39] and Edena [40] software. A total of 872 contigs were generated, with N50 of 5,221 bp and the smallest contig having 201 bp. Due to the hybrid assembly methodology, the redundant contigs were removed using the Simplifier software [41]. The contigs were mapped against the reference genome (strain A909) using BLASTn, and the results were analyzed using G4ALL software [42], to extend the contigs and identify overlaps of a minimum of 30 bp between the ends of the contigs, thus yielding larger contigs.
These contigs were later subjected to a finishing process using CLC Genomics Workbench software. At this step, the contigs were ordered and oriented by mapping against the reference genome, yielding a preliminary scaffold with gaps that were removed with recursive rounds of short read mapping against the scaffold [43].
Genome annotation
For structural annotation, the following software was employed: Glimmer 3, to predict genes [44]; RNAmmer, to predict rRNAs [45]; and tRNAscan-SE, to predict tRNAs [46]. Functional annotation was performed by similarity analyses using public databases of National Center for Biotechnology Information (NCBI) non-redundant database, Swiss-Prot and InterProScan analysis [47]. Genome visualization and manual annotation were carried out using Artemis [48].
Genome properties
The complete genome of S. agalactiae strain SA20-06 comprises a single circular chromosome of 1,820,886 bp in length with 1,710 putative predicted genes (excluding pseudogenes), 35.56% G+C content, 7 rRNA operons, 79 tRNA genes and 62 pseudogenes (Figure 2 and Table 3). The distribution of genes into the COG functional categories is presented in Table 4.
Conclusions
Further analysis of the SA20-06 genome is now under way, with the objective of identifing specific factors that might explain the differences in pathogenesis of disease, mainly in heterothermic hosts.
References
Rajagopal L. Understanding the regulation of Group B Streptococcal virulence factors. Future Microbiol 2009; 4:201–221. PubMed http://dx.doi.org/10.2217/17460913.4.2.201
Richards VP, Lang P, Bitar PD, Lefébure T, Schukken YH, Zadoks RN, Stanhope MJ. Comparative genomics and the role of lateral gene transfer in the evolution of bovine adapted Streptococcus agalactiae. Infect Genet Evol 2011; 11:1263–1275. PubMed http://dx.doi.org/10.1016/j.meegid.2011.04.019
Evans JJ, Klesius PH, Gilbert PM, Shoemaker CA, Al Sarawi MA, Landsberg J, Duremdez R, Al Marzouk A, Al Zenki S. Characterization of β-haemolytic Group B Streptococcus agalactiae in cultured seabream, Sparus auratus L., and wild mullet, Liza klunzingeri (Day), in Kuwait. J Fish Dis 2002; 25:505–513. http://dx.doi.org/10.1046/j.1365-2761.2002.00392.x
Mian GF, Godoy DT, Leal CAG, Yuhara TY, Costa GM, Figueiredo HCP. Aspects of the natural history and virulence of S. agalactiae infection in Nile tilapia. Vet Microbiol 2009; 136:180–183. PubMed http://dx.doi.org/10.1016/j.vetmic.2008.10.016
Duremdez R, Al-Marzouk A, Qasem JA, Al-Harbi A, Gharabally H. Isolation of Streptococcus agalactiae from cultured silver pomfret, Pampus argenteus (Euphrasen), in Kuwait. J Fish Dis 2004; 27:307–310. PubMed http://dx.doi.org/10.111365-2761.2004.00538.x
Tettelin H, Masignani V, Cieslewicz MJ, Donati C, Medini D, Ward NL, Angiuoli SV, Crabtree J, Jones AL, Durkin AS. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: Implications for the microbial “pangenome.”. Proc Natl Acad Sci USA 2005; 102:13950–13955. PubMed http://dx.doi.org/10.1073/pnas.0506758102
Schuchat A. Epidemiology of group B streptococcal disease in the United States: shifting paradigms. Clin Microbiol Rev 1998; 11:497–513. PubMed
Lancefield RC. A serological differentiation of specifc types of bovine hemolytic streptococci (GROUP B). J Exp Med 1934; 59:441–458. PubMed http://dx.doi.org/10.1084/jem.59.4.441
Slotved HC, Kong F, Lambertsen L, Sauer S, Gilbert GL. Serotype IX, a Proposed New Streptococcus agalactiae Serotype. J Clin Microbiol 2007; 45:2929–2936. PubMed http://dx.doi.org/10.1128/JCM.00117-07
Carvalho MG, Facklam R, Jackson D, Beall B, McGee L. Evaluation of three commercial broth media for pigment detection and identification of a group B Streptococcus (Streptococcus agalactiae). J Clin Microbiol 2009; 47:4161–4163. PubMed http://dx.doi.org/10.1128/JCM.01374-09
Brochet M, Couvé E, Zouine M, Vallaeys T, Rusniok C, Lamy M, Buchrieser C, Trieu-Cout P, Kunst F, Poyart C, Glaser P. Genomic diversity and evolution within the species Streptococcus agalactiae. Microbes Infect 2006; 8:1227–1243. PubMed http://dx.doi.org/10.1016/j.micinf.2005.11.010
Bisharat N, Crook DW, Leigh J, Harding RM, Ward PN, Coffey TJ, Maiden MC, Peto T, Jones N. Hyperinvasive Neonatal Group B Streptococcus Has Arisen from a Bovine Ancestor. J Clin Microbiol 2004; 42:2161–2167. PubMed http://dx.doi.org/10.1128/JCM.42.5.2161-2167.2004
Robinson JA, Meyer FP. Streptococcal Fish Pathogen. J Bacteriol 1966; 92:512. PubMed
Pereira UP, Mian GF, Oliveira ICM, Benchetrit LC, Costa GM, Figueiredo HCP. Genotyping of Streptococcus agalactiae strains isolated from fish, human and cattle and their virulence potential in Nile tilapia. Vet Microbiol 2010; 140:186–192. PubMed http://dx.doi.org/10.1016/j.vetmic.2009.07.025
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, Tatusova T, Thomson N, Allen MJ, Angiuoli SV. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol 2008; 26:541–547. PubMed http://dx.doi.org/10.1038/nbt1360
Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci USA 1990; 87:4576–4579. PubMed http://dx.doi.org/10.1073/pnas.87.12.4576
Gibbons NE, Murray RGE. Proposals Concerning the Higher Taxa of Bacteria. Int J Syst Bacteriol 1978; 28:1–6. http://dx.doi.org/10.1099/00207713-28-1-1
Garrity GM, Holt JG. The Road Map to the Manual. In: Garrity GM, Boone DR, Castenholz RW (eds), Bergey’s Manual of Systematic Bacteriology, Second Edition, Volume 1, Springer, New York, 2001, p. 119–169.
Murray RGE. The Higher Taxa, or, a Place for Everything…? In: Holt JG (ed), Bergey’s Manual of Systematic Bacteriology, First Edition, Volume 1, The Williams and Wilkins Co., Baltimore, 1984, p. 31–34.
Euzéby J. List of new names and new combinations previously effectively, but not validly, published. List no. 132. Int J Syst Evol Microbiol 2010; 60:469–472. http://dx.doi.org/10.1099/ijs.0.022855-0
Ludwig W, Schleifer KH, Whitman WB. Class I. Bacilli class nov. In: De Vos P, Garrity G, Jones D, Krieg NR, Ludwig W, Rainey FA, Schleifer KH, Whitman WB (eds), Bergey’s Manual of Systematic Bacteriology, Second Edition, Volume 3, Springer-Verlag, New York, 2009, p. 19–20.
Ludwig W, Schleifer KH, Whitman WB. Order II. Lactobacillales ord. nov. In: De Vos P, Garrity G, Jones D, Krieg NR, Ludwig W, Rainey FA, Schleifer KH, Whitman WB (eds), Bergey’s Manual of Systematic Bacteriology, Second Edition, Volume 3, Springer-Verlag, New York, 2009, p. 464.
Skerman VBD, McGowan V, Sneath PHA. Approved Lists of Bacterial Names. Int J Syst Bacteriol 1980; 30:225–420. http://dx.doi.org/10.1099/00207713-30-1-225
Deibel RH, Seeley HW. Family II. Streptococcaceae. In: Buchanan RE, Gibbons NE (eds), Bergey’s Manual of Determinative Bacteriology, Eighth Edition, The Williams and Wilkins Co., Baltimore, 1974, p. 490–515.
Rosenbach FJ. In: Bergmann JF (ed), Microorganismen bei den Wund-Infections-Krankheiten des Menschen., Wiesbaden, 1884, p. 1–122.
Deibel RH, Seeley HW. Genus I. Streptococcus Rosenbach 1884, 22. In: Buchanan RE, Gibbons NE (eds), Bergey’s Manual of Determinative Bacteriology, Eighth Edition, The Williams and Wilkins Co., Baltimore, 1974, p. 490–509.
Lehmann KB, Neumann R. Atlas und Grundriss der Bakteriologie und Lehrbuch der speziellen bakteriologischen Diagnostik, First Edition, J.F. Lehmann, München, 1896, p. 1–448.
Judicial Commission. Opinions 4, 6, 7, 8, 9, 10, 11, 12, 13, 14 Opinion 8. Int Bull Bacteriol Nomencl Taxon 1954; 4:141–158.
Judicial Commission. Opinion 27: Designation of the neotype strain of Streptococcus agalactiae Lehmann and Neumann. Int Bull Bacteriol Nomencl Taxon 1963; 13:37.
Whiley RA, Hardie JM. The Firmicutes. In: Garrity GM, Boone DR, Castenholz RW (eds), Bergey’s Manual of Systematic Bacteriology, Second Edition, Volume 3, Springer, New York, 2001, p. 655–735.
Glaser P, Rusniok C, Buchrieser C, Chevalier F, Frangeul L, Msadek T, Zouine M, Couvé E, Lalioui L, Poyart C. Genome sequence of Streptococcus agalactiae, a pathogen causing invasive neonatal disease. Mol Microbiol 2002; 45:1499–1513. PubMed http://dx.doi.org/10.1046/J.1365-2958.2002.03126.x
Moura H, Woolfitt AR, Carvalho MG, Pavlopoulos A, Teixeira LM, Satten GA, Barr JR. MALDI-TOF mass spectrometry as a tool for differentiation of invasive and noninvasive Streptococcus pyogenes isolates. FEMS Immunol Med Microbiol 2008; 53:333–342. PubMed http://dx.doi.org/10.1111/J.1574-695X.2008.00428.X
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000; 25:25–29. PubMed http://dx.doi.org/10.1038/75556
Poyart C, Pellegrini E, Gaillot O, Boumaila C, Baptista M, Trieu-Cuot P. Contribution of Mn-cofactored superoxide dismutase (SodA) to the virulence of Streptococcus agalactiae. Infect Immun 2001; 69:5098–5106. PubMed http://dx.doi.org/10.1128/IAI.69.8.5098-5106.2001
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, et al. Clustal W and Clustal X version 2.0. Bioinformatics 2007; 23:2947–2948. PubMed http://dx.doi.org/10.1093/bioinformatics/btm404
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol 2011; 28:2731–2739. PubMed http://dx.doi.org/10.1093/molbev/msr121
Liolios K, Chen IM, Mavromatis K, Tavernarakis N, Hugenholtz P, Markowitz VM, Kyrpides NC. The Genomes On Line Database (GOLD) in 2009: status of genomic and metagenomic projects and their associated metadata. Nucleic Acids Res 2010; 38:D346–D354. PubMed http://dx.doi.org/10.1093/nar/gkp848
Bollet C, Gevaudan MJ, de Lamballerie X, Zandotti C, de Micco P. A simple method for the isolation of chromosomal DNA from Gram positive or acid-fast bacteria. Nucleic Acids Res 1991; 19:1955. PubMed http://dx.doi.org/10.1093/nar/19.8.1955
Zerbino DR, Birney E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 2008; 18:821–829. PubMed http://dx.doi.org/10.1101/gr.074492.107
Hernandez D, François P, Farinelli L, Osterås M, Schrenzel J. De novo bacterial genome sequencing: millions of very short reads assembled on a desktop computer. Genome Res 2008; 18:802–809. PubMed http://dx.doi.org/10.1101/gr.072033.107
Tsai IJ, Otto TD, Berriman M. Improving draft assemblies by iterative mapping and assembly of short reads to eliminate gaps. Genome Biol 2010; 11:R41. PubMed http://dx.doi.org/10.1186/gb-2010-11-4-r41
G4ALL. http://g4all.sourceforge.net
Ramos RTJ, Carneiro AR, Azevedo V, Schneider MP, Barh D, Silva A. Simplifier: a web tool to eliminate redundant NGS contigs. Bioinformation 2012; 8:996–999. PubMed http://dx.doi.org/10.6026/97320630008996
Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. Improved microbial gene identification with GLIMMER. Nucleic Acids Res 1999; 27:4636–4641. PubMed http://dx.doi.org/10.1093/nar/27.23.4636
Lagesen K, Hallin P, Rodland EA, Staerfeldt HH, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res 2007; 35:3100–3108. PubMed http://dx.doi.org/10.1093/nar/gkm160
Lowe TM, Eddy SR. tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic Sequence. Nucleic Acids Res 1997; 25:955–964. PubMed
Zdobnov EM, Apweiler R. InterProScan — an integration platform for the signature-recognition methods in InterPro. Bioinformatics 2001; 17:847–848. PubMed http://dx.doi.org/10.1093/bioinformatics/17.9.847
Rutherford K, Parkhill J, Crook J, Horsnell T, Rice P, Rajandream MA, Barrell B. Artemis: sequence visualization and annotation. Bioinformatics 2000; 16:944–945. PubMed http://dx.doi.org/10.1093/bioinformatics/16.10.944
Grant JR, Arantes AS, Stothard P. Comparing thousands of circular genomes using the CGView Comparison Tool. BMC Genomics 2012; 13:202. PubMed http://dx.doi.org/10.1186/1471-2164-13-202
Acknowledgements
This work was supported by Ministério da Pesca e Aquicultura, Furnas Centrais Elétricas, Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) and Fundação de Amparo à Pesquisa do Estado de Minas Gerais (FAPEMIG). We also acknowledge support from the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) and Rede Paraense de Genômica e Proteômica.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.
About this article
Cite this article
de Pádua Pereira, U., dos Santos, A.R., Hassan, S.S. et al. Complete genome sequence of Streptococcus agalactiae strain SA20-06, a fish pathogen associated to meningoencephalitis outbreaks. Stand in Genomic Sci 8, 188–197 (2013). https://doi.org/10.4056/sigs.3687314
Published:
Issue Date:
DOI: https://doi.org/10.4056/sigs.3687314