Complete genome sequences of two novel autographiviruses infecting a bacterium from the Pseudomonas fluorescens group

Nowicki, Grzegorz; Walkowiak-Nowicka, Karolina; Zemleduch-Barylska, Agata; Mleczko, Anna; Frąckowiak, Patryk; Nowaczyk, Natalia; Kozdrowska, Emilia; Barylski, Jakub

doi:10.1007/s00705-017-3419-9

Complete genome sequences of two novel autographiviruses infecting a bacterium from the Pseudomonas fluorescens group

Annotated Sequence Record
Open access
Published: 27 May 2017

Volume 162, pages 2907–2911, (2017)
Cite this article

Download PDF

You have full access to this open access article

Archives of Virology Aims and scope Submit manuscript

Complete genome sequences of two novel autographiviruses infecting a bacterium from the Pseudomonas fluorescens group

Download PDF

Grzegorz Nowicki¹^na1,
Karolina Walkowiak-Nowicka²,
Agata Zemleduch-Barylska¹,
Anna Mleczko¹,
Patryk Frąckowiak¹,
Natalia Nowaczyk¹,
Emilia Kozdrowska¹ &
…
Jakub Barylski¹^na1

2232 Accesses
8 Citations
2 Altmetric
Explore all metrics

Abstract

In this paper, we describe two independent isolates of a new member of the subfamily Autographivirinae, Pseudomonas phage KNP. The type strain (KNP) has a linear, 40,491-bp-long genome with GC content of 57.3%, and 50 coding DNA sequences (CDSs). The genome of the second strain (WRT) contains one CDS less, encodes a significantly different tail fiber protein and is shorter (40,214 bp; GC content, 57.4%). Phylogenetic analysis indicates that both KNP and WRT belong to the genus T7virus. Together with genetically similar Pseudomonas phages (gh-1, phiPSA2, phiPsa17, PPPL-1, shl2, phi15, PPpW-4, UNO-SLW4, phiIBB-PF7A, Pf-10, and Phi-S1), they form a divergent yet coherent group that stands apart from the T7-like viruses (sensu lato). Analysis of the diversity of this group and its relatedness to other members of the subfamily Autographivirinae led us to the conclusion that this group might be considered as a candidate for a new genus.

The Family Enterobacteriaceae

Abolishment of morphology-based taxa and change to binomial species names: 2022 taxonomy update of the ICTV bacterial viruses subcommittee

Article Open access 23 January 2023

An overview of influenza A virus genes, protein functions, and replication cycle highlighting important updates

Article 26 April 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The Pseudomonas fluorescens group includes bacteria commonly found in soil, fresh water, and seawater. Its members can be used to control plant diseases and are well known for their growth-promoting properties [1]. On the other hand, these microorganisms are also involved in food spoilage [2, 3].

To date, there are at least 293 sequenced phages infecting members of the genus Pseudomonas, eight of which infect bacteria from the P. fluorescens group [4].

In this paper, we describe two novel phages infecting the Pseudomonas strain GL3, which was isolated during earlier studies from Lake Góreckie, located in Wielkopolska National Park (Western Poland) [5, 6]. Based on the sequence of marker genes (16S rRNA, gyrB, and rpoB), we unambiguously assigned this bacterial strain to belong to the above-mentioned group but were unable to classify it at the species level.

Phages infecting strain GL3 were isolated independently from the same region (Wielkopolska Province, Poland): the first one from sediments of a park pond in Śrem, and the other from silt of the Warta River, collected in Poznań (near the influx of treated sewage from the city’s left-bank treatment plant) in the summer of 2014. Phage isolation was a part of a student scientific project; therefore, the name of the first phage (KNP) is an acronym for the Student Scientific Society in Polish. The second name (WRT) is an abbreviation of the sampling site where the phage was found. Phage particles were purified from infected lysates according to “Protocol: CsCl phage prep” by the Center for Phage Technology, Texas A&M University (available at https://cpt.tamu.edu/wordpress/wp-content/uploads/2011/12/CsCl-phage-prep-08-17-2011.pdf). Phage genomic DNA was extracted from the purified phage particles using a QIAamp DNA Mini Kit according to the manufacturer’s instructions.

Genomes of both phage isolates were sequenced using an Illumina MiSeq at Genomed SA (Warsaw, Poland). After removal of the adapter sequences, reads were quality trimmed and randomly subsampled with Trimmomatic GPL v3 [7] and BBDuk v35.82 (http://jgi.doe.gov/data-and-tools/bbtools/) to obtain libraries with sizes corresponding to ~300 times the expected genome size. Prepared libraries were assembled using Geneious 9.1.6 (the software reported coverages of 289.8× for KNP and 296.8× for WRT) [8], MIRA 4.0 (261.0× KNP, 264.6× WRT) [9], Velvet Optimiser 1.2.10 (181.4× KNP ×128.8 WRT) [10], Edena v3_131028 (284.3× KNP, 290.8× WRT) [11] and SPAdes v3.9.0 (39,1× KNP, 41.6× WRT) [12]. The combination of the five different tools allowed us to cross-validate different assemblies. Additional verification was performed by mapping the raw reads back to each genome (we obtained mean coverages of 1209.3× for KNP and 2547.1× for WRT using the Geneious read mapper with medium settings). The obtained mapping was also used to determine the physical termini of both genomes (based on the read arrangement analysis with the Pause pipeline, available at https://cpt.tamu.edu/computer-resources/pause).

Protein-coding genes were predicted using GeneMarkS v4.32 [13], Glimmer 3.02 (iterative training) [14], PRODIGAL v2.6.3 [15], MetaGeneAnnotator v2008/8/19 [16], and ZCURVE_V (ZCURVE package 3.0) [17]. tRNA genes were predicted using tRNAscan-SE [18] (though none were found). Again, predictions generated with the different programs were compared, and CDSs identified by only a single tool with no BLAST hits against the RefSeq database were disregarded. Conflicting start codons were resolved based on majority voting of the prediction algorithms (which included BLAST hits). BLASTx alignments, together with conserved domains detected by InterProScan [19], were used to assign functions to protein products of the predicted genes. Both gene arrangements and functional annotations were subjected to detailed manual curations that included BLASTp searches against multiple databases (nr, RefSeq, UniProtKB), domain localisation (InterProScan, CD-Search [20]), ribosome binding site inspection, and literature review. Finally, PHIRE ver.1.00 [21] was employed to detect conserved phage regulatory elements.

The KNP and WRT genomes are composed of single linear DNA molecules with direct terminal repeats of 219 bp. Their lengths are 40,491 bp (KNP) and 40,214 bp (WRT), with GC contents of 57.3% and 57.4%, respectively. The two sequences are very similar (they share 97.1% identity in MAFFT [22] comparison; algorithm FFT-NS-2, scoring matrix 200PAM/k = 2, gap open penalty 1.53 and offset value 0.123), and their organisation is typical for genomes of autographiviruses. The left arm of each genome encodes predominantly proteins involved in DNA replication, while the right arm harbours genes involved in particle assembly. We predicted 50 CDSs in the KNP genome, all on the same strand. Putative functions were assigned to 28 of the CDSs (56%), while 22 (44%) had no known function. The gene arrangement in the WRT genome is virtually the same, although CDS 14, encoding a hypothetical protein with unknown function, is missing (Fig. 1). Almost all predicted genes have homologues in other viruses, mainly autographiviruses infecting Pseudomonas spp. CDS 3, present in both genomes, is the only gene that has no similarity to any known viral or bacterial sequences. Most other differences between these genomes can be described as single-nucleotide polymorphisms. However, the region encoding the C-terminus of the tail fibre and the following gene with unknown function (Fig. 1) vary significantly between analysed phages (the similarity drops below 60% over the 1-kb stretch of DNA).

Both phages are genetically similar to Pseudomonas phages phiPsa17 (KNP 93.1% and WRT 93.9% identity) and gh-1 (both 87.0% identity), which belong to the genus T7virus [4]. International Committee on Taxonomy of Viruses (ICTV) guidelines recommend DNA sequence identity of 95% as a threshold for species delineation. Thus, we report phages KNP and WRT as two isolates of the same candidate species, which we would like to name “Pseudomonas virus KNP”. Our phylogenetic analysis (see Fig. 2 and Supplementary file S1) supports the inclusion of this species in the genus T7virus. Careful examination of the results (discussed in detail in supplementary file S1 [23,24,25,26,27]) indicate that this genus is much more diverse than other currently approved genera in the subfamily Autographivirinae. Thus, we would like to suggest that this taxon could be split and that the formation of a new genus, “Gh1virus” (clustering T7-like Pseudomonas phages), might be considered.

Nucleotide sequence accession numbers

The complete genomes of the phage KNP and WRT have been deposited in the NCBI database under the GenBank accession numbers KY798121 and KY798120, respectively.

References

Compant S, Clément C, Sessitsch A (2010) Plant growth-promoting bacteria in the rhizo- and endosphere of plants: their role, colonization, mechanisms involved and prospects for utilization. Soil Biol Biochem 42:669–678
Article CAS Google Scholar
Rajmohan S, Dodd CER, Waites WM (2002) Enzymes from isolates of Pseudomonas fluorescens involved in food spoilage. J Appl Microbiol 93:205–213
Article CAS PubMed Google Scholar
Nogarol C, Acutis PL, Bianchi DM et al (2013) Molecular characterization of Pseudomonas fluorescens isolates involved in the Italian “Blue Mozzarella” event. J Food Prot 76:500–504
Article CAS PubMed Google Scholar
NCBI Resource Coordinators (2017) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 45:D12–D17
Article Google Scholar
Barylski J, Nowicki G, Goździcka-Józefiak A (2014) The discovery of phiAGATE, a novel phage infecting Bacillus pumilus, leads to new insights into the phylogeny of the subfamily Spounavirinae. PLoS One 9:e86632
Article PubMed PubMed Central Google Scholar
Nowicki G, Barylski J, Kujawa N, Goździcka-Józefiak A (2014) Complete genome sequence of Lelliottia Podophage phD2B. Genome Announc. doi:10.1128/genomeA.01046-14
PubMed PubMed Central Google Scholar
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120
Article CAS PubMed PubMed Central Google Scholar
Kearse M, Moir R, Wilson A et al (2012) Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28:1647–1649
Article PubMed PubMed Central Google Scholar
Chevreux B, Wetter T, Suhai S (1999) Genome sequence assembly using trace signals and additional sequence information. Comput Sci Biol Proc German Conf Bioinform 99:45–56
Google Scholar
Zerbino DR (2010) Using the Velvet de novo assembler for short-read sequencing technologies. Curr Protoc Bioinformatics Chapter 11:Unit 11.5
Hernandez D, François P, Farinelli L et al (2008) De novo bacterial genome sequencing: millions of very short reads assembled on a desktop computer. Genome Res 18:802–809
Article CAS PubMed PubMed Central Google Scholar
Bankevich A, Nurk S, Antipov D et al (2012) SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477
Article CAS PubMed PubMed Central Google Scholar
Borodovsky M, Lomsadze A (2014) Gene identification in prokaryotic genomes, phages, metagenomes, and EST sequences with GeneMarkS suite. In: Current protocols in microbiology, pp 1E.7.1–1E.7.17
Delcher AL, Bratke KA, Powers EC, Salzberg SL (2007) Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23:673–679
Article CAS PubMed PubMed Central Google Scholar
Hyatt D, Chen G-L, Locascio PF et al (2010) Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinform 11:119
Article Google Scholar
Noguchi H, Taniguchi T, Itoh T (2008) MetaGeneAnnotator: detecting species-specific patterns of ribosomal binding site for precise gene prediction in anonymous prokaryotic and phage genomes. DNA Res 15:387–396
Article CAS PubMed PubMed Central Google Scholar
Hua Z-G, Lin Y, Yuan Y-Z et al (2015) ZCURVE 3.0: identify prokaryotic genes with higher accuracy as well as automatically and accurately select essential genes. Nucleic Acids Res 43:W85–W90
Article CAS PubMed PubMed Central Google Scholar
Lowe TM, Chan PP (2016) tRNAscan-SE On-line: integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res 44:W54–W57
Article CAS PubMed PubMed Central Google Scholar
Jones P, Binns D, Chang H-Y et al (2014) InterProScan 5: genome-scale protein function classification. Bioinformatics 30:1236–1240
Article CAS PubMed PubMed Central Google Scholar
Marchler-Bauer A, Bryant SH (2004) CD-Search: protein domain annotations on the fly. Nucleic Acids Res 32:W327–W331
Article CAS PubMed PubMed Central Google Scholar
Lavigne R, Sun WD, Volckaert G (2004) PHIRE, a deterministic approach to reveal regulatory elements in bacteriophage genomes. Bioinformatics 20:629–635
Article CAS PubMed Google Scholar
Katoh K, Misawa K, Kuma K-I, Miyata T (2002) MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res 30:3059–3066
Article CAS PubMed PubMed Central Google Scholar
Larkin MA, Blackshields G, Brown NP et al (2007) Clustal W and Clustal X version 2.0. Bioinformatics 23:2947–2948
Article CAS PubMed Google Scholar
Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32:1792–1797
Article CAS PubMed PubMed Central Google Scholar
Darriba D, Taboada GL, Doallo R, Posada D (2011) ProtTest-HPC: fast selection of best-fit models of protein evolution. In: Lecture Notes in Computer Science, pp 177–184
Price MN, Dehal PS, Arkin AP (2010) FastTree 2—approximately maximum-likelihood trees for large alignments. PLoS One 5:e9490
Article PubMed PubMed Central Google Scholar
Adriaenssens EM, Ceyssens P-J, Dunon V et al (2011) Bacteriophages LIMElight and LIMEzero of Pantoea agglomerans, belonging to the “phiKMV-like viruses”. Appl Environ Microbiol 77:3443–3450
Article CAS PubMed PubMed Central Google Scholar

Download references

Author information

Grzegorz Nowicki and Jakub Barylski contributed equally to this work.

Authors and Affiliations

Department of Molecular Virology, Institute of Experimental Biology, Faculty of Biology, Adam Mickiewicz University, Umultowska 89, 61-614, Poznań, Poland
Grzegorz Nowicki, Agata Zemleduch-Barylska, Anna Mleczko, Patryk Frąckowiak, Natalia Nowaczyk, Emilia Kozdrowska & Jakub Barylski
Department of Animal Physiology and Development, Institute of Experimental Biology, Faculty of Biology, Adam Mickiewicz University, Umultowska 89, 61-614, Poznań, Poland
Karolina Walkowiak-Nowicka

Authors

Grzegorz Nowicki
View author publications
You can also search for this author in PubMed Google Scholar
Karolina Walkowiak-Nowicka
View author publications
You can also search for this author in PubMed Google Scholar
Agata Zemleduch-Barylska
View author publications
You can also search for this author in PubMed Google Scholar
Anna Mleczko
View author publications
You can also search for this author in PubMed Google Scholar
Patryk Frąckowiak
View author publications
You can also search for this author in PubMed Google Scholar
Natalia Nowaczyk
View author publications
You can also search for this author in PubMed Google Scholar
Emilia Kozdrowska
View author publications
You can also search for this author in PubMed Google Scholar
Jakub Barylski
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jakub Barylski.

Ethics declarations

Funding

The work presented in this paper was funded by internal funds (RMNiD donation) of the Department of Molecular Virology, Adam Mickiewicz University in Poznań.

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (PDF 1210 kb)

Supplementary material 2 (XLS 56 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Nowicki, G., Walkowiak-Nowicka, K., Zemleduch-Barylska, A. et al. Complete genome sequences of two novel autographiviruses infecting a bacterium from the Pseudomonas fluorescens group. Arch Virol 162, 2907–2911 (2017). https://doi.org/10.1007/s00705-017-3419-9

Download citation

Received: 28 March 2017
Accepted: 15 May 2017
Published: 27 May 2017
Issue Date: September 2017
DOI: https://doi.org/10.1007/s00705-017-3419-9

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Complete genome sequences of two novel autographiviruses infecting a bacterium from the Pseudomonas fluorescens group

Abstract