Abstract
In the arms race between bacteria and bacteriophages (phages), some large-genome jumbo phages have evolved a protein shell that encloses their replicating genome to protect it against host immune factors. By segregating the genome from the host cytoplasm, however, the ‘phage nucleus’ introduces the need to specifically translocate messenger RNA and proteins through the nuclear shell and to dock capsids on the shell for genome packaging. Here, we use proximity labeling and localization mapping to systematically identify proteins associated with the major nuclear shell protein chimallin (ChmA) and other distinctive structures assembled by these phages. We identify six uncharacterized nuclear-shell-associated proteins, one of which directly interacts with self-assembled ChmA. The structure and protein–protein interaction network of this protein, which we term ChmB, suggest that it forms pores in the ChmA lattice that serve as docking sites for capsid genome packaging and may also participate in messenger RNA and/or protein translocation.
Similar content being viewed by others
Main
Since the discovery of phages more than a century ago, research on these remarkable entities has yielded fundamental insights into a broad range of pathways across biology1. Historically, most phage studies have focused on small-genome phages (~30–140 kb), leaving larger ‘jumbo phages’ with genomes over ~200 kb much less well understood despite their abundance in nature2,3,4. We previously showed that one family of jumbo phages forms distinctive structures in infected cells, including a nucleus-like compartment bounded by a proteinaceous shell and a spindle-like structure that centers and rotates this compartment within the host cell5,6,7. The phage nucleus encloses the replicating phage genome and excludes most host proteins, including CRISPR effectors and restriction enzymes, rendering this family of phages broadly resistant to DNA-targeting bacterial immune systems8,9.
The jumbo phage nuclear shell provides an important selective advantage by protecting the replicating phage genome from host-encoded defense nucleases, but that protection comes at the cost of substantial added complexity in the phage life cycle. The phage nuclear shell is composed primarily of one protein, termed chimallin (ChmA) or phage nuclear enclosure (PhuN), which forms a single-layer-thick flexible lattice that separates the phage genome from the bacterial cytoplasm10,11. Pores in the ChmA lattice are less than ~2 nm in width, large enough to pass metabolites but too small for passage of most proteins or messenger RNAs (mRNAs)10. As in eukaryotes, mRNAs are transcribed within the phage nucleus but translated in the cytoplasm, meaning that phage mRNAs must be translocated out of the nucleus6. At the same time, phage-encoded proteins necessary for genome replication and mRNA transcription must be specifically translocated into the nucleus6,12. Finally, during virion production, newly assembled capsids are trafficked along filaments of the tubulin-like protein PhuZ to the nuclear shell, where they dock for genome packaging6,13. Following genome packaging, capsids are assembled with virion tails at a pair of structures termed the ‘phage bouquets’ before cell lysis and virion release14,15.
The diverse functions of the jumbo phage nuclear shell—including mRNA and protein translocation through the shell and capsid docking on the shell—imply that this structure incorporates multiple components in addition to ChmA that mediate these functions. Here, we use proximity labeling (miniTurboID16) in Pseudomonas aeruginosa cells infected by the jumbo phage ΦPA3 (ref. 17) to identify proteins that localize both within the phage nucleus and specifically to the nuclear shell. We identify six new nuclear-shell-associated proteins, one of which interacts directly with both ChmA and the putative portal protein18. These interactions suggest that this protein, which we term ChmB, forms pores in the ChmA lattice and mediates capsid docking and genome packaging. The overall protein–protein interaction network of ChmB further suggests additional roles in mRNA and/or protein translocation across the phage nuclear shell. More broadly, our data define the protein interaction network of the jumbo phage nuclear shell and reveal the subcellular localization of dozens of previously uncharacterized jumbo phage proteins.
Results
Identification of proteins associated with the jumbo phage nucleus
The genomes of nucleus-forming jumbo phages are poorly characterized: for instance, 290 of the 378 genes encoded by the Pseudomonas jumbo phage ΦPA3 have no annotated function in the NCBI protein database. To overcome this deficit, we used a proximity labeling approach to identify proteins associated with the phage nuclear shell that could endow this structure with additional functionality such as mRNA or protein translocation and capsid docking. We fused the promiscuous biotin ligase miniTurboID16 to the ChmA protein from the jumbo phage ΦPA3 (gp53) and to the phage’s nuclear-localized RecA protein (gp175)7 (Fig. 1a,b and Extended Data Fig. 1a,b). We first verified that fusing either gp53 or gp175 to green fluorescent protein (GFP) and miniTurboID did not alter the localization of these proteins in ΦPA3-infected cells (Extended Data Fig. 1a,b). We then expressed miniTurboID-gp53 or gp175-miniTurboID (lacking GFP to minimize off-target effects) in P. aeruginosa cells infected with ΦPA3, collected samples at 45 min postinfection, which is the earliest time point at which the mature nuclear shell is observed with docked capsids7, and performed streptavidin pulldown and mass spectrometry analysis to identify biotinylated proteins. By focusing on the phage proteins that were biotinylated and normalizing the results to a control miniTurboID-GFP fusion (which remains diffuse in the host cell cytoplasm throughout infection; Extended Data Fig. 1b), we identified candidate proteins that preferentially localize in close proximity to ChmA and/or RecA (Tables 1 and 2 and Supplementary Tables 1 and 2).
To validate our interaction data, we generated GFP fusions of the top 25 ChmA-interacting and RecA-interacting proteins from our miniTurboID datasets (42 proteins plus ChmA and RecA; Supplementary Table 3), expressed each in P. aeruginosa and determined their localization in both uninfected and ΦPA3-infected cells. Proteins that interact with RecA are expected to localize inside the phage nucleus, whereas ChmA-interacting proteins are expected to localize on or near the nuclear shell. Given the architecture of the ChmA lattice, we expected the amino-terminal miniTurboID tag to be localized on the outer surface of the nuclear shell10. Among the 42 proteins tested, we identified eight that localized within the nucleus-like compartment, six that localized to the nuclear shell itself and five that localized to the phage bouquets (Fig. 1c–e). Most proteins that localized within the phage nucleus were annotated in the NCBI database as putative nucleic-acid-interacting proteins, including a subunit of the phage-encoded nonvirion RNA polymerase (nvRNAP; gp62)19, two predicted helicases (gp131 and gp134), a predicted DNA ligase (gp257) and two predicted endonucleases (gp78 and gp210)18 (Fig. 1c and Supplementary Table 3). Two nuclear-localized proteins (gp108 and gp200) had no annotated or predicted function. Of the five proteins that localized to phage bouquets, three (gp12, gp164 and gp166) were predicted phage tail proteins, one (gp233) was a predicted helicase and one (gp355) had no annotated or predicted function18,20,21.
To date, the only known component of the jumbo phage nuclear shell is ChmA6,7,10. Among our list of RecA-interacting and ChmA-interacting proteins, we identified six proteins (gp2, gp61, gp63, gp64, gp148 and gp375) that clearly localized to the nuclear shell upon ΦPA3 infection of P. aeruginosa cells (Fig. 1d and Supplementary Table 3). One of these proteins, gp148, is predicted to be the portal protein of the phage capsid18. In other phages, the portal protein forms a homododecameric complex that orchestrates capsid assembly22 and associates with the terminase to translocate genomic DNA into the capsid23,24,25,26,27. We previously showed that in this family of jumbo phages, capsids are docked on the nuclear shell for genomic DNA packaging6, and our finding that the putative ΦPA3 portal protein associates with the shell suggests that it is directly responsible for capsid docking. Notably, we found that overexpressed GFP-tagged gp148 localized to the phage nuclear shell as early as 30 min postinfection, well before capsid assembly, and docking began at around 45 min postinfection6 (Extended Data Fig. 3). This finding suggests that the portal protein can localize to the nuclear shell on its own, supporting the idea that it directly mediates capsid docking.
Apart from the portal protein, the remaining five nuclear-shell-localized proteins had no predicted function and were not found in previous mass spectrometry studies of mature jumbo phage virions20. Three of these five proteins (gp61, gp63 and gp64) were in a block of genes that is well-conserved across jumbo phages, whereas homologs of the remaining two proteins (gp2 and gp375) could be identified only in Pseudomonas-infecting jumbo phages (Extended Data Fig. 4). None of these five proteins shows detectable sequence homology to any other known protein. Moreover, all five proteins are expressed early in infections, with timing similar to the major nuclear shell protein ChmA6,7. Thus, we speculate that some or all of these proteins are components of the nuclear shell itself, and that they may mediate translocation of mRNA and/or proteins through the nuclear shell and mediate docking of capsids to the shell for genome packaging.
gp2 is an interaction hub at the nuclear shell
Among the identified nuclear-shell-associated proteins, gp2 was among the most highly biotinylated proteins in our miniTurboID-ChmA samples (Table 1). We verified that GFP fusions of both ΦPA3 gp2 and its homolog from the related jumbo phage 201Φ2-1 (also gp2) colocalized with mCherry-fused ChmA in infected cells (Fig. 2a,b). To further define the interaction network of ΦPA3 gp2, we expressed GFP-fused gp2 in ΦPA3-infected P. aeruginosa cells, then purified the protein and interacting partners using GFP affinity chromatography. After purification of gp2 with a carboxy-terminal GFP tag, we identified two strong bands at a molecular weight of ~70 kDa on a silver-stained sodium dodecyl sulfate–polyacrylamide gel electrophoresis (SDS–PAGE) gel (Fig. 2c). We extracted a gel slice containing these two closely spaced bands and used trypsin mass spectrometry to identify the proteins. In this sample, the strongest signal (149 peptides, 81% sequence coverage) was for ChmA (gp53, 66.7 kDa), and the second strongest (41 peptides, 40% sequence coverage) was for the major capsid protein (gp136, 82.8 kDa; Supplementary Table 4). These data suggest that the observed doublet at ~70 kDa represents these two proteins and that gp2 interacts strongly with both ChmA and capsids.
To further investigate the interactions between nuclear-shell-associated proteins, we next performed mass spectrometry on the full purified samples from GFP-tagged ChmA, gp2 (both N-terminal and C-terminal GFP tags) and the putative nvRNAP subunit gp62 (Fig. 2d, Extended Data Fig. 5a and Supplementary Table 5). We confirmed that this approach could successfully purify functional protein complexes, as we successfully identified all five subunits of the nvRNAP complex in the gp62-GFP pulldown19 (Fig. 2d and Supplementary Table 5). The most enriched protein in GFP-tagged gp2 samples was ChmA, and we also identified the major capsid protein (gp136), the predicted portal protein (gp148), and 14 other proteins annotated as either tail or virion structural proteins (Fig. 2d and Supplementary Table 5). Two predicted phage tail proteins (gp213 and gp11) were also identified in pulldowns with gp62, suggesting that these proteins may simply be highly abundant in cell lysates (Supplementary Table 5). Nonetheless, the strong enrichment of phage structural proteins in GFP-tagged gp2 pulldowns strongly suggests that gp2 interacts directly with capsids, potentially as they dock on the nuclear shell for genomic DNA packaging. Also identified in the GFP-tagged gp2 pulldowns were two other shell-associated proteins, gp63 and gp64 (Fig. 2d and Supplementary Table 5), suggesting that these proteins may interact either directly with gp2 or indirectly through the ChmA lattice. Finally, two additional proteins (gp138 and gp358) were identified in both the ChmA and gp2 GFP pulldowns, as well as having been detected by ChmA miniTurboID labeling (Fig. 2d and Supplementary Tables 1 and 5). These two proteins are conserved across jumbo phages infecting Pseudomonas but have no annotated or predicted function, leaving their potential roles unknown.
gp2 interacts with ChmA and the phage portal protein
We and others have shown that the nuclear shell in diverse jumbo phages including ΦPA3 is composed primarily of ChmA5,6,7, which self-assembles into closed structures in vitro and forms a flexible lattice that surrounds the phage genome in phage-infected cells10,11,28. To confirm that gp2 interacts directly with ChmA, we coexpressed 201Φ2-1 gp2 and ChmA in Escherichia coli and performed Ni2+ pulldowns using a His6-tag on gp2 (ΦPA3 ChmA is poorly expressed in E. coli, precluding analysis in this phage). We found that His6-tagged 201Φ2-1 gp2 robustly interacted with ChmA in this assay, showing that the two proteins directly interact (Fig. 2e and Extended Data Fig. 5b). We next deleted the N- and C-terminal segments of ChmA (NTS and CTS, respectively), which bind to neighboring protomers in the ChmA lattice to mediate nuclear shell assembly. In vitro, deletion of either NTS or CTS led to loss of ChmA self-assembly10, and we found that gp2 was unable to interact with ChmA mutants lacking NTS, CTS, or both NTS and CTS (Fig. 2e). The loss of gp2 binding when deleting either the ChmA NTS or CTS suggests that gp2 does not simply bind one of these tail segments; rather, the data suggest that gp2 interacts specifically with the assembled ChmA lattice.
We next performed a similar coexpression experiment with His6-tagged ΦPA3 gp2 and the portal protein gp148. We found that gp2 could interact directly with gp148 in this assay (Fig. 2f). Combined with our data showing that gp2 interacts directly with self-assembled ChmA, these data suggest that gp2 is an integral component of the phage nuclear shell that is directly involved in the docking and filling of capsids through an interaction with the portal. Based on its localization and probable crucial role in phage nuclear structure and function, we name ΦPA3 gp2 and its homologs in related jumbo phages chimallin B (ChmB).
ChmB forms a homodimer with a novel fold
To determine the structural basis for ChmB interactions with other proteins, we recombinantly purified the protein from several jumbo phages and determined a 2.6 Å resolution crystal structure of ChmB from the related phage PA1C (gp2; 38% identical to ΦPA3 gp2) (Table 3). ChmB formed a homodimer in solution (Fig. 3a and Extended Data Fig. 6a–c), and the structure revealed an intertwined dimeric structure with the N terminus of each protomer forming a short β-strand and an α-helix that pack against the C-terminal globular domain of its dimer mate. Overall, the ChmB dimer adopts a distinctive U shape with dimensions of ~5 × 8 nm (Fig. 3b). Searches with the DALI or FoldSeek protein structure comparison tools29,30 showed no known structural relatives.
ChmB point mutants disrupt phage nucleus formation
To determine the roles of ChmB in phage nucleus formation and function, we first overexpressed GFP-tagged wild-type ΦPA3 ChmB in ΦPA3-infected P. aeruginosa cells (Fig. 4a). In uninfected cells, gp2 overexpression did not cause a significant growth defect, indicating that the protein is not inherently toxic (Extended Data Fig. 7a,b). In infected cells, we observed a striking increase in the average size of the phage nucleus and a concomitant increase in the nuclear DNA content (as measured by total DAPI signal within the nucleus) (Fig. 4b,c and Extended Data Fig. 8a–c). Further, a significant fraction of cells (42%) showed ChmB localization suggestive of multiple juxtaposed phage nuclei or a single phage nucleus with an aberrant shell structure (Fig. 4d).
We next aligned ChmB homologs from jumbo phages that infect Pseudomonas species (Extended Data Fig. 5d) and identified two highly conserved surface residues: Q53 and A159 (ΦPA3 gp2 numbering). We generated point mutations of these residues (Q53A and A159D, respectively) designed to alter the ChmB surface and potentially disrupt specific protein–protein interactions. Whereas ΦPA3-infected P. aeruginosa cells overexpressing wild-type ChmB showed large and/or multiple phage nuclei, infected cells overexpressing either ChmB-Q53A or A159D instead showed a strong disruption in nucleus formation and growth in a large fraction of cells. In around 20% of cells (23% for ChmB-Q53A, 17% for ChmB-A159D), the phage nuclear DNA signal in late infections resembled the puncta usually observed in very early infections, before significant phage nuclear DNA replication and nucleus growth6 (Fig. 4e and Extended Data Fig. 8d,e). In another population of cells (5% for ChmB-Q53A, 7% for ChmB-A159D), nuclear DNA appeared to be entirely absent despite these cells sometimes showing ChmB localization reminiscent of a phage nuclear shell (Fig. 4e). Finally, a third population of cells (14% for ChmB-Q53A, 21% for ChmB-A159D) showed an aberrant nuclear shell structure with nuclear DNA staining that appeared to incompletely fill the nuclear area (Fig. 4e). Overall, nearly half of infected cells expressing mutant ChmB (42% for ChmB-Q53A, 45% for ChmB-A159D) showed abnormal nuclear shell and/or nuclear DNA morphology.
To test whether the ChmB point mutants affected oligomerization or binding to known partner proteins, we first purified wild-type and mutant ΦPA3 ChmB and analyzed the purified proteins by size-exclusion chromatography. All three proteins showed similar elution profiles, indicating that ChmB oligomerization was not affected by the mutations (Extended Data Fig. 7c). Next, we coexpressed His6-tagged 201Φ2-1 ChmB (wild-type and mutants equivalent to ΦPA3 ChmB-Q53A and A159D) with untagged ChmA. Neither mutation affected ChmA association in our pulldown assay (Extended Data Fig. 7d), consistent with our observation that ChmB point mutants localized properly to the phage nuclear shell in infected cells (Fig. 4a). Similarly, neither ChmB mutation affected the ability of His6-tagged ΦPA3 ChmB to associate with the portal protein (gp148) in a coexpression assay (Extended Data Fig. 7e). Thus, the observed effects of ChmB point mutants on phage nuclear shell development are probably not caused by a failure to interact with ChmA or the phage capsids. Rather, these effects may involve other putative interaction partners of ChmB, such as the uncharacterized nuclear-shell-associated proteins gp61, gp63 and gp64.
Discussion
In contrast to well-studied small-genome phages, nucleus-forming jumbo phages build several characteristic structures in infected cells, including the phage nucleus6,7,9,15, PhuZ spindle6,7,13,31 and phage bouquets14,15. Here, we combined proximity labeling with subcellular localization analysis by fluorescence microscopy to establish a subcellular protein localization map comprising 44 phage-encoded proteins (Fig. 5a). Although this map is incomplete, it nonetheless represents a major step in our functional understanding of this distinctive family of phages.
Here, we focus on the phage nucleus, which separates the replicating phage genome from the host cytoplasm and protects the phage from DNA-targeting immune factors encoded by the host. We and others have shown that the jumbo phage nuclear shell is predominantly composed of a single layer of the phage-encoded protein chimallin (ChmA), which assembles into a lattice with pores less than 2 nm in width10,11. As these pores are likely to be too small for the passage of nucleic acids or proteins, we theorized that the ChmA lattice incorporates additional components that mediate mRNA and protein translocation through the phage nuclear shell. Further, based on our previous observation that capsids dock on the nuclear shell for genome packaging5, this structure must also incorporate components that mediate capsid docking and enable the passage of genomic DNA through the nuclear shell.
Our proximity labeling and localization mapping approach identified six proteins in the jumbo phage ΦPA3 that associate with the phage nuclear shell. We found that one of these proteins (gp2) associated directly with self-assembled ChmA in vitro, suggesting that it is an integral phage nuclear shell protein and prompting us to name it ChmB. ChmA self-assembles into a flexible square lattice with extended NTS and CTS binding to neighboring protomers10,11 (Fig. 5b). NTS-mediated interactions define ChmA homotetramers that measure ~11.5 × 11.5 nm, whereas CTS-mediated interactions mediate interactions between neighboring tetramers. The distinctive U shape and overall dimensions of the ChmB dimer, at ~5 × 8 nm, suggest a model in which multiple ChmB dimers could line a hole created by the removal of four or more neighboring ChmA protomers from the lattice (two possibilities shown in Fig. 5b). Indeed, size-exclusion chromatography of ChmB at high concentration suggests a propensity for higher-order self-assembly that may be reinforced by integration into the ChmA lattice (Extended Data Fig. 7c).
In addition to associating directly with ChmA, ChmB is at the center of a large protein–protein interaction network in phage-infected cells. ChmB interacts directly with the phage portal protein, and the portal protein can also localize on its own to the phage nuclear shell in infected cells. Based on these data, we propose that ChmB mediates capsid docking and genome packaging at the nuclear shell through a direct interaction with the portal protein. ChmB may also interact with other nuclear-shell-associated proteins such as gp61, gp63 and gp64 to mediate specific translocation of mRNA and protein through the shell.
Overexpression of wild-type ChmB in infected cells resulted in phage nuclei that were significantly larger than normal, often showing aberrant morphology and appearing as multiple distinct phage nuclei in a single cell. Conversely, overexpression of mutant ChmB proteins (Q53A or A159D) led to the formation of phage nuclei that were significantly smaller or appeared to be completely devoid of DNA based on DAPI staining. Importantly, these point mutations did not disrupt ChmA binding or ChmB localization to the phage nuclear shell, nor did they disrupt binding to the phage portal protein. The strong phenotypic effects of ChmB point-mutant expression in infected cells therefore suggest that ChmB has additional binding partners with fundamental roles in the formation and maturation of the phage nucleus.
ChmB is conserved among all known nucleus-forming jumbo phages that infect Pseudomonas, but it is not found in distantly related phage such as E. coli phage Goslar15 or Serratia phage PCH45 (ref. 9) (Extended Data Fig. 4). Given its apparent role in phage nuclear shell function, why is ChmB not more widely conserved? The most likely explanation is that it is conserved, but that distant homologs of ChmB are too divergent to be recognized by sequence-based searches. This scenario is supported by the high sequence divergence of gp2 homologs relative to other important proteins such as ChmA: across five representative jumbo phages infecting Pseudomonas (ΦPA3, PA1C, Phabio, 201Φ2-1 and ΦKZ), pairwise sequence identities for ChmA homologs average 51%, whereas ChmB proteins average 27% identity (Extended Data Fig. 4). Alternatively, a different protein might perform a role equivalent to that of ChmB in other nucleus-forming jumbo phage families.
In summary, this work provides new insights into the organizational principles of nucleus-forming jumbo phages and the molecular mechanisms of the phage nucleus in particular. Our results identify a protein interaction network centered around the phage nucleus that will form the basis for future research in this area. We identified a key protein, ChmB, at the center of this network that is likely to have multiple roles both as a pore for macromolecular translocation through the nuclear shell and for capsid docking and genomic DNA packaging. Further work will be required to fully understand the composition of the phage nucleus and the myriad proteins that contribute to its remarkable functions.
Methods
Strains, growth condition and phage preparation
Pseudomonas chlororaphis strain 200-B cells were cultured on solid hard agar (HA). P. aeruginosa strains PA01 and K2733 (efflux pump knockout; ΔMexAB-OprMΔMexCD-OprJΔMexEF-OprNΔMexXY-OprM)32 were cultured in Luria-Bertani (LB) media. For amplification of phages, host strains were cultured in liquid media at 30 °C overnight; then, 20 μl of high-titer phage lysate was mixed with 100 μl of cells with optical density at wavelength 600 nm (OD600) of 0.6 (P. chlororaphis for 201Φ2-1, P. aeruginosa for ΦPA3), incubated for 20 min at room temperature, mixed with 5 ml HA (for 201Φ2-1 in P. chlororaphis) or LB top agar (for phage ΦPA3 in P. aeruginosa) and poured over an HA or LB plate. The plates were then incubated upside-down at 30 °C overnight. The next day, the plates were incubated with 5 ml of phage buffer for 5 h at room temperature. Lysates were collected and centrifuged at 21,000g for 10 min. Supernatants were stored at 4 °C with 0.01% chloroform.
Plasmid construction and bacterial transformation
Genes of interest were PCR-amplified from high-titer phage lysates then ligated into linearized plasmid backbones using an NEBuilder HiFi DNA Assembly Cloning Kit (New England Biolabs, catalog number E5520S). For protein expression in P. chlororaphis and P. aeruginosa, the pHERD30T vector was used33. For overexpression in E. coli, UC Berkeley MacroLab vectors 2-BT (ampicillin resistant, His6-TEV tag; Addgene, catalog number 29666) and 13-S (spectinomycin resistant, no tag; Addgene, catalog number 48323) were used. Recombinant plasmids were transformed into E. coli DH5α and plated on LB agar containing appropriate antibiotics (25 μg ml−1 gentamicin sulfate, 100 μg ml−1 ampicillin or 100 μg ml−1 spectinomycin as appropriate). Constructs were confirmed by DNA sequencing and subsequently introduced into indicated organisms of interest and selected on LB supplemented with antibiotics. Selected overnight cultures were stored in 25% glycerol at −80 °C. See Supplementary Table 6 for all plasmids used in this study and Supplementary Table 7 for sequences of all oligonucleotides and synthesized genes.
Fluorescence microscopy of single-cell-infection assay
Agarose pads (1.2%) were prepared on concavity slides. Each pad was supplemented with 0.05–1.00% arabinose to induce protein expression. In certain experiments, FM4-64 (1 μg ml−1) was added to stain cell membranes, and DAPI (1 μg ml−1) was added to stain the nucleoid. P. chlororaphis and P. aeruginosa strains were inoculated on the pads and grown in a humid chamber at 30 °C for 2 h. For phage infections, 10 μl of phages (108 plaque-forming units (PFU) ml−1) were added on the cells, followed by incubation for an additional 40 min at 30 °C to allow infection to proceed. At the desired time points, the pads were sealed with a coverslip, and fluorescence microscopy was performed with a DeltaVision Spectris Deconvolution Microscope (Applied Precision). Cells were imaged using at least eight images in the z axis from the middle focal plane in 0.15 μm increments. For time-lapse imaging, 8 Z points were selected and subsequently imaged using the UltimateFocus mode. Images were further processed using a deconvolution algorithm (DeltaVision SoftWoRx Image Analysis Program) and analyzed in Fiji34.
Phage nucleus area and DAPI measurements from fixed cells
Agarose pads (1.2%) were prepared on concavity slides containing 1% arabinose plus 1 μg ml−1 FM4-64. P. aeruginosa strains were inoculated on pads and grown for 2 h in a humid chamber at 30 °C. Seventy-five minutes postinfection with 10 μl of phage (108 PFU ml−1), 20 μl of fixation mixture (1.5% glutaraldehyde, 13.76% paraformaldehyde and 0.16 M NaPO4 at pH = 7.4) was added to each pad, followed by incubation at room temperature for 20 min. For high levels of nucleoid staining for quantifications, 20 μl of 20 μg ml−1 DAPI was added to the fixed cells, followed by incubation for 20 min at room temperature. Pads were sealed with coverslips, and fluorescence microscopy was performed as above. For quantitation of DAPI area and intensity in nucleoids or phage nuclei, images before deconvolution were analyzed using Fiji with built-in measurement tools. Statistical analysis was performed in GraphPad Prism.
Proximity labeling with miniTurboID
For proximity labeling, overnight cultures of P. aeruginosa were grown in LB media with 25 μg ml−1 gentamicin sulfate. Cultures were then diluted to OD600 = 0.1 and supplemented with 500 μM biotin. When cells reached OD600 = 0.5, they were diluted 1:10 in 50 ml total volume in 250 ml flasks and grown in LB supplemented with 0.1% arabinose, 500 μM biotin, 25 μg ml−1 gentamicin sulfate and 0.2 mM CaCl2. When the cells reached OD600 = 0.3, they were infected with phage ΦPA3 at a multiplicity of infection of 3. At 45 min postinfection, cultures were collected and centrifuged at 3,000g at 4 °C. Cell pellets were stored at −80 °C for mass spectrometry.
Mass spectrometry
To prepare biotinylated samples for immunoprecipitation and mass spectrometry, frozen cell pellets (100 μl) were thawed and resuspended in 100 μl water. Ten microliters of resuspended cells were mixed with 200 μl of 6 M guanidine-HCl, vortexed and then subjected to three cycles of incubation at 100 °C for 5 min, followed by cooling to room temperature. Then, 1.8 ml of pure methanol was added to the boiled cell lysate, followed by vortexing, incubation at −20 °C for 20 min, and centrifugation at 21,000g for 10 min at 4 °C. The tube was inverted and dried to remove any liquid, and the pellet was resuspended in 200 μl of 8 M urea in 0.2 M ammonium bicarbonate. The mixture was incubated for 1 h at 37 °C with constant agitation. Following the incubation, 4 μl of 500 mM Tris(2-carboxyethyl) phosphine and 20 μl of 400 mM chloro-acetamide were added. Protein concentration was measured by BCA assay from a 10 μl sample; then, 600 μl of 200 mM ammonium bicarbonate was added to bring the urea concentration to 2 M. One microgram of sequencing-grade trypsin was added for each 100 μg of protein in the sample, followed by incubation overnight at 42 °C. Following trypsin incubation, 50 μl of 50% formic acid was added (ensuring that the pH dropped to 2 using pH test strips), and then samples were desalted using C18 solid phase extraction (Waters Sep-Pak C18 12 cc Vac Cartridge, catalog number WAT036915) as described by the manufacturer protocol. The peptide concentration of each sample was measured using BCA after resuspension in 1 ml phosphate-buffered saline (PBS) buffer.
For biotin immunoprecipitation, 200 μl of 50% slurry of NeutrAvidin beads (Pierce) was washed three times with PBS; then, 1 mg of resuspended peptide solution in PBS was added, followed by incubation for 1 h at room temperature. Beads were washed three times with 2 ml PBS plus 2.5% acetonitrile (ACN) and once in ultrapure water, and excess liquid was carefully removed with a micropipette. Biotinylated peptides were eluted twice with 300 μl of elution buffer (0.2% trifluoroacetic acid, 0.1% formic acid and 80% ACN in water), with the second elution involving two 5 min incubations at 100 °C. Samples were then dried completely before being subjected to mass spectrometry.
Liquid chromatography coupled with tandem mass spectrometry
Trypsin-digested peptides were analyzed by ultra-high-pressure liquid chromatography coupled with tandem mass spectroscopy using nanospray ionization. The nanospray ionization experiments were performed using a Orbitrap Fusion Lumos hybrid mass spectrometer (Thermo) interfaced with nanoscale reverse-phase ultra-high-pressure liquid chromatography (Thermo Dionex UltiMate 3000 RSLC Nano System) using a 25 cm, 75 µm ID glass capillary packed with 1.7 µm C18 (130) BEH beads (Waters Corporation). Peptides were eluted from the C18 column into the mass spectrometer using a linear gradient (5–80%) of ACN at a flow rate of 375 μl min−1 for 3 h. The buffers used to create the ACN gradient were as follows: buffer A (98% H2O, 2% ACN, 0.1% formic acid); and buffer B (100% ACN, 0.1% formic acid). The mass spectrometer parameters were as follows: an MS1 survey scan using the Orbitrap detector (mass range (m/z): 400–1,500 (using quadrupole isolation), 120,000 resolution setting, spray voltage of 2200 V, ion transfer tube temperature of 275 °C, AGC target of 400,000 and maximum injection time of 50 ms) was followed by data-dependent scans (top speed for the most intense ions, with charge state set to only include +2–5 ions and a 5 s exclusion time, selecting ions with minimal intensities of 50,000) in which the collision event was carried out in the high-energy collision cell (higher-energy collisional dissociation collision energy of 30%), and the fragment masses were analyzed in the ion trap mass analyzer (with an ion trap scan rate of turbo, the first mass m/z was 100, the AGC target was 5,000 and the maximum injection time was 35 ms). Protein identification and label-free quantification were carried out using Peaks Studio 8.5 (Bioinformatics Solutions Inc.). Variable modification at lysine residues of +226.08 atomic mass units was used in the peptide sequencing parameters.
Mass spectrometry analysis
For each sample, biotinylated peptides identified by mass spectrometry were divided into host (P. aeruginosa) and phage (ΦPA3) peptides. Host peptides that were identified in all samples were used to normalize phage peptide signals across the dataset. Biotinylated phage protein peak areas were calculated by summing the peak areas of each peptide assigned to a given protein. The fold changes for proteins from ChmA (gp53)-miniTurboID and RecA (gp175)-miniTurboID were calculated by comparison with protein peak areas from GFP-miniTurboID samples. The proteins were sorted according to their average normalized fold changes.
The genome sequence of ΦPA3 (NCBI RefSeq NC_028999.1) is misannotated between the coding regions for gp64 (NCBI accession YP_009217147.1, nucleotides 51942–53267) and gp68 (NCBI accession YP_009217148.1, nucleotides 58478–60010). We manually annotated this region to identify gp65–66 (which together code for a single protein, separated by an intron spanning nucleotides 55005–55454)19 and gp67 for identification by mass spectrometry:
>gp65–66 (NC_028999.1 nucleotides 53811-55004 and 55455-56447)
MYEEHNLRRAVREIHAKLLGHAALDPYYGTTSAARGAMFLSHIGQAPVVEGNEPRRVMTGMEMRYAEYTFDVRLPTDCTILHKVRKYPTGQGYGAIQHNPVTTLIYENYYDEYKTIGVLHVPEYMSFHQDFGYELVKNKEVWESLQPDQMFAKDTVIAQSSTVKSNGLYGMGVNANVAFMSVPGTIEDGFVVSDEFLERMSPRTYTTAVCGAGKKAFFLNMYGDDKIYKPFPDIGEKIREDGVIFAVRDLDDDLAPAEMTPRALRTLDRTFDRAVIGDPGATVKDIKVYWDERQNPSFTPSGMDGQLRKYYDALCTYYREIIKIYRGLLARRKDKLRISEEFNQLLVEAMIYLPQAEGQRKLTRMYRLEQLDEWRVELTYESIKVPGGAYKLTDFHGGKGVVCEVRPKADMPVDEFGNVVDAIIFGGSTMRRSNYGRIYEHGFGAASRDLAQRLRVEAGLPRHGVVPEQDLNRVCSNREWVTYAFAELQEFYYIIAPTMHEILREHPSPAEYVKTVLRDGFSYIYSPVDDPVDLMSSLNCIMNSRFCPNHTRVTYRGQDGKMVTTKDKVLVGPLYMMLLEKIGEDWSAVASVKVQQFGLPSKLNNSDRSSTPGRESAIRSFGESETRSYNCTVGPEATVELLDQTNNPRAHLAVINSILTADKPSNIERAVDRTKVPFGSSRPVDLLEHLLECRGLKFEYATTDGVQPVHTAVPIRAQQKVKSEAIEE*
>gp67 (NC_028999.1 nucleotides 56450-58420)
MNQYNARDLLNMSYDDLFAIPNEWHKIIFDDGEILTKDRATKLSILLWHPLKQFPNATLSVKYHLGDTRVTSKSLVKLLNSVIWGIHAWSNEQVDPEVLARLAIEAKNVLYNEATSRLGAYVATLSMFEIAEVYNHPKVREANQNIEPTTHGIETIAYGKIKEAFNDPTQFRGNSIIEGLRSGTQKMEQLLQAFGPRGFPTDINSDIFAEPCLTGYIDGIWGLYENMIESRSGTKALLYNKELLRVTEYFNRKSQLIAQYVQRLHKGDCGAGYIEFPVIKAYLKSLRGKFYLNEETGKREILQGNETHLIGKKIKMRSVLGCVHPDPQGICATCYGTLADNIPRGTNIGQVSAVSMGDKITSSVLSTKHTDATSAVEQYKITGVEAKYLREGQAPETLYLKKELANKGYRLMIGRNEAQNLADVLMIDNLSAYPPTSASELTRIGLVRTVDGIDEGDVLTVSLYNRKASLSIELLQHVKRVRWELDNRDNIVIDLNGFDFSLPFLTLPYKHVNMYEVMKRIQSFLHSGSDTEGSKLSSDKVGFTSKTYLKNYNDPIDAVAAFASLVNEKIQLPMPHCEVLVYAMMVRSTQQRDYRLPKPGISGQFEKYNKLMQSRSLAGAMAFEKQHEPLNNPGSFLYTLRNDHPYDLAVKGGKLY*.
GFP pulldowns
For GFP pulldowns, GFP-Trap Magnetic Agarose beads (Chromotek gtma-20) were used. Overnight cultures of P. aeruginosa expressing GFP-tagged proteins of interest were grown in LB plus 25 μg ml−1 gentamicin sulfate. Cells were diluted to an OD600 of 0.1 then grown further to an OD600 of 0.5. Cultures were diluted 1:10 into 50 ml total volume of culture in 250 ml flasks and grown in LB supplemented with 0.1% arabinose, 25 μg ml−1 gentamicin sulfate, and 0.2 mM CaCl2. When the cells reached OD600 = 0.3, they were infected with ΦPA3 at a multiplicity of infection of 3. Cultures were collected at 45 min postinfection and centrifuged at 3,000g at 4 °C. Cell pellets were stored at −80 °C.
For the GFP pulldown, thawed cell pellets were incubated for 1 h with 500 μl lysis buffer (10% glycerol, 25 mM Tris (pH 7.5), 150 mM NaCl, 4 mg ml−1 lysozyme, 20 μg ml−1 DNase I, 2× cOmplete Protease Inhibitor, 0.4 mM phenylmethylsulfonyl fluoride). Cell suspensions were sonicated for 10 rounds with 20 pulses per round (Duty Cycle 40, Output 4). Lysed cells were centrifuged for 30 min at 21,000g at 4 °C. For each sample, 25 μl of bead slurry (prewashed into dilution buffer: 10 mM Tris-HCl pH 7.5, 150 mM NaCl, 0.5 mM EDTA) was used. Then, 500 μl of cell lysate was added to the beads, followed by rotation end-to-end for 1 h at 4 °C. Beads were washed five times with wash buffer (10 mM Tris/Cl pH 7.5, 150 mM NaCl, 0.05% NP-40 substitute, 0.5 mM EDTA). For SDS–PAGE, cells were resuspended with 2× SDS buffer (120 mM Tris-HCl pH 6.8, 20% glycerol, 4% SDS, 0.04% bromophenol blue, 10% β-mercaptoethanol) and boiled at 100 °C for 5 min. Samples (10 μl) of each elution were run on two separate SDS–PAGE gels and visualized by either silver staining or Coomassie blue staining. For tryptic mass spectrometry of gel bands, bands were cut out of Coomassie blue-stained gels. The remaining 80% of each elution was used for mass spectrometry identification of proteins as described above.
Protein purification
Full-length phage gp2 protein sequences were codon-optimized and synthesized (Invitrogen/GeneArt), then cloned into UC Berkeley Macrolab vector 2-BT (Addgene, catalog number 29666) to generate constructs with N-terminal TEV protease-cleavable His6-tags. Proteins were expressed in E. coli strain Rosetta 2 (DE3) pLysS (EMD Millipore). Cultures were grown at 37 °C to OD600 = 0.7 then induced with 0.25 mM IPTG and shifted to 20 °C. After a 16 h incubation, cells were harvested by centrifugation and resuspended in a buffer containing 25 mM Tris pH 7.5, 10% glycerol, 300 mM NaCl, 5 mM imidazole, 5 mM β-mercaptoethanol, and 1 mM NaN3. Proteins were purified by Ni2+-affinity (Ni-NTA agarose, Qiagen) then passed over an anion-exchange column (HiTrap Q HP, Cytiva) in a buffer containing 100 mM NaCl, collecting flow-through fractions. Tags were cleaved with TEV protease35, the mixture was passed over another Ni2+ column, and the flow-through fractions containing protease-cleaved proteins of interest were collected and concentrated. The protein was passed over a size-exclusion column (Superdex 200, Cytiva) in buffer GF (buffer A plus 300 mM NaCl and 1 mM dithiothreitol), then concentrated by ultrafiltration (Amicon Ultra, EMD Millipore) to 10 mg ml−1 and stored at 4 °C.
For characterization of oligomeric state by size-exclusion chromatography coupled to multi-angle light scattering (SEC–MALS), 100 µl of purified proteins at 5 mg ml−1 were injected onto a Superdex 200 Increase 10/300 GL column (Cytiva) in buffer GF. Light scattering and refractive index profiles were collected using miniDAWN TREOS and Optilab T-rEX detectors (Wyatt Technology), respectively, and molecular weight was calculated using ASTRA v.8 software (Wyatt Technology).
Crystallization and structure determination
Purified PA1C gp2 in a buffer containing 20 mM Tris pH 8.5, 1 mM dithiothreitol, and 100 mM NaCl (14 mg ml−1) was mixed 1:1 with well solution containing 0.1 M Tris pH 8.5, and 1.5 M lithium sulfate in hanging drop format. Crystals were cryoprotected by the addition of 24% glycerol and flash-frozen in liquid nitrogen. Diffraction data were collected at the Advanced Photon Source NE-CAT beamline 24ID-C on 11 December 2021, with X-ray wavelength 0.97911 Å and at 100 K. Data were processed with the RAPD data-processing pipeline (https://github.com/RAPD/RAPD), which uses XDS36 for data indexing and reduction, AIMLESS37 for scaling, and TRUNCATE38 for conversion to structure factors. We determined the structure by molecular replacement in PHASER39, using a predicted structure from AlphaFold2 (ref. 40) as a search model. We manually rebuilt the initial model in COOT41 and refined it in phenix.refine42 using positional and individual B factor refinement (Table 3). The final model had good geometry, with 98.75% of residues in favored Ramachandran space, 1.25% allowed and 0% outliers. The overall MolProbity score was 0.92, and the MolProbity clash score was 1.7.
Statistics and reproducibility
All light microscopy experiments were performed once, and each set of conditions that were directly compared (for example, phage-infected versus uninfected) were performed in the same experiment. For figure panels where single cells are shown (Figs. 1a,c–e, 2a,b and 4a and Extended Data Figs. 1b, 2a,b, 3 and 8a,b), these cells are representative of the cell population across at least three full fields of cells recorded in the microscope (in all cases, at least 20 cells were individually inspected). For quantitation of cellular characteristics (Fig. 4b–e and Extended Data Fig. 8c–e), at least 100 cells were examined for each sample. Sample sizes for quantitative characteristics were chosen to ensure at least 95% confidence in observing effect sizes over 10%.
GFP-Trap purifications and mass spectrometry analysis (Fig. 2c and Extended Data Fig. 5a) were performed once per sample. Protein coexpression assays were performed at least three times with consistent results (Fig. 2e,f and Extended Data Figs. 5b and 7d,e).
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
Mass spectrometry data are available at the PRIDE database (www.ebi.ac.uk/pride) under accession ID PXD041684. Final refined coordinates and reduced diffraction data for the structure of PA1C gp2 are available at the RCSB Protein Data Bank (www.rcsb.org) under accession ID 7UYX. Raw diffraction data for the structure of PA1C gp2 are available at the SBGrid Data Bank (data.sbgrid.org) under accession ID 908. Publicly accessible data used in this study include the genome sequence of phage PhiPA3 (NCBI RefSeq NC_028999.1) and annotated protein sequences for phages PhiPA3 (NCBI RefSeq YP_009217083.1 to YP_009217457.1), 201Phi2-1 gp2 (YP_001956728.1) and gp105 (YP_001956829.1), Psa21 gp2 (YP_010347551.1) and phage PA1C gp3 (QBX32150.1). Source data are provided with this paper.
References
Salmond, G. P. C. & Fineran, P. C. A century of the phage: past, present and future. Nat. Rev. Microbiol. 13, 777–786 (2015).
Serwer, P., Hayes, S. J., Thomas, J. A. & Hardies, S. C. Propagating the missing bacteriophages: a large bacteriophage in a new class. Virol. J. 4, 21 (2007).
Al-Shayeb, B. et al. Clades of huge phages from across Earth’s ecosystems. Nature 578, 425–431 (2020).
M Iyer, L., Anantharaman, V., Krishnan, A., Burroughs, A. M. & Aravind, L. Jumbo phages: a comparative genomic overview of core functions and adaptions for biological conflicts. Viruses 13, 63 (2021).
Chaikeeratisak, V., Birkholz, E. A. & Pogliano, J. The phage nucleus and phuz spindle: defining features of the subcellular organization and speciation of nucleus-forming jumbo phages. Front. Microbiol. 12, 641317 (2021).
Chaikeeratisak, V. et al. Assembly of a nucleus-like structure during viral replication in bacteria. Science 355, 194–197 (2017).
Chaikeeratisak, V. et al. The phage nucleus and tubulin spindle are conserved among large Pseudomonas phages. Cell Rep. 20, 1563–1571 (2017).
Mendoza, S. D. et al. A bacteriophage nucleus-like compartment shields DNA from CRISPR nucleases. Nature 577, 244–248 (2020).
Malone, L. M. et al. A jumbo phage that forms a nucleus-like structure evades CRISPR–Cas DNA targeting but is vulnerable to type III RNA-based immunity. Nat. Microbiol. 5, 48–55 (2020).
Laughlin, T. G. et al. Architecture and self-assembly of the jumbo bacteriophage nuclear shell. Nature 608, 429–435 (2022).
Nieweglowska, E. S. et al. The ϕPA3 phage nucleus is enclosed by a self-assembling 2D crystalline lattice. Nat. Commun. 14, 927 (2023).
Nguyen, K. T. et al. Selective transport of fluorescent proteins into the phage nucleus. PLoS ONE 16, e0251429 (2021).
Chaikeeratisak, V. et al. Viral capsid trafficking along treadmilling tubulin filaments in bacteria. Cell 177, 1771–1780.e12 (2019).
Chaikeeratisak, V. et al. Subcellular organization of viral particles during maturation of nucleus-forming jumbo phage. Sci. Adv. 8, eabj9670 (2022).
Birkholz, E. A. et al. A cytoskeletal vortex drives phage nucleus rotation during jumbo phage replication in E. coli. Cell Rep. 40, 111179 (2022).
Branon, T. C. et al. Efficient proximity labeling in living cells and organisms with TurboID. Nat. Biotechnol. 36, 880–887 (2018).
Monson, R., Foulds, I., Foweraker, J., Welch, M. & Salmond, G. P. C. The Pseudomonas aeruginosa generalized transducing phage phiPA3 is a new member of the phiKZ-like group of ‘jumbo’ phages, and infects model laboratory strains and clinical isolates from cystic fibrosis patients. Microbiology 157, 859–867 (2011).
Thomas, J. A. et al. Extensive proteolysis of head and inner body proteins by a morphogenetic protease in the giant Pseudomonas aeruginosa phage φKZ. Mol. Microbiol. 84, 324–339 (2012).
deYMartín Garrido, N. et al. Structure of the bacteriophage PhiKZ non-virion RNA polymerase. Nucleic Acids Res. 49, 7732–7739 (2021).
Thomas, J. A. et al. Characterization of Pseudomonas chlororaphis myovirus 201φphi2-1 via genomic sequencing, mass spectrometry, and electron microscopy. Virology 376, 330–338 (2008).
Thomas, J. A., Weintraub, S. T., Hakala, K., Serwer, P. & Hardies, S. C. Proteome of the large Pseudomonas myovirus 201φ2-1: delineation of proteolytically processed virion proteins. Mol. Cell. Proteomics 9, 940–951 (2010).
Reilly, E. R. et al. A cut above the rest: characterization of the assembly of a large viral Icosahedral capsid. Viruses 12, eabj9670 (2020).
Sun, L. et al. Cryo-EM structure of the bacteriophage T4 portal protein assembly at near-atomic resolution. Nat. Commun. 6, 7548 (2015).
Mitchell, M. S. & Rao, V. B. Functional analysis of the bacteriophage T4 DNA-packaging ATPase motor. J. Biol. Chem. 281, 518–527 (2006).
Morita, M., Tasaka, M. & Fujisawa, H. Analysis of the fine structure of the prohead binding domain of the packaging protein of bacteriophage T3 using a hexapeptide, an analog of a prohead binding site. Virology 211, 516–524 (1995).
Valpuesta, J. M. & Carrascosa, J. L. Structure of viral connectors and their function in bacteriophage assembly and DNA packaging. Q. Rev. Biophys. 27, 107–155 (1994).
Yeo, A. & Feiss, M. Specific interaction of terminase, the DNA packaging enzyme of bacteriophage γ, with the portal protein of the prohead. J. Mol. Biol. 245, 141–150 (1995).
Heymann, J. B. et al. The mottled capsid of the Salmonella giant phage SPN3US, a likely maturation intermediate with a novel internal shell. Viruses 12, 725 (2020).
Holm, L. Using Dali for protein structure comparison. Methods Mol. Biol. 2112, 29–42 (2020).
van Kempen, M. et al. Fast and accurate protein structure search with Foldseek. Nat. Biotechnol. https://doi.org/10.1038/s41587-023-01773-0 (2023).
Kraemer, J. A. et al. A phage tubulin assembles dynamic filaments by an atypical mechanism to center viral DNA within the host cell. Cell 149, 1488–1499 (2012).
Balemans, W. et al. Novel antibiotics targeting respiratory ATP synthesis in Gram-positive pathogenic bacteria. Antimicrob. Agents Chemother. 56, 4131–4139 (2012).
Qiu, D., Damron, F. H., Mima, T., Schweizer, H. P. & Yu, H. D. PBAD-based shuttle vectors for functional analysis of toxic and highly regulated genes in Pseudomonas and Burkholderia spp. and other bacteria. Appl. Environ. Microbiol. 74, 7422–7426 (2008).
Schindelin, J. et al. Fiji: an open-source platform for biological-image analysis. Nat. Methods 9, 676–682 (2012).
Tropea, J. E., Cherry, S. & Waugh, D. S. in High Throughput Protein Expression and Purification: Methods and Protocols (ed. Doyle, S. A.) 297–307 (Humana Press, 2009).
Kabsch, W. Integration, scaling, space-group assignment and post-refinement. Acta Crystallogr. D Biol. Crystallogr. 66, 133–144 (2010).
Evans, P. R. & Murshudov, G. N. How good are my data and what is the resolution? Acta Crystallogr. D Biol. Crystallogr. 69, 1204–1214 (2013).
Winn, M. D. et al. Overview of the CCP4 suite and current developments. Acta Crystallogr. D Biol. Crystallogr. 67, 235–242 (2011).
McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674 (2007).
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallogr. D Biol. Crystallogr. 66, 486–501 (2010).
Afonine, P. V. et al. Towards automated crystallographic structure refinement with phenix.refine. Acta Crystallogr. D Biol. Crystallogr. 68, 352–367 (2012).
Acknowledgements
We thank members of the Pogliano and Corbett laboratories for helpful discussions and for critical reading of the manuscript. We acknowledge support from the National Institutes of Health (R01 GM129245 to J.P. and E.V.; R35 GM144121 to K.D.C.; and NIH shared instrumentation grant S10 OD021724) and the Howard Hughes Medical Institute Emerging Pathogens Initiative (to E.V., J.P. and K.D.C.). E.V. is a Howard Hughes Medical Institute Investigator. This work is based on research conducted at the Northeastern Collaborative Access Team beamlines, which are funded by the National Institute of General Medical Sciences from the National Institutes of Health (P30 GM124165). This research used resources of the Advanced Photon Source, a U.S. Department of Energy (DOE) Office of Science User Facility operated for the DOE Office of Science by Argonne National Laboratory under contract number DE-AC02-06CH11357.
Author information
Authors and Affiliations
Contributions
E.E. conceived the study and performed all cloning, microscopy, protein purification and preparation of samples for mass spectrometry, as well as data analysis and figure and manuscript preparation. A.D. performed X-ray crystallography data collection and analysis. Y.G. performed SEC–MALS analysis. K.T.N. and V.C. cloned and characterized GFP fusions of phage proteins. E.A. performed data analysis and provided input on manuscript preparation. M.G. performed mass spectrometry experiments and initial mass spectrometry data analysis. E.V. provided input on experimental design and interpretation. J.P. conceived the study, guided experimental design and data interpretation, provided funding, prepared figures and wrote the manuscript. K.D.C. guided experimental design and data interpretation, provided funding, prepared figures and wrote the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Structural & Molecular Biology thanks Konstantin Severinov and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available. Primary Handling Editor: Sara Osman and Carolina Perdigoto, in collaboration with the Nature Structural & Molecular Biology team.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 miniTurboID proximity labeling of phage nucleus-associated proteins.
(a) Construct design for localization and proximity labeling of ΦPA3 RecA (gp175) and ChmA (gp53) associated proteins. (b) Localization of GFP control and GFP- and GFP-miniTurboID tagged RecA (gp175) and ChmA (gp53) in ΦPA3-infected P. aeruginosa cells. GFP is shown in green, FM4-64 (to visualize membranes) in red, and DAPI (to visualize nucleic acids) in blue. Scale bar = 2 μm. (c, d) Venn diagrams showing RecA (panel c) and ChmA (panel d) interacting proteins identified by miniTurboID labeling in three independent runs. See Tables 1–2 and Supplementary Tables 1-2.
Extended Data Fig. 2 Localization analysis of RecA- and ChmA-interacting ΦPA3 proteins.
Subcellular localization of selected proteins identified by proximity labeling, with panel (a) showing proteins that localize as cytoplasmic puncta, and (b) showing proteins with diffuse cytoplasmic localization. See Table 3 for a collated list of localizations. GFP is shown in green, FM4-64 (to visualize membranes) in red, and DAPI (to visualize nucleic acids) in blue. Scale bar = 2 μm.
Extended Data Fig. 3 Early localization of gp148 to the phage nuclear shell.
Subcellular localization of GFP-gp148 in P. aeruginosa cells infected with phage ΦPA3, at the indicated times post infection (MPI: minutes post infection). Yellow arrows indicate the position of the phage nucleus. GFP is shown in green, FM4-64 (to visualize membranes) in red, and DAPI (to visualize nucleic acids) in blue. Scale bar = 2 μm.
Extended Data Fig. 4 Sequence analysis of nuclear-localized jumbo phage proteins.
(a) Sequence identity (expressed as percent identity) between selected jumbo phage ChmA proteins: ΦPA3 (NCBI Accession # YP_009217136.1), PA1C (QBX32206.1), Phabio (YP_010348051.1), 201Φ2-1 (YP_001956829.1), ΦKZ (NP_803620.1), Goslar (YP_009820873.1), PCH45 (QFP93061.1), SPN3US (YP_009153316.1). For all pairs of sequences, the ‘Pairwise Alignment’ tool in JalView was used to calculate % identity over the homologous region of each protein. For those identities reported as ‘<15%’, this tool was unable to generate an alignment over a significant portion of the sequences. (b) Sequence identity between selected jumbo phage ChmB proteins: ΦPA3 (NCBI Accession # YP_009217084.1), PA1C (QBX32150.1), Phabio (YP_010347970.1), 201Φ2-1 (YP_001956728.1), ΦKZ (NP_803568.1). Homologs were not identified in Goslar, PCH45, or SPN3US. (c) Sequence identity between selected jumbo phage homologs of ΦPA3 gp61: ΦPA3 (NCBI Accession #YP_009217144.1), PA1C (QBX32215.1), Phabio (YP_010348072.1), 201Φ2-1 (YP_001956847.1), ΦKZ (NP_803633.1), Goslar (YP_009820861.1), PCH45 (QFP93121.1), SPN3US (YP_009153323.1). (d) Sequence identity between selected jumbo phage homologs of ΦPA3 gp63: ΦPA3 (NCBI Accession # YP_009217146.1), PA1C (QBX32217.1), Phabio (YP_010348074.1), 201Φ2-1 (YP_001956849.1), ΦKZ (NP_803635.1), Goslar (YP_009820859.1), PCH45 (QFP93075.1), SPN3US (YP_009153325.1). (e) Sequence identity between selected jumbo phage homologs of ΦPA3 gp64: ΦPA3 (NCBI Accession # YP_009217147.1), PA1C (QBX32218.1), Phabio (YP_010348075.1), 201Φ2-1 (YP_001956850.1), ΦKZ (NP_803636.1), Goslar (YP_009820858.1), PCH45 (QFP93079.1), SPN3US (YP_009153326.1). (f) Sequence identity between selected jumbo phage homologs of ΦPA3 gp375: ΦPA3 (NCBI Accession # YP_009217454.1, 233 residues), PA1C (QBX32544.1, 250 residues), Phabio (YP_010348429.1, 736 residues), 201Φ2-1 (YP_001957174.1, 768 residues), ΦKZ (NP_803869.1, 646 residues). Homologs were not identified in Goslar, PCH45, or SPN3US. Asterisks indicate that these identities represent the homologous regions of dramatically different-length proteins.
Extended Data Fig. 5 Protein-protein interaction analysis.
(a) Silver stained SDS-PAGE gels showing GFP pulldown results from GFP-tagged proteins expressed in ΦPA3-infected P. aeruginosa (45 minutes post infection). White arrowheads indicate the bait protein for each sample. (b) Coomassie blue-stained SDS-PAGE gel showing Ni2+ pulldown results from coexpression of 201Φ2-1 gp2 (His6-tagged) and ChmA (gp105; untagged). ChmA appears as a doublet because of a second start codon at codon 33 of the gp105 gene.
Extended Data Fig. 6 Biochemical and sequence analysis of jumbo phage gp2 proteins.
(a) Size exclusion chromatography coupled to multi-angle light scattering (SEC-MALS) analysis of ΦPA3 gp2. Measured molecular weight = 39.6 kDa; dimer molecular weight = 45 kDa. (b) SEC-MALS analysis of 201Φ201 gp2. Measured molecular weight = 44.9 kDa; dimer molecular weight = 39.8 kDa. (c) SEC-MALS analysis of the gp2 homolog in phage Psa21 (gp3). Measured molecular weight = 51.6 kDa; dimer molecular weight = 50.6 kDa. (d) Sequence alignment of gp2 homologs in jumbo phage infecting Pseudomonas, showing the position of the highly conserved Q53 and A159 (ΦPA3 numbering) residues.
Extended Data Fig. 7 Effects of gp2 Q53A and A159D mutations.
(a) Growth curves of P. aeruginosa cells transformed with pHERD30T empty vector (black) or encoding ΦPA3 gp2 wild-type (violet), Q53A (cyan), or A159D (green). Growth media contained 0.1% arabinose for low-level expression. Datapoints represent the average of three independent replicates, and error bars (black) represent standard deviation. (b) Growth curves of P. aeruginosa cells transformed as in panel (a), with growth media containing 1% arabinose for high-level expression. Datapoints represent the average of three independent replicates, and error bars (black) represent standard deviation. (c) Size exclusion chromatography elution profiles for ΦPA3 ΦPA3 gp2 wild-type (black solid line), Q53A (cyan dotted line), or A159D (green dotted line). (d) Ni2+ pulldown analysis of E. coli-coexpressed His6-tagged 201Φ2-1 gp2 (wild-type (WT), Q53A, or A145D (equivalent to ΦPA3 gp2 A159D)) and full-length ChmA. Doublet bands for ChmA arise from a methionine codon at position 33 of the annotated gene. (e) Ni2+ pulldown analysis of E. coli-coexpressed His6-tagged ΦPA3 gp2 (wild-type (WT), Q53A, or A159D) and portal (gp148). The ~40 kDa band in pulldown in the Ni2+ purification lane for WT gp2 is an unknown contaminant.
Extended Data Fig. 8 Overexpression of mutant gp2 proteins affect nuclear shell formation and morphology.
(a) Microscopy of ΦPA3-infected P. aeruginosa cells transformed with empty vector. Yellow arrow indicates the position of the phage nucleus. The two DAPI staining bodies bracketing the phage nucleus are the phage bouquets. GFP is shown in green, FM4-64 (to visualize membranes) in red, and DAPI (to visualize nucleic acids) in blue. Scale bar = 2 μm. (b) Microscopy of ΦPA3-infected P. aeruginosa cells expressing GFP. Scale bar = 2 μm. (c) Microscopy of ΦPA3-infected P. aeruginosa cells expressing GFP-tagged wild-type gp2. Common phenotypes and their percentages in the analyzed sample (n = 100 cells) are shown. Scale bar = 2 μm. (d) Microscopy of ΦPA3-infected P. aeruginosa cells expressing GFP-tagged gp2 A159D. Common phenotypes and their percentages in the analyzed sample (n = 100 cells) are shown. Scale bar = 2 μm. (e) Microscopy of ΦPA3-infected P. aeruginosa cells expressing GFP-tagged gp2 Q53A. Common phenotypes and their percentages in the analyzed sample (n = 100 cells) are shown. Scale bar = 2 μm.
Supplementary information
Supplementary Tables
Supplementary Tables 1–7 (single workbook with multiple tabs). Supplementary Data Table 1. Phage proteins enriched more than three-fold in PhiPA3 ChmA (gp53) miniTurboID. Supplementary Data Table 2. Phage proteins enriched more than three-fold in PhiPA3 RecA (gp175) miniTurboID. Supplementary Data Table 3. Localization of miniTurboID hits in infected cells. Supplementary Table 4. Mass spectrometry of ChmB-GFP 70 kDa bands. Supplementary Data Table 5. Mass spectrometry of GFP fusion pulldowns. Supplementary Data Table 6. Plasmids used in this study. Supplementary Data Table 7. Oligonucleotides used in this study.
Source data
Source Data Figs. 3 and 4 and Extended Data Figs. 6 and 7
Statistical source data (one .xlsx file with tabs for each noted figure panel).
Source Data Fig. 2 and Extended Data Figs. 5 and 7
Unprocessed gel scans with each figure panel noted.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Enustun, E., Deep, A., Gu, Y. et al. Identification of the bacteriophage nucleus protein interaction network. Nat Struct Mol Biol 30, 1653–1662 (2023). https://doi.org/10.1038/s41594-023-01094-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41594-023-01094-5
- Springer Nature America, Inc.