Biological background

Bacterial conjugation is a major mechanism driving rapid dissemination of genetic information among the bacterial community, leading to the spread of antibiotic resistance and virulence factors. This transfer is important for bacteria to adapt and evolve in variable environments. Transfer of DNA also takes place between the bacterial and eukaryotic communities (Christie et al. 2017; Ilangovan et al. 2015).The most studied conjugation system in Gram-negative bacteria is the F family of plasmids, which includes F, R1 and pED 208. The protein investigated in this work, TraI, is encoded by the R1 plasmid. Gram-negative bacteria utilize type IV secretion systems (T4SS) for a successful transfer of the conjugative plasmid from the donor to the recipient cell. In the donor, conjugation is initiated by three major components known as the relaxosome, transferosome and coupling proteins. The relaxosome is a multi-protein complex, which prepares the conjugative plasmid in the cytoplasm by nicking and unwinding it. The transferosome consists of the proteins, which are involved in the T4SS pilus formation and retraction, to make a close physical contact between donor and recipient cells. Finally, the coupling proteins ‘couple’ components of the relaxosome to the transferosome for a successful substrate transfer (Lawley et al. 2003).

TraI of plasmid R1 is a multi-functional enzyme of the relaxosome that is essential for the conjugative transfer of F like plasmids (Wong et al. 2012; Lang et al. 2010, 2011; Haft et al. 2006). To specifically initiate the process of DNA transfer, R1 TraI first nicks a unique phosphodiester bond at nic site within the plasmid origin of transfer (oriT), binds covalently to the 5′ end of the nicked plasmid stand through a tyrosine residue and finally unwinds the nicked strand of plasmid DNA with the help of its functional helicase domain (Matson and Ragonese 2005). Recently a cryo-EM structure of the three functional domains of R1 TraI: the DNA trans-esterase, vestigial helicase and an active helicase, was published (Fig. 1). These authors found that R1 TraI dimerizes in the presence of DNA including the oriT sequence resulting in a well-coordinated function of trans-esterase and relaxase domains for the substrate translocation (Ilangovan et al. 2017).

Fig. 1
figure 1

Structural motifs of TraI highlighting the far C-terminal domain (R1 TraIC126)

Apart from these three-functional domains there is a fourth poorly characterized C-terminal domain. This domain consists of 295 residues (1461 to 1756) and is mostly flexible. It has been hypothesized to function as a platform for interaction with other relaxosomal proteins. Some studies also emphasize that the far C-terminal domain interacts with ds DNA (Matson and Ragonese 2005; Cheng et al. 2011; Ragonese et al. 2007).

There is no involvement of R1 TraI’s C-terminal domain on the regulation of its own nuclease and relaxase activity. A crystal structure of a 169 residue fold in the C-terminal domain (residues 1461 to 1630) has been reported however, structural and functional details of the far most C-terminal 126 residues (R1 TraIC126, residues 1630 to 1756, 14.3 kDa) are lacking. Due to the fact that this region could never be crystallized or observed in Cryo-EM, it was thought to be highly flexible and disordered (Ilangovan et al. 2017). An analysis of R1 TraIC126 (1630 to 1756) by the program COILS, predicted three coiled-coil domains (Guogas et al. 2009). R1 TraIC126 does not have an orthologue nor does it share sequence homology, thus this novel region appears to play a crucial role exclusively for R1 TraI. In this study, we present the 1H, 13C and 15N NMR resonance assignments and the secondary structure information obtained by Talos-N (Shen and Bax 2013) based on the backbone chemical shift values.

Recombinant protein cloning, expression and purification

R1 TraIC126 was cloned into pET Z2 vector (EMBL Heidelberg) (Bogomolovas et al. 2009) by classical cloning methods where full length R1 TraI was used as a template. The two primers: R1 TraIC126-F 5′-AGTCCATGGGCAATAGCGTCCAGCCG-3′ and R1 TraIC126-R 5′-TTTGGTACCCTAATCGCCGCCCAA-3′, were used to create the R1 TraIC126 encoding insert by PCR amplification. pET Z2 contains a 6xHis, IgG binding domain (Z-domain), TEV cleavage site, followed by a multi-cloning site and carries a kanamycin resistance gene. Restriction digestion was carried out with NcoI and KpnI to create sticky ends for both the amplified R1 TraIC126 gene fragment and the template for subsequent ligation.

This pET Z2 R1 TraIC126 construct was transformed into BL21 (DE3) cells (Thermofisher, Waltham, US) and expressed in M9 media (containing 1 g/l 15N ammonium chloride and 3 g/l 13C glucose) for isotope labelling of proteins for NMR experiments (M9 minimal medium (standard) 2010). In 1 L of culture, the protein expression was induced with 1 mM (final concentration) of Isopropyl-β-D-thiogalactopyranoside (IPTG) at an OD600 nm of 0.8 and growth was continued with shaking at 20 °C for 16 h.

The cells were harvested and re-suspended in lysis buffer (50 mM Na2HPO4, 10 mM imidazole, 300 mM NaCl pH 8.0). Re-suspended cells were disrupted using sonication and centrifugation at 13,500 rpm was carried out to separate the cell pellet from the supernatant. The supernatant was filtered through a 0.45 µm filter and protein purification was performed using Ni–NTA affinity chromatography. An equilibrated 10 ml HisTrap column (GE Healthcare, Merck KGaA, Darmstadt, Germany) (with HisTrap flow through buffer: 50 mM Na2HPO4, 10 mM imidazole, 300 mM Nacl, pH 8.0 and HisTrap elution buffer: 50 mM Na2HPO4, 500 mM imidazole, 300 mM Nacl, pH 8.0) was loaded with 10 ml of filtered supernatant and the protein was eluted with HisTrap elution buffer, using an imidazole gradient. The eluted protein was subjected to an overnight TEV cleavage reaction. For this reaction, TEV protease (1:50 TEV protease to protein ratio), 4 mM β-mercaptoethanol was added to the sample and dialyzed against 2 L of dialysis buffer (50 mM Na2HPO4, 10 mM imidazole, 300 mM NaCl, 4 mM β-mercaptoethanol, pH 8.0) for 16 h. This step removes the 6xHIS and Z-domain. A second Ni–NTA purification separated the cleaved 6xHis and Z domain, eluting the protein during the flow through step. Final purification was done by size exclusion chromatography using a Superdex 75S column (GE Healthcare, Merck KGaA, Darmstadt, Germany) in 50 mM Na2HPO4, 300 mM Nacl, pH 8.0. The purified protein was dialyzed into the NMR buffer (50 mM Na2HPO4, 50 mM Nacl, pH 6.5) for 16 h, before acquiring the NMR experiments.

NMR experiments

NMR measurements were carried out on a Bruker Avance III 700 MHz spectrometer equipped with a cryogenically cooled 5 mm TCI probe head. All the experiments were carried out at 298K. 10% D2O was added to the sample and the following triple resonance experiments were acquired for the assignment of the backbone atoms resonances: 1H–15N HSQC, 1H–13C HSQC, HNCACB, HNCA, HN(CA)CO, HNCO, HN(CO)CA and HNCANNH. For side-chain assignments, CC(CO)NH, H(CCO)NH, HBHA(CO)NH and HCCH-TOCSY were acquired. Complete assignments and confirmation thereof was accomplished using 15N-edited NOESY-HSQC and 13C-edited NOESY-HSQC (John et al. 2006). Standard Bruker pulses were used to acquire all the above mentioned triple resonance experiments. The spectra were processed using NMRPipe (Delaglio et al. 1995) and analyzed by CCPNmr (Vranken et al. 2005).

Assignments and secondary structure information

Figure 2a shows the 1H–15N HSQC spectrum of R1 TraIC126 and the apparent limited peak dispersion along the hydrogen dimension, all the backbone amide peaks between 7.6 and 8.7 ppm, shows that R1 TraIC126 is at least partially disordered. The most overlapped region of the HSQC is shown enlarged in Fig. 2b. With the help of the above mentioned backbone spectra, 94% of all 1H, 15N, 13C′, 13Cα/β, 97% Hα and 93% Hβ resonances were assigned. Asn2 was not found in the spectra. The amide nitrogen signals of the five proline residues along with Ser3 and Glu81 could not be assigned. Due to extensive line broadening or overlapping of peaks Arg30 Cβ, Arg34 Cβ, Arg35 Cβ, Pro6 C′, Pro80 C′ and the side chain amide of asparagine and glutamine residues were not assigned. The unassigned peaks in the 15N HSQC do not show correlations in the triple resonance experiments and therefore probably belong to the 6% of NH groups which could not be assigned.

Fig. 2
figure 2

a1H–15N HSQC spectrum of R1 TraIC126 acquired at 700 MHz and pH 6.5 at 298K with 10% D2O. The number and the respective single letter code of amino acids are indicated at each peak. Residue numbers 2 to127 of R1 TraIC126 correspond to residues 1630 to 1756 in the full length TraI. Met1 is added during the cloning and is not a part of R1 TraIC126. b The zoomed region of the 1H–15N HSQC spectrum, highlighted in (a). BMRB access number 27,596

Secondary structure information was obtained from the backbone chemical shifts processed using Talos-N. The prediction shows major disordered regions, as expected from the 1H–15N HSQC spectrum’s poor peak dispersion along the hydrogen dimension. Talos prediction shows that two stretches of helices, spanning from residues 16 to 41 and 102 to 111 are connected through a loop region. The residues 3 to 15, 42 to 101 and 112 to 127 are predicted to be loop / disordered regions (Fig. 3).

Fig. 3
figure 3

Secondary structure prediction of R1 TraIC126: The predictions from Talos-N are presented as bars colored in red for α helix and black for loop/disordered region. A negative score has been given for the loop region for the ease of comparison