DNA sequencing analysis of the S1 gene in commercial vaccines containing either H120 or Ma5 viral strains
DNA sequencing of the IBV S1 gene was performed in commercial vaccines marketed in Brazil, containing either H120 or Ma5 strains. H120 vaccines are produced in Brazil by four different manufacturers, here called vaccines A, B, C, and D, while the Ma5 vaccine is produced by only one manufacturer. Gene sequencing analyses were performed in two consecutive batches of each vaccine, and DNA sequencing results obtained were deposited in the GenBank with the following access numbers: vaccine A (KU736749), vaccine B (KU736748), vaccine C (KU736750), vaccine D (KU736751), and Ma5 vaccine (KU736747).
Analysis of chromatograms revealed that vaccines A, B, C, and D exhibited secondary nucleotide peaks at the positions 125, 345, and 353 (Fig. 1) while vaccine Ma5 only exhibited a secondary peak at the position 125. In this position, vaccines A, B, C, D, and Ma5 exhibited a main peak of thymine and a secondary peak of guanine, suggesting the presence of two viral subpopulations. At the position 345, vaccines A and B exhibited a main peak of thymine and a secondary peak of cytosine, whereas vaccines C and D exhibited the opposite. Ma5 vaccine did not present a secondary peak at the position 345. At the position 353, vaccines A, B, C, and D exhibited a main peak of guanine and a secondary peak of thymine while the Ma5 vaccine did not present a secondary peak either. Of note, we did not observe any changes in the nucleotide sequence of the S1 gene between two consecutive batches of any vaccine produced by the same manufacturer.
Alignment analyses of the amino acid sequence of the S1 spike protein revealed that vaccines A, B, C, D, and Ma5 had up to 13 amino acid (aa) substitutions (Fig. 2a). We found 12 and 13 aa substitutions on vaccine A compared to vaccines B and C, respectively, while vaccine B exhibited only 1 aa substitution compared to vaccine C. Also, vaccine A only had 2 aa substitutions compared to vaccine D. Furthermore, 10 and 11 amino acid substitutions were found on vaccine D compared to vaccines B and C, respectively. We also found 13, 1, 2, and 11 aa substitutions on vaccine Ma5 compared to vaccines A, B, C, and D, respectively.
To investigate the impact of the polymorphisms found on the S1 protein in each vaccine, their three-dimensional (3D) structures were predicted using an in silico method. The best model generated a protein structure with 88.6% of the amino acid residues in accessible regions while 97.5% of amino acid residues were in energetically allowed regions according to Ramachandran plot analyses, and they exhibited a high-quality stereochemistry based on calculations made using ProSa, ERRAT, and WHAT_CHECK programs. The predicted model for the S1 spike protein in vaccine A showed a protein structure essentially formed of beta sheets and loops (Fig. 2b), similar to what has been observed for other coronaviruses genera [33].
Protein structure analyses revealed amino acid substitutions in regions of the S1 protein that play a role in the virus–host cell interaction. The S1 protein region that interacts with receptors of the respiratory tract in chickens exhibited 3 polymorphisms among vaccines in the residues 19–69 (Fig. 2c), and it has been shown that residues 38 and 69 are essential for this interaction [34]. At the position 38, vaccines B, C, D, and Ma5 showed the acidic polar Asp while vaccine A showed the neutral polar Asn. On the other hand, in the position 69, vaccines A and D showed the neutral polar Thr while vaccines C, D, and Ma5 showed the neutral non-polar Ile. We also found 4 and 2 variations among vaccines in epitope regions for B lymphocytes (residues 24–61, 132–149, 181–196, 196–215, 232–247, 282–300, 291–398, and 409–415) (Fig. 2d) and T lymphocytes (residues 44–51, 181–196, 196–215, 232–247, 282–300, and 406–414) (Fig. 2e), respectively.
S1 gene sequencing analyses of H120 vaccines after viral reisolation from embryonated chicken eggs
We investigated the genetic stability of the commercial vaccines containing H120 strain by comparing the S1 gene sequence found in the original vaccines with the ones obtained after an additional passage in SPF embryonated chicken eggs. Chromatogram analyses of the S1 gene sequences of vaccines before and after viral reisolation showed different genetic variations. Vaccines B and D did not exhibit genetic variations in the S1 gene after viral reisolation. Figure 3 depicts the genetic variations observed in other vaccines; the S1 sequence from the original vaccine C exhibited a main peak of cytosine (shown in blue) at the position 345, which represented the predominant viral subpopulation, and a secondary peak of thymine (shown in red), indicating the presence of a smaller subpopulation. On the contrary, after viral reisolation, this smaller subpopulation stood out, even as the predominant subpopulation in one of the sequencing reactions. On the other hand, the original vaccine A only exhibited one peak of adenine (shown in green) at the position 366, but after the reisolation, a secondary peak of guanine (shown in black) was also identified, indicating that a point mutation happened during that extra passage in embryonated chicken eggs before viral reisolation.
S1 gene sequencing analysis of the Ma5 vaccine after viral reisolation from embryonated chicken eggs
The genetic variation in the commercial Ma5 vaccine was examined by comparing the S1 gene sequence in the original vaccine with the one found in the viral vaccine reisolated from the chorioallantoic liquid (CAL) after a single passage in SPF embryonated chicken eggs. We also isolated the virus from different organs of chicken embryos to identify any genetic variation in the S1 gene compared to the original vaccine.
Chromatogram analyses of the S1 gene showed that the original Ma5 vaccine exhibited two peaks at the nucleotide position 125, indicating the presence of two viral subpopulations. After inoculation of SPF embryonated eggs, two viral subpopulations were also identified at the same nucleotide position in all organs collected from the chicken embryos, i.e., trachea, lungs, kidney, intestine, and bursa of Fabricius. On the contrary, the S1 gene sequence found in the virus reisolated from the chorioallantoic liquid only exhibited the predominant nucleotide peak at the same position (Fig. 3).
Analysis of the Ma5 vaccine by HRM
To further investigate the presence of viral subpopulations in the Ma5 vaccine, we specifically amplified the region containing the nucleotide position 125. PCR products were then cloned in a plasmid and transformed in bacteria to be analyzed by HRM, followed by DNA sequencing. Due to the large number of transformant colonies obtained, we performed a real-time PCR assay to amplify each clone and amplicons were then subjected to High-Resolution Melting analyses (HRM). This method was used to help select clones exhibiting different melting temperatures for the same amplicon and to reduce the need to sequence all clones obtained.
To evaluate all 38 clones in triplicate, we performed three runs in different plates, and after careful analyses, we selected six clones that exhibited different melting profiles (clones 5, 6, 11, 16, 18, and 20). After this first screening, a new run was performed with the selected clones. Clone 5 was used as a control (blue horizontal line), in which clones 6, 11, 16, 18, and 20 as well as the original Ma5 vaccine were compared to. Among the six selected clones, five exhibited a different melting profile compared to the one exhibited by the original Ma5 vaccine and only the clone 20 exhibited a melting profile similar to the original Ma5 vaccine. The original Ma5 vaccine and clone 20 exhibited positive peaks while clone 11 exhibited a negative peak. On the other hand, clones 6, 16, and 18 exhibited a sinusoidal curve (Online Resource 2).
Selected clones were then subjected to DNA sequencing to identify viral subpopulations. Sequencing alignments indicated a common polymorphism in all clones in the nucleotide position T129C when compared to the original Ma5 vaccine. Additional polymorphisms were also found on clones 20 (A140G), 5 (T193C), and 11 (G284A), whereas clones 6, 16, and 18 exhibited identical sequences (Table 1). However, we did not confirm the presence of viral subpopulations based on the nucleotide position 125.
Table 1 Polymorphisms found in the S1 gene sequences of six clones selected by High-Resolution Melting analyses (HRM) compared to the one from the original Ma5 vaccine