Human rotaviruses (RVs) (genus Rotavirus, family Reoviridae) were discovered in 1972 by Bishop and co-workers [4]. Group A RVs are the major etiologic agent of acute gastroenteritis in children during the first years of life throughout the world [26].

Morphologically, RVs are icosahedral particles, containing 11 segments of double-stranded RNA (dsRNA). The outer layer is formed by two capsid proteins: surface protein VP7 (G, glycoprotein) and spike protein VP4 (P, protease-sensitive protein). Based on antigenic and genetic differences of these proteins, RVs are classified into G/P serotypes and G/P genotypes. Currently, 27 G and 35 P genotypes are known to exist in a variety of combinations [20].

Particularly noteworthy are five G/P combinations, G1P[8], G2P[4], G3P[8], G4P[8] and G9P[8], as they are most common in the world among RVs that infect humans [12, 24]. Two external virion proteins, VP7 and VP4, are of particular interest for vaccine development because they are potential targets for neutralising antibodies [15]. The RV G1P[8] type was predominant in Russia up to 2000 [23].

The fourth segment of the genome of RV group A encodes a protease-sensitive VP4, a subunit of which VP8*, is a major determinant of the P serotype (correlated with the P genotype) and an important region, carrying a number of sites for neutralising antibodies [9]. The most widely distributed serotype/genotype in the world, VP4 P1A[8], is included in the developed RV vaccines [6, 8]. This determines the need for monitoring strains of RV, as vaccination and natural immunisation of the population cannot influence the distribution of the same genotype over a long period of time

By the time we started this study, different authors had already found that there are four lineages of P[8] among group A RVs, determined by sequencing the VP8* portion of the VP4 gene, using primers Con2 and Con3 as designed by Gentsch et al. [11]. Allele P[8]-1 was widespread in Bangladesh, Italy, Paraguay, Great Britain, Hungary, Finland, the USA, Taiwan, and Japan [2, 10, 16, 22]. The lineage P[8]-2, which is phylogenetically similar to the reference strains KU and F45, was identified in Paraguay, the USA, Brazil and Korea [10, 18]. The allele P[8]-3 gradually began to spread in the late 1990 s and now is the most common worldwide [2, 10, 16, 18, 22]. The allele P[8]-4 is the least represented and was identified in Finland and Malawi [5].

This paper presents information about the change of alleles P[8] of rotaviruses circulating in the area of Nizhny Novgorod, Russia, over 26 years.

In the years 1984 to 2010, 16,971 faecal samples were studied in children under 6 years of age who were hospitalised in Nizhny Novgorod with acute gastroenteritis. RVs were found in 5,404 (31.8 %) samples by RNA-PAGE and RT-PCR, and of these, the G/P type was identified in 3663 cases.

RV nucleic acid from a 10 % faecal suspension was extracted according to Herring et al. [14]. RNA-PAGE was performed for 18-20 hours in a thin layer of 10 % separating and 3 % PAGE concentrating in Laemmli buffer solution [17]. The resulting gel was stained with silver nitrate by an established method [14].

The dsRNA of RVs was extracted from 10 % suspension of faecal samples by the standard method, using a reagent kit supplied by InterLabService. The VP8* region of the VP4 gene, 877 bp in size, was obtained by RT-PCR using primers Con2 and Con3mod as described previously [11, 21]. The P-genotypes of rotaviruses were identified using primers suggested by Gentsch et al. [11]. The RV G-genotype was determined using RT-PCR based on a set of differentiating primers: GF, G1R, G2R, G4R, G9R, and G3R [7, 13].

The resulting RT-PCR products—fragments of the VP4 gene—were sequenced. Identification of the primary structure of cDNA was performed on two circuits using specific primers, the reagent kit ABI PRISM BigDye Terminator v.3.1, and an ABI Prism 3100 Genetic Analyzer in automatic mode as recommended by the manufacturer.

In this study, we identified 21 nucleotide sequences for the fragment of the VP4 gene and deposited them in the GenBank database under accession numbers GU226766-GU226781 and JX089973-JX089977. The numbers of other sequences in the GenBank database that were used for phylogenetic analysis are listed in the names of isolates presented on the phylogenetic tree. The phylogenetic tree was constructed using the neighbor-joining method and the Kimura two-parameter model in the programme MEGA5 [25].

In the analysis of eletropherotypes (e-types) of RNA isolates of RV genotype P[8], it was established that within its range there were genetic variants that differed radically in the electrophoretic mobility of the fourth segment of the genome. Variants of the virus were detected with slow-migrating (3rd, 4th, 86th e-type RNA) and fast-migrating (42nd, 46th, 52nd, 61st, 72nd, 85th, 88th e-type RNA) RNA segments encoding VP4 (Fig. 1), indicating the existence of at least two alleles that determine the P[8] genotype in the RV strains investigated by us. A strong correlation was established between the electrophoretic mobility of the fourth genomic segment and allele P[8], which confirmed the results of Maunula and von Bonsdorff, obtained in 1998 [21].

Fig. 1
figure 1

Profiles of migration of dsRNA segments of rotavirus in PAGE. Numerals indicate the numbers of e-type RNAs that were assigned in the course of identification of e-type RNA types in our laboratory. SA11 is the monkey group A rotavirus

According to Matthijnssens and co-workers [19], when comparing different strains of RV, more differences are observed in the nucleotide sequences than in the amino acid sequences, and it is therefore more reliable to conduct phylogenetic analysis at the nucleotide level.

We aligned the established partial nucleotide sequences of VP4 with the corresponding nucleotide sequences of the RV strains in the GenBank database using the MEGA5 software (Fig. 2). The figure clearly shows the clustering of strains, revealing at least four clusters within the P[8] genotype, P[8]-1, P[8]-2, P[8]-3, and P[8]-4. As previously reported elsewhere [2, 10], our data show that there is a diversity of VP4 P[8] genotype gene sequences with a high level of variability in the nucleotide sequence encoding the VP8* subunit, which is considered a hypervariable region. In general, the variation in nucleotide sequences of the VP8 P[8] genotype region occurring in these clusters does not exceed 20 % (9.7-12.9 %), which allows them to be assigned to different alleles of the P[8] genotype. Figure 2 shows that RVs detected in Nizhny Novgorod are grouped into clusters, P[8]-1 and P[8]-3. The level of variation in the nucleotide sequences within these clusters reaches 10.2 %.

Fig. 2
figure 2

Phylogenetic trees based on partial nucleotide sequences of the VP4 gene (region VP8*) of rotavirus strains P[8] isolated worldwide. The sequences obtained by us are marked with circles. Numbers of sequences obtained from the GenBank/EMBL/DDBJ database are indicated in the names of the isolates. The phylogenetic tree was made by the neighbour-joining method and the Kimura two-parameter model. Bootstrap values above 50 % are shown at branch nodes

In the P[8]-1 cluster, two subclusters, P[8]-1a and P[8]-1b, are seen, with a variation in the nucleotide sequences of 4.8 %. In the subcluster P[8]-1a, there are Wa-like strains that circulated mainly in the 80-90 s, including in the area of Nizhny Novgorod. In the subcluster P[8]-1b, there are more recent strains that were identified in 2000. In the population of Nizhny Novgorod, only one genetic variant of RV was identified, which was included in this subcluster: RV with 86th e-type RNA, which dominated in the 2003-2004 epidemic season and then disappeared.

Most RV strains circulating in the area of Nizhny Novgorod since the mid-1990 s to the present are characterized by fast-migrating fourth segments of RNA in PAGE. These strains were grouped on the phylogenetic tree in the cluster P[8]-3. Clustering did not exhibit time dependence. RV allele P[8]-3 VP4 was specific to VP7 G1, G3 and G4. On the phylogenetic tree, the cluster P[8]-3 is divided into two groups of sequences with a low level of variation (0.5 %) but a high index of support (89). It is probable that within lineage P[8]-3 VP4 there are at least two geographically independent phylogenetic lines. RV strains in Nizhny Novgorod circulating since 2005 are genetically related to the VP4 gene in strains isolated in European countries (Fig. 2). In the area of Nizhny Novgorod, the alleles P[8]-2 and P[8]-4 of gene VP4 were not detected.

We analyzed the long-term dynamics of circulating genetic variants of RV P[8] with the dominant e-type RNA, which was set with allele P[8]. Figure 3 shows that during 26 years of RV circulation with the VP4 P[8] genotype, a gradual replacement of RV strains P[8]-1 with strains of allele P[8]-3 has occurred. Our research began in 1984. At that time, and until the mid-1990 s, strains of RV P[8]-1, which had a specificity for G1 VP7, had a dominant position.

Fig. 3
figure 3

The change of subgenotypes from P[8]-1 to P[8]-3 during the 26-year-long circulation of group A rotavirus among the children in Nizhny Novgorod. The actual numbers of samples with the genotype P[8] (P[8]-1 – P[8]-3) are shown in brackets after the year. The numbers on the bars indicate the e-types (as in Fig. 1) prevailing for the given epidemiological season

The allele P[8]-3 of the VP4 gene was first detected in the 1985-86 epidemiological season in the RV strains with the 46th e-type RNA, which also showed specificity for G1 VP7. Over time, the RV lineage P[8]-3 increased in circulation, gradually displacing the P[8]-1 strains. In 19 epidemic years, the P[8]-1 allele RV strains were displaced completely (Fig. 3).

Analysis of the evolution of a specific gene is of scientific and practical interest, especially when it comes to genes encoding protective antigens. Of particular importance are studies conducted in one geographical area over a long period of time. Such studies allow the assessment of trends in the changes of genotypes of lineages of viral proteins and their further spread, as well as analysis of the sources of the emergence of new strains. In our study, RVs samples were collected during a 26-year period in a geographically defined population group (Nizhny Novgorod, Russia; children under 6 years) with no vaccine pressure. The first observations carried out in Russia showed that in a population of RV, genotype P[8] VP4 had a dominant position throughout the period of study. We have confirmed that within genotype P[8] VP4, there are several alleles of the gene. In Nizhny Novgorod RV strains, P[8]-1 and P[8]-3 lineages were identified. At the same time, within allele P[8]-1, associated with G1 VP7, the existence of two genetic lines of RV was established: 1a and 1b, which circulated at different periods. Early strains, carrying the allele P[8]-1a associated only with G1, dominated in Nizhny Novgorod until 1995 and now is no longer detected. The RV lineage P[8]-1b in the area of Nizhny Novgorod was detected only in the 2003-04 epidemic season, after which it was also no longer detected. Since 1997 until the present time, the dominant position in the RVs in the Nizhny Novgorod population is held by the RV strains carrying the allele P[8]-3, which is associated with G1, G3 and G4. The shift of the dominant RV allele P[8] VP4 in Nizhny Novgorod reflects the trends occurring in the world. Thus, Wa-like allele VP4 (P[8]-1a), found in RV in the 1970 s in the U.S., Asia and Europe are now not detected. Since 1995, strains G4 and G9 circulating in the world have had the allele VP4 P[8]-3 [13, 10].

Despite the large variety of variants of the P-genotype, P[8] is evolutionary the most successful and widespread in the world. The present article demonstrates the change of alleles within genotype P[8], which is the mechanism of its evolutionary success. These data may help to predict the emergence of a new allele or a cardinal change of genotypes in the circulation of rotaviruses.