Findings

Japanese encephalitis virus (JEV) is the prototype member of the JEV serogroup within the genus Flavivirus, family Flaviviridae. JEV comprises five genotypes (G1-G5) [1-3]. In previous studies, the phylogenetic characteristics of JEV were analyzed and the most recent common ancestor (TMRCA) was estimated. The TMRCA of JEV was estimated to be 1690 years when calculations were based on the complete sequence of four genotypes (G1-G4) [4], whereas, analysis of JEV using a limited number of whole genomic sequences from five genotypes indicated that TMRCA of JEV appeared approximately 460 years [5]. More recently, however, G5 strain XZ0934 isolated in 2009, which had not been included in earlier analyses, was shown to be significantly different from the G5 Muar isolate [6]. Therefore, in order to improve our understanding of the evolutionary progress and population diversity of JEV, a comprehensive dataset was established for evolutionary analysis of JEV in this study. In the dataset, 100 whole genomic sequences of JEV representing all five genotypes of JEV, isolated from various hosts (humans, pigs and bats) and vectors (mosquitoes and midges) were collected and analyzed.

Two G5 JEV full-length genome sequences (Muar and XZ0934) were downloaded from GenBank (GB No. HM596272 and JF915894, respectively) and added to the database established in a previous report [4], forming a new database for analysis (Table 1). The new JEV sequence database was analyzed using Bayesian Markov chain Monte Carlo (MCMC) method. The General Time Reversible (GTR) model + Invariant (I) + Gamma (G) model was selected using MrModelTest [7]. The nucleotide substitution rates and divergence times of the most recent common ancestor (TMRCA) were estimated using the relaxed (uncorrelated lognormal) molecular clock model in the BEAST software package [8]. Demographic histories of JEV were inferred based on Bayesian skyline reconstruction. The analysis was run through 1,000,000,000 generations to ensure sufficient mixing. Finally, the maximum clade credibility (MCC) tree was built using TreeAnnotator with 10% burn-in (http://beast.bio.ed.ac.uk/).

Table 1 Information of JEV isolates analyzed in this study

Based on Bayesian Markov chain Monte Carlo (MCMC) analysis, the maximum clade credibility (MCC) tree for the whole genomic sequences of JEV was established (Figure 1). Representatives of the five distinct lineages were included in the analysis. The posterior probability values for the nodes of each lineage were >0.95, indicating their robustness. JEV was estimated to have emerged 3255 years ago (95% HPD: −978 to −6125 years) and subsequently diverged at least five times to produce the 5 recognized genotypes. In chronological order, they diverged in the order G5, G4, G3, G2 and G1. Thus, G5 represents the most ancestral lineage among genotypes 1–5.

Figure 1
figure 1

Maximum clade credibility (MCC) tree for 100 whole-genome sequences of JEV. Five distinct lineages were identified: G1 (red), G2 (yellow), G3 (blue), G4 (green) and G5 (orange). Estimated TMRCAs of these lineages (with their 95% HPD values in parentheses) are shown, G1: 155(104–315), G2: 530(235–1131), G3: 880(420–1855), G4: 1653(765–3372), and G5: 3255(978–6125).

The mean rate of nucleotide substitution for the whole genomic sequences of 100 JEV strains isolated from a variety of hosts worldwide, estimated using a Bayesian MCMC approach, was 1.01 × 10−4 nucleotide substitutions per site per year (95% HPD values, 4.37 × 10−5, 1.56 × 10−4). This is similar to previous estimates based on analysis of four JEV genotypes [4].

The population dynamics of JEV are shown in Figure 2. The skyline plot showed that the JEV population had experienced complicated changes during the process of evolution. However, the virus population remained relatively stable during the first 2700 years (Figure 2A), followed by a period of rapid decline from the 1700s, reaching a minimum in the 1900s. It then increased rapidly from the 1930s until the 1960s and formed the first peak. The second peak appeared in the 1980-1990s and subsequently the populations of JEV remained high after 2000 (Figure 2B).

Figure 2
figure 2

Bayesian skyline plots for JEV. Highlighted areas correspond to 95% HPD intervals. (A) Populations during the whole evolutionary history; (B) Populations during the later evolutionary history since 1800.

The findings in this study have similarities with previous studies [5]. For example, the divergence pattern of the genotypes occurred in the order G5, G4, G3, G2 and G1, and the mean rate of nucleotide substitution was similar to previous estimates. However, the occurrence time of TMRCA determined in this study (~3255 years ago) was quite different compared with that measured in the report (~460 years ago) of Mohammed et al. [5]. The reason for this discrepancy could be attributed to the dataset used for analysis. In Mohammed’s study, only 35 whole genomic sequences were used and the only G5 representative included was the Muar strain. Therefore, since our new dataset includes two G5 representatives with robust sequences, the occurrence time of TMRCA (~3255 years ago) obtained in this study should reflect more precisely the evolutionary patterns and diversity of JEV.

Two main peak periods in population dynamics were identified in this study, 1930–1960 and 1980-1990s, respectively. These fluctuations were reflections of the virus activity in the sylvatic environment. Since the 1930s, JEV strains belonging to G3 emerged and were isolated from Asian countries. G5 was first identified in 1952 [2,4]. This was a good interpretation of the first peak. Subsequently, G2 and G4 strains were isolated during the 1980s. Importantly, the G1 genotype emerged during the 1980s onwards [4]. Therefore, the virus population diversity peaked in the 1980-1990s. Since 2000, G1 JEV has become the dominant genotype in most endemic regions [4], and although a relatively small decrease was observed, the virus remains the most active and G5 reemerged.

Interestingly, although G5 is estimated to be the most ancestral JEV lineage, this virus showed a highly active dispersal capacity following its reemergence. Indeed, this new G5 strain was isolated from mosquitoes collected in southern region of the Asian continent (Tibet, China) in 2009 [2] and northeast region of Asia (South Korea) during the same year [9]. Thus, G5 now appears to be dispersing widely in Asia. A recent study showed that genotype 1 JEV originated in Southeastern Asia and spread to the entire Asian continent [10]. Based on these observations, it seems likely that G5 will follow a dispersal pattern similar to that of G1 JEV, and has dispersed or will disperse over the entire Asian continent. Clearly, G5 should be monitored closely throughout JEV endemic regions.

Finally, the available inactivated and live attenuated JE vaccines are derived from G3 JEVs [11]. Thus, the level of cross protection of the current vaccines against G5 JEV is likely to be sub-optimal and should therefore be analyzed carefully since the reemergence of G5 and its widespread dispersal, and significant genetic variation could impact on its epidemiology. This possibility is emphasized by the fact that Muar (the first G5 JEV) strain was isolated from a patient with severe viral encephalitis [12]. Thus, there is the realistic possibility that the newly isolated G5 viruses could be highly virulent. Thus, the potential disease burden of viral encephalitis caused by G5 JEV requires careful reassessment.