Cross-border spread, lineage displacement and evolutionary rate estimation of rabies virus in Yunnan Province, China

Zhang, Yuzhen; Vrancken, Bram; Feng, Yun; Dellicour, Simon; Yang, Qiqi; Yang, Weihong; Zhang, Yunzhi; Dong, Lu; Pybus, Oliver G.; Zhang, Hailin; Tian, Huaiyu

doi:10.1186/s12985-017-0769-6

Cross-border spread, lineage displacement and evolutionary rate estimation of rabies virus in Yunnan Province, China

Research
Open access
Published: 03 June 2017

Volume 14, article number 102, (2017)
Cite this article

Download PDF

You have full access to this open access article

Virology Journal Aims and scope Submit manuscript

Cross-border spread, lineage displacement and evolutionary rate estimation of rabies virus in Yunnan Province, China

Download PDF

Yuzhen Zhang¹,
Bram Vrancken²,
Yun Feng¹,
Simon Dellicour²,
Qiqi Yang³,
Weihong Yang¹,
Yunzhi Zhang¹,
Lu Dong⁴,
Oliver G. Pybus⁵,
Hailin Zhang¹ &
…
Huaiyu Tian³

6037 Accesses
22 Citations
1 Altmetric
Explore all metrics

Abstract

Background

Rabies is an important but underestimated threat to public health, with most cases reported in Asia. Since 2000, a new epidemic wave of rabies has emerged in Yunnan Province, southwestern China, which borders three countries in Southeast Asia.

Method

We estimated gene-specific evolutionary rates for rabies virus using available data in GenBank, then used this information to calibrate the timescale of rabies virus (RABV) spread in Asia. We used 452 publicly available geo-referenced complete nucleoprotein (N) gene sequences, including 52 RABV sequences that were recently generated from samples collected in Yunnan between 2008 and 2012.

Results

The RABV N gene evolutionary rate was estimated to be 1.88 × 10⁻⁴ (1.37–2.41 × 10⁻⁴, 95% Bayesian credible interval, BCI) substitutions per site per year. Phylogenetic reconstructions show that the currently circulating RABV lineages in Yunnan result from at least seven independent introductions (95% BCI: 6–9 introductions) and represent each of the three main Asian RABV lineages, SEA-1, -2 and -3. We find that Yunnan is a sink location for the domestic spread of RABV and connects RABV epidemics in North China, South China, and Southeast Asia. Cross-border spread from southeast Asia (SEA) into South China, and intermixing of the North and South China epidemics is also well supported. The influx of RABV into Yunnan from SEA was not well-supported, likely due to the poor sampling of SEA RABV diversity. We found evidence for a lineage displacement of the Yunnan SEA-2 and -3 lineages by Yunnan SEA-1 strains, and considered whether this could be attributed to fitness differences.

Conclusion

Overall, our study contributes to a better understanding of the spread of RABV that could facilitate future rabies virus control and prevention efforts.

Phylodynamic and transmission pattern of rabies virus in China and its neighboring countries

Article 30 May 2019

Evolutionary analysis of rabies virus isolates from Guangxi Province of southern China

Article Open access 18 June 2018

Sequence analysis of nucleoprotein gene reveals the co-circulation of lineages and sublineages of rabies virus in herbivorous in Rio Grande do Sul state, Brazil

Article 17 January 2020

Background

Rabies has one of the highest mortality rates of infectious diseases in humans, and kills almost 60,000 persons each year [1]. It is caused by the rabies virus (RABV), a virus that belongs to the genus Lyssavirus within the family Rhabdoviridae, and which is distributed almost worldwide, in particular in developing countries [2]. Asia has the highest global incidence of rabies, accounting for more than 80% of total cases [3], and China is one of the countries with the highest number of human rabies cases. Dogs are the principal vector of human rabies (98% of human cases result from dog bites) and can spread the virus across national borders and over long distances [4,5,6]. The RABV migration patterns can be reconstructed by analysing virus gene sequences in a phylogeographic framework, which has the potential to inform public health decisions [4].

Several major geographically-distinct clades of RABV species in nonflying mammals have been identified based on phylogenetic analyses of the RABV N gene; these are the African, cosmopolitan, Arctic-related, Asian, and Indian subcontinent clades [7, 8]. Viruses of the latter three clades circulate in Asia: (i) Indian subcontinent clade viruses are found in southern India and Sri Lanka; (ii) Arctic-related clade viruses have been reported across a large region, from Russia and central Asia to eastern Asia [9, 10]; (iii) Asian clade viruses are mainly distributed in Southeast Asia [8]. In a recent study, Guo et al. demonstrated that RABV sequences from Southeast Asia belong to three distinct clades, SEA-1 (China and Indonesia), SEA-2 (South China and the Philippines), and SEA-3 (Malaysia, Vietnam, Laos, Cambodia, and Thailand) [11]. In previous work, we sequenced 52 new RABV isolates sampled between 2008 and 2012 in Yunnan province, China. Two lineages of the clades identified in Yunnan were closely related to strains from North or South China, whilst another lineage was closely related to SEA strains [12]. Consequently, Yunnan Province was identified as a potential focal point in the region for the spread of endemic RABV across Southeast Asia. An increasing incidence of rabies cases has been reported in Yunnan, occurring in only one county in 2000 but in 77 counties in 2012 [12].

In this study, we used a Bayesian phylogenetic molecular clock approach to explore the spatio-temporal history of RABV in Yunnan, an area of China that borders three countries (Vietnam, Laos, and Myanmar) in Southeast Asia. The second aim of this study was to examine possible causes of the apparent RABV lineage displacement event that occurred in Yunnan, in which two of the three RABV lineages disappeared after 2011.

Results

Lineages of the three main Asian clades circulate in Yunnan

Consistent with previous analyses, a phylogenetic analysis of the N gene from nonflying mammals identified six distinct clusters in Asia, which exhibited clear spatial structure: the Indian subcontinent, cosmopolitan, Arctic-related, Southeast Asia-1 (SEA-1), SEA-2, and SEA-3 lineages (Fig. 1) [7, 8, 11, 13]. The geographic locations of the different clusters of Asian RABV, SEA-1–3, are shown in Fig. 1a. Clade SEA-1, the most widespread clade across China, has been isolated in both North and South China and includes a single cluster from Indonesia (yellow highlight in Fig. 1b). Clade SEA-2 is mainly distributed in South China, South of the Yangtze River (Fig. 1a), with a single cluster in the Philippines (green highlight in Fig. 1b). Clade SEA-3 is widely distributed in Southeast Asian countries, including Myanmar, Thailand, Laos, Vietnam, Cambodia, and Yunnan Province in China. Phylogenetic analysis showed that the post-2000 Yunnan epidemic was caused by lineages from all three SEA clades (Fig. 2a).

Temporal and spatial dynamics of RABVs in China and Southeast Asia

Our estimate of the evolutionary rate of the N gene of RABV is 1.88 × 10⁻⁴ (95% highest posterior density, HPD: 1.37 × 10⁻⁴ –2.41 × 10⁻⁴) substitutions per site per year. This is highly consistent with previous estimates [8, 14], and was used to calibrate the timescale of Asian RABV epidemic history.

We find strong Bayes factor support for introductions of RABV into Yunnan from North and South China (Table 1 and Additional file 1: Figure S1). Migrations from North to South China and vice versa, and from Southeast Asia to South China, are also well supported (Table 1). In contrast, the lineage migration link between Yunnan and Southeast Asian countries is less well supported but still significant (Table 1). Combined, this provides significant support for a role of Yunnan as a connector of the RABV epidemics in North and South China with those in Southeast Asia. A visualisation of the well-supported pathways of RABV migration in Asia is provided in Additional file 1: Figure S1.

Table 1 Overview of the significant migration links between locations

Full size table

The Bayesian stochastic search variable selection (BSSVS) analysis provides information on the statistical support for each possible migration link among location pairs, but does not inform on the relative magnitude of viral movement. For this, we quantified the expected number of virus movements between locations (Markov jumps; Table 1). This shows that the bulk of virus flow is between North and South China (69.17%, Table 1). Further, almost equal amounts of virus lineage migration out of North (19.84%) or South (24.58%) China seed the RABV epidemic in Yunnan, and little more than 1% of all migration events are directed from Yunnan towards Southeast Asia. The imbalance between RABV import and export from Yunnan is reflected in the difference between the number of virus lineage movements from and to Yunnan, which is −6.1 (95%BCI: −3 to −8). The only other well-supported migration link (from southeast Asia to south China) accounts for another 10.25% of the Markov jumps. Similar patterns were obtained using a downsampled dataset with a maximum of 20 sequences per area per year (Table 1), which suggests that the results here do not appear to be driven by heterogeneous sampling.

The estimated number of independent RABV introductions into Yunnan is 7 (95% Bayesian credible interval: 6 - 9). This includes five well-supported clusters (>85% posterior support) of at least two lineages sampled in Yunnan (Fig. 2a). The SEA-3 Yunnan lineages (YN-B) are separated from the other SEA-3 strains by a long branch representing almost 140 years of evolution (Fig. 2a and c). Such a long period of undetected circulation in Yunnan is highly unlikely, and indicates that a substantial part of the SEA-3 diversity is not represented among the available data in GenBank and/or escapes detection by current surveillance programs. The t_MRCA estimate of the YN-B clade is 1991.7 (95%HPD: 1969.6–2003.4). Given the increased awareness after 2000 it can be assumed that RABV circulation in Yunnan was rapidly detected after that date. If RABV was detected within 1 year, the finding that 97.9% of the YN-B root node t_MRCA 95% HPD predates 2001 suggests that it is very likely that RABV was introduced into Yunnan from SEA on multiple occasions (at least two or three) (Fig. 2c). However, the 95%HPD interval of this clade’s t_MRCA extends to almost mid-2002 and the epidemic started in 2000, meaning that on the basis of these timings the possibility of one introduction event cannot be dismissed. The t_MRCAs of the four other Yunnan clades are more recent (Fig. 2b). Given that each represents one introduction event, this shows that these lineages entered Yunnan within a rather narrow time window starting from around 2000, which is consistent with previous observations [12].

Different sites evolved under diversifying selection in the three Asian RABV lineages

The overall d_N/d_S ratio for the N gene of RABV in Yunnan was low (0.047), suggesting that strong purifying selection has been the main evolutionary pressure acting on RABV [15]. Interestingly, the branch-specific d_N/d_S values for RABV clades SEA-2 and SEA-3 in Yunnan were higher (0.056) than that for branches in clade SEA-1 (0.009). We applied two likelihood ratio tests (LRTs) that compare codon substitution model M7 (no positive selection) against model M8 (positive selection allowed) in order to detect positive selection at the 5% level. For the SEA-1 clade, the LRT statistic comparing M7 and M8 was 2 InL = 2 × (InL _M8 - InL _M7) = 11.28; thus the null hypothesis of no positive selection could be rejected. For clades SEA-2 and SEA-3, they were 2 InL = 2 × (InL _M8 - InL _M7) = 185.50. Some sites in the N gene (sites 90, 128, 135, 379, 397, and 426) were identified with both the M8 model in CODEML and the FEL method in HYPHY as being positively selected (Table 2). Overall, these results are indicative of positive selection acting at these sites. We then compared the amino acid sites that vary between the SEA-1, SEA-2, and SEA-3 clades (Table 3). The SEA-2 lineages carry six different N-gene AAs and the SEA-3 strains seven AAs, when compared to SEA-1 lineages. Two of these differences with respect to SEA-1 strains are shared between the SEA-2 and SEA-3 lineages: S42T and S397G (Table 3). It is of note that the SEA-3 AA residues at positions within linear epitopes (sites 375 and 379) are different from those in SEA-1 strains, in particular because both sites were subject to diversifying selection according to at least one method (Tables 2 and 3). The SEA-2 lineages, on the other hand, share their AA residues at these epitope positions with SEA-1 strains.

Table 2 Positively selected sites in the N gene in Yunnan

Full size table

Table 3 Key variable sites between the clade SEA-1, clade SEA-2, and clade SEA-3 viruses, Yunnan Province

Full size table

Discussion

Rabies remains one of the most dangerous but neglected diseases in developing countries, especially in Asia and Africa, where >95% of human cases occur [16]. In this study, we used Bayesian phylodynamic methods to investigate the spatiotemporal aspects of RABV evolution in Asia with a focus on the role of the Yunnan province of China in the regional epidemic history of the virus. For this we used publicly available time-annotated near complete viral genomes, as well as N gene sequences whose locations and dates of sampling were known. The former were used to estimate gene-specific evolutionary rates, which allowed us to put a timescale on virus migration history.

Results from the evolutionary reconstructions show that the reemergent RABV lineages in Yunnan since 2000 contain representatives of the three main Southeast Asian clades (Fig. 2a). Interestingly, no SEA-3 strains were detected after 2010 and only one SEA-2 lineage was found in 2011 in Yunnan, even though the surveillance efforts covered the same geographical area throughout the entire sampling period (2008–2012) [12]. Conditioning on the representativeness of our sampling for the Yunnan RABV epidemic, this indicates that the YN-B and YN-C strains were unable to maintain sustained transmission chains. This can either be a stochastic variation or represent differences in adaptation to the epidemiological environment in Yunnan, which might enable SEA-1 to outcompete the SEA-2 and SEA-3 strains. The consensus amino acids of two antibody-binding sites that have been positively selected in the SEA-3 clade differ from those in SEA-1 lineages, which is consistent with the hypothesis that the apparent lineage displacement of SEA-3 by SEA-1 lineages is linked to differences in “pre-adaptation” [17] to the Yunnan epidemiological environment. The AA consensus residues at the antibody-binding sites are shared between SEA-1 and SEA-2 strains, which means that fitness differences linked to immunogenic properties cannot explain the apparent SEA-2 lineage displacement. Thus, a mixture of neutral lineage dynamics (the disappearance of SEA-2) and selection (the disappearance of SEA-3) may have been at play. More research is, however, needed to test and validate the hypothesis of a relationship between lineage displacement and fitness differences. For example, the SEA-2 and SEA-3 clades harbor the same consensus residue difference with respect to the SEA-1 strains at positions 42 and 397, but there is no information about the potential impact of this on the virus’s replicative fitness. Finally, this evaluation is limited to the N gene, and cannot detect an impact of AA substitutions in other genes.

The lineage migration analyses revealed that there is no well-supported RABV migration from China to other countries other than migration via Yunnan. Complementary to its role as a funnel for RABV migration from north and south China to southeast Asia, Yunnan also acts as a sink region for domestic RABVs. Previous work has associated RABV dissemination with patterns of animal trade and dog relocation [18,19,20,21]. However, our results also show that a completely effective screening at best would have prevented only 1 strain from starting a chain of infections every two years (Additional file 1: Figure S3). Even though these estimates represent a lower boundary, the large number of screenings required to detect a positive case indicates that alternative intervention efforts based on early detection and large-scale preventive vaccination campaigns likely are more effective in containing RABV spread.

Import of RABV into south China from southeast Asia was the only well-supported immigration link into China. The temporal dynamics of the YN-B lineage however, indicates that the sample of the SEA-3 genetic diversity in GenBank likely is incomplete. This, together with the frequent clustering of the YN-B clade as an outgroup to the other SEA-3 strains (Figure 2a and c), likely accounts for the failure of the phylogeographic analysis to detect a well-supported link for RABV import into China from SEA via Yunnan. It also follows that the high ratio of expected within-country movements versus international movements (Table 1), represents an upper limit. Nevertheless, it is clear that in China within-country circulation accounts for more infections than virus importation.

Conclusion

This work provides new insights into the spatial diffusion patterns of RABV across large-scale regions in Asia. The main limitation of this work is the lack of a genome-wide perspective on RABV evolution, which is a consequence of the lack of large numbers of publicly available RABV sequence information for genes other than the N gene. However, as recombination in RABV is rare, our N-gene based results may be representative of the whole genome, similar to that seen for hepatitis C viruses [22].

Methods

Sample collection

Between 2008 and 2012, brain tissue was collected from dogs and deceased patients, and saliva and cerebrospinal fluid samples from surviving patients in Yunnan Province, China, as described previously [12]. All samples were tested for RABV antigen, and the viral genes of the rabies-positive samples were sequenced. Viral RNA was extracted by using TRIzol Reagent (Invitrogen, Carlsbad, CA, USA) and used as template for cDNA with Ready-To-Go You-Prime First-Strand Beads (GE Healthcare, Piscataway, NJ, USA). Complete N gene sequences were obtained by using primers specific for this gene as described [19, 23]. PCR products were purified by using the QIAquick PCR Purification Kit (QIAGEN, Hilden, Germany) and sequenced. A total of 52 complete RABV nucleoprotein (N) gene sequences were obtained using primers specific for this gene, as described elsewhere [12, 19, 23].

The nucleotide sequences of the N genes of all available RABVs sampled from nonflying mammals in Asia were extracted from GenBank. The sequences were aligned using the MAFFT program [24]. A total of 452 N sequences (the entire N-coding sequence, 1350 nt long), including our newly acquired sequences, were analyzed in this study (Additional file 1: Table S1). Preliminary maximum likelihood trees for data verification (GTR substitution model with 10,000 bootstrap replicates) were estimated using FASTTREE v2.1.4 [25]. Additional file 1: Table S1 shows the accession numbers, dates of isolation, and origins of the included isolates. Potential recombinant sequences identified by the Recombination Detection Program [26] were excluded from further analyses (GenBank accession numbers FJ712195, JN974825, FJ712196, and EU159363).

Evolutionary rate estimation for N gene sequences

Because preliminary analysis indicated that the N-gene dataset lacked a clear temporal signal we first estimated gene-specific evolutionary rates using the available data in GenBank to specify a suitable prior distribution for the evolutionary rate parameter in subsequent time-scaled phylogenetic analyses. The (near) full genomes (sequence lengths >9550 nt, Additional file 1: Table S2) for which the collection date is specified in GenBank almost all represent the completely N, G, M1, M2 and L genes (see Additional file 1: Figure S4) and were aligned with MAFFT [27]. After removing two 1931 and three 1956 sequences from cell culture propagated lineages, we followed Vrancken et al. [28] to obtain a downsampled dataset with clear temporal signal. Plotting the root-to-tip genetic divergence against sampling time [29] revealed one outlying sequence that was consequently removed. The resulting 93 taxon dataset with samples collected between 1950 and 2015 showed no signal for recombination with the Phi-test [30] nor for confounding between temporal and genetic structure [31]. Reassuringly, the correlation between accrued genetic divergence and sampling time (rho = 0.27) as measured with linear regression was significant (p < 0.01 against a null distribution created from 1000 randomisations) [31]. The regression was performed on a maximum likelihood tree estimated using the default PhyML settings in Seaview [32, 33]. The sampling time distribution of the final dataset is given in Additional file 1: Table S3.

To obtain gene-specific substitution rate estimates we follow the strategy of Al-Qahtani et al. [34]. Rates were estimated with an uncorrelated clock model with the rates drawn from a lognormal distribution [35] and a relative rate approach was used to allow for among gene rate variation [13]. The SkyGrid model was specified as a flexible prior on the tree [36]. A HKY + Gamma substitution model was fitted to each partition and information on the substitution processes was shared across the gene partitions through hierarchical prior distribution specification [37]. As a final check we also conducted a date-randomisation test as implemented in BEAST [38] to compare the rate estimates obtained under the tip-date calibrated clock model with a corresponding null distribution. Following Firth et al [39] we accepted the veracity of the temporal signal because the mean of the substitution rate estimate obtained from the empirical sequence sampling dates falls outside the 95% highest posterior density interval (95% HPD) of the null distribution of rates from the randomised data. The gene-specific evolutionary rate estimate for the N-gene was used to specify a normal prior distribution on the mean clock rate for our RABV dataset. Specifically, the mean of the latter was set to the mean of the N-gene rate estimate, and the standard deviation was set so that the 2.5 and 97.5 percentiles of the normal distribution correspond to the lower and upper boundaries of the N-gene rate estimate’s 95% HPD.

Phylogeography of the RABV N gene

To mitigate possible biases arising from the uneven sample size at each location (Additional file 1: Figure S1) the sequences of the N gene were grouped into six geographic regions: North China (including Hebei, Beijing, Shanxi, Shaanxi, Ningxia, Shandong, Henan, Jiangsu, Anhui, Sichuan, Hubei, and Chongqing, n = 128), South China (including Hunan, Jiangxi, Guizhou, Fujian, Guangxi, Shanghai, and Zhejiang, n = 180), Yunnan Province (n = 63), Island area-1 (China Taiwan and the Philippines, n = 13), Island area-2 (Indonesia, n = 33), and Southeast Asia (Myanmar, Thailand, Laos, Vietnam, and Cambodia, n = 35) (Additional file 1: Figure S2). In order to create a more even spatio-temporal sampling distribution, we also randomly subsampled the large sequence datasets by area and sampling time. At most 20 sequences were sampled per area and per year. After sub-sampling the total number of sequences analyzed here was 401 over a total of 51 years (Additional file 1: Table S4).

Discrete phylogenetic analyses of RABV lineage movement among these six geographic regions were performed under an asymmetric model of among-location transition [40], estimated using the MCMC approach implemented in BEAST v. 1.8.2 (http://beast.bio.ed.ac.uk/). A Bayesian skygrid coalescent model [36], the SRD06 model of nucleotide substitution [41], and a relaxed (uncorrelated log-normal) molecular clock model were used in this analysis. A Bayesian stochastic search variable selection (BSSVS) approach was used to identify the best-supported among-location movement rates and Spread3 [42] was used to calculated Bayes factor (BF) support for the transition rates. We also estimated the number of transitions among locations (Markov jumps) [43]. Three chains of 2.5 × 10⁸ steps, subsampled every 50,000th generation, were combined after the removal of the 10% burn-in. The convergence of MCMC output of all parameters was inspected with Tracer v1.5.

Analysis of selection pressures

We used a phylogenetic codon substitution model approach, implemented in the CODEML program of the PAML package [44], to detect positive selection in RABV in Yunnan Province. The numbers of nonsynonymous (d_N) and synonymous substitutions (d_S) per site and their ratio (d_N/d_S) were estimated for each codon of RABV from nonflying mammals in Yunnan, using the M7 and M8 models [45]. Putative positively selected sites were also identified using the fixed-effect likelihood (FEL) method implemented in HYPHY [46]. An amino acid position was considered to be under positive selection if its d_N/d_S ratio was > 1 and the likelihood ratio test (LRT) had a significance level of P < 0.05. An overall dN/dS ratio was also calculated using all branches (one-ratio model) for clades SEA-1–3. To examine selection pressures in more detail, separate dN/dS values were estimated for branches of the same phylogeny (two-ratio model).

Abbreviations

BSSVS:: Bayesian stochastic search variable selection
FEL:: Fixed-effect likelihood
HPD:: Highest posterior density
LRT:: Likelihood ratio test
RABV:: Rabies viruses

References

Fooks AR, Banyard AC, Horton DL, Johnson N, McElhinney LM, Jackson AC. Current status of rabies and prospects for elimination. Lancet. 2014;384:1389–99.
Article PubMed Google Scholar
Knobel DL, Cleaveland S, Coleman PG, Fèvre EM, Meltzer MI, Miranda MEG, Shaw A, Zinsstag J, Meslin F-X. Re-evaluating the burden of rabies in Africa and Asia. Bull World Health Organ. 2005;83:360–8.
PubMed PubMed Central Google Scholar
Tang X, Luo M, Zhang S, Fooks AR, Hu R, Tu C. Pivotal role of dogs in rabies transmission, China. Emerg Infect Dis. 2005;11:1970–2.
Article PubMed PubMed Central Google Scholar
Talbi C, Lemey P, Suchard MA, Abdelatif E, Elharrak M, Jalal N, Faouzi A, Echevarría JE, Morón SV, Rambaut A. Phylodynamics and human-mediated dispersal of a zoonotic virus. PLoS Pathog. 2010;6:e1001166.
Article PubMed PubMed Central Google Scholar
Bourhy H, Nakouné E, Hall M, Nouvellet P, Lepelletier A, Talbi C, Watier L, Holmes EC, Cauchemez S, Lemey P. Revealing the micro-scale signature of endemic zoonotic disease transmission in an African urban setting. PLoS Pathog. 2016;12:e1005525.
Article PubMed PubMed Central Google Scholar
Pybus OG, Rambaut A. Evolutionary analysis of the dynamics of viral infectious disease. Nat Rev Genet. 2009;10:540–50.
Article CAS PubMed Google Scholar
Meng S, Sun Y, Wu X, Tang J, Xu G, Lei Y, Wu J, Yan J, Yang X, Rupprecht CE. Evolutionary dynamics of rabies viruses highlights the importance of China rabies transmission in Asia. Virology. 2011;410:403–9.
Article CAS PubMed Google Scholar
Bourhy H, Reynes J-M, Dunham EJ, Dacheux L, Larrous F, Huong VTQ, Xu G, Yan J, Miranda MEG, Holmes EC. The origin and phylogeography of dog rabies virus. J Gen Virol. 2008;89:2673–81.
Article CAS PubMed PubMed Central Google Scholar
Hyun B-H, Lee K-K, Kim I-J, Lee K-W, Park H-J, Lee O-S, An S-H, Lee J-B. Molecular epidemiology of rabies virus isolates from South Korea. Virus Res. 2005;114:113–25.
Article CAS PubMed Google Scholar
Nagarajan T, Mohanasubramanian B, Seshagiri E, Nagendrakumar S, Saseendranath M, Satyanarayana M, Thiagarajan D, Rangarajan P, Srinivasan V. Molecular epidemiology of rabies virus isolates in India. J Clin Microbiol. 2006;44:3218–24.
Article CAS PubMed PubMed Central Google Scholar
Guo Z, Tao X, Yin C, Han N, Yu J, Li H, Liu H, Fang W, Adams J, Wang J. National borders effectively halt the spread of rabies: the current rabies epidemic in China is dislocated from cases in neighboring countries. PLoS Negl Trop Dis. 2013;7:e2039.
Article PubMed PubMed Central Google Scholar
Zhang H, Zhang Y, Yang W, Tao X, Li H, Ding J, Feng Y, Yang D, Zhang J, He J. Molecular epidemiology of reemergent rabies in Yunnan Province, southwestern China. Emerg Infect Dis. 2014;20:1433.
Article CAS PubMed PubMed Central Google Scholar
Harrison A, Lemey P, Hurles M, Moyes C, Horn S, Pryor J, Malani J, Supuri M, Masta A, Teriboriki B. Genomic analysis of hepatitis B virus reveals antigen state and genotype as sources of evolutionary rate variation. Viruses. 2011;3:83–101.
Article PubMed PubMed Central Google Scholar
Hughes GJ, Orciari LA, Rupprecht CE. Evolutionary timescale of rabies virus adaptation to North American bats inferred from the substitution rate of the nucleoprotein gene. J Gen Virol. 2005;86:1467–74.
Article CAS PubMed Google Scholar
Holmes EC, Woelk CH, Kassis R, Bourhy H. Genetic constraints and the adaptive evolution of rabies virus in nature. Virology. 2002;292:247–57.
Article CAS PubMed Google Scholar
Bourhy H, Dautry-Varsat A, Hotez PJ, Salomon J. Rabies, still neglected after 125 years of vaccination. PLoS Negl Trop Dis. 2010;4:e839.
Article PubMed PubMed Central Google Scholar
Carlson JM, Du VY, Pfeifer N, Bansal A, Tan VY, Power K, Brumme CJ, Kreimer A, DeZiel CE, Fusi N. Impact of pre-adapted HIV transmission. Nat Med. 2016;22:606–13.
Article PubMed PubMed Central Google Scholar
Nadin-Davis SA, Turner G, Paul JP, Madhusudana SN, Wandeler AI. Emergence of Arctic-like rabies lineage in India. Emerg Infect Dis. 2007;13:111–6.
Article CAS PubMed PubMed Central Google Scholar
Tao X, Tang Q, Li H, Mo Z, Zhang H, Wang D, Zhang Q, Song M, Velasco-Villa A, Wu X. Molecular epidemiology of rabies in Southern People’s Republic of China. Emerg Infect Dis. 2009;15:1192–8.
Article CAS PubMed PubMed Central Google Scholar
Yamagata J, Ahmed K, Khawplod P, Mannen K, Xuyen DK, Loi HH, Van Dung N, Nishizono A. Molecular epidemiology of rabies in Vietnam. Microbiol Immunol. 2007;51:833–40.
Article CAS PubMed Google Scholar
Smith J, McElhinney L, Parsons G, Brink N, Doherty T, Agranoff D, Miranda ME, Fooks AR. Case report: rapid ante‐mortem diagnosis of a human case of rabies imported into the UK from the Philippines. J Med Virol. 2003;69:150–5.
Article PubMed Google Scholar
Cuypers L, Vrancken B, Fabeni L, Marascio N, Cento V, Di Maio VC, Aragri M, Pineda-Peña AC, Schrooten Y, Van Laethem K. Implications of hepatitis C virus subtype 1a migration patterns for virus genetic sequencing policies in Italy. BMC Evol Biol. 2017;17:70.
Article PubMed PubMed Central Google Scholar
Yu J, Li H, Tang Q, Rayner S, Han N, Guo Z, Liu H, Adams J, Fang W, Tao X. The spatial and temporal dynamics of rabies in China. PLoS Negl Trop Dis. 2012;6:e1640.
Article PubMed PubMed Central Google Scholar
Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772–80.
Article CAS PubMed PubMed Central Google Scholar
Price MN, Dehal PS, Arkin AP. FastTree 2–approximately maximum-likelihood trees for large alignments. PLoS One. 2010;5:e9490.
Article PubMed PubMed Central Google Scholar
Martin D, Rybicki E. RDP: detection of recombination amongst aligned sequences. Bioinformatics. 2000;16:562–3.
Article CAS PubMed Google Scholar
Katoh K, Misawa K, Kuma K, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002;30:3059–66.
Article CAS PubMed PubMed Central Google Scholar
Vrancken B, Rambaut A, Suchard MA, Drummond A, Baele G, Derdelinckx I, Van Wijngaerden E, Vandamme A-M, Van Laethem K, Lemey P. The genealogical population dynamics of HIV-1 in a large transmission chain: Bridging within and among host evolutionary rates. PLoS Comput Biol. 2014;10:e1003505.
Article PubMed PubMed Central Google Scholar
Rambaut A, Lam TT, Carvalho LM, Pybus OG. Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol. 2016;2:vew007.
Article PubMed PubMed Central Google Scholar
Huson DH. SplitsTree: analyzing and visualizing evolutionary data. Bioinformatics. 1998;14:68–73.
Article CAS PubMed Google Scholar
Murray GG, Wang F, Harrison EM, Paterson GK, Mather AE, Harris SR, Holmes MA, Rambaut A, Welch JJ. The effect of genetic structure on molecular dating and tests for temporal signal. Methods Ecol Evol. 2016;7:80–9.
Article PubMed Google Scholar
Guindon S, Dufayard J-F, Lefort V, Anisimova M, Hordijk W, Gascuel O. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010;59:307–21.
Article CAS PubMed Google Scholar
Gouy M, Guindon S, Gascuel O. SeaView version 4: a multiplatform graphical user interface for sequence alignment and phylogenetic tree building. Mol Biol Evol. 2010;27:221–4.
Article CAS PubMed Google Scholar
Al-Qahtani AA, Baele G, Khalaf N, Suchard MA, Al-Anazi MR, Abdo AA, Sanai FM, Al-Ashgar HI, Khan MQ, Al-Ahdal MN. The epidemic dynamics of hepatitis C virus subtypes 4a and 4d in Saudi Arabia. Sci Rep. 2017;7:44947.
Article CAS PubMed PubMed Central Google Scholar
Drummond AJ, Ho SY, Phillips MJ, Rambaut A. Relaxed phylogenetics and dating with confidence. PLoS Biol. 2006;4:e88.
Article PubMed PubMed Central Google Scholar
Gill MS, Lemey P, Faria NR, Rambaut A, Shapiro B, Suchard MA. Improving Bayesian population dynamics inference: a coalescent-based model for multiple loci. Mol Biol Evol. 2013;30:713–24.
Article CAS PubMed Google Scholar
Suchard MA, Kitchen CM, Sinsheimer JS, Weiss RE. Hierarchical phylogenetic models for analyzing multipartite sequence data. Syst Biol. 2003;52:649–64.
Article PubMed Google Scholar
Trovão NS, Baele G, Vrancken B, Bielejec F, Suchard MA, Fargette D, Lemey P. Host ecology determines the dispersal patterns of a plant virus. Virus Evol. 2015;1:vev016.
Article PubMed PubMed Central Google Scholar
Firth C, Kitchen A, Shapiro B, Suchard MA, Holmes EC, Rambaut A. Using time-structured data to estimate evolutionary rates of double-stranded DNA viruses. Mol Biol Evol. 2010;27:2038–51.
Article CAS PubMed PubMed Central Google Scholar
Edwards CJ, Suchard MA, Lemey P, Welch JJ, Barnes I, Fulton TL, Barnett R, O’Connell TC, Coxon P, Monaghan N. Ancient hybridization and an Irish origin for the modern polar bear matriline. Curr Biol. 2011;21:1251–8.
Article CAS PubMed PubMed Central Google Scholar
Shapiro B, Rambaut A, Drummond AJ. Choosing appropriate substitution models for the phylogenetic analysis of protein-coding sequences. Mol Biol Evol. 2006;23:7–9.
Article CAS PubMed Google Scholar
Bielejec F, Baele G, Vrancken B, Suchard MA, Rambaut A, Lemey P. Spread3: interactive visualization of spatiotemporal history and trait evolutionary processes. Mol Biol Evol. 2016;33:2167–9.
Article CAS PubMed Google Scholar
Minin VN, Suchard MA. Counting labeled transitions in continuous-time Markov models of evolution. J Math Biol. 2008;56:391–412.
Article PubMed Google Scholar
Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24:1586–91.
Article CAS PubMed Google Scholar
Yang Z, Bielawski JP. Statistical methods for detecting molecular adaptation. Trends Ecol Evol. 2000;15:496–503.
Article CAS PubMed Google Scholar
Pond SL, Frost SD, Muse SV. HyPhy: hypothesis testing using phylogenies. Bioinformatics. 2005;21:676–9.
Article CAS PubMed Google Scholar
Kass RE, Raftery AE. Bayes factors. J Am Stat Assoc. 1995;90:773–95.
Article Google Scholar

Download references

Acknowledgements

We thank the hundreds of Yunnan Institute of Endemic Diseases Control and Prevention staff and local health workers in Yunnan Province who collected data during 2008–2012. H.T is supported by the National Natural Science Foundation of China (81673234), the Fundamental Research Funds for the Central Universities, and the National Key Research and Development Program of China (2016YFA0600104). B.V. is supported by the Bijzonder Onderzoeksfonds KU Leuven (BOF) (No. OT/14/115). S.D. is supported by the Fonds Wetenschappelijk Onderzoek (FWO, Flanders, Belgium).

Funding

Funding for this study was provided by the National Natural Science Foundation of China (81673234), the Bijzonder Onderzoeksfonds KU Leuven (BOF) (No. OT/14/115), the VIROGENESIS project (receiving funding from the European Union’s Horizon 2020 research and innovation program under grant agreement No 634650), the Fundamental Research Funds for the Central Universities, and the National Key Research and Development Program of China (2016YFA0600104). SD is a post-doctoral research fellow funded by the Fonds Wetenschappelijk Onderzoek (FWO, Flanders, Belgium).

Availability of data and materials

All data analyzed for the purposes of this manuscript are included in this article.

Authors’ contributions

HT, SD, and BV performed analyses, YZ, YF, QY, WY, YZ, LD, and HZ conducted surveillance. YZ, YF, WY, YZ, and HZ conducted sequencing. HT, SD, BV, and OP wrote the manuscript. HT and HZ conceived the study. All authors read and approved the final manuscript.

Competing interest

The authors declare that they have no competing interests.

Consent for publication

Not applicable.

Ethics approval and consent to participate

Not applicable.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information

Authors and Affiliations

Yunnan Institute of Endemic Diseases Control and Prevention, Yunnan Provincial Key Laboratory for Zoonosis Control and Prevention, Dali, China
Yuzhen Zhang, Yun Feng, Weihong Yang, Yunzhi Zhang & Hailin Zhang
Department of Microbiology and Immunology, Division of Clinical and Epidemiological Virology, Rega Institute, KU Leuven - University of Leuven, Leuven, Belgium
Bram Vrancken & Simon Dellicour
State Key Laboratory of Remote Sensing Science, College of Global Change and Earth System Science, Beijing Normal University, Beijing, China
Qiqi Yang & Huaiyu Tian
Ministry of Education Key Laboratory for Biodiversity and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, China
Lu Dong
Department of Zoology, University of Oxford, Oxford, UK
Oliver G. Pybus

Authors

Yuzhen Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Bram Vrancken
View author publications
You can also search for this author in PubMed Google Scholar
Yun Feng
View author publications
You can also search for this author in PubMed Google Scholar
Simon Dellicour
View author publications
You can also search for this author in PubMed Google Scholar
Qiqi Yang
View author publications
You can also search for this author in PubMed Google Scholar
Weihong Yang
View author publications
You can also search for this author in PubMed Google Scholar
Yunzhi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Lu Dong
View author publications
You can also search for this author in PubMed Google Scholar
Oliver G. Pybus
View author publications
You can also search for this author in PubMed Google Scholar
Hailin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Huaiyu Tian
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Hailin Zhang or Huaiyu Tian.

Additional file

Additional file 1:

Figure S1. Raw data of RABV N gene sequences in each location in each year in Southeast Asia. Figure S2. RABV N gene sequences in each area in each year. The color spectrum shows the number of sequences from each year and area, from green (low numbers of sequences) to dark red (high numbers of sequences). The data set contained 452 RABV sequences. Figure S3. Histogram showing the temporal trend of the expected number of RABV introductions from North and South China into Yunnan. Figure S4. Plot showing the number of sequences that cover each position in the RABV genome. The coverage low around position 4945 corresponds to a string of six guanines that is only 5 guanines long in many strains. Table S1. Sequences used in this study. Table S2. Time-annotated (near) full genome sequences used for estimating evolutionary rates. Table S3. The sampling time distribution of the final dataset. Table S4. Subsampled sequences used in phylogeographical analysis. (DOCX 332 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Zhang, Y., Vrancken, B., Feng, Y. et al. Cross-border spread, lineage displacement and evolutionary rate estimation of rabies virus in Yunnan Province, China. Virol J 14, 102 (2017). https://doi.org/10.1186/s12985-017-0769-6

Download citation

Received: 01 February 2017
Accepted: 25 May 2017
Published: 03 June 2017
DOI: https://doi.org/10.1186/s12985-017-0769-6

Cross-border spread, lineage displacement and evolutionary rate estimation of rabies virus in Yunnan Province, China

Abstract

Background

Method

Results

Conclusion

Similar content being viewed by others

Phylodynamic and transmission pattern of rabies virus in China and its neighboring countries

Evolutionary analysis of rabies virus isolates from Guangxi Province of southern China

Sequence analysis of nucleoprotein gene reveals the co-circulation of lineages and sublineages of rabies virus in herbivorous in Rio Grande do Sul state, Brazil

Background

Results

Lineages of the three main Asian clades circulate in Yunnan

Temporal and spatial dynamics of RABVs in China and Southeast Asia

Different sites evolved under diversifying selection in the three Asian RABV lineages

Discussion

Conclusion

Methods

Sample collection

Evolutionary rate estimation for N gene sequences

Phylogeography of the RABV N gene

Analysis of selection pressures

Abbreviations

References

Acknowledgements

Funding

Availability of data and materials

Authors’ contributions

Competing interest

Consent for publication

Ethics approval and consent to participate

Publisher’s Note

Author information

Authors and Affiliations

Corresponding authors

Additional file

Additional file 1:

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation