Abstract
During 2015-2017, chikungunya virus (CHIKV) showed a resurgence in several parts of India with Karnataka, Maharashtra and New Delhi accounting for a majority of the cases. E2-E1 gene based characterization revealed Indian subcontinent sublineage strains possessing Aedes aegypti mosquito-adaptive mutations E1: K211E and E2:V264A, with the 211 site positively selected. Novel mutational sites E1: K16E/Q, E1: K132Q/T, E1: S355T, E2: C19R and E2:S185Y could be associated with epitopes or virulence determining domains. The study examines the role of host, vector and viral factors and fills gaps in our molecular epidemiology data for these regions which are known to possess a dynamic population.
Avoid common mistakes on your manuscript.
Since the resurgence of chikungunya virus (CHIKV) in India in 2005, waves of epidemic outbreaks have followed in subsequent years in diverse parts of this country [1,2,3,4]. Chikungunya (CHIK) fever is a mosquito-transmitted acute viral infection, characterized by high fever, arthralgia, myalgia, headache and rash. The virus is transmitted to humans by several species of mosquitoes, with Aedes aegypti and A. albopictus being the two main vectors in urban settings. CHIKV is a positive-stranded RNA virus with a genome approximately 11.8 kb in size. It consists of a single stranded, positive sense RNA genome with two open reading frames (ORFs). The ORF at the 5’ end encodes four nonstructural proteins (nsP1 - nsP4) while the ORF at the 3’ end encodes the structural proteins (capsid C, envelope glycoproteins E1 and E2 and three small cleavage products E3, 6K and the transframe protein TF).
E1 gene sequences are known to possess adequate information for phylogenetic classification [5] while E2 is known to be immunodominant within the structural polyprotein [6]. Based on this E1 and E2 gene sequences are generally used for phylogeny and related analyses [3, 7]. Phylogenetic analysis classifies all the CHIKV strains into three distinct genotypes [5] with the resurgent epidemic isolates from 2005 onwards falling into the Indian Ocean lineage (IOL) of the East, Central and South African (ECSA) genotype [8,9,10]. Further genetic characterization based on the envelope gene sequences of CHIKV isolates from Indian epidemics between 2009 and 2011 in the southern states of Tamil Nadu and Andhra Pradesh [4] and the eastern state of Odisha [3], revealed strain variants and evidence of adaptive evolution of the virus to both the vector and the host. During 2015, the southern state of Karnataka accounted for around 75% of the 27,553 suspected cases of CHIKV infection in the country. However, during 2016, Karnataka, Maharashtra, and New Delhi together accounted for nearly 55.4% of the 64,057 suspected cases. In 2017, Karnataka and Maharashtra, together accounted for nearly 63.1% of the 62,268 suspected cases [11]. The present study was undertaken to understand the genetic nature of the currently circulating CHIKV in the following cities: Pune and Nasik (Maharashtra), Bengaluru (Karnataka) and New Delhi - analysis based on the E1 and E2 genes. To further understand adaptive evolution of the ECSA genotype and IOL, selection pressure studies were undertaken followed by functional analyses.
Virus isolates stored in the repository of National Institute of Virology, Pune India were used for the study. These virus isolates were obtained from samples of patients infected with CHIKV during the 2015-2017 outbreaks in Bengaluru, Pune, Nasik and New Delhi as described previously [12]. Virus isolates (passage 2) obtained using the mosquito cell line C6/36 were used for RNA extraction. The genomic viral RNA was extracted from 11 representative isolates, Bengaluru (n = 5), Pune (n = 3), Nasik (n = 2), and New Delhi (n = 1), by QIAamp viral RNA mini kit (QIAGEN, Germany) according to the manufacturer’s instructions. The viral RNA was reverse transcribed using random hexamer and AMV reverse transcriptase (Promega Corporation, Madison, WI, USA) at 42 °C for 1 h. The E1/ E2 gene was amplified using 10.0 µl cDNA, Taq polymerase (Invitrogen) and primers as described previously [1]. PCR products were gel-purified using a QIAquick gel extraction kit (Qiagen) and sequenced using the BigDye Terminator cycle sequencing ready reaction kit (Applied Biosystems) on an automatic sequencer (ABI PRISM Genetic Analyzer 3100; Applied Biosystems). The E2-E1 gene region of these representative isolates from Pune, Nasik, Bengaluru and New Delhi were then sequenced (Suppl. Table 1). Overlapping molecular sequences (nucleotide and deduced amino acids) generated by the forward and the reverse primers in this study were manually processed using MEGA v.7.0.
An initial neighbour-joining (NJ) tree using MEGA v6 was used to identify the IOL from a starting dataset of all E2-E1 sequences (n = 863, available in GenBank as of March 2018) and reducing the redundancy on the basis of country and year of isolation, selecting one sequence per year for each country. The nucleotide substitution rate per site and the time of most recent common ancestor (tMRCA), were further estimated from time-stamped sequences of the IOL (n = 233) using the Bayesian Markov chain Monte Carlo (MCMC) method in the BEAST v2.0.2 software package [13]. The best fit nucleotide substitution model determined using Akaike Information Criterion (AIC) as implemented in MODELTEST 3.7 [14], was found to be GTR + G + I model (general time-reversible model with gamma-distributed rates of variation among sites and a proportion of invariable sites). The dataset was then analyzed assuming a relaxed (uncorrelated lognormal) molecular clock and using Bayesian skyline demographic models as coalescent prior. MCMC chain length was chosen to reach effective sample size > 200. The results obtained from MCMC analysis were assessed using Tracer v1.5 and the maximum clade credibility (MCC) tree was inferred using TreeAnnotator v2.0.2. The MCC tree was visualized using FigTree v1.4.0 software.
Selection pressure analysis of the E2-E1 region of the IOL and the ECSA genotype was performed in the Datamonkey server [15] using the single likelihood ancestor counting (SLAC), fixed effects likelihood (FEL), internal fixed effects likelihood (IFEL), and mixed effects model of evolution (MEME) methods. Amino acid sites were identified to be under positive selection pressure based on the statistical significance level (P ≤ 0.1 for SLAC, FEL, IFEL and MEME) by at least two of the methods. Mapping of the positively selected sites and major unique mutational sites of the new strains was done on the X-ray crystallographic structure of the E1-E2 heterodimer (3N41.PDB) available in the Protein Data Bank. To co-relate the mutational sites to known B-cell epitopes, the Immune Epitope Database (IEDB) and Analysis resource [16] was used for obtaining the experimentally known epitopes in the CHIKV structural polyprotein. Association of the mutational sites with computationally predicted CTL epitopes on CHIKV E2 and E1 proteins for important HLA-1 alleles [17] was also carried out.
The MCC tree based on the 11 E2-E1 gene sequences of Indian isolates from 2015-17 representing the four affected areas in the states of Karnataka (Bengaluru, 2015), Maharashtra (Nasik, 2015; Pune, 2015, 2016 and 2017) and New Delhi, 2016 as well as other representative strains is depicted in Fig 1. The rate of nucleotide substitution was estimated to be between 7.55 x 10−4 and 1.02 x 10−3 substitutions/site/year with the mean root age of the IOL being around 2003 (Suppl. Fig. 1). The posterior support for the Indian ocean sublineage comprising strains from Reunion, Comoros, Seychelles, Mauritius was found to be 0.93 (node a) while all the CHIKV isolates in this study belonged to the currently circulating Indian subcontinent sublineage (node b, posterior support 0.89). The 2015-2017 study isolates from different places, except a few Bengaluru 2015 isolates, were found to have ancestral origins between 2013 and mid-2014. Unique mutations, C19R, Y69I, S185Y, N219I, I377V and P409L in the E2 gene and K16E/Q, M27L, V28C, E50Q, D70L, K132Q/T, K181Q, E209K, K319N, S332T, N349K, S355T and S357L in the E1 gene were observed (Suppl. Table 2).
Selection pressure analysis (Table 1) of the ECSA genotype based on 214 unique sequences, revealed codon sites at position 110S/I/T, 164T/A, 375S/T/I of the E2 gene and 211K/E/N/R of the E1 gene showed evidence of being positively selected. The IOL analysis based on 194 unique sequences showed that E2: 69Y/H/I, 252K/Q/H and E1:211E/K/N/R were under positive selection. The unique mutational sites as well as the selection pressure sites identified in E1/E2 were mapped to known functional sites including the experimentally known B-Cell epitopes available from the IEDB (Suppl. Table 3) and the predicted MHC-1 restricted T-cell epitopes.
In this study the E2-E1 region of representative isolates (n = 11) from the four different areas was analyzed. The phylogenetic analysis (Fig. 1, Suppl. Fig. 1) revealed that the Indian subcontinent sublineage that is reported to have widely spread to Southeast Asia and Europe was found to show multiple subgroups. The estimates for the nucleotide substitution rate and the root age estimated in the study is almost comparable to a recent report [18]. Indian strains (Suppl. Table 4, Suppl. Fig. 1) from the State of Kerala (2007-2013) and West Bengal (2007-12) along with strains from Bangladesh and several South East Asian countries (including China, Myanmar, Malaysia, Thailand, Indonesia, Singapore, Cambodia), France and Australia were found to possess the A226V mutation that was associated with in-creased CHIKV transmission in Aedes albopictus mosquitoes [19, 20]. The 2015-17 study isolates were found to be close to strains identified in 2009-2012 from the states of Tamil Nadu, West Bengal, Kerala and Andhra Pradesh. Similar strains were noted from China, Japan, Pakistan, Bangladesh, France, Italy and American Somoa. An Australian strain of an imported CHIK case (unpublished), showed close identity with the Delhi 2016 strain. Overall, the molecular clock analysis revealed several independent evolutions of CHIKV within India. Diverse gene variants were specifically noted in the Bengaluru 2015 viruses (Suppl. Table 2) that may explain the major circulation of the CHIKV in Karnataka, in comparison to the other States of India during 2015 [11].
Within the E1 gene, none of the 2015-17 isolates possessed the A226V mutation. Also, none of the reported A226V-correlated secondary mutations (K233E, R198Q and K252Q) [21], of which K252Q was proven to affect dissemination in Ae. Albopictus [22], were found in our isolates. Notably, recent studies [21, 23], have demonstrated that two adaptive mutations, E1: K211E and E2:V264A, noted since 2010 [4, 24, 25] in the background of E1:226A provide remarkably higher fitness in terms of a significant increase in virus infectivity, dissemination and transmission in Ae. aegypti. The majority of the 2015-17study strains possessed E1: K211E and E2:V264A, implying that these mutations have been established in the CHIKV population in states such as Maharashtra, Tamil Nadu and Andhra Pradesh. Partial sequencing of earlier strains from Pune in 2010, revealed the existence of these mutations at an earlier time point (data not shown). Among the sequences used for the phylogenetic analysis, the strain Yemen 2011 and one Bengaluru 2015 strain (KA_Blore5310) were found to possess E2:V264A but not E1:K211E. On the contrary, a single strain HM159390 possessed E1:K211E and E2:V264. Another Bengaluru 2015 strain (KA_Blore5813), possessed neither E2:V264A or E1:K211E mutations.
Further, sequence analysis of the study strains showed the emergence of novel mutations (K16E/Q, K132Q/T, and S355T) in E1 being observed in multiple strains as well as Bengaluru isolates possessing several other mutations. Residue position 132 has been reported as a neutralizing antibody escape mutant in Sindbis virus [26] (Fig. 2) while residue 355 falls in the C-terminal domain III, which is known to contain virulence determinants of envelope proteins [27]. However, the significance of mutations at these positions needs experimental validation. None of the mutational sites could be associated with the transitional epitopes defined by residue positions 300, 361 and 381 [28]. Mutational sites 16 and 355 in E1 could be mapped on to potential T-cell epitopes presented by major HLA alleles HLA-B7 and HLA-B15 [17]. Site 211, that was found to be under selection pressure in earlier reports based on the ECSA genotype [2, 4], was notably positively selected, with regards to IOL, as well.
Within the E2 gene the unique mutation at position 185 was found to be associated with an experimentally known epitope (res. 186-200) in the IEDB (Suppl. Table 3) which is known to be on the exposed spike portion of the E2 protein (Fig. 2). The mutational site 19 falls within another known epitope (res. 1-19). Within E2, amino acid site 252 was identified to be positively selected in the IOL, as observed in an earlier study [2]. This residue is found to be located within the acid sensitive region that plays a role during the E2-E1 conformational changes occurring during viral entry [27]. The other positively selected site in IOL, at position 69, falls in the region E2:52-82 at the top of the spike in domain A [27] that is exposed and reported to be the point of contact for cellular receptors [29]. Further, site 164, which was seen to be under positive selection pressure, falls within a known epitopic region (res. 162-180) (Suppl. Table 3).
To summarize, the present study aimed to improve our understanding of CHIKV strains circulating in Karnataka, Maharashtra and New Delhi during 2015-17 in the context of their genetic evolution in different parts of the country. The results revealed the circulation of strains possessing Aedes aegypti mosquito-adaptive marker mutations E1: K211E and E2:V264A, in the absence of E1:A226V. Among the unique mutational sites in the isolates studied, K16E/Q, K132Q/T, S355T in E1 and C19R, S185Y in E2, could be associated with epitopes or virulence determining domains. The resurgence of CHIKV outbreaks in these regions may be attributed to indigenous evolution rather than importations. CHIKV outbreaks during the period of this study were notably restricted to mainly urban localities. Moreover, of the four areas investigated here, at least three (New Delhi, Pune and Bengaluru) are known for their dynamic populations, mainly due to rapid growth in their industrial sectors. This may have a significant role to play in influencing the herd immunity of the population, vector biology and virus-host interactions thus shaping the evolutionary dynamics of the virus. The occurrence of mutations in known B-cell epitopes underlines the need for studying cross-neutralization of virus isolates. The persistence of CHIKV and regular occurrences of outbreaks in India, highlight the importance of continuously monitoring the circulating CHIKV for changes in the virus population.
References
Arankalle VA, Shrivastava S, Cherian S, Gunjikar RS, Walimbe AM et al (2007) Genetic divergence of chikungunya viruses in India (1963–2006) with special reference to the 2005–2006 explosive epidemic. J Gen Virol 88:1967–1976
Abraham AR, Manakkadana A, Mudaliar P, Joseph I, Sivakumar KC et al (2016) Correlation of phylogenetic clade diversification and in vitro infectivity differences among Cosmopolitan genotype strains of chikungunya virus. Infect Genet Evol 37:174–184
Sahu A, Das B, Das M, Patra A, Biswal S et al (2013) Genetic characterization of E2 region of chikungunya virus circulating in Odisha, Eastern India from 2010 to 2011. Infect Genet Evol 18:113–124
Sumathy K, Ella KM (2012) Genetic diversity of chikungunya virus, India 2006–2010: evolutionary dynamics and serotype analyses. J Med Virol 84:462–470
Powers AM, Brault AC, Tesh RB, Weaver SC (2000) Re-emergence of chikungunya and O’nyong-nyong viruses: evidence for distinct geographical lineages and distant evolutionary relationships. J Gen Virol 81:471–479
Kam YW, Lee WW, Simarmata D, Le Grand R, Tolou H et al (2014) Unique epitopes recognized by antibodies induced in chikungunya virus-infected non-human primates: implications for the study of immunopathology and vaccine development. PLoS One 9:e95647
Schuffenecker I, Iteman I, Michault A, Murri S, Frangeul L et al (2006) Genome microevolution of chikungunya viruses causing the Indian Ocean outbreak. PLoS Med 3:1058–1070
Cherian SS, Walimbe AM, Jadhav SM, Gandhe SS, Hundekar SL et al (2009) Evolutionary rates and timescale comparison of chikungunya viruses inferred from the whole genome/E1 gene with special reference to the 2005–07 outbreak in the Indian subcontinent. Infect Genet Evol 9:16–23
Naresh Kumar CVM, Johnson AMA, Sai Gopal DVR (2007) Molecular characterization of chikungunya virus from Andhra Pradesh, India and phylogenetic relationship with Central African isolates. Indian J Med Res 126:534–540
Ray P, Ratagiri VH, Kabra SK, Lodha R, Sharma S et al (2012) Chikungunya infection in India: results of a prospective hospital based multi-centric study. PLoS One 7:e30025
National Vector Borne Disease Control Programme. 2010–2018. Chikungunya situation in India. http://www.nvbdcp.gov.in/chik-cd.html. Accessed 11 May 2018
Sudeep AB, Bondre VP, Gurav YK, Gokhale MD, Sapkal GN et al (2014) Isolation of Chandipura virus (Vesiculovirus: Rhabdoviridae) from Sergentomyia species of sandflies from Nagpur, Maharashtra, India. Indian J Med Res 139:769–772
Drummond AJ, Rambaut A (2007) BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol Biol 7:214
Posada D, Crandall KA (1998) MODELTEST: testing the model of DNA substitution. Bioinformatics 14:817–818
Kosakovsky Pond SL, Frost SD (2005) Not so different after all: a comparison of methods for detecting amino acid sites under selection. Mol Biol Evol 22:1208–1222
Peters B, Sidney J, Bourne P, Bui HH, Buus S et al (2005) The immune epitope database and analysis resource: from vision to blueprint. PLoS Biol 3:e91
Pratheek BM, Suryawanshi AR, Chattopadhyay S, Chattopadhyay S (2015) In silico analysis of MHC-I restricted epitopes of chikungunya virus 4 proteins: Implication in understanding anti-CHIKV CD8(+) T cell response and advancement of epitope based immunotherapy for CHIKV infection. Infect Genet Evol 31:118–126
Chen R, Puri V, Fedorova N, Lin D et al (2016) Comprehensive genome scale phylogenetic study provides new insights on the global expansion of chikungunya virus. J Virol 90:10600–10611
Vazeille M, Moutailler S, Coudrier D, Rousseaux C, Khun H et al (2007) Two chikungunya isolates from the outbreak of La Reunion (Indian Ocean) exhibit different patterns of infection in the mosquito, Aedes albopictus. PLoS One 14:e1168
Tsetsarkin KA, Vanlandingham DL, McGee CE, Higgs S (2007) A single mutation in chikungunya virus affects vector specificity and epidemic potential. PLoS Pathog 3:e201
Agarwal A, Sharma AK, Sukumaran D, Parida M, Dash PK (2016) Two novel epistatic mutations (E1:K211E and E2:V264A) in structural proteins of chikungunya virus enhance fitness in Aedes aegypti. Virology 497:59–68
Tsetsarkin KA, Chen R, Yun R, Rossi SL, Plante KS et al (2014) Multi-peaked adaptive landscape for chikungunya virus evolution predicts continued fitness optimization in Aedes albopictus mosquitoes. Nat Commun 5:4084
Arias-Goeta C, Mousson L, Rougeon F, Failloux AB (2013) Dissemination and transmission of the E1–226V variant of chikungunya virus in Aedes albopictus are controlled at the midgut barrier level. PLoS One 8:e57548
Shrinet J, Jain S, Sharma A, Singh SS, Mathur K et al (2012) Genetic characterization of chikungunya virus from New Delhi reveal emergence of a new molecular signature in Indian isolates. Virol J 9:100
Taraphdar D, Chatterjee S (2015) Molecular characterization of chikungunya virus circulating in urban and rural areas of West Bengal, India after its re-emergence in 2006. Trans R Soc Trop Med Hyg 109:197–202
Strauss EG, Stec DS, Schmaljohn AL, Strauss JH (1991) Identification of antigenically important domains in the glycoproteins of Sindbis virus by analysis of antibody escape variants. J Virol 65:4654–4664
Voss JE, Vaney MC, Duquerroy S, Vonrhein C, Girard-Blanc C et al (2010) Glycoprotein organization of chikungunya virus particles revealed by X-ray crystallography. Nature 468:709–714
Meyer WJ, Gidwitz S, Ayers VK, Schoepp RJ, Johnston RE (1992) Conformational alteration of Sindbis virion glycoproteins induced by heat, reducing agents, or low pH. J Virol 66:3504–3513
Li L, Jose J, Xiang Y, Kuhn RJ, Rossmann MG (2010) Structural changes of envelope proteins during alphavirus fusion. Nature 468:705–708
Acknowledgements
Authors are thankful to Dr. S.N. Kalashetty, Pune; National Institute of Malaria Research, New Delhi, Nasik Municipal Corporation for referring human clinical samples for the investigation of chikungunya virus etiology. Authors are grateful to Mr. Atul Walimbe for useful input in the Molecular clock studies.
Funding
This study received institutional funding from National Institute of Virology, Pune.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that there are no conflicts of interest.
Ethical approval
The research described in the paper does not use any human or animal subjects.
Additional information
Handling Editor: Diego G. Diel.
Electronic supplementary material
Below is the link to the electronic supplementary material.
705_2018_3974_MOESM1_ESM.tif
Supplementary material 1 (TIFF 3756 kb) Suppl. Fig. 1. Phylogenetic tree (Indian Ocean Lineage) based on E2-E1 gene region sequences (n=233) constructed by the Bayesian Markov Chain Monte Carlo (MCMC) method. Posterior supports for all nodes are indicated. The mean root age of IOL and the tMRCAs of the Indian ocean and Indian subcontinent sublineage are also shown. Branch color codes are: Red (Indian isolates of this study); Blue (other Indian isolates); Black (global isolates). Groups highlighted in pink possess E1:A226V mutation while the group in blue possesses E2:V264A and E1:K211E
705_2018_3974_MOESM2_ESM.doc
Supplementary material 2 (DOC 53 kb) Supplementary Table 1: Details of the Chikungunya virus isolates used in this study along with their GenBank accession numbers
705_2018_3974_MOESM3_ESM.doc
Supplementary material 3 (DOC 62 kb) Supplementary Table 2: Novel mutations in the CHIKV E2-E1 genes reported in this study
705_2018_3974_MOESM4_ESM.doc
Supplementary material 4 (DOC 41 kb) Supplementary Table 3: Experimentally known epitopes in the CHIKV structural polyprotein available in Immune Epitope Database (IEDB) and Analysis resource
705_2018_3974_MOESM5_ESM.doc
Supplementary material 5 (DOC 78 kb) Supplementary Table 4: List of publicly available Indian sequences used in sequence analysis
Rights and permissions
About this article
Cite this article
Patil, J., More, A., Patil, P. et al. Genetic characterization of chikungunya viruses isolated during the 2015-2017 outbreaks in different states of India, based on their E1 and E2 genes. Arch Virol 163, 3135–3140 (2018). https://doi.org/10.1007/s00705-018-3974-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00705-018-3974-8