What happened after the initial global spread of pandemic human influenza virus A (H1N1)? A population genetics approach

Martinez-Hernandez, Fernando; Jimenez-Gonzalez, Diego Emiliano; Martinez-Flores, Arony; Villalobos-Castillejos, Guiehdani; Vaughan, Gilberto; Kawa-Karasik, Simon; Flisser, Ana; Maravilla, Pablo; Romero-Valdovinos, Mirza

doi:10.1186/1743-422X-7-196

What happened after the initial global spread of pandemic human influenza virus A (H1N1)? A population genetics approach

Short report
Open access
Published: 20 August 2010

Volume 7, article number 196, (2010)
Cite this article

Download PDF

You have full access to this open access article

Virology Journal Aims and scope Submit manuscript

What happened after the initial global spread of pandemic human influenza virus A (H1N1)? A population genetics approach

Download PDF

Fernando Martinez-Hernandez¹,
Diego Emiliano Jimenez-Gonzalez¹,
Arony Martinez-Flores¹,
Guiehdani Villalobos-Castillejos²,
Gilberto Vaughan³,
Simon Kawa-Karasik¹,
Ana Flisser⁴,
Pablo Maravilla¹ &
…
Mirza Romero-Valdovinos¹

8166 Accesses
9 Citations
Explore all metrics

Abstract

Viral population evolution dynamics of influenza A is crucial for surveillance and control. In this paper we analyzed viral genetic features during the recent pandemic caused by the new influenza human virus A H1N1, using a conventional population genetics approach based on 4689 hemagglutinin (HA) and neuraminidase (NA) sequences available in GenBank submitted between March and December of 2009. This analysis showed several relevant aspects: a) a scarce initial genetic variability within the viral isolates from some countries that increased along 2009 when influenza was dispersed around the world; b) a worldwide virus polarized behavior identified when comparing paired countries, low differentiation and high gene flow were found in some pairs and high differentiation and moderate or scarce gene flow in others, independently of their geographical closeness, c) lack of positive selection in HA and NA due to increase of the population size of virus variants, d) HA and NA variants spread in a few months all over the world being identified in the same countries in different months along 2009, and e) containment of viral variants in Mexico at the beginning of the outbreak, probably due to the control measures applied by the government.

Findings

In April 2009 the Mexican Secretariat of Health reported an outbreak of respiratory disease. A new human influenza virus A H1N1 with molecular features of North American and Eurasian swine, avian, and human influenza viruses was identified [1]. In the same month, the World Health Organization (WHO) classified the global spread of this virus as a public health event of international concern. After documentation of human to human transmission of the virus in at least two WHO regions, the highest pandemic level was declared [2]. As a result of the epidemiological surveillance, large amounts of A H1N1 genetic sequences were accumulated in the GenBank and several molecular epidemiological studies monitoring evolutionary inferences of viral gene flow in time and space were reported [3–6]. In December 2009, A H1N1 was worldwide spread, affecting 208 countries, with at least 12,220 deaths [7]. Thus, more sequences were reported but no overall population genetics studies were performed, and also no comparison of the initial and the viral variants (VV) has been reported. The goal of the present study is to provide an overview with a phylogeographic behavior during the initial spread and subsequent worldwide establishment of influenza pandemic.

Analysis of genetic diversity within and between populations were calculated using DnaSP v4 [8–10] and included nucleotide diversity (π), haplotype polymorphism (θ), genetic differentiation index (G_ST), coancestry coefficient (F_ST) and migration (Nm). These indexes refer to: π, average proportion of nucleotide differences between all possible pairs of sequences in the sample; θ, proportion of nucleotide sites that are expected to be polymorphic in any suitable sample from this region of the genome. Both indexes are used to assess polymorphisms at the DNA level and monitor diversity within or between ecological populations, and examine the genetic variation in related species or their evolutionary relationships [9]. F_ST and G_ST are two equivalent genetic statistics used to measure differentiation between or among populations; F_ST is used when there are only two alleles at a locus, and G_ST with multiple alleles; common used values for genetic differentiation are: 0 to 0.5 small; 0.05 to 0.15 moderate; 0.15 to 0.25, great, and values above 0.25 indicate huge genetic differentiation, while negative values are due to small sample size [8] and thus, when found, zero value was assigned [11, 12]. The gene flow or migration index (Nm) refers to movement of organisms among subpopulations, those strongly differentiated have a Nm < < 1, while Nm > 4 behave as a single panmictic unit [9].

The previously described genetic diversity analyses were performed with A H1N1 Influenza Database [13] with sequences submitted between April and December 2009 (collection dates and sequence origin are found in addition file 1), including three or more sequences per country of 500 continuous base pairs (bp), recorded during the initial four months of the pandemics and, for the global analysis, those having at least 750 continuous bp were used. Multiple alignments were performed by CLUSTAL W program v1.8 [14] and adjusted using MEGA program v4 [15, 16]. A median joining method for constructing networks from recombination-free population data, featuring Kruskal's algorithm for finding minimum spanning trees [17] was used with the program Network 4v.5.1.6 [18].

Up to 3462 sequences (1779 of HA and 1683 of NA) with 2208 VV (1216 of HA and 992 of NA) from 31 countries were used, interestingly 80% were recorded between April and July (Figure 1). Figure 2 shows the number of sequences analyzed (first row), θ values (second row) and π values (third row) for the analysis performed of the sequences obtained in the initial four months (left column) or of the global analysis (right column). As it can be seen few countries provided most variants. Theta and Pi showed a similar high trend in around 50% of the countries in the analysis of the initial four months (average π = 0.0025 for HA and π = 0.0016 for NA)). In contrast, the overall analysis shows that polymorphism increased in all the countries (π = 0.0125 for HA and π = 0.0153 for NA), with higher levels for USA, Russia, Thailand, Philippines and Spain.

Genetic population indexes were compared in the countries with most sequences reported (USA, Spain, Japan, Mexico and China). Figure 3 shows, in five plots, the data of these countries paired against all those countries with HA and NA reported during the initial four months of the pandemic. For example in USA it can be seen that genetic differentiation parameters (F_ST and G_ST) were high when this country was paired with Mexico, France, Greece or New Zealand (seen as full or empty dots or triangles), while the values of genetic flow (Nm) were higher when USA was compared to Chile, Germany, Russia, China, Philippines or Australia (seen as shadowed areas or star peaks). Following the same explanation for the other four countries, it can be seen that some showed high or low degree of differentiation for F_ST and G_ST but opposed for Nm. Thus, the highest flow is seen in USA followed by Japan, China and Spain, and the lowest was found in Mexico. Interestingly, in the image obtained when samples from April-December were used, a different pattern can be seen: USA shows a moderate flow with all countries used for comparison; while Mexico is the country with the highest differentiation. The in-between countries are Japan, China, Spain and Singapore; the latter country appears in figure 4 but not in 3 because there are no data reported for the early months. Additional file 2 includes all data obtained for F_ST, G_ST and Nm. Negative values for F_ST and G_ST indicate no differentiation; in some cases NA showed lower F_ST values that those of HA with a similar trend. Tajima's D provided negative values: -2.619 and -2.380 in the initial four months and -1.802 and -2.358 in the overall analysis, for HA and NA, respectively, indicating arousal of new polymorphisms as a consequence of population size expansion along 2009 [9].

Figure 5 shows the widespread distribution of the main HA and NA VV around the world and along the time; for example, VV57NA was identified in USA and Mexico in April; one month later it was also present in Brazil, France, Poland, Finland, China and Taiwan; in June in Chile, Greece and Japan; and in July also in Italy and Myanmar (see also additional file TS2).

Figures 6 and 7 show the networks obtained for HA and NA during the first and the last four months (A and B respectively), with the Median Joining method that estimates genealogic relationships. Figure 6A shows three major dispersion centers for HA: one that clustered variants from USA and Asia, a second one that grouped VV mainly from USA, Mexico and China and the third with several Spanish variants. Using NA sequences (Figure 7A) two principal dispersion centers were identified: one clustering mainly VV form USA and another one that grouped VV form USA, Mexico and China; similarly to HA, several Spanish VV were dispersed. Networks obtained between July and December showed only one dispersion center, with several VV from Mexico, China and Singapore in the HA tree, as seen in figure 6B and numerous separated Spanish VV in the NA tree (Figure 7B).

Our study shows that a high viral diversity during the 2009 pandemic took place, as compared, for example, to a study of HA performed in 1999-2000 with samples from French infected patients with A/H3N2, which showed an average of π = 0.0034 [19] which is 440 times lower that the one found in our study (π~0.012 for HA), suggesting that the variability of a pandemic virus is higher than that of an epidemic virus. Negative values of Tajima's D for HA and NA imply that no selection force is yet influencing the success of the pandemic virus. Some studies show different extent of changes: a study with 423 complete genomes of human H3N2 influenza A virus collected between 1997 and 2005 in New York, USA, revealed that adaptive evolution occurred only sporadically, rather, a stochastic process of viral migration and clade reassortment played a vital role in shaping short-term evolutionary dynamics [20]. Another study analyzed 357 nucleotide sequences for HA from A H1N1 and found some codons under positive selection, suggesting that these changes may have predictive value for future epidemic variants [21]. Therefore, precaution should be taken because A H1N1 may peak again, since our data show that the variants are still in expansion. Network analysis showed that the major dispersion center was shared by China, Mexico and USA during the initial four months, and probably reflect the fact that there was a greater interest in the scientific community for submitting and reporting viral sequences in GenBank. Also, HA was more variable than NA, which is in accordance with the statement that the HA gene exhibits a rapid mutation rate [22].

When integrating data of F_ST, G_ST and Nm of this new A H1N1 it was observed that the virus had different behaviors along 2009 when comparing paired countries; which was, in general, independent of their geographical proximity. The extremes were found in USA and Mexico; the former showed a high distribution of virus variants to and from several countries in the initial four months of the pandemic, becoming a worldwide dispersion towards the end of the year, while in Mexico minimal influx of variants was seen in the initial four months. This was probably due to the governmental actions taken in April to contain the influenza outbreak in the whole Mexican Republic [23] or to the exclusion of small sequences for the analyses performed. Also, some countries decided to close their borders or send travel alerts recommending their citizens to avoid nonessential travel to Mexico [stated in 2009 in 24]. At the beginning of the pandemic, federal and local health authorities in Mexico established several measures, mainly focused in two lines 1) social spacing that included closing temporally churches, schools, restaurants, cinemas, theaters and other sites of massive human concentration, 2) intensive hygiene campaign that publicized basic aspects of health such as continuous hand washing, avoiding unprotected sneezing, using disposable surgical masks and surveillance of symptoms associated to flu.

Abbreviations

F_ST:: coancestry coefficient statistics
G_ST:: genetic differentiation index
HA:: hemagglutinin
NA:: neuraminidase
NN:: migration index
VV:: viral variants
VV57NA:: viral variant 57 of neuraminidase
WHO:: World Health Organization
π:: nucleotide diversity
θ:: haplotype polymorphism.

References

Perez-Padilla R, de la Rosa-Zamboni D, Ponce de Leon S, Hernandez M, Quiñones-Falconi F, Bautista E, Ramirez-Venegas A, Rojas-Serrano J, Ormsby CE, Corrales A, Higuera A, Mondragon E, Cordova-Villalobos JA, INER Working Group on Influenza: Pneumonia and respiratory failure from swine-origin influenza A (H1N1) in Mexico. N Engl J Med 2009, 361: 680-689. 10.1056/NEJMoa0904252
Article PubMed CAS Google Scholar
WHO influenza update page[http://www.who.int/csr/don/2009_05_04a/en/index.html]
Bansal S, Pourbohloul B, Grenfell B, Meyers LA: The shifting demographic landscape of influenza. PLoS Curr Influenza 2009, 1: RRN1047. 10.1371/currents.RRN1047
Article Google Scholar
Lemey P, Suchard M, Rambaut A: Reconstructing the initial global spread of a human influenza pandemic: a Bayesian spatial-temporal model for the global spread of H1N1pdm. PLoS Curr Influenza 2009, 2: RRN1031. 10.1371/currents.RRN1031
Google Scholar
Nelson M, Spiro D, Wentworth D, Beck E, Fan J, Ghedin E, Halpin R, Bera J, Hine E, Proudfoot K, Stockwell T, Lin X, Griesemer S, Kumar S, Bose M, Viboud C, Holmes E, Henrickson K: The early diversification of influenza A/H1N1pdm. PLoS Curr Influenza 2009, 3: RRN1126. 10.1371/currents.RRN1126
Google Scholar
Rambaut A, Holmes E: The early molecular epidemiology of the swine-origin A/H1N1 human influenza pandemic. PLoS Curr Influenza 2009, 18: RRN1003. 10.1371/currents.RRN1003
Google Scholar
WHO Pandemic (H1N1) 2009 - update 81[http://www.who.int/csr/don/2009_12_30/en/index.html]
Rozas J, Sánchez-DelBarrio JC, Messeguer X, Rozas R: DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics 2003, 19: 2496-21497. 10.1093/bioinformatics/btg359
Article PubMed CAS Google Scholar
Hartl DL, Clark AG: Principles of Population Genetics. 3rd edition. Sinauer Associates, Inc. Publishers, Sunderland, Massachusetts; 1997.
Google Scholar
Weir BS, Cockerham CC: Estimating F-statistics for the analysis of population structure. Evolution 1984, 38: 1358-1370. 10.2307/2408641
Article Google Scholar
Kullo IJ, Ding K: Patterns of population differentiation of candidate genes for cardiovascular disease. BMC Genet 2007, 8: 48. 10.1186/1471-2156-8-48
Article PubMed PubMed Central Google Scholar
Martinez-Hernandez F, Jimenez-Gonzalez DE, Chenillo P, Alonso-Fernandez C, Maravilla P, Flisser A: Geographical widespread of two lineages of Taenia solium due to human migrations: Can population genetic analysis strengthen this hypothesis? Infect Genet Evol 2009, 9: 1108-1114. 10.1016/j.meegid.2009.09.005
Article PubMed Google Scholar
A H1N1 Influenza Database[http://www.ncbi.nlm.nih.gov/Genbank/index.html]
Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 1994, 22: 4673-4680. 10.1093/nar/22.22.4673
Article PubMed CAS PubMed Central Google Scholar
Bandelt J, Forster P, Röhl A: Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol 1999, 16: 37-48.
Article PubMed CAS Google Scholar
Kimura M: A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol 1980, 16: 111-120. 10.1007/BF01731581
Article PubMed CAS Google Scholar
Kumar S, Tamura K, Nei M: MEGA3: Integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment. Brief Bioinform 2004, 5: 150-163. 10.1093/bib/5.2.150
Article PubMed CAS Google Scholar
Fluxus-engieneering, expertise in software for genetics and engineering[http://www.fluxus-engineering.com/sharenet.htm]
Lavenu A, Leruez-Ville M, Chaix ML, Boelle PY, Rogez S, Freymuth F, Hay A, Rouzioux C, Carrat F: Detailed analysis of the genetic evolution of influenza virus during the course of an epidemic. Epidemiol Infect 2006, 134: 514-520. 10.1017/S0950268805005686
Article PubMed CAS PubMed Central Google Scholar
Nelson MI, Simonsen L, Viboud C, Miller MA, Taylor J, George KS, Griesemer SB, Ghedin E, Sengamalay NA, Spiro DJ, Volkov I, Grenfell BT, Lipman DJ, Taubenberger JK, Holmes EC: Stochastic processes are key determinants of short-term evolution in influenza a virus. PLoS Pathog 2006, 2: e125. 10.1371/journal.ppat.0020125
Article PubMed PubMed Central Google Scholar
Bush RM, Fitch WM, Bender CA, Cox NJ: Positive selection on the H3 hemagglutinin gene of human influenza virus A. Mol Biol Evol 1999, 16: 1457-1465.
Article PubMed CAS Google Scholar
Fitch WM, Leiter JM, Li XQ, Palese P: 1991. Positive Darwinian evolution in human influenza A viruses. Proc Natl Acad Sci USA 1991, 88: 4270-4274. 10.1073/pnas.88.10.4270
Article PubMed CAS PubMed Central Google Scholar
Mexico Health secretariat page[http://portal.salud.gob.mx/contenidos/noticias/influenza/lineamientos.html]
A service of the bureau of consular affairs U.S. Department of State[http://www.travel.state.gov/travel]

Download references

Acknowledgements

This work was supported by Grants PICDSI09-228 and PICDSI09

Author information

Authors and Affiliations

Departamento de Ecología de Agentes Patogenos, Hospital General "Dr. Manuel Gea Gonzalez", 4800, Calzada de Tlalpan, Mexico, DF, 14080, USA
Fernando Martinez-Hernandez, Diego Emiliano Jimenez-Gonzalez, Arony Martinez-Flores, Simon Kawa-Karasik, Pablo Maravilla & Mirza Romero-Valdovinos
Departamanto de Parasitologia Escuela Nacional de Ciencias Biologicas, Prolongación Carpio s/n Instituto Politecnico Nacional, DF, 11340, Mexico
Guiehdani Villalobos-Castillejos
Departamento de Investigaciones Inmunologicas, Instituto de Diagnostico y Referencia Epidemiologicos, 470 SSA, Carpio, DF, 11340, Mexico
Gilberto Vaughan
Departamento de Microbiologia y Parasitologia, Facultad de Medicina, Av. Universidad 3000 Universidad Nacional Autonoma de Mexico, DF, 04510, Mexico
Ana Flisser

Authors

Fernando Martinez-Hernandez
View author publications
You can also search for this author in PubMed Google Scholar
Diego Emiliano Jimenez-Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Arony Martinez-Flores
View author publications
You can also search for this author in PubMed Google Scholar
Guiehdani Villalobos-Castillejos
View author publications
You can also search for this author in PubMed Google Scholar
Gilberto Vaughan
View author publications
You can also search for this author in PubMed Google Scholar
Simon Kawa-Karasik
View author publications
You can also search for this author in PubMed Google Scholar
Ana Flisser
View author publications
You can also search for this author in PubMed Google Scholar
Pablo Maravilla
View author publications
You can also search for this author in PubMed Google Scholar
Mirza Romero-Valdovinos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mirza Romero-Valdovinos.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

FMH, DEJG, AMF and GVC collected data and carried out the bioinformatics analysis. GV, SKK and AF participated in biological interpretations of results and in the discussion. PM and MRV formulated the idea. All authors contributed in writing the manuscript.

Electronic supplementary material

12985_2010_941_MOESM1_ESM.DOC

Additional file 1:A H1N1 gene sequences used for the genetic diversity analysis. List of GenBank sequences of A H1N1, number of accession and country of origin. (DOC 3 MB)

12985_2010_941_MOESM2_ESM.DOC

Additional file 2:Population genetic indexes among paired sequences of A H1N1 obtained from different countries. List of values (indexes) obtained for population genetic analysis among paired sequences from different countries after DnaSP v4 analysis. (DOC 529 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 9

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Martinez-Hernandez, F., Jimenez-Gonzalez, D.E., Martinez-Flores, A. et al. What happened after the initial global spread of pandemic human influenza virus A (H1N1)? A population genetics approach. Virol J 7, 196 (2010). https://doi.org/10.1186/1743-422X-7-196

Download citation

Received: 03 June 2010
Accepted: 20 August 2010
Published: 20 August 2010
DOI: https://doi.org/10.1186/1743-422X-7-196

What happened after the initial global spread of pandemic human influenza virus A (H1N1)? A population genetics approach

Abstract

Findings

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors' contributions

Electronic supplementary material

12985_2010_941_MOESM1_ESM.DOC

12985_2010_941_MOESM2_ESM.DOC

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 9

Rights and permissions

About this article

Cite this article

Keywords

Navigation

What happened after the initial global spread of pandemic human influenza virus A (H1N1)? A population genetics approach

Abstract

Findings

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors' contributions

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation