Advertisement

Journal of Molecular Evolution

, Volume 53, Issue 1, pp 55–62 | Cite as

Substitution Model of Sequence Evolution for the Human Immunodeficiency Virus Type 1 Subtype B gp120 Gene over the C2-V5 Region

  • Jon P.  Anderson
  • Allen G.  Rodrigo
  • Gerald H.  Learn
  • Yang  Wang
  • Hillard  Weinstock
  • Marcia L.  Kalish
  • Kenneth E.  Robbins
  • Leroy  Hood
  • James I.  Mullins

Abstract.

Phylogenetic analyses frequently rely on models of sequence evolution that detail nucleotide substitution rates, nucleotide frequencies, and site-to-site rate heterogeneity. These models can influence hypothesis testing and can affect the accuracy of phylogenetic inferences. Maximum likelihood methods of simultaneously constructing phylogenetic tree topologies and estimating model parameters are computationally intensive, and are not feasible for sample sizes of 25 or greater using personal computers. Techniques that initially construct a tree topology and then use this non-maximized topology to estimate ML substitution rates, however, can quickly arrive at a model of sequence evolution. The accuracy of this two-step estimation technique was tested using simulated data sets with known model parameters. The results showed that for a star-like topology, as is often seen in human immunodeficiency virus type 1 (HIV-1) subtype B sequences, a random starting topology could produce nucleotide substitution rates that were not statistically different than the true rates. Samples were isolated from 100 HIV-1 subtype B infected individuals from the United States and a 620 nt region of the env gene was sequenced for each sample. The sequence data were used to obtain a substitution model of sequence evolution specific for HIV-1 subtype B env by estimating nucleotide substitution rates and the site-to-site heterogeneity in 100 individuals from the United States. The method of estimating the model should provide users of large data sets with a way to quickly compute a model of sequence evolution, while the nucleotide substitution model we identified should prove useful in the phylogenetic analysis of HIV-1 subtype B env sequences.

Key words: HIV-1 —env— Nucleotide substitution rates — Rate heterogeneity — Maximum likelihood — Evolutionary model — Simulations 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag New York Inc. 2001

Authors and Affiliations

  • Jon P.  Anderson
    • 1
  • Allen G.  Rodrigo
    • 2
  • Gerald H.  Learn
    • 2
  • Yang  Wang
    • 2
  • Hillard  Weinstock
    • 3
  • Marcia L.  Kalish
    • 3
  • Kenneth E.  Robbins
    • 3
  • Leroy  Hood
    • 1
  • James I.  Mullins
    • 2
  1. 1.Department of Molecular Biotechnology, Health Sciences Center, University of Washington, Seattle, WA 98195, USAUS
  2. 2.Department of Microbiology, Health Sciences Center, University of Washington, Seattle, WA 98195, USAUS
  3. 3.Centers for Disease Control and Prevention, 1600 Clifton Rd, Atlanta, GA 30333, USAUS

Personalised recommendations