Abstract
Although the Guangxi region accounts for 10% of all HIV-1 cases new reported in 2011 in China, the sources of the transmitted HIV-1 strains are virtually unknown. To determine the extent to which recent HIV infections were derived from already circulating local strains as opposed to recently introduced strains, we performed a cross-sectional molecular epidemiological investigation of recent infections across Guangxi during 2012–2013. HIV-1 nucleotide sequences were amplified and sequenced. Phylogenetic analyses of pol gene regions were used to determine HIV-1 transmission source strains. Based on 229 sequences generated, the subtype/CRF distribution was as follows: CRF01_AE (61.1%), CRF07_BC (18.8%), CRF08_BC (16.6%), CRF55_01B (3.1%), and subtype B′ (0.4%). In total, 213 of 229 (93.0%) sequenced transmission strains were derived from already-circulating local strains. Multivariate logistic regression analysis showed that only an age of 18–25 years was significantly associated with transmission from outside Guangxi (compared to >25 years, AOR: 5.15, 95% CI: 1.18–22.48, p < 0.01). This is the first study to use a Bayesian discrete phylogeographic approach to analyze transmission source strains in China. Our results provide useful data for designing evidence-based prevention strategies and methods for combating the rapid spread of sexually transmitted HIV in Guangxi.
Similar content being viewed by others
Introduction
The Guangxi Zhuang Autonomous Region (Guangxi) is located in southwest China. Due to its location along a major heroin trafficking route linking Guangxi with Yunnan and Vietnam and its close proximity to the world’s major heroin-producing area, known as the Golden Triangle, human immunodeficiency virus (HIV) transmission in Guangxi was primarily fueled initially by intravenous drug use1. In 1996, the first case of HIV infection among Guangxi local residents was identified in an intravenous drug user (IDU) in Pingxiang city, close to Vietnam2. HIV prevalence among IDUs has increased rapidly since then3. HIV infection through drug injection accounted for 69% of reported HIV cases in Guangxi in 2003, but the proportion of HIV infections transmitted through sexual intercourse increased to 66% in 20093,4. Among the 31 provinces and municipalities in China, Guangxi has one of the highest rates of reported HIV infection, accounting for approximately 10% of reported HIV cases in 2011 in China4. Since the end of 2011, heterosexual transmission (90%) has been the predominant transmission route4. Because heterosexual transmission is the primary method by which HIV spreads from high-risk groups5 to the general population, the HIV epidemic poses greater challenges than ever before in Guangxi.
In 1996, four HIV subtypes were identified in Guangxi, consisting of subtypes B′ (Thai B), C, D, and CRF01_AE2. Subtypes C and CRF01_AE were being transmitted via IDUs and heterosexual transmission, whereas subtypes B′ and D were circulating among commercial blood donors. Subsequent sequence analysis showed two geographically distinct, highly homogeneous HIV-1 strains in Guangxi6,7. B/C intersubtype recombinants were found in IDUs from Baise, near the Yunnan-Guangxi border. Circulating recombinant-form (CRF) AE strains (now referred to as CRF01_AE) were found in IDUs from Pingxiang, near the China-Vietnam border, and from Nanning, the capital city of Guangxi7. Although CRF07_BC and CRF08_BC were first detected among IDUs in Guangxi in 19977,8, it is likely that these two CRFs were initially established in Yunnan Province9,10 and spread through overland heroin trafficking routes to Guangxi and Xinjiang1. Sequence analysis showed that CRF07_BC and CRF08_BC were closely related and may have evolved from a common parental strain11. CRF01_AE, originated in central Africa in the 1970s and spread in Thailand in the 1980s through heterosexual transmission12,13, first identified among IDUs in Guangxi, showed significant clustering with strains found in northern Vietnam6,14. Subsequently, a national molecular epidemiological survey conducted across China in 2006 showed that there were three major HIV-1 subtypes circulating in Guangxi, i.e., CRF01_AE, CRF08_BC, and CRF07_BC, accounting for 60.0%, 29.1%, and 6.4% of cases, respectively15. Other studies performed from 2008 to 2009 showed that CRF01_AE was the dominant subtype in Guangxi16,17. Two large CRF01_AE clusters were identified among heterosexual transmission cases in Guangxi18,19. One cluster originated from Vietnamese strains that were previously reported in IDUs. A second, novel cluster was also identified and showed a close relationship to strains from the Fujian province of China.
HIV-1 molecular epidemiological surveys of recent infections are important for understanding the real-time dynamics of the HIV-1 epidemic in Guangxi and the complexities of HIV-1 subtype transmission. The BED-capture enzyme immunoassay (BED-CEIA) has been widely used to measure the proportion of HIV-1-specific IgGs among total IgGs in blood samples for the purpose of identifying recent infections20,21. However, BED-CEIA classifies some individuals having long-standing infections as recently infected22,23. Use of a combination of non-serological biomarkers, such as CD4 cell counts, with BED-CEIA offers an alternative method for the purpose of accurately identifying recent infections24,25. In this study, we performed a cross-sectional molecular epidemiological survey among recently infected individuals identified by a combination of BED-CEIA and CD4 results. We further used bioinformatics approaches to analyze the sources of HIV-1 transmission strains in Guangxi, China.
Results
Demographic and epidemiological characteristics of study participants
Among 275 plasma samples from individuals recently infected with HIV-1 from 2012 to 2013, pol sequences were successfully amplified and genotyped from a total of 229 samples (83.3%). The demographic and epidemiological characteristics of the 229 genotyped individuals are summarized in Table 1. Among the study participants, 64.6% were males, and the mean age of the participants was 44.9 ± 16.1 years. In total, 66.8% (153/229) of participants were of Han ethnicity, 27.1% (62/229) were of Zhuang nationality, and 6.1% (14/229) were of other minority ethnicities, consisting of Yao, Mulao, Tujia, and Dong. The median CD4 cell count was 515 cells/µl (IQR: 419–604). Individuals were classified into the following risk groups: heterosexual (202/229, 88.2%), MSM (18/229, 7.9%), IDUs (4/229, 1.7%), and unknown (5/229, 2.2%). In terms of the distribution of cases among geographic regions, 71 cases were from central Guangxi (31.0%), consisting of 48 cases from Nanning, 15 from Guigang, and eight from Laibin; 46 cases were from southwestern Guangxi (20.1%), consisting of 24 from Qinzhou, eight from Baise, seven from Beihai, six from Chongzuo, and one from Fangchenggang; 60 cases were from northwestern Guangxi (26.2%), consisting of 26 from Guilin, 26 from Liuzhou, and eight from Hechi; and 52 cases were from southeastern Guangxi (22.7%), consisting of 22 from Yulin, 16 from Hezhou, and 14 from Wuzhou.
HIV-1 genotypes and clusters
Among the 229 pol sequences, 140 (61.1%), 43 (18.8%), 38 (16.6%), seven (3.1%), and one (0.4%) were identified as CRF01_AE, CRF07_BC, CRF08_BC, CRF55_01B, and subtype B, respectively (Fig. 1, Table 1). Distribution of HIV-1 genotypes of recently infections in each prefecture were shown in Fig. 2(B). The greatest diversity of HIV subtypes and CRFs was observed among the heterosexual population, with CRF01_AE, CRF07_BC, CRF08_BC, CRF55_01B, and subtype B′ accounting for 62.4% (126/202), 16.8% (34/202), 17.8% (36/202), 2.5% (5/202), and 0.5% (1/202) of cases, respectively. Among the MSM population, CRF01_AE, CRF07_BC, CRF08_BC, and CRF55_01B accounted for 38.9% (7/18), 44.4% (8/18), 5.6% (1/18), and 11.1% (2/18) of cases, respectively. The HIV-1 genotype distributions of the heterosexual (n = 202) and MSM (n = 18) populations were significantly different (Fisher’s exact test, χ2=12.582, p = 0.009). Among the IDU population, CRF01_AE and CRF08_BC accounted for 75.0% (3/4) and 25.0% (1/4) of cases, respectively. The genotypes CRF01_AE (80.0%, 4/5) and CRF07_BC (20.0%, 1/5) were found in the five individuals whose risk status was unknown.
As shown in Fig. 1, among 140 CRF01_AE sequences, 131 (93.6%) were grouped into four clusters: CRF01_AE Cluster 1 (77/140, 55.0%), CRF01_AE Cluster 2 (41/140, 29.3%), CRF01_AE Cluster 4 (7/140, 5.0%), and CRF01_AE Cluster 5 (6/140, 4.3%). A further nine (6.4%) sequences remained ungrouped (Fig. 1). CRF01_AE Cluster 1 strains were found among 73 heterosexuals, one MSM, two IDUs, and one individual of unknown risk group. CRF01_AE Cluster 2 strains were found among 37 heterosexuals, one IDU, and three individuals of unknown risk group. CRF01_AE Cluster 4 strains were found among three heterosexuals and five MSMs. CRF01_AE Cluster 5 strains were found among five heterosexuals and one MSM. Among 43 CRF07_BC sequences, two clusters were identified among 26 (60.5%) sequences, consisting of CRF07_BC Cluster 1 (16/43, 37.2%) and CRF07_BC Cluster 2 (10/43, 23.3%), whereas 17 (39.5%) sequences were not grouped (Fig. 1). CRF07_BC Cluster 1 strains were found among nine heterosexuals and seven MSMs, and CRF07_BC Cluster 2 strains were found among 10 heterosexuals. All seven CRF55_01B sequences (100%) clustered together into a single cluster of five heterosexuals and two MSMs. Among 38 CRF08_BC sequences, no clusters were identified (Fig. 1).
Sources of HIV-1 transmission strains
The sources of HIV-1 transmission strains among recently infected individuals in Guangxi were estimated by reconstruction of Bayesian discrete phylogeographic approaches under a Bayesian skygrid demographic model (See Method). The results of estimation were visualized on maximum clade credibility (MCC) trees (Fig. 3) and summarized in Table 2. Among 229 pol gene sequences, 213 (93.0%) were derived from strains already circulating in Guangxi, with the remaining 16 (7.0%) derived from strains circulating outside Guangxi. As shown in Table 2, sources of HIV transmission differed by sex, age, marital status, and risk group of the study participants. The proportion of source strains derived from those circulating outside Guangxi was 2.5% among female and 9.5% among male participants (p = 0.047); 21.2% among participants aged 18–25 years, 5.9% among those aged 26–49 years, and 3.2% among those aged ≥50 years (p = 0.002); 3.0% among married, 14.5 among unmarried, and 8.6% among divorced/widowed participants (p = 0.013); and 5.9% among heterosexual and 22.2% among MSM participants (p = 0.024). Multivariate logistic regression analysis showed that only age was significantly associated with a source of HIV-1 transmission from outside of Guangxi, with those aged 18–25 years being at highest risk (compared to >25 years, AOR: 5.15, 95% CI: 1.18–22.48, p < 0.01). Among a total of 16 HIV strains found to have been introduced from outside Guangxi, the most probable origins were Beijing (five strains), Shanghai (five strains) and Shenzhen (two strains) (Fig. 3). The most probable origins of the remaining four strains were four different provinces, none of which were the neighboring provinces of Yunnan, Guizhou, Hunan, or Guangdong.
Discussion
Guangxi is a relatively poor area in rural southwestern China that accounts for approximately 19% of new reported HIV cases in 2011 in China. Although traditional field epidemiological surveys focusing on HIV infection and related risk factors have been conducted, this is the first study to use molecular epidemiology to track the sources of HIV transmission strains among recently infected individuals in China. In this study, we found that 93.0% of new HIV-1 infections were derived from strains already circulating in Guangxi. In a study of the international metropolis Shanghai, the transmission network was found to include not only strains circulating in Shanghai itself but also those transmitted between Shanghai and other provinces26. The most probable sources of HIV transmission strains circulating outside Guangxi in our study were from the large Chinese cities of Beijing, Shanghai, and Shenzhen, which illustrates the complexities of large sexual networks arising from population mobility in China. Young individuals in Guangxi were more likely to have been infected with an HIV strain circulating outside Guangxi, indicating increased mobility and higher-risk sexual activities in this population group.
Changes in HIV epidemic characteristics and new trends among recent HIV infections in Guangxi were also identified in this study. In total, five HIV-1 subtypes and CRFs, CRF01_AE, CRF07_BC, CRF08_BC, CRF55_01B, and subtype B′, were detected among study participants in Guangxi. CRF01_AE, CRF07_BC, and CRF08_BC are the three major HIV-1 genotypes circulating in China, whereas CRF55_01B and subtype B′ are minor HIV-1 genotypes. Recent studies have reported that CRF01_AE is the dominant subtype in Guangxi15,17,27, which is consistent with the findings of our study. However, the proportion of CRF01_AE (61.1%) cases in this study was lower than that previously found. Earlier studies found that the proportion of CRF01_AE cases was 77.3% among 481 treatment-naïve HIV-1-infected participants in 2009–201017 and 83.1% among 260 participants who were newly diagnosed, treatment naïve, and 16–25 years of age in the cities of Liuzhou, Hezhou, and Nanning in Guangxi in 2009–201327. As previously reported by Feng et al.28, HIV-1 CRF01_AE strains circulating in China can be categorized into seven independent lineages. In this study, four of these lineages were identified among CRF01_AE strains circulating in Guangxi, corresponding to the CRF01_AE Clusters 1, 2, 4, and 5 identified by Feng et al.; no strains from CRF01_AE Clusters 3, 6, or 7 were found in this study. Among the four CRF01_AE clusters identified in this study, CRF01_AE Cluster 1 accounted for more than half of the total, which is consistent with the findings of some recent studies27,28. This indicates that CRF01_AE Cluster 1 strains are the major strains circulating in Guangxi. CRF01_AE Cluster 2 strains, which originated among early IDU populations in Guangxi7 and sexually transmitted populations from northern Vietnam29, accounted for one-third of the total. We also found that CRF01_AE Clusters 1 and 2 predominated among heterosexuals, whereas CRF 01_AE Cluster 4 predominated among MSMs, which is consistent with the findings of a previous study28.
The proportion of CRF07_BC (18.8%) strains in this study was higher than that found in previous studies, in which the proportions of CRF07_BC were 6.4% among 110 HIV-positive persons newly diagnosed in 200615 and 7.3% among 481 treatment-naïve HIV-1-infected participants in 2009–2010 in Guangxi17. In addition to CRF07_BC Cluster 1, which corresponds to the cluster identified by Li et al.30, a novel CRF07_BC Cluster 2, composed of ten sequences, was identified in this study. The proportion of CRF08_BC appears to vary across studies, from 29.1% in a 2006 study15, to 11.4% in a 2009–2010 study17, to 16.6% in this study. Moreover, no clusters were identified among CRF08_BC strains. These findings indicate that CRF07_BC may be experiencing an increasing epidemic trend, whereas infection with CRF08_BC has decreased among recently infected individuals in Guangxi. It is possible that the new wave of HIV infections among heterosexuals and MSMs is being rapidly driven by subtype CRF07_BC virus.
CRF55_01B, a novel HIV-1 CRF composed of CRF01_AE and subtype B with four recombination breakpoints in the pol gene, was first identified from three epidemiologically unlinked MSMs in China31. CRF55_01B was found among seven recently infected individuals in our study: five heterosexuals and two MSMs. Although CRF55_01B has been circulating widely among MSMs in major cities in southern, eastern, and central China32, the spread of HIV-1 CRF55_01B among both heterosexuals and MSMs may foster future HIV epidemics in Guangxi.
The majority of strains we detected in Guangxi in recently infected individuals were CRF01_AE, CRF07_BC, CRF08_BC and CRF55_01B. Only one subtype B′ strain was detected. All of these strains have been found in China for many years. Circulating recombinant forms such as CRF07_BC, CRF08_BC, and CRF55_01B even originated in China2,5,24. Moreover, the CRF01_AE strain in China originated from Thailand and Vietnam through early cross-border transmission in the 1990s25. However, several clusters of CRF01_AE that are unique to China have been characterized, which indicates that strains of CRF01_AE have been localized in China25. At least two CRF01_AE clusters transmitted mainly through heterosexuals and IDUs have been characterized in Guangxi24. In this study, our results indicate that the epidemic of HIV in Guangxi is mainly driven by local transmission. Although some strains from other provinces of China are present, we did not find any evidence to support cross-border transmission in recently infected individuals.
Our study has several limitations. A sampling bias might be present if there are many HIV-infected individuals in Guangxi whose infections have not been detected and diagnosed. Bias may also have been introduced in our representative HIV-1 pol gene reference sequence dataset, as not all individuals living with HIV have had their virus sequenced and not all sequences have been submitted to GenBank. Additionally, our method of determining HIV-1 transmission strain sources may have some limitations. We were not able to determine the direct source of each transmission, as intermediaries may exist between a case and its transmission source. However, our method does identify infections that are closely linked.
In summary, our study found that HIV-1 transmission in Guangxi mainly occurs from strains already circulating locally. We also found a high diversity of HIV-1 strains and recombinants co-circulating among recently infected individuals in Guangxi. These results may aid in designing evidence-based prevention strategies and methods for countering the rapid spread of this sexually transmitted epidemic in Guangxi. Future studies should focus on using HIV transmission networks to inform real-time prevention and intervention strategies.
Methods
Study participants and blood collection
Eligible study participants included newly diagnosed HIV-positive cases 18 years or older having a positive BED-CEIA test and a first CD4 cell count >350 cells/μl within three months of western blotting (WB) diagnosis. Exclusion criteria for study participants included AIDS, antiretroviral therapy (ART), and a first CD4 cell count <200 cells/μl within three months of WB diagnosis22,23. In total, 6,647 HIV/AIDS cases aged ≥18 years were newly diagnosed between July 2012 and June 2013 in 14 prefectures in Guangxi, China. Excluding those ineligible for the BED-CEIA test, 3,739 plasma samples, which were also diagnosed as HIV positive by western bolting (WB), were shipped to our Laboratory of HIV/AIDS Confirmation and stored at −80 °C for subsequent tests. All of these samples were tested for recent infection through the BED-CEIA test22,23, and 896 individuals were identified as positive. Among them, 608 individuals had first CD4 cell counts >350 cells/μl within three months of WB diagnosis and were defined as recently infected22,23. Among them, 275 plasma samples were randomly selected from 550 plasma samples having sufficient plasma volume and used for genotyping and phylogenetic analysis (For detailed sampling information, see Fig. 2(A), Supplementary Table S1 and Fig. S1). Socio-demographic data, containing sex, age, ethnicity, marital status, date of sampling, prefecture of sampling, date of HIV-positive diagnosis, transmission route, and first CD4 cell count within three months of WB diagnosis, were collected from the Guangxi HIV/AIDS case reporting system database at the Guangxi Zhuang Autonomous Region Center for Disease Control and Prevention.
Ethics Statement
All HIV tests were informed and voluntary. Written consent for HIV testing was obtained, in which the participants agreed that if they were diagnosed with HIV infection, their plasma samples could be used in research for the purpose of controlling and preventing HIV. The study protocol was approved by the Ethics Review Committee of Guangxi Center for Disease Control and Prevention. All participants provided written informed consent. All samples of the participants were tested in this study as approved by Guangxi institutional review board (IRB) and anonymized for our study. This study and all methods were approved by the IRB of the Guangxi Center for Disease Control and Prevention. All research methods in this study were performed in accordance with the approved guidelines.
HIV-1 RNA extraction, amplification, and sequencing
In total, 275 plasma samples identified as recently infected were used for genotype analysis. Viral RNA was extracted from 200 µl of plasma using the NucliSENS easyMAG platform (BioMérieux, Boxtel, Netherlands) according to the manufacturer’s instructions. Then, viral RNA was subjected to nested polymerase chain reaction (PCR) to obtain fragments of the pol gene (HXB2, positions 2147–3462 for a total of 1315 bp)33. The pol fragment was first amplified using One Step Reverse Transcription PCR (Takara, Dalian, China) with primers MAW26 (5′-TGGAAATGTGGAAAGGAAGGAC-3′ and RT21 (5′-CTGTATTTCTGCTATTAAGTCTTTTGATGGG-3′) in 25 µl reaction volumes. Cycling conditions were as follows: 50 °C for 30 min; 94 °C for 2 min; 30 cycles of 94 °C for 30 s, 55 °C for 30 s, and 72 °C for 2 min 30 s; and 72 °C for 10 min. Then, the second PCR was performed using 2 × Taq PCR MasterMix (Tiangen, Beijing, China) with primers Pro-1 (5′-CAGAGCCAACAGCCCCACCA-3′) and RT-20 (5′-CTGCCAGTTCTAGCTCTGCTTC-3′) in 50 µl reaction volumes. Cycling conditions were as follows: 94 °C for 5 min; 30 cycles of 94 °C for 30 s, 63 °C for 30 s, and 72 °C for 2 min 30 s; and 72 °C for 10 min. Each step was performed with appropriate negative controls to detect possible contamination during the experiments. PCR products were analyzed using 1% agarose gel electrophoresis. Positive PCR products were purified using QIAquick Gel Extraction Kit (Qiagen, Valencia, CA, USA) and sequenced directly on an ABI 3730XL automated sequencer using BigDye terminators (Applied Biosystems, Foster City, CA, USA) by Beijing Biomed Technology Development Co., Ltd (Beijing, China).
HIV-1 phylogenetic analysis
HIV-1 genotypes were determined based on neighbor-joining tree analysis in comparison with Los Alamos 2013 HIV-1 subtyping references. The nucleotide sequences of 229 HIV-1 pol genes (PR-RT region) were aligned separately using Gene Cutter (http://www.hiv.lanl.gov/cotent/sequence/.html) with HIV-1 group M subtype reference sequences [Los Alamos National Laboratory (LANL) HIV Sequence Database (http://www.hiv.lanl.gov/cotent/sequence/NEWALIGN/align.html, accessed in April 2013)] as follows: A1 (3 sequences), A2 (3), B (4), C (4), D (4), F1 (4), F2 (4), G (4), H (4), J (3), K (2), CRF01_AE (3), CRF02_AG (3), CRF 06_cpx (3), CRF07_BC (3), CRF08_BC (2), and group N (3). In addition to these 56 reference strains, we used 74 reference sequences from viruses that represent subtypes/CRFs commonly identified in China as follows: CRF01_AE (37), CRF08_BC (11), CRF07_BC (9), CRF55_01B (5), CRF59_01B (5), CRF67_01B (2), CRF68_01B (3), and subtype B (2). We also used two CRF01_AE strains from the Central African Republic and four CRF01_AE strains from Vietnam for a total of 136 reference sequences. Following alignment, manual adjustments were made using BioEdit software34,35, taking into consideration protein coding sequences. Maximum-likelihood trees were generated in RAxML using a GTRGAMMA model. Bootstrap trees were produced using the rapid bootstrapping algorithm and 1000 bootstrap replicates5. Clusters with bootstrap value higher than 0.85 (85%) were defined as phylogenetic cluster.
HIV-1 transmission strain source analysis
To determine the origins of the strains in recently infected individuals, all available pol gene (PR-RT region) sequences from China were downloaded from the Los Alamos HIV Sequence Database (8,607 sequences, accessed March, 2017). In addition, considering that Guangxi is located in southern China, sequences from neighboring countries and regions in South Asia were also downloaded (613 from Vietnam, 1,420 from Thailand, 240 from Myanmar, 424 from Malaysia, 265 from Philippines, 1,450 from Cambodia, 292 from Hong Kong and 443 from Taiwan). All the downloaded sequences were combined with the 8,536 PR-RT region sequences obtained from the Chinese national HIV surveillance and drug resistance monitoring database (not open access) to yield a total dataset of 22,290 sequences. A local BLAST database was built with all of the 22,290 sequences using the application makeblastdb within the BLAST+ software package. BLAST+ searches were performed for the 229 pol sequences of our study participants against the local BLAST database, and the five sequences most similar to each query were selected36. After removal of duplicate sequences, 707 pol sequences were used for determining HIV-1 transmission strain sources, consisting of 478 unique pol sequences identified by BLAST and 229 pol gene sequences from our study participants.
For more precise analysis and better display of phylogenetic trees, we separated the 707 pol sequences into four data sets based on genotype (CRF01_AE, CRF07_BC, CRF08_BC, or CRF55_01B), for datasets of 482, 127, 73, and 25 sequences, respectively (Supplementary Fig. S2). As there was only one sequence belonging to subtype B′, no phylogenetic tree was needed to determine transmission source. To reconstruct the spatial dynamics and estimate the sources of the strains of our study participants, a Bayesian discrete phylogeographic approach was performed using Markov chain Monte Carlo (MCMC) runs of 300 million generations with BEAST v.1.8.2 under a Bayesian skygrid demographic model. The first 10–30% of the states from each run were discarded as burn-in37,38. All four data sets were analyzed using a general time-reversible (GTR) model specifying a gamma distribution as a prior on each relative substitution rate and a relaxed uncorrelated lognormal (UCLN) molecular clock model to infer the timescale of HIV evolution with a gamma distribution prior on the mean clock rate (shape = 0.001, scale = 1000)39. The Bayesian MCMC output was analyzed using Tracer v1.6 (http://beast.bio.ed.ac.uk/Tracer). The most probable origin of the study participants was estimated according to output of the posterior of Bayesian estimation and visualized on maximum clade credibility (MCC) trees using the program FigTree v1.4.0 (http://beast.bio.ed.ac.uk) with the same calculation.
Nucleotide sequence accession numbers
All HIV-1 pol gene nucleotide sequences obtained in this study were submitted to GenBank under accession numbers KY226003-KY226231.
Statistical analysis
Statistical analyses for this study were performed using the SPSS 17.0 statistical analysis software package (SPSS Inc. Chicago, IL, USA). Categorical variables were compared using the chi-squared test. A stepwise multivariate logistic regression model was constructed to select the variables that were independently associated with HIV transmission strain sources. All tests were two-tailed, and p < 0.05 was considered statistically significant.
Change history
27 November 2018
A correction to this article has been published and is linked from the HTML and PDF versions of this paper. The error has not been fixed in the paper.
References
Beyrer, C. et al. Overland heroin trafficking routes and HIV-1 spread in south and south-east Asia. AIDS 14, 75–83 (2000).
Chen, J., Liu, W. & Nancy, L. Y. Molecular-epidemiological analysis of HIV-1 initial prevalence in Guangxi, China. Zhonghua Liu Xing Bing Xue Za Zhi 20, 74–77 (1999).
Zhu, Q. et al. Analysis on HIV/AIDS epidemic situation in Guangxi from 1989 to 2006. Applied Prev Med 14, 70–73 (2008).
Wang, Y. et al. Epidemiological characteristics of HIV/AIDS in Guangxi, 2009–2011. South China J Prev Med 39, 6–11 (2013).
Stamatakis, A., Ludwig, T. & Meier, H. RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees. Bioinformatics 21, 456–463, https://doi.org/10.1093/bioinformatics/bti191 (2005).
Yu, X. F. et al. Emerging HIV infections with distinct subtypes of HIV-1 infection among injection drug users from geographically separate locations in Guangxi Province, China. J Acquir Immune Defic Syndr 22, 180–188 (1999).
Piyasirisilp, S. et al. A recent outbreak of human immunodeficiency virus type 1 infection in southern China was initiated by two highly homogeneous, geographically separated strains, circulating recombinant form AE and a novel BC recombinant. J Virol 74, 11286–11295 (2000).
Su, L. et al. Characterization of a virtually full-length human immunodeficiency virus type 1 genome of a prevalent intersubtype (C/B′) recombinant strain in China. J Virol 74, 11367–11376 (2000).
Qiu, Z. et al. Characterization of five nearly full-length genomes of early HIV type 1 strains in Ruili city: implications for the genesis of CRF07_BC and CRF08_BC circulating in China. AIDS Res Hum Retroviruses 21, 1051–1056, https://doi.org/10.1089/aid.2005.21.1051 (2005).
Tee, K. K. et al. Temporal and spatial dynamics of human immunodeficiency virus type 1 circulating recombinant forms 08_BC and 07_BC in Asia. J Virol 82, 9206–9215, https://doi.org/10.1128/JVI.00399-08 (2008).
McClutchan, F. E. et al. Precise mapping of recombination breakpoints suggests a common parent of two BC recombinant HIV type 1 strains circulating in China. AIDS Res Hum Retroviruses 18, 1135–1140, https://doi.org/10.1089/088922202320567879 (2002).
Gao, F. et al. The heterosexual human immunodeficiency virus type 1 epidemic in Thailand is caused by an intersubtype (A/E) recombinant of African origin. J Virol 70, 7013–7029 (1996).
McCutchan, F. E. et al. Diversity of the envelope glycoprotein among human immunodeficiency virus type 1 isolates of clade E from Asia and Africa. J Virol 70, 3331–3338 (1996).
Kato, K. et al. Closely related HIV-1 CRF01_AE variant among injecting drug users in northern Vietnam: evidence of HIV spread across the Vietnam-China border. AIDS Res Hum Retroviruses 17, 113–123, https://doi.org/10.1089/08892220150217201 (2001).
He, X. et al. A comprehensive mapping of HIV-1 genotypes in various risk groups and regions across China based on a nationwide molecular epidemiologic survey. PLoS One 7, e47289, https://doi.org/10.1371/journal.pone.0047289 (2012).
Liu, W. et al. Distribution of HIV-1 subtypes in Guangxi Zhuang Autonomous Region, 2008–2009. Zhonghua Liu Xing Bing Xue Za Zhi 34, 53–56 (2013).
Li, L. et al. Different distribution of HIV-1 subtype and drug resistance were found among treatment naive individuals in Henan, Guangxi, and Yunnan province of China. PLoS One 8, e75777, https://doi.org/10.1371/journal.pone.0075777 (2013).
Li, L. et al. Genetic characterization of 13 subtype CRF01_AE near full-length genomes in Guangxi, China. AIDS Res Hum Retroviruses 26, 699–704, https://doi.org/10.1089/aid.2010.0026 (2010).
Li, L. et al. Subtype CRF01_AE dominate the sexually transmitted human immunodeficiency virus type 1 epidemic in Guangxi, China. J Med Virol 85, 388–395, https://doi.org/10.1002/jmv.23360 (2013).
Parekh, B. S. et al. Quantitative detection of increasing HIV type 1 antibodies after seroconversion: a simple assay for detecting recent HIV infection and estimating incidence. AIDS Res Hum Retroviruses 18, 295–307, https://doi.org/10.1089/088922202753472874 (2002).
Dobbs, T., Kennedy, S., Pau, C. P., McDougal, J. S. & Parekh, B. S. Performance characteristics of the immunoglobulin G-capture BED-enzyme immunoassay, an assay to detect recent human immunodeficiency virus type 1 seroconversion. J Clin Microbiol 42, 2623–2628, https://doi.org/10.1128/JCM.42.6.2623-2628.2004 (2004).
UNAIDS_Reference_Group_on_Estimates_Modeling_and_Projections. Statement on the use of the BED assay for estimation of HIV-1 incidence or epidemic monitoring. Weekly Epidemiol Rec 81, 33–40 (2006).
Guy, R. et al. Accuracy of serological assays for detection of recent infection with HIV and estimation of population incidence: a systematic review. Lancet Infect Dis 9, 747–759, https://doi.org/10.1016/S1473-3099(09)70300-7 (2009).
Brookmeyer, R., Laeyendecker, O., Donnell, D. & Eshleman, S. H. Cross-sectional HIV incidence estimation in HIV prevention research. J Acquir Immune Defic Syndr 63(Suppl 2), S233–239, https://doi.org/10.1097/QAI.0b013e3182986fdf (2013).
Laeyendecker, O. et al. HIV incidence determination in the United States: a multiassay approach. J Infect Dis 207, 232–239, https://doi.org/10.1093/infdis/jis659 (2013).
Li, X. et al. Evolutionary Dynamics and Complicated Genetic Transmission Network Patterns of HIV-1 CRF01_AE among MSM in Shanghai, China. Scientific reports 6, 34729, https://doi.org/10.1038/srep34729 (2016).
Zhang, J. et al. Genetic Characteristics of CRF01_AE Among Newly Diagnosed HIV-1-Infected 16- to 25-Year Olds in 3 Geographic Regions of Guangxi, China. Medicine 94, e894, https://doi.org/10.1097/MD.0000000000000894 (2015).
Feng, Y. et al. The rapidly expanding CRF01_AE epidemic in China is driven by multiple lineages of HIV-1 viruses introduced in the 1990s. AIDS 27, 1793–1802, https://doi.org/10.1097/QAD.0b013e328360db2d (2013).
Liao, H. et al. Phylodynamic analysis of the dissemination of HIV-1 CRF01_AE in Vietnam. Virology 391, 51–56, https://doi.org/10.1016/j.virol.2009.05.023 (2009).
Li, X. et al. Molecular epidemiology of HIV-1 in Jilin province, northeastern China: emergence of a new CRF07_BC transmission cluster and intersubtype recombinants. PLoS One 9, e110738, https://doi.org/10.1371/journal.pone.0110738 (2014).
Han, X. et al. Genome Sequences of a Novel HIV-1 Circulating Recombinant Form, CRF55_01B, Identified in China. Genome Announc 1, https://doi.org/10.1128/genomeA.00050-12 (2013).
Han, X. et al. A Large-scale Survey of CRF55_01B from Men-Who-Have-Sex-with-Men in China: implying the Evolutionary History and Public Health Impact. Sci Rep 5, 18147, https://doi.org/10.1038/srep18147 (2015).
Liao, L. et al. Genotypic analysis of the protease and reverse transcriptase of HIV type 1 isolates from recently infected injecting drug users in western China. AIDS Res Hum Retroviruses 23, 1062–1065, https://doi.org/10.1089/aid.2007.0050 (2007).
Hall, T. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symposium Series 41, 95–98 (1999).
Thompson, J. D., Gibson, T. J., Plewniak, F., Jeanmougin, F. & Higgins, D. G. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res 25, 4876–4882 (1997).
Altschul, S. F., Gish, W., Miller, E. W. & Myers, E. W. Basic Local Alignment Search Tool. J. Mol. Biol. (1990) 215, 403–410 (1990).
Drummond, A. J. & Rambaut, A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evolutionary Biology 7, 214, https://doi.org/10.1186/1471-2148-7-214 (2007).
Gill, M. S. et al. Improving Bayesian Population Dynamics Inference: A Coalescent-Based Model for Multiple Loci. Molecular Biology and Evolution 30, 713–724, https://doi.org/10.1093/molbev/mss265 (2012).
Drummond, A. J., Matthew, S. Y. W. H. & Andrew Rambaut, J. P. Relaxed Phylogenetics and Dating with Confidence. PLoS Biology 4, 0699–0710, https://doi.org/10.1371/journal.pbio.0040088.g001 (2006).
Acknowledgements
The authors would like to thank Lingjie Liao (China CDC), and Xingguang Li (China CDC) for technical support and analysis of the data. The authors would also like to thank the Group for the HIV Molecular Epidemiologic Survey. Contributing members of the Group [name of contributor (facility of the contributor)]: Yi Luan (Nanning CDC); Houjin Ma (Guilin CDC); Xing Liu (Liuzhou CDC); Mei Li (Qinzhou CDC); Ruiguo Ye (Yulin CDC); Daquan Luo (Baise CDC); Yanfei Meng (Guigang CDC); Shuyin zhao (Hezhou CDC); Bingjian Liang (Wuzhou CDC); Tianji Huang (Chongzuo CDC); Baohui Tang (Hechi CDC); Shuzhi Chen (Laibin CDC). Hao Zhu (Beihai CDC), Weifei Pang (Fangchenggang CDC). We would like to thank Editage [www.editage.cn] for English language editing.
Author information
Authors and Affiliations
Contributions
Z.T., Z.S., and J.L. conceived and designed the study. J.L., Y.F., Y.L., R.X., H.Z., J.W., X.Z., Y.D., N.F., G.L., S.L., and Q.Z. performed the experiments and analyzed the data. J.L., Y.L., Z.T., and Y.R. drafted the manuscript. Z.T., Y.F., Z.S., H.Z., H.X., Y.R., and Y.S. interpreted data and provided critical review. All authors reviewed and approved the final manuscript.
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Li, J., Feng, Y., Shen, Z. et al. HIV-1 Transmissions Among Recently Infected Individuals in Southwest China are Predominantly Derived from Circulating Local Strains. Sci Rep 8, 12831 (2018). https://doi.org/10.1038/s41598-018-29201-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-018-29201-3
- Springer Nature Limited
Keywords
This article is cited by
-
HIV-1 subtype diversity and transmission strain source among men who have sex with men in Guangxi, China
Scientific Reports (2021)
-
Declining trend in HIV new infections in Guangxi, China: insights from linking reported HIV/AIDS cases with CD4-at-diagnosis data
BMC Public Health (2020)
-
Effects of HIV-1 genotype on baseline CD4+ cell count and mortality before and after antiretroviral therapy
Scientific Reports (2020)