Background

Four decades have elapsed since the discovery of the first case of acquired immune deficient syndrome (AIDS) [1] in 1981 in the USA. Since then, the causal agent of AIDS, the human immunodeficiency virus (HIV), and HIV type 1 (HIV-1) in particular, has spread worldwide, causing a serious global public health problem. According to an official report [2] released by the Joint United Nations Program on HIV/AIDS, 37.7 million people were living with HIV worldwide in 2020, despite the success of HIV control efforts. HIV-1 genetic diversity is complex and constantly evolving. To date, 118 HIV-1 circulating recombinant forms (CRFs) have been registered in the HIV Sequence Database (https://www.hiv.lanl.gov/components/sequence/HIV/search/search.html) and numerous unique recombinant forms (URFs) have also been identified [3]. A global survey of 383,519 samples from 116 countries found that the three most common HIV-1 subtypes globally were C, B, and A, accounting for 69% of all HIV isolates [4]. However, recombinant HIV-1 strains are more frequently observed in China than in other countries, with CRF07_BC and CRF01_AE as the dominant strains [5]. HIV-1 diversity can impact the results of diagnostic assays, viral load determination, and the development of HIV vaccines [4, 6]. Therefore, the continued surveillance of HIV-1 subtypes is essential.

Hebei Province in North China, covers 190,000 km2 and comprises 11 prefectures. At the end of October 2020, a total of 17,891 individuals with HIV-1/AIDS were reported in the whole province [7]. Among the newly reported cases, the proportion of homosexual transmission had increased from 41.8% to 2010 to 64.8% in 2020 in Hebei. Moreover, because of this shift in HIV-1 transmission routes (from blood to sexual contact), the most prevalent HIV-1 subtype in Hebei switched from subtype B (before 2005) to CRF01_AE (after 2005) [8, 9]. However, subtype B has remained in circulation in Hebei and now constitutes the third most prevalent subtype in this province [9]. Thus, the cocirculation and dual infection (via the sexual route of transmission) with the CRF01_AE and subtype B strains will undoubtedly provide opportunities for recombination between strains.

Of the 11 prefectures in Hebei, Cangzhou, which shares a border with Tianjin, is severely affected by HIV-1. Since the first case of HIV-1 infection in 2003, more than 100 HIV-1-positive cases across the eight counties of Cangzhou have been reported. Homosexual contact plays a key role in HIV-1 spread in Cangzhou Prefecture [10], with 77.6% of HIV-1 infections observed in individuals aged 15–49 years [10]. However, few studies of URFs have been conducted in the HIV-1-infected population of Cangzhou Prefecture.

In the present study, two novel URFs (hcz0017 and hcz0045), which originated from a recombination between CRF01_AE and subtype B, were detected in two recently infected men who have sex with men (MSM) based in Cangzhou Prefecture, Hebei, China by near full-length genome (NFLG) sequence analysis.

Materials and methods

Study participants

The hcz0017 URF originated from a divorced 55-year-old man, whereas hcz0045 was derived from an unmarried 23-year-old man (Table 1). Both subjects were infected with HIV-1 through homosexual contact, and their CD4+ T cell counts and viral loads were over 300 cells/mm3 and 3 × 104 copies/ml, respectively. The individuals were confirmed as anti-HIV-1-antibody-positive by western blotting in November 2019 and were diagnosed as having recently acquired HIV-1 infection using a limiting antigen avidity enzyme immunoassay, CD4+ T cell counts, and viral load quantification. Blood samples were obtained from the two donors during their voluntary counseling and testing appointments. Written informed consent was obtained from both HIV-1 individuals prior to blood collection. The study was approved by the Local Ethics Committee of Hebei Provincial Center for Disease Control and Prevention (No. IRB(S)2020-031).

Table 1 Baseline information of two individuals recently infected with HIV-1

Quantification of CD4 + T cell counts and HIV-1 viral load

CD4+ T cell counts were determined using the FACSCount System (Becton–Dickinson, Franklin Lakes, NJ, USA) from 50 µL of whole blood. HIV-1 viral load was quantified using the Amplicor HIV-1 Monitor test (COBAS TaqMan 48; Roche, Switzerland).

HIV-1 genome characterization

An HIV-1 NFLG assay was performed as previously described [11]. Raw sequences were assembled using Contig Express 9.1. Multiple sequence alignment with Clustal W and manual editing were performed using Bio-Edit 7.0 software. The standard reference sequences of HIV-1 subtypes were downloaded from the HIV Databases (http://www.hiv.lanl.gov/content/index), including all full-length CRF sequences associated with 01/B recombination. The neighbor-joining (N-J) phylogenetic trees were constructed using the Kimura two-parameter model with 1,000 bootstrap replicates in MEGA 6.0. The jpHMM and RIP 3.0 online tools were used to analyze recombination breakpoints of the two NFLGs.

Results

We found that hcz0017 and hcz0045 formed a distinct monophyletic branch, separately from other subtypes and CRFs in the NFLG N-J phylogenetic tree (Fig. 1), suggesting that these two NFLG sequences were novel recombinant forms. According to the results of the jpHMM and RIP 3.0 breakpoint analysis, the hcz0017 and hcz0045 NFLG sequences originated from a recombination between CRF01_AE and subtype B (Figs. 2 and 3) and each contained seven subregions. The hcz0017 NFLG was composed of three CRF01_AE fragments inserted to a subtype B backbone, while the hcz0045 NFLG was composed of three subtype B fragments within a CRF01_AE backbone.

Fig. 1
figure 1

Phylogenetic tree analysis of HIV-1 NFLG sequences. A neighbor-joining tree was constructed using MEGA 6.0, with 1,000 bootstrap replicates. The standard HIV-1 subtype reference sequences were downloaded from HIV Databases (http://www.hiv.lanl.gov/content/index). Bootstrap values ≥ 60% are shown in the tree. The scale bar indicates 5% nucleotide sequence divergence. Each black dot denotes a study subject

Fig. 2
figure 2

Recombination breakpoint analyses of hcz0017 (A) and hcz0045 (B). The NFLG mosaic maps were generated using the online tool jpHMM (http://jphmm.gobics.de/)

Fig. 3
figure 3

RIP analysis of the hcz0017 and hcz0045 NFLG sequences. Similarity distance analysis was performed using RIP (version 3.0; Siepel AC, Halpern AL, Macken C, Korber BT, http://hiv-web.lanl.gov) from the Los Alamos National Laboratory HIV Database with default settings, except for a window size of 300. Color images are available in the online version of this manuscript

Next, we performed a detailed analysis of the hcz0017 and hcz0045 NFLG sequence mosaic structures (Figs. 2 and 3). The following positions were assigned to the seven subregions of each URF according to the HXB2 numbering system: (1) hcz0017: IB (790–1,171 nt), IICRF01_AE (1,172–2,022 nt), IIIB (2,023–4,469 nt), IVCRF01_AE (4,470–5,866 nt), VB (5,867–7,462 nt), VICRF01_AE (7,463–8,379 nt), VIIB (8,380–9,411 nt); (2) hcz0045: ICRF01_AE (790–5,147 nt), IIB (5,148–5,614 nt), IIICRF01_AE (5,615–6,035 nt), IVB (6,036–6,241 nt); VCRF01_AE (6,242–7,325nt), VIB (7,326–8,254 nt), VIICRF01_AE (8,255–9,411 nt). The phylogenetic tree analysis (Figs. 4 and 5) based on each subregion of each of the NFLGs revealed that all CRF01_AE and B subregions in hcz0017 and hcz0045 clustered with their respective reference sequences. This also revealed the NFLG mosaic structures of the two novel, second-generation recombinant HIV-1 forms. Furthermore, all CRF01_AE and subtype B fragments within the two URFs were closely associated with the Thai-CRF01_AE and Euro-American subtype B, respectively, which have been prevalent in Hebei for 30 years.

Fig. 4
figure 4

Subregion phylogenetic tree of hcz0017. A neighbor-joining tree was constructed using MEGA 6.0, with 1,000 bootstrap replicates. Each black dot denotes a study subject. Bootstrap values ≥ 70% are shown at the corresponding nodes. The scale bar indicates 5% genetic distance

Fig. 5
figure 5

Subregion phylogenetic tree of hcz0045. A neighbor-joining tree was constructed using MEGA 6.0, with 1,000 bootstrap replicates. Each black dot denotes a study subject. Bootstrap values ≥ 70% are shown at the corresponding nodes. The scale bar indicates 5% genetic distance

Discussion

18.6% (22/118) of the CRFs listed in the HIV databases have originated from the recombination between CRF01_AE and subtype B. Moreover, six of these were found in MSM, including CRF55_01B, CRF67_01B, CRF69_01B, CRF59_01B, CRF68_01B, and CRF103_01B. Of the above six CRFs, five were found in China and one (CRF69_01B) in Japan. Indeed, novel, second-generation recombinant forms of HIV-1 have been identified continually in some prefectures of Hebei such as Baoding [12], Handan [13], and Shijiazhuang [14]. In this study, the identification of two CRF01_AE/B recombinant forms, with intricate genomic mosaic structures, highlights the complexity of HIV-1 genetics in Cangzhou Prefecture. Moreover, the two MSM in whom the CRF01_AE/B recombinant forms were diagnosed with recent HIV-1 infection, implying a high level of promiscuity among MSM in this region. Our previous study [15] showed that the transmission of HIV-1 occurred among MSM from neighboring provinces, including Beijing, Tianjin, and Hebei. At present, homosexual contact is the most frequent transmission route in the Beijing-Tianjin-Hebei region. Thus, HIV-1 diversity should be constantly monitored in Hebei and its neighboring provinces to more effectively control the spread of HIV-1 among MSM.

Conclusions

In our study, two novel URFs, which originated from a recombination between CRF01_AE and subtype B, were identified in two recently infected MSM based in Cangzhou Prefecture, Hebei, China by NFLG sequence analysis. Our results highlight the need to continually monitor HIV-1 diversity in Hebei and its neighboring provinces to achieve a more effective control of HIV-1 spread within the MSM community.