Background

Identifying the sources of persistent plasma HIV-1 during treatment with antiretroviral therapy (ART) is critical for designing strategies to target virus producing cells for elimination. Previous studies have attempted to determine whether persistent viremia during successful combination ART results from incomplete suppression of virus replication [19] or if it is produced by long-lived and/or proliferating infected cells [2, 918]. Our recent study on the genetics of persistent viremia during ART [18] showed that the plasma viral population after long-term ART consists of the same variants that were present prior to initiating ART, indicated that it originated from cells, or the descendants of cells, that were infected prior to therapy. Furthermore, investigations by Bailey et al. and Kearney et al. on the genetics of plasma viremia determined by single-genome sequencing revealed populations of identical viral sequences present during ART [11, 18]. These findings together imply that persistent viremia during ART may be the result of virus release from long-lived, proliferating cells and not from on-going, virus replication. Direct evidence now exists that supports the idea that persistent viremia is maintained by the expression of virus from proliferating blood cells [19]. This study showed clear evidence of proliferation of infected CD4+ T-cells collected from patients at various times before and during therapy. Furthermore, Maldarelli et al. identified a population of proliferating CD4+ cells that carried a viral variant that was also predominant in the plasma of a patient on ART [19]. These findings provide the strongest evidence to date that persistent viremia during ART can arise from expression of HIV-1 from expanded CD4+ cell populations that were infected prior to initiating therapy. A subset of such proliferating cells is likely the reservoir for viral rebound when ART is interrupted and, therefore, must be eliminated to control HIV-1 infection off ART.

The magnitude and the distribution of the HIV reservoir that contributes to viral rebound off ART are incompletely defined. One study (Simonetti et al. CROI 2015) showed that identical HIV-1 sequences found in the plasma of an individual being treated with ART originated from cells in the blood, gut, and lymph nodes that were clonally expanded from a single infected progenitor. These findings suggest that infected, proliferating cells are widely distributed without evidence of compartmentalization, and that persistent viremia during ART may be derived from multiple tissue sources. By contrast, other studies aimed at profiling HIV-1 genetics have concluded that the virus is sometimes compartmentalized across tissues [2039]. These conflicting results may be explained in part by a study by Bull et al. who showed that the appearance of compartmentalization, in at least some cases, is due to sampling error and/or over-representation of identical viral variants resulting from cell proliferation and not from unique HIV-1 populations existing in different tissues [40]. These findings suggest that infected cells are well distributed across tissues (the brain and genital secretions may be exceptions [41]) and that local expansion of infected cells may influence the frequency of viral variants detected within tissues, giving the false appearance of compartmentalization. A widely distributed and well mixed model of HIV-1 populations would predict that as new variants emerge in tissues, they would be rapidly disseminated through blood to other compartments, as opposed to local compartmentalization of new variants through ongoing replication. If the well mixed model is confirmed, it would have important implications including: (1) plasma virus population would mirror populations in tissues and, as such, could be used to monitor diversity and evolution in tissues, (2) variants arising from ongoing replication in sanctuary tissues sites would be rapidly disseminated to other tissues and readily detectable in the blood, and (3) expanded clones that may be an important reservoir for HIV would be spread throughout tissues.

Additional studies are needed to determine if HIV-1 is well distributed or is compartmentalized across tissues. These studies are challenging to perform in humans because of the difficulty in obtaining relevant human samples for testing. Animal models are an appropriate substitute for such investigations as they enable extensive sampling, including collection at scheduled necropsy, allowing for expansive and detailed analyses of the viral populations in blood and tissues. Animal models also eliminate the complications of inter-host viral diversity and incomplete treatment adherence, which are common issues in human studies. Previous studies of infected macaques have shown that intracellular viral RNA and viral DNA are present in many tissues [42, 43] and that plasma virus is derived from multiple tissue sources such as the gut [44, 45] or from the lymph nodes [46]. To investigate both the anatomic sources of persistent plasma virus during ART, the nature of proliferating cells (marked by expanding viral variants), and the possibility of on-going viral replication in the tissues during ART, we characterized the genetics of viral variants across anatomic sites in untreated and ART-treated pigtailed macaques infected with a strain of SIVmne containing HIV-1 RT (RT-SHIVmne) [43, 47, 48]. Infection with RT-SHIVmne has previously been shown to result in pre-therapy RNA levels that are similar to HIV-1 infection in humans, to display dynamics of viral decay on ART similar to that in humans, and to achieve sustained levels of suppression on ART to <30 copies/ml [47, 48]. Additionally, RT-SHIVmne animal models have been shown to be appropriate systems to investigate the viral genetics of persistent viremia [49]. In the current study, we used the RT-SHIVmne macaque model to investigate the sources of plasma viremia, the distribution of identical sequences in tissues, and the possibility of on-going replication in tissues during ART by analyzing single-genome sequences obtained from 11 different anatomic sites in four pigtailed macaques infected with a diverse population of RT-SHIVmne. We hypothesized that infected cells would be well dispersed across tissues, making the viral populations indistinguishable from plasma virus and implying multiple anatomic origins of persistent viremia and the lack of on-going replication in tissues [18].

Results and discussion

Viral DNA and RNA levels in plasma and tissues from RT-SHIV-infected macaques

We used a subset of samples from four macaques reported in a previous study to have persistent levels of infected cells and RT-SHIVmne expression (hereafter denoted RT-SHIV). As described previously, plasma samples were collected weekly from the 4 pigtailed macaques after inoculation with RT-SHIV and tissue samples were obtained at necropsy 30 weeks post-infection [43]. Two of the animals (6760 and 8232) were left untreated, and two (8272 and 8030) were treated daily with efavirenz, tenofovir, and emtricitabine, beginning 10 weeks post-infection, and continued until sacrifice [43]. As published by Kline et al. plasma viremia was measured weekly or biweekly throughout the original study and showed a thousand-fold difference in set point between the two untreated animals (6760–105 and 8232–102) as well as sustained suppression of viremia <30 copies/mL after 5–10 weeks of ART in both treated animals. DNA and RNA were isolated from necropsy tissues, quantified as previously reported [43] and as described briefly in the methods, and were used for single-genome sequencing in this study (Table 1, Additional file 1: Figure S1). By analyzing data from Kline et al. we found that, despite the thousand-fold difference in plasma RNA levels at the set point in the two untreated animals, the mean difference between levels of RT-SHIV DNA in their tissues was less than 2-fold. This finding suggests that the higher levels of viremia in untreated animal 6760 compared to untreated animal 8232 is more likely due to higher levels of expression of virus from infected cells rather than to a higher frequency of infected cells. This conclusion was confirmed by measuring levels of RT-SHIV RNA in the tissues in these animals, which showed a ratio of 108:1 (RNA:DNA) in animal 6760 and 8:1 in animal 8232. In the two ART-treated animals, RT-SHIV-infected cells were readily detectable but, the levels of viral RNA expression in tissues collected after 30 weeks of treatment were rarely above the limit of the assay detection, indicating that the infected cells that persisted during ART expressed few viral transcripts (Table 1).

Table 1 gag RNA and DNA copies in tissues at necropsy (week 30/31) and number of single-genome sequences obtained

RT-SHIV diversity of the challenge virus and diversity and divergence in animals 1 week post-infection vs. 30 weeks post-infection

Single-genome sequencing (SGS) of the RT-SHIV RT coding region was performed on the virus challenge stock and on plasma collected from each animal 1 week post-infection. The viral populations in each sample were analyzed using neighbor-joining trees which are capable of reliably identifying identical variants, a key part our analyses throughout the study (Fig. 1). The RT-SHIV population in the challenge stock was heterogeneous, likely due to its history of serial passage in cell culture [43], and similar sets of variants successfully established infection in all four macaques (Fig. 1). Although the populations in the plasma and the challenge stock were diverse, they did include several dominant variants that could be used to track the seeding and spread of specific variants across the tissues (Fig. 1, numbered 1–5). The similarities of the viral population after inoculation in all four animals imply that differences in viral set point were not related to the diversity of the variants that established infection. The heterogeneity early in infection in these animals resulted in populations that more closely resembled chronic HIV-1 infection in humans than homogeneous acute infections.

Fig. 1
figure 1

Phylogenetic relationships between single-genome sequences obtained from the challenge virus stock and from the plasma collected at one week post-infection from each animal. G to A hypermutants are marked by crosses. Numbers refer to clusters of sequences observed more than once here and in subsequent figures. Symbols shown in boxes indicate identical sequences that are present on the tree at the location indicated by the pointer

In addition to comparing the early plasma diversity to the RT-SHIV challenge virus, we also compared the viral genetic diversity in the plasma at week 1 to the plasma diversity at week 30 in each animal to look for evidence of viral evolution over time by using phylogenetics and a test for panmixia (Fig. 2). The panmixia test is used to determine the probability that two subpopulations are derived from the same, well-mixed population. Due to the large numbers of comparisons between sequences and the possibility of sampling error, pairs of populations with panmixia probabilities of <10−3 are considered significantly different [18, 49, 50]. Plasma populations obtained at week 30 were found to be significantly different from week 1 populations in the untreated animals (p = 10−4 in animal 6760 and p < 10−10 in 8232), but not in the animals treated with ART (p = 0.08 in 8030 and p = 0.4 in 8272) (Fig. 2). We also normalized the data to account for differences in sampling between the treated and untreated animals to be sure that the panmixia test was not influenced by undersampling in the treated animals and found little difference in the panmixia results (p = 0.0005 in animal 6760 and p = 0.0003 in 8232). We accomplished this by generating a script to randomly select 20 sequences from week 1 in the untreated animals (to match the number of sequences in the treated animals at week 1) and to randomly select 7 sequences from week 30 in the untreated animals (to match the number of sequences in the treated animals at week 30), which were then tested for panmixia. This finding implies that significant levels of virus evolution due to replication occurred in the untreated animals over 29 weeks but that there was little or no RT-SHIV replication in the blood of the animals treated with ART, consistent with previous HIV-1 studies in humans [18]. The analyses showed that the change in panmixia in untreated animal 6760 was associated with increasing viral population diversity (0.3–0.5 %), corresponding to the emergence of new variants with longer branches on the trees than the variants present at week 1 (Fig. 2a). Only clones 4 and 5 were still detectable after 30 weeks of infection in animal 6760. By contrast, analysis of samples from untreated 8232 revealed evolution by evidence of a genetic bottleneck in this animal that occurred between week 1 and week 30, selecting especially for clone 4 and resulting in a decrease in viral diversity (0.4–0.2 %) (Fig. 2b). These data show that viral evolution in this model can result from the emergence of new variants, as in animal 6760, or from the selection of existing variants, as in animal 8232. Fewer sequences were obtained from the animals on suppressive ART but the sequences obtained were not phylogenetically different from the population present prior to ART initiation (Fig. 2c, d) indicating that no viral evolution could be detected in blood samples. Two additional clonal variants detected in the phylogenetic analyses in Fig. 2 that are also present in later analyses were labeled 6 and 7 for reference across figures and text.

Fig. 2
figure 2

Phylogenetic and panmixia relationships between single-genome plasma sequences obtained one week and 30 weeks post-infection from each animal. Neighbor-joining trees were prepared using consensus B as an out group (bottom, unlabeled branch) for the untreated (a, b) and treated (c, d) animals. The p values shown next to each animal identifier denote the probability that the virus populations sampled at one week and 30 weeks after infection are not different (panmixia). Numbers refer to clusters of sequences observed more than once here and in previous and subsequent figures. Symbols shown in boxes indicate identical sequences that are present on the tree at the location indicated by the pointer. G to A hypermutants are marked by crosses. Asterisks indicate bootstrap values >80

Analyses of RT-SHIV proviral populations in multiple tissues of untreated animals

The genetics of the RT-SHIV DNA populations were compared across tissues after 30 weeks of infection in the untreated animals and were compared to the plasma virus at week 1 and week 30 to investigate the spread and persistence of viral variants within and between tissues and to investigate the anatomic sources of plasma viremia. The tissues included peripheral blood mononuclear cells (PBMCs), axillary and mesenteric lymph nodes (LN), small intestine (ileum, duodenum, jejunum), lung, thymus, spleen, bone marrow, and brain from one untreated animal (6760). Table 1 shows the number of DNA single-genome sequences obtained from each tissue. DNA populations were compared using (1) average pairwise distance (APD) calculations to measure the genetic diversity within each population (Table 2), (2) the test for panmixia to measure divergence between the tissue populations and the plasma virus at week 1 and week 30 (Table 2; Fig. 3), and (3) neighbor-joining analyses to investigate the clustering of sequences within and between anatomic sites (Fig. 3a, b).

Table 2 pol population diversity and divergence,between plasma (week 30) and tissues at necropsy (week 30/31)
Fig. 3
figure 3

Phylogenetic relationships between single-genome proviral sequences obtained from various anatomical compartments 30 weeks post-infection from the untreated animals a 6760 and b 8232. The panmixia probabilities comparing virus populations in the tissues (open colored circles) with sequences in the plasma (solid black circles) are shown next to each tissue in the key. G to A hypermutants are marked by crosses. Numbers refer to clusters of sequences observed more than once here and in previous and subsequent figures. Symbols shown in boxes indicate identical sequences that are present on the tree at the location indicated by the pointer

A wide range of proviral diversity was found across the tissues (0.09–0.85 %) (Table 2), suggesting that there may be differences among tissue populations. Different tissues or anatomical compartments are likely to exert different selective pressures on viral populations, which could have dramatic effects on viral diversity. Proviral diversities in the tissues of animal 6760 increased relative to the week 1 plasma virus, while tissue diversities in animal 8232 decreased, findings consistent with those in the plasma of the same animals. These changes reflect the emergence of new variants in 6760 and the genetic bottleneck in 8232 and imply that the viral populations in the tissues and plasma virus are well mixed.

Further investigations of sequences from the untreated macaques using the panmixia test showed that although there was a wide range of diversity across the tissues; divergence measured from the plasma virus at the same time point (week 30) was significantly different for only two tissues sites: spleen in 6760 and mesenteric lymph node in 8232 (Table 2; Fig. 3). This finding indicates that the same viral variants are present in plasma and across most tissues after 30 weeks of infection and that the wide range in intra-tissue diversity must be due to either sampling error or to different frequencies of the same variants within each tissue compartment.

Neighbor-joining trees confirmed that the vast majority of proviral pol sequences obtained from different anatomical compartments were not different from each other or from the plasma virus at the same time point (Fig. 3a, b). Importantly, these data demonstrate that the evolution that occurred between week 1 and week 30 in these animals is reflected across the tissues analyzed, suggesting that these tissue compartments are more likely connected rather than compartmentalized. Single-genome env sequences were also obtained from one animal (6760) with the same finding of sequence intermingling across tissues as was found for RT (Additional file 2: Figure S2). Phylogenetic analyses also showed that the variable divergences in two tissues (spleen in 6760 and thymus in 8232) were due to differing frequencies of identical sequences between tissues and plasma; specifically, differences in frequencies of clone 4. Clone 4 was under-represented in the spleen in animal 6760 and in the mesenteric LN in animal 8232 resulting in statistically “divergent” populations. This observation was confirmed by collapsing all identical sequences to single variants, resulting in the loss of significant divergence by panmixia. The different frequencies of these variants across some tissues could be due to sampling error, differential seeding of variants from the challenge virus stock, or from clonal expansion of infected cells. Of note, clones 1, 3, 4, and 5 persisted in the tissues in animal 8232 after 30 weeks of infection whereas these variants (except 4) disappeared from the plasma due to a genetic bottleneck that occurred with the section of clone 4 (only reaching significance in the mesenteric LN). Clones 1, 4, and 5 remained detectable in at least some of the tissues of animal 6760 (Fig. 3a), while only clones 4 and 5 were present in the plasma (reaching significance in the spleen). Only 5 sequences were recovered from the brain tissue samples. All 5 sequences were obtained from the cerebrum of the untreated animal, 6760 with the highest viral set point. Three of the 5 variants matched those found in other tissues while two were unique but closely related (Fig. 3a). These findings show a lack of significant compartmentalization of virus in the brain of this pigtailed macaque, consistent with that observed across other tissues (albeit with limited sampling in this case). This observation is distinct from that in humans where clear evidence for HIV-1 compartmentalization has been shown in cerebral spinal fluid compared to blood in some individuals [29]. In total, the DNA populations in the tissues differed significantly only by the variable frequencies of identical sequences, indicating that viral variants and/or infected cells are well mixed across tissues. Plasma virus contained subpopulations that were not different from the proviral populations in the tissues examined. This finding strongly suggests that viremia is most likely derived from infected cell populations that have spread across most, if not all, tissues in the body.

Analyses of RT-SHIV intracellular RNA populations in tissues of untreated animals

To characterize the diversity and divergence of RT-SHIV variants expressed in different tissues and to further investigate the sources of plasma viremia, we obtained intracellular viral RNA sequences and compared these populations to those obtained from plasma (Fig. 4) and to the proviral populations from the same tissues (Figs. 5, 6). Table 1 shows the number of RNA sequences obtained from each tissue. Neighbor-joining trees of intracellular RNA sequences obtained across tissues from the untreated animals are shown in Fig. 4. Similar to the viral DNA populations, intracellular RNA variants in the untreated animals intermingled between tissue and plasma sequences and showed no divergence by panmixia (Table 2), revealing a lack of anatomic compartmentalization and a lack of isolated, local evolution or expression in these animals (Fig. 4). These data support our conclusions from the proviral sequence analyses that the infected cells are well mixed across anatomic compartments and that plasma virus could be derived from multiple anatomic sites. We also found that the expressed populations contained clusters of identical sequences (Fig. 4), suggesting either that multiple cells carrying identical variants are expressing viral RNA (likely the case for previously identified clones such as clone 4) or that a single cell per tissue is undergoing high levels of viral expression in comparison to other infected cells (possibly the case for identical sequences that were not found to be clonal in the DNA, such as those labeled 7, 9, and 10 in Fig. 4).

Fig. 4
figure 4

Phylogenetic relationships between intracellular RNA single-genome sequences obtained from different anatomical compartments 30 weeks post-infection from untreated animals 6760 and 8232. Numbers refer to clusters of sequences observed more than once here and in previous and subsequent figures. Symbols shown in boxes indicate identical sequences that are present on the tree at the location indicated by the pointer

Fig. 5
figure 5

Phylogenetic relationships between intracellular RNA and DNA single-genome sequences obtained from the same anatomical compartments 30 weeks post-infection from untreated animal 6760. p values for panmixia are shown where more than 7 sequences were obtained for both RNA and DNA. Numbers refer to clusters of sequences observed more than once here and in previous and subsequent figures. Symbols shown in boxes indicate identical sequences that are present on the tree at the location indicated by the pointer

Fig. 6
figure 6

Phylogenetic relationships between intracellular RNA and DNA single-genome sequences obtained from the same anatomical compartments 30 weeks post-infection from untreated animal 8232. p values for panmixia are shown where more than 7 sequences were obtained for both RNA and DNA. Numbers refer to clusters of sequences observed more than once here and in previous and subsequent figures. Symbols shown in boxes indicate identical sequences that are present on the tree at the location indicated by the pointer

Direct phylogenetic and panmixia comparisons of intracellular RNA and proviral DNA populations from untreated animals 6760 and 8232, including expression from cells carrying identical sequences, are shown in Figs. 5 and 6. Clonal variants were expressed at varying levels in different tissues, ranging from undetectable to high levels of expression. Phylogenetic relationships between intracellular RNA and DNA single-genome sequences obtained from the same anatomical compartments after 30 weeks of infection from untreated animal 6760 showed no structural differences (Fig. 5), indicating that the infected cells expressing viral RNA are diverse, likely contributing to the high level of plasma viremia in this animal. Panmixia p values are shown where more than 7 sequences were obtained for both RNA and DNA and, with one exception, show a lack of significant probability that the RNA population is different from the DNA population (Fig. 5). This finding suggests that, in the absence of ART, a large fraction of the infected cells across all tissues appear to be expressing viral RNAs. By contrast, phylogenetic analyses of intracellular RNA and DNA from the same anatomical compartments after 30 weeks of infection in untreated animal 8232 with a low viral set point (1000-fold lower than in animal 6760) showed a significant difference in panmixia in some of tissues analyzed (Fig. 6). The difference was primarily due to dominance of viral expression from clonal proviruses 4 and 8, shown in the trees as RNA sequences matching the DNA variants and identified with their clonal reference numbers. This finding suggests that a smaller proportion of infected cells were expressing RT-SHIV RNA in this animal compared to animal 6760, consistent with the lower level of plasma viremia in animal 8232. Analyses of intracellular RNA populations revealed that the majority of proviruses in tissues from 8232 were not likely expressed, whereas tissues from 6760 had a greater proportion of proviruses being expressed. These data suggested two important findings: (1) compartmentalization of proviral populations in the tissues compared to viral populations in the blood was only very rarely statistically significant, and (2) when rare compartmentalized proviral populations did occur, the variants that were expressed were not different from the virus population in plasma. These findings are important because they imply that exchange of virus and/or infected cells between the blood and the tissues may be frequent and that viral evolution may not occur in isolation in the tissues in the animal model studied. Although local viral spread may occur, it appears to be in tandem with the mixing of variants between all the tissues and the blood.

Analyses of RT-SHIV proviral populations in multiple tissues of animals treated with ART

The genetics of the RT-SHIV DNA populations were compared across tissues and to the plasma virus at week 1 and week 30 in the two macaques treated for 20 weeks with combination ART to investigate the spread and persistence of viral variants within and between tissues and to investigate the anatomic sources of persistent viremia during ART. Table 1 shows the number of DNA single-genome sequences obtained from each tissue in the two treated macaques, 8030 and 8272. DNA populations were compared using (1) APD calculations to measure the genetic diversity within each population (Table 2), (2) the test for panmixia to measure divergence between the tissue populations and between tissues and plasma (Table 2), and (3) neighbor-joining analyses to investigate the clustering of sequences within and between anatomic sites and with plasma virus (Fig. 7a, b).

Fig. 7
figure 7

Phylogenetic relationships between single-genome proviral sequences obtained from various anatomical compartments after 20 weeks of combination ART from animals a 8272 and b 8030. The panmixia probabilities comparing virus populations in the tissues (open colored circles) with sequences in the plasma (open black circles) are shown next to each tissue in the key. G to A hypermutants are marked by crosses. Numbers refer to clusters of sequences observed more than once here and in previous and subsequent figures. Symbols shown in boxes indicate identical sequences that are present on the tree at the location indicated by the pointer. Asterisks indicate bootstrap values >80

As in the untreated animals, a wide range of proviral diversity was found across the tissues (0.14–1.19 %) with no overall reduction after 20 weeks of therapy (Table 2). This finding suggests that there may be differences among the populations between tissues (maybe due to different frequencies of G to A hypermutants) and that this distribution is not affected by short-term combination ART. A lack of change in viral diversity with short-term therapy is consistent with our previous observation of plasma virus in HIV-infected humans on ART [18] and in RT-SHIV-infected macaques on suppressive ART [18]. Further investigation of sequences from the treated macaques using the panmixia test showed that, although there was a wide range of diversities across the tissues, the populations were not significantly divergent from each other or from the plasma virus at the same time point (Table 2; Fig. 8). The panmixia results suggest a lack of compartmentalization across the tissues.

Fig. 8
figure 8

Phylogenetic relationships between intracellular RNA and DNA single-genome sequences obtained from the same anatomical compartments after 20 weeks of combination ART from animal 8272. p values for panmixia are shown where more than 7 sequences were obtained for both RNA and DNA. Numbers refer to clusters of sequences observed more than once here and in previous and subsequent figures. Symbols shown in boxes indicate identical sequences that are present on the tree at the location indicated by the pointer

Neighbor-joining trees revealed that proviral pol sequences obtained from different anatomical compartments intermingled among themselves, consistent with the panmixia results of no divergence, and among sequences obtained from plasma at the same time points (Fig. 8). The finding that the DNA populations in the tissues differed only by the variable frequencies of identical sequences, especially for the numbered clones, indicates that viral variants and/or infected cells are well mixed among all tissues analyzed. The differing frequencies of identical sequences are likely due to varying levels of persistence and/or proliferation of infected cells. Clone 1 remained dominant in the plasma from both treated animals, as it was in the virus challenge stock, but it did not persist in the untreated animals. Plasma virus collected during ART contained subpopulations that were not different from the proviral populations in the tissues examined, including the gastrointestinal tract, which has been proposed to be a sanctuary site for virus replication during ART [32, 51]. This finding suggests that persistent viremia during ART is most likely derived from multiple anatomic sites.

Analyses of RT-SHIV intracellular RNA populations in tissues of animals treated with ART

To characterize the diversity and divergence of RT-SHIV variants expressed in different tissues and to further investigate the sources of persistent plasma viremia during ART, we sequenced viral intracellular RNA variants and compared these populations to those in the plasma and to the proviral populations in the tissues of ART-treated animal 8272 (Fig. 7). The same analysis could not be done for animal 8030 due to the low numbers of intracellular RNA sequences obtained. Table 1 shows the number of RNA sequences obtained from each tissue from both treated macaques. In contrast to the untreated macaques, few intracellular RNA sequences were detected in animals treated with ART and those that were detected contained frame shift mutations or large deletions, indicating that they were not replication competent. Furthermore, these mutant RNAs did not cluster with plasma virus or the proviral DNA detected in the tissues. This finding suggests that a minority of the infected cells that persist during ART express viral RNA and those RNAs that are actively expressed are frequently defective (Fig. 7), explaining why only very low levels of plasma viremia persist during ART. Most likely, the majority of productively expressing RT-SHIV-infected cells die within 20 weeks of treatment with combination ART, leaving primarily latently infected cells or cells that contain defective proviruses. Although all the expressed RNA sequences detected in the treated animals were defective, no defects were found in the genomes of virus that persisted at very low levels in plasma during ART in these animals, revealing that some infected cells expressing variants that may be replication competent also persist during ART but at levels below the detection limit of our sampling. Similarly, replication competent HIV-1 variants have been reported to persist in HIV-infected humans treated with ART [52], and are likely to be present at some level in most if not all treated patients [53]. In addition to the expressed variants being defective, we also found that the expressed populations contained clusters of identical sequences (Fig. 7). Populations of identical intracellular RNA sequences suggest either that multiple cells carrying identical variants were expressing viral RNA or that single cells (or their descendants) were undergoing high levels of viral expression in comparison to other infected cells. The finding that the identical variants expressed during ART were all defective implies that either their expression is from a proliferating cell population or from a single cell, but not from local spread through viral replication.

Conclusions

The reservoir for HIV-1 during ART must consist of cells containing replication competent proviruses that persist and sometimes proliferate despite therapy and result in rebound viremia when ART is stopped. The number, identity, distribution, and expression levels of these persisting cells are currently unknown. Our findings show that the reservoir for RT-SHIV likely includes infected cells that are well mixed in different anatomic sites and that plasma virus may be derived from multiple tissue compartments. These findings are important because they imply that exchange of virus and/or infected cells between the blood and many tissues is frequent and that viral evolution does not likely occur and persist in isolation in any specific tissues investigated here, including but not limited to PBMC, lymph nodes, gastrointestinal tract, and spleen. Furthermore, the results of this study show that genetic analyses on plasma virus appear to reflect what is taking place in the tissues in both the untreated and treated animals. Consequently, if viral replication during ART is on-going in the gut, as has been proposed, we would expect the newly emergent variants to be detectable in the plasma during highly suppressive ART. Such an occurrence was not found here at the level of detection provided by single-genome sequencing. Lastly, we found RNA expression of defective proviruses in both treated and untreated macaques as well as a possible correlation between the levels proviral expression and plasma virus load in the untreated animals. In conclusion, these studies strongly suggest that the reservoir for RT-SHIVmne during ART is broadly disseminated throughout the tissues, that infected cells persisting and/or proliferating during ART only rarely express viral products and, when they do, the genomes are often defective, and that the genetics of residual viremia during ART reflect the state of expression in the gut and lymphoid tissues as well as the blood.

Methods

Virus

RT-SHIVmne is a pathogenic SIV/HIV chimeric virus in which SIVmne RT is replaced by HIV-1HxB2 RT [48]. The challenge stock was grown in CEMx174 cells and was titered on TZM-bl cells. The virus was previously shown to be replication competent and pathogenic in pigtailed macaques (Macaca nemestrina) as a useful in vivo model to study antiretroviral suppression and emergence of drug resistance [47].

Animals

Blood and tissue samples were obtained from pigtailed macaques infected for a study previously published [43]. Briefly, male pigtailed macaques were infected intravenously with approximately 1 × 105 infectious units of RT-SHIVmne. All animals were housed at the National Institutes of Health (NIH) in accordance with the American Association of Accreditation of Laboratory Animal Care standards and all procedures were performed according to protocols approved by the Institutional Animal Care and Use Committee of the National Cancer Institute. Two animals were left untreated for 30 weeks post-infection and two received daily ART beginning at 10 weeks of infection until necropsy: 20 mg/kg tenofovir and 40 mg/kg emtricitabine were administered subcutaneously and 400 mg EFV was administered orally in treats. Blood was drawn weekly or biweekly, from which plasma and PBMC were separated. At week 30 post-infection, animals were euthanized and necropsies were performed as previously described [43].

Viral and host cell DNA and RNA isolation and measurement in PBMC and tissues

Virus was pelleted from EDTA-anticoagulated plasma taken at each time point for each animal and quantitative RT-PCR (qRT-PCR) was performed to determine the number SIV RNA (gag) copy equivalents per ml (copy Eq/ml) of plasma essentially as previously described (Cline et al. 2005). The assay limit for quantitation was 30 copy Eq/ml of plasma. Total DNA (viral and genomic) was extracted from tissues with lysis buffer (Nuclei lysis solution, Promega, Madison, WI, USA) after extensive washing using a TissueLyser (Qiagen, Valencia, CA, USA). Tissues were processed and nucleic acids extracted as previously described [43]. Briefly, tissue DNA was extracted using the Wizard genomic purification kit (Promega) and PBMC DNA was extracted using the Blood DNA kit (Qiagen). qPCR (gag) assays were performed as described previously with a limit of detection of 1 copy/sample [43].

Viral and host cell RNA were extracted from tissues stored in RNAlater. Tissues were rinsed with nuclease-free water and homogenized using a TissueLyser in the presence of lysis buffer (RTL, Qiagen) and 20 U of RNase inhibitor (Ambion). RNA was extracted from lymphoid tissues and lung using the RNeasy kit (Qiagen). As this method did not yield good RNA recovery from gastrointestinal tract and brain tissues, TRI Reagent (Sigma-Aldrich, St. Louis, MO, USA) was used to extract RNA from brain and gut tissues, as previously described (10) [54]. Purified RNA was treated with 10U RNase-free DNase I (Roche, Indianapolis, IN, USA) and then precipitated and washed again using the RNeasy kit. qRT-PCR assays were performed as previously described [43].

Single-genome sequencing and genetic analyses

SGS of HIV-1 pol was performed as previously described [49]. Sequences were aligned using ClustalW. Population genetic diversity was calculated as average pairwise difference (APD) using MEGA4 (http://www.megasoftware.net) and an in-house program [55]. Shifts in population structure were calculated using a subdivision test for panmixia with a significance cut off level of p < 10−3 as described by the original report to account for the high number of comparisons between sequences and nucleotide sites (1, 34, 23). The probability of 10−3 for assigning a significant change in viral populations obtained from single-genome sequencing was derived statistically taking into consideration that every nucleotide position is compared in every two possible sets of sequences. This approach results in more than 1012 comparisons between populations of only 10 sequences. Neighbor-joining phylogenetic analyses were done using MEGA4. Trees were rooted on the subtype B consensus sequence (http://www.HIV-1.lanl.gov) shown as the lowest (unmarked) branch of each tree.