Background

The human immunodeficiency virus is the causal agent of AIDS. AIDS morbidity and mortality have led to efforts to identify effective inhibitors of the replication of this virus. Viral replication is driven by a molecular motor consisting of the three viral enzymes: the reverse transcriptase, protease and integrase (IN). The genomic RNA of the virus is used to produce a copy of viral DNA by reverse transcription, and the last of these enzymes, integrase, catalyses the covalent insertion of this DNA into the chromosomes of the infected cells. Once integrated, the provirus persists in the host cell and serves as a template for the transcription of viral genes and replication of the viral genome, leading to the production of new viruses. Integrase possesses two major catalytic activities: an endonucleolytic cleavage at each 3'-OH extremities of the viral genome, named 3'-processing, and a strand transfer reaction leading to the insertion of the processed viral DNA into the target DNA by a trans-esterification mechanism. These catalytic functions of the integrase are essential for the overall integration process and have thus been the object of intensive pharmacological research. Since the end of the 1990s, several inhibitors with genuine antiviral activity have been identified and developed. Two of these compounds – MK-0518 or raltegravir and GS9137 or elvitegravir – have shown great promise and should ensure that integrase inhibitors rapidly become an important class in the arsenal of antiretroviral drugs (ARVs) available [1]. In addition to 3'-processing and strand transfer, IN may efficiently catalyse other reactions: a third reaction, named disintegration, corresponds to the apparent inverse reaction of the strand transfer [2] although it is not clear whether it may occur in the cell context. More recently, a specific and internal cleavage catalysed by the full-length IN has been observed in vitro [3]. This reaction requires a symmetrical organisation of the DNA substrate as well as a tetrameric organisation of the protein. From a structural point of view, this reaction is related to the endonucleolytic reaction of a restriction enzyme.

In vivo, the integrase oligomer and viral DNA molecule form part of a preintegration complex (PIC), our knowledge of which remains limited. The reverse transcriptase (RT), matrix protein (MA), Vpr and the nucleocapsid protein (NC) are also present in this complex as well as cellular partners [47]. The presence of an intact integrase is required for the stabilisation of preintegration complexes and their transport into the nucleus: These non catalytic functions of IN are also crucial for the viral replication cycle. Indeed, a functional interaction between IN and RT has been observed, suggesting that IN is involved, at least indirectly, in controlling the synthesis of viral DNA [810]. Furthermore, the interaction of particular IN structures with one or several cellular cofactors plays a key role for the integration into host cell chromosomes. For instance, LEDGF/p75 acts as a chromatin tethering factor for IN [11, 12]. All these observations pave the way for the development of inhibitors targeting the interactions between IN and either viral or cellular cofactors. These alternative functions may constitute useful targets for the future development of integrase inhibitors.

Integrase

Integrase is a 288-amino acid protein (32 kDa) encoded by the end of the pol gene. It is produced as part of the Gag-Pol polypeptide precursor, from which it is released by viral protease-mediated cleavage. It has three independent domains: (i) The N-terminal domain (amino acids 1–49) that carries an HHCC motif analogous to a zinc finger, and effectively binds Zn2+ [13], possibly favouring protein multimerisation, a key process in integration [13, 14]. (ii) The central domain or catalytic domain (amino acids 50–212) encompassing a D, D-35, E motif which is indispensable for the catalytic activity and which is conserved between viral IN and transposases. This central domain is also implicated in the binding of the viral DNA extremities mainly via the residus Q148, K156 and K159 [1519]. All integrase activities strictly require the presence of a metallic cationic cofactor which is coordinated by two residues of the catalytic triad (D64 and D116 for HIV-1 IN) [20, 21]. (iii) The C-terminal domain (amino acids 213–288) binds non-specifically to DNA and therefore is mainly involved in the stability of the complex with DNA. No complete structure has yet been determined for the integrase protomer (IN1–288), or for oligomers or complexes of these structures with DNA, due to poor solubility and interdomain flexibility problems. However, several structures of isolated domains or of two consecutive domains have been reported [2025].

Integrase functions in a multimeric form, as shown by complementation experiments: mixtures of proteins, each individually inactive, were found to be active [2628]. For example, an inactive catalytic triad mutant can be complemented by an inactive integrase truncated at its C-terminal end. Such a functional complementation can be observed in virions [29]. In addition, the factors promoting integrase multimerisation such as Zn2+ also stimulate the specific Mg2+-dependent activity of the enzyme [14], indicating that functional enzyme is multimeric. Dimers form at either end of the viral DNA molecule. These dimers are responsible for 3'-processing activity [3034]. Pairs of dimers bring together the two ends of the viral DNA and leads to the formation of a tetramer (dimer of dimer), the active form for concerted integration [35, 36]. During its catalytic cycle, IN must bind simultaneously to the viral substrate DNA and the target DNA. Current knowledge of the organisation of this tetramer onto DNA is based exclusively on models constructed from partial structural and biochemical (cross-linking and site-directed mutagenesis) data [24, 3740]. In a recent model, an IN tetramer is bound to the two ends of the viral DNA, i.e. LTRs (Long Terminal Repeat), and to a 26 base pairs host DNA molecule in the presence of Mg2+ [40]. This model takes into account the structural constraints deduced from the model of the complex formed between DNA and a related enzyme, the Tn5 transposase, and the observation that the two ends of the viral DNA are integrated five base pairs apart, corresponding to a distance of about 16 Å. This model may provide a platform for the rational design of new inhibitors. It is important to note that most of these models support a symmetrical form of IN for concerted integration. However, recently, Ren et al. have proposed an asymmetric tetramer/DNA model for the concerted integration suggesting that at least a reaction intermediate could be asymmetric [39].

The catalytic activities of integrase (IN)

3'-processing and strand transfer

There is now substantial virological evidence that the precursor of integrated viral DNA, or provirus, is a linear viral DNA generated by reverse transcription of the viral genome. Two reactions are required for the covalent integration of viral DNA into the host DNA. The integrase (IN) first binds to a short sequence at each end of the viral DNA known as the long terminal repeat (LTR) and catalyses an endonucleotide cleavage known as 3'-processing, in which a dinucleotide is eliminated from each end of the viral DNA (Fig 1). The resulting cleaved DNA is then used as a substrate for integration or strand transfer leading to the covalent insertion of the viral DNA into the genome of the infected cell (Fig 1). This second reaction occurs simultaneously at both ends of the viral DNA molecule, with an offset of precisely five base pairs between the two opposite points of insertion.

Figure 1
figure 1

Catalytical activities of HIV-1 integrase. The catalytical activities 3'-processing (A), strand-transfer reaction (B), disintegration (C) and palindrome cleavage (D) are represented. The domains of the protein responsible for these activities are depicted in the table above.

These two reactions also occur in vivo in a sequential manner. The two reactions are also energetically independent. In both cases, the reaction is a single-step trans-esterification involving the disruption of a phosphodiester bond by nucleophilic attack. In the first reaction, the bond concerned is part of the viral DNA molecule and in the second, the bond is in the target DNA. There is therefore no covalent intermediate between the enzyme and the DNA as it is observed during catalytic reaction of topoisomerase or IN of lambda phage, for example. The removal of the dinucleotides from the 5' overhang, of viral origin, and DNA repair (i.e. polymerisation and ligation) are required to complete the full integration reaction. One study suggested that this might involve a DNA-dependent DNA polymerase activity of the IN [41], but, to date, such a polymerase activity of IN was not confirmed and it is generally thought that this DNA repair is performed by cellular mechanisms that can be reproduced in vitro with purified host cell factors [42]. The final reaction thus results in a viral DNA molecule, the provirus, integrated into and collinear with the genomic DNA, with a characteristic 5 base pairs duplication (in the case of HIV-1) of genomic sequence flanking the integration site. Several lines of evidence support a non-random integration with preferential integration in transcription units for HIV-1 [43]. Integration is then mainly directed by interactions between the pre-integration complex and chromatin. From a DNA sequence point of view, it was recently shown that integration occurs preferentially within symmetric sequences [4446] (see # 2.3).

Both reactions (3'-processing and strand transfer) can be reproduced in vitro using short double-stranded oligonucleotides mimicking the sequence of the ends of the viral LTR U5 or U3 in the presence of a recombinant integrase [47]. 3'-processing is a highly specific reaction. This reaction involves the removal of a dinucleotide, adjacent to the highly conserved CA dinucleotide, from the 3' strand of the U3 and U5 viral DNA LTRs. Mutations in this sequence completely abolish activity, whereas the integrity of flanking sequences is much less important [15, 48]. The 3'-processing reaction corresponds to a nucleophilic attack by a water molecule. However, other alternative nucleophilic agents can be used such as glycerol but generally conduct to non-specific endonucleolytic cleavage [4951]. This mainly occurs when Mn2+ is used. The 3'-OH of the unprocessed DNA can also be used directly as a nucleophilic agent leading to 3'-5' cyclic dinucleotide product [49]. The use of the physiological relevant cofactor Mg2+ improves the specificity of the cleavage with water as the mainly used nucleophilic agent.

During the same reaction, IN can catalyse, with a modest yield, the strand transfer. In the strand transfer reaction, the nucleophilic agent corresponds to the 3'-OH extremity of the processed strand. It is possible to increase the yield of the strand transfer with pre-processed oligonucleotides [36]. By using an oligonucleotide mimicking one LTR end, only a half-transfer reaction can be observed. In vitro, long DNA fragments with two viral extremities can be used to reproduce the concerted integration process which corresponds to the simultaneous integration of two viral ends [35, 36, 52, 53]. Concerted integration appears less tolerant to reaction conditions, i.e. enzyme preparation and oligomerization state than strand transfer. Although it was shown by different groups that IN alone is sufficient to catalyse the concerted integration, viral or cellular protein, acting as cofactors for the integration process, such as the viral nucleocapsid protein NC [54] and the cellular proteins HMG I(Y) [55] and LEDGF [5658] may increase its efficacy. Interestingly, it was recently shown that, in contrast to the half-transfer reaction, a higher reaction yield was obtained for the concerted integration starting from a blunt-ended as compared to a pre-processed DNA substrate [36]. Furthermore, activity of IN is strongly dependent on its oligomeric state [14, 47, 59]. In contrast to 3'-processing which requires the dimeric form of IN [31], it was shown that concerted integration requires a tetrameric organization [32].

Both the 3'-processing and strand transfer reactions require a metallic cofactor. This cofactor may be Mn2+ or Mg2+, but Mg2+ is preferentially used in vivo. Indeed, there is considerable experimental evidence to suggest that Mg2+ is more physiologically relevant, particularly as the specificity of the reaction is much greater in the presence of this cation: (i) IN displays strong non-specific nuclease activity in the presence of Mn2+ [60, 61]. (ii) The tolerance of sequence variation at the ends of the viral DNA molecule is much greater in the presence of Mn2+ than in the presence of Mg2+ [15, 48]. (iii) Many IN mutations remain silent in the presence of Mn2+ but not in the presence of Mg2+. For example, mutations of the HHCC domain that are deleterious to the virus in vivo affect 3'-processing and integration activities in in vitro tests using Mg2+, but have no such effect in tests using Mn2+ [62, 63]. Furthermore, zinc has no stimulatory effect on IN activity when using Mn2+ as a cofactor while zinc stimulates the Mg2+-dependent activity [14]. In the Pearson Hard-Soft Acid-Base theory (HSBA), hards metal ions such as Mg2+ (with d0 electron configuration) are characterized by electron clouds which are not easily deformed, in contrast to soft metals ions such as Mn2+, with direct consequences on the active site plasticity and reaction specificity for many metal-dependent enzymes when comparing their activities under either Mg2+ or Mn2+ context. The presence of Mg2+ generally leads to more stringent conditions for catalysis in term of reaction specificity as found for RAG1/2 proteins [64], Tn10 transposase [65], RNase H activity [66]. HIV-1 integrase also displays such a differential qualitative behaviour between Mg2+ and Mn2+-dependent catalysis. It was also reported that IN/DNA complexes display different stabilities depending on the cofactor context with IN/DNA complexes being more stable in the presence of Mn2+ than in the presence of Mg2+ [6769]. Such a differential stability of complexes is generally observed using IN purified in the presence of detergent and accounts for quantitative differences in term of enzymatic activity when comparing Mg2+ and Mn2+. Indeed, IN from detergent-containing preparations displays more Mn2+-dependant than Mg2+-dependant activity as compared to detergent-free preparations that quantitatively display similar activities. The difference between cofactors has pharmacological implications, as the apparent efficacy of various IN inhibitors differs between tests using Mg2+ or Mn2+ as a cofactor [7072], and the effects of mutations conferring drug resistance are often detectable only in tests using Mg2+ as the cofactor [73]. These considerations have led to the use of chemical groups chelating Mg2+ in the rational design of integrase inhibitors. Such groups are present in all the inhibitors developed to date, including raltegravir and elvitegravir [1].

Whatever the activity tested, IN is characterized by an overall slow cleavage efficiency. Furthermore, IN form stable complexes with both DNA substrate and DNA product, limiting multiple turnover [74]. Taken together, these features resemble to those observed for other polynucleotidyl-tranferases such as transposases. These enzymes share a peculiar enzymatic property: they have evolved to catalyse multi-sequential steps (two reactions for IN and four for Tn5 transposase) in a single active site. A multi-sequential reaction requires a strong binding of the enzyme to the DNA product after each chemical step to optimise the entire process but consequently diminishes the overall enzymatic efficacy in term of turnover. However, this weak catalytic activity is not detrimental for these enzymes in the cellular context, because a single event of integration or transposition is sufficient for the overall function. In vivo, this tight binding of IN to the viral processed DNA most likely allows the complex to remain associated after the 3'-processing reaction long enough for subsequent integration. Two strategies have been considered for the development of IN inhibitors: screening using the unbound protein (before complex formation) or screening with the preformed IN-viral DNA complex. The success of these two approaches has been demonstrated by the identification of (i) inhibitors of 3'-processing targeting the DNA free enzyme and blocking its binding to the viral DNA [75] and (ii) inhibitors of strand transfer targeting the preformed complex more related to the preintegration complex (PIC) [76]. These two families of compounds are respectively called INBI (IN DNA-Binding Inhibitors) and INSTI (IN Strand Transfer Inhibitors) (Fig 2). Since the early 1990s, a number of compounds inhibiting either 3'processing or strand transfer have been identified in vitro [77, 78]. The great stability of the PIC and its presence in the cell throughout most of the preintegration steps make this complex the most suitable target. Unfortunately, most of the INBI compounds are inactive on the preformed complexes. Indeed only strand transfer inhibitors or INSTIs have been shown to be potent antiviral compounds. As they selectively target the preformed IN-viral DNA complex and inhibit the binding of the acceptor DNA (i.e. target DNA or host DNA), INSTI compounds selectively inhibit the strand transfer reaction and have no effect on the 3'-processing reaction [79]. One such compound, Raltegravir (Isentress@), which was developed based on early studies by Hazuda et al. [76], was approved for clinical use in Autumn 2007 as the first antiretroviral drug (ARV) targeting the viral integrase (IN). This inhibitor act by binding to the IN-viral DNA complex, close to the 3' end of the donor DNA, thereby selectively blocking the strand transfer step; the IC50 values are in the nanomolar range both in vitro and ex vivo with a high therapeutic index [80]. Unfortunately, variants of the virus resistant to this inhibitor have already been reported [80]. The emergence of resistant virus in vivo should prompt both a search for new INSTIs and reassessment of the potential inhibitory activity of INBIs (such as styrylquinolines or SQL) which have been shown to be inhibitors of 3'processing in vitro with significant inhibitory activity against viral replication in cell cultures (Fig 2) [81].

Figure 2
figure 2

Some anti-integrase compounds. Styrylquinoline, a member of the INBI (IN DNA-Binding Inhibitors) compound and beta dicetonic acid, Raltegravir and Elvitegravir, members of the INSTI (IN Strand Transfer Inhibitors) compounds, are represented.

The presence either of the catechol or an another group on the SQLs able to form a complex of coordination with a divalent ion suggests that these compounds interact with the active site of the enzyme by a chelation with the metallic cofactor. These compounds are mainly inhibitors of the 3'-processing reaction, and their mechanism of action in vitro can be assimilated to a competitive mechanism. Recently, experiments based on fluorescence anisotropy demonstrated that SQLs are DNA-binding inhibitors of HIV-1 IN [75]. In summary, INBI compounds primarily compete with the binding of the donor DNA (viral DNA) while INSTI compounds compete with the binding of the acceptor DNA (target DNA). However, the mechanism of inhibition of SQLs in the cell context is not completely understood. These compounds appear to act at steps prior to integration, more particularly during RT [82] and nuclear import [83]. These effects are mediated by IN as evidenced by the appearance of resistance mutation in IN sequence. It is then suggested that, ex vivo, non catalytic region of IN are targeted by SQs (see paragraph "non catalytic role of IN"). It is interesting to note that the two classes of IN inhibitors, INBI and INSTI, induce distinct resistant mutations [76, 82, 8486].

Disintegration

A third reaction, disintegration, is observed in vitro (Fig 1). Disintegration may be considered to be the reverse of the strand transfer reaction [2]. Unlike the 3'-processing and strand transfer reactions which requires the full-length protein, the disintegration reaction can be catalysed by the catalytic domain alone (IN55–212) or by truncated proteins, IN1–212 or IN55–288 [47, 87, 88]. This activity was widely used for testing the competitive mechanisms of certain inhibitors. There is currently no experimental evidence to suggest that this reaction occurs in vivo.

A new internal specific activity

Recently, our group has identified a new internal and specific cleavage activity in vitro of HIV-1 IN [3]. Until now, all attempts to study a specific internal endonucleolytic cleavage in vitro have failed. Vink et al. have demonstrated that when the CA dinucleotide, indispensable for the 3'-processing, was separated by more than 2 nucleotides from the 3'-OH end, the activity was dramatically impaired [89]. Nevertheless, we have demonstrated that oligonucleotides mimicking the palindromic sequence found at the LTR-LTR junction of the 2-LTR circles (found in infected cells) were efficiently cleaved at internal positions by HIV-1 IN, with cleavage kinetics comparable to the 3'-processing reaction (Fig 1). This reaction occurs symmetrically on both strands, with a strong cleavage at the CA dinucleotide (corresponding to the CA sequence used for the 3'-processing reaction). A second weaker cleavage site appears after the next adenine (TA sequence) in the 5'-3' direction. Furthermore, HIV-1 IN can efficiently cleave a plasmid mimicking the 2-LTR circles specifically at the LTR-LTR junction. The specificity of this reaction is similar to the one catalysed by transposases which cleave the DNA substrate after a CA or TA dinucleotide [90]. Such internal cleavages are not observed using a mutant of the catalytic site (E152A) testifying that the DDE triad is also implicated in this reaction. In addition, this novel activity is stringent and highly specific as (i) it occurs with the physiological metallic cofactor (Mg2+) and not only Mn2+, (ii) only the full-length IN is competent for the internal cleavage of the palindrome, in contrast to the disintegration reaction that is efficiently catalysed by truncated proteins such as IN55–212, IN55–288, IN1–212 and (iii) it does not sustain any mutation in the sequence of the LTR-LTR junction. Furthermore, the cleavage of the LTR-LTR junction requires the tetrameric forms of IN whereas the 3'-processing reaction is efficiently catalysed by a dimer [31, 32]. This new activity seems to be generalised to other retroviral IN as reported earlier for PFV-1 IN [91, 92]. However, although PFV-1 IN performs this cleavage activity, it is important to note that both IN are strictly restricted to their own cognate palindromic sequence: HIV-1 IN is unable to cleave the PFV LTR-LTR junction and PFV-1 IN is unable to cleave the HIV-1 LTR-LTR junction.

Recently, mapping of extensive integration sites, notably for HIV-1, put in light the existence of a weak palindromic consensus [4446]. It is important to note that the sequence of the weak palindromic consensus is similar, although not identical, to the one found at the LTR-LTR junction. This specific endonucleolytic activity on a palindromic LTR-LTR junction as well as the symmetrical organization of integration sites reveal a common structural feature of IN: IN intrinsically prefers to bind to symmetric DNA sequences. Moreover, we have found that tetramers catalyses the cleavage of the palindromic sequence while others have suggested that the same oligomeric form is responsible for the concerted integration in the context of the synaptic complex [35, 36]. Therefore, one could reasonably imagine that the same multimeric organization of IN (i.e. the tetrameric form) is stabilised by a corresponding symmetry at the DNA level, either at the viral DNA (LTR-LTR junction) or at the target level (integration sites).

In vivo, unintegrated viral DNA could represent 99% of total viral DNA in infected cells [93] underlying that integration is a rare event. Un-integrated DNA is mainly linear but also circular – 1-LTR or 2-LTR circles. In the absence of integration (for example using strand-transfer inhibitors such as diketo acids), at least the 2-LTR circular forms of viral DNA, which are usually believed to be dead-end molecules, are accumulated [94]. It is tempting to speculate about a possible role of 2-LTR circles in a subsequent integration process after removing the drug pressure, mediated by the ability of IN to cleave the LTR-LTR junction. However, to date, although IN is able to cleave the LTR-LTR junction in vitro, there is no proof that such a cleavage can occur in vivo and thus that 2-LTR circles could be an efficient precursor for integration.

Modulation of IN activity

Several cellular and viral proteins have been reported to stimulate IN activities in vitro as well as in vivo. Among these cofactors, some proteins are known to interact directly with IN and thus enhance its solubility or favours an active conformation of IN, while other proteins do not physically interact with IN but could indirectly stimulate IN activities as found for proteins playing a structural role on DNA conformation.

For instance, in the group of IN interactors, the yeast chaperoning protein, yHSP60, was described by Parissi and colleagues to interact directly with HIV-1 IN [95]. It has also been demonstrated that the human counterpart of the yHSP60, hHSP60, was able to stimulate the in vitro processing as well as joining activities of IN, suggesting that hHSP60-IN interaction could allow IN to adopt a more competent conformation for activity or prevent IN from aggregation [95]. However, further investigations must be done to confirm the potential role of HSP60 in the viral life cycle.

LEDGF/p75, Lens Epithelial Derived Growth Factor, has been reported to interact with IN and stimulate both concerted integration and strand transfer. Addition of recombinant LEDGF/p75 to an in vitro mini HIV-based IN assay enhanced the strand transfer activity of the recombinant HIV-1 IN [56]. This stimulation is highly dependent of the ratio between IN and LEDGF used for the reaction [58]. Probably, LEDGF/p75 has a double effect on IN. The first one is similar to the one described for HSP60. Indeed, it was shown that LEDGF-IN complex displays a more favourable solubility profiles as compared to the free IN [96]. In the same publication, a second effect could explain the enhancement of IN activity as LEDGF/p75 binding to DNA concomitantly increases IN-DNA affinity [96]. Concerning more specifically the concerted integration, it has been reported that LEDGF increases the stabilisation of the tetrameric state of IN which is responsible for the concerted integration [97]. In vivo, LEDGF displays an important role in the targeting of the viral integration [98] (see also # 2.5).

It is important to note that IN activity is also highly regulated by the structure of the viral and host DNA substrates which can be influenced by protein interactions on DNA. Pruss et al. studied the propensity of IN to integrate an oligonucleotide mimicking the HIV LTR into either DNA molecules of known structure or nucleosomal complexes [99, 100]. Results highlight that the structure of the target greatly influences the site of integration, and that DNA curvature, flexibility/rigidity in solution, all parameters influence the frequency of integration. Furthermore, using a model target comprising a 13-nucleosome extended array that includes binding sites for specific transcription factors and which can be compacted into a higher-ordered structure, Taganov et al. demonstrated that the efficiency of the in vitro integration was decreased after compaction of this target with histone H1 [101]. Consequently, both intrinsic DNA structure and the folding of DNA into chromosomal structures will exert a major influence on both catalysis efficiency and target site selection for the viral genome integration. The structure of the viral DNA also greatly influences IN activity [102], as illustrated by alterations in the minor groove of the viral DNA which result in a greater decrease in 3'-processing activity than major groove substitutions, suggesting a great importance of the structure of the viral DNA for IN activities.

Several cellular proteins greatly influence the structure of the viral DNA and thus modulate IN activities. For example, BAF (Barrier-to-autointegration factor), a component of the functional HIV-1 pre-integration complex, stimulates the integration reaction in the PIC complex [103, 104]. The effect of BAF on integration is probably due, in vitro, to its DNA binding activity and its effect on the viral DNA structure [105]. HMG I(Y), a protein partner of the HIV-1 PICs, has been also described to stimulate concerted integration in vitro. Li and colleagues demonstrated that HMG I(Y) can condense model HIV-1 cDNA in vitro, possibly by approximating both LTR ends and facilitating IN binding by unwinding the LTR termini [106]. These data suggest that binding of HMG I(Y) to multiple cDNA sites compacts retroviral cDNA, thereby promoting formation of active integrase-cDNA complexes [106]. In addition, Carteau and colleagues led to the finding that concerted integration can be stimulated more than 1,000-fold in the presence of the nucleocapsid protein in comparison to integrase alone under some conditions of reaction [54]. To date, the effect of the NC on concerted integration is not clear but is probably due its capability to promote DNA distorsion.

Another IN cofactor, INI-1 (Integrase Interactor 1), has been described to enhance IN activity probably by structural and topological effect on DNA. INI-1, is one of the core subunits of the ATP-dependent chromatin remodelling complex SWI/SNF that regulates expression of numerous eukaryotic genes by altering DNA/histone interaction. INI-1 was identified by a two-hybrid system that binds to IN and enhances the strand transfer activity of the protein [107]. Taking into account that INI-1 interacts with IN, it is not excluded that a solubility effect induced by protein-protein interaction may account for the stimulation effect on IN activity as reported for LEDGF/p75. It is important to note that conflicting results concerning the role of INI-1 in the HIV-1 life cycle have been reported. It has been described that SNF5/INI-1 interferes with early steps of HIV-1 replication [108]. Boese and colleagues found no effects on viral integration in cells depleted for INI-1 [109], whereas it has been proposed that INI-1 was required for efficient activation of Tat-mediated transcription [110]. The comprehension of the role of such IN partners, as well as the discovery of novel partners will be crucial to reproduce more authentic integrase complexes for mechanistic studies and development of IN inhibitors.

Targeting viral integration

Additionally, interactions between IN and cellular protein partners play key role in the targeting of integration. A systematic study of the sites of HIV DNA integration into the host DNA has shown that integration is not entirely random. Analysis of integration sites in vivo indicates that HIV tends to integrate into sites of active transcription [43]. It is likely that this integration bias results from interactions between PICs and components of cellular origin in relationship with the chromatin tethering. Several cellular cofactors, including INI-1 [107, 111], BAF [103, 112], Ku [113] and LEDGF/p75 [114], are known to interact with the PIC in the nucleus. Among these proteins, at least INI-1 and LEDGF/p75 physically interact with IN [107, 115]. Recent work with LEDGF/p75 strongly suggests that this cofactor is actually responsible for targeting integration [11]. LEDGF/p75 silencing modifies the bias from transcription units to CpG islands [43, 116]. As LEDGF/p75 is essential for HIV-1 replication and LEDGF/p75 interacts directly with IN, the domain of interaction between these two proteins is therefore a promising target for the development of integrase ligands with antiviral activity. Although no direct interaction between IN and BAF or Ku was described, it is suggested that these two cofactors could influence the profile or efficiency of integration [117, 118]. For example, interaction of BAF with emerin, an internal-inner-nuclear-envelope protein, could favour the access of the PIC to the chromatin and thus facilitate integration [119]. In relationship with chromatin, it was recently described that the C-terminal domain of IN is acetylated by a histone acetyl transferase (HAT) [120]. However, the effect of IN acetylation on integration in vivo remains unclear [121].

Non catalytic activities of IN

IN plays a key role for retroviral replication because of its catalytical activities. In addition, IN has also non catalytic properties that are essential for the replication cycle. Mutations of IN can be divided into two groups. The first class of mutations (Class I) includes mutants that are affected in their catalytic activities. For instance, one mutation in either the three amino acids of the DDE triad abolishes the catalytic activities of IN. The second class of mutations (Class II mutants) disturbs other steps of the retroviral replication and corresponding purified integrase mutants display wild-type level of activity.

Several mutations of IN displayed an in vivo DNA synthesis defect and a block of viral replication at the reverse transcription level [8, 9, 122124]. A structural general defect at the level of the retrotranscription complex which includes RT and IN may account for such a phenotype. Indeed, several studies suggest a physical interaction between IN and RT [9]. Such a defect in DNA synthesis can be also observed when using SQL compounds which target integrase, as evidenced by resistance mutations study, but primarily affect the reverse transcription step [82].

Another role of IN prior to integration is related to the PIC translocation in the nucleus. In fact, in non-dividing infected cells, such as macrophages, the PIC must cross the nuclear membrane to reach the chromosomal DNA. This involves an active mechanism, the determinants of which remain unclear [125, 126]. It has been reported by De Soultrait et al. that L2, which corresponds to the C-end half of the yeast STU2p, a microtubule-associated protein (MAP), interacts with IN. STU2p is an essential component of the yeast spindle pole body (SPB), which is able to bind microtubules in vitro. This interaction was observed in vitro and also in vivo in a yeast model [127]. The identification of components of the microtubule network associated with IN suggests a role of this complex in the transport of HIV-1 PIC to the nucleus and supports recent particle tracking data suggesting that PIC is characterized by a microtubule-directed movement [128].

Integrase and at least two other components of the PIC, Vpr and MA, have karyophilic properties [129] suggesting that several distinct mechanisms could be involved in the nuclear import. The integrase enzyme includes several sequence motifs likely to act as nuclear localisation signals (NLSs), including at least one known to interact with the nuclear import receptor, this motif being located in the C-terminal domain [126]. A sequence within the catalytic core including the V165 and R166 residues may also contribute to the karyophilic properties of integrase [130], although this remains a matter of debate [124, 131]. In any case, the mutation of these various sequences does not completely abolish the nuclear translocation of PICs, confirming that there are complementary and/or redundant translocation mechanisms. Recently, a novel partner of IN in the nuclear translocation has been described by Christ and colleagues [132]. Using yeast two-hybrid and pull-down experiments, the transportin-SR2 (TRN-SR2) was shown to interact with IN. By RNAi experiment on infected cells, SR2 was clearly validated as an essential partner in the translocation of IN and consequently of the PIC into the nucleus of infected cells.

Finally, integrase could be indirectly involved in the regulation of transcription of integrated provirus. After the integration process, IN could be tightly bound to the integrated DNA and then, the degradation of IN by the proteasome-ubiquitin pathway was proposed to regulate the transcription of viral genes. Indeed, Dargemont and collaborators have found that integrase interacts with VBP1 (von Hippel-Lindau binding protein 1), a binding partner of Cul2/VHL ligase complex involved in the polyubiquitylation process [133].

Conclusion

In conclusion, remarkable progress has been made towards understanding the structure of the pre-integration complex formed by HIV integrase and viral DNA. This new knowledge has led to considerable improvements in the methods used to search for compounds active against this enzyme. Several families of inhibitors have now been identified, including at least one – strand transfer inhibitors – currently in the advanced stages of clinical development and giving results sufficiently promising for one molecule (Raltegravir) to have obtained a licence in October 2007 for release in the United States. The identification of several new integrase cofactors will provide us with a clearer picture of the determinants of integration in vivo, opening up new possibilities for pharmacological research [134]. There is no doubt that interest in the structural biology of integrase will be substantially stimulated by the release of the first integrase inhibitors onto the market and, unfortunately, by the likely emergence of resistant viruses.