TL;DR: Each of the three retroviruses studied showed unique integration site preferences, suggesting that virus-specific binding of integration complexes to chromatin features likely guides site selection.
Abstract: The completion of the human genome sequence has made possible genome-wide studies of retroviral DNA integration. Here we report an analysis of 3,127 integration site sequences from human cells. We compared retroviral vectors derived from human immunodeficiency virus (HIV), avian sarcoma-leukosis virus (ASLV), and murine leukemia virus (MLV). Effects of gene activity on integration targeting were assessed by transcriptional profiling of infected cells. Integration by HIV vectors, analyzed in two primary cell types and several cell lines, strongly favored active genes. An analysis of the effects of tissue-specific transcription showed that it resulted in tissue-specific integration targeting by HIV, though the effect was quantitatively modest. Chromosomal regions rich in expressed genes were favored for HIV integration, but these regions were found to be interleaved with unfavorable regions at CpG islands. MLV vectors showed a strong bias in favor of integration near transcription start sites, as reported previously. ASLV vectors showed only a weak preference for active genes and no preference for transcription start regions. Thus, each of the three retroviruses studied showed unique integration site preferences, suggesting that virus-specific binding of integration complexes to chromatin features likely guides site selection.
TL;DR: The definitive features of these novel elements are that they include site‐specific integration functions (the Integrase and the insertion site); (ii) that they are able to acquire various gene units and act as an expression cassette by supplying the promoter for the inserted genes.
Abstract: A family of novel mobile DNA elements is described, examples of which are found at several independent locations and encode a variety of antibiotic resistance genes. The complete elements consist of two conserved segments separated by a segment of variable length and sequence which includes inserted antibiotic resistance genes. The conserved segment located 3' to the inserted resistance genes was sequenced from Tn21 and R46, and the sequences are identical over a region of 2026 bases, which includes the sulphonamide resistance gene sull, and two further open reading frames of unknown function. The complete sequences of both the 3' and 5' conserved regions of the DNA element have been determined. A 59-base sequence element, found at the junctions of inserted DNA sequences and the conserved 3' segment, is also present at this location in the R46 sequence. A copy of one half of this 59-base element is found at the end of the sull gene, suggesting that sull, though part of the conserved region, was also originally inserted into an ancestral element by site-specific integration. Inverted or direct terminal repeats or short target site duplications, both of which are characteristics of class I and class II transposons, are not found at the outer boundaries of the elements described here. Furthermore, the conserved regions do not encode any proteins related to known transposition proteins, except the DNA integrase encoded by the 5' conserved region which is implicated in the gene insertion process. Mobilization of this element has not been observed experimentally; mobility is implied from the identification of the element in at least four independent locations, in Tn21, R46 (IncN), R388 (IncW) and Tn1696. The definitive features of these novel elements are (i) that they include site-specific integration functions (the integrase and the insertion site); (ii) that they are able to acquire various gene units and act as an expression cassette by supplying the promoter for the inserted genes. As a consequence of acquiring different inserted genes, the element exists in a variety of forms which differ in the number and nature of the inserted genes. This family of elements appears formally distinct from other known mobile DNA elements and we propose the name DNA integration elements, or integrons.
TL;DR: The crystal structure of the catalytically active core domain of HIV-1 integrase was determined and it is revealed that this domain of integrase belongs to a superfamily of polynucleotidyl transferases that includes ribonuclease H and the Holliday junction resolvase RuvC.
Abstract: HIV integrase is the enzyme responsible for inserting the viral DNA into the host chromosome; it is essential for HIV replication. The crystal structure of the catalytically active core domain (residues 50 to 212) of HIV-1 integrase was determined at 2.5 A resolution. The central feature of the structure is a five-stranded beta sheet flanked by helical regions. The overall topology reveals that this domain of integrase belongs to a superfamily of polynucleotidyl transferases that includes ribonuclease H and the Holliday junction resolvase RuvC. The active site region is identified by the position of two of the conserved carboxylate residues essential for catalysis, which are located at similar positions in ribonuclease H. In the crystal, two molecules form a dimer with a extensive solvent-inaccessible interface of 1300 A2 per monomer.
TL;DR: The crystal structure of full-length integrase from the prototype foamy virus in complex with its cognate DNA is presented, defining the structural basis of retroviral DNA integration, and will allow modelling of the HIV-1 intasome to aid in the development of antiretroviral drugs.
Abstract: Integrase is an essential retroviral enzyme that binds both termini of linear viral DNA and inserts them into a host cell chromosome. The structure of full-length retroviral integrase, either separately or in complex with DNA, has been lacking. Furthermore, although clinically useful inhibitors of HIV integrase have been developed, their mechanism of action remains speculative. Here we present a crystal structure of full-length integrase from the prototype foamy virus in complex with its cognate DNA. The structure shows the organization of the retroviral intasome comprising an integrase tetramer tightly associated with a pair of viral DNA ends. All three canonical integrase structural domains are involved in extensive protein–DNA and protein–protein interactions. The binding of strand-transfer inhibitors displaces the reactive viral DNA end from the active site, disarming the viral nucleoprotein complex. Our findings define the structural basis of retroviral DNA integration, and will allow modelling of the HIV-1 intasome to aid in the development of antiretroviral drugs.
TL;DR: Results suggest that both reactions occur by a one-step mechanism without involvement of a covalent protein-DNA intermediate in HIV-1 integration.