TL;DR: The polymerase chain reaction (PCR) process is used to show that several randomly chosen simple sequence loci with different nucleotide composition and from different species show extensive length polymorphisms.
Abstract: Short simple sequence stretches occur as highly repetitive elements in all eukaryotic genomes and partially also in prokaryotes and eubacteria. They are thought to arise by slippage like events working on randomly occurring internally repetitive sequence stretches. This predicts that they should be generally hypervariable in length. I have used the polymerase chain reaction (PCR) process to show that several randomly chosen simple sequence loci with different nucleotide composition and from different species show extensive length polymorphisms. These simple sequence length polymorphisms (SSLP) may be usefully exploited for identity testing, population studies, linkage analysis and genome mapping.
TL;DR: Evidence is presented that single-base repeats (the shortest possible motifs) are represented by longer runs in mammalian introns than would be expected on a random basis, supporting the idea that SSM may be a ubiquitous force in the evolution of the eukaryotic genome.
Abstract: Simple repetitive DNA sequences are a widespread and abundant feature of genomic DNA. The following several features characterize such sequences: (1) they typically consist of a variety of repeated motifs of 1-10 bases--but may include much larger repeats as well; (2) larger repeat units often include shorter ones within them; (3) long polypyrimidine and poly-CA tracts are often found; and (4) tandem arrangements of closely related motifs are often found. We propose that slipped-strand mispairing events, in concert with unequal crossing-over, can readily account for all of these features. The frequent occurrence of long tandem repeats of particular motifs (polypyrimidine and poly-CA tracts) appears to result from nonrandom patterns of nucleotide substitution. We argue that the intrahelical process of slipped-strand mispairing is much more likely to be the major factor in the initial expansion of short repeated motifs and that, after initial expansion, simple tandem repeats may be predisposed to further expansion by unequal crossing-over or other interhelical events because of their propensity to mispair. Evidence is presented that single-base repeats (the shortest possible motifs) are represented by longer runs in mammalian introns than would be expected on a random basis, supporting the idea that SSM may be a ubiquitous force in the evolution of the eukaryotic genome. Simple repetitive sequences may therefore represent a natural ground state of DNA unselected for coding functions.
TL;DR: A highly conserved repetitive DNA sequence, (TTAGGG)n, has been isolated from a human recombinant repetitive DNA library and its similarity to functional telomeres isolated from lower eukaryotes suggest that this sequence is a functional human telomere.
Abstract: A highly conserved repetitive DNA sequence, (TTAGGG)n, has been isolated from a human recombinant repetitive DNA library. Quantitative hybridization to chromosomes sorted by flow cytometry indicates that comparable amounts of this sequence are present on each human chromosome. Both fluorescent in situ hybridization and BAL-31 nuclease digestion experiments reveal major clusters of this sequence at the telomeres of all human chromosomes. The evolutionary conservation of this DNA sequence, its terminal chromosomal location in a variety of higher eukaryotes (regardless of chromosome number or chromosome length), and its similarity to functional telomeres isolated from lower eukaryotes suggest that this sequence is a functional human telomere.
TL;DR: A novel family of repetitive DNA sequences that is present among both domains of the prokaryotes but absent from eukaryotes or viruses is studied, characterized by direct repeats, varying in size from 21 to 37 bp, interspaced by similarly sized non‐repetitive sequences.
Abstract: Using in silico analysis we studied a novel family of repetitive DNA sequences that is present among both domains of the prokaryotes (Archaea and Bacteria), but absent from eukaryotes or viruses. This family is characterized by direct repeats, varying in size from 21 to 37 bp, interspaced by similarly sized non-repetitive sequences. To appreciate their characteri-stic structure, we will refer to this family as the clustered regularly interspaced short palindromic repeats (CRISPR). In most species with two or more CRISPR loci, these loci were flanked on one side by a common leader sequence of 300-500 b. The direct repeats and the leader sequences were conserved within a species, but dissimilar between species. The presence of multiple chromosomal CRISPR loci suggests that CRISPRs are mobile elements. Four CRISPR-associated (cas) genes were identified in CRISPR-containing prokaryotes that were absent from CRISPR-negative prokaryotes. The cas genes were invariably located adjacent to a CRISPR locus, indicating that the cas genes and CRISPR loci have a functional relationship. The cas3 gene showed motifs characteristic for helicases of the superfamily 2, and the cas4 gene showed motifs of the RecB family of exonucleases, suggesting that these genes are involved in DNA metabolism or gene expression. The spatial coherence of CRISPR and cas genes may stimulate new research on the genesis and biological role of these repeats and genes.
TL;DR: Features of the organization of repetitive sequences in eukaryotic genomes, and their distribution in natural populations, reflect the evolutionary forces acting on selfish DNA.
Abstract: Repetitive DNA sequences form a large portion of the genomes of eukaryotes. The 'selfish DNA' hypothesis proposes that they are maintained by their ability to replicate within the genome. The behaviour of repetitive sequences can result in mutations that cause genetic diseases, and confer significant fitness losses on the organism. Features of the organization of repetitive sequences in eukaryotic genomes, and their distribution in natural populations, reflect the evolutionary forces acting on selfish DNA.