TL;DR: The results suggest that trimeric and tetrameric STR loci are useful markers for the study of new mutations and genetic linkage analysis and for application to personal identification in the medical and forensic sciences.
TL;DR: A method enabling rapid localization of STRs and determination of their flanking DNA sequences was developed, thus simplifying the identification of polymorphic STR loci.
Abstract: Tandemly reiterated sequences represent a rich source of highly polymorphic markers for genetic linkage, mapping, and personal identification. Human trimeric and tetrameric short tandem repeats (STRs) were studied for informativeness, frequency, distribution, and suitability for DNA typing and genetic mapping. The STRs were highly polymorphic and inherited stably. A STR-based multiplex PCR for personal identification is described. It features fluorescent detection of amplified products on sequencing gels, specific allele identification, simultaneous detection of independent loci, and internal size standards. Variation in allele frequencies were explored for four U.S. populations. The three STR loci (chromosomes 4, 11, and X) used in the fluorescent multiplex PCR have a combined average individualization potential of 1/500 individuals. STR loci appear common, being found every 300-500 kb on the X chromosome. The combined frequency of polymorphic trimeric and tetrameric STRs could be as high as 1 locus/20 kb. The markers should be useful for genetic mapping, as they are sequence based, and can be multiplexed with the PCR. A method enabling rapid localization of STRs and determination of their flanking DNA sequences was developed, thus simplifying the identification of polymorphic STR loci. The ease by which STRs may be identified, as well as their genetic and physical mapping utility, give them the properties of useful sequence tagged sites (STSs) for the human genome initiative.
TL;DR: The physical location of each STR locus in the human genome is delineated and allele ranges and variants observed in human populations are summarized as are mutation rates observed from parentage testing.
Abstract: Over the past decade, the human identity testing community has settled on a set of core short tandem repeat (STR) loci that are widely used for DNA typing applications. A variety of commercial kits enable robust amplification of these core STR loci. A brief history is presented regarding the selection of core autosomal and Y-chromosomal STR markers. The physical location of each STR locus in the human genome is delineated and allele ranges and variants observed in human populations are summarized as are mutation rates observed from parentage testing. Internet resources for additional information on core STR loci are reviewed. Additional topics are also discussed, including potential linkage of STR loci to genetic disease-causing genes, probabilistic predictions of sample ethnicity, and desirable characteristics for additional STR loci that may be added in the future to the current core loci. These core STR loci, which form the basis for DNA databases worldwide, will continue to play an important role in forensic science for many years to come.
TL;DR: The combination of multiplex STR-PCR and automatic fluorescence-based detection is thus a rapid and powerful technique for individual identification.
Abstract: Short tandem repeat (STR) loci are a class of polymorphic markers which occur throughout the human genome and which consist of simple tandemly repeated sequences 1–6bp in length. Their abundance, hypervariability and amenability to amplification by the polymerase chain reaction (PCR) make them ideal markers for use in the identification of individuals.
TL;DR: The speed and reliability of lobSTR exceed the performance of current mainstream algorithms for STR profiling, and the algorithm was used to conduct a comprehensive survey of STR variations in a deeply sequenced personal genome.
Abstract: Short tandem repeats (STRs) have a wide range of applications, including medical genetics, forensics, and genetic genealogy. High-throughput sequencing (HTS) has the potential to profile hundreds of thousands of STR loci. However, mainstream bioinformatics pipelines are inadequate for the task. These pipelines treat STR mapping as gapped alignment, which results in cumbersome processing times and a biased sampling of STR alleles. Here, we present lobSTR, a novel method for profiling STRs in personal genomes. lobSTR harnesses concepts from signal processing and statistical learning to avoid gapped alignment and to address the specific noise patterns in STR calling. The speed and reliability of lobSTR exceed the performance of current mainstream algorithms for STR profiling. We validated lobSTR's accuracy by measuring its consistency in calling STRs from whole-genome sequencing of two biological replicates from the same individual, by tracing Mendelian inheritance patterns in STR alleles in whole-genome sequencing of a HapMap trio, and by comparing lobSTR results to traditional molecular techniques. Encouraged by the speed and accuracy of lobSTR, we used the algorithm to conduct a comprehensive survey of STR variations in a deeply sequenced personal genome. We traced the mutation dynamics of close to 100,000 STR loci and observed more than 50,000 STR variations in a single genome. lobSTR's implementation is an end-to-end solution. The package accepts raw sequencing reads and provides the user with the genotyping results. It is written in C/C++, includes multi-threading capabilities, and is compatible with the BAM format.