Multiple structural alignment by secondary structures: algorithm and applications.
TL;DR: By using conserved structural motifs, one can guide protein–protein docking, which is a notoriously difficult problem and is shown to be a combination of several important characteristics of MASS.
read more
Abstract: We present MASS (Multiple Alignment by Secondary Structures), a novel highly efficient method for structural alignment of multiple protein molecules and detection of common structural motifs. MASS is based on a two-level alignment, using both secondary structure and atomic representation. Utilizing secondary structure information aids in filtering out noisy solutions and achieves efficiency and robustness. Currently, only a few methods are available for addressing the multiple structural alignment task. In addition to using secondary structure information, the advantage of MASS as compared to these methods is that it is a combination of several important characteristics: (1) While most existing methods are based on series of pairwise comparisons, and thus might miss optimal global solutions, MASS is truly multiple, considering all the molecules simultaneously; (2) MASS is sequence order-independent and thus capable of detecting nontopological structural motifs; (3) MASS is able to detect not only structural motifs, shared by all input molecules, but also motifs shared only by subsets of the molecules. Here, we show the application of MASS to various protein ensembles. We demonstrate its ability to handle a large number (order of tens) of molecules, to detect nontopological motifs and to find biologically meaningful alignments within nonpredefined subsets of the input. In particular, we show how by using conserved structural motifs, one can guide protein–protein docking, which is a notoriously difficult problem. MASS is freely available at http://bioinfo3d.cs.tau.ac.il/MASS/.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
MUSTANG: a multiple structural alignment algorithm.
TL;DR: A reliable and robust algorithm, MUSTANG (MUltiple STructural AligNment AlGorithm), for the alignment of multiple protein structures, based on the progressive pairwise heuristic, which performs comparably to popular pairwise and multiple structural alignment tools for closely related proteins.
A method for simultaneous alignment of multiple protein structures
TL;DR: It is their residue sequence order‐independence that allows application of MultiProt to derive multiple alignments of binding sites and of protein‐protein interfaces, making MultiProt an extremely useful structural tool.
On the relationship between sequence and structure similarities in proteomics
TL;DR: The results indicate that multiple conservation of residue identity is not common and that relationship between sequence and structure may be explained by a model based on the assumption that protein structure is tolerant to residue substitutions preserving hydropathic profile of the sequence.
155
Déjà vu all over again: finding and analyzing protein structure similarities.
Michael Sierk,Gerard J. Kleywegt +1 more
TL;DR: The main purpose is to encourage users to gain some understanding of the strengths and limitations of structural alignment, and to take these factors into account when interpreting the results of different programs.
155
Optimal simultaneous superpositioning of multiple structures with missing data
TL;DR: This work uses the expectation–maximization algorithm, a classic statistical technique for dealing with incomplete data, to find both maximum-likelihood solutions and the optimal least-squares solution as a special case for superposition when some of the data are missing.
References
The Protein Data Bank
Helen M. Berman,John D. Westbrook,Zukang Feng,Gary L. Gilliland,Talapady N. Bhat,Helge Weissig,Ilya N. Shindyalov,Philip E. Bourne +7 more
TL;DR: The goals of the PDB are described, the systems in place for data deposition and access, how to obtain further information and plans for the future development of the resource are described.
Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features
Wolfgang Kabsch,Chris Sander +1 more
TL;DR: A set of simple and physically motivated criteria for secondary structure, programmed as a pattern‐recognition process of hydrogen‐bonded and geometrical features extracted from x‐ray coordinates is developed.
15.7K
A general method applicable to the search for similarities in the amino acid sequence of two proteins
TL;DR: A computer adaptable method for finding similarities in the amino acid sequences of two proteins has been developed and it is possible to determine whether significant homology exists between the proteins to trace their possible evolutionary development.
13.2K
SCOP: a structural classification of proteins database for the investigation of sequences and structures.
TL;DR: This database provides a detailed and comprehensive description of the structural and evolutionary relationships of the proteins of known structure and provides for each entry links to co-ordinates, images of the structure, interactive viewers, sequence data and literature references.
6.9K
•Book
Introduction to protein structure
Carl Ivar Branden,John Tooze +1 more
- 01 Jan 1991
TL;DR: Part 1 BASIC STRUCTURAL PRINCIPLES: The Building Blocks and Motifs of Protein Structure and Part 2 STRUCTURE, FUNCTION and ENGINEERING: Structure, Function and Engineering.
3.7K