An iterative knowledge-based scoring function for protein–protein recognition

doi:10.1002/PROT.21949

Journal Article10.1002/PROT.21949

An iterative knowledge-based scoring function for protein–protein recognition

Sheng-You Huang, +1 more

- 01 Aug 2008

- Proteins

- Vol. 72, Iss: 2, pp 557-579

301

TL;DR: A distance‐dependent knowledge‐based scoring function to predict protein–protein interactions and the binding scores predicted by ITScore‐PP correlated well with the experimentally determined binding affinities, yielding a correlation coefficient of R = 0.71.

Abstract: Using an efficient iterative method, we have developed a distance-dependent knowledge-based scoring function to predict protein-protein interactions. The function, referred to as ITScore-PP, was derived using the crystal structures of a training set of 851 protein-protein dimeric complexes containing true biological interfaces. The key idea of the iterative method for deriving ITScore-PP is to improve the interatomic pair potentials by iteration, until the pair potentials can distinguish true binding modes from decoy modes for the protein-protein complexes in the training set. The iterative method circumvents the challenging reference state problem in deriving knowledge-based potentials. The derived scoring function was used to evaluate the ligand orientations generated by ZDOCK 2.1 and the native ligand structures on a diverse set of 91 protein-protein complexes. For the bound test cases, ITScore-PP yielded a success rate of 98.9% if the top 10 ranked orientations were considered. For the more realistic unbound test cases, the corresponding success rate was 40.7%. Furthermore, for faster orientational sampling purpose, several residue-level knowledge-based scoring functions were also derived following the similar iterative procedure. Among them, the scoring function that uses the side-chain center of mass (SCM) to represent a residue, referred to as ITScore-PP(SCM), showed the best performance and yielded success rates of 71.4% and 30.8% for the bound and unbound cases, respectively, when the top 10 orientations were considered. ITScore-PP was further tested using two other published protein-protein docking decoy sets, the ZDOCK decoy set and the RosettaDock decoy set. In addition to binding mode prediction, the binding scores predicted by ITScore-PP also correlated well with the experimentally determined binding affinities, yielding a correlation coefficient of R = 0.71 on a test set of 74 protein-protein complexes with known affinities. ITScore-PP is computationally efficient. The average run time for ITScore-PP was about 0.03 second per orientation (including optimization) on a personal computer with 3.2 GHz Pentium IV CPU and 3.0 GB RAM. The computational speed of ITScore-PP(SCM) is about an order of magnitude faster than that of ITScore-PP. ITScore-PP and/or ITScore-PP(SCM) can be combined with efficient protein docking software to study protein-protein recognition.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1038/NPROT.2016.169

The ClusPro web server for protein-protein docking.

Dima Kozakov, +7 more

- 01 Feb 2017

- Nature Protocols

TL;DR: This protocol describes the use of the various options, the construction of auxiliary restraints files, the selection of the energy parameters, and the analysis of the results of the ClusPro server.

...read moreread less

2.6K

Journal Article•10.1038/S41596-020-0312-X

The HDOCK server for integrated protein-protein docking.

Yumeng Yan, +3 more

- 08 Apr 2020

- Nature Protocols

TL;DR: The HDOCK server is developed for template-based and template-free protein–protein docking, using amino acid sequences or PDB structures as inputs, and can incorporate SAXS data and can be applied to protein–RNA/DNA docking.

...read moreread less

1.1K

•Journal Article•10.1093/NAR/GKX407

HDOCK: a web server for protein-protein and protein-DNA/RNA docking based on a hybrid strategy.

Yumeng Yan, +4 more

- 03 Jul 2017

- Nucleic Acids Research

TL;DR: Tested on the cases with weakly homologous complexes of <30% sequence identity from five docking benchmarks, the HDOCK pipeline tied with template-based modeling on the protein–protein and protein–DNA benchmarks and performed better than template- based modelingon the three protein–RNA benchmarks when the top 10 predictions were considered.

...read moreread less

967

•Posted Content

Market Force, Ecology, and Evolution

J. Farmer

- 01 Dec 1998

- Research Papers in Economics

TL;DR: In financial markets, an excess of buying tends to drive prices up, and a excess of selling tend to drive them down as mentioned in this paper, and this is called market impact, which is a non-equilibrium theory for price formation, based on a simplified model for market making.

...read moreread less

635

Journal Article•10.1039/C0CP00151A

Scoring functions and their evaluation methods for protein-ligand docking: recent advances and future directions.

Sheng-You Huang, +2 more

- 07 Oct 2010

- Physical Chemistry Chemical Physics

TL;DR: Three basic types of scoring functions (force-field, empirical, and knowledge-based) and the consensus scoring technique that are used for protein-ligand docking are reviewed and a discussion of the challenges faced by existing scoring functions and possible future directions for developing improved scoring functions is discussed.

...read moreread less

496

...

Expand

References

•Journal Article•10.1093/NAR/28.1.235

The Protein Data Bank

Helen M. Berman, +7 more

- 01 Jan 2000

- Nucleic Acids Research

TL;DR: The goals of the PDB are described, the systems in place for data deposition and access, how to obtain further information and plans for the future development of the resource are described.

...read moreread less

39.5K

Journal Article•10.1093/COMJNL/7.4.308

A simplex method for function minimization

John A. Nelder, +1 more

- 01 Jan 1965

- The Computer Journal

TL;DR: A method is described for the minimization of a function of n variables, which depends on the comparison of function values at the (n 41) vertices of a general simplex, followed by the replacement of the vertex with the highest value by another point.

...read moreread less

30.6K

•Journal Article•10.1002/(SICI)1096-987X(19981115)19:14<1639::AID-JCC10>3.0.CO;2-B

Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function

Garrett M. Morris, +6 more

- 15 Nov 1998

- Journal of Computational Chemistry

TL;DR: It is shown that both the traditional and Lamarckian genetic algorithms can handle ligands with more degrees of freedom than the simulated annealing method used in earlier versions of AUTODOCK, and that the Lamarckia genetic algorithm is the most efficient, reliable, and successful of the three.

...read moreread less

10.6K

Journal Article•10.1021/JA00315A051

A new force field for molecular mechanical simulation of nucleic acids and proteins

S. J. Weiner, +7 more

- 01 Feb 1984

- Journal of the American Chemical Society

TL;DR: In this paper, a force field for simulation of nucleic acids and proteins is presented, which is based on the ECEPP, UNECEPP, and EPEN energy refinement software.

...read moreread less

4.6K

Journal Article•10.1002/JCC.540070216

An all atom force field for simulations of proteins and nucleic acids.

S. J. Weiner, +3 more

- 01 Apr 1986

- Journal of Computational Chemistry

TL;DR: An all atom potential energy function for the simulation of proteins and nucleic acids and the first general vibrational analysis of all five nucleic acid bases with a molecular mechanics potential approach is presented.

...read moreread less

3.4K