Chemical Similarity Searching
TL;DR: The concept of similarity searching is introduced, differentiating it from the more common substructure searching, and the current generation of fragment-based measures that are used for searching chemical structure databases are discussed.
read more
Abstract: This paper reviews the use of similarity searching in chemical databases. It begins by introducing the concept of similarity searching, differentiating it from the more common substructure searching, and then discusses the current generation of fragment-based measures that are used for searching chemical structure databases. The next sections focus upon two of the principal characteristics of a similarity measure: the coefficient that is used to quantify the degree of structural resemblance between pairs of molecules and the structural representations that are used to characterize molecules that are being compared in a similarity calculation. New types of similarity measure are then compared with current approaches, and examples are given of several applications that are related to similarity searching.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Comprehensive survey of target prediction web servers for Traditional Chinese Medicine.
TL;DR: This review introduces the widely used web tools for compound-target interaction prediction databases and web resources in TCM pharmacology research, and compares and analyzes each web tool based on their basic properties, including the underlying theory, algorithms, datasets, and search results.
6
Weighted similarity-based clustering of chemical structures and bioactivity data in early drug discovery.
Nolen Perualila-Tan,Ziv Shkedy,Willem Talloen,Hinrich W. H. Göhlmann,Marijke Van Moerbeke,Adetayo Kasim +5 more
TL;DR: In this article, the authors proposed an integrative clustering approach that makes use of both biological (compound efficacy) and chemical (structural features) data sources for the purpose of discovering a subset of compounds with aligned structural and biological properties.
6
A fourier fingerprint-based method for protein surface representation.
TL;DR: Four surface representations that can be used locally to describe the global shape of specifically bounded local region models are presented and it is demonstrated that this provides an effective and an efficient technique for the matching of protein surfaces.
6
Computational characterization of SAR microenvironments in high-throughput screening data
Mathias Wawer,Su Sun,Jürgen Bajorath +2 more
- 01 Apr 2010
TL;DR: A computational approach is described to analyze structure–activity relationship (SAR) information contained in compound and screening data sets to aid in the selection of hits from high-throughput screening (HTS) data.
6
Ligand shape emerges in solvent dipole ordering region at ligand binding site of protein
TL;DR: A redocking experiment for p38 mitogen‐activated protein kinase ligands using model molecules for search queries succeeded in identifying the binding conformations and binding modes of known inhibitors.
6
References
Hierarchical Grouping to Optimize an Objective Function
TL;DR: In this paper, a procedure for forming hierarchical groups of mutually exclusive subsets, each of which has members that are maximally similar with respect to specified characteristics, is suggested for use in large-scale (n > 100) studies when a precise optimal solution for a specified number of groups is not practical.
19.8K
Features of Similarity
TL;DR: The metric and dimensional assumptions that underlie the geometric representation of similarity are questioned on both theoretical and empirical grounds and a set of qualitative assumptions are shown to imply the contrast model, which expresses the similarity between objects as a linear combination of the measures of their common and distinctive features.
Development and validation of a genetic algorithm for flexible docking.
TL;DR: GOLD (Genetic Optimisation for Ligand Docking) is an automated ligand docking program that uses a genetic algorithm to explore the full range of ligand conformational flexibility with partial flexibility of the protein, and satisfies the fundamental requirement that the ligand must displace loosely bound water on binding.
6.5K
Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins.
TL;DR: The main features of the CoMFA approach, exemplified by analyses of the affinities of 21 varied steroids to corticosteroid and testosterone-binding globulins, and a number of advances in the methodology of molecular graphics are described.
3.8K
A Fast Flexible Docking Method using an Incremental Construction Algorithm
TL;DR: This work presents an automatic method for docking organic ligands into protein binding sites that combines an appropriate model of the physico-chemical properties of the docked molecules with efficient methods for sampling the conformational space of the ligand.
2.8K