Chemical Similarity Searching
TL;DR: The concept of similarity searching is introduced, differentiating it from the more common substructure searching, and the current generation of fragment-based measures that are used for searching chemical structure databases are discussed.
read more
Abstract: This paper reviews the use of similarity searching in chemical databases. It begins by introducing the concept of similarity searching, differentiating it from the more common substructure searching, and then discusses the current generation of fragment-based measures that are used for searching chemical structure databases. The next sections focus upon two of the principal characteristics of a similarity measure: the coefficient that is used to quantify the degree of structural resemblance between pairs of molecules and the structural representations that are used to characterize molecules that are being compared in a similarity calculation. New types of similarity measure are then compared with current approaches, and examples are given of several applications that are related to similarity searching.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Structure-activity relationship anatomy by network-like similarity graphs and local structure-activity relationship indices.
TL;DR: An approach that combines the analysis of similarity-based molecular networks and SAR index distributions to identify multiple SAR components present within sets of active compounds to identify key compounds that are major determinants of SAR characteristics is introduced.
162
Structural resemblances and comparisons of the relative pharmacological properties of imatinib and nilotinib.
Paul W. Manley,Nikolaus Stiefl,Sandra W. Cowan-Jacob,Susan Kaufman,Jürgen Mestan,Markus Wartmann,Marion Wiesmann,Richard C. Woodman,Neil Gallagher +8 more
TL;DR: D dissimilarity is reflected in the drugs having substantially different preclinical pharmacology and a lack of cross-intolerance in CML patients, which translates into nilotinib being an efficacious treatment for CML, with a favourable side-effect profile.
160
Combination of fingerprint-based similarity coefficients using data fusion.
TL;DR: This paper describes experiments in which data fusion is used to combine several binary similarity coefficients to get an overall estimate of similarity for searching databases of bioactive molecules, and shows that search performances can be improved by combining coefficients with little extra computational cost.
157
Predicting the Predictability: A Unified Approach to the Applicability Domain Problem of QSAR Models
TL;DR: A unified conceptual framework to describe and quantify the important issue of the Applicability Domains (AD) of Quantitative Structure-Activity Relationships (QSARs) and a first use of untrustworthiness scores resides in prioritization of predictions, without the need to specify a hard AD border.
157
Similarity searching in large combinatorial chemistry spaces.
Matthias Rarey,Martin Stahl +1 more
TL;DR: A novel algorithm, called Ftrees-FS, for similarity searching in large chemistry spaces based on dynamic programming, which generates sets of compounds from a given chemistry space that are similar to the query.
157
References
Hierarchical Grouping to Optimize an Objective Function
TL;DR: In this paper, a procedure for forming hierarchical groups of mutually exclusive subsets, each of which has members that are maximally similar with respect to specified characteristics, is suggested for use in large-scale (n > 100) studies when a precise optimal solution for a specified number of groups is not practical.
19.8K
Features of Similarity
TL;DR: The metric and dimensional assumptions that underlie the geometric representation of similarity are questioned on both theoretical and empirical grounds and a set of qualitative assumptions are shown to imply the contrast model, which expresses the similarity between objects as a linear combination of the measures of their common and distinctive features.
Development and validation of a genetic algorithm for flexible docking.
TL;DR: GOLD (Genetic Optimisation for Ligand Docking) is an automated ligand docking program that uses a genetic algorithm to explore the full range of ligand conformational flexibility with partial flexibility of the protein, and satisfies the fundamental requirement that the ligand must displace loosely bound water on binding.
6.5K
Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins.
TL;DR: The main features of the CoMFA approach, exemplified by analyses of the affinities of 21 varied steroids to corticosteroid and testosterone-binding globulins, and a number of advances in the methodology of molecular graphics are described.
3.8K
A Fast Flexible Docking Method using an Incremental Construction Algorithm
TL;DR: This work presents an automatic method for docking organic ligands into protein binding sites that combines an appropriate model of the physico-chemical properties of the docked molecules with efficient methods for sampling the conformational space of the ligand.
2.8K