Chemical Similarity Searching
TL;DR: The concept of similarity searching is introduced, differentiating it from the more common substructure searching, and the current generation of fragment-based measures that are used for searching chemical structure databases are discussed.
read more
Abstract: This paper reviews the use of similarity searching in chemical databases. It begins by introducing the concept of similarity searching, differentiating it from the more common substructure searching, and then discusses the current generation of fragment-based measures that are used for searching chemical structure databases. The next sections focus upon two of the principal characteristics of a similarity measure: the coefficient that is used to quantify the degree of structural resemblance between pairs of molecules and the structural representations that are used to characterize molecules that are being compared in a similarity calculation. New types of similarity measure are then compared with current approaches, and examples are given of several applications that are related to similarity searching.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Scaffold distributions in bioactive molecules, clinical trials compounds, and drugs.
Ye Hu,Jürgen Bajorath +1 more
TL;DR: A comparative molecular scaffold analysis of three sets of compounds representing different stages in drug discovery: biologically active molecules (hits or leads), compounds in clinical trials, and registered/approved drugs.
21
Physicochemical and DMPK In Silico Models: Facilitating Their Use by Medicinal Chemists
TL;DR: An overview of calculable physicochemical and DMPK properties, an assessment of their relative difficulty of their calculation and accuracy, and available software is provided.
21
A quantum-based similarity method in virtual screening
TL;DR: Simulated virtual screening experiment show that the effectiveness of SQB method was significantly increased due to the role of representational power of molecular compounds in complex numbers forms compared to Tanimoto benchmark similarity measure.
Molecular similarity concepts and search calculations.
Jens Auer,Jürgen Bajorath +1 more
TL;DR: Critical aspects of molecular similarity are discussed, an understanding of which is essential for the evaluation of method development in this field and studies designed to enhance the performance of molecular fingerprint searching are described, one of the most intuitive and widely used similarity-based methods.
21
Assessing the Structural and Pharmacological Similarity of Newly Identified Drugs of Abuse to Controlled Substances Using Public Health Assessment via Structural Evaluation.
Christopher R. Ellis,Rebecca Racz,Naomi L. Kruhlak,Marlene T. Kim,Edward G. Hawkins,David G. Strauss,Lidiya Stavitskaya +6 more
TL;DR: The US Food and Drug Administration's Center for Drug Evaluation and Research's CDER developed an investigational Public Health Assessment via Structural Evaluation (PHASE) methodology to provide a structure‐based evaluation of a newly identified opioid's risk to public safety.
21
References
Hierarchical Grouping to Optimize an Objective Function
TL;DR: In this paper, a procedure for forming hierarchical groups of mutually exclusive subsets, each of which has members that are maximally similar with respect to specified characteristics, is suggested for use in large-scale (n > 100) studies when a precise optimal solution for a specified number of groups is not practical.
19.8K
Features of Similarity
TL;DR: The metric and dimensional assumptions that underlie the geometric representation of similarity are questioned on both theoretical and empirical grounds and a set of qualitative assumptions are shown to imply the contrast model, which expresses the similarity between objects as a linear combination of the measures of their common and distinctive features.
Development and validation of a genetic algorithm for flexible docking.
TL;DR: GOLD (Genetic Optimisation for Ligand Docking) is an automated ligand docking program that uses a genetic algorithm to explore the full range of ligand conformational flexibility with partial flexibility of the protein, and satisfies the fundamental requirement that the ligand must displace loosely bound water on binding.
6.5K
Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins.
TL;DR: The main features of the CoMFA approach, exemplified by analyses of the affinities of 21 varied steroids to corticosteroid and testosterone-binding globulins, and a number of advances in the methodology of molecular graphics are described.
3.8K
A Fast Flexible Docking Method using an Incremental Construction Algorithm
TL;DR: This work presents an automatic method for docking organic ligands into protein binding sites that combines an appropriate model of the physico-chemical properties of the docked molecules with efficient methods for sampling the conformational space of the ligand.
2.8K