Applicability domains for classification problems: Benchmarking of distance to models for Ames mutagenicity set.

doi:10.1021/CI100253R

Journal Article10.1021/CI100253R

Applicability domains for classification problems: Benchmarking of distance to models for Ames mutagenicity set.

Iurii Sushko, +32 more

- 29 Oct 2010

- Journal of Chemical Information and Mode...

- Vol. 50, Iss: 12, pp 2094-2111

240

TL;DR: This work demonstrates that the DMs based on an ensemble (consensus) model provide systematically better performance than other DMs and can be used to halve the cost of experimental measurements by providing a similar prediction accuracy.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.3390/MOLECULES21010001

Extended Functional Groups (EFG): An Efficient Set for Chemical Characterization and Structure-Activity Relationship Studies of Chemical Compounds

Elena Salmina, +2 more

- 23 Dec 2015

- Molecules

TL;DR: An extension of a set previously used by the CheckMol software that covers in addition heterocyclic compound classes and periodic table groups is described, which demonstrates that EFG can be efficiently used to develop and interpret structure-activity relationship models.

...read moreread less

1K

Journal Article•10.1002/MINF.201501008

Deep Learning in Drug Discovery.

Erik Gawehn, +2 more

- 01 Jan 2016

- Molecular Informatics

TL;DR: An overview of this emerging field of molecular informatics, the basic concepts of prominent deep learning methods are presented, and motivation to explore these techniques for their usefulness in computer‐assisted drug discovery and design is offered.

...read moreread less

691

Journal Article•10.1021/CI400084K

Time-Split Cross-Validation as a Method for Estimating the Goodness of Prospective Prediction.

Robert P. Sheridan

- 05 Apr 2013

- Journal of Chemical Information and Mode...

TL;DR: Time-split selection should be used in addition to random selection as a standard for cross-validation in QSAR model building, and gives an R(2) that is more like that of true prospective prediction than the R(1) from random selection or from the analog of leave-class-out selection.

...read moreread less

283

•Journal Article•10.1021/CI300245Q

ToxAlerts: a Web server of structural alerts for toxic chemicals and compounds with potential adverse reactions.

Iurii Sushko, +4 more

- 10 Aug 2012

- Journal of Chemical Information and Mode...

TL;DR: A Web-based platform for collecting and storing toxicological structural alerts from literature and for virtual screening of chemical libraries to flag potentially toxic chemicals and compounds that can cause adverse side effects is presented.

...read moreread less

241

•Journal Article•10.3389/FCHEM.2020.00726

Computational Approaches in Preclinical Studies on Drug Discovery and Development.

Fengxu Wu, +8 more

- 11 Sep 2020

- Frontiers in Chemistry

TL;DR: A systematic classification and description of the databases and software commonly used for ADMET prediction and some applications that are related to the prediction categories and web tools are listed.

...read moreread less

239

...

Expand

References

•Journal Article•10.1023/A:1010933404324

Random Forests

Leo Breiman

- 01 Oct 2001

TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.

...read moreread less

113.1K

•Book

Elements of information theory

Thomas M. Cover, +1 more

- 01 Jan 1991

TL;DR: The author examines the role of entropy, inequality, and randomness in the design of codes and the construction of codes in the rapidly changing environment.

...read moreread less

52.2K

Journal Article•10.1145/1961189.1961199

LIBSVM: A library for support vector machines

Chih-Chung Chang, +1 more

- 06 May 2011

- ACM Transactions on Intelligent Systems ...

TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.

...read moreread less

46.3K

Statistical learning theory

Vladimir Vapnik

- 01 Jan 1998

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.

...read moreread less

30.4K

•Book

Data Mining: Practical Machine Learning Tools and Techniques

Ian H. Witten, +2 more

- 25 Oct 1999

TL;DR: This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.

...read moreread less

25.4K