Using deep mutational scanning to benchmark variant effect predictors and identify disease mutations.

doi:10.15252/MSB.20199380

Open AccessJournal Article10.15252/MSB.20199380

Using deep mutational scanning to benchmark variant effect predictors and identify disease mutations.

Benjamin J Livesey, +1 more

- 01 Jul 2020

- Molecular Systems Biology

- Vol. 16, Iss: 7

164

TL;DR: DeepSequence clearly stood out, showing both the strongest correlations with DMS data and having the best ability to predict pathogenic mutations, which is especially remarkable given that it is an unsupervised method.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1038/S41580-021-00407-0

A guide to machine learning for biologists.

Joe G Greener, +3 more

- 13 Sep 2021

- Nature Reviews Molecular Cell Biology

TL;DR: Machine learning is becoming a widely used tool for the analysis of biological data as mentioned in this paper, however, proper use of machine learning methods can be challenging for experimentalists, proper application of ML methods can also be challenging, and best practices and points to consider when embarking on experiments involving machine learning are discussed.

...read moreread less

1.1K

Journal Article•10.1126/science.adg7492

Accurate proteome-wide missense variant effect prediction with AlphaMissense

Jun Cheng, +15 more

- 19 Sep 2023

- Science

TL;DR: AlphaMissense, an adaptation of AlphaFold fine-tuned on human and primate variant population frequency databases to predict missense variant pathogenicity, achieves state-of-the-art results across a wide range of genetic and experimental benchmarks, all without explicitly training on such data.

...read moreread less

774

•Journal Article•10.1038/S41467-021-22732-W

Protein design and variant prediction using autoregressive generative models

Jung-Eun Shin, +9 more

- 23 Apr 2021

- Nature Communications

TL;DR: In this article, a deep generative model adapted from natural language processing for prediction and design of diverse functional sequences without the need for alignments is proposed, which performs state-of-the-art prediction of missense and indel effects and successfully design and test a diverse 105-nanobody library.

...read moreread less

268

•Journal Article•10.1038/s41587-023-01763-2

Efficient evolution of human antibodies from general protein language models

Brian Hie, +1 more

- 24 Apr 2023

- Nature Biotechnology

TL;DR: This paper showed that general protein language models can efficiently evolve human antibodies by suggesting mutations that are evolutionarily plausible, despite providing the model with no information about the target antigen, binding specificity or protein structure.

...read moreread less

245

•Journal Article•10.1038/s41467-022-31686-6

Loss-of-function, gain-of-function and dominant-negative mutations have profoundly different effects on protein structure

Liyan Ye

- 06 Jul 2022

- Nature Communications

TL;DR: In this paper , the protein-level effects of pathogenic missense mutations associated with different molecular mechanisms are investigated, with striking differences between recessive vs dominant, and loss of function vs non-LOF mutations, with dominant, non-loss of function disease mutations having much milder effects on protein structure, and DN mutations being highly enriched at protein interfaces.

...read moreread less

148

...

Expand

References

•Journal Article

Dropout: a simple way to prevent neural networks from overfitting

Nitish Srivastava, +4 more

- 01 Jan 2014

- Journal of Machine Learning Research

TL;DR: It is shown that dropout improves the performance of neural networks on supervised learning tasks in vision, speech recognition, document classification and computational biology, obtaining state-of-the-art results on many benchmark data sets.

...read moreread less

43.7K

•Journal Article•10.1038/NMETH0410-248

A method and server for predicting damaging missense mutations.

Ivan Adzhubei, +7 more

- 01 Apr 2010

- Nature Methods

TL;DR: A new method and the corresponding software tool, PolyPhen-2, which is different from the early tool polyPhen1 in the set of predictive features, alignment pipeline, and the method of classification is presented and performance, as presented by its receiver operating characteristic curves, was consistently superior.

...read moreread less

13.4K

•Journal Article•10.1073/PNAS.89.22.10915

Amino acid substitution matrices from protein blocks

Steven Henikoff, +1 more

- 15 Nov 1992

- Proceedings of the National Academy of S...

TL;DR: This work has derived substitution matrices from about 2000 blocks of aligned sequence segments characterizing more than 500 groups of related proteins, leading to marked improvements in alignments and in searches using queries from each of the groups.

...read moreread less

7.2K

•Journal Article•10.1093/NAR/GKY1016

CADD: predicting the deleteriousness of variants throughout the human genome.

Philipp Rentzsch, +5 more

- 08 Jan 2019

- Nucleic Acids Research

TL;DR: The latest updates to CADD are reviewed, including the most recent version, 1.4, which supports the human genome build GRCh38, and also present updates to the website that include simplified variant lookup, extended documentation, an Application Program Interface and improved mechanisms for integrating CADD scores into other tools or applications.

...read moreread less

3.2K

•Journal Article•10.1093/NAR/GKT1113

ClinVar: public archive of relationships among sequence variation and human phenotype

Melissa J. Landrum, +6 more

- 01 Jan 2014

- Nucleic Acids Research

TL;DR: To facilitate evaluation of the medical importance of each variant, ClinVar aggregates submissions with the same variation/phenotype combination, adds value from other NCBI databases, assigns a distinct accession of the format RCV000000000.0 and reports if there are conflicting clinical interpretations.

...read moreread less

3K