Book Chapter10.1016/BS.MIE.2020.05.005
Machine learning-assisted enzyme engineering.
86
TL;DR: In this chapter, enzyme engineering strategies that combine random or (semi-)rational approaches with ML methods and allow an effective reengineering of enzymes to improve targeted properties are explained.
read more
Abstract: Directed evolution and rational design are powerful strategies in protein engineering to tailor enzyme properties to meet the demands in academia and industry. Traditional approaches for enzyme engineering and directed evolution are often experimentally driven, in particular when the protein structure-function relationship is not available. Though they have been successfully applied to engineer many enzymes, these methods are still facing significant challenges due to the tremendous size of the protein sequence space and the combinatorial problem. It can be ascertained that current experimental techniques and computational techniques might never be able to sample through the entire protein sequence space and benefit from nature's full potential for the generation of better enzymes. With advancements in next generation sequencing, high throughput screening methods, the growth of protein databases and artificial intelligence, especially machine learning (ML), data-driven enzyme engineering is emerging as a promising solution to these challenges. To date, ML-assisted approaches have efficiently and accurately determined the quantitative structure-property/activity relationship for the prediction of diverse enzyme properties. In addition, enzyme engineering can be accelerated much faster than ever through the combination of experimental library generation and ML-based prediction. In this chapter, we review the recent progresses in ML-assisted enzyme engineering and highlight several successful examples (e.g., to enhance activity, enantioselectivity, or thermostability). Herein we explain enzyme engineering strategies that combine random or (semi-)rational approaches with ML methods and allow an effective reengineering of enzymes to improve targeted properties. We further discuss the main challenges to solve in order to realize the full potential of ML methods in enzyme engineering. Finally, we describe the current limitations of ML-assisted enzyme engineering, and our perspective on future opportunities in this growing field.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Informed training set design enables efficient machine learning-assisted directed protein evolution.
TL;DR: In this paper, a path-independent machine learning-assisted directed evolution (MLDE) protocol was proposed for combinatorial libraries, which allows in-silico screening of full-combinatorial libraries and achieves the global fitness maximum up to 81 times more frequently than single-step greedy optimization.
144
Algorithm-aided engineering of aliphatic halogenase WelO5* for the asymmetric late-stage functionalization of soraphens
Johannes Büchler,Sumire Honda Malca,David Patsch,Moritz Voss,Nicholas J. Turner,Uwe T. Bornscheuer,Oliver Allemann,Camille Le Chapelain,Alexandre Lumbroso,Olivier Loiseleur,Rebecca Buller +10 more
TL;DR: In this article , a combination of smart library design and machine learning is used to engineer the iron/α-ketoglutarate dependent halogenase WelO5* for the late-stage functionalization of the complex and chemically difficult to derivatize macrolides soraphen A and C, potent anti-fungal agents.
Machine learning in bioprocess development: from promise to practice
TL;DR: In this article , the authors demonstrate how ML methods have been applied so far in bioprocess development, especially in strain engineering and selection, bi-process optimization, scale-up, monitoring, and control of bi-rocesses.
56
Machine Learning-assisted Nanozyme Design: Lessons from Materials and Engineered Enzymes.
TL;DR: In this article , the authors summarize recent progress in ML, its utilization in the design of catalytic materials and enzymes, and how emergent ML applications serve as promising strategies to circumvent challenges associated with time-expensive and laborious testing in nanozyme research and development.
55
High-throughput screening, next generation sequencing and machine learning: advanced methods in enzyme engineering
TL;DR: A broad overview of current state-of-the-art approaches for enzyme engineering and evolution can be found in this paper , with a focus on novel readout systems based on enzyme cascades and new approaches to reaction compartmentalization including single-cell hydrogel encapsulation.
References
Mastering the game of Go without human knowledge
David Silver,Julian Schrittwieser,Karen Simonyan,Ioannis Antonoglou,Aja Huang,Arthur Guez,Thomas Hubert,Lucas Baker,Matthew Lai,Adrian Bolton,Yutian Chen,Timothy P. Lillicrap,Fan Hui,Laurent Sifre,George van den Driessche,Thore Graepel,Demis Hassabis +16 more
TL;DR: An algorithm based solely on reinforcement learning is introduced, without human data, guidance or domain knowledge beyond game rules, that achieves superhuman performance, winning 100–0 against the previously published, champion-defeating AlphaGo.
Rapid evolution of a protein in vitro by DNA shuffling.
TL;DR: It is reported here that selected mutants had a minimum inhibitory concentration of 640 μg ml-1, a 32,000-fold increase and 64-fold greater than any published TEM-1 derived enzyme.
2.6K
The FoldX web server: an online force field
TL;DR: The core functionality of FoldX, namely the calculation of the free energy of a macromolecule based on its high-resolution 3D structure, is now publicly available through a web server at FoldX.
2.6K
Probabilistic machine learning and artificial intelligence
TL;DR: This Review provides an introduction to this framework, and discusses some of the state-of-the-art advances in the field, namely, probabilistic programming, Bayesian optimization, data compression and automatic model discovery.