Open AccessPosted Content
Gryffin: An algorithm for Bayesian optimization for categorical variables informed by physical intuition with applications to chemistry
Florian Häse,Loïc M. Roch,Alán Aspuru-Guzik +2 more
- 26 Mar 2020
20
TL;DR: Gryffin is introduced, as a general purpose optimization framework for the autonomous selection of categorical variables driven by expert knowledge and augments Bayesian optimization with kernel density estimation using smooth approximations to categorical distributions.
read more
Abstract: Designing functional molecules and advanced materials requires complex interdependent design choices: tuning continuous process parameters such as temperatures or flow rates, while simultaneously selecting categorical variables like catalysts or solvents. To date, the development of data-driven experiment planning strategies for autonomous experimentation has largely focused on continuous process parameters despite the urge to devise efficient strategies for the selection of categorical variables to substantially accelerate scientific discovery. We introduce Gryffin, as a general purpose optimization framework for the autonomous selection of categorical variables driven by expert knowledge. Gryffin augments Bayesian optimization with kernel density estimation using smooth approximations to categorical distributions. Leveraging domain knowledge from physicochemical descriptors to characterize categorical options, Gryffin can significantly accelerate the search for promising molecules and materials. Gryffin can further highlight relevant correlations between the provided descriptors to inspire physical insights and foster scientific intuition. In addition to comprehensive benchmarks, we demonstrate the capabilities and performance of Gryffin on three examples in materials science and chemistry: (i) the discovery of non-fullerene acceptors for organic solar cells, (ii) the design of hybrid organic-inorganic perovskites for light-harvesting, and (iii) the identification of ligands and process parameters for Suzuki-Miyaura reactions. Our observations suggest that Gryffin, in its simplest form without descriptors, constitutes a competitive categorical optimizer compared to state-of-the-art approaches. However, when leveraging domain knowledge provided via descriptors, Gryffin can optimize at considerable higher rates and refine this domain knowledge to spark scientific understanding.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Bayesian reaction optimization as a tool for chemical synthesis.
Benjamin J. Shields,Jason M. Stevens,Jun Li,Marvin Parasram,Farhan Damani,Jesus I. Martinez Alvarado,Jacob M. Janey,Ryan P. Adams,Abigail G. Doyle +8 more
TL;DR: In this paper, the authors report the development of a framework for Bayesian reaction optimization and an open-source software tool that allows chemists to easily integrate state-of-the-art optimization algorithms into their everyday laboratory practices.
633
Data-Driven Strategies for Accelerated Materials Design.
Robert Pollice,Gabriel dos Passos Gomes,Matteo Aldeghi,Riley J. Hickman,Mario Krenn,Cyrille Lavigne,Michael Lindner-D’Addario,AkshatKumar Nigam,Cher Tian Ser,Zhenpeng Yao,Alán Aspuru-Guzik +10 more
TL;DR: The most recent contributions of this group in this thriving field of machine learning for material science are reviewed, focusing on small molecules as organic electronic materials and crystalline materials and the data-driven approaches they employed to speed up discovery and derive material design strategies.
311
Materials Acceleration Platforms: On the way to autonomous experimentation
Martha M. Flores-Leonar,L.M. Mejía-Mendoza,Andrés Aguilar-Granda,Benjamin Sanchez-Lengeling,Hermann Tribukait,Carlos Amador-Bedolla,Alán Aspuru-Guzik +6 more
TL;DR: This work presents state-of-the-art robotic platforms and machine learning approaches for autonomous experimentation, their integration, and applications, particularly in the field of materials for clean energy technologies.
133
Gryffin: An algorithm for Bayesian optimization of categorical variables informed by expert knowledge
TL;DR: Gryffin this article augments Bayesian optimization based on kernel density estimation with smooth approximations to categorical distributions, which can significantly accelerate the search for promising molecules and materials.
113
Ready, Set, Flow! Automated Continuous Synthesis and Optimization
Christopher P. Breen,Anirudh Manoj K. Nambiar,Timothy F. Jamison,Klavs F. Jensen +3 more
- 01 May 2021
TL;DR: Recent case studies that present strategies towards realizing automated synthesis with a further focus on works that leverage continuous flow chemistry as an enabling technology are discussed.
103
References
Random Forests
Leo Breiman
- 01 Oct 2001
TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
Genetic algorithms in search, optimization and machine learning
David E. Goldberg
- 01 Jan 1989
TL;DR: This book brings together the computer techniques, mathematical tools, and research results that will enable both students and practitioners to apply genetic algorithms to problems in many fields.
58.6K
•Book
Genetic algorithms in search, optimization, and machine learning
David E. Goldberg
- 01 Sep 1988
TL;DR: In this article, the authors present the computer techniques, mathematical tools, and research results that will enable both students and practitioners to apply genetic algorithms to problems in many fields, including computer programming and mathematics.
•Proceedings Article
Auto-Encoding Variational Bayes
Diederik P. Kingma,Max Welling +1 more
- 01 Jan 2014
TL;DR: A stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case is introduced.
•Book
Genetic Programming: On the Programming of Computers by Means of Natural Selection
John R. Koza
- 01 Jan 1992
TL;DR: This book discusses the evolution of architecture, primitive functions, terminals, sufficiency, and closure, and the role of representation and the lens effect in genetic programming.
15K