Open AccessProceedings Article
Reinforcement Learning for Algorithm Selection
Michail G. Lagoudakis,Michael L. Littman +1 more
- 30 Jul 2000
- pp 1081
TL;DR: This work considers the problem of algorithm selection: dynamically choose an algorithm to attack an instance or subinstances of a problem with the goal of minimizing the overall execution time, and uses ideas from reinforcement learning to solve it.
read more
Abstract: Many computational problems can be solved by multiple algorithms, with different algorithms fastest for different problem sizes, input distributions, and hardware characteristics. We consider the problem of algorithm selection: dynamically choose an algorithm to attack an instance or subinstances (due to recursive calls) of a problem with the goal of minimizing the overall execution time. We formulate the problem as a kind of Markov Decision Process (MDP), and use ideas from reinforcement learning (RL) to solve it. The process’ state consists of a set of instance features, such as problem size. Actions are the different algorithms we can choose from. Non-recursive algorithms are terminal in that they solve the problem completely (terminal state). Recursive algorithms create subproblems and therefore cause transitions to other states, making the task a sequential decision task. The immediate cost of a decision is the real time taken for executing the selected algorithm on the current instance, excluding time taken in recursive calls. Thus, the total (undiscounted) cost during an episode is the time taken to solve the problem. The goal is a policy that minimizes the total cost/time. This process differs from a standard MDP as it allows one-to-many state transitions (multiple recursive calls at one level). Our initial experiments focus on the problem of order statistic selection: given an array of (unordered) numbers and some index , select the number that would rank -th if the array were sorted. We picked two algorithms such that neither is best in all cases, otherwise learning would not help. DETERMINISTIC SELECT ( )i s an recursive algorithm and HEAP SELECT ( )i s an algorithm
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder,Raghu Rajan,Xingyou Song,André Biedenkapp,Yingjie Miao,Theresa Eimer,Vu Nguyen,Roberto Calandra,Aleksandra Faust,Frank Hutter,Marius Lindauer +10 more
TL;DR: This survey seeks to unify the field of AutoRL, provide a common taxonomy, discuss each area in detail and pose open problems of interest to researchers going forward.
•Proceedings Article
Collaborative Evolutionary Reinforcement Learning.
Shauharda Khadka,Somdeb Majumdar,Tarek Nassar,Zach Dwiel,Evren Tumer,Santiago Miret,Yinyin Liu,Kagan Tumer +7 more
- 24 May 2019
TL;DR: Collaborative Evolutionary Reinforcement Learning (CERL) is introduced, a scalable framework that comprises a portfolio of policies that simultaneously explore and exploit diverse regions of the solution space and significantly outperforms its composite learners while remaining overall more sample-efficient.
Parameter Adjustment Based on Performance Prediction: Towards an Instance-Aware Problem Solver
Frank Hutter,Youssef Hamadi +1 more
- 01 Jan 2005
TL;DR: It is demonstrated that it is possible to fairly accurately predict the runtime of SAPS, one of the best-performing stochastic local search algorithms for SAT, and sequential Bayesian learning is introduced to the problem of runtime prediction, enabling an incremental learning approach and yielding very informative estimates of predictive uncertainty.
Learning When to Use Lazy Learning in Constraint Solving
Ian P. Gent,Christopher Jefferson,Lars Kotthoff,Ian Miguel,Neil C. A. Moore,Peter Nightingale,Karen E. Petrie +6 more
- 04 Aug 2010
TL;DR: This work shows that, in the context of a large benchmark set, standard ML approaches can be used to learn a simple, cheap classifier which performs well in identifying instances on which lazy learning should or should not be used.
53
•Posted Content
Collaborative Evolutionary Reinforcement Learning
Shauharda Khadka,Somdeb Majumdar,Tarek Nassar,Zach Dwiel,Evren Tumer,Santiago Miret,Yinyin Liu,Kagan Tumer +7 more
TL;DR: In this paper, the authors introduce Collaborative Evolutionary Reinforcement Learning (CERL), a scalable framework that comprises a portfolio of policies that simultaneously explore and exploit diverse regions of the solution space.
39
References
•Book
Introduction to Algorithms
Thomas H. Cormen,Charles E. Leiserson,Ronald L. Rivest +2 more
- 01 Jan 1990
TL;DR: The updated new edition of the classic Introduction to Algorithms is intended primarily for use in undergraduate or graduate courses in algorithms or data structures and presents a rich variety of algorithms and covers them in considerable depth while making their design and analysis accessible to all levels of readers.
24.8K
•Proceedings Article
Algorithm Selection using Reinforcement Learning
Michail G. Lagoudakis,Michael L. Littman +1 more
- 29 Jun 2000
TL;DR: A kind of MDP that models the algorithm selection problem by allowing multiple state transitions is introduced, and the well known Q-learning algorithm is adapted for this case in a way that combines both Monte-Carlo and Temporal Difference methods.
168