Reinforcement Learning for Algorithm Selection

Open AccessProceedings Article

Reinforcement Learning for Algorithm Selection

- 30 Jul 2000

- pp 1081

39

TL;DR: This work considers the problem of algorithm selection: dynamically choose an algorithm to attack an instance or subinstances of a problem with the goal of minimizing the overall execution time, and uses ideas from reinforcement learning to solve it.

Abstract: Many computational problems can be solved by multiple algorithms, with different algorithms fastest for different problem sizes, input distributions, and hardware characteristics. We consider the problem of algorithm selection: dynamically choose an algorithm to attack an instance or subinstances (due to recursive calls) of a problem with the goal of minimizing the overall execution time. We formulate the problem as a kind of Markov Decision Process (MDP), and use ideas from reinforcement learning (RL) to solve it. The process’ state consists of a set of instance features, such as problem size. Actions are the different algorithms we can choose from. Non-recursive algorithms are terminal in that they solve the problem completely (terminal state). Recursive algorithms create subproblems and therefore cause transitions to other states, making the task a sequential decision task. The immediate cost of a decision is the real time taken for executing the selected algorithm on the current instance, excluding time taken in recursive calls. Thus, the total (undiscounted) cost during an episode is the time taken to solve the problem. The goal is a policy that minimizes the total cost/time. This process differs from a standard MDP as it allows one-to-many state transitions (multiple recursive calls at one level). Our initial experiments focus on the problem of order statistic selection: given an array of (unordered) numbers and some index , select the number that would rank -th if the array were sorted. We picked two algorithms such that neither is best in all cases, otherwise learning would not help. DETERMINISTIC SELECT ( )i s an recursive algorithm and HEAP SELECT ( )i s an algorithm

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1613/jair.1.13596

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Jack Parker-Holder, +10 more

- 11 Jan 2022

- Journal of Artificial Intelligence Resea...

TL;DR: This survey seeks to unify the field of AutoRL, provide a common taxonomy, discuss each area in detail and pose open problems of interest to researchers going forward.

...read moreread less

99

•Proceedings Article

Collaborative Evolutionary Reinforcement Learning.

Shauharda Khadka, +7 more

- 24 May 2019

TL;DR: Collaborative Evolutionary Reinforcement Learning (CERL) is introduced, a scalable framework that comprises a portfolio of policies that simultaneously explore and exploit diverse regions of the solution space and significantly outperforms its composite learners while remaining overall more sample-efficient.

...read moreread less

68

Parameter Adjustment Based on Performance Prediction: Towards an Instance-Aware Problem Solver

Frank Hutter, +1 more

- 01 Jan 2005

TL;DR: It is demonstrated that it is possible to fairly accurately predict the runtime of SAPS, one of the best-performing stochastic local search algorithms for SAT, and sequential Bayesian learning is introduced to the problem of runtime prediction, enabling an incremental learning approach and yielding very informative estimates of predictive uncertainty.

...read moreread less

54

•Proceedings Article•10.3233/978-1-60750-606-5-873

Learning When to Use Lazy Learning in Constraint Solving

Ian P. Gent, +6 more

- 04 Aug 2010

TL;DR: This work shows that, in the context of a large benchmark set, standard ML approaches can be used to learn a simple, cheap classifier which performs well in identifying instances on which lazy learning should or should not be used.

...read moreread less

53

•Posted Content

Collaborative Evolutionary Reinforcement Learning

Shauharda Khadka, +7 more

- 02 May 2019

- arXiv: Learning

TL;DR: In this paper, the authors introduce Collaborative Evolutionary Reinforcement Learning (CERL), a scalable framework that comprises a portfolio of policies that simultaneously explore and exploit diverse regions of the solution space.

...read moreread less

39

...

Expand

References

•Book

Introduction to Algorithms

Thomas H. Cormen, +2 more

- 01 Jan 1990

TL;DR: The updated new edition of the classic Introduction to Algorithms is intended primarily for use in undergraduate or graduate courses in algorithms or data structures and presents a rich variety of algorithms and covers them in considerable depth while making their design and analysis accessible to all levels of readers.

...read moreread less

24.8K

•Proceedings Article

Algorithm Selection using Reinforcement Learning

Michail G. Lagoudakis, +1 more

- 29 Jun 2000

TL;DR: A kind of MDP that models the algorithm selection problem by allowing multiple state transitions is introduced, and the well known Q-learning algorithm is adapted for this case in a way that combines both Monte-Carlo and Temporal Difference methods.

...read moreread less

168

Reinforcement Learning for Algorithm Selection

Chat with Paper

AI Agents for this Paper

Citations

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Collaborative Evolutionary Reinforcement Learning.

Parameter Adjustment Based on Performance Prediction: Towards an Instance-Aware Problem Solver

Learning When to Use Lazy Learning in Constraint Solving

Collaborative Evolutionary Reinforcement Learning

References

Introduction to Algorithms

Algorithm Selection using Reinforcement Learning

Related Papers (5)

Dynamic Algorithm Selection Using Reinforcement Learning

The Algorithm Selection Problem

Complexity Analysis of Real-Time Reinforcement Learning Applied to Finding Shortest Paths in Deterministic Domains

A machine learning-based branch and price algorithm for a sampled vehicle routing problem

Algorithm portfolio selection as a bandit problem with unbounded losses