Mathieu Reymond

9 Papers

3 Citations

Mathieu Reymond is an academic researcher. The author has contributed to research in topics: Computer science & Reinforcement learning. The author has an hindex of 3, co-authored 7 publications.

Author Tools

Create citation map

Create Author Profile

Analyze Mathieu Reymond's Top Papers

Chat about Author

Papers

•Journal Article•10.1007/s10458-022-09552-y

A practical guide to multi-objective reinforcement learning and planning

Conor Hayes, +17 more

- 01 Apr 2022

- Autonomous Agents and Multi-Agent System...

TL;DR: In this article , a guide to the application of multi-objective decision-making methods to difficult problems is presented, aimed at researchers who are already familiar with singleobjective reinforcement learning and planning methods and who wish to adopt a multiobjective perspective on their research.

...read moreread less

150

Journal Article•10.1007/s10458-023-09604-x

Actor-critic multi-objective reinforcement learning for non-linear utility functions

Mathieu Reymond, +4 more

- 28 Apr 2023

- Autonomous Agents and Multi-Agent System...

TL;DR: A novel multi-objective reinforcement learning algorithm that successfully learns the optimal policy even for non-linear utility functions, avoiding the need to learn the full Pareto front.

...read moreread less

Proceedings Article•10.48550/arXiv.2204.05036

Pareto Conditioned Networks

Mathieu Reymond, +2 more

- 11 Apr 2022

TL;DR: Pareto Conditioned Networks (PCN) is proposed, a method that uses a single neural network to encompass all non-dominated policies and is stable as it learns in a supervised fashion, thus avoiding moving target issues.

...read moreread less

•Journal Article•10.1007/s10458-022-09596-0

Monte Carlo tree search algorithms for risk-aware and multi-objective reinforcement learning

Conor Hayes, +4 more

- 23 Nov 2022

- Autonomous Agents and Multi-Agent System...

TL;DR: In this article , a Monte Carlo tree search algorithm is proposed to compute policies for nonlinear utility functions (NLU-MCTS) by optimising the utility of the different possible returns attainable from individual policy executions, resulting in good policies for both risk-aware and multiobjective settings.

...read moreread less

Journal Article•10.48550/arXiv.2204.05027

Exploring the Pareto front of multi-objective COVID-19 mitigation policies using reinforcement learning

Mathieu Reymond, +10 more

- 11 Apr 2022

- arXiv.org

TL;DR: This work contributes a multi-objective Markov decision process that encapsulates the stochastic compartment model that was used to inform policy makers during the COVID-19 epidemic and evaluates the solution returned by PCN, which correctly learns to reduce the social burden whenever the hospitalization rates are sufficiently low.

...read moreread less