Ryan D'Orazio
Université de Montréal
13 Papers
20 Citations
Ryan D'Orazio is an academic researcher from Université de Montréal. The author has contributed to research in topics: Computer science & Regret. The author has an hindex of 4, co-authored 13 publications. Previous affiliations of Ryan D'Orazio include University of Alberta.
Chat about Author
Papers
•Posted Content
Solving Common-Payoff Games with Approximate Policy Iteration.
Samuel Sokota,Edward Lockhart,Finbarr Timbers,Elnaz Davoodi,Ryan D'Orazio,Neil Burch,Martin Schmid,Michael Bowling,Marc Lanctot +8 more
TL;DR: This work proposes CAPI, a novel algorithm which, like BAD, combines common knowledge with deep reinforcement learning, however, unlike BAD, CAPI prioritizes the propensity to discover optimal joint policies over scalability, which precludes CAPI from scaling to games as large as Hanabi.
21
•Posted Content
Hindsight and Sequential Rationality of Correlated Play.
Dustin Morrill,Ryan D'Orazio,Reca Sarfati,Marc Lanctot,James R. Wright,Amy Greenwald,Michael Bowling +6 more
TL;DR: This work develops and advocate for this hindsight rationality framing of learning in general sequential decision-making settings, and re-examines mediated equilibrium and deviation types in extensive-form games, thereby gaining a more complete understanding and resolving past misconceptions.
21
•Posted Content
Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization
TL;DR: This work derives approximation error-aware regret bounds for $(\Phi, f)$-regret matching, which applies to a general class of link functions and regret objectives and provides a theoretical justification for RCFR implementations with alternative policy parameterizations, including softmax.
10
•Posted Content
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
TL;DR: In this paper, the authors formalize behavioral deviations as a general class of deviations that respect the structure of extensive-form games, and introduce an extensive form regret minimization (EFR) algorithm that achieves hindsight rationality for any given set of behavioral deviations with computation that scales closely with the complexity of the set.
9
•Proceedings Article
Alternative Function Approximation Parameterizations for Solving Games: An Analysis of ƒ-Regression Counterfactual Regret Minimization
Ryan D'Orazio,Dustin Morrill,James R. Wright,Michael Bowling +3 more
- 05 May 2020
TL;DR: In this article, the authors derive approximation error-aware regret bounds for (¶hi, ƒ)-regret matching, which applies to a general class of link functions and regret objectives.
6