Ryan D'Orazio

Université de Montréal

13 Papers

20 Citations

Ryan D'Orazio is an academic researcher from Université de Montréal. The author has contributed to research in topics: Computer science & Regret. The author has an hindex of 4, co-authored 13 publications. Previous affiliations of Ryan D'Orazio include University of Alberta.

Author Tools

Create citation map

Create Author Profile

Analyze Ryan D'Orazio's Top Papers

Chat about Author

Papers

•Posted Content

Solving Common-Payoff Games with Approximate Policy Iteration.

Samuel Sokota, +8 more

- 11 Jan 2021

- arXiv: Artificial Intelligence

TL;DR: This work proposes CAPI, a novel algorithm which, like BAD, combines common knowledge with deep reinforcement learning, however, unlike BAD, CAPI prioritizes the propensity to discover optimal joint policies over scalability, which precludes CAPI from scaling to games as large as Hanabi.

...read moreread less

•Posted Content

Hindsight and Sequential Rationality of Correlated Play.

Dustin Morrill, +6 more

- 10 Dec 2020

- arXiv: Computer Science and Game Theory

TL;DR: This work develops and advocate for this hindsight rationality framing of learning in general sequential decision-making settings, and re-examines mediated equilibrium and deviation types in extensive-form games, thereby gaining a more complete understanding and resolving past misconceptions.

...read moreread less

•Posted Content

Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization

Ryan D'Orazio, +3 more

- 06 Dec 2019

- arXiv: Artificial Intelligence

TL;DR: This work derives approximation error-aware regret bounds for $(\Phi, f)$-regret matching, which applies to a general class of link functions and regret objectives and provides a theoretical justification for RCFR implementations with alternative policy parameterizations, including softmax.

...read moreread less

•Posted Content

Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games

Dustin Morrill, +5 more

- 13 Feb 2021

- arXiv: Computer Science and Game Theory

TL;DR: In this paper, the authors formalize behavioral deviations as a general class of deviations that respect the structure of extensive-form games, and introduce an extensive form regret minimization (EFR) algorithm that achieves hindsight rationality for any given set of behavioral deviations with computation that scales closely with the complexity of the set.

...read moreread less

•Proceedings Article

Alternative Function Approximation Parameterizations for Solving Games: An Analysis of ƒ-Regression Counterfactual Regret Minimization

Ryan D'Orazio, +3 more

- 05 May 2020

TL;DR: In this article, the authors derive approximation error-aware regret bounds for (¶hi, ƒ)-regret matching, which applies to a general class of link functions and regret objectives.

...read moreread less