John Schultz
3 Papers
John Schultz is an academic researcher. The author has contributed to research in topics: Computer science & Regret. The author has an hindex of 1, co-authored 1 publications.
Chat about Author
Papers
•Journal Article
The Advantage Regret-Matching Actor-Critic
Audrunas Gruslys,Marc Lanctot,Rémi Munos,Finbarr Timbers,Martin Schmid,Julien Perolet,Dustin Morrill,Vinicius Zambaldi,Jean-Baptiste Lespiau,John Schultz,Mohammad Gheshlaghi Azar,Michael Bowling,Karl Tuyls +12 more
TL;DR: A model-free RL algorithm, the AdvantageRegret-Matching Actor-Critic (ARMAC), which learns from sampled trajectories in a centralized training setting, without requiring the application of importance sampling commonly used in Monte Carlo counterfactual regret (CFR) minimization; hence, it does not suffer from excessive variance in large environments.
Learning to Navigate Wikipedia by Taking Random Walks
Manzil Zaheer,Kenneth Marino,Will Grathwohl,John Schultz,Wendy Shang,Sheila Babayan,Arika Ahuja,Ishita Dasgupta,Christine Kaeser-Chen,Rob Fergus +9 more
- 31 Oct 2022
TL;DR: In this paper , behavioral cloning of randomly sampled trajectories is used to learn an effective link selection policy, and the model is able to efficiently navigate between nodes 5 and 20 steps apart 96% and 92% of the time.
5
Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning
Marc Lanctot,John Schultz,Neil Burch,Max O. Smith,Daniel Hennes,Thomas Anthony,Julien Perolat +6 more
TL;DR: In this paper, the authors propose a benchmark for multiagent learning based on repeated play of the simple game Rock, Paper, Scissors along with a population of forty-three tournament entries, some of which are intentionally sub-optimal.