Simon Schmitt

Google

17 Papers

66 Citations

Simon Schmitt is an academic researcher from Google. The author has contributed to research in topics: Computer science & Reinforcement learning. The author has an hindex of 8, co-authored 14 publications.

Author Tools

Create citation map

Create Author Profile

Analyze Simon Schmitt's Top Papers

Chat about Author

Papers

•Journal Article•10.1038/S41586-020-03051-4

Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

Julian Schrittwieser, +11 more

- 19 Nov 2019

- arXiv: Learning

TL;DR: The MuZero algorithm is presented, which, by combining a tree-based search with a learned model, achieves superhuman performance in a range of challenging and visually complex domains, without any knowledge of their underlying dynamics.

...read moreread less

1.4K

•Journal Article•10.1038/S41586-020-03051-4

Mastering Atari, Go, chess and shogi by planning with a learned model

Julian Schrittwieser, +11 more

- 23 Dec 2020

- Nature

TL;DR: MuZero as discussed by the authors is a reinforcement learning algorithm that combines a tree-based search with a learned model to achieve state-of-the-art performance in high-performance planning and visually complex domains.

...read moreread less

1.2K

•Journal Article•10.1609/AAAI.V33I01.33013796

Multi-task Deep Reinforcement Learning with PopArt

Matteo Hessel, +5 more

- 17 Jul 2019

TL;DR: This work proposes to automatically adapt the contribution of each task to the agent’s updates, so that all tasks have a similar impact on the learning dynamics, and learns a single trained policy that exceeds median human performance on this multi-task domain.

...read moreread less

366

•Posted Content

Kickstarting Deep Reinforcement Learning

Simon Schmitt, +10 more

- 10 Mar 2018

- arXiv: Learning

TL;DR: It is shown that, on a challenging and computationally-intensive multi-task benchmark (DMLab-30), kickstarted training improves the data efficiency of new agents, making it significantly easier to iterate on their design.

...read moreread less

120

•Posted Content

Muesli: Combining Improvements in Policy Optimization.

Matteo Hessel, +8 more

- 13 Apr 2021

- arXiv: Learning

TL;DR: A novel policy update that combines regularized policy optimization with model learning as an auxiliary loss and does so without using deep search: it acts directly with a policy network and has computation speed comparable to model-free baselines.

...read moreread less

...

Expand