Richie Steigerwald
3 Papers
Richie Steigerwald is an academic researcher. The author has contributed to research in topics: Computer science. The author has an hindex of 1, co-authored 1 publications.
Chat about Author
Papers
Genie: Generative Interactive Environments
Jake Bruce,Michael Dennis,Ashley Edwards,Jack Parker-Holder,Yuge Shi,Edward Hughes,Matthew Lai,Aditi Mavalankar,Richie Steigerwald,Chris Apps,Yusuf Aytar,Sarah Bechtle,Feryal Behbahani,Stephanie Chan,Nicolas Heess,Lucy Gonzalez,Simon Osindero,Sherjil Ozair,Scott Reed,Jingwei Zhang,Konrad Żołna,Jeff Clune,Nando de Freitas,Satinder Singh,Tim Rocktaschel +24 more
TL;DR: Genie is introduced, the first generative interactive environment trained in an unsupervised manner from unlabelled Internet videos, which enables users to act in the generated environments on a frame-by-frame basis despite training without any ground-truth action labels or other domain-specific requirements typically found in the world model literature.
39
Vision-Language Models as a Source of Rewards
Kate Baumli,Satinder Singh Baveja,Feryal Behbahani,Harris Chan,Gheorghe Comanici,Sebastian Flennerhag,Maxime Gazeau,Kristian Holsheimer,Dan Horgan,Michael Laskin,Clare Lyle,Hussain Masoom,Kay McKinney,Volodymyr Mnih,Alexander Neitz,Fabio Pardo,Jack Parker-Holder,John Quan,Tim Rocktaschel,Himanshu Sahni,Tom Schaul,Yannick Schroecker,Stephen Spencer,Richie Steigerwald,Luyu Wang,Lei Zhang +25 more
TL;DR: This work investigates the feasibility of using off-the-shelf vision-language models, or VLMs, as sources of rewards for reinforcement learning agents and presents a scaling trend showing how larger VLMs lead to more accurate rewards for visual goal achievement, which in turn produces more capable RL agents.
13
In-context Reinforcement Learning with Algorithm Distillation
Michael Laskin,Luyu Wang,Junhyuk Oh,Emilio Parisotto,Stephen Spencer,Richie Steigerwald,DJ Strouse,Steven Hansen,Angelos Filos,Ethan Brooks,Maxime Gazeau,Himanshu Sahni,Satinder Singh,Volodymyr Mnih +13 more
- 25 Oct 2022
TL;DR: It is demonstrated that AD can reinforcement learn in-context in a variety of environments with sparse rewards, combinatorial task structure, and pixel-based observations, and that AD learns a more data-efficient RL algorithm than the one that generated the source data.