Tom Everitt
Australian National University
51 Papers
185 Citations
Tom Everitt is an academic researcher from Australian National University. The author has contributed to research in topics: Reinforcement learning & Computer science. The author has an hindex of 13, co-authored 44 publications. Previous affiliations of Tom Everitt include Stockholm University & Google.
Chat about Author
Papers
•Posted Content
Scalable agent alignment via reward modeling: a research direction.
TL;DR: This work outlines a high-level research direction to solve the agent alignment problem centered around reward modeling: learning a reward function from interaction with the user and optimizing the learned reward function with reinforcement learning.
303
AGI Safety Literature Review
Tom Everitt,Gary Lea,Marcus Hutter +2 more
- 03 May 2018
TL;DR: In this paper, the authors provide an easily accessible and up-to-date collection of references for the emerging field of AGI safety, and review the current public policy on AGI.
Count-Based Exploration in Feature Space for Reinforcement Learning
Jarryd Martin,Suraj Narayanan Sasikumar,Tom Everitt,Marcus Hutter +3 more
- 01 Aug 2017
TL;DR: In this article, a count-based optimistic exploration algorithm for reinforcement learning (RL) that is feasible in environments with high-dimensional state-action spaces is introduced. But this algorithm requires the agent to explore in feature space rather than in untransformed state space.
•Posted Content
Count-Based Exploration in Feature Space for Reinforcement Learning
TL;DR: In this article, a count-based optimistic exploration algorithm for reinforcement learning (RL) that is feasible in environments with high-dimensional state-action spaces is introduced. But this algorithm requires the agent to explore in feature space rather than in untransformed state space.
54
Avoiding wireheading with value reinforcement learning
Tom Everitt,Marcus Hutter +1 more
- 16 Jul 2016
TL;DR: In this paper, value reinforcement learning (VRL) is proposed to remove the incentive to wirehead by placing a constraint on the agent's actions, which does not require explicit specification of which actions constitute wireheading.
37