Benja Fallenstein
Machine Intelligence Research Institute
9 Papers
71 Citations
Benja Fallenstein is an academic researcher from Machine Intelligence Research Institute. The author has contributed to research in topics: Oracle & HOL. The author has an hindex of 6, co-authored 9 publications.
Chat about Author
Papers
•Posted Content
Toward Idealized Decision Theory
Nate Soares,Benja Fallenstein +1 more
TL;DR: The shortcomings of two standard formulations of decision theory are discussed, and it is demonstrated that they cannot be used to describe an idealized decision procedure suitable for approximation by artificial systems.
34
•Posted Content
Robust Cooperation in the Prisoner's Dilemma: Program Equilibrium via Provability Logic
Mihály Bárász,Paul F. Christiano,Benja Fallenstein,Marcello Herreshoff,Patrick LaVictoire,Eliezer Yudkowsky +5 more
TL;DR: This work considers the one-shot Prisoner's Dilemma between algorithms with read-access to one anothers' source codes, and uses the modal logic of provability to build agents that can achieve mutual cooperation in a manner that is robust, in that cooperation does not require exact equality of the agents' source code.
Problems of Self-reference in Self-improving Space-Time Embedded Intelligence
Benja Fallenstein,Nate Soares +1 more
- 01 Aug 2014
TL;DR: It is shown that in one particular model based on formal logic, naive approaches either lead to incorrect reasoning that allows an agent to put off an important task forever (the procrastination paradox), or fail to allow the agent to justify even obviously safe rewrites (the Lobian obstacle).
Reflective Oracles: A Foundation for Game Theory in Artificial Intelligence
Benja Fallenstein,Jessica Taylor,Paul F. Christiano +2 more
- 27 Oct 2015
TL;DR: This paper proposes a framework in which agents and their environments are both modelled as probablistic oracle machines with access to a “reflective” oracle, which is able to answer questions about the outputs of other machines with Access to the same oracle.
16
Proof-Producing Reflection for HOL
Benja Fallenstein,Ramana Kumar +1 more
- 24 Aug 2015
TL;DR: In this article, the authors present a reflection principle of the form "If the cardinality of a cardinal is provable, then the cardinal has the same meaning both inside and outside of the HOL4 theorem prover".
10