Agent Foundations for Aligning Machine Intelligence with Human Interests: A Technical Research Agenda

doi:10.1007/978-3-662-54033-6_5

Book Chapter10.1007/978-3-662-54033-6_5

Agent Foundations for Aligning Machine Intelligence with Human Interests: A Technical Research Agenda

Nate Soares, +1 more

- 01 Jan 2017

- pp 103-125

70

TL;DR: In this chapter, a host of technical problems that AI scientists could work on to ensure that the creation of smarter-than-human machine intelligence has a positive impact are discussed.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1080/09332480.2017.1302723

Superintelligence: Paths, Dangers, Strategies

Christian P. Robert

- 10 Mar 2017

- Chance

TL;DR: The first ultraintelligent machine is the last invention that man need ever make, provided that the machine i... as mentioned in this paper, 2014.Hardcover: 352 pagesYear: 2014Publisher: Oxford University PressISBN-13: 978019967811212

...read moreread less

790

Book Chapter•10.1002/9781118914762.CH1

The logic of decision

Franco Taroni, +4 more

- 18 Jul 2014

449

•Posted Content

Scalable agent alignment via reward modeling: a research direction.

Jan Leike, +5 more

- 19 Nov 2018

- arXiv: Learning

TL;DR: This work outlines a high-level research direction to solve the agent alignment problem centered around reward modeling: learning a reward function from interaction with the user and optimizing the learned reward function with reinforcement learning.

...read moreread less

303

•Journal Article•10.1017/S0020589320000366

Artificial intelligence and the limits of legal personality

Simon Chesterman

- 01 Oct 2020

- International and Comparative Law Quarte...

TL;DR: In this article, the authors argue that although most legal systems could create a novel category of legal persons, such arguments are insufficient to show that they should, and they argue that such categories should be replaced by natural persons.

...read moreread less

121

•Proceedings Article•10.24963/IJCAI.2018/768

AGI Safety Literature Review

Tom Everitt, +2 more

- 03 May 2018

TL;DR: In this paper, the authors provide an easily accessible and up-to-date collection of references for the emerging field of AGI safety, and review the current public policy on AGI.

...read moreread less

94

...

Expand

References

Monograph•10.1017/CBO9780511803161

Causality: models, reasoning, and inference

Judea Pearl

- 14 Sep 2009

- Tijdschrift Voor Filosofie

TL;DR: The art and science of cause and effect have been studied in the social sciences for a long time as mentioned in this paper, see, e.g., the theory of inferred causation, causal diagrams and the identification of causal effects.

...read moreread less

14.9K

•Journal Article•10.1609/AIMAG.V27I4.1904

A Proposal for the Dartmouth Summer Research Project on Artificial Intelligence, August 31, 1955

John J. McCarthy, +3 more

- 15 Dec 2006

- Ai Magazine

TL;DR: The 1956 Dartmouth summer research project on artificial intelligence was initiated by this August 31, 1955 proposal, authored by John McCarthy, Marvin Minsky, Nathaniel Rochester, and Claude Shannon, along with the short autobiographical statements of the proposers.

...read moreread less

1.7K

•Book

Superintelligence: Paths, Dangers, Strategies

Nick Bostrom

- 03 Jul 2014

TL;DR: In this paper, Bostrom's work picks its way carefully through a vast tract of forbiddingly difficult intellectual terrain, and the writing is so lucid that it somehow makes it all seem easy.

...read moreread less

1.5K

Journal Article•10.1016/S0925-2312(01)00330-7

Causality: Models, Reasoning, and Inference: Judea Pearl; Cambridge University Press, Cambridge, UK, 2000, pp. 384. ISBN 0-521-77362-8

Ram Shanmugam

- 01 Oct 2001

- Neurocomputing

1.4K

•Journal Article

Programming a computer for playing chess

Shannon

- 01 Jan 1950

- Philosophical Magazine

TL;DR: This paper is concerned with the problem of constructing a computing routine or “program” for a modern general purpose computer which will enable it to play chess.

...read moreread less

940