Data, Knowledge, and Computation
TL;DR: Richard Sutton’s 2019 essay “The Bitter Lesson” is worth reading, and I would like to share my thoughts about the essay with you in this editorial.
read more
About: This article is published in Künstliche Intelligenz. The article was published on 11 Aug 2021. and is currently open access. The article focuses on the topics: Computation.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Efficiency is Not Enough: A Critical Perspective of Environmentally Sustainable AI
Dustin Wright,Christian Igel,Gabrielle Samuel,R. Arul Selvan +3 more
TL;DR: It is argued that efficiency alone is not enough to make ML as a technology environmentally sustainable, and systems thinking as a viable path towards improving the environmental sustainability of ML holistically is presented.
References
•Proceedings Article
Language Models are Few-Shot Learners
Tom B. Brown,Benjamin Mann,Nick Ryder,Melanie Subbiah,Jared Kaplan,Prafulla Dhariwal,Arvind Neelakantan,Pranav Shyam,Girish Sastry,Amanda Askell,Sandhini Agarwal,Ariel Herbert-Voss,Gretchen Krueger,Thomas Henighan,Rewon Child,Aditya Ramesh,Daniel M. Ziegler,Jeffrey Wu,Clemens Winter,Christopher Hesse,Mark Chen,Eric Sigler,Mateusz Litwin,Scott Gray,Benjamin Chess,Jack Clark,Christopher Berner,Samuel McCandlish,Alec Radford,Ilya Sutskever,Dario Amodei +30 more
- 28 May 2020
TL;DR: GPT-3 achieves strong performance on many NLP datasets, including translation, question-answering, and cloze tasks, as well as several tasks that require on-the-fly reasoning or domain adaptation, such as unscrambling words, using a novel word in a sentence, or performing 3-digit arithmetic.
•Proceedings Article
Policy Gradient Methods for Reinforcement Learning with Function Approximation
Richard S. Sutton,David McAllester,Satinder Singh,Yishay Mansour +3 more
- 29 Nov 1999
TL;DR: This paper proves for the first time that a version of policy iteration with arbitrary differentiable function approximation is convergent to a locally optimal policy.
Learning to Predict by the Methods of Temporal Differences
TL;DR: This article introduces a class of incremental learning procedures specialized for prediction – that is, for using past experience with an incompletely known system to predict its future behavior – and proves their convergence and optimality for special cases and relation to supervised-learning methods.
•Book
A Probabilistic Theory of Pattern Recognition
Luc Devroye,László Györfi,Gábor Lugosi +2 more
- 01 Jan 1996
TL;DR: The Bayes Error and Vapnik-Chervonenkis theory are applied as guide for empirical classifier selection on the basis of explicit specification and explicit enforcement of the maximum likelihood principle.
Neuronlike adaptive elements that can solve difficult learning control problems
Andrew G. Barto,Richard S. Sutton,Charles W. Anderson +2 more
- 01 Sep 1983
TL;DR: In this article, a system consisting of two neuron-like adaptive elements can solve a difficult learning control problem, where the task is to balance a pole that is hinged to a movable cart by applying forces to the cart base.
3.4K