Reinforcement learning algorithms as function optimizers

doi:10.1109/IJCNN.1989.118683

Proceedings Article10.1109/IJCNN.1989.118683

Reinforcement learning algorithms as function optimizers

Williams, +1 more

- 01 Jan 1989

- pp 89-95

22

TL;DR: The results of simulations in which the optima of several deterministic functions studied by D.H. Ackley were sought using variants of REINFORCE algorithms compare favorably to the best results found by Ackley.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1080/09540099108946587

Function Optimization using Connectionist Reinforcement Learning Algorithms

Ronald J. Williams, +1 more

- 01 Jan 1991

- Connection Science

TL;DR: One of these variants, called REINFORCE/MENT, represents a novel but principled approach to reinforcement learning in nontrivial networks which incorporates an entropy maximization strategy.

...read moreread less

438

Journal Article•10.1088/0954-898X_8_4_001

Basal ganglia: structure and computations.

Jeff Wickens

- 01 Jan 1997

- Network: Computation In Neural Systems

TL;DR: Computational modelling can help to advance theoretical understanding of the role of basal ganglia in the selection and performance of learnt behaviours, and also in the effects of reinforcement on acquisition and maintenance of new behaviours.

...read moreread less

83

Journal Article•10.1016/J.NUCENGDES.2020.110966

Physics-informed reinforcement learning optimization of nuclear assembly design

Majdi I. Radaideh, +7 more

- 01 Feb 2021

- Nuclear Engineering and Design

TL;DR: This work proposes a physics-informed AI optimization methodology by establishing a connection through reward shaping between RL and the tactics fuel designers follow in practice by moving fuel rods in the assembly to meet specific constraints and objectives and demonstrates RL effectiveness as another decision support tool for nuclear fuel assembly optimization.

...read moreread less

62

Journal Article•10.1109/72.125861

Using random weights to train multilayer networks of hard-limiting units

P.L. Barlett, +1 more

- 01 Mar 1992

- IEEE Transactions on Neural Networks

TL;DR: A gradient descent algorithm suitable for training multilayer feedforward networks of processing units with hard-limiting output functions is presented and its performance is similar to that of conventional backpropagation applied to networks of units with sigmoidal characteristics.

...read moreread less

48

Journal Article•10.1016/J.INS.2004.09.004

A learning automata based algorithm for optimization of continuous complex functions

Xianyi Zeng, +1 more

- 11 Aug 2005

- Information Sciences

TL;DR: This method can be considered as active learning permitting to select on-line the most significant data samples in order to quickly converge to a quasi global optimum of the functions to be optimized with a fewer number of tests or calculations.

...read moreread less

38

...

Expand

References

Journal Article•10.1126/SCIENCE.220.4598.671

Optimization by Simulated Annealing

Scott Kirkpatrick, +2 more

- 13 May 1983

- Science

TL;DR: There is a deep and useful connection between statistical mechanics and multivariate or combinatorial optimization (finding the minimum of a given function depending on many parameters), and a detailed analogy with annealing in solids provides a framework for optimization of very large and complex systems.

...read moreread less

46.9K

•Book

Computers and Intractability: A Guide to the Theory of NP-Completeness

Michael Randolph Garey, +1 more

- 01 Jan 1979

TL;DR: The second edition of a quarterly column as discussed by the authors provides a continuing update to the list of problems (NP-complete and harder) presented by M. R. Garey and myself in our book "Computers and Intractability: A Guide to the Theory of NP-Completeness,” W. H. Freeman & Co., San Francisco, 1979.

...read moreread less

46.2K

Johnson: Computers and Intractability-A Guide to the Theory of NP-Completeness

Michael Randolph Garey

- 01 Jan 1979

42.6K

•Book

Adaptation in natural and artificial systems

John H. Holland

- 01 Jan 1975

TL;DR: Names of founding work in the area of Adaptation and modiication, which aims to mimic biological optimization, and some (Non-GA) branches of AI.

...read moreread less

40.3K

•Journal Article•10.1007/BF00339943

Neural computation of decisions in optimization problems

John J. Hopfield, +1 more

- 01 Jul 1985

- Biological Cybernetics

TL;DR: Results of computer simulations of a network designed to solve a difficult but well-defined optimization problem-the Traveling-Salesman Problem-are presented and used to illustrate the computational power of the networks.

...read moreread less

6K

...

Expand

Reinforcement learning algorithms as function optimizers

Chat with Paper

AI Agents for this Paper

Citations

Function Optimization using Connectionist Reinforcement Learning Algorithms

Basal ganglia: structure and computations.

Physics-informed reinforcement learning optimization of nuclear assembly design

Using random weights to train multilayer networks of hard-limiting units

A learning automata based algorithm for optimization of continuous complex functions

References

Optimization by Simulated Annealing

Computers and Intractability: A Guide to the Theory of NP-Completeness

Johnson: Computers and Intractability-A Guide to the Theory of NP-Completeness

Adaptation in natural and artificial systems

Neural computation of decisions in optimization problems

Related Papers (5)

Learning internal representations by error propagation

Identification and optimal control of nonlinear systems using recurrent neural networks and reinforcement learning: An overview

Function Optimization using Connectionist Reinforcement Learning Algorithms

Learning to Predict by the Methods of Temporal Differences

Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning