Neural Network Optimization for Reinforcement Learning Tasks Using Sparse Computations

Journal Article

Neural Network Optimization for Reinforcement Learning Tasks Using Sparse Computations

- 07 Jan 2022

- Vol. abs/2201.02571

TL;DR: This method combines two ideas: neural network pruning and taking into account input data correlations; it makes it possible to update neuron states only when changes in them exceed a certain threshold, and reduces the number of multiplications when running neural networks.

Abstract: This article proposes a sparse computation-based method for optimizing neural networks for reinforcement learning (RL) tasks. This method combines two ideas: neural network pruning and taking into account input data correlations; it makes it possible to update neuron states only when changes in them exceed a certain threshold. It signiﬁcantly reduces the number of multiplications when running neural networks. We tested diﬀerent RL tasks and achieved 20-150x reduction in the number of multiplications. There were no substantial performance losses; sometimes the performance even improved.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Table 3: Number of multiplications in an unoptimized neural network

Figure 4: Results for Robotank, BattleZone, Enduro and Freeway.

Figure 1: DQN architecture. DQN consists of three convolutional layers and two dense layers. This architecture is suitable for all the video games (the number of outputs at the last layer is the only value that changes)

Table 4: Number of multiplications in Breakout with 0.79 sparsity and 0.001 threshold

Table 5: Number of multiplications in SpaceInvaders with 0.74 sparsity and 0.001 threshold

References

•Proceedings Article

Optimal Brain Damage

Yann LeCun, +2 more

- 01 Jan 1989

TL;DR: A class of practical and nearly optimal schemes for adapting the size of a neural network by using second-derivative information to make a tradeoff between network complexity and training set error is derived.

...read moreread less

4.5K

•Proceedings Article

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks.

Jonathan Frankle, +1 more

- 04 Mar 2019

TL;DR: This work finds that dense, randomly-initialized, feed-forward networks contain subnetworks ("winning tickets") that - when trained in isolation - reach test accuracy comparable to the original network in a similar number of iterations, and articulate the "lottery ticket hypothesis".

...read moreread less

2.6K

•Proceedings Article

Second order derivatives for network pruning: Optimal Brain Surgeon

Babak Hassibi, +1 more

- 30 Nov 1992

TL;DR: Of OBS, Optimal Brain Damage, and magnitude-based methods, only OBS deletes the correct weights from a trained XOR network in every case, and thus yields better generalization on test data.

...read moreread less

2.1K

•Journal Article

Double Q-Learning

Hado van Hasselt

- 01 Jan 2010

- IEEE Intelligent Systems

TL;DR: An alternative way to approximate the maximum expected value for any set of random variables is introduced and the obtained double estimator method is shown to sometimes underestimate rather than overestimate themaximum expected value.

...read moreread less

1K

Neural Network Optimization for Reinforcement Learning Tasks Using Sparse Computations

Chat with Paper

AI Agents for this Paper

Figures

References

Human-level control through deep reinforcement learning

Optimal Brain Damage

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks.

Second order derivatives for network pruning: Optimal Brain Surgeon

Double Q-Learning