Some methods of speeding up the convergence of iteration methods

doi:10.1016/0041-5553(64)90137-5

Journal Article10.1016/0041-5553(64)90137-5

Some methods of speeding up the convergence of iteration methods

Boris T. Polyak

- 01 Jan 1964

- Ussr Computational Mathematics and Mathe...

- Vol. 4, Iss: 5, pp 1-17

3.1K

TL;DR: In this article, the authors consider the problem of minimizing the differentiable functional (x) in Hilbert space, so long as this problem reduces to the solution of the equation grad(x) = 0.

Abstract: For the solution of the functional equation P (x) = 0 (1) (where P is an operator, usually linear, from B into B, and B is a Banach space) iteration methods are generally used. These consist of the construction of a series x0, …, xn, …, which converges to the solution (see, for example [1]). Continuous analogues of these methods are also known, in which a trajectory x(t), 0 ⩽ t ⩽ ∞ is constructed, which satisfies the ordinary differential equation in B and is such that x(t) approaches the solution of (1) as t → ∞ (see [2]). We shall call the method a k-step method if for the construction of each successive iteration xn+1 we use k previous iterations xn, …, xn−k+1. The same term will also be used for continuous methods if x(t) satisfies a differential equation of the k-th order or k-th degree. Iteration methods which are more widely used are one-step (e.g. methods of successive approximations). They are generally simple from the calculation point of view but often converge very slowly. This is confirmed both by the evaluation of the speed of convergence and by calculation in practice (for more details see below). Therefore the question of the rate of convergence is most important. Some multistep methods, which we shall consider further, which are only slightly more complicated than the corresponding one-step methods, make it possible to speed up the convergence substantially. Note that all the methods mentioned below are applicable also to the problem of minimizing the differentiable functional (x) in Hilbert space, so long as this problem reduces to the solution of the equation grad (x) = 0.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

GANs Trained by a Two Time-Scale Update Rule Converge to a Nash Equilibrium

Martin Heusel, +5 more

- 26 Jun 2017

- arXiv: Learning

TL;DR: In this article, a two time-scale update rule (TTUR) was proposed for training GANs with stochastic gradient descent on arbitrary GAN loss functions, which has an individual learning rate for both the discriminator and the generator.

...read moreread less

9.2K

•Posted Content

Character-level Convolutional Networks for Text Classification

Xiang Zhang, +2 more

- 04 Sep 2015

- arXiv: Learning

TL;DR: This article constructed several large-scale datasets to show that character-level convolutional networks could achieve state-of-the-art or competitive results in text classification.

...read moreread less

5.4K

•Proceedings Article

On the importance of initialization and momentum in deep learning

Ilya Sutskever, +3 more

- 16 Jun 2013

TL;DR: It is shown that when stochastic gradient descent with momentum uses a well-designed random initialization and a particular type of slowly increasing schedule for the momentum parameter, it can train both DNNs and RNNs to levels of performance that were previously achievable only with Hessian-Free optimization.

...read moreread less

5K

•Journal Article•10.1137/16M1080173

Optimization Methods for Large-Scale Machine Learning

Léon Bottou, +2 more

- 08 May 2018

- Siam Review

TL;DR: The authors provides a review and commentary on the past, present, and future of numerical optimization algorithms in the context of machine learning applications and discusses how optimization problems arise in machine learning and what makes them challenging.

...read moreread less

3.7K

•Proceedings Article•10.1109/CVPR.2018.00957

Boosting Adversarial Attacks with Momentum

Yinpeng Dong, +6 more

- 18 Jun 2018

TL;DR: A broad class of momentum-based iterative algorithms to boost adversarial attacks by integrating the momentum term into the iterative process for attacks, which can stabilize update directions and escape from poor local maxima during the iterations, resulting in more transferable adversarial examples.

...read moreread less

3.4K

...

Expand

References

•Book

Stability theory of differential equations

Richard Bellman

- 01 Jan 1953

1.2K

•Journal Article•10.1090/S0025-5718-1950-0046149-3

Convergence rates of iterative treatments of partial differential equations

Stanley P. Frankel

- 01 May 1950

- Mathematics of Computation

TL;DR: Renwick and Wilkes as mentioned in this paper present a sequence of initial orders, known as program orders, permanently wired onto a set of uniselectors (rotary telephone switches), which can be transferred to the store by pressing a button.

...read moreread less

274

Book Chapter•10.1007/978-94-015-8308-4_30

Numerical Methods in Linear Algebra

Jitka Segethová, +1 more

- 01 Jan 1994

TL;DR: In this paper, the numerical solution of systems with real coefficients and real right-hand sides was studied, and the concepts of system of m linear equations for n unknowns, matrix of the system, righthand side vector, solution vector and augmented matrix were introduced.

...read moreread less

43