Journal Article10.1016/0041-5553(64)90137-5
Some methods of speeding up the convergence of iteration methods
3.1K
TL;DR: In this article, the authors consider the problem of minimizing the differentiable functional (x) in Hilbert space, so long as this problem reduces to the solution of the equation grad(x) = 0.
read more
Abstract: For the solution of the functional equation P (x) = 0 (1) (where P is an operator, usually linear, from B into B, and B is a Banach space) iteration methods are generally used. These consist of the construction of a series x0, …, xn, …, which converges to the solution (see, for example [1]). Continuous analogues of these methods are also known, in which a trajectory x(t), 0 ⩽ t ⩽ ∞ is constructed, which satisfies the ordinary differential equation in B and is such that x(t) approaches the solution of (1) as t → ∞ (see [2]). We shall call the method a k-step method if for the construction of each successive iteration xn+1 we use k previous iterations xn, …, xn−k+1. The same term will also be used for continuous methods if x(t) satisfies a differential equation of the k-th order or k-th degree. Iteration methods which are more widely used are one-step (e.g. methods of successive approximations). They are generally simple from the calculation point of view but often converge very slowly. This is confirmed both by the evaluation of the speed of convergence and by calculation in practice (for more details see below). Therefore the question of the rate of convergence is most important. Some multistep methods, which we shall consider further, which are only slightly more complicated than the corresponding one-step methods, make it possible to speed up the convergence substantially. Note that all the methods mentioned below are applicable also to the problem of minimizing the differentiable functional (x) in Hilbert space, so long as this problem reduces to the solution of the equation grad (x) = 0.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
GANs Trained by a Two Time-Scale Update Rule Converge to a Nash Equilibrium
Martin Heusel,Hubert Ramsauer,Thomas Unterthiner,Bernhard Nessler,Günter Klambauer,Sepp Hochreiter +5 more
TL;DR: In this article, a two time-scale update rule (TTUR) was proposed for training GANs with stochastic gradient descent on arbitrary GAN loss functions, which has an individual learning rate for both the discriminator and the generator.
9.2K
•Posted Content
Character-level Convolutional Networks for Text Classification
TL;DR: This article constructed several large-scale datasets to show that character-level convolutional networks could achieve state-of-the-art or competitive results in text classification.
5.4K
•Proceedings Article
On the importance of initialization and momentum in deep learning
Ilya Sutskever,James Martens,George E. Dahl,Geoffrey E. Hinton +3 more
- 16 Jun 2013
TL;DR: It is shown that when stochastic gradient descent with momentum uses a well-designed random initialization and a particular type of slowly increasing schedule for the momentum parameter, it can train both DNNs and RNNs to levels of performance that were previously achievable only with Hessian-Free optimization.
5K
Optimization Methods for Large-Scale Machine Learning
TL;DR: The authors provides a review and commentary on the past, present, and future of numerical optimization algorithms in the context of machine learning applications and discusses how optimization problems arise in machine learning and what makes them challenging.
3.7K
Boosting Adversarial Attacks with Momentum
Yinpeng Dong,Fangzhou Liao,Tianyu Pang,Hang Su,Jun Zhu,Xiaolin Hu,Jianguo Li +6 more
- 18 Jun 2018
TL;DR: A broad class of momentum-based iterative algorithms to boost adversarial attacks by integrating the momentum term into the iterative process for attacks, which can stabilize update directions and escape from poor local maxima during the iterations, resulting in more transferable adversarial examples.
References
Convergence rates of iterative treatments of partial differential equations
TL;DR: Renwick and Wilkes as mentioned in this paper present a sequence of initial orders, known as program orders, permanently wired onto a set of uniselectors (rotary telephone switches), which can be transferred to the store by pressing a button.
Numerical Methods in Linear Algebra
Jitka Segethová,Karel Segeth +1 more
- 01 Jan 1994
TL;DR: In this paper, the numerical solution of systems with real coefficients and real right-hand sides was studied, and the concepts of system of m linear equations for n unknowns, matrix of the system, righthand side vector, solution vector and augmented matrix were introduced.
43
Related Papers (5)
Diederik P. Kingma,Jimmy Ba +1 more
- 01 Jan 2015
Herbert Robbins,Sutton Monro +1 more
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016