First-Order Objective-Function-Free Optimization Algorithms and Their Complexity

First-Order Objective-Function-Free Optimization Algorithms and Their Complexity

- 03 Mar 2022

8

TL;DR: Limited numerical experiments show that the new methods’ performance may be comparable to that of standard steepest descent, despite using significantly less information, and that this performance is relatively insensitive to noise.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1007/s10589-022-00435-2

OFFO minimization algorithms for second-order optimality and their complexity

Serge Gratton, +1 more

- 07 Mar 2022

- Computational Optimization and Applicati...

TL;DR: In this paper , an Adagrad-inspired class of algorithms for smooth unconstrained optimization is presented in which the objective function is never evaluated and yet the gradient norms decrease at least as fast as $$\mathcal{O}(1/\sqrt{k+1})$$ while second-order optimality measures converge to zero at least 1/3.

...read moreread less

5

Convergence properties of an Objective-Function-Free Optimization regularization algorithm, including an $\mathcal{O}(\epsilon^{-3/2})$ complexity bound

Serge Gratton, +2 more

- 18 Mar 2022

TL;DR: It is shown that excellent complexity bounds for adaptive regularization methods are also valid for the new algorithm, despite the fact that signiﬁcantly less information is used.

...read moreread less

3

Complexity of a Class of First-Order Objective-Function-Free Optimization Algorithms

Serge Gratton, +2 more

- 03 Mar 2022

TL;DR: In this article , a parametric class of trust-region algorithms for unconstrained nonconvex optimization is considered, where the value of the objective function is never computed, and the rate of convergence of methods in the class is analyzed and is shown to be identical to that known for first-order optimization methods using both function and gradients values.

...read moreread less

2

Iteration Complexity of Fixed-Step-Momentum Methods for Convex Quadratic Functions

Melinda Hagedorn

TL;DR: In this article , an explicit bound on the number of iterations needed to guarantee a reduction of the Euclidean distance to the optimal solution by a factor is derived, up to a constant factor and complements earlier asymptotically optimal results.

...read moreread less

1

•Journal Article•10.1007/s10957-023-02261-w

Iteration Complexity of Fixed-Step Methods by Nesterov and Polyak for Convex Quadratic Functions

Melinda Hagedorn, +1 more

- 18 Nov 2022

- Journal of Optimization Theory and Appli...

TL;DR: In this paper , an explicit bound on the number of iterations needed to guarantee a reduction of the Euclidean distance to the optimal solution by a factor was derived, which complements earlier asymptotically optimal results for the momentum method and Nesterov's accelerated gradient method.

...read moreread less

References

•Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 01 Jan 2015

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

138.5K

•Journal Article•10.1007/BF01589116

On the limited memory BFGS method for large scale optimization

Dong C. Liu, +1 more

- 01 Dec 1989

- Mathematical Programming

TL;DR: The numerical tests indicate that the L-BFGS method is faster than the method of Buckley and LeNir, and is better able to use additional storage to accelerate convergence, and the convergence properties are studied to prove global convergence on uniformly convex problems.

...read moreread less

8.8K

•Proceedings Article

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization.

John C. Duchi, +2 more

- 01 Jan 2010

TL;DR: Adaptive subgradient methods as discussed by the authors dynamically incorporate knowledge of the geometry of the data observed in earlier iterations to perform more informative gradient-based learning, which allows us to find needles in haystacks in the form of very predictive but rarely seen features.

...read moreread less

8.7K

Journal Article•10.1093/IMANUM/8.1.141

Two-Point Step Size Gradient Methods

Jonathan Barzilai, +1 more

- 01 Jan 1988

- Ima Journal of Numerical Analysis

TL;DR: Etude de nouvelles methodes de descente suivant le gradient for the solution approchee du probleme de minimisation sans contrainte. as mentioned in this paper.

...read moreread less

3K

•Book

First-Order Methods in Optimization

Amir Beck

- 02 Oct 2017

1.8K

...

Expand