Accelerated Stochastic Optimization Methods under Quasar-convexity

Accelerated Stochastic Optimization Methods under Quasar-convexity

- 08 May 2023

1

TL;DR: In this paper , a stochastic algorithm for minimizing quasar-convex functions is proposed. But it is not a deterministic algorithm, and it has high complexity and slow convergence.

Abstract: Non-convex optimization plays a key role in a growing number of machine learning applications. This motivates the identification of specialized structure that enables sharper theoretical analysis. One such identified structure is quasar-convexity, a non-convex generalization of convexity that subsumes convex functions. Existing algorithms for minimizing quasar-convex functions in the stochastic setting have either high complexity or slow convergence, which prompts us to derive a new class of stochastic methods for optimizing smooth quasar-convex functions. We demonstrate that our algorithms have fast convergence and outperform existing algorithms on several examples, including the classical problem of learning linear dynamical systems. We also present a unified analysis of our newly proposed algorithms and a previously studied deterministic algorithm.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arXiv.2306.02601

Aiming towards the minimizers: fast convergence of SGD for overparametrized problems

Chaoyue Liu, +4 more

- 05 Jun 2023

- arXiv.org

TL;DR: In this article , the authors propose a regularity condition within the interpolation regime which endows the stochastic gradient method with the same worst-case iteration complexity as the deterministic gradient method, while using only a single sampled gradient (or a minibatch) in each iteration.

...read moreread less

6

References

•Journal Article

A method for solving the convex programming problem with convergence rate O(1/k^2)

Yurii Nesterov

- 01 Jan 1983

- Proceedings of the USSR Academy of Scien...

4.1K

Journal Article•10.1137/070704277

Robust Stochastic Approximation Approach to Stochastic Programming

Arkadi Nemirovski, +3 more

- 01 Dec 2008

- Siam Journal on Optimization

TL;DR: It is intended to demonstrate that a properly modified SA approach can be competitive and even significantly outperform the SAA method for a certain class of convex stochastic problems.

...read moreread less

2.9K

•Proceedings Article

Hogwild: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent

Benjamin Recht, +3 more

- 12 Dec 2011

TL;DR: In this paper, the authors present an update scheme called HOGWILD!, which allows processors access to shared memory with the possibility of overwriting each other's work, which achieves a nearly optimal rate of convergence.

...read moreread less

2.3K

Journal Article•10.1007/S10107-006-0706-8

Cubic regularization of Newton method and its global performance

Yurii Nesterov, +1 more

- 01 Aug 2006

- Mathematical Programming

TL;DR: This paper provides theoretical analysis for a cubic regularization of Newton method as applied to unconstrained minimization problem and proves general local convergence results for this scheme.

...read moreread less

1.2K

•Journal Article•10.1007/S10107-015-0871-8

Accelerated gradient methods for nonconvex nonlinear and stochastic programming

Saeed Ghadimi, +1 more

- 01 Mar 2016

- Mathematical Programming

TL;DR: The AG method is generalized to solve nonconvex and possibly stochastic optimization problems and it is demonstrated that by properly specifying the stepsize policy, the AG method exhibits the best known rate of convergence for solving general non Convex smooth optimization problems by using first-order information, similarly to the gradient descent method.

...read moreread less

760

...

Expand