Accelerated Stochastic Optimization Methods under Quasar-convexity
Qiang Fu,Ashia C. Wilson +1 more
- 08 May 2023
1
TL;DR: In this paper , a stochastic algorithm for minimizing quasar-convex functions is proposed. But it is not a deterministic algorithm, and it has high complexity and slow convergence.
read more
Abstract: Non-convex optimization plays a key role in a growing number of machine learning applications. This motivates the identification of specialized structure that enables sharper theoretical analysis. One such identified structure is quasar-convexity, a non-convex generalization of convexity that subsumes convex functions. Existing algorithms for minimizing quasar-convex functions in the stochastic setting have either high complexity or slow convergence, which prompts us to derive a new class of stochastic methods for optimizing smooth quasar-convex functions. We demonstrate that our algorithms have fast convergence and outperform existing algorithms on several examples, including the classical problem of learning linear dynamical systems. We also present a unified analysis of our newly proposed algorithms and a previously studied deterministic algorithm.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Aiming towards the minimizers: fast convergence of SGD for overparametrized problems
TL;DR: In this article , the authors propose a regularity condition within the interpolation regime which endows the stochastic gradient method with the same worst-case iteration complexity as the deterministic gradient method, while using only a single sampled gradient (or a minibatch) in each iteration.
References
Robust Stochastic Approximation Approach to Stochastic Programming
TL;DR: It is intended to demonstrate that a properly modified SA approach can be competitive and even significantly outperform the SAA method for a certain class of convex stochastic problems.
2.9K
•Proceedings Article
Hogwild: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent
Benjamin Recht,Christopher Ré,Stephen J. Wright,Feng Niu +3 more
- 12 Dec 2011
TL;DR: In this paper, the authors present an update scheme called HOGWILD!, which allows processors access to shared memory with the possibility of overwriting each other's work, which achieves a nearly optimal rate of convergence.
Cubic regularization of Newton method and its global performance
Yurii Nesterov,Boris T. Polyak +1 more
TL;DR: This paper provides theoretical analysis for a cubic regularization of Newton method as applied to unconstrained minimization problem and proves general local convergence results for this scheme.
1.2K
Accelerated gradient methods for nonconvex nonlinear and stochastic programming
Saeed Ghadimi,Guanghui Lan +1 more
TL;DR: The AG method is generalized to solve nonconvex and possibly stochastic optimization problems and it is demonstrated that by properly specifying the stepsize policy, the AG method exhibits the best known rate of convergence for solving general non Convex smooth optimization problems by using first-order information, similarly to the gradient descent method.