Stochastic Recursive Gradient Algorithm for Nonconvex Optimization

doi:10.48550/arxiv.1705.07261

10.48550/arxiv.1705.07261

Stochastic Recursive Gradient Algorithm for Nonconvex Optimization

Lam M. Nguyen, +3 more

16

TL;DR: This paper analyzes the mini-batch SARAH algorithm for nonconvex optimization, providing sublinear and linear convergence rates for general and gradient-dominated functions, respectively, outperforming other stochastic gradient algorithms for nonconvex losses.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1007/978-3-030-58292-0_30904

Convex

70

Proceedings Article•10.1109/WCICA.2018.8630692

Anomaly Detection in Manufacturing Systems Using Structured Neural Networks

Jie Liu, +6 more

- 04 Jul 2018

TL;DR: The proposed structured neural networks outperform the unstructured neural networks in terms of anomaly detection accuracy and can reduce test error by 20% and reduce anomaly detection misclassification error by as much as 64%.

...read moreread less

55

•Proceedings Article•10.24963/IJCAI.2019/354

Zeroth-Order Stochastic Alternating Direction Method of Multipliers for Nonconvex Nonsmooth Optimization

Feihu Huang, +4 more

- 01 Aug 2019

TL;DR: A class of fast zeroth-order stochastic ADMM methods for solving nonconvex problems with multiple nonsmooth penalties, based on the coordinate smoothing gradient estimator, which not only reach the best convergence rate for the non Convex optimization, but also are able to effectively solve many complex machine learning problems withmultiple regularized penalties and constraints.

...read moreread less

29

10.48550/arxiv.1910.06378

SCAFFOLD Stochastic Controlled Averaging for Federated Learning

Sai Praneeth Karimireddy, +5 more

TL;DR: This paper proposes SCAFFOLD, a federated learning algorithm that corrects for "client-drift" using control variates, achieving faster convergence and requiring fewer communication rounds, especially in heterogeneous data settings.

...read moreread less

•Journal Article•10.1109/TPAMI.2019.2933841

Faster First-Order Methods for Stochastic Non-Convex Optimization on Riemannian Manifolds

Pan Zhou, +3 more

- 01 Feb 2021

- IEEE Transactions on Pattern Analysis an...

TL;DR: In this article, the Riemannian Stochastic path integrated differential estimator (R-SPIDER) was proposed to solve the finite-sum and online RiemANNian non-convex minimization problems.

...read moreread less

...

Expand

References

Journal Article•10.1007/S10107-006-0706-8

Cubic regularization of Newton method and its global performance

Yurii Nesterov, +1 more

- 01 Aug 2006

- Mathematical Programming

TL;DR: This paper provides theoretical analysis for a cubic regularization of Newton method as applied to unconstrained minimization problem and proves general local convergence results for this scheme.

...read moreread less

1.2K

•Journal Article•10.1137/120880811

Stochastic First- and Zeroth-Order Methods for Nonconvex Stochastic Programming

Saeed Ghadimi, +1 more

- 03 Dec 2013

- Siam Journal on Optimization

TL;DR: The randomized stochastic gradient (RSG) algorithm as mentioned in this paper is a type of approximation algorithm for non-convex nonlinear programming problems, and it has a nearly optimal rate of convergence if the problem is convex.

...read moreread less

1.1K

10.48550/arxiv.1407.2710

Finito: A Faster, Permutable Incremental Gradient Method for Big Data Problems.

Aaron J. Defazio, +2 more

TL;DR: Researchers introduce Finito, a faster incremental gradient method for big data problems, achieving four times faster convergence rate than existing methods, with further speed-ups through sampling without replacement, and demonstrating state-of-the-art performance in empirical results.

...read moreread less

10.1007/978-3-319-91578-4

Introductory Lectures on Convex Optimization: A Basic Course

Yurii Nesterov

TL;DR: This book offers a modern, comprehensive introduction to convex optimization, a crucial field in applied mathematics, economics, finance, engineering, and computer science, with significant applications in data science and machine learning.

...read moreread less

•Journal Article•10.1007/S10107-014-0839-0

Accelerated Proximal Stochastic Dual Coordinate Ascent for Regularized Loss Minimization

Shai Shalev-Shwartz, +1 more

- 21 Jun 2014

TL;DR: The runtime of the framework is analyzed and rates that improve state-of-the-art results for various key machine learning optimization problems including SVM, logistic regression, ridge regression, Lasso, and multiclass SVM are obtained.

...read moreread less