Gradient Descent with Random Initialization: Fast Global Convergence for Nonconvex Phase Retrieval
TL;DR: In this article, the authors investigated the efficiency of gradient descent designed for the nonconvex least squares problem and showed that under Gaussian designs, gradient descent yields an accurate solution in O(log n+log(1/ε) ) iterations given nearly minimal samples, thus achieving near optimal computational and sample complexities at once.
read more
Abstract: This paper considers the problem of solving systems of quadratic equations, namely, recovering an object of interest $\mathbf{x}^{\natural}\in\mathbb{R}^{n}$ from $m$ quadratic equations/samples $y_{i}=(\mathbf{a}_{i}^{\top}\mathbf{x}^{\natural})^{2}$, $1\leq i\leq m$. This problem, also dubbed as phase retrieval, spans multiple domains including physical sciences and machine learning.
We investigate the efficiency of gradient descent (or Wirtinger flow) designed for the nonconvex least squares problem. We prove that under Gaussian designs, gradient descent --- when randomly initialized --- yields an $\epsilon$-accurate solution in $O\big(\log n+\log(1/\epsilon)\big)$ iterations given nearly minimal samples, thus achieving near-optimal computational and sample complexities at once. This provides the first global convergence guarantee concerning vanilla gradient descent for phase retrieval, without the need of (i) carefully-designed initialization, (ii) sample splitting, or (iii) sophisticated saddle-point escaping schemes. All of these are achieved by exploiting the statistical models in analyzing optimization algorithms, via a leave-one-out approach that enables the decoupling of certain statistical dependency between the gradient descent iterates and the data.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Quantum Generative Adversarial Networks for Learning and Loading Random Distributions
TL;DR: In this article, a hybrid quantum-classical algorithm for efficient, approximate quantum state loading is presented, which uses quantum Generative Adversarial Networks (qGANs) to facilitate efficient learning and loading of generic probability distributions, implicitly given by data samples.
A selective overview of deep learning.
TL;DR: This work introduces common neural network models and training techniques from a statistical point of view and highlights new characteristics of deep learning (including depth and over-parametrization) and explains their practical and theoretical benefits.
137
Spectral method and regularized mle are both optimal for top-k ranking.
TL;DR: In this article, it was shown that the spectral method alone or the regularized maximum likelihood estimator alone is minimax optimal in terms of the sample complexity, i.e., the number of paired comparisons needed to ensure exact top-k$ identification, for the fixed dynamic range regime.
Consensus-Based Optimization on Hypersurfaces: Well-Posedness and Mean-Field Limit
TL;DR: In this paper, the authors introduced a new stochastic differential model for global optimization of nonconvex functions on compact hypersurfaces, inspired by the Stochastic Kuramoto-Vicsek system and belongs to the class of Consensus-Based Optimization methods.
26
Sparse Signal Recovery from Phaseless Measurements via Hard Thresholding Pursuit
TL;DR: Inspired by the hard thresholding pursuit (HTP) algorithm in compressed sensing, this paper proposes an efficient second-order algorithm for sparse phase retrieval that is theoretically guaranteed to give an exact sparse signal recovery in finite steps.
14
References
Introduction to the non-asymptotic analysis of random matrices.
Roman Vershynin
- 01 May 2012
TL;DR: This is a tutorial on some basic non-asymptotic methods and concepts in random matrix theory, particularly for the problem of estimating covariance matrices in statistics and for validating probabilistic constructions of measurementMatrices in compressed sensing.
2.9K
PhaseLift: Exact and Stable Signal Recovery from Magnitude Measurements via Convex Programming
TL;DR: It is shown that in some instances, the combinatorial phase retrieval problem can be solved by convex programming techniques, and it is proved that the methodology is robust vis‐à‐vis additive noise.
Phase Retrieval via Wirtinger Flow: Theory and Algorithms
TL;DR: This paper develops a nonconvex formulation of the phase retrieval problem as well as a concrete solution algorithm that is shown to rigorously allow the exact retrieval of phase information from a nearly minimal number of random measurements.
1.4K
Cubic regularization of Newton method and its global performance
Yurii Nesterov,Boris T. Polyak +1 more
TL;DR: This paper provides theoretical analysis for a cubic regularization of Newton method as applied to unconstrained minimization problem and proves general local convergence results for this scheme.
1.2K
Matrix Completion From a Few Entries
TL;DR: OptimSpace as mentioned in this paper reconstructs an n? × n matrix from a uniformly random subset of its entries with probability larger than 1 - 1/n3, which is a generalization of the result of Friedman-Kahn-Szemeredi and Feige-Ofek.
1.1K