Accelerated Log-Regularized Convolutional Transform Learning and its Convergence Guarantee.

doi:10.1109/TCYB.2021.3067352

Journal Article10.1109/TCYB.2021.3067352

Accelerated Log-Regularized Convolutional Transform Learning and its Convergence Guarantee.

Zhenni Li, +4 more

- 19 Apr 2021

- IEEE Transactions on Systems, Man, and C...

- pp 1-15

19

TL;DR: Wang et al. as discussed by the authors presented a new CTL framework with a log regularizer, which can not only obtain accurate representations but also yield strong sparsity, and provided a rigorous convergence analysis for the proposed algorithm under the accelerated PDCA.

Abstract: Convolutional transform learning (CTL), learning filters by minimizing the data fidelity loss function in an unsupervised way, is becoming very pervasive, resulting from keeping the best of both worlds: the benefit of unsupervised learning and the success of the convolutional neural network. There have been growing interests in developing efficient CTL algorithms. However, developing a convergent and accelerated CTL algorithm with accurate representations simultaneously with proper sparsity is an open problem. This article presents a new CTL framework with a log regularizer that can not only obtain accurate representations but also yield strong sparsity. To efficiently address our nonconvex composite optimization, we propose to employ the proximal difference of the convex algorithm (PDCA) which relies on decomposing the nonconvex regularizer into the difference of two convex parts and then optimizes the convex subproblems. Furthermore, we introduce the extrapolation technology to accelerate the algorithm, leading to a fast and efficient CTL algorithm. In particular, we provide a rigorous convergence analysis for the proposed algorithm under the accelerated PDCA. The experimental results demonstrate that the proposed algorithm can converge more stably to desirable solutions with lower approximation error and simultaneously with stronger sparsity and, thus, learn filters efficiently. Meanwhile, the convergence speed is faster than the existing CTL algorithms.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1016/J.KNOSYS.2021.107244

Semi-supervised multi-view learning by using label propagation based non-negative matrix factorization

Naiyao Liang, +5 more

- 27 Sep 2021

- Knowledge Based Systems

TL;DR: Zhang et al. as discussed by the authors proposed a label propagation based non-negative matrix factorization (LPNMF) method to address the problem of sparse labeled data, which can classify a large number of unlabeled data with few labeled data.

...read moreread less

30

Journal Article•10.1109/jsac.2022.3213283

Compact Learning Model for Dynamic Off-Chain Routing in Blockchain-Based IoT

01 Dec 2022

- IEEE Journal on Selected Areas in Commun...

TL;DR: Wang et al. as discussed by the authors proposed a compact deep reinforcement learning (DRL) algorithm to learn the joint dynamic and lightweight routing policy for maximizing long-term transaction efficiency in payment channel network (PCN)-based IoT.

...read moreread less

22

Journal Article•10.1109/jas.2022.105602

Structured Sparse Coding With the Group Log-regularizer for Key Frame Extraction

01 Oct 2022

- IEEE/CAA Journal of Automatica Sinica

TL;DR: Wang et al. as mentioned in this paper proposed a nonconvex group log-regularizer with strong sparsity and a low reconstruction error for key frame extraction, which can reduce the redundancy of continuous frames and concisely express the entire video.

...read moreread less

17

Journal Article•10.48550/arXiv.2303.00565

AdaSAM: Boosting Sharpness-Aware Minimization with Adaptive Learning Rate and Momentum for Training Deep Neural Networks

Hao Sun, +8 more

- 01 Mar 2023

- arXiv.org

TL;DR: In this paper , the convergence rate of AdaSAM with adaptive learning rate and momentum acceleration is analyzed in the stochastic non-convex setting, and the authors theoretically show that AdaSAM admits a √ O(1/ √ bT ) convergence rate, which achieves linear speedup property with respect to mini-batch size.

...read moreread less

16

Journal Article•10.1016/j.neunet.2023.10.044

AdaSAM: Boosting sharpness-aware minimization with adaptive learning rate and momentum for training deep neural networks

Hao Sun, +8 more

- 01 Jan 2024

- Neural Networks

TL;DR: AdaSAM optimizer achieves a O(1/bT) convergence rate in the stochastic non-convex setting, achieving linear speedup property with respect to mini-batch size.

...read moreread less

14

...

Expand

References

•Dissertation

Learning Multiple Layers of Features from Tiny Images

Alex Krizhevsky

- 01 Jan 2009

TL;DR: In this paper, the authors describe how to train a multi-layer generative model of natural images, using a dataset of millions of tiny colour images, described in the next section.

...read moreread less

23.7K

Journal Article•10.1137/080716542

A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems

Amir Beck, +1 more

- 01 Jan 2009

- Siam Journal on Imaging Sciences

TL;DR: A new fast iterative shrinkage-thresholding algorithm (FISTA) which preserves the computational simplicity of ISTA but with a global rate of convergence which is proven to be significantly better, both theoretically and practically.

...read moreread less

14.3K

•Journal Article•10.1002/CPA.20042

An Iterative Thresholding Algorithm for Linear Inverse Problems with a Sparsity Constraint

Ingrid Daubechies, +2 more

- 01 Nov 2004

- Communications on Pure and Applied Mathe...

TL;DR: It is proved that replacing the usual quadratic regularizing penalties by weighted 𝓁p‐penalized penalties on the coefficients of such expansions, with 1 ≤ p ≤ 2, still regularizes the problem.

...read moreread less

5.5K

•Journal Article•10.1109/TPAMI.2017.2723009

Places: A 10 Million Image Database for Scene Recognition

Bolei Zhou, +4 more

- 01 Jun 2018

- IEEE Transactions on Pattern Analysis an...

TL;DR: The Places Database is described, a repository of 10 million scene photographs, labeled with scene semantic categories, comprising a large and diverse list of the types of environments encountered in the world, using the state-of-the-art Convolutional Neural Networks as baselines, that significantly outperform the previous approaches.

...read moreread less

4.9K

•Journal Article•10.1007/S10107-011-0484-9

Convergence of descent methods for semi-algebraic and tame problems: proximal algorithms, forward-backward splitting, and regularized Gauss-Seidel methods

Hedy Attouch, +2 more

- 01 Feb 2013

- Mathematical Programming

TL;DR: This work proves an abstract convergence result for descent methods satisfying a sufficient-decrease assumption, and allowing a relative error tolerance, that guarantees the convergence of bounded sequences under the assumption that the function f satisfies the Kurdyka–Łojasiewicz inequality.

...read moreread less

1.6K

...

Expand

Accelerated Log-Regularized Convolutional Transform Learning and its Convergence Guarantee.

Chat with Paper

AI Agents for this Paper

Citations

Semi-supervised multi-view learning by using label propagation based non-negative matrix factorization

Compact Learning Model for Dynamic Off-Chain Routing in Blockchain-Based IoT

Structured Sparse Coding With the Group Log-regularizer for Key Frame Extraction

AdaSAM: Boosting Sharpness-Aware Minimization with Adaptive Learning Rate and Momentum for Training Deep Neural Networks

AdaSAM: Boosting sharpness-aware minimization with adaptive learning rate and momentum for training deep neural networks

References

Learning Multiple Layers of Features from Tiny Images

A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems

An Iterative Thresholding Algorithm for Linear Inverse Problems with a Sparsity Constraint

Places: A 10 Million Image Database for Scene Recognition

Convergence of descent methods for semi-algebraic and tame problems: proximal algorithms, forward-backward splitting, and regularized Gauss-Seidel methods

Related Papers (5)

Finite-Time Distributed Economic Dispatch Over Network Systems With Coupled Local Costs

Direct l_(2, p)-Norm Learning for Feature Selection.

FSMRank: Feature Selection Algorithm for Learning to Rank

A majorization-minimization algorithm for (multiple) hyperparameter learning

Optimality and convergence for convex ensemble learning with sparsity and diversity based on fixed point optimization