Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning
Ke Yu,Chao Dong,Liang Lin,Chen Change Loy +3 more
- 18 Jun 2018
- pp 2443-2452
TL;DR: Zhang et al. as discussed by the authors proposed a toolbox consisting of small-scale convolutional networks of different complexities and specialized in different tasks to select appropriate tools from the toolbox to progressively restore the quality of corrupted image.
read more
Abstract: We investigate a novel approach for image restoration by reinforcement learning. Unlike existing studies that mostly train a single large network for a specialized task, we prepare a toolbox consisting of small-scale convolutional networks of different complexities and specialized in different tasks. Our method, RL-Restore, then learns a policy to select appropriate tools from the toolbox to progressively restore the quality of a corrupted image. We formulate a stepwise reward function proportional to how well the image is restored at each step to learn the action policy. We also devise a joint learning scheme to train the agent and tools for better performance in handling uncertainty. In comparison to conventional human-designed networks, RL-Restore is capable of restoring images corrupted with complex and unknown distortions in a more parameter-efficient manner using the dynamically formed toolchain1.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
DRL360: 360-degree Video Streaming with Deep Reinforcement Learning
Yuanxing Zhang,Pengyu Zhao,Kaigui Bian,Yunxin Liu,Lingyang Song,Xiaoming Li +5 more
- 01 Apr 2019
TL;DR: A Deep Reinforcement Learning (DRL) based framework for 360-degree video streaming, named DRL360, which can adapt to all considered scenarios, and outperform the state-of-the-art approaches by 20%–30% on average given different QoE objectives.
181
Unpaired Image Super-Resolution Using Pseudo-Supervision
Shunta Maeda
- 14 Jun 2020
TL;DR: An unpaired SR method using a generative adversarial network that does not require a paired/aligned training dataset is proposed and experiments show that the proposed method is superior to existing solutions to the unpairedSR problem.
•Posted Content
Residual Dense Network for Image Restoration.
TL;DR: This work proposes residual dense block (RDB) to extract abundant local features via densely connected convolutional layers and proposes local feature fusion in RDB to adaptively learn more effective features from preceding and current local features and stabilize the training of wider network.
146
•Posted Content
EDVR: Video Restoration with Enhanced Deformable Convolutional Networks
TL;DR: Zhang et al. as mentioned in this paper proposed a novel Video Restoration framework with Enhanced Deformable networks, termed EDVR, to address the challenges of aligning multiple frames given large motions and effectively fusing different frames with diverse motion and blur.
134
Deep Learning-Based Video Coding: A Review and A Case Study
TL;DR: Deep Learning Video Coding (DLVC) as discussed by the authors is a deep learning-based video coding framework, which is based on convolutional neural network (CNN) and block adaptive resolution coding (BARC).
124
References
•Proceedings Article
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
- 01 Jan 2015
TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
138.5K
Long short-term memory
TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.
99K
•Posted Content
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.
82.5K
Image quality assessment: from error visibility to structural similarity
TL;DR: In this article, a structural similarity index is proposed for image quality assessment based on the degradation of structural information, which can be applied to both subjective ratings and objective methods on a database of images compressed with JPEG and JPEG2000.
Human-level control through deep reinforcement learning
Volodymyr Mnih,Koray Kavukcuoglu,David Silver,Andrei Rusu,Joel Veness,Marc G. Bellemare,Alex Graves,Martin Riedmiller,Andreas K. Fidjeland,Georg Ostrovski,Stig Petersen,Charles Beattie,Amir Sadik,Ioannis Antonoglou,Helen King,Dharshan Kumaran,Daan Wierstra,Shane Legg,Demis Hassabis +18 more
TL;DR: This work bridges the divide between high-dimensional sensory inputs and actions, resulting in the first artificial agent that is capable of learning to excel at a diverse array of challenging tasks.
Related Papers (5)
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
Diederik P. Kingma,Jimmy Ba +1 more
- 01 Jan 2015