Wide Residual Networks
Sergey Zagoruyko,Nikos Komodakis +1 more
- 01 Jan 2016
TL;DR: This paper conducts a detailed experimental study on the architecture of ResNet blocks and proposes a novel architecture where the depth and width of residual networks are decreased and the resulting network structures are called wide residual networks (WRNs), which are far superior over their commonly used thin and very deep counterparts.
read more
Abstract: Deep residual networks were shown to be able to scale up to thousands of layers and still have improving performance. However, each fraction of a percent of improved accuracy costs nearly doubling the number of layers, and so training very deep residual networks has a problem of diminishing feature reuse, which makes these networks very slow to train. To tackle these problems, in this paper we conduct a detailed experimental study on the architecture of ResNet blocks, based on which we propose a novel architecture where we decrease depth and increase width of residual networks. We call the resulting network structures wide residual networks (WRNs) and show that these are far superior over their commonly used thin and very deep counterparts. For example, we demonstrate that even a simple 16-layer-deep wide residual network outperforms in accuracy and efficiency all previous deep residual networks, including thousand-layer-deep networks, achieving new state-of-the-art results on CIFAR, SVHN, COCO, and significant improvements on ImageNet. Our code and models are available at this https URL
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Multi-Scale Multi-View Deep Feature Aggregation for Food Recognition
TL;DR: A multi-scale multi-view feature aggregation (MSMVFA) scheme that can aggregate high-level semantic features, mid-level attribute features, and deep visual features into a unified representation for food recognition and achieves state-of-the-art recognition performance on three popular large-scale food benchmark datasets.
189
Semi-Supervised Learning under Class Distribution Mismatch.
Yanbei Chen,Xiatian Zhu,Wei Li,Shaogang Gong +3 more
- 03 Apr 2020
TL;DR: This work addresses under-studied and realistic SSL problem by a novel algorithm named Uncertainty-Aware Self-Distillation (UASD), which produces soft targets that avoid catastrophic error propagation, and empower learning effectively from unconstrained unlabelled data with out-of-distribution (OOD) samples.
•Posted Content
SaliencyMix: A Saliency Guided Data Augmentation Strategy for Better Regularization
TL;DR: This work proposes SaliencyMix, a new state-of-the-art top-1 error-reducing model that carefully selects a representative image patch with the help of a saliency map and mixes this indicative patch with a target image that leads the model to learn more appropriate feature representation.
189
•Posted Content
DO-Conv: Depthwise Over-parameterized Convolutional Layer.
Jinming Cao,Yangyan Li,Mingchao Sun,Ying Chen,Dani Lischinski,Daniel Cohen-Or,Baoquan Chen,Changhe Tu +7 more
TL;DR: This paper shows with extensive experiments that the mere replacement of conventional convolutional layers with DO-Conv layers boosts the performance of CNNs on many classical vision tasks, such as image classification, detection, and segmentation.
188
•Proceedings Article
Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised Learning.
Paola Cascante-Bonilla,Fuwen Tan,Yanjun Qi,Vicente Ordonez +3 more
- 18 May 2021
TL;DR: In this article, a semi-supervised pseudo-labeling method is proposed to apply pseudo-labels to samples in the unlabeled set by using a model trained on combination of the labeled samples and any previously pseudolabeled samples, and iteratively repeating this process in a self-training cycle.
184
Related Papers (5)
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
Gao Huang,Zhuang Liu,Laurens van der Maaten,Kilian Q. Weinberger +3 more
- 21 Jul 2017