Open AccessPosted Content
DO-Conv: Depthwise Over-parameterized Convolutional Layer.
Jinming Cao,Yangyan Li,Mingchao Sun,Ying Chen,Dani Lischinski,Daniel Cohen-Or,Baoquan Chen,Changhe Tu +7 more
TL;DR: This paper shows with extensive experiments that the mere replacement of conventional convolutional layers with DO-Conv layers boosts the performance of CNNs on many classical vision tasks, such as image classification, detection, and segmentation.
read more
Abstract: Convolutional layers are the core building blocks of Convolutional Neural Networks (CNNs) In this paper, we propose to augment a convolutional layer with an additional depthwise convolution, where each input channel is convolved with a different 2D kernel The composition of the two convolutions constitutes an over-parameterization, since it adds learnable parameters, while the resulting linear operation can be expressed by a single convolution layer We refer to this depthwise over-parameterized convolutional layer as DO-Conv We show with extensive experiments that the mere replacement of conventional convolutional layers with DO-Conv layers boosts the performance of CNNs on many classical vision tasks, such as image classification, detection, and segmentation Moreover, in the inference phase, the depthwise convolution is folded into the conventional convolution, reducing the computation to be exactly equivalent to that of a convolutional layer without over-parameterization As DO-Conv introduces performance gains without incurring any computational complexity increase for inference, we advocate it as an alternative to the conventional convolutional layer We open-source a reference implementation of DO-Conv in Tensorflow, PyTorch and GluonCV at this https URL
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
RepVGG: Making VGG-style ConvNets Great Again
Xiaohan Ding,Xiangyu Zhang,Ningning Ma,Jungong Han,Guiguang Ding,Jian Sun +5 more
- 11 Jan 2021
TL;DR: RepVGG as mentioned in this paper decouples the training-time and inference-time architecture by a structural re-parameterization technique and achieves state-of-the-art accuracy on ImageNet.
•Posted Content
RepVGG: Making VGG-style ConvNets Great Again.
TL;DR: A simple but powerful architecture of convolutional neural network, which has a VGG-like inference-time body composed of nothing but a stack of 3 × 3 convolution and ReLU, while the training-time model has a multi-branch topology.
1K
Diverse Branch Block: Building a Convolution as an Inception-like Unit
Xiaohan Ding,Xiangyu Zhang,Jungong Han,Guiguang Ding +3 more
- 01 Jun 2021
TL;DR: Diverse Branch Block (DBB) as mentioned in this paper enhances the representational capacity of a single convolution by combining diverse branches of different scales and complexities to enrich the feature space, including sequences of convolutions, multiscale convolutions and average pooling.
U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation
Chenxin Li,Xinyu Liu,Wuyang Li,Cheng Wang,Han-Wen Liu,Yixuan Yuan +5 more
- 05 Jun 2024
TL;DR: U-KAN makes a strong backbone for medical image segmentation and generation by improving the accuracy and interpretability of U-Net through the integration of Kolmogorov-Arnold Networks (KANs).
An Improved Yolov5 for Multi-Rotor UAV Detection
Bailing Liu,Huan Luo +1 more
TL;DR: The Yolov5 backbone is replaced with Efficientlite, thus reducing the number of parameters in the model, and adaptively spatial feature fusion is injected into the head of the baseline model to facilitate the fusion of feature maps with different spatial resolutions.
40
References
Deep Residual Learning for Image Recognition
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
•Posted Content
Deep Residual Learning for Image Recognition
TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.
117.9K
•Proceedings Article
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan,Andrew Zisserman +1 more
- 04 Sep 2014
TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
102.6K
ImageNet classification with deep convolutional neural networks
TL;DR: A large, deep convolutional neural network was trained to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes and employed a recently developed regularization method called "dropout" that proved to be very effective.
•Proceedings Article
ImageNet Classification with Deep Convolutional Neural Networks
Alex Krizhevsky,Ilya Sutskever,Geoffrey E. Hinton +2 more
- 03 Dec 2012
TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Related Papers (5)
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
Karen Simonyan,Andrew Zisserman +1 more
- 04 Sep 2014