Aggregated Residual Transformations for Deep Neural Networks

doi:10.1109/CVPR.2017.634

Open AccessProceedings Article10.1109/CVPR.2017.634

Aggregated Residual Transformations for Deep Neural Networks

Saining Xie, +4 more

- 21 Jul 2017

- pp 5987-5995

11K

TL;DR: ResNeXt as discussed by the authors is a simple, highly modularized network architecture for image classification, which is constructed by repeating a building block that aggregates a set of transformations with the same topology.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1109/TPAMI.2021.3069237

Learning Generalisable Omni-Scale Representations for Person Re-Identification.

Kaiyang Zhou, +3 more

- 26 Mar 2021

- IEEE Transactions on Pattern Analysis an...

TL;DR: Zhang et al. as discussed by the authors proposed omni-scale network (OSNet) to learn features that not only capture different spatial scales but also encapsulate a synergistic combination of multiple scales.

...read moreread less

178

•Book Chapter•10.1007/978-3-030-58580-8_27

Learning Delicate Local Representations for Multi-person Pose Estimation.

Yuanhao Cai, +9 more

- 09 Mar 2020

TL;DR: Wang et al. as discussed by the authors proposed Residual Steps Network (RSN), which aggregates features with the same spatial size (Intra-level features) efficiently to obtain delicate local representations, which retain rich low-level spatial information and result in precise keypoint localization.

...read moreread less

178

Journal Article•10.1109/TGRS.2021.3113912

Asymmetric Siamese Networks for Semantic Change Detection in Aerial Images

Kunping Yang, +6 more

- 01 Oct 2021

- IEEE Transactions on Geoscience and Remo...

TL;DR: An asymmetric Siamese network is presented to locate and identify semantic changes through feature pairs obtained from modules of widely different structures, which involves areas of various sizes and applies different quantities of parameters to factor in the discrepancy across land-cover distributions during different times.

...read moreread less

177

•Proceedings Article•10.1109/ICCV.2019.00971

Human Uncertainty Makes Classification More Robust

Joshua C. Peterson, +3 more

- 01 Oct 2019

TL;DR: In this paper, the authors present a new benchmark dataset, CIFAR10H, containing a full distribution of human labels for each image of the CIFARS10 test set, and show that explicit training on their dataset closes this gap, supports improved generalization to increasingly out-of-training-distribution test datasets, and confers robustness to adversarial attacks.

...read moreread less

176

•Posted Content

Learning the Best Pooling Strategy for Visual Semantic Embedding.

Jiacheng Chen, +4 more

- 09 Nov 2020

- arXiv: Computer Vision and Pattern Recog...

TL;DR: A Generalized Pooling Operator (GPO) is proposed, which learns to automatically adapt itself to the best pooling strategy for different features, requiring no manual tuning while staying effective and efficient and can be a plug-and-play feature aggregation module for standard VSE models.

...read moreread less

176

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 04 Sep 2014

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

102.6K

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

Journal Article•10.1023/B:VISI.0000029664.99615.94

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004

- International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

59.3K

•Proceedings Article•10.1109/CVPR.2015.7298594

Going deeper with convolutions

Christian Szegedy, +8 more

- 07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

56.6K

...

Expand

Aggregated Residual Transformations for Deep Neural Networks

Chat with Paper

AI Agents for this Paper

Citations

Learning Generalisable Omni-Scale Representations for Person Re-Identification.

Learning Delicate Local Representations for Multi-person Pose Estimation.

Asymmetric Siamese Networks for Semantic Change Detection in Aerial Images

Human Uncertainty Makes Classification More Robust

Learning the Best Pooling Strategy for Visual Semantic Embedding.

References

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Distinctive Image Features from Scale-Invariant Keypoints

Going deeper with convolutions

Related Papers (5)

Deep Residual Learning for Image Recognition

Densely Connected Convolutional Networks

Going deeper with convolutions

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition