Concurrent Spatial and Channel Squeeze & Excitation in Fully Convolutional Networks

Open AccessPosted Content

Concurrent Spatial and Channel Squeeze & Excitation in Fully Convolutional Networks

- 07 Mar 2018

- arXiv: Computer Vision and Pattern Recog...

609

TL;DR: This paper introduces three variants of SE modules for image segmentation, and effectively incorporates these SE modules within three different state-of-the-art F-CNNs (DenseNet, SD-Net, U-Net) and observes consistent improvement of performance across all architectures, while minimally effecting model complexity.

Abstract: Fully convolutional neural networks (F-CNNs) have set the state-of-the-art in image segmentation for a plethora of applications. Architectural innovations within F-CNNs have mainly focused on improving spatial encoding or network connectivity to aid gradient flow. In this paper, we explore an alternate direction of recalibrating the feature maps adaptively, to boost meaningful features, while suppressing weak ones. We draw inspiration from the recently proposed squeeze & excitation (SE) module for channel recalibration of feature maps for image classification. Towards this end, we introduce three variants of SE modules for image segmentation, (i) squeezing spatially and exciting channel-wise (cSE), (ii) squeezing channel-wise and exciting spatially (sSE) and (iii) concurrent spatial and channel squeeze & excitation (scSE). We effectively incorporate these SE modules within three different state-of-the-art F-CNNs (DenseNet, SD-Net, U-Net) and observe consistent improvement of performance across all architectures, while minimally effecting model complexity. Evaluations are performed on two challenging applications: whole brain segmentation on MRI scans (Multi-Atlas Labelling Challenge Dataset) and organ segmentation on whole body contrast enhanced CT scans (Visceral Dataset).

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Fig. 1: Illustration of network architecture with squeeze & excitation (SE) blocks. (a) The proposed integration of SE blocks within F-CNN. (b-d) The architectural design of cSE, sSE and scSE blocks, respectively, for recalibrating feature map U.

Table 1: Mean and standard deviation of the global Dice scores for the different FCNN models without and with cSE, sSE and scSE blocks on both datasets.

Fig. 2: Boxplot of Dice scores for all brain structures on the left hemisphere (due to space constraints), using DenseNets on MALC dataset, without and with proposed cSE, sSE, scSE blocks. Grey and white matter are abbreviated as GM and WM, respectively.

Fig. 4: Input scan, ground truth annotations, DenseNet segmentation and DenseNet+scSE segmentation for both whole-brain MRI T1 (a-d) and whole-body ceCT (e-h) are shown. ROIs are indicated by white box and red arrow highlighting regions where the scSE block improved the segmentation, for both applications.

Fig. 3: Structure-wise Dice performance of DenseNets on Visceral dataset, without and with proposed cSE, sSE, scSE blocks. Left and right are indicated as L. and R. Psoas major muscle is abbreviated as PM.

Citations

Journal Article•10.48550/arxiv.2308.10417

The Change You Want to See (Now in 3D)

Ragav Sachdeva, +1 more

- 21 Aug 2023

- arXiv.org

TL;DR: This work contributes a change detection model that is trained entirely on synthetic data and is class-agnostic, yet it is performant out-of-the-box on real world images without requiring fine-tuning.

...read moreread less

Preprint•10.21203/rs.3.rs-3472743/v1

A Novel Carbon Stocking Estimation Through Continuous Catalog Learning

Dror Haor, +5 more

- 27 Oct 2023

TL;DR: A novel carbon stocking estimation method using continuous learning and high-resolution imagery data improves accuracy and provides valuable insights into diverse forest types.

...read moreread less

Journal Article•10.48550/arxiv.2308.15327

Enhancing Robot Learning through Learned Human-Attention Feature Maps

Daniel Scheuchenstuhl, +4 more

- 29 Aug 2023

- arXiv.org

TL;DR: A novel approach to model and emulate the human attention with an approximate prediction model and feed it as a structured auxiliary feature map into downstream learning tasks to enhance efficiency and robustness of the learning process in robotics.

...read moreread less

Journal Article•10.1016/j.compeleceng.2024.109299

A multi-focus image fusion network combining dilated convolution with learnable spacings and residual dense network

Jidong Fang, +6 more

- 01 Jul 2024

- Computers & Electrical Engineering

•Journal Article•10.1002/mp.15723

Decoupled Pyramid Correlation Network for Liver Tumor Segmentation from CT images

Yao Zhang, +8 more

- 26 May 2022

- Medical Physics

TL;DR: A Decoupled Pyramid Correlation Network (DPC-Net) that exploits attention mechanisms to fully leverage both low- and high-level features embedded in FCN to segment liver tumor and can effectively model the multi-level correlation from both semantic and spatial dimensions is proposed.

...read moreread less

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Posted Content

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 10 Dec 2015

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.

...read moreread less

117.9K

•Journal Article•10.1145/3065386

ImageNet classification with deep convolutional neural networks

Alex Krizhevsky, +2 more

- 24 May 2017

- Communications of The ACM

TL;DR: A large, deep convolutional neural network was trained to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes and employed a recently developed regularization method called "dropout" that proved to be very effective.

...read moreread less

98.2K

•Book Chapter•10.1007/978-3-319-24574-4_28

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, +2 more

- 05 Oct 2015

TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.

...read moreread less

92K

•Proceedings Article•10.1109/CVPR.2015.7298965

Fully convolutional networks for semantic segmentation

Jonathan Long, +2 more

- 07 Jun 2015

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.

...read moreread less

42.6K

Concurrent Spatial and Channel Squeeze & Excitation in Fully Convolutional Networks

Chat with Paper

AI Agents for this Paper

Figures

Citations

The Change You Want to See (Now in 3D)

A Novel Carbon Stocking Estimation Through Continuous Catalog Learning

Enhancing Robot Learning through Learned Human-Attention Feature Maps

A multi-focus image fusion network combining dilated convolution with learnable spacings and residual dense network

Decoupled Pyramid Correlation Network for Liver Tumor Segmentation from CT images

References

Deep Residual Learning for Image Recognition

Deep Residual Learning for Image Recognition

ImageNet classification with deep convolutional neural networks

U-Net: Convolutional Networks for Biomedical Image Segmentation

Fully convolutional networks for semantic segmentation

Related Papers (5)

U-Net: Convolutional Networks for Biomedical Image Segmentation

Squeeze-and-Excitation Networks

Deep Residual Learning for Image Recognition

Fully convolutional networks for semantic segmentation

Pyramid Scene Parsing Network