DCM: A Dense-Attention Context Module For Semantic Segmentation

doi:10.1109/ICIP40778.2020.9190675

Proceedings Article10.1109/ICIP40778.2020.9190675

DCM: A Dense-Attention Context Module For Semantic Segmentation

Li Shenghua, +6 more

- 01 Oct 2020

- pp 1431-1435

3

TL;DR: A new attention-augmented module named Dense-attention Context Module (DCM) is presented, which is used to connect the common backbones and the other decoding heads, which shows the promising results of this method on Cityscapes dataset.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/ACCESS.2023.3289968

Multi-Encoder Context Aggregation Network for Structured and Unstructured Urban Street Scene Analysis

Tanmay Singha, +2 more

- IEEE Access

TL;DR: In this paper , a multi-encoder Context Aggregation Network (MCANet) is proposed for real-time semantic scene segmentation, which offers the best combination of low model complexity and state-of-the-art performance on benchmark datasets.

...read moreread less

•Journal Article•10.1109/access.2023.3289968

Multi-encoder Context Aggregation Network for Structured and Unstructured Urban Street Scene Analysis

01 Jan 2023

- IEEE Access

TL;DR: In this paper , a multi-encoder Context Aggregation Network (MCANet) is proposed for real-time semantic scene segmentation, which offers the best combination of low model complexity and state-of-the-art performance on benchmark datasets.

...read moreread less

Journal Article•10.1016/j.compeleceng.2023.108698

Mixture lightweight transformer for scene understanding

Quan Zhou, +4 more

- 01 May 2023

- Computers & Electrical Engineering

TL;DR: Wang et al. as mentioned in this paper proposed a mixture lightweight Transformer backbone for image understanding, where each Transformer block, called SH-Transformer, adopts Single-Head Self-Attention (SHSA) and Convolutional Inception Module (CIM).

...read moreread less

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

- 12 Jun 2017

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

94.2K

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

•Proceedings Article•10.1109/CVPR.2015.7298594

Going deeper with convolutions

Christian Szegedy, +8 more

- 07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

56.6K

Preprint•10.48550/arxiv.1706.03762

Attention Is All You Need

Ashish Vaswani, +7 more

- 01 Jan 2017

Abstract: The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train. Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task, improving over the existing best results, including ensembles by over 2 BLEU. On the WMT 2014 English-to-French translation task, our model establishes a new single-model state-of-the-art BLEU score of 41.8 after training for 3.5 days on eight GPUs, a small fraction of the training costs of the best models from the literature. We show that the Transformer generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.

...read moreread less

51.8K