Open AccessPosted Content
BiSeNet V2: Bilateral Network with Guided Aggregation for Real-time Semantic Segmentation
TL;DR: This work proposes an efficient and effective architecture with a good trade-off between speed and accuracy, termed Bilateral Segmentation Network (BiSeNet V2), which performs favourably against a few state-of-the-art real-time semantic segmentation approaches.
read more
Abstract: The low-level details and high-level semantics are both essential to the semantic segmentation task. However, to speed up the model inference, current approaches almost always sacrifice the low-level details, which leads to a considerable accuracy decrease. We propose to treat these spatial details and categorical semantics separately to achieve high accuracy and high efficiency for realtime semantic segmentation. To this end, we propose an efficient and effective architecture with a good trade-off between speed and accuracy, termed Bilateral Segmentation Network (BiSeNet V2). This architecture involves: (i) a Detail Branch, with wide channels and shallow layers to capture low-level details and generate high-resolution feature representation; (ii) a Semantic Branch, with narrow channels and deep layers to obtain high-level semantic context. The Semantic Branch is lightweight due to reducing the channel capacity and a fast-downsampling strategy. Furthermore, we design a Guided Aggregation Layer to enhance mutual connections and fuse both types of feature representation. Besides, a booster training strategy is designed to improve the segmentation performance without any extra inference cost. Extensive quantitative and qualitative evaluations demonstrate that the proposed architecture performs favourably against a few state-of-the-art real-time semantic segmentation approaches. Specifically, for a 2,048x1,024 input, we achieve 72.6% Mean IoU on the Cityscapes test set with a speed of 156 FPS on one NVIDIA GeForce GTX 1080 Ti card, which is significantly faster than existing methods, yet we achieve better segmentation accuracy.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
GLA-STDeepLab: SAR Enhancing Glacier and Ice Shelf Front Detection Using Swin-TransDeepLab with Global-local Attention
Qiaoying Zhu,Huadong Guo,Lu Zhang,Dong Liang,Zherong Wu,Yiming Liu,Zhuoran Lv +6 more
TL;DR: The findings demonstrate that the proposed novel calving front detection network enables precise monitoring of the dynamic information of glacier and ice shelf fronts, as well as their far-reaching implications for ice sheet mass balance and global climate change.
Attention based lightweight asymmetric network for real-time semantic segmentation
Qian Liu,Cunbao Wang,Zhensheng Li,Jiongtao Fang +3 more
TL;DR: This paper proposes ALANet, an attention-based lightweight asymmetric network for real-time semantic segmentation, achieving 74.4% mIoU on Cityscapes and 69.5% on CamVid with 115.6FPS and 113.2FPS inference speed, respectively.
Real-Time Semantic Segmentation of Remote Sensing Images for Land Management
Yinsheng Zhang,Ru Ji,Yuxiang Hu,Yan Yang,Xin Chen,Xiaoran Duan,Huilin Shan +6 more
TL;DR: Real-time semantic segmentation of remote sensing images for land management is achieved using a novel dual-path feature aggregation network (DPFANet). DPFANet achieves high accuracy with low latency and a small model size.
From Appearance to Inherence: A Hyperspectral Image Dataset and Benchmark of Material Classiffcation for Surveillance
Likun Gao,Hai-Miao Hu,Xinhui Xue,Haoxin Hu +3 more
TL;DR: This paper introduces a hyperspectral image dataset and benchmark for material classification in surveillance scenarios, leveraging 128 spectral bands and 28 material categories, and proposes a novel texture-metric-based fusion network achieving state-of-the-art results.
Exploring Compact and Efficient Neural Networks for Real-Time Semantic Segmentation
Zijia Li,ZhenZhong Xiao,Zhan Song +2 more
- 24 Sep 2023
TL;DR: A novel method is presented to build an efficient semantic segmentation network that provides a better balance between inference speed and prediction accuracy than the existing methods and is built upon a two-branch network architecture.
References
Deep Residual Learning for Image Recognition
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
•Posted Content
Deep Residual Learning for Image Recognition
TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.
117.9K
•Proceedings Article
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan,Andrew Zisserman +1 more
- 04 Sep 2014
TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
102.6K
ImageNet classification with deep convolutional neural networks
TL;DR: A large, deep convolutional neural network was trained to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes and employed a recently developed regularization method called "dropout" that proved to be very effective.
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger,Philipp Fischer,Thomas Brox +2 more
- 05 Oct 2015
TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.
Related Papers (5)
Jonathan Long,Evan Shelhamer,Trevor Darrell +2 more
- 07 Jun 2015
Hengshuang Zhao,Jianping Shi,Xiaojuan Qi,Xiaogang Wang,Jiaya Jia +4 more
- 21 Jul 2017
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016