Semantic Instance Segmentation for Autonomous Driving

doi:10.1109/CVPRW.2017.66

Open AccessProceedings Article10.1109/CVPRW.2017.66

Semantic Instance Segmentation for Autonomous Driving

Bert De Brabandere, +2 more

- 01 Jul 2017

- pp 478-480

242

TL;DR: This work proposes a discriminative loss function, operating at pixel level, that encourages a convolutional network to produce a representation of the image that can easily be clustered into instances with a simple post-processing step.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1038/S41592-019-0403-1

Deep learning for cellular image analysis

Erick Moen, +5 more

- 27 May 2019

- Nature Methods

TL;DR: The intersection between deep learning and cellular image analysis is reviewed and an overview of both the mathematical mechanics and the programming frameworks of deep learning that are pertinent to life scientists are provided.

...read moreread less

1K

•Journal Article•10.1016/J.ENG.2018.11.030

Advances in Computer Vision-Based Civil Infrastructure Inspection and Monitoring

Billie F. Spencer, +2 more

- 08 May 2019

- Engineering

TL;DR: An overview of recent advances in computer vision techniques as they apply to the problem of civil infrastructure condition assessment and some of the key challenges that persist toward the goal of automated vision-based civil infrastructure and monitoring are presented.

...read moreread less

867

•Book Chapter•10.1007/978-3-030-01264-9_17

PersonLab: Person Pose Estimation and Instance Segmentation with a Bottom-Up, Part-Based, Geometric Embedding Model

George Papandreou, +5 more

- 08 Sep 2018

TL;DR: In this article, a CNN is used to detect individual keypoints and predict their relative displacements, allowing them to group keypoints into person pose instances and then associate semantic person pixels with their corresponding person instance, delivering instance-level person segmentations.

...read moreread less

771

•Book Chapter•10.1007/978-3-030-58523-5_38

SOLO: Segmenting Objects by Locations

Xinlong Wang, +4 more

- 23 Aug 2020

TL;DR: Adelai et al. as discussed by the authors proposed the notion of instance categories, which assigns categories to each pixel within an instance according to the instance's location and size, thus nicely converting instance segmentation into a single-shot classification-solvable problem.

...read moreread less

605

•Journal Article•10.1109/TPAMI.2020.3014297

YOLACT++: Better Real-time Instance Segmentation.

Daniel Bolya, +3 more

- 05 Aug 2020

- IEEE Transactions on Pattern Analysis an...

TL;DR: A simple, fully-convolutional model for real-time instance segmentation that achieves competitive results on MS COCO evaluated on a single Titan Xp, which is significantly faster than any previous state-of-the-art approach.

...read moreread less

522

...

Expand

References

•Journal Article•10.1109/TPAMI.2016.2644615

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

Vijay Badrinarayanan, +2 more

- 01 Dec 2017

- IEEE Transactions on Pattern Analysis an...

TL;DR: Quantitative assessments show that SegNet provides good performance with competitive inference time and most efficient inference memory-wise as compared to other architectures, including FCN and DeconvNet.

...read moreread less

19.6K

•Proceedings Article•10.1109/CVPR.2015.7298682

FaceNet: A Unified Embedding for Face Recognition and Clustering

Florian Schroff, +2 more

- 12 Mar 2015

- arXiv: Computer Vision and Pattern Recog...

TL;DR: FaceNet as discussed by the authors uses a deep convolutional network trained to directly optimize the embedding itself, rather than an intermediate bottleneck layer as in previous deep learning approaches, and achieves state-of-the-art face recognition performance using only 128 bytes per face.

...read moreread less

14.2K

•Proceedings Article•10.1109/CVPR.2016.350

The Cityscapes Dataset for Semantic Urban Scene Understanding

Marius Cordts, +8 more

- 01 Jun 2016

TL;DR: This work introduces Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeling, and exceeds previous attempts in terms of dataset size, annotation richness, scene variability, and complexity.

...read moreread less

11.5K

•Proceedings Article

Multi-Scale Context Aggregation by Dilated Convolutions

Fisher Yu, +1 more

- 30 Apr 2016

TL;DR: This work develops a new convolutional network module that is specifically designed for dense prediction, and shows that the presented context module increases the accuracy of state-of-the-art semantic segmentation systems.

...read moreread less

9.3K

•Proceedings Article•10.1109/CVPR.2015.7298682

FaceNet: A unified embedding for face recognition and clustering

Florian Schroff, +2 more

- 07 Jun 2015

TL;DR: A system that directly learns a mapping from face images to a compact Euclidean space where distances directly correspond to a measure offace similarity, and achieves state-of-the-art face recognition performance using only 128-bytes perface.

...read moreread less

8.4K

...

Expand

Semantic Instance Segmentation for Autonomous Driving

Chat with Paper

AI Agents for this Paper

Citations

Deep learning for cellular image analysis

Advances in Computer Vision-Based Civil Infrastructure Inspection and Monitoring

PersonLab: Person Pose Estimation and Instance Segmentation with a Bottom-Up, Part-Based, Geometric Embedding Model

SOLO: Segmenting Objects by Locations

YOLACT++: Better Real-time Instance Segmentation.

References

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

FaceNet: A Unified Embedding for Face Recognition and Clustering

The Cityscapes Dataset for Semantic Urban Scene Understanding

Multi-Scale Context Aggregation by Dilated Convolutions

FaceNet: A unified embedding for face recognition and clustering

Related Papers (5)

Mask R-CNN

Deep Residual Learning for Image Recognition

Microsoft COCO: Common Objects in Context

Fully convolutional networks for semantic segmentation

U-Net: Convolutional Networks for Biomedical Image Segmentation