Monocular Object Instance Segmentation and Depth Ordering with CNNs

doi:10.1109/ICCV.2015.300

Open AccessProceedings Article10.1109/ICCV.2015.300

Monocular Object Instance Segmentation and Depth Ordering with CNNs

Ziyu Zhang, +3 more

- 07 Dec 2015

- pp 2614-2622

200

TL;DR: In this article, a Markov Random Field (MRF) is proposed to predict instance-level segmentation and depth ordering from a single monocular image, where the instance ID encodes the depth ordering within image patches.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.1109/CVPR.2016.350

The Cityscapes Dataset for Semantic Urban Scene Understanding

Marius Cordts, +8 more

- 01 Jun 2016

TL;DR: This work introduces Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeling, and exceeds previous attempts in terms of dataset size, annotation richness, scene variability, and complexity.

...read moreread less

11.5K

•Proceedings Article•10.1109/CVPR.2018.00913

Path Aggregation Network for Instance Segmentation

Shu Liu, +4 more

- 18 Jun 2018

TL;DR: PANet as mentioned in this paper enhances the entire feature hierarchy with accurate localization signals in lower layers by bottom-up path augmentation, which shortens the information path between lower layers and topmost feature.

...read moreread less

7.8K

•Proceedings Article•10.1109/CVPR.2018.00214

Deep Ordinal Regression Network for Monocular Depth Estimation

Huan Fu, +4 more

- 18 Jun 2018

TL;DR: Deep Ordinal Regression Network (DORN) as discussed by the authors discretizes depth and recast depth network learning as an ordinal regression problem by training the network using an ordinary regression loss, which achieves much higher accuracy and faster convergence in synch.

...read moreread less

2.2K

•Proceedings Article•10.1109/CVPR.2019.00511

Hybrid Task Cascade for Instance Segmentation

Kai Chen, +11 more

- 15 Jun 2019

TL;DR: Chen et al. as discussed by the authors proposed a Hybrid Task Cascade (HTC) framework, which interweaves the two tasks for a joint multi-stage processing and adopted a fully convolutional branch to provide spatial context, which can help distinguishing hard foreground from cluttered background.

...read moreread less

1.3K

Proceedings Article•10.1109/CVPR.2016.236

Monocular 3D Object Detection for Autonomous Driving

Xiaozhi Chen, +5 more

- 27 Jun 2016

TL;DR: This work proposes an energy minimization approach that places object candidates in 3D using the fact that objects should be on the ground-plane, and achieves the best detection performance on the challenging KITTI benchmark, among published monocular competitors.

...read moreread less

1.3K

...

Expand

References

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 04 Sep 2014

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

102.6K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 01 Jan 2015

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

51.9K

•Proceedings Article•10.1109/CVPR.2015.7298965

Fully convolutional networks for semantic segmentation

Jonathan Long, +2 more

- 07 Jun 2015

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.

...read moreread less

42.6K

•Proceedings Article•10.1109/CVPR.2014.81

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Ross Girshick, +3 more

- 23 Jun 2014

TL;DR: RCNN as discussed by the authors combines CNNs with bottom-up region proposals to localize and segment objects, and when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost.

...read moreread less

33.7K

•Book

A wavelet tour of signal processing

Stéphane Mallat

- 01 Jan 1998

TL;DR: An introduction to a Transient World and an Approximation Tour of Wavelet Packet and Local Cosine Bases.

...read moreread less

20.3K

...

Expand

Monocular Object Instance Segmentation and Depth Ordering with CNNs

Chat with Paper

AI Agents for this Paper

Citations

The Cityscapes Dataset for Semantic Urban Scene Understanding

Path Aggregation Network for Instance Segmentation

Deep Ordinal Regression Network for Monocular Depth Estimation

Hybrid Task Cascade for Instance Segmentation

Monocular 3D Object Detection for Autonomous Driving

References

Very Deep Convolutional Networks for Large-Scale Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

Fully convolutional networks for semantic segmentation

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

A wavelet tour of signal processing

Related Papers (5)

Fully convolutional networks for semantic segmentation

Microsoft COCO: Common Objects in Context

Deep Residual Learning for Image Recognition

Faster R-CNN: towards real-time object detection with region proposal networks

Mask R-CNN