Learning Instance Activation Maps for Weakly Supervised Instance Segmentation

doi:10.1109/CVPR.2019.00323

Proceedings Article10.1109/CVPR.2019.00323

Learning Instance Activation Maps for Weakly Supervised Instance Segmentation

Yi Zhu, +5 more

- 01 Jun 2019

- pp 3111-3120

114

TL;DR: This work designs a process to selectively collect pseudo supervision from noisy segment proposals obtained with previously published techniques and uses it to learn a differentiable filling module that predicts a class-agnostic activation map for each instance given the image and an incomplete region response.

Abstract: Discriminative region responses residing inside an object instance can be extracted from networks trained with image-level label supervision. However, learning the full extent of pixel-level instance response in a weakly supervised manner remains unexplored. In this work, we tackle this challenging problem by using a novel instance extent filling approach. We first design a process to selectively collect pseudo supervision from noisy segment proposals obtained with previously published techniques. The pseudo supervision is used to learn a differentiable filling module that predicts a class-agnostic activation map for each instance given the image and an incomplete region response. We refer to the above maps as Instance Activation Maps (IAMs), which provide a fine-grained instance-level representation and allow instance masks to be extracted by lightweight CRF. Extensive experiments on the PASCAL VOC12 dataset show that our approach beats the state-of-the-art weakly supervised instance segmentation methods by a significant margin and increases the inference speed by an order of magnitude. Our method also generalizes well across domains and to unseen object categories. Without fine-tuning for the specific tasks, our model trained on VOC12 dataset (20 classes) obtains top performance for weakly supervised object localization on the CUB dataset (200 classes) and achieves competitive results on three widely used salient object detection benchmarks.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.

Wei Gao, +7 more

- 27 Mar 2021

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper introduces the token semantic coupled attention map (TS-CAM) to take full advantage of the self-attention mechanism in visual transformer for long-range dependency extraction and achieves state-of-the-art performance.

...read moreread less

224

•Proceedings Article•10.1109/CVPR46437.2021.00267

BBAM: Bounding Box Attribution Map for Weakly Supervised Semantic and Instance Segmentation

Jungbeom Lee, +3 more

- 01 Jun 2021

TL;DR: In this paper, a bounding-box attribution map (BBAM) was proposed to identify the target object in its bounding box and thus serve as pseudo ground truth for weakly supervised semantic and instance segmentation.

...read moreread less

224

•Journal Article•10.1016/J.MEDIA.2020.101908

An interpretable classifier for high-resolution breast cancer screening images utilizing weakly supervised localization

Yiqiu Shen, +10 more

- 01 Feb 2021

- Medical Image Analysis

TL;DR: This work proposes a novel neural network model that is trained with only image-level labels and can generate pixel-level saliency maps indicating possible malignant findings in screening mammography interpretation: predicting the presence or absence of benign and malignant lesions.

...read moreread less

201

Journal Article•10.1109/TPAMI.2020.3023152

Leveraging Instance-, Image- and Dataset-Level Information for Weakly Supervised Instance Segmentation.

Yun Liu, +5 more

- 10 Sep 2020

- IEEE Transactions on Pattern Analysis an...

TL;DR: This article proposes a multiple instance learning (MIL) framework, which can be trained in an end-to-end manner using training images with image-level labels and achieves state-of-the-art performance for both weakly supervised instance segmentation and semantic segmentation.

...read moreread less

190

•Journal Article•10.1016/J.COMPIND.2021.103459

Mixed supervision for surface-defect detection: From weakly to fully supervised learning

Jakob Božič, +2 more

- 01 Aug 2021

- Computers in Industry

TL;DR: In this article, a deep learning architecture for surface-defect detection in industrial quality control has been proposed, which is composed of two sub-networks yielding defect segmentation and classification results.

...read moreread less

173

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article•10.1109/CVPR.2017.106

Feature Pyramid Networks for Object Detection

Tsung-Yi Lin, +5 more

- 21 Jul 2017

TL;DR: This paper exploits the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct feature pyramids with marginal extra cost and achieves state-of-the-art single-model results on the COCO detection benchmark without bells and whistles.

...read moreread less

29.5K

Proceedings Article•10.1109/ICCV.2017.322

Mask R-CNN

Kaiming He, +3 more

- 20 Mar 2017

TL;DR: This work presents a conceptually simple, flexible, and general framework for object instance segmentation, which extends Faster R-CNN by adding a branch for predicting an object mask in parallel with the existing branch for bounding box recognition.

...read moreread less

23.6K

•Proceedings Article

Mask R-CNN

Kaiming He, +3 more

- 20 Mar 2017

TL;DR: This work presents a conceptually simple, flexible, and general framework for object instance segmentation that outperforms all existing, single-model entries on every task, including the COCO 2016 challenge winners.

...read moreread less

19.7K

•Proceedings Article•10.1109/CVPR.2016.350

The Cityscapes Dataset for Semantic Urban Scene Understanding

Marius Cordts, +8 more

- 01 Jun 2016

TL;DR: This work introduces Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeling, and exceeds previous attempts in terms of dataset size, annotation richness, scene variability, and complexity.

...read moreread less

11.5K

...

Expand

Learning Instance Activation Maps for Weakly Supervised Instance Segmentation

Chat with Paper

AI Agents for this Paper

Citations

TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.

BBAM: Bounding Box Attribution Map for Weakly Supervised Semantic and Instance Segmentation

An interpretable classifier for high-resolution breast cancer screening images utilizing weakly supervised localization

Leveraging Instance-, Image- and Dataset-Level Information for Weakly Supervised Instance Segmentation.

Mixed supervision for surface-defect detection: From weakly to fully supervised learning

References

Deep Residual Learning for Image Recognition

Feature Pyramid Networks for Object Detection

Mask R-CNN

Mask R-CNN

The Cityscapes Dataset for Semantic Urban Scene Understanding

Related Papers (5)

Learning Deep Features for Discriminative Localization

Deep Residual Learning for Image Recognition

Microsoft COCO: Common Objects in Context

Mask R-CNN

"GrabCut": interactive foreground extraction using iterated graph cuts