Proceedings Article10.1109/CVPR.2019.00323
Learning Instance Activation Maps for Weakly Supervised Instance Segmentation
Yi Zhu,Yanzhao Zhou,Huijuan Xu,Qixiang Ye,David Doermann,Jianbin Jiao +5 more
- 01 Jun 2019
- pp 3111-3120
TL;DR: This work designs a process to selectively collect pseudo supervision from noisy segment proposals obtained with previously published techniques and uses it to learn a differentiable filling module that predicts a class-agnostic activation map for each instance given the image and an incomplete region response.
read more
Abstract: Discriminative region responses residing inside an object instance can be extracted from networks trained with image-level label supervision. However, learning the full extent of pixel-level instance response in a weakly supervised manner remains unexplored. In this work, we tackle this challenging problem by using a novel instance extent filling approach. We first design a process to selectively collect pseudo supervision from noisy segment proposals obtained with previously published techniques. The pseudo supervision is used to learn a differentiable filling module that predicts a class-agnostic activation map for each instance given the image and an incomplete region response. We refer to the above maps as Instance Activation Maps (IAMs), which provide a fine-grained instance-level representation and allow instance masks to be extracted by lightweight CRF. Extensive experiments on the PASCAL VOC12 dataset show that our approach beats the state-of-the-art weakly supervised instance segmentation methods by a significant margin and increases the inference speed by an order of magnitude. Our method also generalizes well across domains and to unseen object categories. Without fine-tuning for the specific tasks, our model trained on VOC12 dataset (20 classes) obtains top performance for weakly supervised object localization on the CUB dataset (200 classes) and achieves competitive results on three widely used salient object detection benchmarks.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.
TL;DR: This paper introduces the token semantic coupled attention map (TS-CAM) to take full advantage of the self-attention mechanism in visual transformer for long-range dependency extraction and achieves state-of-the-art performance.
BBAM: Bounding Box Attribution Map for Weakly Supervised Semantic and Instance Segmentation
Jungbeom Lee,Jihun Yi,Chaehun Shin,Sungroh Yoon +3 more
- 01 Jun 2021
TL;DR: In this paper, a bounding-box attribution map (BBAM) was proposed to identify the target object in its bounding box and thus serve as pseudo ground truth for weakly supervised semantic and instance segmentation.
An interpretable classifier for high-resolution breast cancer screening images utilizing weakly supervised localization
Yiqiu Shen,Nan Wu,Jason Phang,Jungkyu Park,Kangning Liu,Sudarshini Tyagi,Laura Heacock,S. Gene Kim,Linda Moy,Kyunghyun Cho,Krzysztof J. Geras +10 more
TL;DR: This work proposes a novel neural network model that is trained with only image-level labels and can generate pixel-level saliency maps indicating possible malignant findings in screening mammography interpretation: predicting the presence or absence of benign and malignant lesions.
Leveraging Instance-, Image- and Dataset-Level Information for Weakly Supervised Instance Segmentation.
TL;DR: This article proposes a multiple instance learning (MIL) framework, which can be trained in an end-to-end manner using training images with image-level labels and achieves state-of-the-art performance for both weakly supervised instance segmentation and semantic segmentation.
190
Mixed supervision for surface-defect detection: From weakly to fully supervised learning
TL;DR: In this article, a deep learning architecture for surface-defect detection in industrial quality control has been proposed, which is composed of two sub-networks yielding defect segmentation and classification results.
173
References
Deep Residual Learning for Image Recognition
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Feature Pyramid Networks for Object Detection
Tsung-Yi Lin,Piotr Dollár,Ross Girshick,Kaiming He,Bharath Hariharan,Serge Belongie +5 more
- 21 Jul 2017
TL;DR: This paper exploits the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct feature pyramids with marginal extra cost and achieves state-of-the-art single-model results on the COCO detection benchmark without bells and whistles.
Mask R-CNN
Kaiming He,Georgia Gkioxari,Piotr Dollár,Ross Girshick +3 more
- 20 Mar 2017
TL;DR: This work presents a conceptually simple, flexible, and general framework for object instance segmentation, which extends Faster R-CNN by adding a branch for predicting an object mask in parallel with the existing branch for bounding box recognition.
•Proceedings Article
Mask R-CNN
Kaiming He,Georgia Gkioxari,Piotr Dollár,Ross Girshick +3 more
- 20 Mar 2017
TL;DR: This work presents a conceptually simple, flexible, and general framework for object instance segmentation that outperforms all existing, single-model entries on every task, including the COCO 2016 challenge winners.
19.7K
The Cityscapes Dataset for Semantic Urban Scene Understanding
Marius Cordts,Mohamed Omran,Sebastian Ramos,Timo Rehfeld,Markus Enzweiler,Rodrigo Benenson,Uwe Franke,Stefan Roth,Bernt Schiele +8 more
- 01 Jun 2016
TL;DR: This work introduces Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeling, and exceeds previous attempts in terms of dataset size, annotation richness, scene variability, and complexity.
11.5K
Related Papers (5)
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
[...]
Kaiming He,Georgia Gkioxari,Piotr Dollár,Ross Girshick +3 more
- 20 Mar 2017