Detecting 11K Classes: Large Scale Object Detection Without Fine-Grained Bounding Boxes

doi:10.1109/ICCV.2019.00990

Open AccessProceedings Article10.1109/ICCV.2019.00990

Detecting 11K Classes: Large Scale Object Detection Without Fine-Grained Bounding Boxes

Hao Yang, +2 more

- 14 Aug 2019

- pp 9805-9813

39

TL;DR: This paper proposes a semi-supervised large scale fine-grained detection method, which only needs bounding box annotations of a smaller number of coarse- grained classes and image-level labels of large scalefine-grains classes, and can detect all classes at nearly fully-super supervised accuracy.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Book Chapter•10.1007/978-3-031-20077-9_21

Detecting Twenty-Thousand Classes Using Image-Level Supervision

Xingyi Zhou, +4 more

- 07 Jan 2022

- Lecture Notes in Computer Science

TL;DR: Detic as mentioned in this paper proposes to train the classifiers of a detector on image classification data and thus expands the vocabulary of detectors to tens of thousands of concepts, making it much easier to implement and compatible with a range of detection architectures and backbones.

...read moreread less

611

•Book Chapter•10.1007/978-3-030-58526-6_35

Improving Object Detection with Selective Self-supervised Self-training

Yandong Li, +4 more

- 23 Aug 2020

TL;DR: A selective net is proposed to rectify the supervision signals in Web images and not only identifies positive bounding boxes but also creates a safe zone for mining hard negative boxes.

...read moreread less

89

•Book Chapter•10.1007/978-3-030-58548-8_19

Grounded Situation Recognition

Sarah M Pratt, +5 more

- 23 Aug 2020

TL;DR: In this article, the authors introduce Grounded Situation Recognition (GSR), a task that requires producing structured semantic summaries of images describing: the primary activity, entities engaged in the activity with their roles, and bounding-box groundings of entities.

...read moreread less

73

•Journal Article•10.1109/lra.2022.3146922

Learning Open-World Object Proposals Without Learning to Classify

01 Apr 2022

TL;DR: Object Localization Network (OLN) as discussed by the authors estimates the objectness of each region purely by how well the location and shape of a region overlap with any ground-truth object (e.g., centerness and IoU).

...read moreread less

61

•Book Chapter•10.1007/978-3-030-58568-6_11

Object Detection with a Unified Label Space from Multiple Datasets

Xiangyun Zhao, +5 more

- 23 Aug 2020

TL;DR: Zhao et al. as mentioned in this paper propose loss functions that carefully integrate partial but correct annotations with complementary but noisy pseudo labels to train a single object detector predicting over the union of all the label spaces.

...read moreread less

50

...

Expand

References

Proceedings Article•10.1109/CVPR.2009.5206848

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

- 20 Jun 2009

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

75.9K

•Journal Article•10.1109/TPAMI.2016.2577031

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren, +3 more

- 01 Jun 2017

- IEEE Transactions on Pattern Analysis an...

TL;DR: This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.

...read moreread less

64.4K

•Book Chapter•10.1007/978-3-319-10602-1_48

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin, +7 more

- 06 Sep 2014

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.

...read moreread less

51.7K

•Proceedings Article•10.1109/CVPR.2016.91

You Only Look Once: Unified, Real-Time Object Detection

Joseph Redmon, +3 more

- 27 Jun 2016

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

45.7K

•Book Chapter•10.1007/978-3-319-46448-0_2

SSD: Single Shot MultiBox Detector

Wei Liu, +6 more

- 08 Oct 2016

TL;DR: The approach, named SSD, discretizes the output space of bounding boxes into a set of default boxes over different aspect ratios and scales per feature map location, which makes SSD easy to train and straightforward to integrate into systems that require a detection component.

...read moreread less

35.5K

...

Expand

Detecting 11K Classes: Large Scale Object Detection Without Fine-Grained Bounding Boxes

Chat with Paper

AI Agents for this Paper

Citations

Detecting Twenty-Thousand Classes Using Image-Level Supervision

Improving Object Detection with Selective Self-supervised Self-training

Grounded Situation Recognition

Learning Open-World Object Proposals Without Learning to Classify

Object Detection with a Unified Label Space from Multiple Datasets

References

ImageNet: A large-scale hierarchical image database

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Microsoft COCO: Common Objects in Context

You Only Look Once: Unified, Real-Time Object Detection

SSD: Single Shot MultiBox Detector

Related Papers (5)

Microsoft COCO: Common Objects in Context

YOLO9000: Better, Faster, Stronger

Feature Pyramid Networks for Object Detection

ImageNet: A large-scale hierarchical image database

Fast R-CNN