HOGgles: Visualizing Object Detection Features

doi:10.1109/ICCV.2013.8

Open AccessProceedings Article10.1109/ICCV.2013.8

HOGgles: Visualizing Object Detection Features

Carl Vondrick, +3 more

- 01 Dec 2013

- pp 1-8

358

TL;DR: Algorithms to visualize feature spaces used by object detectors allow a human to put on 'HOG goggles' and perceive the visual world as a HOG based object detector sees it, and allow us to analyze object detection systems in new ways and gain new insight into the detector's failures.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Figure 1: An image from PASCAL and a high scoring car detection from DPM [8]. Why did the detector fail?

Figure 2: We show the crop for the false car detection from Figure 1. On the right, we show our visualization of the HOG features for the same patch. Our visualization reveals that this false alarm actually looks like a car in HOG space.

Table 1: We evaluate the performance of our inversion algorithm by comparing the inverse to the ground truth image using the mean normalized cross correlation. Higher is better; a score of 1 is perfect. See supplemental for full table.

Table 2: We evaluate visualization performance across twenty PASCAL VOC categories by asking MTurk workers to classify our inversions. Numbers are percent classified correctly; higher is better. Chance is 0.05. Glyph refers to the standard black-and-white HOG diagram popularized by [3]. Paired dictionary learning provides the best visualizations for humans. Expert refers to MIT PhD students in computer vision performing the same visualization challenge with HOG glyphs. See supplemental for full table.

Figure 13: HOG inversion reveals the world that object detectors see. The left shows a man standing in a dark room. If we compute HOG on this image and invert it, the previously dark scene behind the man emerges. Notice the wall structure, the lamp post, and the chair in the bottom right hand corner.

Figure 4: In this paper, we present algorithms to visualize HOG features. Our visualizations are perceptually intuitive for humans to understand.

Citations

•Proceedings Article•10.1109/CVPR.2014.81

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Ross Girshick, +3 more

- 23 Jun 2014

TL;DR: RCNN as discussed by the authors combines CNNs with bottom-up region proposals to localize and segment objects, and when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost.

...read moreread less

33.7K

•Proceedings Article•10.1109/ICCV.2017.74

Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization

Ramprasaath R. Selvaraju, +5 more

- 01 Oct 2017

TL;DR: This work combines existing fine-grained visualizations to create a high-resolution class-discriminative visualization, Guided Grad-CAM, and applies it to image classification, image captioning, and visual question answering (VQA) models, including ResNet-based architectures.

...read moreread less

14.7K

•Posted Content

Rich feature hierarchies for accurate object detection and semantic segmentation

Ross Girshick, +3 more

- 11 Nov 2013

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper proposes a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous best result on VOC 2012 -- achieving a mAP of 53.3%.

...read moreread less

13.1K

•Book Chapter•10.1007/978-3-319-46475-6_43

Perceptual Losses for Real-Time Style Transfer and Super-Resolution

Justin Johnson, +2 more

- 08 Oct 2016

TL;DR: In this paper, the authors combine the benefits of both approaches, and propose the use of perceptual loss functions for training feed-forward networks for image style transfer, where a feedforward network is trained to solve the optimization problem proposed by Gatys et al. in real-time.

...read moreread less

10.1K

•Posted Content

Perceptual Losses for Real-Time Style Transfer and Super-Resolution

Justin Johnson, +2 more

- 27 Mar 2016

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This work considers image transformation problems, and proposes the use of perceptual loss functions for training feed-forward networks for image transformation tasks, and shows results on image style transfer, where aFeed-forward network is trained to solve the optimization problem proposed by Gatys et al. in real-time.

...read moreread less

8.3K

...

Expand

References

•Proceedings Article•10.1109/CVPR.2005.177

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

- 20 Jun 2005

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

36.7K

•Journal Article•10.1007/S11263-009-0275-4

The Pascal Visual Object Classes (VOC) Challenge

Mark Everingham, +4 more

- 01 Jun 2010

- International Journal of Computer Vision

TL;DR: The state-of-the-art in evaluated methods for both classification and detection are reviewed, whether the methods are statistically different, what they are learning from the images, and what the methods find easy or confuse.

...read moreread less

21.3K

Proceedings Article•10.1109/ICCV.1999.790410

Object recognition from local scale-invariant features

David G. Lowe

- 20 Sep 1999

TL;DR: Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.

...read moreread less

19.3K

•Journal Article•10.1109/TPAMI.2009.167

Object Detection with Discriminatively Trained Part-Based Models

Pedro F. Felzenszwalb, +3 more

- 01 Sep 2010

- IEEE Transactions on Pattern Analysis an...

TL;DR: An object detection system based on mixtures of multiscale deformable part models that is able to represent highly variable object classes and achieves state-of-the-art results in the PASCAL object detection challenges is described.

...read moreread less

11.9K

•Journal Article•10.1007/S11263-014-0733-5

The Pascal Visual Object Classes Challenge: A Retrospective

Mark Everingham, +5 more

- 01 Jan 2015

- International Journal of Computer Vision

TL;DR: A review of the Pascal Visual Object Classes challenge from 2008-2012 and an appraisal of the aspects of the challenge that worked well, and those that could be improved in future challenges.

...read moreread less

7.8K

...

Expand

HOGgles: Visualizing Object Detection Features

Chat with Paper

AI Agents for this Paper

Figures

Citations

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization

Rich feature hierarchies for accurate object detection and semantic segmentation

Perceptual Losses for Real-Time Style Transfer and Super-Resolution

Perceptual Losses for Real-Time Style Transfer and Super-Resolution

References

Histograms of oriented gradients for human detection

The Pascal Visual Object Classes (VOC) Challenge

Object recognition from local scale-invariant features

Object Detection with Discriminatively Trained Part-Based Models

The Pascal Visual Object Classes Challenge: A Retrospective

Related Papers (5)

Histograms of oriented gradients for human detection

ImageNet: A large-scale hierarchical image database

ImageNet Classification with Deep Convolutional Neural Networks

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Distinctive Image Features from Scale-Invariant Keypoints