DeeperCut: A Deeper, Stronger, and Faster Multi-Person Pose Estimation Model

Open AccessPosted Content

DeeperCut: A Deeper, Stronger, and Faster Multi-Person Pose Estimation Model

- 10 May 2016

- arXiv: Computer Vision and Pattern Recog...

783

TL;DR: In this paper, an incremental optimization strategy was proposed to explore the search space more efficiently, leading both to better performance and significant speed-up factors for multi-person pose estimation.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

Bottom-up Higher-Resolution Networks for Multi-Person Pose Estimation

Bowen Cheng, +5 more

- 27 Aug 2019

TL;DR: Higher-Resolution Network (HigherHRNet) is proposed, which is a simple extension of the High-Res resolution Network (HRNet), which generates higher-resolution feature maps by deconvolving the high- resolution feature maps outputted by HRNet, which are spatially more accurate for small and medium persons.

...read moreread less

57

•Posted Content

Cascade Feature Aggregation for Human Pose Estimation

Zhihui Su, +4 more

- 21 Feb 2019

- arXiv: Computer Vision and Pattern Recog...

TL;DR: A novel Cascade Feature Aggregation (CFA) method, which cascades several hourglass networks for robust human pose estimation, which outperforms the state-of-the-art and achieves the best performance on the state of theart benchmark MPII.

...read moreread less

55

•Journal Article•10.7554/ELIFE.58145

anTraX, a software package for high-throughput video tracking of color-tagged insects

Asaf Gal, +2 more

- 19 Nov 2020

- eLife

TL;DR: An algorithm and software package for high-throughput video tracking of color-tagged insects that combines neural network classification of animals with a novel approach for representing tracking data as a graph, enabling individual tracking even in cases where it is difficult to segment animals from one another, or where tags are obscured.

...read moreread less

53

•Proceedings Article•10.1109/ICCV.2019.00609

Anchor Loss: Modulating Loss Scale Based on Prediction Difficulty

Serim Ryou, +2 more

- 01 Oct 2019

TL;DR: In this paper, a novel loss function is proposed to dynamically re-scales the cross entropy based on prediction difficulty regarding a sample, where the prediction difficulty is defined as a relative property coming from the confidence score gap between positive and negative labels.

...read moreread less

52

Journal Article•10.1016/J.CVIU.2018.03.007

A dual-source approach for 3D human pose estimation from single images

Umar Iqbal, +5 more

- 01 Jul 2018

- Computer Vision and Image Understanding

TL;DR: In this paper, a dual-source approach is proposed to estimate 2D pose from motion capture data and then estimate the 3D pose map from the 2D motion capture space to the image.

...read moreread less

51

...

Expand

References

•Posted Content

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 10 Dec 2015

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.

...read moreread less

117.9K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 04 Sep 2014

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

102.6K

•Journal Article•10.1109/TPAMI.2016.2577031

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren, +3 more

- 01 Jun 2017

- IEEE Transactions on Pattern Analysis an...

TL;DR: This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.

...read moreread less

64.4K

•Proceedings Article•10.1109/CVPR.2015.7298594

Going deeper with convolutions

Christian Szegedy, +8 more

- 07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

56.6K

•Proceedings Article•10.1109/CVPR.2015.7298965

Fully convolutional networks for semantic segmentation

Jonathan Long, +2 more

- 07 Jun 2015

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.

...read moreread less

42.6K

...

Expand

DeeperCut: A Deeper, Stronger, and Faster Multi-Person Pose Estimation Model

Chat with Paper

AI Agents for this Paper

Citations

Bottom-up Higher-Resolution Networks for Multi-Person Pose Estimation

Cascade Feature Aggregation for Human Pose Estimation

anTraX, a software package for high-throughput video tracking of color-tagged insects

Anchor Loss: Modulating Loss Scale Based on Prediction Difficulty

A dual-source approach for 3D human pose estimation from single images

References

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Going deeper with convolutions

Fully convolutional networks for semantic segmentation

Related Papers (5)

2D Human Pose Estimation: New Benchmark and State of the Art Analysis

Deep Residual Learning for Image Recognition

DeepPose: Human Pose Estimation via Deep Neural Networks

Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields

Microsoft COCO: Common Objects in Context