CityPersons: A Diverse Dataset for Pedestrian Detection
Shanshan Zhang,Rodrigo Benenson,Bernt Schiele +2 more
- 21 Jul 2017
- pp 4457-4465
TL;DR: In this paper, a new set of person annotations on top of the Cityscapes dataset is introduced, CityPersons, which allows the first time to train one single CNN model that generalizes well over multiple benchmarks.
read more
Abstract: Convnets have enabled significant progress in pedestrian detection recently, but there are still open questions regarding suitable architectures and training data. We revisit CNN design and point out key adaptations, enabling plain FasterRCNN to obtain state-of-the-art results on the Caltech dataset. To achieve further improvement from more and better data, we introduce CityPersons, a new set of person annotations on top of the Cityscapes dataset. The diversity of CityPersons allows us for the first time to train one single CNN model that generalizes well over multiple benchmarks. Moreover, with additional training with CityPersons, we obtain top results using FasterRCNN on Caltech, improving especially for more difficult cases (heavy occlusion and small scale) and providing higher localization quality.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
A machine vision system for avian song classification with CNN’s
Gabriel Palma,Ana Aquino,Patrícia Ferreira Monticelli,Luciano Martins Verdade,Charles Markham,Rafael Moral +5 more
- 31 Aug 2022
TL;DR: In this article , the authors used a pre-trained VGG16 CNN architecture to identify two nocturnal avian species, namely Antrostomus rufus and Megascops choliba, commonly encountered in Brazilian forests.
•Posted Content
Compensation Tracker: Reprocessing for Lost Object
Zhibo Zou,Junjie Huang,Ping Luo +2 more
TL;DR: Wang et al. as mentioned in this paper proposed a compensation tracker based on motion compensation and objects selection, which can be embedded into other non-end-to-end tracking frameworks, and achieved 66% MOTA and 67% IDF1 in the 2020 datasets.
1
Research on small object detection methods based on deep learning
29 Jul 2022
TL;DR: In this article , the authors systematically describe on small object detection methods based on deep learning, and divide them into four categories based on small-object detection optimization methods, such as data augmentation, multi-scale feature fusion, contextual features, and optimized backbone networks, and analyzes the benefits and drawbacks of each method.
1
Real-Time Pedestrian Detection Using Local Regression Based YOLOv6
Ravinder Kaur,Jitendra Singh +1 more
- 23 Jun 2023
TL;DR: The starting point is the YOLO-LR model, in which it is provided a highly fast method for learning a single neural network to do all of the aforementioned tasks, and the technique demonstrated competitive results.
1
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-person Multi-task Human-Centric Perception
Sheng Jin,S. Li,Tong Shu Li,Wentao Liu,Chen Qian,Ping Luo +5 more
1
References
Deep Residual Learning for Image Recognition
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
•Proceedings Article
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
- 01 Jan 2015
TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
138.5K
•Proceedings Article
ImageNet Classification with Deep Convolutional Neural Networks
Alex Krizhevsky,Ilya Sutskever,Geoffrey E. Hinton +2 more
- 03 Dec 2012
TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
•Posted Content
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.
82.5K
You Only Look Once: Unified, Real-Time Object Detection
Joseph Redmon,Santosh K. Divvala,Ross Girshick,Ali Farhadi +3 more
- 27 Jun 2016
TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.