DiffusionDet: Diffusion Model for Object Detection

doi:10.1109/iccv51070.2023.01816

Journal Article10.1109/iccv51070.2023.01816

DiffusionDet: Diffusion Model for Object Detection

Shoufa Chen, +3 more

- 01 Oct 2023

176

TL;DR: DiffusionDet is a novel object detection framework based on a diffusion process, achieving competitive performance with flexibility in the number of boxes and iterations.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1016/j.media.2023.102846

Diffusion models in medical imaging: A comprehensive survey.

A Kazerouni, +6 more

- 01 May 2023

- Medical Image Analysis

TL;DR: A comprehensive overview of diffusion models in the discipline of medical imaging can be found in this article , where the authors provide a taxonomy based on their application, imaging modality, organ of interest, and algorithms.

...read moreread less

196

Journal Article•10.1109/tkde.2024.3361474

A Survey on Generative Diffusion Models

Hanqun Cao, +6 more

- 06 Sep 2022

- IEEE Transactions on Knowledge and Data ...

TL;DR: This survey comprehensively elucidates the fundamental formulation of diffusion, algorithmic enhancements, and the manifold applications of diffusion from three distinct angles: the fundamental formulation of diffusion, algorithmic enhancements, and the manifold applications of diffusion.

...read moreread less

145

Journal Article•10.1109/cvpr52729.2023.00708

Dense Distinct Query for End-to-End Object Detection

Shilong Zhang, +7 more

- 01 Jun 2023

TL;DR: Dense Distinct Query (DDQ) significantly improves object detection performance by combining the advantages of traditional and recent end-to-end detectors.

...read moreread less

75

Journal Article•10.1109/iccv51070.2023.00527

Unleashing Text-to-Image Diffusion Models for Visual Perception

Wenliang Zhao, +5 more

- 01 Oct 2023

TL;DR: VPD framework utilizes pre-trained text-to-image diffusion models for visual perception tasks, leveraging their high-level knowledge and achieving state-of-the-art performance.

...read moreread less

63

Journal Article•10.1109/tip.2023.3322046

Dif-Fusion: Toward High Color Fidelity in Infrared and Visible Image Fusion With Diffusion Models

Jun Yue, +4 more

- 01 Jan 2023

- IEEE Transactions on Image Processing

TL;DR: Dif-Fusion achieves high color fidelity in infrared and visible image fusion by generating the distribution of multi-channel input data with diffusion models.

...read moreread less

50

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

Proceedings Article•10.1109/CVPR.2009.5206848

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

- 20 Jun 2009

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

75.9K

•Journal Article•10.1109/TPAMI.2016.2577031

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren, +3 more

- 01 Jun 2017

- IEEE Transactions on Pattern Analysis an...

TL;DR: This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.

...read moreread less

64.4K

•Proceedings Article•10.1109/CVPR.2016.91

You Only Look Once: Unified, Real-Time Object Detection

Joseph Redmon, +3 more

- 27 Jun 2016

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

45.7K

•Book Chapter•10.1007/978-3-319-46448-0_2

SSD: Single Shot MultiBox Detector

Wei Liu, +6 more

- 08 Oct 2016

TL;DR: The approach, named SSD, discretizes the output space of bounding boxes into a set of default boxes over different aspect ratios and scales per feature map location, which makes SSD easy to train and straightforward to integrate into systems that require a detection component.

...read moreread less

35.5K

...

Expand