ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models

Open AccessPosted Content

ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models

- 06 Aug 2021

- arXiv: Computer Vision and Pattern Recog...

310

TL;DR: In this article, an iterative Latent Variable Refinement (ILVR) method is proposed to guide the generative process in DDPM to generate high-quality images based on a given reference image.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/cvpr52729.2023.02155

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

Nataniel Ruiz, +5 more

- 01 Jun 2023

TL;DR: DreamBooth fine-tunes text-to-image diffusion models to generate subject-driven images from text prompts, leveraging unique subject identifiers and a new autogenous class-specific prior preservation loss.

...read moreread less

871

•Proceedings Article•10.1109/cvpr52688.2022.01117

RePaint: Inpainting using Denoising Diffusion Probabilistic Models

01 Jun 2022

TL;DR: RePaint as discussed by the authors employs a pretrained unconditional DDPM as the generative prior to condition the generation process, and only alter the reverse diffusion iterations by sampling the unmasked regions using the given image infor-mation.

...read moreread less

676

•Journal Article•10.1109/tpami.2023.3261988

Diffusion Models in Vision: A Survey

01 Jan 2023

- IEEE Transactions on Pattern Analysis an...

TL;DR: Denoising diffusion models represent a recent emerging topic in computer vision, demonstrating remarkable results in the area of generative modeling as discussed by the authors , and are widely appreciated for the quality and diversity of the generated samples, despite their known computational burdens.

...read moreread less

568

Journal Article•10.1109/cvpr52729.2023.00582

Imagic: Text-Based Real Image Editing with Diffusion Models

Bahjat Kawar, +7 more

- 01 Jun 2023

TL;DR: Imagic is the first method to apply complex text-based semantic edits to a single real image. It requires only a single input image and a target text, and can produce high-quality complex semantic edits.

...read moreread less

358

•Journal Article•10.1016/j.neucom.2022.01.029

SRDiff: Single image super-resolution with diffusion probabilistic models

01 Mar 2022

- Neurocomputing

TL;DR: Zhang et al. as discussed by the authors proposed a diffusion-based model for single image super-resolution (SISR), which is optimized with a variant of the variational bound on the data likelihood.

...read moreread less

309

...

Expand

References

•Book Chapter•10.1007/978-3-319-24574-4_28

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, +2 more

- 05 Oct 2015

TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.

...read moreread less

92K

•Journal Article•10.3156/JSOFT.29.5_177_2

Generative Adversarial Nets

Ian Goodfellow, +7 more

- 08 Dec 2014

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

48.6K

•Proceedings Article•10.1109/CVPR.2017.632

Image-to-Image Translation with Conditional Adversarial Networks

Phillip Isola, +3 more

- 21 Jul 2017

TL;DR: Conditional adversarial networks are investigated as a general-purpose solution to image-to-image translation problems and it is demonstrated that this approach is effective at synthesizing photos from label maps, reconstructing objects from edge maps, and colorizing images, among other tasks.

...read moreread less

19.6K

•Proceedings Article•10.1109/ICCV.2017.244

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

Jun-Yan Zhu, +3 more

- 01 Oct 2017

TL;DR: CycleGAN as discussed by the authors learns a mapping G : X → Y such that the distribution of images from G(X) is indistinguishable from the distribution Y using an adversarial loss.

...read moreread less

19.5K

Book Chapter•10.1007/978-3-658-40442-0_9