RePaint: Inpainting using Denoising Diffusion Probabilistic Models

doi:10.1109/CVPR52688.2022.01117

Proceedings Article10.1109/CVPR52688.2022.01117

RePaint: Inpainting using Denoising Diffusion Probabilistic Models

Andreas Lugmayr, +5 more

- 24 Jan 2022

pp 11451-11461

854

TL;DR: This work proposes RePaint: A Denoising Diffusion Probabilistic Model (DDPM) based inpainting approach that is applicable to even extreme masks and outperforms state-of-the-art Autoregressive, and GAN approaches for at least five out of six mask distributions.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arXiv.2209.00796

Diffusion Models: A Comprehensive Survey of Methods and Applications

Ling Yang, +8 more

- 02 Sep 2022

- arXiv.org

TL;DR: A comprehensive review of existing variants of the diffusion models and a thorough investigation into the applications of diffusion models, including computer vision, natural language processing, waveform signal processing, multi-modal modeling, molecular graph generation, time series modeling, and adversarial puriﬁcation.

...read moreread less

734

Journal Article•10.1109/TPAMI.2023.3261988

Diffusion Models in Vision: A Survey

Florinel-Alin Croitoru, +3 more

- 10 Sep 2022

- IEEE Transactions on Pattern Analysis an...

TL;DR: A multi-perspective categorization of diffusion models applied in computer vision, including variational auto-encoders, generative adversarial networks, energy-based models, autoregressive models and normalizing models is introduced.

...read moreread less

635

Journal Article•10.48550/arXiv.2211.01324

eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers

Yogesh Balaji, +12 more

- 02 Nov 2022

- arXiv.org

TL;DR: The authors propose to train an ensemble of text-to-image diffusion models specialized for different synthesis stages, which leads to improved text alignment while maintaining the same inference computation cost and preserving high visual quality, outperforming previous large-scale text to image diffusion models on the standard benchmark.

...read moreread less

515

Journal Article•10.48550/arXiv.2304.08818

Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models

Andreas Blattmann, +6 more

- 18 Apr 2023

- arXiv.org

TL;DR: In this article , the authors apply the LDM paradigm to high-resolution video generation, a particularly resource-intensive task, by introducing a temporal dimension to the latent space diffusion model and fine-tuning on encoded image sequences, i.e., videos.

...read moreread less

480

Journal Article•10.48550/arXiv.2301.13188

Extracting Training Data from Diffusion Models

Nicholas Carlini, +8 more

- 30 Jan 2023

- arXiv.org

TL;DR: In this article , the authors show that diffusion models memorize individual images from their training data and emit them at generation time, and that mitigating these vulnerabilities may require new advances in privacy-preserving training.

...read moreread less

318

...

Expand

References

•Journal Article•10.3156/JSOFT.29.5_177_2

Generative Adversarial Nets

Ian Goodfellow, +7 more

- 08 Dec 2014

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

48.6K

•Journal Article

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Apr 2015

- Springer US

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) has been running annually for five years (since 2010) and has become the standard benchmark for large-scale object recognition.

...read moreread less

23.9K

•Proceedings Article•10.1109/CVPR.2019.00453

A Style-Based Generator Architecture for Generative Adversarial Networks

Tero Karras, +2 more

- 15 Jun 2019

TL;DR: This paper proposed an alternative generator architecture for GANs, borrowing from style transfer literature, which leads to an automatically learned, unsupervised separation of high-level attributes (e.g., pose and identity when trained on human faces) and stochastic variation in the generated images.

...read moreread less

11.7K

•Posted Content

Denoising Diffusion Probabilistic Models

Jonathan Ho, +2 more

- 19 Jun 2020

- arXiv: Learning

TL;DR: High quality image synthesis results are presented using diffusion probabilistic models, a class of latent variable models inspired by considerations from nonequilibrium thermodynamics, which naturally admit a progressive lossy decompression scheme that can be interpreted as a generalization of autoregressive decoding.

...read moreread less

11.7K

•Proceedings Article•10.1109/ICCV.2015.425

Deep Learning Face Attributes in the Wild

Ziwei Liu, +3 more

- 07 Dec 2015

TL;DR: A novel deep learning framework for attribute prediction in the wild that cascades two CNNs, LNet and ANet, which are fine-tuned jointly with attribute tags, but pre-trained differently.

...read moreread less

10.1K

...

Expand