GANs Trained by a Two Time-Scale Update Rule Converge to a Nash Equilibrium

Open AccessPosted Content

GANs Trained by a Two Time-Scale Update Rule Converge to a Nash Equilibrium

- 26 Jun 2017

9.2K

TL;DR: In this article, a two time-scale update rule (TTUR) was proposed for training GANs with stochastic gradient descent on arbitrary GAN loss functions, which has an individual learning rate for both the discriminator and the generator.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.1109/CVPR.2019.00224

Joint Discriminative and Generative Learning for Person Re-Identification

Zhedong Zheng, +5 more

- 15 Jun 2019

TL;DR: In this paper, a joint learning framework that couples re-id learning and data generation is proposed to improve learned re-ID embeddings by better leveraging the generated data, which leads to state-of-the-art performance on several benchmark datasets.

...read moreread less

859

Proceedings Article•10.1109/CVPR52688.2022.01117

RePaint: Inpainting using Denoising Diffusion Probabilistic Models

Andreas Lugmayr, +5 more

- 24 Jan 2022

TL;DR: This work proposes RePaint: A Denoising Diffusion Probabilistic Model (DDPM) based inpainting approach that is applicable to even extreme masks and outperforms state-of-the-art Autoregressive, and GAN approaches for at least five out of six mask distributions.

...read moreread less

854

•Posted Content

EdgeConnect: Generative Image Inpainting with Adversarial Edge Learning

Kamyar Nazeri, +4 more

- 01 Jan 2019

- arXiv: Computer Vision and Pattern Recog...

TL;DR: A new approach for image inpainting that does a better job of reproducing filled regions exhibiting fine details is developed and outperforms current state-of-the-art techniques quantitatively and qualitatively.

...read moreread less

849

Proceedings Article•10.48550/arXiv.2204.03458

Video Diffusion Models

Jonathan Ho, +5 more

- 07 Apr 2022

TL;DR: The authors proposed a diffusion model for video generation, which is a natural extension of the standard image diffusion architecture and enables jointly training from image and video data, which they find to reduce the variance of minibatch gradients and speed up optimization.

...read moreread less

841

Journal Article•10.48550/arXiv.2210.02303

Imagen Video: High Definition Video Generation with Diffusion Models

Jonathan Ho, +10 more

- 05 Oct 2022

- arXiv.org

TL;DR: Imagen Video is presented, a text-conditional video generation system based on a cascade of video diffusion models not only capable of generating videos of high quality, but also having a high degree of controllability and world knowledge, including the ability to generate diverse videos and text animations in various artistic styles and with 3D object understanding.

...read moreread less

838

...

Expand

References

•Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 01 Jan 2015

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

138.5K

•Journal Article•10.1145/3065386

ImageNet classification with deep convolutional neural networks

Alex Krizhevsky, +2 more

- 24 May 2017

- Communications of The ACM

TL;DR: A large, deep convolutional neural network was trained to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes and employed a recently developed regularization method called "dropout" that proved to be very effective.

...read moreread less

98.2K

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

•Journal Article•10.3156/JSOFT.29.5_177_2

Generative Adversarial Nets

Ian Goodfellow, +7 more

- 08 Dec 2014

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

48.6K

•Proceedings Article

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, +1 more

- 06 Jul 2015

TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.

...read moreread less

43.7K

...

Expand

GANs Trained by a Two Time-Scale Update Rule Converge to a Nash Equilibrium

Chat with Paper

AI Agents for this Paper

Citations

Joint Discriminative and Generative Learning for Person Re-Identification

RePaint: Inpainting using Denoising Diffusion Probabilistic Models

EdgeConnect: Generative Image Inpainting with Adversarial Edge Learning

Video Diffusion Models

Imagen Video: High Definition Video Generation with Diffusion Models

References

Adam: A Method for Stochastic Optimization

ImageNet classification with deep convolutional neural networks

ImageNet Classification with Deep Convolutional Neural Networks

Generative Adversarial Nets

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Related Papers (5)

Generative Adversarial Nets

Deep Residual Learning for Image Recognition

Image-to-Image Translation with Conditional Adversarial Networks

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

A Style-Based Generator Architecture for Generative Adversarial Networks