Self-Attention Generative Adversarial Networks

Open AccessProceedings Article

Self-Attention Generative Adversarial Networks

- 24 May 2019

- pp 7354-7363

2.1K

TL;DR: The proposed SAGAN achieves the state-of-the-art results, boosting the best published Inception score from 36.8 to 52.52 and reducing Frechet Inception distance from 27.62 to 18.65 on the challenging ImageNet dataset.

Abstract: In this paper, we propose the Self-Attention Generative Adversarial Network (SAGAN) which allows attention-driven, long-range dependency modeling for image generation tasks. Traditional convolutional GANs generate high-resolution details as a function of only spatially local points in lower-resolution feature maps. In SAGAN, details can be generated using cues from all feature locations. Moreover, the discriminator can check that highly detailed features in distant portions of the image are consistent with each other. Furthermore, recent work has shown that generator conditioning affects GAN performance. Leveraging this insight, we apply spectral normalization to the GAN generator and find that this improves training dynamics. The proposed SAGAN achieves the state-of-the-art results, boosting the best published Inception score from 36.8 to 52.52 and reducing Frechet Inception distance from 27.62 to 18.65 on the challenging ImageNet dataset. Visualization of the attention layers shows that the generator leverages neighborhoods that correspond to object shapes rather than local regions of fixed shape.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.1109/CVPR.2019.00453

A Style-Based Generator Architecture for Generative Adversarial Networks

Tero Karras, +2 more

- 15 Jun 2019

TL;DR: This paper proposed an alternative generator architecture for GANs, borrowing from style transfer literature, which leads to an automatically learned, unsupervised separation of high-level attributes (e.g., pose and identity when trained on human faces) and stochastic variation in the generated images.

...read moreread less

11.7K

•Proceedings Article•10.1109/CVPR.2019.00326

Dual Attention Network for Scene Segmentation

Jun Fu, +6 more

- 15 Jun 2019

TL;DR: New state-of-the-art segmentation performance on three challenging scene segmentation datasets, i.e., Cityscapes, PASCAL Context and COCO Stuff dataset is achieved without using coarse data.

...read moreread less

7.2K

•Proceedings Article•10.1109/CVPR42600.2020.00813

Analyzing and Improving the Image Quality of StyleGAN

Tero Karras, +5 more

- 14 Jun 2020

TL;DR: In this paper, the authors propose to redesign the generator normalization, revisit progressive growing, and regularize the generator to encourage good conditioning in the mapping from latent codes to images.

...read moreread less

4.7K

•Posted Content

Analyzing and Improving the Image Quality of StyleGAN

Tero Karras, +5 more

- 03 Dec 2019

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This work redesigns the generator normalization, revisit progressive growing, and regularize the generator to encourage good conditioning in the mapping from latent codes to images, and thereby redefines the state of the art in unconditional image modeling.

...read moreread less

3.9K

•Proceedings Article•10.1109/CVPR.2019.00244

Semantic Image Synthesis With Spatially-Adaptive Normalization

Taesung Park, +3 more

- 18 Mar 2019

TL;DR: S spatially-adaptive normalization is proposed, a simple but effective layer for synthesizing photorealistic images given an input semantic layout that allows users to easily control the style and content of image synthesis results as well as create multi-modal results.

...read moreread less

3.5K

...

Expand

Self-Attention Generative Adversarial Networks

Chat with Paper

AI Agents for this Paper

Citations

A Style-Based Generator Architecture for Generative Adversarial Networks

Dual Attention Network for Scene Segmentation

Analyzing and Improving the Image Quality of StyleGAN

Analyzing and Improving the Image Quality of StyleGAN

Semantic Image Synthesis With Spatially-Adaptive Normalization

Related Papers (5)

Generative Adversarial Nets

Deep Residual Learning for Image Recognition

Adam: A Method for Stochastic Optimization

Image-to-Image Translation with Conditional Adversarial Networks

Attention is All you Need