Analyzing and Improving the Image Quality of StyleGAN

doi:10.1109/CVPR42600.2020.00813

Open AccessProceedings Article10.1109/CVPR42600.2020.00813

Analyzing and Improving the Image Quality of StyleGAN

Tero Karras, +5 more

- 14 Jun 2020

- pp 8110-8119

4.7K

TL;DR: In this paper, the authors propose to redesign the generator normalization, revisit progressive growing, and regularize the generator to encourage good conditioning in the mapping from latent codes to images.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/cvpr52729.2023.02152

SceneComposer: Any-Level Semantic Image Synthesis

Y. Zeng, +6 more

- 01 Jun 2023

TL;DR: SceneComposer generates high-quality images from semantic layouts of any precision level, ranging from text to detailed shapes. It supports flexible control over image generation based on user input and offers a wide range of capabilities for different drawing expertise and stages of the creative process.

...read moreread less

18

Journal Article•10.1109/cvpr52733.2024.00829

Prompt-Free Diffusion: Taking “Text” Out of Text-to-Image Diffusion Models

Xingqian Xu, +5 more

- 16 Jun 2024

18

•Journal Article•10.1007/978-3-031-19790-1_15

Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis

01 Jan 2022

- Lecture Notes in Computer Science

TL;DR: Wang et al. as mentioned in this paper proposed a 3D-aware GAN, SURF-GAN, which is capable of discovering semantic attributes during training and controlling them in an unsupervised manner.

...read moreread less

18

•Posted Content

MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation

Tianxiang Ma, +3 more

- 18 Nov 2020

- arXiv: Computer Vision and Pattern Recog...

TL;DR: A novel multi-level statistics transfer model is proposed, which disentangles and transfers multi- level appearance features from person images and merges them with pose features to reconstruct the source person images themselves so that the source images can be used as supervision for self-driven person image generation.

...read moreread less

18

Journal Article•10.1109/iccv51070.2023.02085

InfiniCity: Infinite-Scale City Synthesis

Chieh Hubert Lin, +6 more

- 01 Oct 2023

TL;DR: InfiniCity synthesizes infinite-scale 3D city environments from random noises, leveraging 2D and 3D data.

...read moreread less

18

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 04 Sep 2014

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

102.6K

•Book Chapter•10.1007/978-3-319-24574-4_28

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, +2 more

- 05 Oct 2015

TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.

...read moreread less

92K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 01 Jan 2015

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

51.9K

•Journal Article•10.1007/S11263-015-0816-Y

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015

- International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less

41.6K

...

Expand

Analyzing and Improving the Image Quality of StyleGAN

Chat with Paper

AI Agents for this Paper

Citations

SceneComposer: Any-Level Semantic Image Synthesis

Prompt-Free Diffusion: Taking “Text” Out of Text-to-Image Diffusion Models

Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis

MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation

InfiniCity: Infinite-Scale City Synthesis

References

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

U-Net: Convolutional Networks for Biomedical Image Segmentation

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet Large Scale Visual Recognition Challenge

Related Papers (5)

A Style-Based Generator Architecture for Generative Adversarial Networks

Generative Adversarial Nets

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

Image-to-Image Translation with Conditional Adversarial Networks

Deep Residual Learning for Image Recognition