Analyzing and Improving the Image Quality of StyleGAN

doi:10.1109/CVPR42600.2020.00813

Open AccessProceedings Article10.1109/CVPR42600.2020.00813

Analyzing and Improving the Image Quality of StyleGAN

Tero Karras, +5 more

- 14 Jun 2020

- pp 8110-8119

4.7K

TL;DR: In this paper, the authors propose to redesign the generator normalization, revisit progressive growing, and regularize the generator to encourage good conditioning in the mapping from latent codes to images.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1115/1.4056500

DDE-GAN: Integrating a Data-driven Design Evaluator into Generative Adversarial Networks for Desirable and Diverse Concept Generation

Chenxi Yuan, +2 more

- 19 Dec 2022

- Journal of mechanical design

TL;DR: In this article , a multimodal Data-driven Design Evaluation (DDE) model is developed to guide the generative process by automatically predicting user sentiments for the generated samples based on large-scale user reviews of previous designs.

...read moreread less

19

Journal Article•10.1609/aaai.v38i5.28313

Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style

Shuai Tan, +2 more

- 11 Mar 2024

TL;DR: This paper presents Style2Talker, an audio-driven talking face generation method that integrates emotion style and art style, utilizing large-scale pretrained models and latent diffusion models to produce high-resolution, artistically stylized talking head videos with improved audio-lip synchronization and emotional expression.

...read moreread less

19

•Posted Content

STEEX: Steering Counterfactual Explanations with Semantics

Paul Jacob, +6 more

- 17 Nov 2021

- arXiv: Computer Vision and Pattern Recog...

TL;DR: In this paper, a new generative counterfactual explanation framework is proposed to produce plausible and sparse modifications which preserve the overall scene structure, where users can guide the generation of counterfactuallys by specifying a set of semantic regions of the query image the explanation must be about.

...read moreread less

19

•Posted Content

Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs

Hui-Po Wang, +2 more

- 28 Nov 2020

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This work shows that state-of-the-art GAN models – such as they are being publicly released by researchers and industry – can be used for a range of applications beyond unconditional image generation, by an iterative scheme that also allows gaining control over the image generation process despite the highly non-linear latent spaces of the latest GAN model.

...read moreread less

19

Journal Article•10.1109/tpami.2023.3283551

One-Shot Adaptation of GAN in Just One CLIP

01 Jan 2023

- IEEE Transactions on Pattern Analysis an...

TL;DR: OneshotCLIP as discussed by the authors employs a two-step training strategy: reference image search in the source generator using a CLIP-guided latent optimization, followed by generator fine-tuning with a novel loss function that imposes CLIP space consistency between the source and adapted generators.

...read moreread less

19

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 04 Sep 2014

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

102.6K

•Book Chapter•10.1007/978-3-319-24574-4_28

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, +2 more

- 05 Oct 2015

TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.

...read moreread less

92K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 01 Jan 2015

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

51.9K

•Journal Article•10.1007/S11263-015-0816-Y

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015

- International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less

41.6K

...

Expand

Analyzing and Improving the Image Quality of StyleGAN

Chat with Paper

AI Agents for this Paper

Citations

DDE-GAN: Integrating a Data-driven Design Evaluator into Generative Adversarial Networks for Desirable and Diverse Concept Generation

Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style

STEEX: Steering Counterfactual Explanations with Semantics

Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs

One-Shot Adaptation of GAN in Just One CLIP

References

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

U-Net: Convolutional Networks for Biomedical Image Segmentation

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet Large Scale Visual Recognition Challenge

Related Papers (5)

A Style-Based Generator Architecture for Generative Adversarial Networks

Generative Adversarial Nets

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

Image-to-Image Translation with Conditional Adversarial Networks

Deep Residual Learning for Image Recognition