SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing

doi:10.1109/cvpr52688.2022.01097

Open AccessProceedings Article10.1109/cvpr52688.2022.01097

SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing

01 Jun 2022

41

TL;DR: SemanticStyleGAN as discussed by the authors uses a generator to model local semantic parts separately and synthesize images in a compositional way, where the structure and texture of different local parts are controlled by corresponding latent codes.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1145/3550454.3555506

Ide-3d

Jin Sun, +5 more

- 30 Nov 2022

- ACM Transactions on Graphics

TL;DR: Wang et al. as discussed by the authors proposed a 3D-semantics-aware generative model that produces view-consistent, disentangled face images and semantic masks, and a hybrid GAN inversion approach that initializes the latent codes from the semantic and texture encoder, and further optimizes them for faithful reconstruction.

...read moreread less

65

•Book Chapter•10.1007/978-3-031-19781-9_42

Sem2NeRF: Converting Single-View Semantic Masks to Neural Radiance Fields

Yuedong Chen, +4 more

- 21 Mar 2022

TL;DR: Sem2NeRF as mentioned in this paper proposes a new task, Semantic-to-NeRF translation, that aims to reconstruct a 3D scene modelled by NeRF, conditioned on one single-view semantic mask as input.

...read moreread less

41

•Proceedings Article•10.1109/icassp49357.2023.10096372

Generative Model based Highly Efficient Semantic Communication Approach for Image Transmission

Tiancheng Han, +5 more

- 18 Nov 2022

TL;DR: Wang et al. as discussed by the authors proposed a generative model based semantic communication to further improve the efficiency of image transmission and protect private information, which employed a privacy filter and a knowledge base to erase private information and replace it with natural features in the knowledge base.

...read moreread less

29

Journal Article•10.1109/tpami.2023.3298868

GAN-Based Facial Attribute Manipulation

Yunfan Liu, +4 more

- 01 Dec 2023

- IEEE Transactions on Pattern Analysis an...

TL;DR: GAN-based facial attribute manipulation surveys existing methods and explores future directions in the field.

...read moreread less

21

Journal Article•10.1109/tmi.2024.3382043

Mutual Information Guided Diffusion for Zero-shot Cross-modality Medical Image Translation.

Zihao Wang, +6 more

- 29 Mar 2024

- IEEE Transactions on Medical Imaging

TL;DR: This study proposes Mutual Information guided Diffusion Model (MIDiffusion) for zero-shot cross-modality medical image translation, leveraging statistical consistency between modalities and a differentiable local-wise mutual information layer for iterative denoising and domain adaptation.

...read moreread less

9

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article•10.1109/CVPR.2019.00453

A Style-Based Generator Architecture for Generative Adversarial Networks

Tero Karras, +2 more

- 15 Jun 2019

TL;DR: This paper proposed an alternative generator architecture for GANs, borrowing from style transfer literature, which leads to an automatically learned, unsupervised separation of high-level attributes (e.g., pose and identity when trained on human faces) and stochastic variation in the generated images.

...read moreread less

11.7K

•Proceedings Article•10.1109/ICCV.2015.425

Deep Learning Face Attributes in the Wild

Ziwei Liu, +3 more

- 07 Dec 2015

TL;DR: A novel deep learning framework for attribute prediction in the wild that cascades two CNNs, LNet and ANet, which are fine-tuned jointly with attribute tags, but pre-trained differently.

...read moreread less

10.1K

•Proceedings Article•10.1109/CVPR.2018.00917

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

Ting-Chun Wang, +5 more

- 18 Jun 2018

TL;DR: In this paper, a new method for synthesizing high-resolution photo-realistic images from semantic label maps using conditional generative adversarial networks (conditional GANs) is presented.

...read moreread less

5.4K

•Proceedings Article

Improved techniques for training GANs

Tim Salimans, +5 more

- 05 Dec 2016

TL;DR: In this article, a variety of new architectural features and training procedures are applied to the generative adversarial networks (GANs) framework and achieved state-of-the-art results in semi-supervised classification on MNIST, CIFAR-10 and SVHN.

...read moreread less

5.2K