29 Papers
8 Citations
Yaohui Wang is an academic researcher from French Institute for Research in Computer Science and Automation. The author has contributed to research in topics: Computer science & Generative model. The author has an hindex of 5, co-authored 14 publications.
Chat about Author
Papers
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo,Ceyuan Yang,Anyi Rao,Yaohui Wang,Yu Qiao,Dahua Lin,B. Z. Dai +6 more
- 10 Jul 2023
TL;DR: The authors proposed a framework to animate personalized text-to-image models by inserting a newly initialized motion modeling module into the T2I model and training it on video clips to distill reasonable motion priors.
ImaGINator: Conditional Spatio-Temporal GAN for Video Generation
Yaohui Wang,Piotr Bilinski,Francois Bremond,Antitza Dantcheva +3 more
- 01 Mar 2020
TL;DR: A novel conditional GAN architecture, namely ImaGINator, which given a single image, a condition (label of a facial expression or action) and noise, decomposes appearance and motion in both latent and high level feature spaces, generating realistic videos.
Joint Generative and Contrastive Learning for Unsupervised Person Re-identification
Hao Chen,Yaohui Wang,Benoit Lagadec,Antitza Dantcheva,Francois Bremond +4 more
- 01 Jun 2021
TL;DR: Li et al. as discussed by the authors proposed a mesh-based view generator to generate novel views of a person and proposed a view-invariant loss to facilitate contrastive learning between original and generated views.
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Yaohui Wang,Xinyuan Chen,Xin Ma,Shangchen Zhou,Ziqi Huang,Yian Wang,Ceyuan Yang,Yinan He,Jiashuo Yu,Pe-der Yang,Yuwei Guo,Tianxing Wu,Chenyang Si,Yuming Jiang,Cunjian Chen,Chen Change Loy,B. Z. Dai,Dahua Lin,Yu Qiao,Ziwei Liu +19 more
TL;DR: The incorporation of simple temporal self-attentions, coupled with rotary positional encoding, adequately captures the temporal correlations inherent in video data and validate that the process of joint image-video fine-tuning plays a pivotal role in producing high-quality and creative outcomes.
•Posted Content
Joint Generative and Contrastive Learning for Unsupervised Person Re-identification
TL;DR: This paper incorporates a Generative Adversarial Network and a contrastive learning module into one joint training framework, and proposes a mesh-based view generator that significantly outperforms state-of-the-art methods under both, fully unsupervised and unsuper supervised domain adaptive settings on several large scale ReID dat-sets.