Xuhui Jia
University of Hong Kong
23 Papers
92 Citations
Xuhui Jia is an academic researcher from University of Hong Kong. The author has contributed to research in topics: Computer science & Random forest. The author has an hindex of 8, co-authored 19 publications. Previous affiliations of Xuhui Jia include University of Central Florida & Google.
Chat about Author
Papers
Search to Distill: Pearls Are Everywhere but Not the Eyes
Yu Liu,Xuhui Jia,Mingxing Tan,Raviteja Vemulapalli,Yukun Zhu,Bradley Ray Green,Xiaogang Wang +6 more
- 14 Jun 2020
TL;DR: This work presents a new architecture-aware Knowledge Distillation approach that finds student models (pearls for the teacher) that are best for distilling the given teacher model and leverages Neural Architecture Search (NAS), equipped with the authors' KD-guided reward, to search for the best student architectures for a given teacher.
Subject-driven Text-to-Image Generation via Apprenticeship Learning
TL;DR: SuTI as mentioned in this paper is a subject-driven text-to-image generator that replaces subject-specific fine-tuning with in-context learning, given a few demonstrations of a new subject.
Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models
Xuhui Jia,Yang Zhao,Kelvin C.K. Chan,Yandong Li,Hanzhe Zhang,Boqing Gong,Tingbo Hou,Huisheng Wang,Yu Su +8 more
TL;DR: Zhang et al. as mentioned in this paper proposed an encoder to capture high-level identifiable semantics of objects, producing an object-specific embedding with only a single feed-forward pass.
67
•Posted Content
An Empirical Study of Recent Face Alignment Methods
TL;DR: A new evaluation metric for face alignment on a set of images, i.e., area under error distribution curve within a threshold, AUC$_\alpha$, is proposed given the fact that the traditional evaluation measure (mean error) is very sensitive to big alignment error.
51
•Journal Article
Global Self-Attention Networks for Image Recognition
TL;DR: A new global self-attention module, referred to as the GSA module, which is efficient enough to serve as the backbone component of a deep network, and introduces new standalone global attention-based deep networks that use GSA modules instead of convolutions to model pixel interactions.