Lu Hou
Huawei
31 Papers
178 Citations
Lu Hou is an academic researcher from Huawei. The author has contributed to research in topics: Computer science & Quantization (signal processing). The author has an hindex of 10, co-authored 21 publications. Previous affiliations of Lu Hou include Chinese Academy of Sciences & Hong Kong University of Science and Technology.
Chat about Author
Papers
•Proceedings Article
DynaBERT: Dynamic BERT with Adaptive Width and Depth
Lu Hou,Zhiqi Huang,Lifeng Shang,Xin Jiang,Xiao Chen,Qun Liu +5 more
- 01 Jan 2020
TL;DR: A novel dynamic BERT model, which can run at adaptive width and depth, is proposed (abbreviated as DynaBERT), which has comparable performance as BERT (or RoBERTa), while at smaller widths and depths consistently outperforms existing BERT compression methods.
TernaryBERT: Distillation-aware Ultra-low Bit BERT
Wei Zhang,Lu Hou,Yichun Yin,Lifeng Shang,Xiao Chen,Xin Jiang,Qun Liu +6 more
- 01 Nov 2020
TL;DR: This work proposes TernaryBERT, which ternarizes the weights in a fine-tuned BERT model, and leverages the knowledge distillation technique in the training process to reduce the accuracy degradation caused by the lower capacity of low bits.
•Posted Content
FILIP: Fine-grained Interactive Language-Image Pre-Training
Lewei Yao,Runhui Huang,Lu Hou,Guansong Lu,Minzhe Niu,Hang Xu,Xiaodan Liang,Zhenguo Li,Xin Jiang,Chunjing Xu +9 more
TL;DR: In this paper, a fine-grained interactive language-image pre-training (FILIP) is proposed to achieve finer-level alignment through a cross-modal late interaction mechanism, which uses a token-wise maximum similarity between visual and textual tokens to guide the contrastive objective.
149
•Posted Content
Loss-aware Binarization of Deep Networks
TL;DR: In this article, a proximal Newton algorithm with diagonal Hessian approximation is proposed to directly minimize the loss w.r.t. the binarized weights, and the second order information can be efficiently obtained from the second moments already computed by the Adam optimizer.
125
•Posted Content
DynaBERT: Dynamic BERT with Adaptive Width and Depth.
TL;DR: In this paper, the authors propose a dynamic BERT model, which can flexibly adjust the size and latency by selecting adaptive width and depth, by distilling knowledge from the full-sized model to small sub-networks.
115