Shaochen Zhong
12 Papers
Shaochen Zhong is an academic researcher. The author has contributed to research in topics: Computer science. The author has an hindex of 2, co-authored 4 publications.
Chat about Author
Papers
Data-centric Artificial Intelligence: A Survey
TL;DR: Data-centric AI as mentioned in this paper provides a comprehensive survey that provides a global view of a spectrum of tasks across various stages of the data lifecycle, and equip the readers with the techniques and further research ideas to systematically engineer data for building AI systems.
KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
Zirui Liu,Jiayi Yuan,Hongye Jin,Shaochen Zhong,Zhaozhuo Xu,Vladimir Braverman,Beidi Chen,Xia Hu +7 more
TL;DR: A tuning-free 2bit KV cache quantization algorithm, named KIVI, which can enable Llama (Llama-2), Falcon, and Mistral models to maintain almost the same quality while using $\mathbf{2.6\times}$ less peak memory usage (including the model weight).
Proceedings Article
Revisit Kernel Pruning with Lottery Regulated Grouped Convolutions
TL;DR: This paper revisits the idea of kernel pruning (to only prune one or several k × k kernels out of a 3D-filter) and develops a simple yet cost-efficient greedy approximation algorithm to determine which group kernels to keep within each filter group.
16
Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model
Zirui Liu,Guanchu Wang,Shaochen Zhong,Zhaozhuo Xu,Daochen Zha,Ruixiang Tang,Zhimeng Jiang,Kaixiong Zhou,Vipin Chaudhary,Xia Hu +9 more
TL;DR: This article proposed a new family of unbiased estimators called WTA-CRS for matrix production with reduced variance, which only requires storing the sub-sampled activations for calculating the gradient.
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
Yang Sui,Yu-Neng Chuang,Guanchu Wang,Jiamu Zhang,Tianyi Zhang,Jiayi Yuan,Hongyi Liu,Andrew Wen,Shaochen Zhong,Hanjie Chen,Xia Hu +10 more