Q. Zhou
10 Papers
Q. Zhou is an academic researcher. The author has contributed to research in topics: Computer science. The author has an hindex of 2, co-authored 4 publications.
Chat about Author
Papers
SALMON: Self-Alignment with Principle-Following Reward Models
Zhiqing Sun,Yikang Shen,Hongxin Zhang,Q. Zhou,Zhenfang Chen,David Cox,Yiming Yang,Chuang Gan +7 more
TL;DR: A novel approach, namely SALMON (Self-ALignMent with principle-fOllowiNg reward models), to align base language models with minimal human supervision, using only a small set of human-defined principles, yet achieving superior performance.
Visual Chain-of-Thought Prompting for Knowledge-Based Visual Reasoning
Zhenfang Chen,Q. Zhou,Yikang Shen,Yining Hong,Zhiqing Sun,Dan Gutfreund,Chuang Gan +6 more
- 24 Mar 2024
TL;DR: This paper proposes Visual Chain-of-Thought Prompting (VCTP) for knowledge-based visual reasoning, integrating visual perception and language-based reasoning through iterative steps, achieving better performance, transparency, and efficiency on various datasets.
AdaDS: Adaptive data selection for accelerating pre-trained language model knowledge distillation
Q. Zhou,Peng Fei Li,Yang Liu,Yuyang Guan,Qizhou Xing,Ming Chen,Maosong Sun,Yang Liu +7 more
- AI open
TL;DR: This study proposes AdaDS, a framework that adaptively selects data for knowledge distillation, leveraging various strategies to improve performance across tasks, data sizes, and training stages, achieving comparable results to DistilBERT with reduced computational cost.
1
Bridging the Gap between Decision and Logits in Decision-based Knowledge Distillation for Pre-trained Language Models
Q. Zhou,Zonghan Yang,Peng-Fei Li,Yang Lu +3 more
- 15 Jun 2023
TL;DR: This paper proposed a decision-based knowledge distillation method to estimate logits from the decision distributions, where decision distributions can be both derived as a function of logits theoretically and estimated with test-time data augmentation empirically.
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Zhiqing Sun,Yikang Shen,Q. Zhou,Hongxin Zhang,Zhenfang Chen,David Cox,Yiming Yang,Chuang Gan +7 more
TL;DR: This paper proposed a self-alignment approach called Self-ALIGN, which combines principle-driven reasoning and the generative power of LLMs for the self alignment of AI agents with minimal human supervision.