Ping Yu
9 Papers
9 Citations
Ping Yu is an academic researcher. The author has contributed to research in topics: Computer science & Question answering. The author has an hindex of 2, co-authored 2 publications.
Chat about Author
Papers
LIMA: Less Is More for Alignment
Chunting Zhou,Pengfei Liu,Srinivasan Iyer,Jiao Sun,Yuning Mao,Xuezhe Ma,Avia Efrat,Ping Yu,Lili Yu,Susan Zhang,Gargi Ghosh,M. Lewis,Luke Zettlemoyer,Omer Levy +13 more
TL;DR: This paper trained a 65B parameter LLaMa language model fine-tuned with the standard supervised loss on only 1,000 carefully curated prompts and responses, without any reinforcement learning or human preference modeling.
427
Self-Alignment with Instruction Backtranslation
TL;DR: This work presents a scalable method to build a high quality instruction following language model by automatically labelling human-written text with corresponding instructions, not relying on distillation data, demonstrating highly effective self-alignment.
Shepherd: A Critic for Language Model Generation
Tianlu Wang,Ping Yu,Xiaoqing E. Tan,Sean O'Brien,Ramakanth Pasunuru,Jane Dwivedi-Yu,Olga Golovneva,Luke Zettlemoyer,Maryam Fazel-Zarandi,A. Celikyilmaz +9 more
TL;DR: This work introduces Shepherd, a language model specifically tuned to critique responses and suggest refinements, extending beyond the capabilities of an untuned model to identify diverse errors and provide suggestions to remedy them.
49
ALERT: Adapting Language Models to Reasoning Tasks
TL;DR: The authors introduce ALERT, a benchmark and suite of analyses for assessing language models' reasoning ability comparing pre-trained and finetuned models on complex tasks that require reasoning skills to solve.
OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models
Badr AlKhamissi,Siddharth Verma,Ping Yu,Zhijing Jin,A. Celikyilmaz,Mona Diab +5 more
TL;DR: This paper investigated the role of explanations on different reasoning skills of large language models and found that having explanations in the few-shot exemplar has no significant impact on the model's performance when the model is finetuned, while positively affecting the non-finetuned counterpart.