Ping Yu

9 Papers

9 Citations

Ping Yu is an academic researcher. The author has contributed to research in topics: Computer science & Question answering. The author has an hindex of 2, co-authored 2 publications.

Author Tools

Create citation map

Create Author Profile

Analyze Ping Yu's Top Papers

Chat about Author

Papers

Journal Article•10.48550/arXiv.2305.11206

LIMA: Less Is More for Alignment

Chunting Zhou, +13 more

- 18 May 2023

- arXiv.org

TL;DR: This paper trained a 65B parameter LLaMa language model fine-tuned with the standard supervised loss on only 1,000 carefully curated prompts and responses, without any reinforcement learning or human preference modeling.

...read moreread less

427

Journal Article•10.48550/arxiv.2308.06259

Self-Alignment with Instruction Backtranslation

Xian Li, +7 more

- 11 Aug 2023

- arXiv.org

TL;DR: This work presents a scalable method to build a high quality instruction following language model by automatically labelling human-written text with corresponding instructions, not relying on distillation data, demonstrating highly effective self-alignment.

...read moreread less

Journal Article•10.48550/arxiv.2308.04592

Shepherd: A Critic for Language Model Generation

Tianlu Wang, +9 more

- 08 Aug 2023

- arXiv.org

TL;DR: This work introduces Shepherd, a language model specifically tuned to critique responses and suggest refinements, extending beyond the capabilities of an untuned model to identify diverse errors and provide suggestions to remedy them.

...read moreread less

Journal Article•10.48550/arXiv.2212.08286

ALERT: Adapting Language Models to Reasoning Tasks

Ping Yu, +6 more

- 16 Dec 2022

- arXiv.org

TL;DR: The authors introduce ALERT, a benchmark and suite of analyses for assessing language models' reasoning ability comparing pre-trained and finetuned models on complex tasks that require reasoning skills to solve.

...read moreread less

Journal Article•10.48550/arXiv.2305.12001

OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models

Badr AlKhamissi, +5 more

- 19 May 2023

- arXiv.org

TL;DR: This paper investigated the role of explanations on different reasoning skills of large language models and found that having explanations in the few-shot exemplar has no significant impact on the model's performance when the model is finetuned, while positively affecting the non-finetuned counterpart.

...read moreread less