Least-to-Most Prompting Enables Complex Reasoning in Large Language
  Models

doi:10.48550/arxiv.2205.10625

Open AccessPosted Content10.48550/arxiv.2205.10625

Least-to-Most Prompting Enables Complex Reasoning in Large Language Models

21 May 2022

206

TL;DR: This article proposed a least-to-most prompting strategy, which reduces a complex problem into a list of subproblems, and then sequentially solves these sub-problems by solving a given subproblem by the answers to previously solved subproproblems.

Abstract: Although chain-of-thought prompting has shown impressive results on many natural language reasoning tasks, it often performs poorly on tasks which need to solve problems harder than the demonstration examples. To tackle such easy-to-hard generalization issues, we propose a novel prompting strategy, least-to-most prompting. It reduces a complex problem into a list of subproblems, and then sequentially solve these subproblems, whereby solving a given subproblem is facilitated by the answers to previously solved subproblems. Experiments on symbolic manipulation, compositional generalization and math reasoning show that least-to-most prompting can generalize to the examples that are harder than those seen in the prompt, and outperform chain-of-thought prompting by a large margin. A notable result is that the GPT-3 code-davinci-002 model with least-to-most-prompting solves the SCAN benchmark regardless of splits (such as length split) with an accuracy of 99.7% using 14 examples versus an accuracy of 16.2% by chain-of-thought prompting, and neural-symbolic models in the literature specialized for solving SCAN are trained with the full training set of more than 15,000 examples.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.18653/v1/2023.acl-long.147

Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models

Lei Wang, +6 more

- 01 Jan 2023

TL;DR: Plan-and-Solve prompting improves zero-shot chain-of-thought reasoning by large language models by addressing missing-step errors and improving the quality of generated reasoning steps.

...read moreread less

81

Journal Article•10.48550/arxiv.2307.14385

Leveraging Large Language Models for Mental Health Prediction via Online Text Data

Xuhai Xu, +6 more

- arXiv.org

TL;DR: This work presents the first comprehensive evaluation of multiple LLMs, including AlPaca, Alpaca-LoRA, and GPT-3.5, on various mental health prediction tasks via online text data and shows that instruction finetuning can significantly boost the performance of LLMs for all tasks simultaneously.

...read moreread less

74

•Posted Content•10.1145/3568813.3600142

Thrilled by Your Progress! Large Language Models (GPT-4) No Longer Struggle to Pass Assessments in Higher Education Programming Courses

Jaromir Savelka, +4 more

- 15 Jun 2023

- arXiv.org

TL;DR: In this paper , the authors report the performance of GPT-4, comparing it to the previous generations of generative AI models, on three Python courses with assessments ranging from simple multiple-choice questions (no code involved) to complex programming projects with code bases distributed into multiple files (599 exercises overall).

...read moreread less

72

Journal Article•10.18653/v1/2023.findings-emnlp.272

Multi-step Jailbreaking Privacy Attacks on ChatGPT

Haoran Li, +6 more

- 01 Jan 2023

TL;DR: Multi-step jailbreaking privacy attacks on ChatGPT reveal potential privacy threats from application-integrated LLMs.

...read moreread less

66

Journal Article•10.18653/v1/2023.acl-long.294

Reasoning with Language Model Prompting: A Survey

Shuofei Qiao, +8 more

- 01 Jan 2023

TL;DR: A survey on reasoning with language model prompting explores the latest research on reasoning abilities in language models and provides a comprehensive overview of the field.

...read moreread less

64

...

Expand

References

10.48550/arxiv.2003.05562

Learning Compositional Rules Via Neural Program Synthesis

Maxwell I. Nye, +3 more

TL;DR: Researchers propose a neuro-symbolic model that learns compositional rules from few examples, outperforming neural meta-learning techniques in three domains, including artificial instruction-learning and language translation, by inducing explicit rule systems.

...read moreread less

10.48550/arxiv.1503.01007

Inferring Algorithmic Patterns with Stack-Augmented Recurrent Nets

Armand Joulin, +1 more

TL;DR: This paper explores overcoming limitations of standard deep learning approaches by introducing a stack-augmented recurrent network that can learn to count and memorize sequences, enabling the prediction of algorithmically generated sequences beyond standard recurrent networks' capabilities.

...read moreread less

10.48550/arxiv.2106.04537

Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks

Avi Schwarzschild, +6 more

TL;DR: Recurrent neural networks can learn to solve complex problems by extending reasoning strategies learned on simple problems, achieving algorithmic behavior through additional recurrences, demonstrated on prefix sum computation, mazes, and chess.

...read moreread less