Least-to-Most Prompting Enables Complex Reasoning in Large Language Models
21 May 2022
TL;DR: This article proposed a least-to-most prompting strategy, which reduces a complex problem into a list of subproblems, and then sequentially solves these sub-problems by solving a given subproblem by the answers to previously solved subproproblems.
read more
Abstract: Although chain-of-thought prompting has shown impressive results on many natural language reasoning tasks, it often performs poorly on tasks which need to solve problems harder than the demonstration examples. To tackle such easy-to-hard generalization issues, we propose a novel prompting strategy, least-to-most prompting. It reduces a complex problem into a list of subproblems, and then sequentially solve these subproblems, whereby solving a given subproblem is facilitated by the answers to previously solved subproblems. Experiments on symbolic manipulation, compositional generalization and math reasoning show that least-to-most prompting can generalize to the examples that are harder than those seen in the prompt, and outperform chain-of-thought prompting by a large margin. A notable result is that the GPT-3 code-davinci-002 model with least-to-most-prompting solves the SCAN benchmark regardless of splits (such as length split) with an accuracy of 99.7% using 14 examples versus an accuracy of 16.2% by chain-of-thought prompting, and neural-symbolic models in the literature specialized for solving SCAN are trained with the full training set of more than 15,000 examples.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models
Lei Wang,Wanyu Xu,Yihuai Lan,Zhiqiang Hu,Yunshi Lan,Roy Ka-Wei Lee,Ee-Peng Lim +6 more
- 01 Jan 2023
TL;DR: Plan-and-Solve prompting improves zero-shot chain-of-thought reasoning by large language models by addressing missing-step errors and improving the quality of generated reasoning steps.
Leveraging Large Language Models for Mental Health Prediction via Online Text Data
Xuhai Xu,Bingsheng Yao,Yuanzhe Dong,Hong Yu,James A. Hendler,Anind K. Dey,Dakuo Wang +6 more
TL;DR: This work presents the first comprehensive evaluation of multiple LLMs, including AlPaca, Alpaca-LoRA, and GPT-3.5, on various mental health prediction tasks via online text data and shows that instruction finetuning can significantly boost the performance of LLMs for all tasks simultaneously.
74
Thrilled by Your Progress! Large Language Models (GPT-4) No Longer Struggle to Pass Assessments in Higher Education Programming Courses
TL;DR: In this paper , the authors report the performance of GPT-4, comparing it to the previous generations of generative AI models, on three Python courses with assessments ranging from simple multiple-choice questions (no code involved) to complex programming projects with code bases distributed into multiple files (599 exercises overall).
Multi-step Jailbreaking Privacy Attacks on ChatGPT
Haoran Li,Dadi Guo,Fan Wang,Mingshi Xu,Jie Huang,Fanbo Meng,Yangqiu Song +6 more
- 01 Jan 2023
TL;DR: Multi-step jailbreaking privacy attacks on ChatGPT reveal potential privacy threats from application-integrated LLMs.
Reasoning with Language Model Prompting: A Survey
Shuofei Qiao,Yixin Ou,Ningyu Zhang,Xiang Chen,Yunzhi Yao,Shumin Deng,Chuanqi Tan,Fei Huang,Huajun Chen +8 more
- 01 Jan 2023
TL;DR: A survey on reasoning with language model prompting explores the latest research on reasoning abilities in language models and provides a comprehensive overview of the field.
References
Learning Compositional Rules Via Neural Program Synthesis
Maxwell I. Nye,Armando Solar-Lezama,Joshua B. Tenenbaum,Brenden M. Lake +3 more
TL;DR: Researchers propose a neuro-symbolic model that learns compositional rules from few examples, outperforming neural meta-learning techniques in three domains, including artificial instruction-learning and language translation, by inducing explicit rule systems.
Inferring Algorithmic Patterns with Stack-Augmented Recurrent Nets
Armand Joulin,Tomas Mikolov +1 more
TL;DR: This paper explores overcoming limitations of standard deep learning approaches by introducing a stack-augmented recurrent network that can learn to count and memorize sequences, enabling the prediction of algorithmically generated sequences beyond standard recurrent networks' capabilities.
Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks
Avi Schwarzschild,Eitan Borgnia,Arjun Gupta,Furong Huang,Uzi Vishkin,Micah Goldblum,Tom Goldstein +6 more
TL;DR: Recurrent neural networks can learn to solve complex problems by extending reasoning strategies learned on simple problems, achieving algorithmic behavior through additional recurrences, demonstrated on prefix sum computation, mazes, and chess.