Open AccessPosted Content
Multitask Prompted Training Enables Zero-Shot Task Generalization
Victor Sanh,Albert Webson,Colin Raffel,Stephen H. Bach,Lintang Sutawika,Zaid Alyafeai,Antoine Chaffin,Arnaud Stiegler,Teven Le Scao,Arun Raja,Manan Dey,M Saiful Bari,Canwen Xu,Urmish Thakker,Shanya Sharma,Eliza Szczechla,Taewoon Kim,Gunjan Chhablani,Nihal V. Nayak,Debajyoti Datta,Jonathan Chang,Mike Tian-Jian Jiang,Han Wang,Matteo Manica,Sheng Shen,Zheng Xin Yong,Harshit Pandey,Rachel Bawden,Tom Wang,Trishala Neeraj,Jos Rozen,Abheesht Sharma,Andrea Santilli,Thibault Févry,Jason A. Fries,Ryan Teehan,Stella Biderman,Leo Gao,Tali Bers,Thomas Wolf,Alexander M. Rush +40 more
TL;DR: This article developed a system for easily mapping general natural language tasks into a human-readable prompted form, and fine-tuned a pretrained encoder-decoder model on this multitask mixture covering a wide variety of tasks.
read more
Abstract: Large language models have recently been shown to attain reasonable zero-shot
generalization on a diverse set of tasks. It has been hypothesized that this is
a consequence of implicit multitask learning in language model training. Can
zero-shot generalization instead be directly induced by explicit multitask
learning? To test this question at scale, we develop a system for easily
mapping general natural language tasks into a human-readable prompted form. We
convert a large set of supervised datasets, each with multiple prompts using
varying natural language. These prompted datasets allow for benchmarking the
ability of a model to perform completely unseen tasks specified in natural
language. We fine-tune a pretrained encoder-decoder model on this multitask
mixture covering a wide variety of tasks. The model attains strong zero-shot
performance on several standard datasets, often outperforming models 16x its
size. Further, our approach attains strong performance on a subset of tasks
from the BIG-Bench benchmark, outperforming models 6x its size. All prompts and
trained models are available at github.com/bigscience-workshop/promptsource/.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Why Johnny Can’t Prompt: How Non-AI Experts Try (and Fail) to Design LLM Prompts
J. D. Zamfirescu-Pereira,Richmond Y. Wong,Bjoern Hartmann,Qiang Yang +3 more
- 19 Apr 2023
TL;DR: The authors explored whether non-AI-experts can successfully engage in "end-user prompt engineering" using a design probe, a prototype LLM-based chatbot design tool supporting development and systematic evaluation of prompting strategies.
Academic integrity considerations of AI Large Language Models in the post-pandemic era: ChatGPT and beyond
TL;DR: In this article , the authors examine the academic integrity concerns of students' use of Artificial Intelligence (AI) tools using Large Language Models (LLMs) such as ChatGPT in formal assessments, and conclude that it is not the student use of any AI tools that defines whether plagiarism or a breach of academic integrity has occurred, but whether any use is made clear by the student.
Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?
Sewon Min,Xinxi Lyu,Ari Holtzman,Mikel Artetxe,Michael Lewis,Hannaneh Hajishirzi,Luke Zettlemoyer +6 more
- 01 Jan 2022
TL;DR: In-context learning works primarily through the provision of examples of the label space, input text distribution, and sequence format rather than the need for ground truth demonstrations.
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Stephen H. Bach,Victor Sanh,Zheng-Xin Yong,Albert Webson,Colin Raffel,Nihal V. Nayak,Abheesht Sharma,Taewoon Kim,M Saiful Bari,Thibault Févry,Zaid Alyafeai,Manan Dey,Andrea Santilli,Zhiqing Sun,Srulik Ben-David,Canwen Xu,Gunjan Chhablani,Han Wang,Jason A. Fries,Maged S. Al-shaibani,Shanya Sharma,Urmish Thakker,Khalid Almubarak,Xiangru Tang,Mike Tian-Jian Jiang,Alexander M. Rush +25 more
- 02 Feb 2022
TL;DR: PromptSource addresses the emergent challenges in this new setting with a templating language for defining data-linked prompts, an interface that lets users quickly iterate on prompt development by observing outputs of their prompts on many examples, and a community-driven set of guidelines for contributing new prompts to a common pool.
PTR: Prompt Tuning with Rules for Text Classification
TL;DR: The authors proposed to encode the prior knowledge of a classification task into rules, then design sub-prompts according to the rules, and finally combine the sub-problems to handle the task.
317
References
•Posted Content
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.
81.7K
•Proceedings Article
Language Models are Few-Shot Learners
Tom B. Brown,Benjamin Mann,Nick Ryder,Melanie Subbiah,Jared Kaplan,Prafulla Dhariwal,Arvind Neelakantan,Pranav Shyam,Girish Sastry,Amanda Askell,Sandhini Agarwal,Ariel Herbert-Voss,Gretchen Krueger,Thomas Henighan,Rewon Child,Aditya Ramesh,Daniel M. Ziegler,Jeffrey Wu,Clemens Winter,Christopher Hesse,Mark Chen,Eric Sigler,Mateusz Litwin,Scott Gray,Benjamin Chess,Jack Clark,Christopher Berner,Samuel McCandlish,Alec Radford,Ilya Sutskever,Dario Amodei +30 more
- 28 May 2020
TL;DR: GPT-3 achieves strong performance on many NLP datasets, including translation, question-answering, and cloze tasks, as well as several tasks that require on-the-fly reasoning or domain adaptation, such as unscrambling words, using a novel word in a sentence, or performing 3-digit arithmetic.
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin,Ming-Wei Chang,Kenton Lee,Kristina Toutanova +3 more
- 11 Oct 2018
TL;DR: BERT as mentioned in this paper pre-trains deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.
24.6K
Multitask Learning
Rich Caruana
- 01 Jul 1997
TL;DR: Multi-task Learning (MTL) as mentioned in this paper is an approach to inductive transfer that improves generalization by using the domain information contained in the training signals of related tasks as an inductive bias.
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Wang,Amanpreet Singh,Julian Michael,Felix Hill,Omer Levy,Samuel R. Bowman +5 more
- 01 Nov 2018
TL;DR: The gluebenchmark as mentioned in this paper is a benchmark of nine diverse NLU tasks, an auxiliary dataset for probing models for understanding of specific linguistic phenomena, and an online platform for evaluating and comparing models.