PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts

doi:10.18653/v1/2022.acl-demo.9

Open AccessProceedings Article10.18653/v1/2022.acl-demo.9

PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts

Stephen H. Bach, +25 more

- 02 Feb 2022

Vol. abs/2202.01279

340

TL;DR: PromptSource addresses the emergent challenges in this new setting with a templating language for defining data-linked prompts, an interface that lets users quickly iterate on prompt development by observing outputs of their prompts on many examples, and a community-driven set of guidelines for contributing new prompts to a common pool.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arXiv.2211.05100

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao, +386 more

- 09 Nov 2022

- arXiv.org

TL;DR: BLOOM as discussed by the authors is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total).

...read moreread less

1.4K

Proceedings Article•10.48550/arXiv.2212.10560

Self-Instruct: Aligning Language Models with Self-Generated Instructions

Yizhong Wang, +6 more

- 20 Dec 2022

TL;DR: The authors propose Self-Instruct, a framework for improving the instruction-following capabilities of pre-trained language models by bootstrapping off their own generations, which generates instructions, input, and output samples from a language model, then filters invalid or similar ones before using them to finetune the original model.

...read moreread less

1.2K

Journal Article•10.48550/arXiv.2305.14314

QLoRA: Efficient Finetuning of Quantized LLMs

Tim Dettmers, +3 more

- 23 May 2023

- arXiv.org

TL;DR: QLoRA as discussed by the authors proposes to backpropagate gradients through a frozen, 4-bit quantized pretrained language model into Low Rank Adapters (LoRA) and achieves state-of-the-art performance.

...read moreread less

1.1K

Proceedings Article•10.48550/arXiv.2210.02414

GLM-130B: An Open Bilingual Pre-trained Model

Aohan Zeng, +17 more

- 05 Oct 2022

TL;DR: An attempt to open-source a 100B-scale model at least as good as GPT-3 and unveil how models of such a scale can be successfully pre-trained, including its design choices, training strategies for both efficiency and stability, and engineering efforts is introduced.

...read moreread less

707

...

Expand

References

•Proceedings Article

Language Models are Few-Shot Learners

Tom B. Brown, +30 more

- 28 May 2020

TL;DR: GPT-3 achieves strong performance on many NLP datasets, including translation, question-answering, and cloze tasks, as well as several tasks that require on-the-fly reasoning or domain adaptation, such as unscrambling words, using a novel word in a sentence, or performing 3-digit arithmetic.

...read moreread less

25.2K

•Proceedings Article•10.18653/V1/D15-1075

A large annotated corpus for learning natural language inference

Samuel R. Bowman, +3 more

- 21 Aug 2015

TL;DR: The Stanford Natural Language Inference (SNLI) corpus as discussed by the authors is a large-scale collection of labeled sentence pairs, written by humans doing a novel grounded task based on image captioning.

...read moreread less

5.2K

•Proceedings Article

brat: a Web-based Tool for NLP-Assisted Text Annotation

Pontus Stenetorp, +5 more

- 23 Apr 2012

TL;DR: The brat rapid annotation tool (BRAT) is introduced, an intuitive web-based tool for text annotation supported by Natural Language Processing (NLP) technology and an evaluation of annotation assisted by semantic class disambiguation on a multicategory entity mention annotation task, showing a 15% decrease in total annotation time.

...read moreread less

1.3K

•Proceedings Article•10.18653/V1/2021.NAACL-MAIN.185

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

Timo Schick, +1 more

- 01 Jun 2021

TL;DR: This work shows that performance similar to GPT-3 can be obtained with language models that are much “greener” in that their parameter count is several orders of magnitude smaller, and identifies key factors required for successful natural language understanding with small language models.

...read moreread less

1.1K

•Posted Content

Exploiting Cloze Questions for Few Shot Text Classification and Natural Language Inference

Timo Schick, +1 more

- 21 Jan 2020

- arXiv: Computation and Language

TL;DR: This work introduces Pattern-Exploiting Training (PET), a semi-supervised training procedure that reformulates input examples as cloze-style phrases to help language models understand a given task.

...read moreread less

1K