Topic

Constructed language

About: Constructed language is a research topic. Over the lifetime, 2699 publications have been published within this topic receiving 68403 citations. The topic is also known as: conlang & artificial language.

...read moreread less

Topic Tools

Find unexplored research gaps

Generate a literature review

Explore related concepts

Papers published on a yearly basis

1 / 2

Papers

Proceedings Article•

Language Models are Few-Shot Learners

[...]

Tom B. Brown¹, Benjamin Mann, Nick Ryder², Melanie Subbiah, Jared Kaplan³, Prafulla Dhariwal¹, Arvind Neelakantan⁴, Pranav Shyam, Girish Sastry¹, Amanda Askell¹, Sandhini Agarwal¹, Ariel Herbert-Voss¹, Gretchen Krueger¹, Thomas Henighan¹, Rewon Child¹, Aditya Ramesh¹, Daniel M. Ziegler⁵, Jeffrey Wu¹, Clemens Winter, Christopher Hesse¹, Mark Chen¹, Eric Sigler, Mateusz Litwin, Scott Gray¹, Benjamin Chess¹, Jack Clark¹, Christopher Berner, Samuel McCandlish¹, Alec Radford¹, Ilya Sutskever¹, Dario Amodei¹ - Show less +27 more•Institutions (5)

OpenAI¹, University of California, Berkeley², Johns Hopkins University³, Google⁴, Massachusetts Institute of Technology⁵

28 May 2020

TL;DR: GPT-3 achieves strong performance on many NLP datasets, including translation, question-answering, and cloze tasks, as well as several tasks that require on-the-fly reasoning or domain adaptation, such as unscrambling words, using a novel word in a sentence, or performing 3-digit arithmetic.

...read moreread less

Abstract: Recent work has demonstrated substantial gains on many NLP tasks and benchmarks by pre-training on a large corpus of text followed by fine-tuning on a specific task. While typically task-agnostic in architecture, this method still requires task-specific fine-tuning datasets of thousands or tens of thousands of examples. By contrast, humans can generally perform a new language task from only a few examples or from simple instructions - something which current NLP systems still largely struggle to do. Here we show that scaling up language models greatly improves task-agnostic, few-shot performance, sometimes even reaching competitiveness with prior state-of-the-art fine-tuning approaches. Specifically, we train GPT-3, an autoregressive language model with 175 billion parameters, 10x more than any previous non-sparse language model, and test its performance in the few-shot setting. For all tasks, GPT-3 is applied without any gradient updates or fine-tuning, with tasks and few-shot demonstrations specified purely via text interaction with the model. GPT-3 achieves strong performance on many NLP datasets, including translation, question-answering, and cloze tasks, as well as several tasks that require on-the-fly reasoning or domain adaptation, such as unscrambling words, using a novel word in a sentence, or performing 3-digit arithmetic. At the same time, we also identify some datasets where GPT-3's few-shot learning still struggles, as well as some datasets where GPT-3 faces methodological issues related to training on large web corpora. Finally, we find that GPT-3 can generate samples of news articles which human evaluators have difficulty distinguishing from articles written by humans. We discuss broader societal impacts of this finding and of GPT-3 in general.

...read moreread less

25,208 citations

Journal Article•10.1080/10618600.1996.10474713•

R: A Language for Data Analysis and Graphics

[...]

Ross Ihaka¹, Robert Gentleman¹•Institutions (1)

University of Auckland¹

01 Sep 1996-Journal of Computational and Graphical Statistics

TL;DR: In this article, the authors discuss their experience designing and implementing a statistical computing language, which combines what they felt were useful features from two existing computer languages, and they feel that the new language provides advantages in the areas of portability, computational efficiency, memory management, and scope.

...read moreread less

Abstract: In this article we discuss our experience designing and implementing a statistical computing language. In developing this new language, we sought to combine what we felt were useful features from two existing computer languages. We feel that the new language provides advantages in the areas of portability, computational efficiency, memory management, and scoping.

...read moreread less

10,904 citations

Posted Content•

Language Models are Few-Shot Learners

[...]

OpenAI¹, University of California, Berkeley², Johns Hopkins University³, Google⁴, Massachusetts Institute of Technology⁵

28 May 2020-arXiv: Computation and Language

TL;DR: This article showed that scaling up language models greatly improves task-agnostic, few-shot performance, sometimes even reaching competitiveness with prior state-of-the-art fine-tuning approaches.

...read moreread less

1,956 citations

Journal Article•10.4324/9781315836027-6•

From communicative competence to communicative language pedagogy

[...]

Michael Canale

06 Jun 2014-Language & Communication

TL;DR: The authors argue that the individual who wishes to learn a new language must, in addition to acquiring a new vocabulary and a new set of phonological and syntactic rules, learn what Hymes calls the rules of speaking: the patterns of sociolinguistic behaviour of the target language.

...read moreread less

1,882 citations

Journal Article•10.1006/JMLA.1996.0032•

Word segmentation : the role of distributional cues

[...]

Jenny R. Saffran¹, Elissa L. Newport¹, Richard N. Aslin¹•Institutions (1)

University of Rochester¹

01 Aug 1996-Journal of Memory and Language

TL;DR: This article showed that distributional cues may play an important role in the initial word segmentation of language learners, and that the addition of certain prosodic cues served to enhance performance of infants.

...read moreread less

1,406 citations

...

Expand

Performance Metrics

2,711

Papers

15,132

Citations

No. of papers in the topic in previous years
Year	Papers
2024	1
2023	2
2022	4
2021	126
2020	157
2019	136

Constructed language

Topic Tools

Papers published on a yearly basis

Papers

Language Models are Few-Shot Learners

R: A Language for Data Analysis and Graphics

Language Models are Few-Shot Learners

From communicative competence to communicative language pedagogy

Word segmentation : the role of distributional cues

Related Topics (5)

Performance Metrics