Meta Learning for Natural Language Processing: A Survey

doi:10.48550/arXiv.2205.01500

Proceedings Article10.48550/arXiv.2205.01500

Meta Learning for Natural Language Processing: A Survey

Hung-yi Lee, +2 more

- 03 May 2022

pp 666-684

28

TL;DR: The goal with this survey paper is to offer researchers pointers to relevant meta-learning works in NLP and attract more attention from the NLP community to drive future innovation.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arXiv.2304.13712

Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond

Jingfeng Yang, +7 more

- 26 Apr 2023

- arXiv.org

TL;DR: Mooler et al. as mentioned in this paper presented a comprehensive and practical guide for practitioners and end-users working with large language models (LLMs) in their downstream natural language processing (NLP) tasks.

...read moreread less

302

•Journal Article•10.1145/3593042

A Survey of Adversarial Defences and Robustness in NLP

Shreyansh Goyal, +3 more

- 12 Mar 2022

- ACM Computing Surveys

TL;DR: In the past few years, it has become increasingly evident that deep neural networks are not resilient enough to withstand adversarial perturbations in input data, leaving them vulnerable to attack as mentioned in this paper .

...read moreread less

99

AugGPT: Leveraging ChatGPT for Text Data Augmentation

Haixing Dai, +16 more

- 25 Feb 2023

TL;DR: This article proposed a text data augmentation approach based on ChatGPT (named AugGPT), which rephrases each sentence in the training samples into multiple conceptually similar but semantically different samples.

...read moreread less

77

Journal Article•10.48550/arxiv.2310.08184

Learn From Model Beyond Fine-Tuning: A Survey

Hongling Zheng, +6 more

- 12 Oct 2023

- arXiv.org

TL;DR: A comprehensive review of the current methods based on FM from the perspective of LFM is given to help readers better understand the current research status and ideas, and highlights several critical areas for future exploration and addressing open issues that require further attention.

...read moreread less

27

Journal Article•10.1109/comst.2023.3330910

Federated Learning and Meta Learning: Approaches, Applications, and Directions

Xiaonan Liu, +3 more

- 24 Oct 2022

TL;DR: This tutorial presents a comprehensive review of FL, meta learning, and federated meta learning (FedMeta), and their applications over wireless networks and analyzes the relationships among these learning algorithms and examine their advantages and disadvantages in real-world applications.

...read moreread less

20

...

Expand

References

•Posted Content

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

- 11 Oct 2018

- arXiv: Computation and Language

TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

81.7K

•Posted Content

Distilling the Knowledge in a Neural Network

Geoffrey E. Hinton, +2 more

- 09 Mar 2015

- arXiv: Machine Learning

TL;DR: This work shows that it can significantly improve the acoustic model of a heavily used commercial system by distilling the knowledge in an ensemble of models into a single model and introduces a new type of ensemble composed of one or more full models and many specialist models which learn to distinguish fine-grained classes that the full models confuse.

...read moreread less

21.2K

•Posted Content

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Colin Raffel, +8 more

- 23 Oct 2019

- arXiv: Learning

TL;DR: This systematic study compares pre-training objectives, architectures, unlabeled datasets, transfer approaches, and other factors on dozens of language understanding tasks and achieves state-of-the-art results on many benchmarks covering summarization, question answering, text classification, and more.

...read moreread less

12.9K

•Proceedings Article

Model-agnostic meta-learning for fast adaptation of deep networks

Chelsea Finn, +2 more

- 06 Aug 2017

TL;DR: An algorithm for meta-learning that is model-agnostic, in the sense that it is compatible with any model trained with gradient descent and applicable to a variety of different learning problems, including classification, regression, and reinforcement learning is proposed.

...read moreread less

11.3K

•Book

Foundations of Statistical Natural Language Processing

Christopher D. Manning, +1 more

- 28 May 1999

TL;DR: This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear and provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations.

...read moreread less

10.9K

...

Expand