A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios

Open AccessPosted Content

A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios

- 23 Oct 2020

8

TL;DR: In this paper, a survey of low-resource natural language processing methods is presented, including data augmentation, distant supervision, and transfer learning settings that reduce the need for target supervision.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.3390/IJERPH18010218

A Sentiment Analysis Approach to Predict an Individual's Awareness of the Precautionary Procedures to Prevent COVID-19 Outbreaks in Saudi Arabia.

Sumayh S. Aljameel, +7 more

- 30 Dec 2020

- International Journal of Environmental R...

TL;DR: A model that predicts an individual’s awareness of the precautionary procedures in five main regions in Saudi Arabia can support the medical sectors and decision-makers to decide the appropriate procedures for each region based on their attitudes towards the pandemic.

...read moreread less

91

•Journal Article•10.3390/APP11188319

A Survey on Recent Named Entity Recognition and Relationship Extraction Techniques on Clinical Texts

Priyankar Bose, +5 more

- 08 Sep 2021

- Applied Sciences

TL;DR: This comprehensive survey on clinical NER and RE encompass current challenges, state-of-the-art practices, and future directions in information extraction from clinical text.

...read moreread less

84

•Posted Content

Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey.

Danielle Saunders

- 14 Apr 2021

- arXiv: Computation and Language

TL;DR: The authors focus on domain adaptation for NMT, particularly the case where a system may need to translate sentences from multiple domains, and divide techniques into those relating to data selection, model architecture, parameter adaptation procedure, and inference procedure.

...read moreread less

46

•Posted Content

An Empirical Survey of Data Augmentation for Limited Data Learning in NLP

Jiaao Chen, +4 more

- 14 Jun 2021

- arXiv: Computation and Language

TL;DR: The authors provided an empirical survey of recent progress on data augmentation for NLP in the limited labeled data setting, summarizing the landscape of methods and carrying out experiments on 11 datasets covering topics/news classification, inference tasks, paraphrasing tasks, and single-sentence tasks.

...read moreread less

16

•Journal Article•10.1613/jair.1.13566

Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey

29 Sep 2022

- Journal of Artificial Intelligence Resea...

TL;DR: The authors survey approaches to domain adaptation for NMT, particularly where a system may need to translate across multiple domains, and highlight the benefits of domain adaptation and multidomain adaptation techniques to other lines of NMT research.

...read moreread less

7

References

•Proceedings Article•10.18653/V1/P17-2090

Data Augmentation for Low-Resource Neural Machine Translation

Marzieh Fadaee, +2 more

- 01 May 2017

- arXiv: Computation and Language

TL;DR: This article proposed a data augmentation approach that targets low-frequency words by generating new sentence pairs containing rare words in new, synthetically created contexts, which improves translation quality by up to 2.9 BLEU points over the baseline and up to 3.2BLEU over back-translation.

...read moreread less

181

Journal Article•10.1016/J.WPI.2020.101965

Patent classification by fine-tuning BERT language model

Jieh Sheng Lee, +1 more

- 01 Jun 2020

- World Patent Information

TL;DR: When applied to large datasets of over two million patents, this approach outperforms the state of the art by an approach using CNN with word embeddings and shows that patent claims alone are sufficient to achieve state-of-the-art results for classification task, in contrast to conventional wisdom.

...read moreread less

180

•Proceedings Article•10.18653/V1/N18-1089

Robust Multilingual Part-of-Speech Tagging via Adversarial Training

Michihiro Yasunaga, +2 more

- 01 Jun 2018

TL;DR: It is found that AT not only improves the overall tagging accuracy, but also prevents over-fitting well in low resource languages and boosts tagging accuracy for rare / unseen words.

...read moreread less

179

•Proceedings Article•10.18653/V1/E17-1088

Cross-Lingual Word Embeddings for Low-Resource Language Modeling

Oliver Adams, +4 more

- 01 Apr 2017

TL;DR: This work investigates the use of bilingual lexicons to improve language models when textual training data is limited to as few as a thousand sentences, and involves learning cross-lingual word embeddings as a preliminary step in training monolingual language models.

...read moreread less

177

Journal Article•10.1007/S10462-016-9482-X

Urdu language processing: a survey

Ali Daud, +2 more

- 01 Mar 2017

- Artificial Intelligence Review

TL;DR: The goal of this paper is to organize the ULP work in a way that it can provide a platform for ULP research activities in future and to describe in detail the recent increase in interest and progress made in Urdu language processing research.

...read moreread less

175