Multitask Semi-Supervised Learning for Class-Imbalanced Discourse Classification

Open AccessProceedings Article

Multitask Semi-Supervised Learning for Class-Imbalanced Discourse Classification

- 01 Nov 2021

- pp 498-517

14

TL;DR: This article showed that a multitask learning approach can combine discourse datasets from similar and diverse domains to improve discourse classification and showed an improvement of 4.9% Micro F1 score over current state-of-the-art benchmarks on the NewsDiscourse dataset, one of the largest discourse datasets recently published.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1186/s40537-023-00727-2

A survey on deep learning tools dealing with data scarcity: definitions, challenges, solutions, tips, and applications

Laith Alzubaidi, +13 more

- 14 Apr 2023

- Journal of Big Data

TL;DR: In this article , the authors present a survey on state-of-the-art techniques to deal with training DL models to overcome three challenges including small, imbalanced datasets, and lack of generalization.

...read moreread less

303

Journal Article•10.1109/tai.2022.3160658

On Supervised Class-Imbalanced Learning: An Updated Perspective and Some Key Challenges

01 Dec 2022

- IEEE transactions on artificial intellig...

TL;DR: In this article , the authors provide a comprehensive summary of the rich pool of research works attempting to combat the adversarial effects of class imbalance efficiently and highlight the need for techniques tailored for such a paradigm.

...read moreread less

32

•Posted Content

Sequential Sentence Classification in Research Papers using Cross-Domain Multi-Task Learning.

Arthur Brack, +3 more

- 11 Feb 2021

- arXiv: Computation and Language

TL;DR: This paper proposed a uniform deep learning architecture and multi-task learning to improve sequential sentence classification in scientific texts across domains by exploiting training data from multiple domains, which can enhance academic search engines to support researchers in finding and exploring research literature more effectively.

...read moreread less

12

Journal Article•10.18653/v1/2023.eacl-main.38

A Survey of Methods for Addressing Class Imbalance in Deep-Learning Based Natural Language Processing

Sophie Henning, +3 more

- 01 Jan 2023

TL;DR: A survey of methods for addressing class imbalance in deep-learning based natural language processing (NLP) tasks. Covers various types of imbalance, approaches based on sampling, data augmentation, loss functions, staged learning, and model design.

...read moreread less

9

•Proceedings Article•10.1145/3529372.3530922

Cross-domain multi-task learning for sequential sentence classification in research papers

20 Jun 2022

TL;DR: In this paper , a novel uniform deep learning architecture and multi-task learning for cross-domain sequential sentence classification in scientific texts are proposed. But the authors do not consider the issue of different text structure of full papers and abstracts.

...read moreread less

8

References

•Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

- 12 Jun 2017

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

94.2K

•Posted Content

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

- 11 Oct 2018

- arXiv: Computation and Language

TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

81.7K

Preprint•10.48550/arxiv.1706.03762

Attention Is All You Need

Ashish Vaswani, +7 more

- 01 Jan 2017

Abstract: The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train. Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task, improving over the existing best results, including ensembles by over 2 BLEU. On the WMT 2014 English-to-French translation task, our model establishes a new single-model state-of-the-art BLEU score of 41.8 after training for 3.5 days on eight GPUs, a small fraction of the training costs of the best models from the literature. We show that the Transformer generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.

...read moreread less

51.8K

•Journal Article•10.1613/JAIR.953

SMOTE: synthetic minority over-sampling technique

Nitesh V. Chawla, +3 more

- 01 Jan 2002

- Journal of Artificial Intelligence Resea...

TL;DR: In this article, a method of over-sampling the minority class involves creating synthetic minority class examples, which is evaluated using the area under the Receiver Operating Characteristic curve (AUC) and the ROC convex hull strategy.

...read moreread less

27.7K

•Posted Content

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Yinhan Liu, +9 more

- 26 Jul 2019

- arXiv: Computation and Language

TL;DR: It is found that BERT was significantly undertrained, and can match or exceed the performance of every model published after it, and the best model achieves state-of-the-art results on GLUE, RACE and SQuAD.

...read moreread less

26.2K