Journal Article10.1016/J.ESWA.2021.115067
Sequential targeting: A continual learning approach for data imbalance in text classification
16
TL;DR: A novel training method, Sequential Targeting (ST), is proposed, independent of the effectiveness of the representation method, which enforces an incremental learning setting by splitting the data into mutually exclusive subsets and training the learner adaptively.
read more
Abstract: Text classification has numerous use cases including sentiment analysis, spam detection, document classification, hate speech detection, etc. In realistic settings, classification on text data confronts imbalanced data conditions where classes of interest usually compose a minor fraction. Deep neural networks used for text classification, such as recurrent neural networks and transformer networks, suffer from a lack of efficient methods addressing imbalanced data. Traditional data-level methods attempting to mitigate distributional skew include oversampling and undersampling. The oversampling methods destruct the quality of original language representation of the sparse data coming from minority classes whereas the undersampling methods fail to fully utilize the rich context of majority classes. We address such issues in data-driven approaches by enforcing continual learning on imbalanced data by partitioning the training data distribution into mutually exclusive subsets and performing continual learning, treating the individual subsets as distinct tasks. We demonstrate the effectiveness of our method through experiments on the IMDB dataset and constructed datasets from real-world data. The experimental results show that the proposed method improves by 56.38 %p on the IMDB dataset and by 16.89 %p and 34.76 %p on the constructed datasets compared to the baseline method in terms of the F1-score metric.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
Learning without Forgetting
Zhizhong Li,Derek Hoiem +1 more
TL;DR: This work proposes the Learning without Forgetting method, which uses only new task data to train the network while preserving the original capabilities, and performs favorably compared to commonly used feature extraction and fine-tuning adaption techniques.
2.6K
A survey on hate speech detection and sentiment analysis using machine learning and deep learning models
Malliga Subramanian,Veerappampalayam Easwaramoorthy Sathiskumar,G. Deepalakshmi,JaeHyuk Cho,G. Manikandan +4 more
TL;DR: This survey article provides a comprehensive overview of recent advancements in hate speech detection and sentiment analysis using machine learning and deep learning models, highlighting methodologies, datasets, challenges, and areas for future research to promote a more inclusive online environment.
27
Text FCG: Fusing Contextual Information via Graph Learning for text classification
TL;DR: Wang et al. as mentioned in this paper proposed TextFCGNN (Text Contextual Information via Graph Neural Networks), which constructs a single graph for all words in each text and labels the edges by fusing its various contextual relations.
24
Evolving Long Short-Term Memory Network-Based Text Classification
Arjun Singh,Shashi K. Dargar,Amita Gupta,Ashish Kumar,Atul Srivastava,Mitali Srivastava,Pradeep Kumar Tiwari,Mohammad Aman Ullah +7 more
TL;DR: An evolving LSTM (ELSTM) network is proposed using a multiobjective genetic algorithm (MOGA) to optimize the architecture and weights of L STM.
Effective Text Classification using BERT, MTM LSTM, and DT
Saman Jamshidi,Mahin Mohammadi,Saeed Bagheri,Hamid Esmaeili Najafabadi,Alireza Rezvanian,Mehdi Gheisari,Mustafa Ghaderzadeh,Amir Shahab Shahabi,Zongda Wu +8 more
- 01 Apr 2024
10
References
Generative Adversarial Nets
Ian Goodfellow,Jean Pouget-Abadie,Mehdi Mirza,Bing Xu,David Warde-Farley,Sherjil Ozair,Aaron Courville,Yoshua Bengio +7 more
- 08 Dec 2014
TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.
Glove: Global Vectors for Word Representation
Jeffrey Pennington,Richard Socher,Christopher D. Manning +2 more
- 01 Oct 2014
TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.
SMOTE: synthetic minority over-sampling technique
TL;DR: In this article, a method of over-sampling the minority class involves creating synthetic minority class examples, which is evaluated using the area under the Receiver Operating Characteristic curve (AUC) and the ROC convex hull strategy.
•Proceedings Article
Efficient Estimation of Word Representations in Vector Space
Tomas Mikolov,Kai Chen,Greg S. Corrado,Jeffrey Dean +3 more
- 16 Jan 2013
TL;DR: Two novel model architectures for computing continuous vector representations of words from very large data sets are proposed and it is shown that these vectors provide state-of-the-art performance on the authors' test set for measuring syntactic and semantic word similarities.
27.5K
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin,Ming-Wei Chang,Kenton Lee,Kristina Toutanova +3 more
- 11 Oct 2018
TL;DR: BERT as mentioned in this paper pre-trains deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.
24.6K