Overfitting of Neural Nets Under Class Imbalance: Analysis and Improvements for Segmentation

doi:10.1007/978-3-030-32248-9_45

Open AccessBook Chapter10.1007/978-3-030-32248-9_45

Overfitting of Neural Nets Under Class Imbalance: Analysis and Improvements for Segmentation

Zeju Li, +2 more

- 13 Oct 2019

- pp 402-410

127

TL;DR: In this article, the distribution of logit activations when processing unseen test samples of an underrepresented class tends to shift towards and even across the decision boundary, while the over-represented class seems unaffected.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Fig. 3. The illustration of the proposed asymmetric modifications for the existing four techniques. We make the logit activations of foreground class far away from the decision boundary by setting bias for the foreground class in different ways.

Table 1. Evaluation of the tumor core segmentation with different amounts of training data and different techniques to counter overfitting.

Fig. 4. Activations of the classification layer when processing (top) foreground and (bottom) background samples, using 5% training data. Asymmetric modifications lead to better separation of the logits of unseen foreground samples.

Citations

Journal Article•10.1016/J.MEDIA.2020.101911

Discriminative ensemble learning for few-shot chest x-ray diagnosis.

Angshuman Paul, +3 more

- 01 Feb 2021

- Medical Image Analysis

TL;DR: The proposed method for few-shot diagnosis of diseases and conditions from chest x-rays using discriminative ensemble learning is modular and easily adaptable to new tasks requiring the training of only the saliency-based classifier.

...read moreread less

52

Journal Article•10.1016/j.asoc.2022.109631

Multimodal brain tumor detection using multimodal deep transfer learning

Parvin Razzaghi, +3 more

- 01 Sep 2022

- Applied Soft Computing

TL;DR: In this paper , the authors proposed a new multimodal deep transfer learning for MRI brain image segmentation, where the knowledge transfer between and within modalities is considered to tackle the challenge of having different distributions between the training and test sets.

...read moreread less

34

Journal Article•10.1016/j.heliyon.2023.e20382

A review of fake news detection approaches: A critical analysis of relevant studies and highlighting key challenges associated with the dataset, feature representation, and data fusion

Suhaib Kh. Hamed, +2 more

- 01 Sep 2023

- Heliyon

TL;DR: The investigation of fake news detection studies relied on the following aspects and their impact on detection accuracy, namely datasets, overfitting/underfitting, image-based features, feature vector representation, machine learning models, and data fusion.

...read moreread less

32

•Journal Article•10.3390/w15091750

Revolutionizing Groundwater Management with Hybrid AI Models: A Practical Review

M Zaresefat, +1 more

- 02 May 2023

- Water

TL;DR: In this paper , the state-of-the-art hybrid machine learning (ML) models used for groundwater management are reviewed and a review of the most cited hybrid ML models employed in this domain is presented.

...read moreread less

30

•Journal Article•10.1016/j.media.2022.102597

Enhancing MR image segmentation with realistic adversarial data augmentation

01 Nov 2022

- Medical Image Analysis

TL;DR: In this article , the authors proposed AdvChain, a generic adversarial data augmentation framework to improve both the diversity and effectiveness of training data for medical image segmentation tasks by generating randomly chained photo-metric and geometric transformations to expand training data.

...read moreread less

30

...

Expand

References

•Proceedings Article•10.1109/ICCV.2017.324

Focal Loss for Dense Object Detection

Tsung-Yi Lin, +4 more

- 07 Aug 2017

TL;DR: This paper proposes to address the extreme foreground-background class imbalance encountered during training of dense detectors by reshaping the standard cross entropy loss such that it down-weights the loss assigned to well-classified examples, and develops a novel Focal Loss, which focuses training on a sparse set of hard examples and prevents the vast number of easy negatives from overwhelming the detector during training.

...read moreread less

21.3K

•Posted Content

Explaining and Harnessing Adversarial Examples

Ian Goodfellow, +2 more

- 20 Dec 2014

- arXiv: Machine Learning

TL;DR: The authors argue that the primary cause of neural networks' vulnerability to adversarial perturbation is their linear nature, which is supported by new quantitative results while giving the first explanation of the most intriguing fact about adversarial examples: their generalization across architectures and training sets.

...read moreread less

15.9K

•Proceedings Article

mixup: Beyond Empirical Risk Minimization

Hongyi Zhang, +3 more

- 25 Oct 2017

TL;DR: This work proposes mixup, a simple learning principle that trains a neural network on convex combinations of pairs of examples and their labels, which improves the generalization of state-of-the-art neural network architectures.

...read moreread less

6.8K

•Posted Content

mixup: Beyond Empirical Risk Minimization

Hongyi Zhang, +3 more

- 25 Oct 2017

- arXiv: Learning

TL;DR: Mixup as discussed by the authors trains a neural network on convex combinations of pairs of examples and their labels, and regularizes the neural network to favor simple linear behavior in between training examples, which improves the generalization of state-of-the-art neural network architectures.

...read moreread less

4.2K

•Journal Article•10.1016/J.MEDIA.2016.10.004

Efficient Multi-Scale 3D CNN with Fully Connected CRF for Accurate Brain Lesion Segmentation

Konstantinos Kamnitsas, +7 more

- 01 Feb 2017

- Medical Image Analysis

TL;DR: An efficient and effective dense training scheme which joins the processing of adjacent image patches into one pass through the network while automatically adapting to the inherent class imbalance present in the data, and improves on the state-of-the‐art for all three applications.

...read moreread less

3.6K

Overfitting of Neural Nets Under Class Imbalance: Analysis and Improvements for Segmentation

Chat with Paper

AI Agents for this Paper

Figures

Citations

Discriminative ensemble learning for few-shot chest x-ray diagnosis.

Multimodal brain tumor detection using multimodal deep transfer learning

A review of fake news detection approaches: A critical analysis of relevant studies and highlighting key challenges associated with the dataset, feature representation, and data fusion

Revolutionizing Groundwater Management with Hybrid AI Models: A Practical Review

Enhancing MR image segmentation with realistic adversarial data augmentation

References

Focal Loss for Dense Object Detection

Explaining and Harnessing Adversarial Examples

mixup: Beyond Empirical Risk Minimization

mixup: Beyond Empirical Risk Minimization

Efficient Multi-Scale 3D CNN with Fully Connected CRF for Accurate Brain Lesion Segmentation

Related Papers (5)

A survey on deep learning in medical image analysis

U-Net: Convolutional Networks for Biomedical Image Segmentation

Deep Residual Learning for Image Recognition

Focal Loss for Dense Object Detection

Deep learning