Learning to Compose Domain-Specific Transformations for Data Augmentation

Open AccessPosted Content

Learning to Compose Domain-Specific Transformations for Data Augmentation

- 06 Sep 2017

311

TL;DR: In this article, a generative adversarial approach is proposed to learn a sequence model over user-specified transformation functions using GANs, which can make use of arbitrary, non-deterministic transformation functions, and is robust to misspecified user input.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Figure 1: Three examples of transformation functions (TFs) in different domains: Two example sequences of incremental image TFs applied to CIFAR-10 images (left); a conditional word-swap TF using an externally trained language model and specifically targeting nouns (NN) between entity mentions (E1,E2) for a relation extraction task (middle); and an unsupervised segementation-based translation TF applied to mass-containing mammography images (right).

Figure 7: Accuracy scores on random 10% subsamples of test data (dotted lines) and on versions augmented with a single transformation (vertical bars) with parameters drawn uniformly at random.

Figure 2: A high-level diagram of our method. Users input a set of transformation functions h1, ..., hK and unlabeled data. A generative adversarial approach is then used to train a null class discriminator, D∅, and a generator, G, which produces TF sequences hτ1 , ..., hτL . Finally, the trained generator is used to perform data augmentation for an end discriminative model Df .

Table 4: A simple study of the effect of adding a transformation regularization (TR) term to the objective function, evaluated on a labeled validation set. We see that adding the term improves performance for both heuristic (random) TF sequences and for TF sequences generated by the trained LSTM model, and that there is a larger positive effect for the latter.

Table 3: Current state-of-the-art image classification models as ranked by reported performance on the CIFAR-10 and CIFAR-100 tasks, and their error with (Err. w/ DA) and without (Err. w/o DA) data augmentation. We include both scores and particular data augmentation techniques when reported, although the latter is rarely reported with great precision. 13

Table 1: Test set performance of end models trained on subsamples of the labeled training data (% ), not including validation splits, using various data augmentation approaches. None indicates performance with no augmentation. All tasks are measured in accuracy, except ACE which is measured by F1 score.

Citations

Proceedings Article•10.1109/CVPR.2019.00020

AutoAugment: Learning Augmentation Strategies From Data

Ekin D. Cubuk, +4 more

- 15 Jun 2019

TL;DR: This paper describes a simple procedure called AutoAugment to automatically search for improved data augmentation policies, which achieves state-of-the-art accuracy on CIFAR-10, CIFar-100, SVHN, and ImageNet (without additional data).

...read moreread less

3.2K

•Proceedings Article

RandAugment: Practical Automated Data Augmentation with a Reduced Search Space

Ekin D. Cubuk, +3 more

- 01 Jan 2020

TL;DR: This work proposes a simplified search space that vastly reduces the computational expense of automated augmentation, and permits the removal of a separate proxy task.

...read moreread less

2.5K

Proceedings Article•10.1109/CVPRW50498.2020.00359

Randaugment: Practical automated data augmentation with a reduced search space

Ekin D. Cubuk, +3 more

- 14 Jun 2020

TL;DR: EfficientNet-B7 as mentioned in this paper proposes a simplified search space that greatly reduces the computational expense of automated augmentation, and permits the removal of a separate proxy task, and achieves state-of-the-art results in image classification and object detection.

...read moreread less

2.2K

•Journal Article•10.3390/INFO11020125

Albumentations: fast and flexible image augmentations

Alexander Buslaev, +5 more

- 24 Feb 2020

- Information-an International Interdiscip...

TL;DR: Albumentations as mentioned in this paper is a fast and flexible open source library for image augmentation with many various image transform operations available that is also an easy-to-use wrapper around other augmentation libraries.

...read moreread less

1.8K

•Posted Content

AutoAugment: Learning Augmentation Policies from Data

Ekin D. Cubuk, +4 more

- 24 May 2018

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper describes a simple procedure called AutoAugment to automatically search for improved data augmentation policies, which achieves state-of-the-art accuracy on CIFAR-10, CIFar-100, SVHN, and ImageNet (without additional data).

...read moreread less

1.6K

...

Expand

References

•Posted Content

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 10 Dec 2015

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.

...read moreread less

117.9K

Gradient-based learning applied to document recognition

Yann LeCun, +7 more

- 01 Jan 2001

TL;DR: This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task, and Convolutional neural networks are shown to outperform all other techniques.

...read moreread less

32.7K

•Posted Content

Explaining and Harnessing Adversarial Examples

Ian Goodfellow, +2 more

- 20 Dec 2014

- arXiv: Machine Learning

TL;DR: The authors argue that the primary cause of neural networks' vulnerability to adversarial perturbation is their linear nature, which is supported by new quantitative results while giving the first explanation of the most intriguing fact about adversarial examples: their generalization across architectures and training sets.

...read moreread less

15.9K

•Posted Content

Improved Techniques for Training GANs

Tim Salimans, +5 more

- 10 Jun 2016

- arXiv: Learning

TL;DR: In this article, the authors present a variety of new architectural features and training procedures that apply to the generative adversarial networks (GANs) framework and achieve state-of-the-art results in semi-supervised classification on MNIST, CIFAR-10 and SVHN.

...read moreread less

7.4K

•Posted Content

Enriching Word Vectors with Subword Information

Piotr Bojanowski, +3 more

- 15 Jul 2016

- arXiv: Computation and Language

TL;DR: A new approach based on the skipgram model, where each word is represented as a bag of character n-grams, with words being represented as the sum of these representations, which achieves state-of-the-art performance on word similarity and analogy tasks.

...read moreread less

6.1K

...

Expand

Learning to Compose Domain-Specific Transformations for Data Augmentation

Chat with Paper

AI Agents for this Paper

Figures

Citations

AutoAugment: Learning Augmentation Strategies From Data

RandAugment: Practical Automated Data Augmentation with a Reduced Search Space

Randaugment: Practical automated data augmentation with a reduced search space

Albumentations: fast and flexible image augmentations

AutoAugment: Learning Augmentation Policies from Data

References

Deep Residual Learning for Image Recognition

Gradient-based learning applied to document recognition

Explaining and Harnessing Adversarial Examples

Improved Techniques for Training GANs

Enriching Word Vectors with Subword Information

Related Papers (5)

Deep Residual Learning for Image Recognition

Generative Adversarial Nets

ImageNet Classification with Deep Convolutional Neural Networks

ImageNet: A large-scale hierarchical image database

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks