Unshuffling Data for Improved Generalization.

Open AccessPosted Content

Unshuffling Data for Improved Generalization.

- 27 Feb 2020

- arXiv: Computer Vision and Pattern Recog...

83

TL;DR: This work describes a training procedure to capture the patterns that are stable across environments while discarding spurious ones, and demonstrates multiple use cases with the task of visual question answering, which is notorious for dataset biases.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

In Search of Lost Domain Generalization

Ishaan Gulrajani, +1 more

- 02 Jul 2020

- arXiv: Learning

TL;DR: This paper implements DomainBed, a testbed for domain generalization including seven multi-domain datasets, nine baseline algorithms, and three model selection criteria, and finds that, when carefully implemented, empirical risk minimization shows state-of-the-art performance across all datasets.

...read moreread less

775

•Proceedings Article•10.1109/CVPR46437.2021.01251

Counterfactual VQA: A Cause-Effect Look at Language Bias

Yulei Niu, +5 more

- 01 Jun 2021

TL;DR: The authors proposed a counterfactual inference framework to mitigate language bias in VQA models, which enables them to capture the language bias as the direct causal effect of questions on answers and reduce language bias by subtracting the direct language effect from the total causal effect.

...read moreread less

329

•Posted Content

Counterfactual VQA: A Cause-Effect Look at Language Bias

Yulei Niu, +5 more

- 08 Jun 2020

- arXiv: Computer Vision and Pattern Recog...

TL;DR: A novel counterfactual inference framework is proposed, which enables the language bias to be captured as the direct causal effect of questions on answers and reduced by subtracting the direct language effect from the total causal effect.

...read moreread less

301

Proceedings Article•10.48550/arXiv.2204.02937

Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations

Polina Kirichenko, +2 more

- 06 Apr 2022

TL;DR: It is demonstrated that simple last layer retraining on large ImageNet-trained models can match or outperform state-of-the-art approaches on spurious correlation benchmarks, but with profoundly lower complexity and computational expenses.

...read moreread less

188

Proceedings Article•10.1109/CVPR42600.2020.01006

Counterfactual Vision and Language Learning

Ehsan Abbasnejad, +4 more

- 14 Jun 2020

TL;DR: This work proposes a method that addresses the problem of visual question answering by introducing counterfactuals in the training, and shows that simulating plausible alternative training data through this process results in better generalization.

...read moreread less

159

...

Expand

References

•Journal Article•10.1145/3065386

ImageNet classification with deep convolutional neural networks

Alex Krizhevsky, +2 more

- 24 May 2017

- Communications of The ACM

TL;DR: A large, deep convolutional neural network was trained to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes and employed a recently developed regularization method called "dropout" that proved to be very effective.

...read moreread less

98.2K

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

Proceedings Article•10.3115/V1/D14-1162

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

- 01 Oct 2014

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

41.6K

Statistical learning theory

Vladimir Vapnik

- 01 Jan 1998

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.

...read moreread less

30.4K

•Journal Article•10.1023/A:1018054314350

Bagging predictors

Leo Breiman

- 01 Aug 1996

TL;DR: Tests on real and simulated data sets using classification and regression trees and subset selection in linear regression show that bagging can give substantial gains in accuracy.

...read moreread less

16.6K