NAS-Bench-101: Towards Reproducible Neural Architecture Search

Open AccessProceedings Article

NAS-Bench-101: Towards Reproducible Neural Architecture Search

- 24 May 2019

- pp 7105-7114

549

TL;DR: This work introduces NAS-Bench-101, the first public architecture dataset for NAS research, which allows researchers to evaluate the quality of a diverse range of models in milliseconds by querying the pre-computed dataset.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

An Analysis of Super-Net Heuristics in Weight-Sharing NAS

Kaicheng Yu, +2 more

- 04 Oct 2021

- arXiv: Learning

TL;DR: In this article, the authors disentangle super-net training from the search algorithm, isolate 14 frequently-used training heuristics, and evaluate them over three benchmark search spaces.

...read moreread less

•Posted Content

Neural networks adapting to datasets: learning network size and topology.

Romuald A. Janik, +1 more

- 22 Jun 2020

- arXiv: Learning

TL;DR: A flexible setup allowing for a neural network to learn both its size and topology during the course of a standard gradient-based training is introduced, which has the structure of a graph tailored to the particular learning task and dataset.

...read moreread less

•Posted Content

A Novel Evolutionary Algorithm for Hierarchical Neural Architecture Search.

Aristeidis Chrostoforidis, +2 more

- 18 Jul 2021

- arXiv: Neural and Evolutionary Computing

TL;DR: In this paper, an evolutionary algorithm for neural architecture search is proposed, which organizes the topology in multiple hierarchical modules, while the design process exploits this representation, in order to explore the search space.

...read moreread less

•Posted Content

EPE-NAS: Efficient Performance Estimation Without Training for Neural Architecture Search

Vasco Lopes, +2 more

- 16 Feb 2021

- arXiv: Learning

TL;DR: In this paper, an efficient performance estimation strategy, EPE-NAS, is proposed, which mitigates the problem of evaluating networks by scoring untrained networks and creating a correlation with their trained performance.

...read moreread less

Journal Article•10.1109/ACCESS.2022.3208591

Learning a Unified Latent Space for NAS: Toward Leveraging Structural and Symbolic Information

Saeed Godeyri Eslami, +2 more

- IEEE Access

TL;DR: How to build a proper representation of network architecture that preserves explicit or implicit information inside the architecture is discussed, and the effectiveness of the proposed method as compared with the state-of-the-art predictors is demonstrated.

...read moreread less

...

Expand

References

•Posted Content

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 10 Dec 2015

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.

...read moreread less

117.9K

Proceedings Article•10.1109/CVPR.2009.5206848

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

- 20 Jun 2009

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

75.9K

•Proceedings Article

Sequence to Sequence Learning with Neural Networks

Ilya Sutskever, +2 more

- 08 Dec 2014

TL;DR: The authors used a multilayered Long Short-Term Memory (LSTM) to map the input sequence to a vector of a fixed dimensionality, and then another deep LSTM to decode the target sequence from the vector.

...read moreread less

20.1K

•Posted Content

Decoupled Weight Decay Regularization

Ilya Loshchilov, +1 more

- 14 Nov 2017

- arXiv: Learning

TL;DR: This work proposes a simple modification to recover the original formulation of weight decay regularization by decoupling the weight decay from the optimization steps taken w.r.t. the loss function, and provides empirical evidence that this modification substantially improves Adam's generalization performance.

...read moreread less

14.4K

•Journal Article

Random search for hyper-parameter optimization

James Bergstra, +1 more

- 01 Mar 2012

- Journal of Machine Learning Research

TL;DR: This paper shows empirically and theoretically that randomly chosen trials are more efficient for hyper-parameter optimization than trials on a grid, and shows that random search is a natural baseline against which to judge progress in the development of adaptive (sequential) hyper- parameter optimization algorithms.

...read moreread less

9.7K

...

Expand

NAS-Bench-101: Towards Reproducible Neural Architecture Search

Chat with Paper

AI Agents for this Paper

Citations

An Analysis of Super-Net Heuristics in Weight-Sharing NAS

Neural networks adapting to datasets: learning network size and topology.

A Novel Evolutionary Algorithm for Hierarchical Neural Architecture Search.

EPE-NAS: Efficient Performance Estimation Without Training for Neural Architecture Search

Learning a Unified Latent Space for NAS: Toward Leveraging Structural and Symbolic Information

References

Deep Residual Learning for Image Recognition

ImageNet: A large-scale hierarchical image database

Sequence to Sequence Learning with Neural Networks

Decoupled Weight Decay Regularization

Random search for hyper-parameter optimization

Related Papers (5)

DARTS: Differentiable Architecture Search

Learning Transferable Architectures for Scalable Image Recognition

Regularized Evolution for Image Classifier Architecture Search

Deep Residual Learning for Image Recognition

Progressive Neural Architecture Search