NAS-Bench-101: Towards Reproducible Neural Architecture Search

Open AccessProceedings Article

NAS-Bench-101: Towards Reproducible Neural Architecture Search

- 24 May 2019

- pp 7105-7114

549

TL;DR: This work introduces NAS-Bench-101, the first public architecture dataset for NAS research, which allows researchers to evaluate the quality of a diverse range of models in milliseconds by querying the pre-computed dataset.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

Exploring the Loss Landscape in Neural Architecture Search

Colin White, +2 more

- 06 May 2020

- arXiv: Learning

TL;DR: In this article, the authors show that the simple hill-climbing algorithm is a powerful baseline for NAS, and when the noise in popular NAS benchmark datasets is reduced to a minimum, hill climbing outperforms many popular state-of-the-art algorithms.

...read moreread less

•Posted Content

iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients

Miao Zhang, +5 more

- 21 Jun 2021

- arXiv: Learning

TL;DR: In this paper, a stochastic hypergradient approximation for differentiable NAS is proposed, and theoretically show that the architecture optimization with the proposed method, named iDARTS, is expected to converge to a stationary point, making it only depend on the obtained solution to the inner-loop optimization and agnostic to the optimization path.

...read moreread less

•Posted Content

Towards Green Automated Machine Learning: Status Quo and Future Directions

Tanja Tornede, +5 more

- 10 Nov 2021

- arXiv: Learning

TL;DR: In this paper, the authors identify four categories of actions the community may take towards more sustainable research on AutoML, namely approach design, benchmarking, research incentives, and transparency.

...read moreread less

Proceedings Article•10.1145/3528535.3533273

EdgeTune: Inference-Aware Multi-Parameter Tuning

Isabelly Rocha, +3 more

- 07 Nov 2022

TL;DR: A novel one-fold tuning algorithm that employs the principle of multi-fidelity and simultaneously explores multiple tuning budgets, which the prior art can only handle as suboptimal case of single type of budget is proposed.

...read moreread less

•Posted Content

GPNAS: A Neural Network Architecture Search Framework Based on Graphical Predictor

Dige Ai, +1 more

- 19 Mar 2021

- arXiv: Learning

TL;DR: In this article, the authors propose a framework to decouple network structure from operator search space, and use two BOHBs to search alternatively, which can not only improve the search efficiency, but also solve the dimension curse.

...read moreread less

...

Expand

References

•Posted Content

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 10 Dec 2015

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.

...read moreread less

117.9K

Proceedings Article•10.1109/CVPR.2009.5206848

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

- 20 Jun 2009

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

75.9K

•Proceedings Article

Sequence to Sequence Learning with Neural Networks

Ilya Sutskever, +2 more

- 08 Dec 2014

TL;DR: The authors used a multilayered Long Short-Term Memory (LSTM) to map the input sequence to a vector of a fixed dimensionality, and then another deep LSTM to decode the target sequence from the vector.

...read moreread less

20.1K

•Posted Content

Decoupled Weight Decay Regularization

Ilya Loshchilov, +1 more

- 14 Nov 2017

- arXiv: Learning

TL;DR: This work proposes a simple modification to recover the original formulation of weight decay regularization by decoupling the weight decay from the optimization steps taken w.r.t. the loss function, and provides empirical evidence that this modification substantially improves Adam's generalization performance.

...read moreread less

14.4K

•Journal Article

Random search for hyper-parameter optimization

James Bergstra, +1 more

- 01 Mar 2012

- Journal of Machine Learning Research

TL;DR: This paper shows empirically and theoretically that randomly chosen trials are more efficient for hyper-parameter optimization than trials on a grid, and shows that random search is a natural baseline against which to judge progress in the development of adaptive (sequential) hyper- parameter optimization algorithms.

...read moreread less

9.7K

...

Expand

NAS-Bench-101: Towards Reproducible Neural Architecture Search

Chat with Paper

AI Agents for this Paper

Citations

Exploring the Loss Landscape in Neural Architecture Search

iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients

Towards Green Automated Machine Learning: Status Quo and Future Directions

EdgeTune: Inference-Aware Multi-Parameter Tuning

GPNAS: A Neural Network Architecture Search Framework Based on Graphical Predictor

References

Deep Residual Learning for Image Recognition

ImageNet: A large-scale hierarchical image database

Sequence to Sequence Learning with Neural Networks

Decoupled Weight Decay Regularization

Random search for hyper-parameter optimization

Related Papers (5)

DARTS: Differentiable Architecture Search

Learning Transferable Architectures for Scalable Image Recognition

Regularized Evolution for Image Classifier Architecture Search

Deep Residual Learning for Image Recognition

Progressive Neural Architecture Search