Hyperparameter Optimization with Factorized Multilayer Perceptrons

doi:10.1007/978-3-319-23525-7_6

Book Chapter10.1007/978-3-319-23525-7_6

Hyperparameter Optimization with Factorized Multilayer Perceptrons

Nicolas Schilling, +3 more

- 07 Sep 2015

- pp 87-103

47

TL;DR: This paper proposes multilayer perceptrons as surrogate models as they are able to model highly nonlinear hyperparameter response surfaces and improves the performance of multilier perceptrons by means of an explicit factorization of the interaction weights and call the resulting model a factorized multilayers perceptron.

Abstract: In machine learning, hyperparameter optimization is a challenging task that is usually approached by experienced practitioners or in a computationally expensive brute-force manner such as grid-search. Therefore, recent research proposes to use observed hyperparameter performance on already solved problems (i.e. data sets) in order to speed up the search for promising hyperparameter configurations in the sequential model based optimization framework. In this paper, we propose multilayer perceptrons as surrogate models as they are able to model highly nonlinear hyperparameter response surfaces. However, since interactions of hyperparameters, data sets and metafeatures are only implicitly learned in the subsequent layers, we improve the performance of multilayer perceptrons by means of an explicit factorization of the interaction weights and call the resulting model a factorized multilayer perceptron. Additionally, we evaluate different ways of obtaining predictive uncertainty, which is a key ingredient for a decent tradeoff between exploration and exploitation. Our experimental results on two public meta data sets demonstrate the efficiency of our approach compared to a variety of published baselines. For reproduction purposes, we make our data sets and all the program code publicly available on our supplementary webpage.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Pattern Recognition and Machine Learning

Christopher M. Bishop

- 01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

10.1K

•Posted Content

Meta-Learning: A Survey

Joaquin Vanschoren

- 08 Oct 2018

- arXiv: Learning

TL;DR: This chapter provides an overview of the state of the art in meta-learning, the science of systematically observing how different machine learning approaches perform on a wide range of learning tasks and then learning from this experience, or meta-data, to learn new tasks much faster than otherwise possible.

...read moreread less

600

Journal Article•10.1016/J.PHYSA.2019.123592

BreastNet: A novel convolutional neural network model through histopathological images for the diagnosis of breast cancer

Mesut Toğaçar, +3 more

- 01 May 2020

- Physica A-statistical Mechanics and Its ...

TL;DR: A novel deep learning model developed based on a convolutional neural network named as BreastNet was developed and the success of the classification was increased by using the proposed model, which yielded better results than the other studies that use the current BreakHis dataset.

...read moreread less

231

•Proceedings Article

Scalable Hyperparameter Transfer Learning

Valerio Perrone, +3 more

- 01 Jan 2018

TL;DR: This work proposes a multi-task adaptive Bayesian linear regression model for transfer learning in BO, whose complexity is linear in the function evaluations: one Bayesianlinear regression model is associated to each black-box function optimization problem (or task), while transfer learning is achieved by coupling the models through a shared deep neural net.

...read moreread less

144

Proceedings Article•10.1109/DSAA.2015.7344817

Learning hyperparameter optimization initializations

Martin Wistuba, +2 more

- 07 Dec 2015

TL;DR: Empirical evidence is given that the proposed initialization strategy for hyperparameter optimization provides an improvement over the state of the art for initialization strategies and further methods that are able to transfer knowledge between data sets.

...read moreread less

122

...

Expand

References

Journal Article•10.1198/TECH.2007.S518

Pattern Recognition and Machine Learning

Radford M. Neal

- 01 Aug 2007

- Technometrics

TL;DR: This book covers a broad range of topics for regular factorial designs and presents all of the material in very mathematical fashion and will surely become an invaluable resource for researchers and graduate students doing research in the design of factorial experiments.

...read moreread less

30.8K

Journal Article•10.1038/323533A0

Learning representations by back-propagating errors

David E. Rumelhart, +2 more

- 01 Jan 1988

- Nature

TL;DR: Back-propagation repeatedly adjusts the weights of the connections in the network so as to minimize a measure of the difference between the actual output vector of the net and the desired output vector, which helps to represent important features of the task domain.

...read moreread less

30.1K

•Book

Pattern Recognition and Machine Learning

Christopher M. Bishop

- 17 Aug 2006

TL;DR: Probability Distributions, linear models for Regression, Linear Models for Classification, Neural Networks, Graphical Models, Mixture Models and EM, Sampling Methods, Continuous Latent Variables, Sequential Data are studied.

...read moreread less

23.4K

•Book

Machine Learning : A Probabilistic Perspective

Kevin P. Murphy

- 24 Aug 2012

TL;DR: This textbook offers a comprehensive and self-contained introduction to the field of machine learning, based on a unified, probabilistic approach, and is suitable for upper-level undergraduates with an introductory-level college math background and beginning graduate students.

...read moreread less

11.8K

•Book

Pattern Recognition and Machine Learning (Information Science and Statistics)

Christopher M. Bishop

- 01 Aug 2006

TL;DR: Looking for competent reading resources?

...read moreread less

10.1K

...

Expand

Hyperparameter Optimization with Factorized Multilayer Perceptrons

Chat with Paper

AI Agents for this Paper

Citations

Pattern Recognition and Machine Learning

Meta-Learning: A Survey

BreastNet: A novel convolutional neural network model through histopathological images for the diagnosis of breast cancer

Scalable Hyperparameter Transfer Learning

Learning hyperparameter optimization initializations

References

Pattern Recognition and Machine Learning

Learning representations by back-propagating errors

Pattern Recognition and Machine Learning

Machine Learning : A Probabilistic Perspective

Pattern Recognition and Machine Learning (Information Science and Statistics)

Related Papers (5)

Sequential model-based optimization for general algorithm configuration

Efficient Global Optimization of Expensive Black-Box Functions

Random search for hyper-parameter optimization

Practical Bayesian Optimization of Machine Learning Algorithms

Initializing bayesian hyperparameter optimization via meta-learning