Book Chapter10.1007/978-3-319-23525-7_6
Hyperparameter Optimization with Factorized Multilayer Perceptrons
Nicolas Schilling,Martin Wistuba,Lucas Drumond,Lars Schmidt-Thieme +3 more
- 07 Sep 2015
- pp 87-103
TL;DR: This paper proposes multilayer perceptrons as surrogate models as they are able to model highly nonlinear hyperparameter response surfaces and improves the performance of multilier perceptrons by means of an explicit factorization of the interaction weights and call the resulting model a factorized multilayers perceptron.
read more
Abstract: In machine learning, hyperparameter optimization is a challenging task that is usually approached by experienced practitioners or in a computationally expensive brute-force manner such as grid-search. Therefore, recent research proposes to use observed hyperparameter performance on already solved problems (i.e. data sets) in order to speed up the search for promising hyperparameter configurations in the sequential model based optimization framework.
In this paper, we propose multilayer perceptrons as surrogate models as they are able to model highly nonlinear hyperparameter response surfaces. However, since interactions of hyperparameters, data sets and metafeatures are only implicitly learned in the subsequent layers, we improve the performance of multilayer perceptrons by means of an explicit factorization of the interaction weights and call the resulting model a factorized multilayer perceptron. Additionally, we evaluate different ways of obtaining predictive uncertainty, which is a key ingredient for a decent tradeoff between exploration and exploitation. Our experimental results on two public meta data sets demonstrate the efficiency of our approach compared to a variety of published baselines. For reproduction purposes, we make our data sets and all the program code publicly available on our supplementary webpage.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Pattern Recognition and Machine Learning
Christopher M. Bishop
- 01 Jan 2006
TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.
10.1K
•Posted Content
Meta-Learning: A Survey
TL;DR: This chapter provides an overview of the state of the art in meta-learning, the science of systematically observing how different machine learning approaches perform on a wide range of learning tasks and then learning from this experience, or meta-data, to learn new tasks much faster than otherwise possible.
BreastNet: A novel convolutional neural network model through histopathological images for the diagnosis of breast cancer
TL;DR: A novel deep learning model developed based on a convolutional neural network named as BreastNet was developed and the success of the classification was increased by using the proposed model, which yielded better results than the other studies that use the current BreakHis dataset.
231
•Proceedings Article
Scalable Hyperparameter Transfer Learning
Valerio Perrone,Rodolphe Jenatton,Matthias Seeger,Cédric Archambeau +3 more
- 01 Jan 2018
TL;DR: This work proposes a multi-task adaptive Bayesian linear regression model for transfer learning in BO, whose complexity is linear in the function evaluations: one Bayesianlinear regression model is associated to each black-box function optimization problem (or task), while transfer learning is achieved by coupling the models through a shared deep neural net.
Learning hyperparameter optimization initializations
Martin Wistuba,Nicolas Schilling,Lars Schmidt-Thieme +2 more
- 07 Dec 2015
TL;DR: Empirical evidence is given that the proposed initialization strategy for hyperparameter optimization provides an improvement over the state of the art for initialization strategies and further methods that are able to transfer knowledge between data sets.
References
Pattern Recognition and Machine Learning
TL;DR: This book covers a broad range of topics for regular factorial designs and presents all of the material in very mathematical fashion and will surely become an invaluable resource for researchers and graduate students doing research in the design of factorial experiments.
30.8K
Learning representations by back-propagating errors
TL;DR: Back-propagation repeatedly adjusts the weights of the connections in the network so as to minimize a measure of the difference between the actual output vector of the net and the desired output vector, which helps to represent important features of the task domain.
30.1K
•Book
Pattern Recognition and Machine Learning
Christopher M. Bishop
- 17 Aug 2006
TL;DR: Probability Distributions, linear models for Regression, Linear Models for Classification, Neural Networks, Graphical Models, Mixture Models and EM, Sampling Methods, Continuous Latent Variables, Sequential Data are studied.
•Book
Machine Learning : A Probabilistic Perspective
Kevin P. Murphy
- 24 Aug 2012
TL;DR: This textbook offers a comprehensive and self-contained introduction to the field of machine learning, based on a unified, probabilistic approach, and is suitable for upper-level undergraduates with an introductory-level college math background and beginning graduate students.
11.8K
•Book
Pattern Recognition and Machine Learning (Information Science and Statistics)
Christopher M. Bishop
- 01 Aug 2006
TL;DR: Looking for competent reading resources?
10.1K