Hierarchical Knowledge Gradient for Sequential Sampling
TL;DR: A hierarchical aggregation technique that uses the common features shared by alternatives to learn about many alternatives from even a single measurement is proposed, which greatly reduces the measurement effort but requires some prior knowledge on the smoothness of the function in the form of an aggregation function.
read more
Abstract: We propose a sequential sampling policy for noisy discrete global optimization and ranking and selection, in which we aim to efficiently explore a finite set of alternatives before selecting an alternative as best when exploration stops. Each alternative may be characterized by a multi-dimensional vector of categorical and numerical attributes and has independent normal rewards. We use a Bayesian probability model for the unknown reward of each alternative and follow a fully sequential sampling policy called the knowledge-gradient policy. This policy myopically optimizes the expected increment in the value of sampling information in each time period. We propose a hierarchical aggregation technique that uses the common features shared by alternatives to learn about many alternatives from even a single measurement. This approach greatly reduces the measurement effort required, but it requires some prior knowledge on the smoothness of the function in the form of an aggregation function and computational issues limit the number of alternatives that can be easily considered to the thousands. We prove that our policy is consistent, finding a globally optimal alternative when given enough measurements, and show through simulations that it performs competitively with or significantly better than other policies.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
A review of recent research on green road freight transportation
TL;DR: A review of recent research on green road freight transportation is provided to provide an understanding of vehicle emission models and their inclusion into the existing optimization methods.
723
Flexible Heuristics Miner (FHM)
A.J.M.M. Weijters,Joel Ribeiro +1 more
- 11 Apr 2011
TL;DR: A new process representation language is presented in combination with an accompanying process mining algorithm that results in easy to understand process models even in the case of non-trivial constructs, low structured domains and the presence of noise.
A unified framework for stochastic optimization
TL;DR: It is argued that the principles of bandit problems should become a core dimension of mainstream stochastic optimization, and a universal modeling framework is proposed that encompasses all of these competing approaches.
279
Parallel Gaussian Process Optimization with Upper Confidence Bound and Pure Exploration
TL;DR: The Gaussian Process Upper Confidence Bound and Pure exploration algorithm (GP-UCB-PE) is introduced which combines the UCB strategy and Pure Exploration in the same batch of evaluations along the parallel iterations and proves theoretical upper bounds on the regret with batches of size K for this procedure.
215
Maintenance spare parts planning and control : a framework for control and agenda for future research
M.A. Driessen,Jacobus J. Arts,Gjjan Geert-Jan van Houtum,WD Jan Willem Rustenburg,Bob Huisman +4 more
TL;DR: In this article, a framework for planning and control of the spare parts supply chain in organizations that use and maintain high-value capital assets is presented, where decisions in the framework are decomposed hierarchically and interfaces are described.
References
•Book
The Elements of Statistical Learning
Trevor Hastie,Robert Tibshirani,Jerome H. Friedman +2 more
- 01 Jan 2001
29.4K
The Elements of Statistical Learning
TL;DR: Chapter 11 includes more case studies in other areas, ranging from manufacturing to marketing research, and a detailed comparison with other diagnostic tools, such as logistic regression and tree-based methods.
15.5K
A Stochastic Approximation Method
Herbert Robbins,Sutton Monro +1 more
TL;DR: In this article, a method for making successive experiments at levels x1, x2, ··· in such a way that xn will tend to θ in probability is presented.
•Book
Multilevel analysis : an introduction to basic and advanced multilevel modeling
Tom A. B. Snijders,Roel Bosker +1 more
- 01 Jan 1999
TL;DR: In this paper, the authors proposed a multilevel regression model to estimate within-and between-group correlations using a combination of within-group correlation and cross-group evidence.
10.6K
Finite-time Analysis of the Multiarmed Bandit Problem
TL;DR: This work shows that the optimal logarithmic regret is also achievable uniformly over time, with simple and efficient policies, and for all reward distributions with bounded support.
Related Papers (5)
Osman Alp,Woonghee Tim Huh,Tarkan Tan +2 more
- 01 Jan 2009
R.E. Seguel Pérez,H. Eshuis,Paul Grefen +2 more
- 01 Jan 2009