Hierarchical Knowledge Gradient for Sequential Sampling

doi:10.5555/1953048.2078200

Open AccessJournal Article10.5555/1953048.2078200

Hierarchical Knowledge Gradient for Sequential Sampling

Martijn R.K. Mes, +2 more

- 01 Feb 2011

- Journal of Machine Learning Research

- Vol. 12, Iss: 10, pp 2931-2974

113

TL;DR: A hierarchical aggregation technique that uses the common features shared by alternatives to learn about many alternatives from even a single measurement is proposed, which greatly reduces the measurement effort but requires some prior knowledge on the smoothness of the function in the form of an aggregation function.

Abstract: We propose a sequential sampling policy for noisy discrete global optimization and ranking and selection, in which we aim to efficiently explore a finite set of alternatives before selecting an alternative as best when exploration stops. Each alternative may be characterized by a multi-dimensional vector of categorical and numerical attributes and has independent normal rewards. We use a Bayesian probability model for the unknown reward of each alternative and follow a fully sequential sampling policy called the knowledge-gradient policy. This policy myopically optimizes the expected increment in the value of sampling information in each time period. We propose a hierarchical aggregation technique that uses the common features shared by alternatives to learn about many alternatives from even a single measurement. This approach greatly reduces the measurement effort required, but it requires some prior knowledge on the smoothness of the function in the form of an aggregation function and computational issues limit the number of alternatives that can be easily considered to the thousands. We prove that our policy is consistent, finding a globally optimal alternative when given enough measurements, and show through simulations that it performs competitively with or significantly better than other policies.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1016/J.EJOR.2013.12.033

A review of recent research on green road freight transportation

Emrah Demir, +2 more

- 16 Sep 2014

- European Journal of Operational Research

TL;DR: A review of recent research on green road freight transportation is provided to provide an understanding of vehicle emission models and their inclusion into the existing optimization methods.

...read moreread less

723

•Proceedings Article•10.1109/CIDM.2011.5949453

Flexible Heuristics Miner (FHM)

A.J.M.M. Weijters, +1 more

- 11 Apr 2011

TL;DR: A new process representation language is presented in combination with an accompanying process mining algorithm that results in easy to understand process models even in the case of non-trivial constructs, low structured domains and the presence of noise.

...read moreread less

552

Journal Article•10.1016/J.EJOR.2018.07.014

A unified framework for stochastic optimization

Warren B. Powell

- 16 Jun 2019

- European Journal of Operational Research

TL;DR: It is argued that the principles of bandit problems should become a core dimension of mainstream stochastic optimization, and a universal modeling framework is proposed that encompasses all of these competing approaches.

...read moreread less

279

•Book Chapter•10.1007/978-3-642-40988-2_15

Parallel Gaussian Process Optimization with Upper Confidence Bound and Pure Exploration

Emile Contal, +3 more

- 19 Apr 2013

- arXiv: Learning

TL;DR: The Gaussian Process Upper Confidence Bound and Pure exploration algorithm (GP-UCB-PE) is introduced which combines the UCB strategy and Pure Exploration in the same batch of evaluations along the parallel iterations and proves theoretical upper bounds on the regret with batches of size K for this procedure.

...read moreread less

215

•Journal Article•10.1080/09537287.2014.907586

Maintenance spare parts planning and control : a framework for control and agenda for future research

M.A. Driessen, +4 more

- 14 Apr 2014

- Production Planning & Control

TL;DR: In this article, a framework for planning and control of the spare parts supply chain in organizations that use and maintain high-value capital assets is presented, where decisions in the framework are decomposed hierarchically and interfaces are described.

...read moreread less

193

...

Expand

References

•Book

The Elements of Statistical Learning

Trevor Hastie, +2 more

- 01 Jan 2001

29.4K

Journal Article•10.1198/TECH.2003.S770

The Elements of Statistical Learning

Eric R. Ziegel

- 01 Aug 2003

- Technometrics

TL;DR: Chapter 11 includes more case studies in other areas, ranging from manufacturing to marketing research, and a detailed comparison with other diagnostic tools, such as logistic regression and tree-based methods.

...read moreread less

15.5K

•Journal Article•10.1214/AOMS/1177729586

A Stochastic Approximation Method

Herbert Robbins, +1 more

- 01 Sep 1951

- Annals of Mathematical Statistics

TL;DR: In this article, a method for making successive experiments at levels x1, x2, ··· in such a way that xn will tend to θ in probability is presented.

...read moreread less

11.3K

•Book

Multilevel analysis : an introduction to basic and advanced multilevel modeling

Tom A. B. Snijders, +1 more

- 01 Jan 1999

TL;DR: In this paper, the authors proposed a multilevel regression model to estimate within-and between-group correlations using a combination of within-group correlation and cross-group evidence.

...read moreread less

10.6K

•Journal Article•10.1023/A:1013689704352

Finite-time Analysis of the Multiarmed Bandit Problem

Peter Auer, +2 more

- 01 May 2002

- Machine Learning

TL;DR: This work shows that the optimal logarithmic regret is also achievable uniformly over time, with simple and efficient policies, and for all reward distributions with bounded support.

...read moreread less

8K