Top 169 papers published in the topic of Asymptotically optimal algorithm in 2019

Showing papers on "Asymptotically optimal algorithm published in 2019"

Journal Article•10.1109/TWC.2019.2940454•

Fast mmwave Beam Alignment via Correlated Bandit Learning

[...]

Wen Wu¹, Nan Cheng², Ning Zhang³, Peng Yang¹, Weihua Zhuang¹, Xuemin Shen¹ - Show less +2 more•Institutions (3)

University of Waterloo¹, Xidian University², Texas A&M University–Corpus Christi³

17 Sep 2019-IEEE Transactions on Wireless Communications

TL;DR: A learning algorithm to reduce BA latency, namely Hierarchical Beam Alignment (HBA) algorithm, which takes advantage of the correlation structure among beams such that the information from nearby beams is extracted to identify the optimal beam, instead of searching the entire beam space.

...read moreread less

Abstract: Beam alignment (BA) is to ensure the transmitter and receiver beams are accurately aligned to establish a reliable communication link in millimeter-wave (mmwave) systems. Existing BA methods search the entire beam space to identify the optimal transmit-receive beam pair, which incurs significant BA latency on the order of seconds in the worst case. In this paper, we develop a learning algorithm to reduce BA latency, namely Hierarchical Beam Alignment (HBA) algorithm. We first formulate the BA problem as a stochastic multi-armed bandit problem with the objective to maximize the cumulative received signal strength within a certain period. The proposed algorithm takes advantage of the correlation structure among beams such that the information from nearby beams is extracted to identify the optimal beam, instead of searching the entire beam space. Furthermore, the prior knowledge on the channel fluctuation is incorporated in the proposed algorithm to further accelerate the BA process. Theoretical analysis indicates that the proposed algorithm is asymptotically optimal. Extensive simulation results demonstrate that the proposed algorithm can identify the optimal beam with a high probability and reduce the BA latency from hundreds of milliseconds to a few milliseconds in the multipath channel, as compared to the existing BA method in IEEE 802.11ad.

...read moreread less

160 citations

Journal Article•10.1007/S10898-019-00798-7•

Efficient computation of expected hypervolume improvement using box decomposition algorithms

[...]

Kaifeng Yang¹, Michael Emmerich¹, André H. Deutz¹, Thomas Bäck¹•Institutions (1)

Leiden University¹

03 Jul 2019-Journal of Global Optimization

TL;DR: An efficient algorithm for the exact calculation of the EHVI for in a generic case is proposed, based on partitioning the integration volume into a set of axis-parallel slices and a new hyperbox decomposition technique, which is proposed by Dachert et al.

...read moreread less

Abstract: In the field of multi-objective optimization algorithms, multi-objective Bayesian Global Optimization (MOBGO) is an important branch, in addition to evolutionary multi-objective optimization algorithms. MOBGO utilizes Gaussian Process models learned from previous objective function evaluations to decide the next evaluation site by maximizing or minimizing an infill criterion. A commonly used criterion in MOBGO is the Expected Hypervolume Improvement (EHVI), which shows a good performance on a wide range of problems, with respect to exploration and exploitation. However, so far, it has been a challenge to calculate exact EHVI values efficiently. This paper proposes an efficient algorithm for the exact calculation of the EHVI for in a generic case. This efficient algorithm is based on partitioning the integration volume into a set of axis-parallel slices. Theoretically, the upper bound time complexities can be improved from previously $$O (n^2)$$ and $$O(n^3)$$ , for two- and three-objective problems respectively, to $$\varTheta (n\log n)$$ , which is asymptotically optimal. This article generalizes the scheme in higher dimensional cases by utilizing a new hyperbox decomposition technique, which is proposed by Dachert et al. (Eur J Oper Res 260(3):841–855, 2017). It also utilizes a generalization of the multilayered integration scheme that scales linearly in the number of hyperboxes of the decomposition. The speed comparison shows that the proposed algorithm in this paper significantly reduces computation time. Finally, this decomposition technique is applied in the calculation of the Probability of Improvement (PoI).

...read moreread less

89 citations

Journal Article•10.1109/TAC.2018.2853558•

Sampling-Based Optimal Control Synthesis for Multirobot Systems Under Global Temporal Tasks

[...]

Yiannis Kantaros¹, Michael M. Zavlanos¹•Institutions (1)

Duke University¹

01 May 2019-IEEE Transactions on Automatic Control

TL;DR: In this article, the authors propose a sampling-based algorithm that builds incrementally trees that approximate the state space and transitions of the synchronous product automata, which is probabilistically complete and asymptotically optimal.

...read moreread less

Abstract: This paper proposes a new optimal control synthesis algorithm for multirobot systems under global temporal logic tasks. Existing planning approaches under global temporal goals rely on graph search techniques applied to a product automaton constructed among the robots. In this paper, we propose a new sampling-based algorithm that builds incrementally trees that approximate the state space and transitions of the synchronous product automaton. By approximating the product automaton by a tree rather than representing it explicitly, we require much fewer memory resources to store it and motion plans can be found by tracing sequences of parent nodes without the need for sophisticated graph search methods. This significantly increases the scalability of our algorithm compared to existing optimal control synthesis methods. We also show that the proposed algorithm is probabilistically complete and asymptotically optimal. Finally, we present numerical experiments showing that our approach can synthesize optimal plans from product automata with billions of states, which is not possible using standard optimal control synthesis algorithms or model checkers.

...read moreread less

81 citations

Journal Article•10.1109/TIT.2018.2866257•

Active Anomaly Detection in Heterogeneous Processes

[...]

Boshuang Huang¹, Kobi Cohen², Qing Zhao¹•Institutions (2)

Cornell University¹, Ben-Gurion University of the Negev²

01 Apr 2019-IEEE Transactions on Information Theory

TL;DR: A low-complexity deterministic test is shown to enjoy the same asymptotic optimality while offering significantly better performance in the finite regime and faster convergence to the optimal rate function, especially when the number of processes is large.

...read moreread less

Abstract: An active inference problem of detecting anomalies among heterogeneous processes is considered. At each time, a subset of processes can be probed. The objective is to design a sequential probing strategy that dynamically determines which processes to observe at each time and when to terminate the search so that the expected detection time is minimized under a constraint on the probability of misclassifying any process. This problem falls into the general setting of sequential design of experiments pioneered by Chernoff in 1959, in which a randomized strategy, referred to as the Chernoff test, was proposed and shown to be asymptotically optimal as the error probability approaches zero. For the problem considered in this paper, a low-complexity deterministic test is shown to enjoy the same asymptotic optimality while offering significantly better performance in the finite regime and faster convergence to the optimal rate function, especially when the number of processes is large. Furthermore, the proposed test offers considerable reduction in computation complexity.

...read moreread less

72 citations

Proceedings Article•

Asymptotically Optimal Planning for Non-Myopic Multi-Robot Information Gathering

[...]

Yiannis Kantaros¹, Brent Schlotfeldt², Nikolay Atanasov³, George J. Pappas²•Institutions (3)

Duke University¹, University of Pennsylvania², University of California, San Diego³

22 Jun 2019

TL;DR: This paper proposes a novel highly scalable sampling-based planning algorithm for multi-robot active information acquisition tasks in complex environments that can address large-scale estimation tasks which were previously infeasible.

...read moreread less

Abstract: This paper proposes a novel highly scalable sampling-based planning algorithm for multi-robot active information acquisition tasks in complex environments. Active information gathering scenarios include target localization and tracking, active SLAM, surveillance, environmental monitoring and others. The objective is to compute control policies for sensing robots which minimize the accumulated uncertainty of a dynamic hidden state over an a priori unknown horizon. To address this problem, we propose a new sampling-based algorithm that simultaneously explores both the robot motion space and the reachable information space. Unlike relevant samplingbased approaches, we show that the proposed algorithm is probabilistically complete, asymptotically optimal and is supported by convergence rate bounds. Moreover, we demonstrate that by introducing bias in the sampling process towards informative areas, the proposed method can quickly compute sensor policies that achieve desired levels of uncertainty in large-scale estimation tasks that may involve large sensor teams, workspaces, and dimensions of the hidden state. We provide extensive simulation results that corroborate the theoretical analysis and show that the proposed algorithm can address large-scale estimation tasks which were previously infeasible.

...read moreread less

63 citations

Posted Content•

Adaptive Exploration in Linear Contextual Bandit

[...]

Botao Hao¹, Tor Lattimore², Csaba Szepesvári²•Institutions (2)

Princeton University¹, Google²

15 Oct 2019-arXiv: Learning

TL;DR: This work designs an algorithm that is asymptotically optimal and has good finite-time empirical performance and makes connections to the recent literature on when exploration-free methods are effective.

...read moreread less

Abstract: Contextual bandits serve as a fundamental model for many sequential decision making tasks. The most popular theoretically justified approaches are based on the optimism principle. While these algorithms can be practical, they are known to be suboptimal asymptotically. On the other hand, existing asymptotically optimal algorithms for this problem do not exploit the linear structure in an optimal way and suffer from lower-order terms that dominate the regret in all practically interesting regimes. We start to bridge the gap by designing an algorithm that is asymptotically optimal and has good finite-time empirical performance. At the same time, we make connections to the recent literature on when exploration-free methods are effective. Indeed, if the distribution of contexts is well behaved, then our algorithm acts mostly greedily and enjoys sub-logarithmic regret. Furthermore, our approach is adaptive in the sense that it automatically detects the nice case. Numerical results demonstrate significant regret reductions by our method relative to several baselines.

...read moreread less

53 citations

Journal Article•10.1017/APR.2019.29•

An Asymptotically Optimal Heuristic for General Non-Stationary Finite-Horizon Restless Multi-Armed Multi-Action Bandits

[...]

Gabriel Zayas-Cabán¹, Stefanus Jasin², Guihua Wang²•Institutions (2)

University of Wisconsin-Madison¹, University of Michigan²

01 Sep 2019-Advances in Applied Probability

TL;DR: Numerical simulations show that the Randomized Assignment Control is asymptotically optimal for a dynamic population, where bandits can randomly arrive and depart the system, and discuss how the framework extends to more general costs and constraints.

...read moreread less

Abstract: We propose an asymptotically optimal heuristic, which we term randomized assignment control (RAC) for a restless multi-armed bandit problem with discrete-time and finite states. It is constructed using a linear programming relaxation of the original stochastic control formulation. In contrast to most of the existing literature, we consider a finite-horizon problem with multiple actions and time-dependent (i.e. nonstationary) upper bound on the number of bandits that can be activated at each time period; indeed, our analysis can also be applied in the setting with nonstationary transition matrix and nonstationary cost function. The asymptotic setting is obtained by letting the number of bandits and other related parameters grow to infinity. Our main contribution is that the asymptotic optimality of RAC in this general setting does not require indexability properties or the usual stability conditions of the underlying Markov chain (e.g. unichain) or fluid approximation (e.g. global stable attractor). Moreover, our multi-action setting is not restricted to the usual dominant action concept. Finally, we show that RAC is also asymptotically optimal for a dynamic population, where bandits can randomly arrive and depart the system.

...read moreread less

43 citations

Posted Content•

Dynamic optimization with side information.

[...]

Dimitris Bertsimas¹, Christopher McCord¹, Bradley Sturt¹•Institutions (1)

Massachusetts Institute of Technology¹

17 Jul 2019-arXiv: Optimization and Control

TL;DR: Through a novel measure concentration result for a class of machine learning methods, it is proved that the proposed approach is asymptotically optimal for multi-period stochastic programming with side information.

...read moreread less

Abstract: We develop a tractable and flexible approach for incorporating side information into dynamic optimization under uncertainty. The proposed framework uses predictive machine learning methods (such as $k$-nearest neighbors, kernel regression, and random forests) to weight the relative importance of various data-driven uncertainty sets in a robust optimization formulation. Through a novel measure concentration result for a class of machine learning methods, we prove that the proposed approach is asymptotically optimal for multi-period stochastic programming with side information. We also describe a general-purpose approximation for these optimization problems, based on overlapping linear decision rules, which is computationally tractable and produces high-quality solutions for dynamic problems with many stages. Across a variety of examples in inventory management, finance, and shipment planning, our method achieves improvements of up to 15\% over alternatives and requires less than one minute of computation time on problems with twelve stages.

...read moreread less

43 citations

Proceedings Article•

Optimized Score Transformation for Fair Classification.

[...]

Dennis Wei¹, Karthikeyan Natesan Ramamurthy¹, Flavio P. Calmon¹•Institutions (1)

IBM¹

31 May 2019

TL;DR: Comprehensive experiments show that the proposed FairScoreTransformer has advantages for score-based metrics such as Brier score and AUC while remaining competitive for binary label-based metric such as accuracy.

...read moreread less

Abstract: This paper considers fair probabilistic classification where the outputs of primary interest are predicted probabilities, commonly referred to as scores We formulate the problem of transforming scores to satisfy fairness constraints that are linear in conditional means of scores while minimizing the loss in utility The formulation can be applied either to post-process classifier outputs or to pre-process training data, thus allowing maximum freedom in selecting a classification algorithm We derive a closed-form expression for the optimal transformed scores and a convex optimization problem for the transformation parameters In the population limit, the transformed score function is the fairness-constrained minimizer of cross-entropy with respect to the optimal unconstrained scores In the finite sample setting, we propose to approach this solution using a combination of standard probabilistic classifiers and ADMM The transformation parameters obtained from the finite-sample procedure are shown to be asymptotically optimal Comprehensive experiments comparing to 10 existing methods show that the proposed FairScoreTransformer has advantages for score-based metrics such as Brier score and AUC while remaining competitive for binary label-based metrics such as accuracy

...read moreread less

41 citations

Proceedings Article•

Pure Exploration with Multiple Correct Answers

[...]

Rémy Degenne¹, Wouter M. Koolen¹•Institutions (1)

Centrum Wiskunde & Informatica¹

1 Jan 2019

TL;DR: In this article, the authors derived a lower bound for the sample complexity of pure exploration bandit problems with multiple good answers and presented a new algorithm which has asymptotic sample complexity matching the lower bound.

...read moreread less

Abstract: We determine the sample complexity of pure exploration bandit problems with multiple good answers. We derive a lower bound using a new game equilibrium argument. We show how continuity and convexity properties of single-answer problems ensure that the existing Track-and-Stop algorithm has asymptotically optimal sample complexity. However, that convexity is lost when going to the multiple-answer setting. We present a new algorithm which extends Track-and-Stop to the multiple-answer case and has asymptotic sample complexity matching the lower bound.

...read moreread less

37 citations

Journal Article•10.1007/S10463-017-0642-9•

Frequentist model averaging for threshold models

[...]

Yan Gao¹, Yan Gao², Xinyu Zhang¹, Shouyang Wang¹, Terence Tai-Leung Chong³, Guohua Zou⁴ - Show less +2 more•Institutions (4)

Chinese Academy of Sciences¹, Minzu University of China², The Chinese University of Hong Kong³, Capital Normal University⁴

01 Apr 2019-Annals of the Institute of Statistical Mathematics

TL;DR: This paper develops a frequentist model averaging approach for threshold model specifications that is proved to be asymptotically optimal in the sense of achieving the lowest possible squared errors when combining estimators from threshold autoregressive models.

...read moreread less

Abstract: This paper develops a frequentist model averaging approach for threshold model specifications. The resulting estimator is proved to be asymptotically optimal in the sense of achieving the lowest possible squared errors. In particular, when combining estimators from threshold autoregressive models, this approach is also proved to be asymptotically optimal. Simulation results show that for the situation where the existing model averaging approach is not applicable, our proposed model averaging approach has a good performance; for the other situations, our proposed model averaging approach performs marginally better than other commonly used model selection and model averaging methods. An empirical application of our approach on the US unemployment data is given.

...read moreread less

Journal Article•10.1016/J.AUTOMATICA.2018.12.008•

An efficient simulation procedure for ranking the top simulated designs in the presence of stochastic constraints

[...]

Hui Xiao¹, Hu Chen¹, Loo Hay Lee²•Institutions (2)

Southwestern University of Finance and Economics¹, National University of Singapore²

01 May 2019-Automatica

TL;DR: A lower bound on the PCR is proposed and an asymptotically optimal allocation rule is developed based on the lower bound to derive an efficient simulation budget allocation strategy such that the probability of correct ranking (PCR) can be maximized.

...read moreread less

Posted Content•

From Predictions to Prescriptions in Multistage Optimization Problems.

[...]

Dimitris Bertsimas¹, Christopher McCord¹•Institutions (1)

Massachusetts Institute of Technology¹

26 Apr 2019-arXiv: Machine Learning

TL;DR: This paper introduces a framework for solving finite-horizon multistage optimization problems under uncertainty in the presence of auxiliary data, and establishes finite sample guarantees for the optimality of the method with $k$NN weight functions.

...read moreread less

Abstract: In this paper, we introduce a framework for solving finite-horizon multistage optimization problems under uncertainty in the presence of auxiliary data. We assume the joint distribution of the uncertain quantities is unknown, but noisy observations, along with observations of auxiliary covariates, are available. We utilize effective predictive methods from machine learning (ML), including $k$-nearest neighbors regression ($k$NN), classification and regression trees (CART), and random forests (RF), to develop specific methods that are applicable to a wide variety of problems. We demonstrate that our solution methods are asymptotically optimal under mild conditions. Additionally, we establish finite sample guarantees for the optimality of our method with $k$NN weight functions. Finally, we demonstrate the practicality of our approach with computational examples. We see a significant decrease in cost by taking into account the auxiliary data in the multistage setting.

...read moreread less

Proceedings Article•

Adaptive Exploration in Linear Contextual Bandit.

[...]

Botao Hao¹, Tor Lattimore², Csaba Szepesvári²•Institutions (2)

Princeton University¹, Google²

1 Oct 2019

TL;DR: In this article, the authors propose an algorithm that is asymptotically optimal and has good finite-time empirical performance, and make connections to the recent literature on when exploration-free methods are effective.

...read moreread less

Abstract: Contextual bandits serve as a fundamental model for many sequential decision making tasks The most popular theoretically justified approaches are based on the optimism principle While these algorithms can be practical, they are known to be suboptimal asymptotically On the other hand, existing asymptotically optimal algorithms for this problem do not exploit the linear structure in an optimal way and suffer from lower-order terms that dominate the regret in all practically interesting regimes We start to bridge the gap by designing an algorithm that is asymptotically optimal and has good finite-time empirical performance At the same time, we make connections to the recent literature on when exploration-free methods are effective Indeed, if the distribution of contexts is well behaved, then our algorithm acts mostly greedily and enjoys sub-logarithmic regret Furthermore, our approach is adaptive in the sense that it automatically detects the nice case Numerical results demonstrate significant regret reductions by our method relative to several baselines

...read moreread less

Proceedings Article•10.1109/COASE.2019.8842957•

Selecting the Optimal System Design under Covariates

[...]

Siyang Gao¹, Jianzhong Du¹, Chun-Hung Chen²•Institutions (2)

City University of Hong Kong¹, George Mason University²

1 Aug 2019

TL;DR: This work introduces three measures to evaluate the selection quality over the covariate space and investigates their rate functions of convergence and shows that the selection algorithm can recover the asymptotical optimal allocation in the limit.

...read moreread less

Abstract: In this research, we consider the ranking and selection problem in the presence of covariates. It is an important problem in personalized decision making. The performance of each design alternative depends on the values of the covariates to the simulation model for which the relationship is hard to describe analytically. Therefore the optimal design under each possible covariate value needs to be estimated by simulation. This work first introduces three measures to evaluate the selection quality over the covariate space and investigates their rate functions of convergence. By optimizing the rate functions, an asymptotically optimal budget allocation rule is developed and a corresponding selection algorithm is devised. We further show that the selection algorithm can recover the asymptotical optimal allocation in the limit. The high efficiency of the selection algorithm is illustrated via numerical testing.

...read moreread less

Posted Content•

Minimax Linear Estimation of the Retargeted Mean

[...]

David A. Hirshberg, Arian Maleki, José R. Zubizarreta

11 Jan 2019-arXiv: Statistics Theory

TL;DR: In this article, the authors study the minimax linear estimator of the treatment-specific mean outcome on a target population and provide a theoretical basis for inference based on it, which is the common practice of ignoring bias when building confidence intervals with these linear estimators.

...read moreread less

Abstract: Evaluating treatments received by one population for application to a different target population of scientific interest is a central problem in causal inference from observational studies. We study the minimax linear estimator of the treatment-specific mean outcome on a target population and provide a theoretical basis for inference based on it. In particular, we provide a justification for the common practice of ignoring bias when building confidence intervals with these linear estimators. Focusing on the case that the class of the unknown outcome function is the unit ball of a reproducing kernel Hilbert space, we show that the resulting linear estimator is asymptotically optimal under conditions only marginally stronger than those used with augmented estimators. We establish bounds attesting to the estimator's good finite sample properties. In an extensive simulation study, we observe promising performance of the estimator throughout a wide range of sample sizes, noise levels, and levels of overlap between the covariate distributions of the treated and target populations.

...read moreread less

Proceedings Article•10.1109/ISIT.2019.8849721•

Second-Order Asymptotically Optimal Statistical Classification

[...]

Lin Zhou¹, Vincent Y. F. Tan², Mehul Motani²•Institutions (2)

University of Michigan¹, National University of Singapore²

7 Jul 2019

TL;DR: In this article, the authors consider the second-order asymptotics tradeoff between type-I and type-II error probabilities for tests which ensure that the type I error probability for all pairs of distributions decays exponentially fast and the type II error probability of a particular pair of distributions is nonvanishing.

...read moreread less

Abstract: Motivated by real-world machine learning applications, we analyze approximations to the non-asymptotic fundamental limits of statistical classification. In the binary version of this problem, given two training sequences generated according to two unknown distributions P 1 and P 2 , one is tasked to classify a test sequence which is known to be generated according to either P 1 or P 2 . This problem can be thought of as an analogue of the binary hypothesis testing problem but in the present setting, the generating distributions are unknown. Due to finite sample considerations, we consider the second-order asymptotics (or dispersion-type) tradeoff between type-I and type-II error probabilities for tests which ensure that (i) the type-I error probability for all pairs of distributions decays exponentially fast and (ii) the type-II error probability for a particular pair of distributions is non-vanishing. We generalize our results to classification of multiple hypotheses with the rejection option.

...read moreread less

Journal Article•10.5705/SS.202015.0392•

Optimal model averaging estimation for partially linear models

[...]

Xinyu Zhang, Wendun Wang

01 Jan 2019-Statistica Sinica

TL;DR: In this article, optimal model averaging for partially linear models with heteroscedasticity is studied and the resulting estimator is proved to be asymptotically optimal under some regularity conditions.

...read moreread less

Abstract: textThis article studies optimal model averaging for partially linear models with heteroscedasticity. A Mallows-type criterion is proposed to choose the weight. The resulting model averaging estimator is proved to be asymptotically optimal under some regularity conditions. Simulation experiments suggest that the proposed model averaging method is superior to other commonly used model selection and averaging methods. The proposed procedure is further applied to study Japan’s sovereign credit default swap spreads.

...read moreread less

Proceedings Article•10.24963/IJCAI.2019/855•

The Provable Virtue of Laziness in Motion Planning.

[...]

Nika Haghtalab¹, Simon Mackenzie², Ariel D. Procaccia³, Oren Salzman³, Siddhartha S. Srinivasa⁴ - Show less +1 more•Institutions (4)

Microsoft¹, University of New South Wales², Carnegie Mellon University³, University of Washington⁴

1 Aug 2019

TL;DR: In this paper, an analytical upper bound on the number of edge evaluations required by Lazy Shortest Path (LazySP) algorithms is derived, and a matching lower bound is shown that these algorithms are asymptotically optimal in the worst case.

...read moreread less

Abstract: The Lazy Shortest Path (LazySP) class consists of motion-planning algorithms that only evaluate edges along candidate shortest paths between the source and target. These algorithms were designed to minimize the number of edge evaluations in settings where edge evaluation dominates the running time of the algorithm; but how close to optimal are LazySP algorithms in terms of this objective? Our main result is an analytical upper bound, in a probabilistic model, on the number of edge evaluations required by LazySP algorithms; a matching lower bound shows that these algorithms are asymptotically optimal in the worst case.

...read moreread less

Proceedings Article•10.1109/ICASSP.2019.8683450•

Active Anomaly Detection with Switching Cost

[...]

Da Chen¹, Qiwei Huang¹, Hui Feng¹, Qing Zhao², Bo Hu¹ - Show less +1 more•Institutions (2)

Fudan University¹, Cornell University²

8 May 2019

TL;DR: In this paper, an active inference strategy consisting of a selection rule governing which process to probe at each time, a stopping rule on when to terminate the detection, and a decision rule on the final detection outcome is considered.

...read moreread less

Abstract: The problem of anomaly detection among multiple processes is considered within the framework of sequential design of experiments. The objective is an active inference strategy consisting of a selection rule governing which process to probe at each time, a stopping rule on when to terminate the detection, and a decision rule on the final detection outcome. The performance measure is the Bayes risk that takes into account not only sample complexity and detection errors, but also costs associated with switching across processes. While the problem is a partially observable Markov decision process to which optimal solutions are generally intractable, a low-complexity deterministic policy is shown to be asymptotically optimal and offer significant performance improvement over existing methods in the finite regime.

...read moreread less

Journal Article•10.1016/J.JFA.2019.108280•

Improved log-Sobolev inequalities, hypercontractivity and uncertainty principle on the hypercube

[...]

Yury Polyanskiy¹, Alex Samorodnitsky²•Institutions (2)

Massachusetts Institute of Technology¹, Hebrew University of Jerusalem²

01 Dec 2019-Journal of Functional Analysis

TL;DR: In this paper, a family of entropy-energy inequalities for binary hypercube functions with small support is presented. But the tradeoff between the sizes that they derive is asymptotically optimal.

...read moreread less

Journal Article•10.1007/S12095-018-0331-4•

Two constructions of asymptotically optimal codebooks

[...]

Gaojun Luo¹, Xiwang Cao², Xiwang Cao¹•Institutions (2)

Nanjing University of Aeronautics and Astronautics¹, Chinese Academy of Sciences²

01 Jul 2019-Cryptography and Communications

TL;DR: The constructions of complex codebooks from the operations of certain sets are proposed and are proved to be asymptotically optimal with respect to the Welch bound.

...read moreread less

Abstract: Codebooks with low-coherence have wide applications in many fields such as direct spread code division multiple access communications, compressed sensing, signal processing and so on. In this paper, we propose two constructions of complex codebooks from the operations of certain sets. The complex codebooks produced by these constructions are proved to be asymptotically optimal with respect to the Welch bound. In addition, the parameters of the complex codebooks presented in this paper are new and flexible in some cases.

...read moreread less

Journal Article•10.1137/17M113592X•

Trading Fractional Brownian Motion

[...]

Paolo Guasoni, Zsolt Nika, Miklós Rásonyi

05 Sep 2019-Siam Journal on Financial Mathematics

TL;DR: In a market with an asset price described by fractional Brownian motion, which can be traded with temporary nonlinear price impact, the authors find asymptotically optimal strategies for the maximization of the asset price.

...read moreread less

Abstract: In a market with an asset price described by fractional Brownian motion, which can be traded with temporary nonlinear price impact, we find asymptotically optimal strategies for the maximization of...

...read moreread less

Journal Article•10.1109/TNNLS.2018.2854796•

Asymptotically Optimal Contextual Bandit Algorithm Using Hierarchical Structures

[...]

Mohammadreza Mohaghegh Neyshabouri¹, Kaan Gokcesu², Hakan Gökcesu³, Huseyin Ozkan⁴, Suleyman S. Kozat¹ - Show less +1 more•Institutions (4)

Bilkent University¹, Massachusetts Institute of Technology², École Polytechnique Fédérale de Lausanne³, Sabancı University⁴

01 Mar 2019-IEEE Transactions on Neural Networks

TL;DR: This work proposes an online algorithm for sequential learning in the contextual multiarmed bandit setting and provides significant performance improvements by introducing upper bounds (with respect to the best arm selection policy) that are mathematically proven to vanish in the average loss per round sense at a faster rate compared to the state of the art.

...read moreread less

Abstract: We propose an online algorithm for sequential learning in the contextual multiarmed bandit setting. Our approach is to partition the context space and, then, optimally combine all of the possible mappings between the partition regions and the set of bandit arms in a data-driven manner. We show that in our approach, the best mapping is able to approximate the best arm selection policy to any desired degree under mild Lipschitz conditions. Therefore, we design our algorithm based on the optimal adaptive combination and asymptotically achieve the performance of the best mapping as well as the best arm selection policy. This optimality is also guaranteed to hold even in adversarial environments since we do not rely on any statistical assumptions regarding the contexts or the loss of the bandit arms. Moreover, we design an efficient implementation for our algorithm using various hierarchical partitioning structures, such as lexicographical or arbitrary position splitting and binary trees (BTs) (and several other partitioning examples). For instance, in the case of BT partitioning, the computational complexity is only log-linear in the number of regions in the finest partition. In conclusion, we provide significant performance improvements by introducing upper bounds (with respect to the best arm selection policy) that are mathematically proven to vanish in the average loss per round sense at a faster rate compared to the state of the art. Our experimental work extensively covers various scenarios ranging from bandit settings to multiclass classification with real and synthetic data. In these experiments, we show that our algorithm is highly superior to the state-of-the-art techniques while maintaining the introduced mathematical guarantees and a computationally decent scalability.

...read moreread less

Posted Content•

Gradient Ascent for Active Exploration in Bandit Problems.

[...]

Pierre Ménard¹•Institutions (1)

University of Toulouse¹

20 May 2019-arXiv: Machine Learning

TL;DR: It is proved that this algorithm based on an gradient ascent for a general Active Exploration bandit problem in the fixed confidence setting is asymptotically optimal and computationally efficient.

...read moreread less

Abstract: We present a new algorithm based on an gradient ascent for a general Active Exploration bandit problem in the fixed confidence setting. This problem encompasses several well studied problems such that the Best Arm Identification or Thresholding Bandits. It consists of a new sampling rule based on an online lazy mirror ascent. We prove that this algorithm is asymptotically optimal and, most importantly, computationally efficient.

...read moreread less

Posted Content•

Streaming Quantiles Algorithms with Small Space and Update Time.

[...]

Nikita Ivkin¹, Edo Liberty¹, Kevin J. Lang, Zohar Karnin¹, Vladimir Braverman² - Show less +1 more•Institutions (2)

Amazon.com¹, Johns Hopkins University²

29 Jun 2019-arXiv: Data Structures and Algorithms

TL;DR: A practical variants of the first asymptotically optimal algorithm for approximating quantiles and distributions over streaming data with improved constants is provided by providing a specialized data structure for these sketches which reduces both their memory footprints and update times.

...read moreread less

Abstract: Approximating quantiles and distributions over streaming data has been studied for roughly two decades now. Recently, Karnin, Lang, and Liberty proposed the first asymptotically optimal algorithm for doing so. This manuscript complements their theoretical result by providing a practical variants of their algorithm with improved constants. For a given sketch size, our techniques provably reduce the upper bound on the sketch error by a factor of two. These improvements are verified experimentally. Our modified quantile sketch improves the latency as well by reducing the worst case update time from $O(1/\varepsilon)$ down to $O(\log (1/\varepsilon))$. We also suggest two algorithms for weighted item streams which offer improved asymptotic update times compared to naive extensions. Finally, we provide a specialized data structure for these sketches which reduces both their memory footprints and update times.

...read moreread less

Proceedings Article•10.1109/CAC48633.2019.8996415•

Path planning of mobile robot based on Improved RRT Algorithm

[...]

Yang Ying¹, Zhang Li¹, Guo Ruihong¹, Han Yisa¹, Tan Haiyan¹, Meng Junxi¹ - Show less +2 more•Institutions (1)

Xi'an Polytechnic University¹

1 Nov 2019

TL;DR: In the path planning of mobile robot, aiming at the problem that the rapidly-exploring random tree (RRT) algorithm leads to strong randomness and non-progressive optimization in the process of path growth, an improved RRT algorithm is proposed and the ”Bridge Test” is introduced to guide the robot to walk.

...read moreread less

Abstract: In the path planning of mobile robot, aiming at the problem that the rapidly-exploring random tree (RRT) algorithm, which adopts the global uniform sampling strategy, leads to strong randomness and non-progressive optimization in the process of path growth, an improved RRT algorithm is proposed. Firstly, in the sampling stage, we establish the region sampling to select target points, and add gravitational component to solve the problem of RRT algorithm which is too random. Secondly, the obstacle expansion method is adopted to solve the limitation of long sampling time when the growing tree is close to the obstacle area. In view of the difficulty of the narrow-channel robot to pass, the ”Bridge Test” is introduced to guide the robot to walk. The improved algorithm effectively reduces the occupied space memory, running time and number of nodes, shortens the path length at the same time. By comparing the improved RRT algorithm with the basic RRT algorithm and the asymptotically optimal bi-directional rapidly-exploring random tree (B-RRT*) algorithm, the simulation results show that the improved RRT algorithm has shorter path and better time, and is more efficiently.

...read moreread less

Posted Content•

A Bayesian Theory of Change Detection in Statistically Periodic Random Processes

[...]

Taposh Banerjee¹, Prudhvi Gurram², Gene T. Whipps³•Institutions (3)

University of Texas at San Antonio¹, Booz Allen Hamilton², United States Army Research Laboratory³

06 Apr 2019-arXiv: Signal Processing

TL;DR: It is shown that the Bayesian change point problem can be expressed as an optimal control problem of a Markov decision process (MDP) with periodic transition and cost structures and that a nonstationary but periodic policy is optimal.

...read moreread less

Abstract: A new class of stochastic processes called independent and periodically identically distributed (i.p.i.d.) processes is defined to capture periodically varying statistical behavior. A novel Bayesian theory is developed for detecting a change in the distribution of an i.p.i.d. process. It is shown that the Bayesian change point problem can be expressed as a problem of optimal control of a Markov decision process (MDP) with periodic transition and cost structures. Optimal control theory is developed for periodic MDPs for discounted and undiscounted total cost criteria. A fixed-point equation is obtained that is satisfied by the optimal cost function. It is shown that the optimal policy for the MDP is nonstationary but periodic in nature. A value iteration algorithm is obtained to compute the optimal cost function. The results from the MDP theory are then applied to detect changes in i.p.i.d. processes. It is shown that while the optimal change point algorithm is a stopping rule based on a periodic sequence of thresholds, a single-threshold policy is asymptotically optimal, as the probability of false alarm goes to zero. Numerical results are provided to demonstrate that the asymptotically optimal policy is not strictly optimal.

...read moreread less

Journal Article•10.2139/SSRN.3375203•

Asymptotic Optimality of Constant-Order Policies in Joint Pricing and Inventory Control Models

[...]

Xin Chen¹, Alexander L. Stolyar¹, Linwei Xin²•Institutions (2)

University of Illinois at Urbana–Champaign¹, University of Chicago²

09 May 2019-Social Science Research Network

TL;DR: It is proved that the best constant-order dynamic pricing policy is asymptotically optimal as the lead time grows large, which is exactly the setting in which the problem becomes computationally intractable due to the curse of dimensionality.

...read moreread less

Abstract: We consider a classical joint pricing and inventory control problem with lead times, which has been extensively studied in the literature but is notoriously difficult to solve due to the complex structure of the optimal policy. In this work, rather than analyzing the optimal policy, we propose a class of constant-order dynamic pricing policies, which are fundamentally different from base-stock list price policies, the primary emphasis in the existing literature. Under such a policy, a constant-order amount of new inventory is ordered every period and a pricing decision is made based on the inventory level. The policy is independent of the lead time. We prove that the best constant-order dynamic pricing policy is asymptotically optimal as the lead time grows large, which is exactly the setting in which the problem becomes computationally intractable due to the curse of dimensionality. As our main methodological contributions, we establish the convergence to a long-run average random yield inventory model with zero lead time and ordering capacities by its discounted counterpart as the discount factor goes to one, non-trivially extending the previous results in Federgruen and Yang (2014) that analyze a similar model but without capacity constraints.

...read moreread less

Posted Content•

On the asymptotic optimality of the comb strategy for prediction with expert advice

[...]

Erhan Bayraktar, Ibrahim Ekren, Yili Zhang

06 Feb 2019-arXiv: Probability

TL;DR: This work compute the exact leading order expansion for the long time behavior of the value function and uses this expansion to prove that as conjectured in Gravin et al.

...read moreread less

Abstract: For the problem of prediction with expert advice in the adversarial setting with geometric stopping, we compute the exact leading order expansion for the long time behavior of the value function. Then, we use this expansion to prove that as conjectured in Gravin et al. [12], the comb strategies are indeed asymptotically optimal for the adversary in the case of 4 experts.

...read moreread less

...

Expand