Scispace (Formerly Typeset)
  1. Home
  2. Topics
  3. Asymptotically optimal algorithm
  4. 2019
  1. Home
  2. Topics
  3. Asymptotically optimal algorithm
  4. 2019
Showing papers on "Asymptotically optimal algorithm published in 2019"
Journal Article•10.1109/TWC.2019.2940454•
Fast mmwave Beam Alignment via Correlated Bandit Learning

[...]

Wen Wu1, Nan Cheng2, Ning Zhang3, Peng Yang1, Weihua Zhuang1, Xuemin Shen1 •
University of Waterloo1, Xidian University2, Texas A&M University–Corpus Christi3
17 Sep 2019-IEEE Transactions on Wireless Communications
TL;DR: A learning algorithm to reduce BA latency, namely Hierarchical Beam Alignment (HBA) algorithm, which takes advantage of the correlation structure among beams such that the information from nearby beams is extracted to identify the optimal beam, instead of searching the entire beam space.
Abstract: Beam alignment (BA) is to ensure the transmitter and receiver beams are accurately aligned to establish a reliable communication link in millimeter-wave (mmwave) systems. Existing BA methods search the entire beam space to identify the optimal transmit-receive beam pair, which incurs significant BA latency on the order of seconds in the worst case. In this paper, we develop a learning algorithm to reduce BA latency, namely Hierarchical Beam Alignment (HBA) algorithm. We first formulate the BA problem as a stochastic multi-armed bandit problem with the objective to maximize the cumulative received signal strength within a certain period. The proposed algorithm takes advantage of the correlation structure among beams such that the information from nearby beams is extracted to identify the optimal beam, instead of searching the entire beam space. Furthermore, the prior knowledge on the channel fluctuation is incorporated in the proposed algorithm to further accelerate the BA process. Theoretical analysis indicates that the proposed algorithm is asymptotically optimal. Extensive simulation results demonstrate that the proposed algorithm can identify the optimal beam with a high probability and reduce the BA latency from hundreds of milliseconds to a few milliseconds in the multipath channel, as compared to the existing BA method in IEEE 802.11ad.

160 citations

Journal Article•10.1007/S10898-019-00798-7•
Efficient computation of expected hypervolume improvement using box decomposition algorithms

[...]

Kaifeng Yang1, Michael Emmerich1, André H. Deutz1, Thomas Bäck1•
Leiden University1
03 Jul 2019-Journal of Global Optimization
TL;DR: An efficient algorithm for the exact calculation of the EHVI for in a generic case is proposed, based on partitioning the integration volume into a set of axis-parallel slices and a new hyperbox decomposition technique, which is proposed by Dachert et al.
Abstract: In the field of multi-objective optimization algorithms, multi-objective Bayesian Global Optimization (MOBGO) is an important branch, in addition to evolutionary multi-objective optimization algorithms. MOBGO utilizes Gaussian Process models learned from previous objective function evaluations to decide the next evaluation site by maximizing or minimizing an infill criterion. A commonly used criterion in MOBGO is the Expected Hypervolume Improvement (EHVI), which shows a good performance on a wide range of problems, with respect to exploration and exploitation. However, so far, it has been a challenge to calculate exact EHVI values efficiently. This paper proposes an efficient algorithm for the exact calculation of the EHVI for in a generic case. This efficient algorithm is based on partitioning the integration volume into a set of axis-parallel slices. Theoretically, the upper bound time complexities can be improved from previously $$O (n^2)$$ and $$O(n^3)$$ , for two- and three-objective problems respectively, to $$\varTheta (n\log n)$$ , which is asymptotically optimal. This article generalizes the scheme in higher dimensional cases by utilizing a new hyperbox decomposition technique, which is proposed by Dachert et al. (Eur J Oper Res 260(3):841–855, 2017). It also utilizes a generalization of the multilayered integration scheme that scales linearly in the number of hyperboxes of the decomposition. The speed comparison shows that the proposed algorithm in this paper significantly reduces computation time. Finally, this decomposition technique is applied in the calculation of the Probability of Improvement (PoI).

89 citations

Journal Article•10.1109/TAC.2018.2853558•
Sampling-Based Optimal Control Synthesis for Multirobot Systems Under Global Temporal Tasks

[...]

Yiannis Kantaros1, Michael M. Zavlanos1•
Duke University1
01 May 2019-IEEE Transactions on Automatic Control
TL;DR: In this article, the authors propose a sampling-based algorithm that builds incrementally trees that approximate the state space and transitions of the synchronous product automata, which is probabilistically complete and asymptotically optimal.
Abstract: This paper proposes a new optimal control synthesis algorithm for multirobot systems under global temporal logic tasks. Existing planning approaches under global temporal goals rely on graph search techniques applied to a product automaton constructed among the robots. In this paper, we propose a new sampling-based algorithm that builds incrementally trees that approximate the state space and transitions of the synchronous product automaton. By approximating the product automaton by a tree rather than representing it explicitly, we require much fewer memory resources to store it and motion plans can be found by tracing sequences of parent nodes without the need for sophisticated graph search methods. This significantly increases the scalability of our algorithm compared to existing optimal control synthesis methods. We also show that the proposed algorithm is probabilistically complete and asymptotically optimal. Finally, we present numerical experiments showing that our approach can synthesize optimal plans from product automata with billions of states, which is not possible using standard optimal control synthesis algorithms or model checkers.

81 citations

Journal Article•10.1109/TIT.2018.2866257•
Active Anomaly Detection in Heterogeneous Processes

[...]

Boshuang Huang1, Kobi Cohen2, Qing Zhao1•
Cornell University1, Ben-Gurion University of the Negev2
01 Apr 2019-IEEE Transactions on Information Theory
TL;DR: A low-complexity deterministic test is shown to enjoy the same asymptotic optimality while offering significantly better performance in the finite regime and faster convergence to the optimal rate function, especially when the number of processes is large.
Abstract: An active inference problem of detecting anomalies among heterogeneous processes is considered. At each time, a subset of processes can be probed. The objective is to design a sequential probing strategy that dynamically determines which processes to observe at each time and when to terminate the search so that the expected detection time is minimized under a constraint on the probability of misclassifying any process. This problem falls into the general setting of sequential design of experiments pioneered by Chernoff in 1959, in which a randomized strategy, referred to as the Chernoff test, was proposed and shown to be asymptotically optimal as the error probability approaches zero. For the problem considered in this paper, a low-complexity deterministic test is shown to enjoy the same asymptotic optimality while offering significantly better performance in the finite regime and faster convergence to the optimal rate function, especially when the number of processes is large. Furthermore, the proposed test offers considerable reduction in computation complexity.

72 citations

Proceedings Article•
Asymptotically Optimal Planning for Non-Myopic Multi-Robot Information Gathering

[...]

Yiannis Kantaros1, Brent Schlotfeldt2, Nikolay Atanasov3, George J. Pappas2•
Duke University1, University of Pennsylvania2, University of California, San Diego3
22 Jun 2019
TL;DR: This paper proposes a novel highly scalable sampling-based planning algorithm for multi-robot active information acquisition tasks in complex environments that can address large-scale estimation tasks which were previously infeasible.
Abstract: This paper proposes a novel highly scalable sampling-based planning algorithm for multi-robot active information acquisition tasks in complex environments. Active information gathering scenarios include target localization and tracking, active SLAM, surveillance, environmental monitoring and others. The objective is to compute control policies for sensing robots which minimize the accumulated uncertainty of a dynamic hidden state over an a priori unknown horizon. To address this problem, we propose a new sampling-based algorithm that simultaneously explores both the robot motion space and the reachable information space. Unlike relevant samplingbased approaches, we show that the proposed algorithm is probabilistically complete, asymptotically optimal and is supported by convergence rate bounds. Moreover, we demonstrate that by introducing bias in the sampling process towards informative areas, the proposed method can quickly compute sensor policies that achieve desired levels of uncertainty in large-scale estimation tasks that may involve large sensor teams, workspaces, and dimensions of the hidden state. We provide extensive simulation results that corroborate the theoretical analysis and show that the proposed algorithm can address large-scale estimation tasks which were previously infeasible.

63 citations

Posted Content•
Adaptive Exploration in Linear Contextual Bandit

[...]

Botao Hao1, Tor Lattimore2, Csaba Szepesvári2•
Princeton University1, Google2
15 Oct 2019-arXiv: Learning
TL;DR: This work designs an algorithm that is asymptotically optimal and has good finite-time empirical performance and makes connections to the recent literature on when exploration-free methods are effective.
Abstract: Contextual bandits serve as a fundamental model for many sequential decision making tasks. The most popular theoretically justified approaches are based on the optimism principle. While these algorithms can be practical, they are known to be suboptimal asymptotically. On the other hand, existing asymptotically optimal algorithms for this problem do not exploit the linear structure in an optimal way and suffer from lower-order terms that dominate the regret in all practically interesting regimes. We start to bridge the gap by designing an algorithm that is asymptotically optimal and has good finite-time empirical performance. At the same time, we make connections to the recent literature on when exploration-free methods are effective. Indeed, if the distribution of contexts is well behaved, then our algorithm acts mostly greedily and enjoys sub-logarithmic regret. Furthermore, our approach is adaptive in the sense that it automatically detects the nice case. Numerical results demonstrate significant regret reductions by our method relative to several baselines.

53 citations

Journal Article•10.1017/APR.2019.29•
An Asymptotically Optimal Heuristic for General Non-Stationary Finite-Horizon Restless Multi-Armed Multi-Action Bandits

[...]

Gabriel Zayas-Cabán1, Stefanus Jasin2, Guihua Wang2•
University of Wisconsin-Madison1, University of Michigan2
01 Sep 2019-Advances in Applied Probability
TL;DR: Numerical simulations show that the Randomized Assignment Control is asymptotically optimal for a dynamic population, where bandits can randomly arrive and depart the system, and discuss how the framework extends to more general costs and constraints.
Abstract: We propose an asymptotically optimal heuristic, which we term randomized assignment control (RAC) for a restless multi-armed bandit problem with discrete-time and finite states. It is constructed using a linear programming relaxation of the original stochastic control formulation. In contrast to most of the existing literature, we consider a finite-horizon problem with multiple actions and time-dependent (i.e. nonstationary) upper bound on the number of bandits that can be activated at each time period; indeed, our analysis can also be applied in the setting with nonstationary transition matrix and nonstationary cost function. The asymptotic setting is obtained by letting the number of bandits and other related parameters grow to infinity. Our main contribution is that the asymptotic optimality of RAC in this general setting does not require indexability properties or the usual stability conditions of the underlying Markov chain (e.g. unichain) or fluid approximation (e.g. global stable attractor). Moreover, our multi-action setting is not restricted to the usual dominant action concept. Finally, we show that RAC is also asymptotically optimal for a dynamic population, where bandits can randomly arrive and depart the system.

43 citations

Posted Content•
Dynamic optimization with side information.

[...]

Dimitris Bertsimas1, Christopher McCord1, Bradley Sturt1•
Massachusetts Institute of Technology1
17 Jul 2019-arXiv: Optimization and Control
TL;DR: Through a novel measure concentration result for a class of machine learning methods, it is proved that the proposed approach is asymptotically optimal for multi-period stochastic programming with side information.
Abstract: We develop a tractable and flexible approach for incorporating side information into dynamic optimization under uncertainty. The proposed framework uses predictive machine learning methods (such as $k$-nearest neighbors, kernel regression, and random forests) to weight the relative importance of various data-driven uncertainty sets in a robust optimization formulation. Through a novel measure concentration result for a class of machine learning methods, we prove that the proposed approach is asymptotically optimal for multi-period stochastic programming with side information. We also describe a general-purpose approximation for these optimization problems, based on overlapping linear decision rules, which is computationally tractable and produces high-quality solutions for dynamic problems with many stages. Across a variety of examples in inventory management, finance, and shipment planning, our method achieves improvements of up to 15\% over alternatives and requires less than one minute of computation time on problems with twelve stages.

43 citations

Proceedings Article•
Optimized Score Transformation for Fair Classification.

[...]

Dennis Wei1, Karthikeyan Natesan Ramamurthy1, Flavio P. Calmon1•
IBM1
31 May 2019
TL;DR: Comprehensive experiments show that the proposed FairScoreTransformer has advantages for score-based metrics such as Brier score and AUC while remaining competitive for binary label-based metric such as accuracy.
Abstract: This paper considers fair probabilistic classification where the outputs of primary interest are predicted probabilities, commonly referred to as scores We formulate the problem of transforming scores to satisfy fairness constraints that are linear in conditional means of scores while minimizing the loss in utility The formulation can be applied either to post-process classifier outputs or to pre-process training data, thus allowing maximum freedom in selecting a classification algorithm We derive a closed-form expression for the optimal transformed scores and a convex optimization problem for the transformation parameters In the population limit, the transformed score function is the fairness-constrained minimizer of cross-entropy with respect to the optimal unconstrained scores In the finite sample setting, we propose to approach this solution using a combination of standard probabilistic classifiers and ADMM The transformation parameters obtained from the finite-sample procedure are shown to be asymptotically optimal Comprehensive experiments comparing to 10 existing methods show that the proposed FairScoreTransformer has advantages for score-based metrics such as Brier score and AUC while remaining competitive for binary label-based metrics such as accuracy

41 citations

Proceedings Article•
Pure Exploration with Multiple Correct Answers

[...]

Rémy Degenne1, Wouter M. Koolen1•
Centrum Wiskunde & Informatica1
1 Jan 2019
TL;DR: In this article, the authors derived a lower bound for the sample complexity of pure exploration bandit problems with multiple good answers and presented a new algorithm which has asymptotic sample complexity matching the lower bound.
Abstract: We determine the sample complexity of pure exploration bandit problems with multiple good answers. We derive a lower bound using a new game equilibrium argument. We show how continuity and convexity properties of single-answer problems ensure that the existing Track-and-Stop algorithm has asymptotically optimal sample complexity. However, that convexity is lost when going to the multiple-answer setting. We present a new algorithm which extends Track-and-Stop to the multiple-answer case and has asymptotic sample complexity matching the lower bound.

37 citations

Journal Article•10.1007/S10463-017-0642-9•
Frequentist model averaging for threshold models

[...]

Yan Gao1, Yan Gao2, Xinyu Zhang1, Shouyang Wang1, Terence Tai-Leung Chong3, Guohua Zou4 •
Chinese Academy of Sciences1, Minzu University of China2, The Chinese University of Hong Kong3, Capital Normal University4
01 Apr 2019-Annals of the Institute of Statistical Mathematics
TL;DR: This paper develops a frequentist model averaging approach for threshold model specifications that is proved to be asymptotically optimal in the sense of achieving the lowest possible squared errors when combining estimators from threshold autoregressive models.
Abstract: This paper develops a frequentist model averaging approach for threshold model specifications. The resulting estimator is proved to be asymptotically optimal in the sense of achieving the lowest possible squared errors. In particular, when combining estimators from threshold autoregressive models, this approach is also proved to be asymptotically optimal. Simulation results show that for the situation where the existing model averaging approach is not applicable, our proposed model averaging approach has a good performance; for the other situations, our proposed model averaging approach performs marginally better than other commonly used model selection and model averaging methods. An empirical application of our approach on the US unemployment data is given.
Journal Article•10.1016/J.AUTOMATICA.2018.12.008•
An efficient simulation procedure for ranking the top simulated designs in the presence of stochastic constraints

[...]

Hui Xiao1, Hu Chen1, Loo Hay Lee2•
Southwestern University of Finance and Economics1, National University of Singapore2
01 May 2019-Automatica
TL;DR: A lower bound on the PCR is proposed and an asymptotically optimal allocation rule is developed based on the lower bound to derive an efficient simulation budget allocation strategy such that the probability of correct ranking (PCR) can be maximized.
Posted Content•
From Predictions to Prescriptions in Multistage Optimization Problems.

[...]

Dimitris Bertsimas1, Christopher McCord1•
Massachusetts Institute of Technology1
26 Apr 2019-arXiv: Machine Learning
TL;DR: This paper introduces a framework for solving finite-horizon multistage optimization problems under uncertainty in the presence of auxiliary data, and establishes finite sample guarantees for the optimality of the method with $k$NN weight functions.
Abstract: In this paper, we introduce a framework for solving finite-horizon multistage optimization problems under uncertainty in the presence of auxiliary data. We assume the joint distribution of the uncertain quantities is unknown, but noisy observations, along with observations of auxiliary covariates, are available. We utilize effective predictive methods from machine learning (ML), including $k$-nearest neighbors regression ($k$NN), classification and regression trees (CART), and random forests (RF), to develop specific methods that are applicable to a wide variety of problems. We demonstrate that our solution methods are asymptotically optimal under mild conditions. Additionally, we establish finite sample guarantees for the optimality of our method with $k$NN weight functions. Finally, we demonstrate the practicality of our approach with computational examples. We see a significant decrease in cost by taking into account the auxiliary data in the multistage setting.
Proceedings Article•
Adaptive Exploration in Linear Contextual Bandit.

[...]

Botao Hao1, Tor Lattimore2, Csaba Szepesvári2•
Princeton University1, Google2
1 Oct 2019
TL;DR: In this article, the authors propose an algorithm that is asymptotically optimal and has good finite-time empirical performance, and make connections to the recent literature on when exploration-free methods are effective.
Abstract: Contextual bandits serve as a fundamental model for many sequential decision making tasks The most popular theoretically justified approaches are based on the optimism principle While these algorithms can be practical, they are known to be suboptimal asymptotically On the other hand, existing asymptotically optimal algorithms for this problem do not exploit the linear structure in an optimal way and suffer from lower-order terms that dominate the regret in all practically interesting regimes We start to bridge the gap by designing an algorithm that is asymptotically optimal and has good finite-time empirical performance At the same time, we make connections to the recent literature on when exploration-free methods are effective Indeed, if the distribution of contexts is well behaved, then our algorithm acts mostly greedily and enjoys sub-logarithmic regret Furthermore, our approach is adaptive in the sense that it automatically detects the nice case Numerical results demonstrate significant regret reductions by our method relative to several baselines
Proceedings Article•10.1109/COASE.2019.8842957•
Selecting the Optimal System Design under Covariates

[...]

Siyang Gao1, Jianzhong Du1, Chun-Hung Chen2•
City University of Hong Kong1, George Mason University2
1 Aug 2019
TL;DR: This work introduces three measures to evaluate the selection quality over the covariate space and investigates their rate functions of convergence and shows that the selection algorithm can recover the asymptotical optimal allocation in the limit.
Abstract: In this research, we consider the ranking and selection problem in the presence of covariates. It is an important problem in personalized decision making. The performance of each design alternative depends on the values of the covariates to the simulation model for which the relationship is hard to describe analytically. Therefore the optimal design under each possible covariate value needs to be estimated by simulation. This work first introduces three measures to evaluate the selection quality over the covariate space and investigates their rate functions of convergence. By optimizing the rate functions, an asymptotically optimal budget allocation rule is developed and a corresponding selection algorithm is devised. We further show that the selection algorithm can recover the asymptotical optimal allocation in the limit. The high efficiency of the selection algorithm is illustrated via numerical testing.
Posted Content•
Minimax Linear Estimation of the Retargeted Mean

[...]

David A. Hirshberg, Arian Maleki, José R. Zubizarreta
11 Jan 2019-arXiv: Statistics Theory
TL;DR: In this article, the authors study the minimax linear estimator of the treatment-specific mean outcome on a target population and provide a theoretical basis for inference based on it, which is the common practice of ignoring bias when building confidence intervals with these linear estimators.
Abstract: Evaluating treatments received by one population for application to a different target population of scientific interest is a central problem in causal inference from observational studies. We study the minimax linear estimator of the treatment-specific mean outcome on a target population and provide a theoretical basis for inference based on it. In particular, we provide a justification for the common practice of ignoring bias when building confidence intervals with these linear estimators. Focusing on the case that the class of the unknown outcome function is the unit ball of a reproducing kernel Hilbert space, we show that the resulting linear estimator is asymptotically optimal under conditions only marginally stronger than those used with augmented estimators. We establish bounds attesting to the estimator's good finite sample properties. In an extensive simulation study, we observe promising performance of the estimator throughout a wide range of sample sizes, noise levels, and levels of overlap between the covariate distributions of the treated and target populations.
Proceedings Article•10.1109/ISIT.2019.8849721•
Second-Order Asymptotically Optimal Statistical Classification

[...]

Lin Zhou1, Vincent Y. F. Tan2, Mehul Motani2•
University of Michigan1, National University of Singapore2
7 Jul 2019
TL;DR: In this article, the authors consider the second-order asymptotics tradeoff between type-I and type-II error probabilities for tests which ensure that the type I error probability for all pairs of distributions decays exponentially fast and the type II error probability of a particular pair of distributions is nonvanishing.
Abstract: Motivated by real-world machine learning applications, we analyze approximations to the non-asymptotic fundamental limits of statistical classification. In the binary version of this problem, given two training sequences generated according to two unknown distributions P 1 and P 2 , one is tasked to classify a test sequence which is known to be generated according to either P 1 or P 2 . This problem can be thought of as an analogue of the binary hypothesis testing problem but in the present setting, the generating distributions are unknown. Due to finite sample considerations, we consider the second-order asymptotics (or dispersion-type) tradeoff between type-I and type-II error probabilities for tests which ensure that (i) the type-I error probability for all pairs of distributions decays exponentially fast and (ii) the type-II error probability for a particular pair of distributions is non-vanishing. We generalize our results to classification of multiple hypotheses with the rejection option.
Journal Article•10.5705/SS.202015.0392•
Optimal model averaging estimation for partially linear models

[...]

Xinyu Zhang, Wendun Wang
01 Jan 2019-Statistica Sinica
TL;DR: In this article, optimal model averaging for partially linear models with heteroscedasticity is studied and the resulting estimator is proved to be asymptotically optimal under some regularity conditions.
Abstract: textThis article studies optimal model averaging for partially linear models with heteroscedasticity. A Mallows-type criterion is proposed to choose the weight. The resulting model averaging estimator is proved to be asymptotically optimal under some regularity conditions. Simulation experiments suggest that the proposed model averaging method is superior to other commonly used model selection and averaging methods. The proposed procedure is further applied to study Japan’s sovereign credit default swap spreads.
Proceedings Article•10.24963/IJCAI.2019/855•
The Provable Virtue of Laziness in Motion Planning.

[...]

Nika Haghtalab1, Simon Mackenzie2, Ariel D. Procaccia3, Oren Salzman3, Siddhartha S. Srinivasa4 •
Microsoft1, University of New South Wales2, Carnegie Mellon University3, University of Washington4
1 Aug 2019
TL;DR: In this paper, an analytical upper bound on the number of edge evaluations required by Lazy Shortest Path (LazySP) algorithms is derived, and a matching lower bound is shown that these algorithms are asymptotically optimal in the worst case.
Abstract: The Lazy Shortest Path (LazySP) class consists of motion-planning algorithms that only evaluate edges along candidate shortest paths between the source and target. These algorithms were designed to minimize the number of edge evaluations in settings where edge evaluation dominates the running time of the algorithm; but how close to optimal are LazySP algorithms in terms of this objective? Our main result is an analytical upper bound, in a probabilistic model, on the number of edge evaluations required by LazySP algorithms; a matching lower bound shows that these algorithms are asymptotically optimal in the worst case.
Proceedings Article•10.1109/ICASSP.2019.8683450•
Active Anomaly Detection with Switching Cost

[...]

Da Chen1, Qiwei Huang1, Hui Feng1, Qing Zhao2, Bo Hu1 •
Fudan University1, Cornell University2
8 May 2019
TL;DR: In this paper, an active inference strategy consisting of a selection rule governing which process to probe at each time, a stopping rule on when to terminate the detection, and a decision rule on the final detection outcome is considered.
Abstract: The problem of anomaly detection among multiple processes is considered within the framework of sequential design of experiments. The objective is an active inference strategy consisting of a selection rule governing which process to probe at each time, a stopping rule on when to terminate the detection, and a decision rule on the final detection outcome. The performance measure is the Bayes risk that takes into account not only sample complexity and detection errors, but also costs associated with switching across processes. While the problem is a partially observable Markov decision process to which optimal solutions are generally intractable, a low-complexity deterministic policy is shown to be asymptotically optimal and offer significant performance improvement over existing methods in the finite regime.
Journal Article•10.1016/J.JFA.2019.108280•
Improved log-Sobolev inequalities, hypercontractivity and uncertainty principle on the hypercube

[...]

Yury Polyanskiy1, Alex Samorodnitsky2•
Massachusetts Institute of Technology1, Hebrew University of Jerusalem2
01 Dec 2019-Journal of Functional Analysis
TL;DR: In this paper, a family of entropy-energy inequalities for binary hypercube functions with small support is presented. But the tradeoff between the sizes that they derive is asymptotically optimal.
Journal Article•10.1007/S12095-018-0331-4•
Two constructions of asymptotically optimal codebooks

[...]

Gaojun Luo1, Xiwang Cao2, Xiwang Cao1•
Nanjing University of Aeronautics and Astronautics1, Chinese Academy of Sciences2
01 Jul 2019-Cryptography and Communications
TL;DR: The constructions of complex codebooks from the operations of certain sets are proposed and are proved to be asymptotically optimal with respect to the Welch bound.
Abstract: Codebooks with low-coherence have wide applications in many fields such as direct spread code division multiple access communications, compressed sensing, signal processing and so on. In this paper, we propose two constructions of complex codebooks from the operations of certain sets. The complex codebooks produced by these constructions are proved to be asymptotically optimal with respect to the Welch bound. In addition, the parameters of the complex codebooks presented in this paper are new and flexible in some cases.
Journal Article•10.1137/17M113592X•
Trading Fractional Brownian Motion

[...]

Paolo Guasoni, Zsolt Nika, Miklós Rásonyi
05 Sep 2019-Siam Journal on Financial Mathematics
TL;DR: In a market with an asset price described by fractional Brownian motion, which can be traded with temporary nonlinear price impact, the authors find asymptotically optimal strategies for the maximization of the asset price.
Abstract: In a market with an asset price described by fractional Brownian motion, which can be traded with temporary nonlinear price impact, we find asymptotically optimal strategies for the maximization of...
Journal Article•10.1109/TNNLS.2018.2854796•
Asymptotically Optimal Contextual Bandit Algorithm Using Hierarchical Structures

[...]

Mohammadreza Mohaghegh Neyshabouri1, Kaan Gokcesu2, Hakan Gökcesu3, Huseyin Ozkan4, Suleyman S. Kozat1 •
Bilkent University1, Massachusetts Institute of Technology2, École Polytechnique Fédérale de Lausanne3, Sabancı University4
01 Mar 2019-IEEE Transactions on Neural Networks
TL;DR: This work proposes an online algorithm for sequential learning in the contextual multiarmed bandit setting and provides significant performance improvements by introducing upper bounds (with respect to the best arm selection policy) that are mathematically proven to vanish in the average loss per round sense at a faster rate compared to the state of the art.
Abstract: We propose an online algorithm for sequential learning in the contextual multiarmed bandit setting. Our approach is to partition the context space and, then, optimally combine all of the possible mappings between the partition regions and the set of bandit arms in a data-driven manner. We show that in our approach, the best mapping is able to approximate the best arm selection policy to any desired degree under mild Lipschitz conditions. Therefore, we design our algorithm based on the optimal adaptive combination and asymptotically achieve the performance of the best mapping as well as the best arm selection policy. This optimality is also guaranteed to hold even in adversarial environments since we do not rely on any statistical assumptions regarding the contexts or the loss of the bandit arms. Moreover, we design an efficient implementation for our algorithm using various hierarchical partitioning structures, such as lexicographical or arbitrary position splitting and binary trees (BTs) (and several other partitioning examples). For instance, in the case of BT partitioning, the computational complexity is only log-linear in the number of regions in the finest partition. In conclusion, we provide significant performance improvements by introducing upper bounds (with respect to the best arm selection policy) that are mathematically proven to vanish in the average loss per round sense at a faster rate compared to the state of the art. Our experimental work extensively covers various scenarios ranging from bandit settings to multiclass classification with real and synthetic data. In these experiments, we show that our algorithm is highly superior to the state-of-the-art techniques while maintaining the introduced mathematical guarantees and a computationally decent scalability.
Posted Content•
Gradient Ascent for Active Exploration in Bandit Problems.

[...]

Pierre Ménard1•
University of Toulouse1
20 May 2019-arXiv: Machine Learning
TL;DR: It is proved that this algorithm based on an gradient ascent for a general Active Exploration bandit problem in the fixed confidence setting is asymptotically optimal and computationally efficient.
Abstract: We present a new algorithm based on an gradient ascent for a general Active Exploration bandit problem in the fixed confidence setting. This problem encompasses several well studied problems such that the Best Arm Identification or Thresholding Bandits. It consists of a new sampling rule based on an online lazy mirror ascent. We prove that this algorithm is asymptotically optimal and, most importantly, computationally efficient.
Posted Content•
Streaming Quantiles Algorithms with Small Space and Update Time.

[...]

Nikita Ivkin1, Edo Liberty1, Kevin J. Lang, Zohar Karnin1, Vladimir Braverman2 •
Amazon.com1, Johns Hopkins University2
29 Jun 2019-arXiv: Data Structures and Algorithms
TL;DR: A practical variants of the first asymptotically optimal algorithm for approximating quantiles and distributions over streaming data with improved constants is provided by providing a specialized data structure for these sketches which reduces both their memory footprints and update times.
Abstract: Approximating quantiles and distributions over streaming data has been studied for roughly two decades now. Recently, Karnin, Lang, and Liberty proposed the first asymptotically optimal algorithm for doing so. This manuscript complements their theoretical result by providing a practical variants of their algorithm with improved constants. For a given sketch size, our techniques provably reduce the upper bound on the sketch error by a factor of two. These improvements are verified experimentally. Our modified quantile sketch improves the latency as well by reducing the worst case update time from $O(1/\varepsilon)$ down to $O(\log (1/\varepsilon))$. We also suggest two algorithms for weighted item streams which offer improved asymptotic update times compared to naive extensions. Finally, we provide a specialized data structure for these sketches which reduces both their memory footprints and update times.
Proceedings Article•10.1109/CAC48633.2019.8996415•
Path planning of mobile robot based on Improved RRT Algorithm

[...]

Yang Ying1, Zhang Li1, Guo Ruihong1, Han Yisa1, Tan Haiyan1, Meng Junxi1 •
Xi'an Polytechnic University1
1 Nov 2019
TL;DR: In the path planning of mobile robot, aiming at the problem that the rapidly-exploring random tree (RRT) algorithm leads to strong randomness and non-progressive optimization in the process of path growth, an improved RRT algorithm is proposed and the ”Bridge Test” is introduced to guide the robot to walk.
Abstract: In the path planning of mobile robot, aiming at the problem that the rapidly-exploring random tree (RRT) algorithm, which adopts the global uniform sampling strategy, leads to strong randomness and non-progressive optimization in the process of path growth, an improved RRT algorithm is proposed. Firstly, in the sampling stage, we establish the region sampling to select target points, and add gravitational component to solve the problem of RRT algorithm which is too random. Secondly, the obstacle expansion method is adopted to solve the limitation of long sampling time when the growing tree is close to the obstacle area. In view of the difficulty of the narrow-channel robot to pass, the ”Bridge Test” is introduced to guide the robot to walk. The improved algorithm effectively reduces the occupied space memory, running time and number of nodes, shortens the path length at the same time. By comparing the improved RRT algorithm with the basic RRT algorithm and the asymptotically optimal bi-directional rapidly-exploring random tree (B-RRT*) algorithm, the simulation results show that the improved RRT algorithm has shorter path and better time, and is more efficiently.
Posted Content•
A Bayesian Theory of Change Detection in Statistically Periodic Random Processes

[...]

Taposh Banerjee1, Prudhvi Gurram2, Gene T. Whipps3•
University of Texas at San Antonio1, Booz Allen Hamilton2, United States Army Research Laboratory3
06 Apr 2019-arXiv: Signal Processing
TL;DR: It is shown that the Bayesian change point problem can be expressed as an optimal control problem of a Markov decision process (MDP) with periodic transition and cost structures and that a nonstationary but periodic policy is optimal.
Abstract: A new class of stochastic processes called independent and periodically identically distributed (i.p.i.d.) processes is defined to capture periodically varying statistical behavior. A novel Bayesian theory is developed for detecting a change in the distribution of an i.p.i.d. process. It is shown that the Bayesian change point problem can be expressed as a problem of optimal control of a Markov decision process (MDP) with periodic transition and cost structures. Optimal control theory is developed for periodic MDPs for discounted and undiscounted total cost criteria. A fixed-point equation is obtained that is satisfied by the optimal cost function. It is shown that the optimal policy for the MDP is nonstationary but periodic in nature. A value iteration algorithm is obtained to compute the optimal cost function. The results from the MDP theory are then applied to detect changes in i.p.i.d. processes. It is shown that while the optimal change point algorithm is a stopping rule based on a periodic sequence of thresholds, a single-threshold policy is asymptotically optimal, as the probability of false alarm goes to zero. Numerical results are provided to demonstrate that the asymptotically optimal policy is not strictly optimal.
Journal Article•10.2139/SSRN.3375203•
Asymptotic Optimality of Constant-Order Policies in Joint Pricing and Inventory Control Models

[...]

Xin Chen1, Alexander L. Stolyar1, Linwei Xin2•
University of Illinois at Urbana–Champaign1, University of Chicago2
09 May 2019-Social Science Research Network
TL;DR: It is proved that the best constant-order dynamic pricing policy is asymptotically optimal as the lead time grows large, which is exactly the setting in which the problem becomes computationally intractable due to the curse of dimensionality.
Abstract: We consider a classical joint pricing and inventory control problem with lead times, which has been extensively studied in the literature but is notoriously difficult to solve due to the complex structure of the optimal policy. In this work, rather than analyzing the optimal policy, we propose a class of constant-order dynamic pricing policies, which are fundamentally different from base-stock list price policies, the primary emphasis in the existing literature. Under such a policy, a constant-order amount of new inventory is ordered every period and a pricing decision is made based on the inventory level. The policy is independent of the lead time. We prove that the best constant-order dynamic pricing policy is asymptotically optimal as the lead time grows large, which is exactly the setting in which the problem becomes computationally intractable due to the curse of dimensionality. As our main methodological contributions, we establish the convergence to a long-run average random yield inventory model with zero lead time and ordering capacities by its discounted counterpart as the discount factor goes to one, non-trivially extending the previous results in Federgruen and Yang (2014) that analyze a similar model but without capacity constraints.
Posted Content•
On the asymptotic optimality of the comb strategy for prediction with expert advice

[...]

Erhan Bayraktar, Ibrahim Ekren, Yili Zhang
06 Feb 2019-arXiv: Probability
TL;DR: This work compute the exact leading order expansion for the long time behavior of the value function and uses this expansion to prove that as conjectured in Gravin et al.
Abstract: For the problem of prediction with expert advice in the adversarial setting with geometric stopping, we compute the exact leading order expansion for the long time behavior of the value function. Then, we use this expansion to prove that as conjectured in Gravin et al. [12], the comb strategies are indeed asymptotically optimal for the adversary in the case of 4 experts.
...

Tools

SciSpace AgentBiomedical AgentSciSpace RecruitSciSpace for EnterpriseAgent GalleryChat with PDFLiterature ReviewAI WriterFind TopicsParaphraserCitation GeneratorExtract DataAI DetectorCitation Booster

Learn

ResourcesLive Workshops

SciSpace

CareersSupportBrowse PapersPricingSciSpace Affiliate ProgramCancellation & Refund PolicyTermsPrivacyData Sources

Directories

PapersTopicsJournalsAuthorsConferencesInstitutionsCitation StylesWriting templates

Extension & Apps

SciSpace Chrome ExtensionSciSpace Mobile App

Contact

support@scispace.com
SciSpace

© 2026 | PubGenius Inc. | Suite # 217 691 S Milpitas Blvd Milpitas CA 95035, USA

soc2
Secured by Delve