Top 2974 papers published in the topic of Function (mathematics) in 2017

Showing papers on "Function (mathematics) published in 2017"

Journal Article•10.1007/S10107-016-1034-2•

On the linear convergence of the alternating direction method of multipliers

[...]

Mingyi Hong¹, Zhi-Quan Luo²•Institutions (2)

Iowa State University¹, The Chinese University of Hong Kong²

01 Mar 2017-Mathematical Programming

TL;DR: This paper establishes the global R-linear convergence of the ADMM for minimizing the sum of any number of convex separable functions, assuming that a certain error bound condition holds true and the dual stepsize is sufficiently small.

...read moreread less

Abstract: We analyze the convergence rate of the alternating direction method of multipliers (ADMM) for minimizing the sum of two or more nonsmooth convex separable functions subject to linear constraints. Previous analysis of the ADMM typically assumes that the objective function is the sum of only two convex functions defined on two separable blocks of variables even though the algorithm works well in numerical experiments for three or more blocks. Moreover, there has been no rate of convergence analysis for the ADMM without strong convexity in the objective function. In this paper we establish the global R-linear convergence of the ADMM for minimizing the sum of any number of convex separable functions, assuming that a certain error bound condition holds true and the dual stepsize is sufficiently small. Such an error bound condition is satisfied for example when the feasible set is a compact polyhedron and the objective function consists of a smooth strictly convex function composed with a linear mapping, and a nonsmooth $$\ell _1$$l1 regularizer. This result implies the linear convergence of the ADMM for contemporary applications such as LASSO without assuming strong convexity of the objective function.

...read moreread less

862 citations

Journal Article•10.1016/J.JQSRT.2016.05.028•

LEVEL: A computer program for solving the radial Schrödinger equation for bound and quasibound levels

[...]

Robert J. Le Roy¹•Institutions (1)

University of Waterloo¹

01 Jan 2017-Journal of Quantitative Spectroscopy & Radiative Transfer

TL;DR: Level as mentioned in this paper can automatically locate the bound and/or quasibounded levels of any smooth single- or double-minimum potential, and calculate inertial rotation and centrifugal distortion constants and various expectation values for those levels.

...read moreread less

Abstract: This paper describes program LEVEL, which can solve the radial or one-dimensional Schrodinger equation and automatically locate either all of, or a selected number of, the bound and/or quasibound levels of any smooth single- or double-minimum potential, and calculate inertial rotation and centrifugal distortion constants and various expectation values for those levels. It can also calculate Franck–Condon factors and other off-diagonal matrix elements, either between levels of a single potential or between levels of two different potentials. The potential energy function may be defined by any one of a number of analytic functions, or by a set of input potential function values which the code will interpolate over and extrapolate beyond to span the desired range.

...read moreread less

827 citations

Journal Article•10.1007/S40304-017-0117-6•

Deep Learning-Based Numerical Methods for High-Dimensional Parabolic Partial Differential Equations and Backward Stochastic Differential Equations

[...]

Weinan E¹, Weinan E², Jiequn Han¹, Arnulf Jentzen³•Institutions (3)

Princeton University¹, Peking University², ETH Zurich³

15 Jun 2017

TL;DR: In this article, a new algorithm for solving parabolic partial differential equations and backward stochastic differential equations (BSDEs) in high dimension, which is based on an analogy between the BSDE and reinforcement learning with the gradient of the solution playing the role of the policy function, and the loss function given by the error between the prescribed terminal condition and the solution of BSDE.

...read moreread less

Abstract: We study a new algorithm for solving parabolic partial differential equations (PDEs) and backward stochastic differential equations (BSDEs) in high dimension, which is based on an analogy between the BSDE and reinforcement learning with the gradient of the solution playing the role of the policy function, and the loss function given by the error between the prescribed terminal condition and the solution of the BSDE. The policy function is then approximated by a neural network, as is done in deep reinforcement learning. Numerical results using TensorFlow illustrate the efficiency and accuracy of the studied algorithm for several 100-dimensional nonlinear PDEs from physics and finance such as the Allen–Cahn equation, the Hamilton–Jacobi–Bellman equation, and a nonlinear pricing model for financial derivatives.

...read moreread less

408 citations

Posted Content•

Joint Distribution Optimal Transportation for Domain Adaptation

[...]

Nicolas Courty¹, Rémi Flamary, Amaury Habrard, Alain Rakotomamonjy•Institutions (1)

University of Paris-Sud¹

24 May 2017-arXiv: Machine Learning

TL;DR: This paper proposes a solution of the unsupervised domain adaptation problem with optimal transport, that allows to recover an estimated target $\mathcal{P}^f_t=(X,f(X))$ by optimizing simultaneously the optimal coupling and $f$.

...read moreread less

Abstract: This paper deals with the unsupervised domain adaptation problem, where one wants to estimate a prediction function $f$ in a given target domain without any labeled sample by exploiting the knowledge available from a source domain where labels are known Our work makes the following assumption: there exists a non-linear transformation between the joint feature/label space distributions of the two domain $\mathcal{P}_s$ and $\mathcal{P}_t$ We propose a solution of this problem with optimal transport, that allows to recover an estimated target $\mathcal{P}^f_t=(X,f(X))$ by optimizing simultaneously the optimal coupling and $f$ We show that our method corresponds to the minimization of a bound on the target error, and provide an efficient algorithmic solution, for which convergence is proved The versatility of our approach, both in terms of class of hypothesis or loss functions is demonstrated with real world classification and regression problems, for which we reach or surpass state-of-the-art results

...read moreread less

344 citations

Journal Article•10.1109/TEVC.2017.2694221•

DG2: A Faster and More Accurate Differential Grouping for Large-Scale Black-Box Optimization

[...]

Mohammad Nabi Omidvar¹, Ming Yang², Yi Mei³, Xiaodong Li⁴, Xin Yao¹ - Show less +1 more•Institutions (4)

University of Birmingham¹, China University of Geosciences (Wuhan)², Victoria University of Wellington³, RMIT University⁴

25 Apr 2017-IEEE Transactions on Evolutionary Computation

TL;DR: The proposed improved variant of the differential grouping (DG) algorithm, DG2, finds a reliable threshold value by estimating the magnitude of roundoff errors and automatic calculation of its threshold parameter, which makes it parameter-free.

...read moreread less

Abstract: Identification of variable interaction is essential for an efficient implementation of a divide-and-conquer algorithm for large-scale black-box optimization. In this paper, we propose an improved variant of the differential grouping (DG) algorithm, which has a better efficiency and grouping accuracy. The proposed algorithm, DG2, finds a reliable threshold value by estimating the magnitude of roundoff errors. With respect to efficiency, DG2 reuses the sample points that are generated for detecting interactions and saves up to half of the computational resources on fully separable functions. We mathematically show that the new sampling technique achieves the lower bound with respect to the number of function evaluations. Unlike its predecessor, DG2 checks all possible pairs of variables for interactions and has the capacity to identify overlapping components of an objective function. On the accuracy aspect, DG2 outperforms the state-of-the-art decomposition methods on the latest large-scale continuous optimization benchmark suites. DG2 also performs reliably in the presence of imbalance among contribution of components in an objective function. Another major advantage of DG2 is the automatic calculation of its threshold parameter ( $\epsilon $ ), which makes it parameter-free. Finally, the experimental results show that when DG2 is used within a cooperative co-evolutionary framework, it can generate competitive results as compared to several state-of-the-art algorithms.

...read moreread less

336 citations

Posted Content•

On the Properties of the Softmax Function with Application in Game Theory and Reinforcement Learning

[...]

Bolin Gao, Lacra Pavel

03 Apr 2017-arXiv: Optimization and Control

TL;DR: This paper shows that the softmax function is the monotone gradient map of the log-sum-exp function and exploits the inverse temperature parameter to derive the Lipschitz and co-coercivity properties of thesoftmax function.

...read moreread less

Abstract: In this paper, we utilize results from convex analysis and monotone operator theory to derive additional properties of the softmax function that have not yet been covered in the existing literature. In particular, we show that the softmax function is the monotone gradient map of the log-sum-exp function. By exploiting this connection, we show that the inverse temperature parameter determines the Lipschitz and co-coercivity properties of the softmax function. We then demonstrate the usefulness of these properties through an application in game-theoretic reinforcement learning.

...read moreread less

319 citations

Journal Article•10.1109/TAC.2016.2593899•

Distributed Continuous-Time Convex Optimization With Time-Varying Cost Functions

[...]

Salar Rahili¹, Wei Ren¹•Institutions (1)

University of California, Riverside¹

01 Apr 2017-IEEE Transactions on Automatic Control

TL;DR: A time-varying distributed convex optimization problem is studied for continuous-time multi-agent systems and it is shown that the center of the agents tracks the optimal trajectory, the connectivity of the Agents is maintained, and interagent collision is avoided.

...read moreread less

Abstract: In this paper, a time-varying distributed convex optimization problem is studied for continuous-time multi-agent systems. The objective is to minimize the sum of local time-varying cost functions, each of which is known to only an individual agent, through local interaction. Here, the optimal point is time varying and creates an optimal trajectory. Control algorithms are designed for the cases of single-integrator and double-integrator dynamics. In both cases, a centralized approach is first introduced to solve the optimization problem. Then, this problem is solved in a distributed manner and a discontinuous algorithm based on the signum function is proposed in each case. In the case of single-integrator (respectively, double-integrator) dynamics, each agent relies only on its own position and the relative positions (respectively, positions and velocities) between itself and its neighbors. A gain adaption scheme is introduced in both algorithms to eliminate certain global information requirement. To relax the restricted assumption imposed on feasible cost functions, an estimator based algorithm using the signum function is proposed, where each agent uses dynamic average tracking as a tool to estimate the centralized control input. As a tradeoff, the estimator-based algorithm necessitates communication between neighbors. Then, in the case of double-integrator dynamics, the proposed algorithms are further extended. Two continuous algorithms based on, respectively, a time-varying and a fixed boundary layer are proposed as continuous approximations of the signum function. To account for interagent collision for physical agents, a distributed convex optimization problem with swarm tracking behavior is introduced for both single-integrator and double-integrator dynamics. It is shown that the center of the agents tracks the optimal trajectory, the connectivity of the agents is maintained, and interagent collision is avoided.

...read moreread less

284 citations

Journal Article•10.1038/S41598-017-09098-0•

Machine learning quantum phases of matter beyond the fermion sign problem

[...]

Peter Broecker¹, Juan Carrasquilla², Roger G. Melko³, Roger G. Melko², Simon Trebst¹ - Show less +1 more•Institutions (3)

University of Cologne¹, Perimeter Institute for Theoretical Physics², University of Waterloo³

18 Aug 2017-Scientific Reports

TL;DR: In this article, a convolutional neural network (CNN) was used to identify and locate quantum phase transitions in quantum many-fermion systems using auxiliary-field quantum Monte Carlo simulations.

...read moreread less

Abstract: State-of-the-art machine learning techniques promise to become a powerful tool in statistical mechanics via their capacity to distinguish different phases of matter in an automated way. Here we demonstrate that convolutional neural networks (CNN) can be optimized for quantum many-fermion systems such that they correctly identify and locate quantum phase transitions in such systems. Using auxiliary-field quantum Monte Carlo (QMC) simulations to sample the many-fermion system, we show that the Green’s function holds sufficient information to allow for the distinction of different fermionic phases via a CNN. We demonstrate that this QMC + machine learning approach works even for systems exhibiting a severe fermion sign problem where conventional approaches to extract information from the Green’s function, e.g. in the form of equal-time correlation functions, fail.

...read moreread less

272 citations

Journal Article•

Information-geometric optimization algorithms: a unifying picture via invariance principles

[...]

Yann Ollivier¹, Ludovic Arnold, Anne Auger², Nikolaus Hansen²•Institutions (2)

Université Paris-Saclay¹, École Polytechnique²

01 Jan 2017-Journal of Machine Learning Research

TL;DR: A canonical way to turn any smooth parametric family of probability distributions on an arbitrary search space X into a continuous-time black-box optimization method on X, the information-geometric optimization (IGO) method, which achieves maximal invariance properties.

...read moreread less

Abstract: We present a canonical way to turn any smooth parametric family of probability distributions on an arbitrary search space X into a continuous-time black-box optimization method on X, the information-geometric optimization (IGO) method. Invariance as a major design principle keeps the number of arbitrary choices to a minimum. The resulting IGO flow is the flow of an ordinary differential equation conducting the natural gradient ascent of an adaptive, time-dependent transformation of the objective function. It makes no particular assumptions on the objective function to be optimized. The IGO method produces explicit IGO algorithms through time discretization. It naturally recovers versions of known algorithms and offers a systematic way to derive new ones. In continuous search spaces, IGO algorithms take a form related to natural evolution strategies (NES). The cross-entropy method is recovered in a particular case with a large time step, and can be extended into a smoothed, parametrization-independent maximum likelihood update (IGO-ML). When applied to the family of Gaussian distributions on Rd, the IGO framework recovers a version of the well-known CMA-ES algorithm and of xNES. For the family of Bernoulli distributions on {0, 1}d, we recover the seminal PBIL algorithm and cGA. For the distributions of restricted Boltzmann machines, we naturally obtain a novel algorithm for discrete optimization on {0, 1}d. All these algorithms are natural instances of, and unified under, the single information-geometric optimization framework. The IGO method achieves, thanks to its intrinsic formulation, maximal invariance properties: invariance under reparametrization of the search space X, under a change of parameters of the probability distribution, and under increasing transformation of the function to be optimized. The latter is achieved through an adaptive, quantile-based formulation of the objective. Theoretical considerations strongly suggest that IGO algorithms are essentially characterized by a minimal change of the distribution over time. Therefore they have minimal loss in diversity through the course of optimization, provided the initial diversity is high. First experiments using restricted Boltzmann machines confirm this insight. As a simple consequence, IGO seems to provide, from information theory, an elegant way to simultaneously explore several valleys of a fitness landscape in a single run.

...read moreread less

265 citations

Journal Article•

An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback

[...]

Ohad Shamir

01 Jan 2017-Journal of Machine Learning Research

TL;DR: In this paper, a bandit convex optimization with two-point feedback and zero-order stochastic convex optimisation with two function evaluations per round is considered. And the algorithm is based on a small but surprisingly powerful modification of the gradient estimator.

...read moreread less

Abstract: We consider the closely related problems of bandit convex optimization with two-point feedback, and zero-order stochastic convex optimization with two function evaluations per round. We provide a simple algorithm and analysis which is optimal for convex Lipschitz functions. This improves on \cite{dujww13}, which only provides an optimal result for smooth functions; Moreover, the algorithm and analysis are simpler, and readily extend to non-Euclidean problems. The algorithm is based on a small but surprisingly powerful modification of the gradient estimator.

...read moreread less

253 citations

Proceedings Article•10.14288/1.0357417•

Joint Distribution Optimal Transportation for Domain Adaptation

[...]

Nicolas Courty¹, Rémi Flamary, Amaury Habrard, Alain Rakotomamonjy•Institutions (1)

University of Paris-Sud¹

24 May 2017

TL;DR: In this article, a non-linear transformation between the joint feature/label space distributions of the two domains P s and P t can be estimated with optimal transport, which corresponds to the minimization of a bound on the target error.

...read moreread less

Abstract: This paper deals with the unsupervised domain adaptation problem, where one wants to estimate a prediction function f in a given target domain without any labeled sample by exploiting the knowledge available from a source domain where labels are known. Our work makes the following assumption: there exists a non-linear transformation between the joint feature/label space distributions of the two domain P s and P t that can be estimated with optimal transport. We propose a solution of this problem that allows to recover an estimated target P f t = (X, f (X)) by optimizing simultaneously the optimal coupling and f. We show that our method corresponds to the minimization of a bound on the target error, and provide an efficient algorithmic solution, for which convergence is proved. The versatility of our approach, both in terms of class of hypothesis or loss functions is demonstrated with real world classification and regression problems, for which we reach or surpass state-of-the-art results.

...read moreread less

Journal Article•10.1109/TAC.2016.2628807•

Distributed Continuous-Time Algorithm for Constrained Convex Optimizations via Nonsmooth Analysis Approach

[...]

Xianlin Zeng¹, Peng Yi², Yiguang Hong¹•Institutions (2)

Chinese Academy of Sciences¹, University of Toronto²

01 Oct 2017-IEEE Transactions on Automatic Control

TL;DR: In this article, a distributed continuous-time projected algorithm for convex cost functions with local constraints is proposed, in which each agent knows its local cost function and local constraint set, and proves that all the agents of the algorithm can find the same optimal solution.

...read moreread less

Abstract: This technical note studies the distributed optimization problem of a sum of nonsmooth convex cost functions with local constraints. At first, we propose a novel distributed continuous-time projected algorithm, in which each agent knows its local cost function and local constraint set, for the constrained optimization problem. Then we prove that all the agents of the algorithm can find the same optimal solution, and meanwhile, keep the states bounded while seeking the optimal solutions. We conduct a complete convergence analysis by employing nonsmooth Lyapunov functions for the stability analysis of differential inclusions. Finally, we provide a numerical example for illustration.

...read moreread less

Posted Content•

Loss-Sensitive Generative Adversarial Networks on Lipschitz Densities

[...]

Guo-Jun Qi

23 Jan 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this article, a loss-sensitive GAN (LS-GAN) is proposed to distinguish between real and fake samples by designated margins, while learning a generator alternately to produce realistic samples by minimizing their losses.

...read moreread less

Abstract: In this paper, we present the Lipschitz regularization theory and algorithms for a novel Loss-Sensitive Generative Adversarial Network (LS-GAN). Specifically, it trains a loss function to distinguish between real and fake samples by designated margins, while learning a generator alternately to produce realistic samples by minimizing their losses. The LS-GAN further regularizes its loss function with a Lipschitz regularity condition on the density of real data, yielding a regularized model that can better generalize to produce new data from a reasonable number of training examples than the classic GAN. We will further present a Generalized LS-GAN (GLS-GAN) and show it contains a large family of regularized GAN models, including both LS-GAN and Wasserstein GAN, as its special cases. Compared with the other GAN models, we will conduct experiments to show both LS-GAN and GLS-GAN exhibit competitive ability in generating new images in terms of the Minimum Reconstruction Error (MRE) assessed on a separate test set. We further extend the LS-GAN to a conditional form for supervised and semi-supervised learning problems, and demonstrate its outstanding performance on image classification tasks.

...read moreread less

Journal Article•10.1109/TAC.2017.2648041•

Non-Convex Distributed Optimization

[...]

Tatiana Tatarenko¹, Behrouz Touri²•Institutions (2)

Technische Universität Darmstadt¹, University of Colorado Boulder²

05 Jan 2017-IEEE Transactions on Automatic Control

TL;DR: In this article, the authors study distributed non-convex optimization on a time-varying multi-agent network, where each node has access to its own smooth local cost function, and the collective goal is to minimize the sum of these functions.

...read moreread less

Abstract: We study distributed non-convex optimization on a time-varying multi-agent network. Each node has access to its own smooth local cost function, and the collective goal is to minimize the sum of these functions. The perturbed push-sum algorithm was previously used for convex distributed optimization. We generalize the result obtained for the convex case to the case of non-convex functions. Under some additional technical assumptions on the gradients we prove the convergence of the distributed push-sum algorithm to some critical point of the objective function. By utilizing perturbations on the update process, we show the almost sure convergence of the perturbed dynamics to a local minimum of the global objective function, if the objective function has no saddle points. Our analysis shows that this perturbed procedure converges at a rate of $O(1/t)$ .

...read moreread less

Journal Article•10.1287/MOOR.2016.0842•

Optimal Approximation for Submodular and Supermodular Optimization with Bounded Curvature

[...]

Maxim Sviridenko, Jan Vondrák, Justin Ward

16 May 2017-Mathematics of Operations Research

TL;DR: In this paper, a (1 − c/e)-approximation algorithm was proposed for the problem of maximizing a monotone increasing submodular function subject to a single matroid constraint.

...read moreread less

Abstract: We design new approximation algorithms for the problems of optimizing submodular and supermodular functions subject to a single matroid constraint. Specifically, we consider the case in which we wish to maximize a monotone increasing submodular function or minimize a monotone decreasing supermodular function with a bounded total curvature c. Intuitively, the parameter c represents how nonlinear a function f is: when c = 0, f is linear, while for c = 1, f may be an arbitrary monotone increasing submodular function. For the case of submodular maximization with total curvature c, we obtain a (1 − c/e)-approximation—the first improvement over the greedy algorithm of of Conforti and Cornuejols from 1984, which holds for a cardinality constraint, as well as a recent analogous result for an arbitrary matroid constraint. Our approach is based on modifications of the continuous greedy algorithm and nonoblivious local search, and allows us to approximately maximize the sum of a nonnegative, monotone increasing subm...

...read moreread less

Journal Article•10.1109/TCYB.2016.2542923•

Discrete-Time Deterministic $Q$ -Learning: A Novel Convergence Analysis

[...]

Qinglai Wei¹, Frank L. Lewis², Qiuye Sun³, Pengfei Yan¹, Ruizhuo Song⁴ - Show less +1 more•Institutions (4)

Chinese Academy of Sciences¹, University of Texas at Arlington², Northeastern University (China)³, University of Science and Technology Beijing⁴

01 May 2017-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: A novel discrete-time deterministic deterministic inline-formula-learning algorithm is developed and the convergence criterion for the discounted case is established, and the iterative control law of the developed algorithm is simplified.

...read moreread less

Abstract: In this paper, a novel discrete-time deterministic $ Q$ -learning algorithm is developed. In each iteration of the developed $ Q$ -learning algorithm, the iterative $ Q$ function is updated for all the state and control spaces, instead of updating for a single state and a single control in traditional $ Q$ -learning algorithm. A new convergence criterion is established to guarantee that the iterative $ Q$ function converges to the optimum, where the convergence criterion of the learning rates for traditional $ Q$ -learning algorithms is simplified. During the convergence analysis, the upper and lower bounds of the iterative $ Q$ function are analyzed to obtain the convergence criterion, instead of analyzing the iterative $ Q$ function itself. For convenience of analysis, the convergence properties for undiscounted case of the deterministic $ Q$ -learning algorithm are first developed. Then, considering the discounted factor, the convergence criterion for the discounted case is established. Neural networks are used to approximate the iterative $ Q$ function and compute the iterative control law, respectively, for facilitating the implementation of the deterministic $ Q$ -learning algorithm. Finally, simulation results and comparisons are given to illustrate the performance of the developed algorithm.

...read moreread less

Proceedings Article•

Sobolev Training for Neural Networks

[...]

Wojciech Marian Czarnecki¹, Simon Osindero², Max Jaderberg¹, Grzegorz Swirszcz¹, Razvan Pascanu¹ - Show less +1 more•Institutions (2)

Google¹, Yahoo!²

15 Jun 2017

TL;DR: Sobolev Training for neural networks is introduced, which is a method for incorporating target derivatives in addition the to target values while training, and results in models with higher accuracy and stronger generalisation on three distinct domains.

...read moreread less

Abstract: At the heart of deep learning we aim to use neural networks as function approximators - training them to produce outputs from inputs in emulation of a ground truth function or data creation process. In many cases we only have access to input-output pairs from the ground truth, however it is becoming more common to have access to derivatives of the target output with respect to the input -- for example when the ground truth function is itself a neural network such as in network compression or distillation. Generally these target derivatives are not computed, or are ignored. This paper introduces Sobolev Training for neural networks, which is a method for incorporating these target derivatives in addition the to target values while training. By optimising neural networks to not only approximate the function’s outputs but also the function’s derivatives we encode additional information about the target function within the parameters of the neural network. Thereby we can improve the quality of our predictors, as well as the data-efficiency and generalization capabilities of our learned function approximation. We provide theoretical justifications for such an approach as well as examples of empirical evidence on three distinct domains: regression on classical optimisation datasets, distilling policies of an agent playing Atari, and on large-scale applications of synthetic gradients. In all three domains the use of Sobolev Training, employing target derivatives in addition to target values, results in models with higher accuracy and stronger generalisation.

...read moreread less

Journal Article•10.1137/15M1026924•

Saddle-point dynamics: conditions for asymptotic stability of saddle points

[...]

Ashish Cherukuri¹, Bahman Gharesifard², Jorge E. Cortes¹•Institutions (2)

University of California, San Diego¹, Queen's University²

22 Feb 2017-Siam Journal on Control and Optimization

TL;DR: In this article, the authors consider continuous differentiable functions with min-max saddle points and study the asymptotic convergence properties of the associated saddle-point dynamics (gradient descent in the first variable and gradient ascent in the second one).

...read moreread less

Abstract: This paper considers continuously differentiable functions of two vector variables that have (possibly a continuum of) min-max saddle points. We study the asymptotic convergence properties of the associated saddle-point dynamics (gradient descent in the first variable and gradient ascent in the second one). We identify a suite of complementary conditions under which the set of saddle points is asymptotically stable under the saddle-point dynamics. Our first set of results is based on the convexity-concavity of the function defining the saddle-point dynamics to establish the convergence guarantees. For functions that do not enjoy this feature, our second set of results relies on properties of the linearization of the dynamics, the function along the proximal normals to the saddle set, and the linearity of the function in one variable. We also provide global versions of the asymptotic convergence results. Various examples illustrate our discussion.

...read moreread less

Journal Article•10.1109/TSP.2018.2835403•

Global optimality in low-rank matrix optimization

[...]

Zhihui Zhu¹, Qiuwei Li¹, Gongguo Tang¹, Michael B. Wakin¹•Institutions (1)

Colorado School of Mines¹

25 Feb 2017

TL;DR: In this paper, the authors consider the minimization of a general objective function over a set of rectangular matrices that have rank at most r. Despite the resulting nonconvexity, recent studies in matrix completion and sensing have shown that the factored problem has no spurious local minima and obeys the strict saddle property.

...read moreread less

Abstract: This paper considers the minimization of a general objective function $f(\boldsymbol{X})$ over the set of rectangular $n\times m$ matrices that have rank at most $r$ . To reduce the computational burden, we factorize the variable $\boldsymbol{X}$ into a product of two smaller matrices and optimize over these two matrices instead of $\boldsymbol{X}$ . Despite the resulting nonconvexity, recent studies in matrix completion and sensing have shown that the factored problem has no spurious local minima and obeys the so-called strict saddle property (the function has a directional negative curvature at all critical points but local minima). We analyze the global geometry for a general and yet well-conditioned objective function $f(\boldsymbol{X})$ whose restricted strong convexity and restricted strong smoothness constants are comparable. In particular, we show that the reformulated objective function has no spurious local minima and obeys the strict saddle property. These geometric properties imply that a number of iterative optimization algorithms (such as gradient descent) can provably solve the factored problem with global convergence.

...read moreread less

Journal Article•10.1007/JHEP08(2017)146•

More on Supersymmetric and 2d Analogs of the SYK Model

[...]

Jeff Murugan¹, Jeff Murugan², Douglas Stanford¹, Edward Witten¹•Institutions (2)

Institute for Advanced Study¹, University of Cape Town²

16 Jun 2017-arXiv: High Energy Physics - Theory

TL;DR: In this article, supersymmetric and 2D analogies of the SYK model were explored, which consist of super-conformal eigenfunctions appropriate for expanding a four-point function.

...read moreread less

Abstract: In this paper, we explore supersymmetric and 2d analogs of the SYK model. We begin by working out a basis of (super)conformal eigenfunctions appropriate for expanding a four-point function. We use this to clarify some details of the 1d supersymmetric SYK model. We then introduce new bosonic and supersymmetric analogs of SYK in two dimensions. These theories consist of $N$ fields interacting with random $q$-field interactions. Although models built entirely from bosons appear to be problematic, we find a supersymmetric model that flows to a large $N$ CFT with interaction strength of order one. We derive an integral formula for the four-point function at order $1/N$, and use it to compute the central charge, chaos exponent and some anomalous dimensions. We describe a problem that arises if one tries to find a 2d SYK-like CFT with a continuous global symmetry.

...read moreread less

Journal Article•10.1002/INT.21898•

A Novel Improved Accuracy Function for Interval Valued Pythagorean Fuzzy Sets and Its Applications in the Decision-Making Process

[...]

Harish Garg¹•Institutions (1)

Thapar University¹

01 Dec 2017-International Journal of Intelligent Systems

TL;DR: An improved accuracy function for the ranking order of interval‐valued Pythagorean fuzzy sets (IVPFSs) is presented and multicriteria decision‐making method has been proposed for finding the desirable alternative(s).

...read moreread less

Abstract: The objective of this work is to present an improved accuracy function for the ranking order of interval-valued Pythagorean fuzzy sets (IVPFSs). Shortcomings of the existing score and accuracy functions in interval-valued Pythagorean environment have been overcome by the proposed accuracy function. In the proposed function, degree of hesitation between the element of IVPFS has been taken into account during the analysis. Based on it, multicriteria decision-making method has been proposed for finding the desirable alternative(s). Finally, an illustrative example for solving the decision-making problem has been presented to demonstrate application of the proposed approach.

...read moreread less

Journal Article•10.1007/S10107-017-1161-4•

Accelerated schemes for a class of variational inequalities

[...]

Yunmei Chen¹, Guanghui Lan², Yuyuan Ouyang³•Institutions (3)

University of Florida¹, Georgia Institute of Technology², Clemson University³

01 Sep 2017-Mathematical Programming

TL;DR: In this article, a stochastic accelerated mirror-prox (SAMP) method was proposed for solving a class of monotone variational inequalities (SVI), which is based on a multi-step acceleration scheme.

...read moreread less

Abstract: We propose a novel stochastic method, namely the stochastic accelerated mirror-prox (SAMP) method, for solving a class of monotone stochastic variational inequalities (SVI). The main idea of the proposed algorithm is to incorporate a multi-step acceleration scheme into the stochastic mirror-prox method. The developed SAMP method computes weak solutions with the optimal iteration complexity for SVIs. In particular, if the operator in SVI consists of the stochastic gradient of a smooth function, the iteration complexity of the SAMP method can be accelerated in terms of their dependence on the Lipschitz constant of the smooth function. For SVIs with bounded feasible sets, the bound of the iteration complexity of the SAMP method depends on the diameter of the feasible set. For unbounded SVIs, we adopt the modified gap function introduced by Monteiro and Svaiter for solving monotone inclusion, and show that the iteration complexity of the SAMP method depends on the distance from the initial point to the set of strong solutions. It is worth noting that our study also significantly improves a few existing complexity results for solving deterministic variational inequality problems. We demonstrate the advantages of the SAMP method over some existing algorithms through our preliminary numerical experiments.

...read moreread less

Journal Article•10.1016/J.AUTOMATICA.2017.01.002•

Sliding mode control for singular stochastic Markovian jump systems with uncertainties

[...]

Qingling Zhang¹, Qingling Zhang², Li Li¹, Xing-Gang Yan², Sarah K. Spurgeon³ - Show less +1 more•Institutions (3)

Northeastern University (China)¹, University of Kent², University College London³

01 May 2017-Automatica

TL;DR: A set of new sufficient conditions is developed which not only guarantees the stochastic admissibility of the sliding mode dynamics, but also determines all the parameter matrices in the integral sliding function.

...read moreread less

Journal Article•10.1080/00401706.2016.1251495•

Bayesian Design of Experiments Using Approximate Coordinate Exchange

[...]

Antony M. Overstall¹, David C. Woods¹•Institutions (1)

University of Southampton¹

27 Apr 2017-Technometrics

TL;DR: In this article, a Gaussian process emulator is used to approximate the expected utility as a function of a single design coordinate in a series of conditional optimization steps to find multi-variable designs without resorting to asymptotic approximations to the posterior distribution or expected utility.

...read moreread less

Abstract: The construction of decision-theoretical Bayesian designs for realistically complex nonlinear models is computationally challenging, as it requires the optimization of analytically intractable expected utility functions over high-dimensional design spaces. We provide the most general solution to date for this problem through a novel approximate coordinate exchange algorithm. This methodology uses a Gaussian process emulator to approximate the expected utility as a function of a single design coordinate in a series of conditional optimization steps. It has flexibility to address problems for any choice of utility function and for a wide range of statistical models with different numbers of variables, numbers of runs and randomization restrictions. In contrast to existing approaches to Bayesian design, the method can find multi-variable designs in large numbers of runs without resorting to asymptotic approximations to the posterior distribution or expected utility. The methodology is demonstrated on...

...read moreread less

Journal Article•10.1103/PHYSREVD.96.043505•

Real no-boundary wave function in Lorentzian quantum cosmology

[...]

J. Diaz Dorronsoro¹, Jonathan J. Halliwell², James B. Hartle³, Thomas Hertog¹, Oliver Janssen - Show less +1 more•Institutions (3)

Katholieke Universiteit Leuven¹, Imperial College London², University of California, Santa Barbara³

07 Aug 2017-Physical Review D

TL;DR: In this article, the standard no-boundary wave function has a natural expression in terms of a Lorentzian path integral with its contour defined by Picard-Lefschetz theory.

...read moreread less

Abstract: It is shown that the standard no-boundary wave function has a natural expression in terms of a Lorentzian path integral with its contour defined by Picard-Lefschetz theory The wave function is real, satisfies the Wheeler-DeWitt equation and predicts an ensemble of asymptotically classical, inflationary universes with nearly-Gaussian fluctuations and with a smooth semiclassical origin

...read moreread less

Journal Article•10.1016/J.AUTOMATICA.2017.06.035•

Function perturbations on singular Boolean networks

[...]

Yang Liu¹, Yang Liu², Baowen Li², Hongwei Chen¹, Jinde Cao¹, Jinde Cao³ - Show less +2 more•Institutions (3)

Southeast University¹, Zhejiang Normal University², Shandong Normal University³

01 Oct 2017-Automatica

TL;DR: The algebraic form of an SBN is given, and how the transition matrix of the SBN changes under function perturbations is discussed, under which the impacts of function perturbed structure changes on the topological structure are investigated.

...read moreread less

Journal Article•10.1007/JHEP06(2017)127•

A non-planar two-loop three-point function beyond multiple polylogarithms

[...]

Andreas von Manteuffel¹, Andreas von Manteuffel², Lorenzo Tancredi³•Institutions (3)

Michigan State University¹, University of Mainz², Karlsruhe Institute of Technology³

20 Jan 2017-arXiv: High Energy Physics - Phenomenology

TL;DR: In this article, the analytic calculation of a two-loop non-planar three-point function which contributes to the twoloop amplitudes for $t \bar{t}$ production and $gamma \gamma$ production in gluon fusion through a massive top-quark loop is considered.

...read moreread less

Abstract: We consider the analytic calculation of a two-loop non-planar three-point function which contributes to the two-loop amplitudes for $t \bar{t}$ production and $\gamma \gamma$ production in gluon fusion through a massive top-quark loop. All subtopology integrals can be written in terms of multiple polylogarithms over an irrational alphabet and we employ a new method for the integration of the differential equations which does not rely on the rationalization of the latter. The top topology integrals, instead, in spite of the absence of a massive three-particle cut, cannot be evaluated in terms of multiple polylogarithms and require the introduction of integrals over complete elliptic integrals and polylogarithms. We provide one-fold integral representations for the solutions and continue them analytically to all relevant regions of the phase space in terms of real function, extracting all imaginary parts explicitly. The numerical evaluation of our expressions becomes straightforward in this way.

...read moreread less

Journal Article•

Distributed Learning with Regularized Least Squares

[...]

Shaobo Lin, Xin Guo, Ding-Xuan Zhou

01 Jan 2017-Journal of Machine Learning Research

TL;DR: In this article, the authors study distributed learning with the least squares regularization scheme in a reproducing kernel Hilbert space (RKHS) and show that the global output function of this distributed learning is a good approximation to the algorithm processing the whole data in one single machine.

...read moreread less

Abstract: We study distributed learning with the least squares regularization scheme in a reproducing kernel Hilbert space (RKHS). By a divide-and-conquer approach, the algorithm partitions a data set into disjoint data subsets, applies the least squares regularization scheme to each data subset to produce an output function, and then takes an average of the individual output functions as a final global estimator or predictor. We show with error bounds in expectation in both the $L^2$-metric and RKHS-metric that the global output function of this distributed learning is a good approximation to the algorithm processing the whole data in one single machine. Our error bounds are sharp and stated in a general setting without any eigenfunction assumption. The analysis is achieved by a novel second order decomposition of operator differences in our integral operator approach. Even for the classical least squares regularization scheme in the RKHS associated with a general kernel, we give the best learning rate in the literature.

...read moreread less

Journal Article•10.1016/J.IFACOL.2017.08.1026•

Transfer Function Estimation in System Identification Toolbox via Vector Fitting

[...]

Ahmet Arda Ozdemir¹, Suat Gumussoy¹•Institutions (1)

MathWorks¹

01 Jul 2017-IFAC-PapersOnLine

TL;DR: The first contribution is a bilinear mapping of the original problem from the imaginary axis onto the unit disk, which improves the numerics of the underlying Sanathanan-Koerner iterations and the more recent instrumental-variable iterations.

...read moreread less

Journal Article•10.1002/RNC.3596•

Adaptive finite‐time fault‐tolerant tracking control for a class of MIMO nonlinear systems with output constraints

[...]

Xu Jin¹•Institutions (1)

Georgia Institute of Technology¹

25 Mar 2017-International Journal of Robust and Nonlinear Control

TL;DR: It is shown that under the proposed control scheme, finite-time convergence of the output tracking error into a small set around zero is guaranteed, while the constraint requirement on the system outputtracking error will not be violated during operation.

...read moreread less

Abstract: Summary In this work, we present a novel adaptive finite-time fault-tolerant control algorithm for a class of multi-input multi-output nonlinear systems with constraint requirement on the system output tracking error. Both parametric and nonparametric system uncertainties can be effectively dealt with by the proposed control scheme. The gain functions of the nonlinear systems under discussion, especially the control input gain function, can be not fully known and state-dependent. Backstepping design with a tan-type barrier Lyapunov function and a new structure of stabilizing function is presented. We show that under the proposed control scheme, finite-time convergence of the output tracking error into a small set around zero is guaranteed, while the constraint requirement on the system output tracking error will not be violated during operation. An illustrative example on a robot manipulator model is presented in the end to further demonstrate the effectiveness of the proposed control scheme. Copyright © 2016 John Wiley & Sons, Ltd.

...read moreread less

...

Expand