Scispace (Formerly Typeset)
  1. Home
  2. Topics
  3. Function (mathematics)
  4. 2017
  1. Home
  2. Topics
  3. Function (mathematics)
  4. 2017
Showing papers on "Function (mathematics) published in 2017"
Journal Article•10.1007/S10107-016-1034-2•
On the linear convergence of the alternating direction method of multipliers

[...]

Mingyi Hong1, Zhi-Quan Luo2•
Iowa State University1, The Chinese University of Hong Kong2
01 Mar 2017-Mathematical Programming
TL;DR: This paper establishes the global R-linear convergence of the ADMM for minimizing the sum of any number of convex separable functions, assuming that a certain error bound condition holds true and the dual stepsize is sufficiently small.
Abstract: We analyze the convergence rate of the alternating direction method of multipliers (ADMM) for minimizing the sum of two or more nonsmooth convex separable functions subject to linear constraints. Previous analysis of the ADMM typically assumes that the objective function is the sum of only two convex functions defined on two separable blocks of variables even though the algorithm works well in numerical experiments for three or more blocks. Moreover, there has been no rate of convergence analysis for the ADMM without strong convexity in the objective function. In this paper we establish the global R-linear convergence of the ADMM for minimizing the sum of any number of convex separable functions, assuming that a certain error bound condition holds true and the dual stepsize is sufficiently small. Such an error bound condition is satisfied for example when the feasible set is a compact polyhedron and the objective function consists of a smooth strictly convex function composed with a linear mapping, and a nonsmooth $$\ell _1$$l1 regularizer. This result implies the linear convergence of the ADMM for contemporary applications such as LASSO without assuming strong convexity of the objective function.

862 citations

Journal Article•10.1016/J.JQSRT.2016.05.028•
LEVEL: A computer program for solving the radial Schrödinger equation for bound and quasibound levels

[...]

Robert J. Le Roy1•
University of Waterloo1
01 Jan 2017-Journal of Quantitative Spectroscopy & Radiative Transfer
TL;DR: Level as mentioned in this paper can automatically locate the bound and/or quasibounded levels of any smooth single- or double-minimum potential, and calculate inertial rotation and centrifugal distortion constants and various expectation values for those levels.
Abstract: This paper describes program LEVEL, which can solve the radial or one-dimensional Schrodinger equation and automatically locate either all of, or a selected number of, the bound and/or quasibound levels of any smooth single- or double-minimum potential, and calculate inertial rotation and centrifugal distortion constants and various expectation values for those levels. It can also calculate Franck–Condon factors and other off-diagonal matrix elements, either between levels of a single potential or between levels of two different potentials. The potential energy function may be defined by any one of a number of analytic functions, or by a set of input potential function values which the code will interpolate over and extrapolate beyond to span the desired range.

827 citations

Journal Article•10.1007/S40304-017-0117-6•
Deep Learning-Based Numerical Methods for High-Dimensional Parabolic Partial Differential Equations and Backward Stochastic Differential Equations

[...]

Weinan E1, Weinan E2, Jiequn Han1, Arnulf Jentzen3•
Princeton University1, Peking University2, ETH Zurich3
15 Jun 2017
TL;DR: In this article, a new algorithm for solving parabolic partial differential equations and backward stochastic differential equations (BSDEs) in high dimension, which is based on an analogy between the BSDE and reinforcement learning with the gradient of the solution playing the role of the policy function, and the loss function given by the error between the prescribed terminal condition and the solution of BSDE.
Abstract: We study a new algorithm for solving parabolic partial differential equations (PDEs) and backward stochastic differential equations (BSDEs) in high dimension, which is based on an analogy between the BSDE and reinforcement learning with the gradient of the solution playing the role of the policy function, and the loss function given by the error between the prescribed terminal condition and the solution of the BSDE. The policy function is then approximated by a neural network, as is done in deep reinforcement learning. Numerical results using TensorFlow illustrate the efficiency and accuracy of the studied algorithm for several 100-dimensional nonlinear PDEs from physics and finance such as the Allen–Cahn equation, the Hamilton–Jacobi–Bellman equation, and a nonlinear pricing model for financial derivatives.

408 citations

Posted Content•
Joint Distribution Optimal Transportation for Domain Adaptation

[...]

Nicolas Courty1, Rémi Flamary, Amaury Habrard, Alain Rakotomamonjy•
University of Paris-Sud1
24 May 2017-arXiv: Machine Learning
TL;DR: This paper proposes a solution of the unsupervised domain adaptation problem with optimal transport, that allows to recover an estimated target $\mathcal{P}^f_t=(X,f(X))$ by optimizing simultaneously the optimal coupling and $f$.
Abstract: This paper deals with the unsupervised domain adaptation problem, where one wants to estimate a prediction function $f$ in a given target domain without any labeled sample by exploiting the knowledge available from a source domain where labels are known Our work makes the following assumption: there exists a non-linear transformation between the joint feature/label space distributions of the two domain $\mathcal{P}_s$ and $\mathcal{P}_t$ We propose a solution of this problem with optimal transport, that allows to recover an estimated target $\mathcal{P}^f_t=(X,f(X))$ by optimizing simultaneously the optimal coupling and $f$ We show that our method corresponds to the minimization of a bound on the target error, and provide an efficient algorithmic solution, for which convergence is proved The versatility of our approach, both in terms of class of hypothesis or loss functions is demonstrated with real world classification and regression problems, for which we reach or surpass state-of-the-art results

344 citations

Journal Article•10.1109/TEVC.2017.2694221•
DG2: A Faster and More Accurate Differential Grouping for Large-Scale Black-Box Optimization

[...]

Mohammad Nabi Omidvar1, Ming Yang2, Yi Mei3, Xiaodong Li4, Xin Yao1 •
University of Birmingham1, China University of Geosciences (Wuhan)2, Victoria University of Wellington3, RMIT University4
25 Apr 2017-IEEE Transactions on Evolutionary Computation
TL;DR: The proposed improved variant of the differential grouping (DG) algorithm, DG2, finds a reliable threshold value by estimating the magnitude of roundoff errors and automatic calculation of its threshold parameter, which makes it parameter-free.
Abstract: Identification of variable interaction is essential for an efficient implementation of a divide-and-conquer algorithm for large-scale black-box optimization. In this paper, we propose an improved variant of the differential grouping (DG) algorithm, which has a better efficiency and grouping accuracy. The proposed algorithm, DG2, finds a reliable threshold value by estimating the magnitude of roundoff errors. With respect to efficiency, DG2 reuses the sample points that are generated for detecting interactions and saves up to half of the computational resources on fully separable functions. We mathematically show that the new sampling technique achieves the lower bound with respect to the number of function evaluations. Unlike its predecessor, DG2 checks all possible pairs of variables for interactions and has the capacity to identify overlapping components of an objective function. On the accuracy aspect, DG2 outperforms the state-of-the-art decomposition methods on the latest large-scale continuous optimization benchmark suites. DG2 also performs reliably in the presence of imbalance among contribution of components in an objective function. Another major advantage of DG2 is the automatic calculation of its threshold parameter ( $\epsilon $ ), which makes it parameter-free. Finally, the experimental results show that when DG2 is used within a cooperative co-evolutionary framework, it can generate competitive results as compared to several state-of-the-art algorithms.

336 citations

Posted Content•
On the Properties of the Softmax Function with Application in Game Theory and Reinforcement Learning

[...]

Bolin Gao, Lacra Pavel
03 Apr 2017-arXiv: Optimization and Control
TL;DR: This paper shows that the softmax function is the monotone gradient map of the log-sum-exp function and exploits the inverse temperature parameter to derive the Lipschitz and co-coercivity properties of thesoftmax function.
Abstract: In this paper, we utilize results from convex analysis and monotone operator theory to derive additional properties of the softmax function that have not yet been covered in the existing literature. In particular, we show that the softmax function is the monotone gradient map of the log-sum-exp function. By exploiting this connection, we show that the inverse temperature parameter determines the Lipschitz and co-coercivity properties of the softmax function. We then demonstrate the usefulness of these properties through an application in game-theoretic reinforcement learning.

319 citations

Journal Article•10.1109/TAC.2016.2593899•
Distributed Continuous-Time Convex Optimization With Time-Varying Cost Functions

[...]

Salar Rahili1, Wei Ren1•
University of California, Riverside1
01 Apr 2017-IEEE Transactions on Automatic Control
TL;DR: A time-varying distributed convex optimization problem is studied for continuous-time multi-agent systems and it is shown that the center of the agents tracks the optimal trajectory, the connectivity of the Agents is maintained, and interagent collision is avoided.
Abstract: In this paper, a time-varying distributed convex optimization problem is studied for continuous-time multi-agent systems. The objective is to minimize the sum of local time-varying cost functions, each of which is known to only an individual agent, through local interaction. Here, the optimal point is time varying and creates an optimal trajectory. Control algorithms are designed for the cases of single-integrator and double-integrator dynamics. In both cases, a centralized approach is first introduced to solve the optimization problem. Then, this problem is solved in a distributed manner and a discontinuous algorithm based on the signum function is proposed in each case. In the case of single-integrator (respectively, double-integrator) dynamics, each agent relies only on its own position and the relative positions (respectively, positions and velocities) between itself and its neighbors. A gain adaption scheme is introduced in both algorithms to eliminate certain global information requirement. To relax the restricted assumption imposed on feasible cost functions, an estimator based algorithm using the signum function is proposed, where each agent uses dynamic average tracking as a tool to estimate the centralized control input. As a tradeoff, the estimator-based algorithm necessitates communication between neighbors. Then, in the case of double-integrator dynamics, the proposed algorithms are further extended. Two continuous algorithms based on, respectively, a time-varying and a fixed boundary layer are proposed as continuous approximations of the signum function. To account for interagent collision for physical agents, a distributed convex optimization problem with swarm tracking behavior is introduced for both single-integrator and double-integrator dynamics. It is shown that the center of the agents tracks the optimal trajectory, the connectivity of the agents is maintained, and interagent collision is avoided.

284 citations

Journal Article•10.1038/S41598-017-09098-0•
Machine learning quantum phases of matter beyond the fermion sign problem

[...]

Peter Broecker1, Juan Carrasquilla2, Roger G. Melko3, Roger G. Melko2, Simon Trebst1 •
University of Cologne1, Perimeter Institute for Theoretical Physics2, University of Waterloo3
18 Aug 2017-Scientific Reports
TL;DR: In this article, a convolutional neural network (CNN) was used to identify and locate quantum phase transitions in quantum many-fermion systems using auxiliary-field quantum Monte Carlo simulations.
Abstract: State-of-the-art machine learning techniques promise to become a powerful tool in statistical mechanics via their capacity to distinguish different phases of matter in an automated way. Here we demonstrate that convolutional neural networks (CNN) can be optimized for quantum many-fermion systems such that they correctly identify and locate quantum phase transitions in such systems. Using auxiliary-field quantum Monte Carlo (QMC) simulations to sample the many-fermion system, we show that the Green’s function holds sufficient information to allow for the distinction of different fermionic phases via a CNN. We demonstrate that this QMC + machine learning approach works even for systems exhibiting a severe fermion sign problem where conventional approaches to extract information from the Green’s function, e.g. in the form of equal-time correlation functions, fail.

272 citations

Journal Article•
Information-geometric optimization algorithms: a unifying picture via invariance principles

[...]

Yann Ollivier1, Ludovic Arnold, Anne Auger2, Nikolaus Hansen2•
Université Paris-Saclay1, École Polytechnique2
01 Jan 2017-Journal of Machine Learning Research
TL;DR: A canonical way to turn any smooth parametric family of probability distributions on an arbitrary search space X into a continuous-time black-box optimization method on X, the information-geometric optimization (IGO) method, which achieves maximal invariance properties.
Abstract: We present a canonical way to turn any smooth parametric family of probability distributions on an arbitrary search space X into a continuous-time black-box optimization method on X, the information-geometric optimization (IGO) method. Invariance as a major design principle keeps the number of arbitrary choices to a minimum. The resulting IGO flow is the flow of an ordinary differential equation conducting the natural gradient ascent of an adaptive, time-dependent transformation of the objective function. It makes no particular assumptions on the objective function to be optimized. The IGO method produces explicit IGO algorithms through time discretization. It naturally recovers versions of known algorithms and offers a systematic way to derive new ones. In continuous search spaces, IGO algorithms take a form related to natural evolution strategies (NES). The cross-entropy method is recovered in a particular case with a large time step, and can be extended into a smoothed, parametrization-independent maximum likelihood update (IGO-ML). When applied to the family of Gaussian distributions on Rd, the IGO framework recovers a version of the well-known CMA-ES algorithm and of xNES. For the family of Bernoulli distributions on {0, 1}d, we recover the seminal PBIL algorithm and cGA. For the distributions of restricted Boltzmann machines, we naturally obtain a novel algorithm for discrete optimization on {0, 1}d. All these algorithms are natural instances of, and unified under, the single information-geometric optimization framework. The IGO method achieves, thanks to its intrinsic formulation, maximal invariance properties: invariance under reparametrization of the search space X, under a change of parameters of the probability distribution, and under increasing transformation of the function to be optimized. The latter is achieved through an adaptive, quantile-based formulation of the objective. Theoretical considerations strongly suggest that IGO algorithms are essentially characterized by a minimal change of the distribution over time. Therefore they have minimal loss in diversity through the course of optimization, provided the initial diversity is high. First experiments using restricted Boltzmann machines confirm this insight. As a simple consequence, IGO seems to provide, from information theory, an elegant way to simultaneously explore several valleys of a fitness landscape in a single run.

265 citations

Journal Article•
An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback

[...]

Ohad Shamir
01 Jan 2017-Journal of Machine Learning Research
TL;DR: In this paper, a bandit convex optimization with two-point feedback and zero-order stochastic convex optimisation with two function evaluations per round is considered. And the algorithm is based on a small but surprisingly powerful modification of the gradient estimator.
Abstract: We consider the closely related problems of bandit convex optimization with two-point feedback, and zero-order stochastic convex optimization with two function evaluations per round. We provide a simple algorithm and analysis which is optimal for convex Lipschitz functions. This improves on \cite{dujww13}, which only provides an optimal result for smooth functions; Moreover, the algorithm and analysis are simpler, and readily extend to non-Euclidean problems. The algorithm is based on a small but surprisingly powerful modification of the gradient estimator.

253 citations

Proceedings Article•10.14288/1.0357417•
Joint Distribution Optimal Transportation for Domain Adaptation

[...]

Nicolas Courty1, Rémi Flamary, Amaury Habrard, Alain Rakotomamonjy•
University of Paris-Sud1
24 May 2017
TL;DR: In this article, a non-linear transformation between the joint feature/label space distributions of the two domains P s and P t can be estimated with optimal transport, which corresponds to the minimization of a bound on the target error.
Abstract: This paper deals with the unsupervised domain adaptation problem, where one wants to estimate a prediction function f in a given target domain without any labeled sample by exploiting the knowledge available from a source domain where labels are known. Our work makes the following assumption: there exists a non-linear transformation between the joint feature/label space distributions of the two domain P s and P t that can be estimated with optimal transport. We propose a solution of this problem that allows to recover an estimated target P f t = (X, f (X)) by optimizing simultaneously the optimal coupling and f. We show that our method corresponds to the minimization of a bound on the target error, and provide an efficient algorithmic solution, for which convergence is proved. The versatility of our approach, both in terms of class of hypothesis or loss functions is demonstrated with real world classification and regression problems, for which we reach or surpass state-of-the-art results.
Journal Article•10.1109/TAC.2016.2628807•
Distributed Continuous-Time Algorithm for Constrained Convex Optimizations via Nonsmooth Analysis Approach

[...]

Xianlin Zeng1, Peng Yi2, Yiguang Hong1•
Chinese Academy of Sciences1, University of Toronto2
01 Oct 2017-IEEE Transactions on Automatic Control
TL;DR: In this article, a distributed continuous-time projected algorithm for convex cost functions with local constraints is proposed, in which each agent knows its local cost function and local constraint set, and proves that all the agents of the algorithm can find the same optimal solution.
Abstract: This technical note studies the distributed optimization problem of a sum of nonsmooth convex cost functions with local constraints. At first, we propose a novel distributed continuous-time projected algorithm, in which each agent knows its local cost function and local constraint set, for the constrained optimization problem. Then we prove that all the agents of the algorithm can find the same optimal solution, and meanwhile, keep the states bounded while seeking the optimal solutions. We conduct a complete convergence analysis by employing nonsmooth Lyapunov functions for the stability analysis of differential inclusions. Finally, we provide a numerical example for illustration.
Posted Content•
Loss-Sensitive Generative Adversarial Networks on Lipschitz Densities

[...]

Guo-Jun Qi
23 Jan 2017-arXiv: Computer Vision and Pattern Recognition
TL;DR: In this article, a loss-sensitive GAN (LS-GAN) is proposed to distinguish between real and fake samples by designated margins, while learning a generator alternately to produce realistic samples by minimizing their losses.
Abstract: In this paper, we present the Lipschitz regularization theory and algorithms for a novel Loss-Sensitive Generative Adversarial Network (LS-GAN). Specifically, it trains a loss function to distinguish between real and fake samples by designated margins, while learning a generator alternately to produce realistic samples by minimizing their losses. The LS-GAN further regularizes its loss function with a Lipschitz regularity condition on the density of real data, yielding a regularized model that can better generalize to produce new data from a reasonable number of training examples than the classic GAN. We will further present a Generalized LS-GAN (GLS-GAN) and show it contains a large family of regularized GAN models, including both LS-GAN and Wasserstein GAN, as its special cases. Compared with the other GAN models, we will conduct experiments to show both LS-GAN and GLS-GAN exhibit competitive ability in generating new images in terms of the Minimum Reconstruction Error (MRE) assessed on a separate test set. We further extend the LS-GAN to a conditional form for supervised and semi-supervised learning problems, and demonstrate its outstanding performance on image classification tasks.
Journal Article•10.1109/TAC.2017.2648041•
Non-Convex Distributed Optimization

[...]

Tatiana Tatarenko1, Behrouz Touri2•
Technische Universität Darmstadt1, University of Colorado Boulder2
05 Jan 2017-IEEE Transactions on Automatic Control
TL;DR: In this article, the authors study distributed non-convex optimization on a time-varying multi-agent network, where each node has access to its own smooth local cost function, and the collective goal is to minimize the sum of these functions.
Abstract: We study distributed non-convex optimization on a time-varying multi-agent network. Each node has access to its own smooth local cost function, and the collective goal is to minimize the sum of these functions. The perturbed push-sum algorithm was previously used for convex distributed optimization. We generalize the result obtained for the convex case to the case of non-convex functions. Under some additional technical assumptions on the gradients we prove the convergence of the distributed push-sum algorithm to some critical point of the objective function. By utilizing perturbations on the update process, we show the almost sure convergence of the perturbed dynamics to a local minimum of the global objective function, if the objective function has no saddle points. Our analysis shows that this perturbed procedure converges at a rate of $O(1/t)$ .
Journal Article•10.1287/MOOR.2016.0842•
Optimal Approximation for Submodular and Supermodular Optimization with Bounded Curvature

[...]

Maxim Sviridenko, Jan Vondrák, Justin Ward
16 May 2017-Mathematics of Operations Research
TL;DR: In this paper, a (1 − c/e)-approximation algorithm was proposed for the problem of maximizing a monotone increasing submodular function subject to a single matroid constraint.
Abstract: We design new approximation algorithms for the problems of optimizing submodular and supermodular functions subject to a single matroid constraint. Specifically, we consider the case in which we wish to maximize a monotone increasing submodular function or minimize a monotone decreasing supermodular function with a bounded total curvature c. Intuitively, the parameter c represents how nonlinear a function f is: when c = 0, f is linear, while for c = 1, f may be an arbitrary monotone increasing submodular function. For the case of submodular maximization with total curvature c, we obtain a (1 − c/e)-approximation—the first improvement over the greedy algorithm of of Conforti and Cornuejols from 1984, which holds for a cardinality constraint, as well as a recent analogous result for an arbitrary matroid constraint. Our approach is based on modifications of the continuous greedy algorithm and nonoblivious local search, and allows us to approximately maximize the sum of a nonnegative, monotone increasing subm...
Journal Article•10.1109/TCYB.2016.2542923•
Discrete-Time Deterministic $Q$ -Learning: A Novel Convergence Analysis

[...]

Qinglai Wei1, Frank L. Lewis2, Qiuye Sun3, Pengfei Yan1, Ruizhuo Song4 •
Chinese Academy of Sciences1, University of Texas at Arlington2, Northeastern University (China)3, University of Science and Technology Beijing4
01 May 2017-IEEE Transactions on Systems, Man, and Cybernetics
TL;DR: A novel discrete-time deterministic deterministic inline-formula-learning algorithm is developed and the convergence criterion for the discounted case is established, and the iterative control law of the developed algorithm is simplified.
Abstract: In this paper, a novel discrete-time deterministic $ Q$ -learning algorithm is developed. In each iteration of the developed $ Q$ -learning algorithm, the iterative $ Q$ function is updated for all the state and control spaces, instead of updating for a single state and a single control in traditional $ Q$ -learning algorithm. A new convergence criterion is established to guarantee that the iterative $ Q$ function converges to the optimum, where the convergence criterion of the learning rates for traditional $ Q$ -learning algorithms is simplified. During the convergence analysis, the upper and lower bounds of the iterative $ Q$ function are analyzed to obtain the convergence criterion, instead of analyzing the iterative $ Q$ function itself. For convenience of analysis, the convergence properties for undiscounted case of the deterministic $ Q$ -learning algorithm are first developed. Then, considering the discounted factor, the convergence criterion for the discounted case is established. Neural networks are used to approximate the iterative $ Q$ function and compute the iterative control law, respectively, for facilitating the implementation of the deterministic $ Q$ -learning algorithm. Finally, simulation results and comparisons are given to illustrate the performance of the developed algorithm.
Proceedings Article•
Sobolev Training for Neural Networks

[...]

Wojciech Marian Czarnecki1, Simon Osindero2, Max Jaderberg1, Grzegorz Swirszcz1, Razvan Pascanu1 •
Google1, Yahoo!2
15 Jun 2017
TL;DR: Sobolev Training for neural networks is introduced, which is a method for incorporating target derivatives in addition the to target values while training, and results in models with higher accuracy and stronger generalisation on three distinct domains.
Abstract: At the heart of deep learning we aim to use neural networks as function approximators - training them to produce outputs from inputs in emulation of a ground truth function or data creation process. In many cases we only have access to input-output pairs from the ground truth, however it is becoming more common to have access to derivatives of the target output with respect to the input -- for example when the ground truth function is itself a neural network such as in network compression or distillation. Generally these target derivatives are not computed, or are ignored. This paper introduces Sobolev Training for neural networks, which is a method for incorporating these target derivatives in addition the to target values while training. By optimising neural networks to not only approximate the function’s outputs but also the function’s derivatives we encode additional information about the target function within the parameters of the neural network. Thereby we can improve the quality of our predictors, as well as the data-efficiency and generalization capabilities of our learned function approximation. We provide theoretical justifications for such an approach as well as examples of empirical evidence on three distinct domains: regression on classical optimisation datasets, distilling policies of an agent playing Atari, and on large-scale applications of synthetic gradients. In all three domains the use of Sobolev Training, employing target derivatives in addition to target values, results in models with higher accuracy and stronger generalisation.
Journal Article•10.1137/15M1026924•
Saddle-point dynamics: conditions for asymptotic stability of saddle points

[...]

Ashish Cherukuri1, Bahman Gharesifard2, Jorge E. Cortes1•
University of California, San Diego1, Queen's University2
22 Feb 2017-Siam Journal on Control and Optimization
TL;DR: In this article, the authors consider continuous differentiable functions with min-max saddle points and study the asymptotic convergence properties of the associated saddle-point dynamics (gradient descent in the first variable and gradient ascent in the second one).
Abstract: This paper considers continuously differentiable functions of two vector variables that have (possibly a continuum of) min-max saddle points. We study the asymptotic convergence properties of the associated saddle-point dynamics (gradient descent in the first variable and gradient ascent in the second one). We identify a suite of complementary conditions under which the set of saddle points is asymptotically stable under the saddle-point dynamics. Our first set of results is based on the convexity-concavity of the function defining the saddle-point dynamics to establish the convergence guarantees. For functions that do not enjoy this feature, our second set of results relies on properties of the linearization of the dynamics, the function along the proximal normals to the saddle set, and the linearity of the function in one variable. We also provide global versions of the asymptotic convergence results. Various examples illustrate our discussion.
Journal Article•10.1109/TSP.2018.2835403•
Global optimality in low-rank matrix optimization

[...]

Zhihui Zhu1, Qiuwei Li1, Gongguo Tang1, Michael B. Wakin1•
Colorado School of Mines1
25 Feb 2017
TL;DR: In this paper, the authors consider the minimization of a general objective function over a set of rectangular matrices that have rank at most r. Despite the resulting nonconvexity, recent studies in matrix completion and sensing have shown that the factored problem has no spurious local minima and obeys the strict saddle property.
Abstract: This paper considers the minimization of a general objective function $f(\boldsymbol{X})$ over the set of rectangular $n\times m$ matrices that have rank at most $r$ . To reduce the computational burden, we factorize the variable $\boldsymbol{X}$ into a product of two smaller matrices and optimize over these two matrices instead of $\boldsymbol{X}$ . Despite the resulting nonconvexity, recent studies in matrix completion and sensing have shown that the factored problem has no spurious local minima and obeys the so-called strict saddle property (the function has a directional negative curvature at all critical points but local minima). We analyze the global geometry for a general and yet well-conditioned objective function $f(\boldsymbol{X})$ whose restricted strong convexity and restricted strong smoothness constants are comparable. In particular, we show that the reformulated objective function has no spurious local minima and obeys the strict saddle property. These geometric properties imply that a number of iterative optimization algorithms (such as gradient descent) can provably solve the factored problem with global convergence.
Journal Article•10.1007/JHEP08(2017)146•
More on Supersymmetric and 2d Analogs of the SYK Model

[...]

Jeff Murugan1, Jeff Murugan2, Douglas Stanford1, Edward Witten1•
Institute for Advanced Study1, University of Cape Town2
16 Jun 2017-arXiv: High Energy Physics - Theory
TL;DR: In this article, supersymmetric and 2D analogies of the SYK model were explored, which consist of super-conformal eigenfunctions appropriate for expanding a four-point function.
Abstract: In this paper, we explore supersymmetric and 2d analogs of the SYK model. We begin by working out a basis of (super)conformal eigenfunctions appropriate for expanding a four-point function. We use this to clarify some details of the 1d supersymmetric SYK model. We then introduce new bosonic and supersymmetric analogs of SYK in two dimensions. These theories consist of $N$ fields interacting with random $q$-field interactions. Although models built entirely from bosons appear to be problematic, we find a supersymmetric model that flows to a large $N$ CFT with interaction strength of order one. We derive an integral formula for the four-point function at order $1/N$, and use it to compute the central charge, chaos exponent and some anomalous dimensions. We describe a problem that arises if one tries to find a 2d SYK-like CFT with a continuous global symmetry.
Journal Article•10.1002/INT.21898•
A Novel Improved Accuracy Function for Interval Valued Pythagorean Fuzzy Sets and Its Applications in the Decision-Making Process

[...]

Harish Garg1•
Thapar University1
01 Dec 2017-International Journal of Intelligent Systems
TL;DR: An improved accuracy function for the ranking order of interval‐valued Pythagorean fuzzy sets (IVPFSs) is presented and multicriteria decision‐making method has been proposed for finding the desirable alternative(s).
Abstract: The objective of this work is to present an improved accuracy function for the ranking order of interval-valued Pythagorean fuzzy sets (IVPFSs). Shortcomings of the existing score and accuracy functions in interval-valued Pythagorean environment have been overcome by the proposed accuracy function. In the proposed function, degree of hesitation between the element of IVPFS has been taken into account during the analysis. Based on it, multicriteria decision-making method has been proposed for finding the desirable alternative(s). Finally, an illustrative example for solving the decision-making problem has been presented to demonstrate application of the proposed approach.
Journal Article•10.1007/S10107-017-1161-4•
Accelerated schemes for a class of variational inequalities

[...]

Yunmei Chen1, Guanghui Lan2, Yuyuan Ouyang3•
University of Florida1, Georgia Institute of Technology2, Clemson University3
01 Sep 2017-Mathematical Programming
TL;DR: In this article, a stochastic accelerated mirror-prox (SAMP) method was proposed for solving a class of monotone variational inequalities (SVI), which is based on a multi-step acceleration scheme.
Abstract: We propose a novel stochastic method, namely the stochastic accelerated mirror-prox (SAMP) method, for solving a class of monotone stochastic variational inequalities (SVI). The main idea of the proposed algorithm is to incorporate a multi-step acceleration scheme into the stochastic mirror-prox method. The developed SAMP method computes weak solutions with the optimal iteration complexity for SVIs. In particular, if the operator in SVI consists of the stochastic gradient of a smooth function, the iteration complexity of the SAMP method can be accelerated in terms of their dependence on the Lipschitz constant of the smooth function. For SVIs with bounded feasible sets, the bound of the iteration complexity of the SAMP method depends on the diameter of the feasible set. For unbounded SVIs, we adopt the modified gap function introduced by Monteiro and Svaiter for solving monotone inclusion, and show that the iteration complexity of the SAMP method depends on the distance from the initial point to the set of strong solutions. It is worth noting that our study also significantly improves a few existing complexity results for solving deterministic variational inequality problems. We demonstrate the advantages of the SAMP method over some existing algorithms through our preliminary numerical experiments.
Journal Article•10.1016/J.AUTOMATICA.2017.01.002•
Sliding mode control for singular stochastic Markovian jump systems with uncertainties

[...]

Qingling Zhang1, Qingling Zhang2, Li Li1, Xing-Gang Yan2, Sarah K. Spurgeon3 •
Northeastern University (China)1, University of Kent2, University College London3
01 May 2017-Automatica
TL;DR: A set of new sufficient conditions is developed which not only guarantees the stochastic admissibility of the sliding mode dynamics, but also determines all the parameter matrices in the integral sliding function.
Journal Article•10.1080/00401706.2016.1251495•
Bayesian Design of Experiments Using Approximate Coordinate Exchange

[...]

Antony M. Overstall1, David C. Woods1•
University of Southampton1
27 Apr 2017-Technometrics
TL;DR: In this article, a Gaussian process emulator is used to approximate the expected utility as a function of a single design coordinate in a series of conditional optimization steps to find multi-variable designs without resorting to asymptotic approximations to the posterior distribution or expected utility.
Abstract: The construction of decision-theoretical Bayesian designs for realistically complex nonlinear models is computationally challenging, as it requires the optimization of analytically intractable expected utility functions over high-dimensional design spaces. We provide the most general solution to date for this problem through a novel approximate coordinate exchange algorithm. This methodology uses a Gaussian process emulator to approximate the expected utility as a function of a single design coordinate in a series of conditional optimization steps. It has flexibility to address problems for any choice of utility function and for a wide range of statistical models with different numbers of variables, numbers of runs and randomization restrictions. In contrast to existing approaches to Bayesian design, the method can find multi-variable designs in large numbers of runs without resorting to asymptotic approximations to the posterior distribution or expected utility. The methodology is demonstrated on...
Journal Article•10.1103/PHYSREVD.96.043505•
Real no-boundary wave function in Lorentzian quantum cosmology

[...]

J. Diaz Dorronsoro1, Jonathan J. Halliwell2, James B. Hartle3, Thomas Hertog1, Oliver Janssen •
Katholieke Universiteit Leuven1, Imperial College London2, University of California, Santa Barbara3
07 Aug 2017-Physical Review D
TL;DR: In this article, the standard no-boundary wave function has a natural expression in terms of a Lorentzian path integral with its contour defined by Picard-Lefschetz theory.
Abstract: It is shown that the standard no-boundary wave function has a natural expression in terms of a Lorentzian path integral with its contour defined by Picard-Lefschetz theory The wave function is real, satisfies the Wheeler-DeWitt equation and predicts an ensemble of asymptotically classical, inflationary universes with nearly-Gaussian fluctuations and with a smooth semiclassical origin
Journal Article•10.1016/J.AUTOMATICA.2017.06.035•
Function perturbations on singular Boolean networks

[...]

Yang Liu1, Yang Liu2, Baowen Li2, Hongwei Chen1, Jinde Cao1, Jinde Cao3 •
Southeast University1, Zhejiang Normal University2, Shandong Normal University3
01 Oct 2017-Automatica
TL;DR: The algebraic form of an SBN is given, and how the transition matrix of the SBN changes under function perturbations is discussed, under which the impacts of function perturbed structure changes on the topological structure are investigated.
Journal Article•10.1007/JHEP06(2017)127•
A non-planar two-loop three-point function beyond multiple polylogarithms

[...]

Andreas von Manteuffel1, Andreas von Manteuffel2, Lorenzo Tancredi3•
Michigan State University1, University of Mainz2, Karlsruhe Institute of Technology3
20 Jan 2017-arXiv: High Energy Physics - Phenomenology
TL;DR: In this article, the analytic calculation of a two-loop non-planar three-point function which contributes to the twoloop amplitudes for $t \bar{t}$ production and $gamma \gamma$ production in gluon fusion through a massive top-quark loop is considered.
Abstract: We consider the analytic calculation of a two-loop non-planar three-point function which contributes to the two-loop amplitudes for $t \bar{t}$ production and $\gamma \gamma$ production in gluon fusion through a massive top-quark loop. All subtopology integrals can be written in terms of multiple polylogarithms over an irrational alphabet and we employ a new method for the integration of the differential equations which does not rely on the rationalization of the latter. The top topology integrals, instead, in spite of the absence of a massive three-particle cut, cannot be evaluated in terms of multiple polylogarithms and require the introduction of integrals over complete elliptic integrals and polylogarithms. We provide one-fold integral representations for the solutions and continue them analytically to all relevant regions of the phase space in terms of real function, extracting all imaginary parts explicitly. The numerical evaluation of our expressions becomes straightforward in this way.
Journal Article•
Distributed Learning with Regularized Least Squares

[...]

Shaobo Lin, Xin Guo, Ding-Xuan Zhou
01 Jan 2017-Journal of Machine Learning Research
TL;DR: In this article, the authors study distributed learning with the least squares regularization scheme in a reproducing kernel Hilbert space (RKHS) and show that the global output function of this distributed learning is a good approximation to the algorithm processing the whole data in one single machine.
Abstract: We study distributed learning with the least squares regularization scheme in a reproducing kernel Hilbert space (RKHS). By a divide-and-conquer approach, the algorithm partitions a data set into disjoint data subsets, applies the least squares regularization scheme to each data subset to produce an output function, and then takes an average of the individual output functions as a final global estimator or predictor. We show with error bounds in expectation in both the $L^2$-metric and RKHS-metric that the global output function of this distributed learning is a good approximation to the algorithm processing the whole data in one single machine. Our error bounds are sharp and stated in a general setting without any eigenfunction assumption. The analysis is achieved by a novel second order decomposition of operator differences in our integral operator approach. Even for the classical least squares regularization scheme in the RKHS associated with a general kernel, we give the best learning rate in the literature.
Journal Article•10.1016/J.IFACOL.2017.08.1026•
Transfer Function Estimation in System Identification Toolbox via Vector Fitting

[...]

Ahmet Arda Ozdemir1, Suat Gumussoy1•
MathWorks1
01 Jul 2017-IFAC-PapersOnLine
TL;DR: The first contribution is a bilinear mapping of the original problem from the imaginary axis onto the unit disk, which improves the numerics of the underlying Sanathanan-Koerner iterations and the more recent instrumental-variable iterations.
Journal Article•10.1002/RNC.3596•
Adaptive finite‐time fault‐tolerant tracking control for a class of MIMO nonlinear systems with output constraints

[...]

Xu Jin1•
Georgia Institute of Technology1
25 Mar 2017-International Journal of Robust and Nonlinear Control
TL;DR: It is shown that under the proposed control scheme, finite-time convergence of the output tracking error into a small set around zero is guaranteed, while the constraint requirement on the system outputtracking error will not be violated during operation.
Abstract: Summary In this work, we present a novel adaptive finite-time fault-tolerant control algorithm for a class of multi-input multi-output nonlinear systems with constraint requirement on the system output tracking error. Both parametric and nonparametric system uncertainties can be effectively dealt with by the proposed control scheme. The gain functions of the nonlinear systems under discussion, especially the control input gain function, can be not fully known and state-dependent. Backstepping design with a tan-type barrier Lyapunov function and a new structure of stabilizing function is presented. We show that under the proposed control scheme, finite-time convergence of the output tracking error into a small set around zero is guaranteed, while the constraint requirement on the system output tracking error will not be violated during operation. An illustrative example on a robot manipulator model is presented in the end to further demonstrate the effectiveness of the proposed control scheme. Copyright © 2016 John Wiley & Sons, Ltd.
...

Tools

SciSpace AgentBiomedical AgentSciSpace RecruitSciSpace for EnterpriseAgent GalleryChat with PDFLiterature ReviewAI WriterFind TopicsParaphraserCitation GeneratorExtract DataAI DetectorCitation Booster

Learn

ResourcesLive Workshops

SciSpace

CareersSupportBrowse PapersPricingSciSpace Affiliate ProgramCancellation & Refund PolicyTermsPrivacyData Sources

Directories

PapersTopicsJournalsAuthorsConferencesInstitutionsCitation StylesWriting templates

Extension & Apps

SciSpace Chrome ExtensionSciSpace Mobile App

Contact

support@scispace.com
SciSpace

© 2026 | PubGenius Inc. | Suite # 217 691 S Milpitas Blvd Milpitas CA 95035, USA

soc2
Secured by Delve