Top 3535 papers published in the topic of Function (mathematics) in 2021

Showing papers on "Function (mathematics) published in 2021"

Journal Article•10.1038/S42256-021-00302-5•

Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators

[...]

Lu Lu¹, Pengzhan Jin², Pengzhan Jin³, Guofei Pang³, Zhongqiang Zhang⁴, George Em Karniadakis³ - Show less +2 more•Institutions (4)

Massachusetts Institute of Technology¹, Chinese Academy of Sciences², Brown University³, Worcester Polytechnic Institute⁴

01 Mar 2021-Nature Machine Intelligence

TL;DR: A new deep neural network called DeepONet can lean various mathematical operators with small generalization error and can learn various explicit operators, such as integrals and fractional Laplacians, as well as implicit operators that represent deterministic and stochastic differential equations.

...read moreread less

Abstract: It is widely known that neural networks (NNs) are universal approximators of continuous functions. However, a less known but powerful result is that a NN with a single hidden layer can accurately approximate any nonlinear continuous operator. This universal approximation theorem of operators is suggestive of the structure and potential of deep neural networks (DNNs) in learning continuous operators or complex systems from streams of scattered data. Here, we thus extend this theorem to DNNs. We design a new network with small generalization error, the deep operator network (DeepONet), which consists of a DNN for encoding the discrete input function space (branch net) and another DNN for encoding the domain of the output functions (trunk net). We demonstrate that DeepONet can learn various explicit operators, such as integrals and fractional Laplacians, as well as implicit operators that represent deterministic and stochastic differential equations. We study different formulations of the input function space and its effect on the generalization error for 16 different diverse applications. Neural networks are known as universal approximators of continuous functions, but they can also approximate any mathematical operator (mapping a function to another function), which is an important capability for complex systems such as robotics control. A new deep neural network called DeepONet can lean various mathematical operators with small generalization error.

...read moreread less

1,667 citations

Posted Content•

Focal and Efficient IOU Loss for Accurate Bounding Box Regression

[...]

Yi-Fan Zhang, Weiqiang Ren, Zhang Zhang¹, Zhen Jia¹, Liang Wang¹, Tieniu Tan¹ - Show less +2 more•Institutions (1)

Chinese Academy of Sciences¹

20 Jan 2021-arXiv: Computer Vision and Pattern Recognition

TL;DR: Wang et al. as discussed by the authors proposed an Efficient Intersection over Union (EIOU) loss, which explicitly measures the discrepancies of three geometric factors in BBR, i.e., the overlap area, the central point and the side length.

...read moreread less

Abstract: In object detection, bounding box regression (BBR) is a crucial step that determines the object localization performance. However, we find that most previous loss functions for BBR have two main drawbacks: (i) Both $\ell_n$-norm and IOU-based loss functions are inefficient to depict the objective of BBR, which leads to slow convergence and inaccurate regression results. (ii) Most of the loss functions ignore the imbalance problem in BBR that the large number of anchor boxes which have small overlaps with the target boxes contribute most to the optimization of BBR. To mitigate the adverse effects caused thereby, we perform thorough studies to exploit the potential of BBR losses in this paper. Firstly, an Efficient Intersection over Union (EIOU) loss is proposed, which explicitly measures the discrepancies of three geometric factors in BBR, i.e., the overlap area, the central point and the side length. After that, we state the Effective Example Mining (EEM) problem and propose a regression version of focal loss to make the regression process focus on high-quality anchor boxes. Finally, the above two parts are combined to obtain a new loss function, namely Focal-EIOU loss. Extensive experiments on both synthetic and real datasets are performed. Notable superiorities on both the convergence speed and the localization accuracy can be achieved over other BBR losses.

...read moreread less

917 citations

Journal Article•10.1109/TNNLS.2021.3107600•

Adaptive Neural Network Control for a Class of Nonlinear Systems With Function Constraints on States.

[...]

Yan-Jun Liu¹, Wei Zhao¹, Lei Liu¹, Dapeng Li², Shaocheng Tong¹, C. L. Philip Chen³ - Show less +2 more•Institutions (3)

Liaoning University of Technology¹, Beijing University of Technology², South China University of Technology³

14 Sep 2021-IEEE Transactions on Neural Networks

TL;DR: In this article, the problem of tracking control for a class of nonlinear time-varying full state constrained systems is investigated, and the intelligent controller and adaptive law are developed.

...read moreread less

Abstract: In this article, the problem of tracking control for a class of nonlinear time-varying full state constrained systems is investigated. By constructing the time-varying asymmetric barrier Lyapunov function (BLF) and combining it with the backstepping algorithm, the intelligent controller and adaptive law are developed. Neural networks (NNs) are utilized to approximate the uncertain function. It is well known that in the past research of nonlinear systems with state constraints, the state constraint boundary is either a constant or a time-varying function. In this article, the constraint boundaries both related to state and time are investigated, which makes the design of control algorithm more complex and difficult. Furthermore, by employing the Lyapunov stability analysis, it is proven that all signals in the closed-loop system are bounded and the time-varying full state constraints are not violated. In the end, the effectiveness of the control algorithm is verified by numerical simulation.

...read moreread less

246 citations

Journal Article•10.1109/TAC.2020.3008125•

Bipartite Consensus Tracking for Second-Order Multiagent Systems: A Time-Varying Function-Based Preset-Time Approach

[...]

Boda Ning¹, Qing-Long Han¹, Zongyu Zuo²•Institutions (2)

Swinburne University of Technology¹, Beihang University²

01 Jun 2021-IEEE Transactions on Automatic Control

TL;DR: A time-varying function-based preset-time approach is proposed to realize the convergence in predetermined time to achieve bipartite consensus tracking for second-order multiagent systems with signed directed graphs.

...read moreread less

Abstract: This article is concerned with bipartite consensus tracking for second-order multiagent systems with signed directed graphs. A time-varying function-based preset-time approach is proposed to realize the convergence in predetermined time. First, a class of time-varying functions with generalized properties are presented. Second, two time-varying function-based auxiliaries and a corresponding manifold are constructed. Under a structurally balanced and strongly connected graph, a time-varying function-based controller considering the neighboring state is proposed to guarantee that the system trajectory is constrained on the manifold such that bipartite consensus tracking is achieved in preset-time. Third, for first-order multiagent systems, a preset-time controller is further developed with simplified design. Finally, numerical examples are provided to demonstrate the effectiveness of the proposed controllers.

...read moreread less

201 citations

Journal Article•10.1103/PHYSREVA.103.052416•

Solving nonlinear differential equations with differentiable quantum circuits

[...]

Oleksandr Kyriienko¹, Annie E. Paine¹, Vincent E. Elfving•Institutions (1)

University of Exeter¹

17 May 2021-Physical Review A

TL;DR: A hybrid quantum-classical workflow where DQCs are trained to satisfy differential equations and specified boundary conditions is described, and how this approach can implement a spectral method for solving differential equations in a high-dimensional feature space is shown.

...read moreread less

Abstract: We propose a quantum algorithm to solve systems of nonlinear differential equations. Using a quantum feature map encoding, we define functions as expectation values of parametrized quantum circuits. We use automatic differentiation to represent function derivatives in an analytical form as differentiable quantum circuits (DQCs), thus avoiding inaccurate finite difference procedures for calculating gradients. We describe a hybrid quantum-classical workflow where DQCs are trained to satisfy differential equations and specified boundary conditions. As a particular example setting, we show how this approach can implement a spectral method for solving differential equations in a high-dimensional feature space. From a technical perspective, we design a Chebyshev quantum feature map that offers a powerful basis set of fitting polynomials and possesses rich expressivity. We simulate the algorithm to solve an instance of Navier-Stokes equations and compute density, temperature, and velocity profiles for the fluid flow in a convergent-divergent nozzle.

...read moreread less

181 citations

Proceedings Article•10.1109/PIMRC50174.2021.9569465•

Path Loss in Reconfigurable Intelligent Surface-Enabled Channels

[...]

Steven W. Ellingson¹•Institutions (1)

Virginia Tech¹

13 Sep 2021

TL;DR: In this article, the path loss of a channel consisting of a passive reflectarray-type RIS is characterized as a function of RIS size, link geometry, and the method used to set the element states.

...read moreread less

Abstract: A reconfigurable intelligent surface (RIS) employs an array of individually-controllable elements to scatter incident signals in a desirable way; for example, to facilitate links between base stations and mobile stations that would otherwise be blocked. A principal consideration in the study of RIS-enabled propagation channels is path loss. This paper presents a simple yet broadly-applicable method for calculating the path loss of a channel consisting of a passive reflectarray-type RIS. This model is then used to characterize path loss as a function of RIS size, link geometry, and the method used to set the element states. Whereas previous work presumes either (1) an array of parameterizable element patterns and spacings (most useful for analysis of specific designs) or (2) a continuous electromagnetic surface (most useful for determining scaling laws and theoretical limits), this work begins with (1) and is then shown to be consistent with (2), making it possible to identify specific practical designs and scenarios that exhibit the performance predicted using (2). This model is used to further elucidate the matter of path loss of the RIS-enabled channel relative to that of the free space direct and specular reflection channels, which is an important consideration in the design of networks employing RIS technology.

...read moreread less

173 citations

Journal Article•10.1287/MNSC.2020.3678•

From data to decisions: Distributionally robust optimization is optimal

[...]

Bart Van Parys¹, Peyman Mohajerin Esfahani², Daniel Kuhn³•Institutions (3)

Massachusetts Institute of Technology¹, Delft University of Technology², École Polytechnique Fédérale de Lausanne³

01 Jun 2021-Management Science

TL;DR: This paper proposes a meta-optimization problem to find the least conservative predictors and prescriptors subject to constraints on their out-of-sample disappointment and proves that the best predictor-prescriptor-pair is obtained by solving a distributionally robust optimization problem over all distributions within a given relative entropy distance from the empirical distribution of the data.

...read moreread less

Abstract: We study stochastic programs where the decision-maker cannot observe the distribution of the exogenous uncertainties but has access to a finite set of independent samples from this distribution. In this setting, the goal is to find a procedure that transforms the data to an estimate of the expected cost function under the unknown data-generating distribution, i.e., a predictor, and an optimizer of the estimated cost function that serves as a near-optimal candidate decision, i.e., a prescriptor. As functions of the data, predictors and prescriptors constitute statistical estimators. We propose a meta-optimization problem to find the least conservative predictors and prescriptors subject to constraints on their out-of-sample disappointment. The out-of-sample disappointment quantifies the probability that the actual expected cost of the candidate decision under the unknown true distribution exceeds its predicted cost. Leveraging tools from large deviations theory, we prove that this meta-optimization problem admits a unique solution: The best predictor-prescriptor pair is obtained by solving a distributionally robust optimization problem over all distributions within a given relative entropy distance from the empirical distribution of the data.

...read moreread less

135 citations

Journal Article•10.1088/1751-8121/ABFAC7•

On barren plateaus and cost function locality in variational quantum algorithms

[...]

Alexey Uvarov, Jacob Biamonte

18 Jun 2021-Journal of Physics A

TL;DR: In this paper, the authors derived a lower bound on the variance of the gradient, which depends mainly on the width of the circuit causal cone of each term in the Pauli decomposition of the cost function.

...read moreread less

Abstract: Variational quantum algorithms rely on gradient based optimization to iteratively minimize a cost function evaluated by measuring output(s) of a quantum processor. A barren plateau is the phenomenon of exponentially vanishing gradients in sufficiently expressive parametrized quantum circuits. It has been established that the onset of a barren plateau regime depends on the cost function, although the particular behavior has been demonstrated only for certain classes of cost functions. Here we derive a lower bound on the variance of the gradient, which depends mainly on the width of the circuit causal cone of each term in the Pauli decomposition of the cost function. Our result further clarifies the conditions under which barren plateaus can occur.

...read moreread less

124 citations

Journal Article•10.1109/TIT.2021.3062161•

Deep Neural Network Approximation Theory

[...]

Dennis Elbrächter¹, Dmytro Perekrestenko², Philipp Grohs¹, Helmut Bölcskei²•Institutions (2)

University of Vienna¹, ETH Zurich²

24 Feb 2021-IEEE Transactions on Information Theory

TL;DR: In this paper, it was shown that deep networks are Kolmogorov-optimal approximants for unit balls in Besov spaces and modulation spaces, and that for sufficiently smooth functions finite-width deep networks require strictly smaller connectivity than finite-depth wide networks.

...read moreread less

Abstract: This paper develops fundamental limits of deep neural network learning by characterizing what is possible if no constraints are imposed on the learning algorithm and on the amount of training data. Concretely, we consider Kolmogorov-optimal approximation through deep neural networks with the guiding theme being a relation between the complexity of the function (class) to be approximated and the complexity of the approximating network in terms of connectivity and memory requirements for storing the network topology and the associated quantized weights. The theory we develop establishes that deep networks are Kolmogorov-optimal approximants for markedly different function classes, such as unit balls in Besov spaces and modulation spaces. In addition, deep networks provide exponential approximation accuracy—i.e., the approximation error decays exponentially in the number of nonzero weights in the network—of the multiplication operation, polynomials, sinusoidal functions, and certain smooth functions. Moreover, this holds true even for one-dimensional oscillatory textures and the Weierstrass function—a fractal function, neither of which has previously known methods achieving exponential approximation accuracy. We also show that in the approximation of sufficiently smooth functions finite-width deep networks require strictly smaller connectivity than finite-depth wide networks.

...read moreread less

120 citations

Journal Article•10.1109/TCYB.2020.2970969•

Policy Iteration Q -Learning for Data-Based Two-Player Zero-Sum Game of Linear Discrete-Time Systems

[...]

Biao Luo¹, Yin Yang², Derong Liu³•Institutions (3)

Central South University¹, Khalifa University², Guangdong University of Technology³

23 Jun 2021-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: In this article, the data-based two-player zero-sum game problem is considered for linear discrete-time systems and it is proved that the PIQL algorithm is equivalent to the Newton iteration method in the Banach space by using the Fréchet derivative.

...read moreread less

Abstract: In this article, the data-based two-player zero-sum game problem is considered for linear discrete-time systems. This problem theoretically depends on solving the discrete-time game algebraic Riccati equation (DTGARE), while it requires complete system dynamics. To avoid solving the DTGARE, the $Q$ -function is introduced and a data-based policy iteration $Q$ -learning (PIQL) algorithm is developed to learn the optimal $Q$ -function by using data collected from the real system. Writing the $Q$ -function in a quadratic form, it is proved that the PIQL algorithm is equivalent to the Newton iteration method in the Banach space by using the Frechet derivative. Then, the convergence of the PIQL algorithm can be guaranteed by Kantorovich’s theorem. For the realization of the PIQL algorithm, the off-policy learning scheme is proposed using real data rather than the system model. Finally, the efficiency of the developed data-based PIQL method is validated through simulation studies.

...read moreread less

120 citations

Journal Article•10.1016/J.NEUNET.2021.04.011•

Neural network approximation: Three hidden layers are enough.

[...]

Zuowei Shen¹, Haizhao Yang², Shijun Zhang¹•Institutions (2)

National University of Singapore¹, Purdue University²

17 Apr 2021-Neural Networks

TL;DR: A three-hidden-layer neural network with super approximation power is introduced, which overcomes the curse of dimensionality in approximation power when the variation of ωf(r) as r→0 is moderate and is extended to general bounded continuous functions on a bounded set E⊆Rd.

...read moreread less

Journal Article•10.1109/TSMC.2021.3050993•

Approximation-Based Nussbaum Gain Adaptive Control of Nonlinear Systems With Periodic Disturbances

[...]

Hui Ma¹, Hongru Ren¹, Qi Zhou¹, Renquan Lu¹, Hongyi Li¹ - Show less +1 more•Institutions (1)

Guangdong University of Technology¹

29 Jan 2021-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: The Fourier series expansion and radial basis function neural network are incorporated into a function approximator to model time-varying-disturbed function with a known period in nonlinear systems to deal with the problems of the dead zone output and unknown control direction.

...read moreread less

Abstract: This article considers the Nussbaum gain adaptive control issue for a type of nonlinear systems, in which some sophisticated and challenging problems, such as periodic disturbances, dead zone output, and unknown control direction are addressed. The Fourier series expansion and radial basis function neural network are incorporated into a function approximator to model time-varying-disturbed function with a known period in nonlinear systems. To deal with the problems of the dead zone output and unknown control direction, the Nussbaum-type function is recommended in the design of the control algorithm. Applying the Lyapunov stability theory and backstepping technique, the proposed control strategy ensures that the tracking error is pulled back to a small neighborhood of origin and all closed-loop signals are bounded. Finally, simulation results are presented to show the availability and validity of the analysis approach.

...read moreread less

Journal Article•10.1109/TSMC.2019.2915663•

Logarithmic Hyperbolic Cosine Adaptive Filter and Its Performance Analysis

[...]

Shiyuan Wang¹, Wenyue Wang¹, Kui Xiong¹, Herbert Ho-Ching Iu², Chi K. Tse³ - Show less +1 more•Institutions (3)

Southwest University¹, University of Western Australia², Hong Kong Polytechnic University³

01 Apr 2021-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: A novel normalization based on the logarithmic hyperbolic cosine function is proposed to achieve the stabilization for the case of large initial weight errors, which generates a logarathmic HCAF (LHCAF) and a variable scaling factor and step-size LHCAF and VSS-LH CAF are proposed to improve the filtering accuracy and stability.

...read moreread less

Abstract: The hyperbolic cosine function with high-order errors can be utilized to improve the accuracy of adaptive filters. However, when initial weight errors are large, the hyperbolic cosine-based adaptive filter (HCAF) may be unstable. In this paper, a novel normalization based on the logarithmic hyperbolic cosine function is proposed to achieve the stabilization for the case of large initial weight errors, which generates a logarithmic HCAF (LHCAF). Actually, the cost function of LHCAF is the logarithmic hyperbolic cosine function that is robust to large errors and smooth to small errors. The transient and steady-state analyses of LHCAF in terms of the mean-square deviation (MSD) are performed for a stationary white input with an even probability density function in a stationary zero-mean white noise. The convergence and stability of LHCAF can be therefore guaranteed as long as the filtering parameters satisfy certain conditions. The theoretical results based on the MSD are supported by the simulations. In addition, a variable scaling factor and step-size LHCAF (VSS-LHCAF) is proposed to improve the filtering accuracy of LHCAF further. The proposed LHCAF and VSS-LHCAF are superior to HCAF and other robust adaptive filters in terms of filtering accuracy and stability.

...read moreread less

Journal Article•10.1016/J.JCP.2020.109985•

DeepMoD: Deep learning for model discovery in noisy data

[...]

Gert-Jan Both, Subham Choudhury, Pierre Sens, Remy Kusters

01 Mar 2021-Journal of Computational Physics

TL;DR: DeepMoD as discussed by the authors discovers the partial differential equation underlying a spatio-temporal data set using sparse regression on a library of possible functions and their derivatives, using a neural network as function approximator and its output to construct the function library, allowing to perform the sparse regression within the neural network.

...read moreread less

Journal Article•10.1109/TSMC.2020.2994808•

Global Predefined Time and Accuracy Adaptive Neural Network Control for Uncertain Strict-Feedback Systems With Output Constraint and Dead Zone

[...]

Junkang Ni¹, Peng Shi²•Institutions (2)

Northwestern Polytechnical University¹, University of Adelaide²

01 Dec 2021-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: A global adaptive neural network tracking control method for uncertain strict-feedback nonlinear system with output constraint and dead zone to achieve predefined-time convergence of the tracking error to predefined accuracy within predefined time is proposed.

...read moreread less

Abstract: This article proposes a global adaptive neural network tracking control method for uncertain strict-feedback nonlinear system with output constraint and dead zone to achieve predefined-time convergence of the tracking error to predefined accuracy. First, an integral-type Barrier Lyapunov function (BLF) is constructed to handle output constraint. Next, radial basis function neural network (RBFNN) control and robust control are used to tackle unknown nonlinear function. The continuous switching function is designed to switch RBFNN control to robust control when the arguments of the unknown function exceed the active region of neural network. Utilizing the property of radial basis function, we derive the upper bound of the term containing the unknown nonlinear function and design adaptive laws to determine the derived upper bound and robust control gain. Then, the predefined time virtual control inputs are obtained and their derivatives are estimated by finite time differentiator. Finally, we use the dead zone inverse technique to obtain the actual control input. Stability analysis shows the presented control scheme achieves global convergence of the errors to predefined accuracy within predefined time. The simulation results verify the effectiveness of the presented control scheme.

...read moreread less

Journal Article•10.3982/ECTA18312•

Extreme Points and Majorization: Economic Applications

[...]

Andreas Kleiner, Benny Moldovanu, Philipp Strack

01 Jul 2021-Econometrica

TL;DR: In this paper, the authors characterize the set of extreme points of monotonic functions that are either majorized by a given function f or themselves majorize f and show that these extreme points play a crucial role in many economic design problems.

...read moreread less

Abstract: We characterize the set of extreme points of monotonic functions that are either majorized by a given function f or themselves majorize f and show that these extreme points play a crucial role in many economic design problems. Our main results show that each extreme point is uniquely characterized by a countable collection of intervals. Outside these intervals the extreme point equals the original function f and inside the function is constant. Further consistency conditions need to be satisfied pinning down the value of an extreme point in each interval where it is constant. We apply these insights to a varied set of economic problems: equivalence and optimality of mechanisms for auctions and (matching) contests, Bayesian persuasion, optimal delegation, and decision making under uncertainty.

...read moreread less

Journal Article•10.1109/TAC.2020.2997282•

Robustness for Stability and Stabilization of Boolean Networks With Stochastic Function Perturbations

[...]

Haitao Li¹, Xinrong Yang¹, Shuling Wang¹•Institutions (1)

Shandong Normal University¹

01 Mar 2021-IEEE Transactions on Automatic Control

TL;DR: The robust stability and stabilization of Boolean networks with stochastic function perturbations is studied and it is proved that the finite-time stability is reduced to stability in distribution when the intersection of perturbed set and complement set of parameterized set is nonempty.

...read moreread less

Abstract: In genetic regulatory networks (GRNs), gene mutations often occur in a stochastic manner. As an important model of GRNs, gene mutations of Boolean networks are always described as function perturbations. This article studies the robust stability and stabilization of Boolean networks with stochastic function perturbations. A kind of parameterized set is constructed, and it is revealed that under the stochastic function perturbations, the property of finite-time stability remains unchanged when the perturbed set and the parameterized set are disjoint. In addition, it is proved that the finite-time stability is reduced to stability in distribution when the intersection of perturbed set and complement set of parameterized set is nonempty. As an application, the robust stabilization problem of Boolean control networks with stochastic function perturbations is discussed, and several necessary and sufficient conditions are presented for the robustness of feedback stabilizers. Finally, the obtained results are used to study the Drosophila melanogaster segmentation polarity gene network and the lac operon in the bacterium Escherichia coil.

...read moreread less

Journal Article•10.1016/J.NEUNET.2021.01.020•

The Kolmogorov-Arnold representation theorem revisited.

[...]

Johannes Schmidt-Hieber¹•Institutions (1)

University of Twente¹

29 Jan 2021-Neural Networks

TL;DR: In this article, the authors derive modifications of the Kolmogorov-Arnold representation that transfer smoothness properties of the represented function to the outer function and can be well approximated by ReLU networks.

...read moreread less

Journal Article•10.1093/ECTJ/UTAA027•

Debiased machine learning of conditional average treatment effects and other causal functions

[...]

Vira Semenova¹, Victor Chernozhukov²•Institutions (2)

University of California, Berkeley¹, Massachusetts Institute of Technology²

29 Jun 2021-Econometrics Journal

TL;DR: In this article, the best linear predictor of a structural function is estimated using machine learning techniques, such as conditional average structural and treatment effects, and structural derivatives, based on modern machine learning (ML) tools.

...read moreread less

Abstract: This paper provides estimation and inference methods for the best linear predictor (approximation) of a structural function, such as conditional average structural and treatment effects, and structural derivatives, based on modern machine learning (ML) tools. We represent this structural function as a conditional expectation of an unbiased signal that depends on a nuisance parameter, which we estimate by modern machine learning techniques. We first adjust the signal to make it insensitive (Neyman-orthogonal) with respect to the first-stage regularization bias. We then project the signal onto a set of basis functions, growing with sample size, which gives us the best linear predictor of the structural function. We derive a complete set of results for estimation and simultaneous inference on all parameters of the best linear predictor, conducting inference by Gaussian bootstrap. When the structural function is smooth and the basis is sufficiently rich, our estimation and inference result automatically targets this function. When basis functions are group indicators, the best linear predictor reduces to group average treatment/structural effect, and our inference automatically targets these parameters. We demonstrate our method by estimating uniform confidence bands for the average price elasticity of gasoline demand conditional on income.

...read moreread less

Journal Article•10.1109/TCYB.2019.2939424•

Adaptive Neural Network-Based Finite-Time Online Optimal Tracking Control of the Nonlinear System With Dead Zone

[...]

Liang Ding¹, Shu Li¹, Haibo Gao¹, Yan-Jun Liu², Lan Huang¹, Zongquan Deng¹ - Show less +2 more•Institutions (2)

Harbin Institute of Technology¹, Liaoning University of Technology²

01 Jan 2021-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: Considering the uncertain nonstrict nonlinear system with dead-zone input, an adaptive neural network (NN)-based finite-time online optimal tracking control algorithm is proposed, using the tracking errors and the Lipschitz linearized desired tracking function as the new state vector.

...read moreread less

Abstract: Considering the uncertain nonstrict nonlinear system with dead-zone input, an adaptive neural network (NN)-based finite-time online optimal tracking control algorithm is proposed. By using the tracking errors and the Lipschitz linearized desired tracking function as the new state vector, an extended system is present. Then, a novel Hamilton–Jacobi–Bellman (HJB) function is defined to associate with the nonquadratic performance function. Further, the upper limit of integration is selected as the finite-time convergence time, in which the dead-zone input is considered. In addition, the Bellman error function can be obtained from the Hamiltonian function. Then, the adaptations of the critic and action NN are updated by using the gradient descent method on the Bellman error function. The semiglobal practical finite-time stability (SGPFS) is guaranteed, and the tracking errors convergence to a compact set by zero in a finite time.

...read moreread less

Proceedings Article•10.18653/V1/2021.ACL-LONG.336•

PairRE: Knowledge Graph Embeddings via Paired Relation Vectors

[...]

Linlin Chao, Jianshan He, Taifeng Wang¹, Wei Chu²•Institutions (2)

Alibaba Group¹, Sichuan University²

1 Aug 2021

TL;DR: PairRE as mentioned in this paper proposes PairRE, a model with paired vectors for each relation representation, which enable an adaptive adjustment of the margin in loss function to fit for different complex relations.

...read moreread less

Abstract: Distance based knowledge graph embedding methods show promising results on link prediction task, on which two topics have been widely studied: one is the ability to handle complex relations, such as N-to-1, 1-to-N and N-to-N, the other is to encode various relation patterns, such as symmetry/antisymmetry. However, the existing methods fail to solve these two problems at the same time, which leads to unsatisfactory results. To mitigate this problem, we propose PairRE, a model with paired vectors for each relation representation. The paired vectors enable an adaptive adjustment of the margin in loss function to fit for different complex relations. Besides, PairRE is capable of encoding three important relation patterns, symmetry/antisymmetry, inverse and composition. Given simple constraints on relation representations, PairRE can encode subrelation further. Experiments on link prediction benchmarks demonstrate the proposed key capabilities of PairRE. Moreover, We set a new state-of-the-art on two knowledge graph datasets of the challenging Open Graph Benchmark.

...read moreread less

Journal Article•10.1103/PHYSREVLETT.126.056402•

Nevanlinna Analytical Continuation.

[...]

Jiani Fei¹, Chia-Nan Yeh¹, Emanuel Gull¹•Institutions (1)

University of Michigan¹

04 Feb 2021-Physical Review Letters

TL;DR: It is shown that explicitly respecting the analytic "Nevanlinna" structure of the Green's function leads to intrinsically positive and normalized spectral functions, and a continued fraction expansion that yields all possible functions consistent with the analytic structure is presented.

...read moreread less

Abstract: Simulations of finite temperature quantum systems provide imaginary frequency Green's functions that correspond one to one to experimentally measurable real-frequency spectral functions. However, due to the bad conditioning of the continuation transform from imaginary to real frequencies, established methods tend to either wash out spectral features at high frequencies or produce spectral functions with unphysical negative parts. Here, we show that explicitly respecting the analytic "Nevanlinna" structure of the Green's function leads to intrinsically positive and normalized spectral functions, and we present a continued fraction expansion that yields all possible functions consistent with the analytic structure. Application to synthetic trial data shows that sharp, smooth, and multipeak data is resolved accurately. Application to the band structure of silicon demonstrates that high energy features are resolved precisely. Continuations in a realistic correlated setup reveal additional features that were previously unresolved. By substantially increasing the resolution of real frequency calculations our work overcomes one of the main limitations of finite-temperature quantum simulations.

...read moreread less

Posted Content•

Equivalence of quantum barren plateaus to cost concentration and narrow gorges.

[...]

Andrew Arrasmith, Zoë Holmes, Marco Cerezo, Patrick J. Coles

12 Apr 2021-arXiv: Quantum Physics

TL;DR: In this article, the authors investigate the connection between three different landscape features that have been observed for PQCs: (1) exponentially vanishing gradients (called barren plateaus), (2) exponential cost concentration about the mean, and (3) the exponential narrowness of minina.

...read moreread less

Abstract: Optimizing parameterized quantum circuits (PQCs) is the leading approach to make use of near-term quantum computers. However, very little is known about the cost function landscape for PQCs, which hinders progress towards quantum-aware optimizers. In this work, we investigate the connection between three different landscape features that have been observed for PQCs: (1) exponentially vanishing gradients (called barren plateaus), (2) exponential cost concentration about the mean, and (3) the exponential narrowness of minina (called narrow gorges). We analytically prove that these three phenomena occur together, i.e., when one occurs then so do the other two. A key implication of this result is that one can numerically diagnose barren plateaus via cost differences rather than via the computationally more expensive gradients. More broadly, our work shows that quantum mechanics rules out certain cost landscapes (which otherwise would be mathematically possible), and hence our results are interesting from a quantum foundations perspective.

...read moreread less

Journal Article•10.1093/IMANUM/DRAA038•

Accurately Computing the Log-Sum-Exp and Softmax Functions

[...]

Pierre Blanchard¹, Desmond J. Higham², Nicholas J. Higham¹•Institutions (2)

University of Manchester¹, University of Edinburgh²

22 Oct 2021-Ima Journal of Numerical Analysis

TL;DR: It is concluded, based on the analysis and numerical experiments, that the shifted formulas are of similar accuracy to the unshifted ones, so can safely be used, but that a division-free variant of softmax can suffer from loss of accuracy.

...read moreread less

Abstract: Evaluating the log-sum-exp function or the softmax function is a key step in many modern data science algorithms, notably in inference and classification. Because of the exponentials that these functions contain, the evaluation is prone to overflow and underflow, especially in low precision arithmetic. Software implementations commonly use alternative formulas that avoid overflow and reduce the chance of harmful underflow, employing a shift or another rewriting. Although mathematically equivalent, these variants behave differently in floating-point arithmetic ew{and shifting can introduce subtractive cancellation}. We give rounding error analyses of different evaluation algorithms and interpret the error bounds using condition numbers for the functions. We conclude, based on the analysis and numerical experiments, that the shifted formulas are of similar accuracy to the unshifted ones, so can safely be used, but that a division-free variant of softmax can suffer from loss of accuracy.

...read moreread less

Journal Article•10.1016/J.ESWA.2021.115633•

A level set method based on additive bias correction for image segmentation

[...]

Guirong Weng¹, Bin Dong¹, Yu Lei¹•Institutions (1)

Soochow University (Suzhou)¹

15 Dec 2021-Expert Systems With Applications

TL;DR: In this article, an additive bias correction (ABC) model based on intensity inhomogeneity is proposed, which divides the observed image into three parts: additive bias function, reflection edge structure function and Gaussian noise.

...read moreread less

Abstract: Intensity inhomogeneity brings great difficulties to image segmentation. This problem is partly solved by the multiplicative bias field correction model. However, some other problems still exist, such as slow segmentation speed and narrow application field. In this paper, an additive bias correction (ABC) model based on intensity inhomogeneity is proposed. The model divides the observed image into three parts: additive bias function, reflection edge structure function and Gaussian noise. Firstly, the local area and local clustering criterion of intensity inhomogeneity are defined. Secondly, by introducing the level set function, the local clustering criterion is transformed into an energy function based on the level set model. Finally, the structure of the estimated bias field and the reflection edge is computed through the process of minimizing the energy function while the image is segmented. In order to improve the stability of the system, a de-parameterized regularization function and an adaptive data-driven term function are designed. Compared with the traditional multiplicative model, the addition model has faster calculation speed. The proposed model can obtain ideal segmentation effect for images with intensity inhomogeneity. Experiment results show that the proposed method is more robust, faster and more accurate than traditional piecewise and multiplicative models.

...read moreread less

Journal Article•10.1016/J.CMA.2020.113453•

Stress constrained multi-material topology optimization with the ordered SIMP method

[...]

Shuzhi Xu¹, Shuzhi Xu², Jikai Liu², Bin Zou², Quhao Li², Yongsheng Ma¹ - Show less +2 more•Institutions (2)

University of Alberta¹, Shandong University²

01 Jan 2021-Computer Methods in Applied Mechanics and Engineering

TL;DR: A novel ordered SIMP-like interpolation function is proposed to realize the relaxed and scaled stress interpolation, so that only a unique set of density variables are required for the SMMTO problem.

...read moreread less

Journal Article•10.1109/TSP.2021.3086579•

Distributed Algorithms for Composite Optimization: Unified Framework and Convergence Analysis

[...]

Jinming Xu¹, Ye Tian², Ying Sun³, Gesualdo Scutari²•Institutions (3)

Zhejiang University¹, Purdue University², Pennsylvania State University³

07 Jun 2021-IEEE Transactions on Signal Processing

TL;DR: A by-product of this analysis is a tuning recommendation for several existing (non-accelerated) distributed algorithms yielding provably faster (worst-case) convergence rate for the class of problems under consideration.

...read moreread less

Abstract: We study distributed composite optimization over networks: agents minimize a sum of smooth (strongly) convex functions–the agents’ sum-utility–plus a nonsmooth (extended-valued) convex one. We propose a general unified algorithmic framework for such a class of problems and provide a convergence analysis leveraging the theory of operator splitting. Distinguishing features of our scheme are: (i) When each of the agent’s functions is strongly convex, the algorithm converges at a linear rate, whose dependence on the agents’ functions and network topology is decoupled ; (ii) When the objective function is convex (but not strongly convex), similar decoupling as in (i) is established for the coefficient of the proved sublinear rate. This also reveals the role of function heterogeneity on the convergence rate. (iii) The algorithm can adjust the ratio between the number of communications and computations to achieve a rate (in terms of computations) independent on the network connectivity; and (iv) A by-product of our analysis is a tuning recommendation for several existing (non-accelerated) distributed algorithms yielding provably faster (worst-case) convergence rate for the class of problems under consideration.

...read moreread less

Journal Article•10.1016/J.CIE.2020.107049•

Novel approach to multi-attribute group decision-making based on interval-valued Pythagorean fuzzy power Maclaurin symmetric mean operator

[...]

Zhimin Mu¹, Shouzhen Zeng², Shouzhen Zeng³, Pinyue Wang•Institutions (3)

Tianjin Agricultural University¹, Zhejiang Gongshang University², Ningbo University³

01 May 2021-Computers & Industrial Engineering

TL;DR: A new comparison rule is obtained, whereby two different IVPFNs may be distinguished, and the proposed operators in MAGDM problems can eliminate bad influences of extreme evaluation values from biased decision makers and capture the interaction between attributes.

...read moreread less

Posted Content•

Certified dimension reduction in nonlinear Bayesian inverse problems

[...]

Olivier Zahm, Tiangang Cui¹, Kody J. H. Law², Alessio Spantini³, Youssef M. Marzouk³ - Show less +1 more•Institutions (3)

Monash University¹, University of Manchester², Massachusetts Institute of Technology³

09 Mar 2021-arXiv: Probability

TL;DR: A dimension reduction technique for Bayesian inverse problems with nonlinear forward operators, non-Gaussian priors, and non- Gaussian observation noise is proposed and an analysis that enables control of the posterior approximation error due to this sampling is provided.

...read moreread less

Abstract: We propose a dimension reduction technique for Bayesian inverse problems with nonlinear forward operators, non-Gaussian priors, and non-Gaussian observation noise. The likelihood function is approximated by a ridge function, i.e., a map which depends non-trivially only on a few linear combinations of the parameters. We build this ridge approximation by minimizing an upper bound on the Kullback-Leibler divergence between the posterior distribution and its approximation. This bound, obtained via logarithmic Sobolev inequalities, allows one to certify the error of the posterior approximation. Computing the bound requires computing the second moment matrix of the gradient of the log-likelihood function. In practice, a sample-based approximation of the upper bound is then required. We provide an analysis that enables control of the posterior approximation error due to this sampling. Numerical and theoretical comparisons with existing methods illustrate the benefits of the proposed methodology.

...read moreread less

Journal Article•10.1109/TFUZZ.2021.3097811•

Feature selection with fuzzy-rough minimum classification error criterion

[...]

Changzhong Wang¹, Yuhua Qian², Weiping Ding³, Xiaodong Feng¹•Institutions (3)

Bohai University¹, Shanxi University², Nantong University³

19 Jul 2021-IEEE Transactions on Fuzzy Systems

TL;DR: A novel concept of dependency is proposed: inner product dependency to describe the classification error, and a criterion function to evaluate the importance of candidate features is proposed to overcome this weakness.

...read moreread less

Abstract: Classical fuzzy rough set often uses fuzzy rough dependency as an evaluation function of feature selection. However, this function only retains the maximum member- ship degree of a sample to one decision class, it can not describe the classification error. Therefore, in this work, a novel criterion function for feature selection is proposed to overcome this weakness. To characterize the classification error rate, we first introduce a class of irreflexive and symmetric fuzzy binary relations to redefine the concepts of fuzzy rough approximations. Then, we propose a novel concept of dependency: inner product dependency to describe the classification error, and construct a criterion function to evaluate the importance of candidate features. The proposed criterion function not only can maintain a maximum dependency function, but also guarantees the minimum classification error. The experimental analysis shows that the proposed criterion function is effective for data sets with a large overlap between different categories.

...read moreread less

...

Expand