Function (mathematics)

Topic Tools

Papers published on a yearly basis

1 / 3

Papers

Journal Article•10.1093/COMJNL/7.4.308•

A simplex method for function minimization

[...]

John A. Nelder, R. Mead¹•Institutions (1)

University of Warwick¹

01 Jan 1965-The Computer Journal

TL;DR: A method is described for the minimization of a function of n variables, which depends on the comparison of function values at the (n 41) vertices of a general simplex, followed by the replacement of the vertex with the highest value by another point.

...read moreread less

Abstract: A method is described for the minimization of a function of n variables, which depends on the comparison of function values at the (n 41) vertices of a general simplex, followed by the replacement of the vertex with the highest value by another point. The simplex adapts itself to the local landscape, and contracts on to the final minimum. The method is shown to be effective and computationally compact. A procedure is given for the estimation of the Hessian matrix in the neighbourhood of the minimum, needed in statistical estimation problems.

...read moreread less

30,616 citations

Book Chapter•10.1007/11681878_14•

Calibrating noise to sensitivity in private data analysis

[...]

Cynthia Dwork¹, Frank McSherry¹, Kobbi Nissim², Adam Smith³•Institutions (3)

Microsoft¹, Ben-Gurion University of the Negev², Weizmann Institute of Science³

4 Mar 2006

TL;DR: In this article, the authors show that for several particular applications substantially less noise is needed than was previously understood to be the case, and also show the separation results showing the increased value of interactive sanitization mechanisms over non-interactive.

...read moreread less

Abstract: We continue a line of research initiated in [10,11]on privacy-preserving statistical databases. Consider a trusted server that holds a database of sensitive information. Given a query function f mapping databases to reals, the so-called true answer is the result of applying f to the database. To protect privacy, the true answer is perturbed by the addition of random noise generated according to a carefully chosen distribution, and this response, the true answer plus noise, is returned to the user. Previous work focused on the case of noisy sums, in which f = ∑ig(xi), where xi denotes the ith row of the database and g maps database rows to [0,1]. We extend the study to general functions f, proving that privacy can be preserved by calibrating the standard deviation of the noise according to the sensitivity of the function f. Roughly speaking, this is the amount that any single argument to f can change its output. The new analysis shows that for several particular applications substantially less noise is needed than was previously understood to be the case. The first step is a very clean characterization of privacy in terms of indistinguishability of transcripts. Additionally, we obtain separation results showing the increased value of interactive sanitization mechanisms over non-interactive.

...read moreread less

8,919 citations

Proceedings Article•

Policy Gradient Methods for Reinforcement Learning with Function Approximation

[...]

Richard S. Sutton¹, David McAllester¹, Satinder Singh¹, Yishay Mansour¹•Institutions (1)

AT&T Labs¹

29 Nov 1999

TL;DR: This paper proves for the first time that a version of policy iteration with arbitrary differentiable function approximation is convergent to a locally optimal policy.

...read moreread less

Abstract: Function approximation is essential to reinforcement learning, but the standard approach of approximating a value function and determining a policy from it has so far proven theoretically intractable. In this paper we explore an alternative approach in which the policy is explicitly represented by its own function approximator, independent of the value function, and is updated according to the gradient of expected reward with respect to the policy parameters. Williams's REINFORCE method and actor-critic methods are examples of this approach. Our main new result is to show that the gradient can be written in a form suitable for estimation from experience aided by an approximate action-value or advantage function. Using this result, we prove for the first time that a version of policy iteration with arbitrary differentiable function approximation is convergent to a locally optimal policy.

...read moreread less

7,133 citations

Journal Article•10.1016/0893-6080(91)90009-T•

Approximation capabilities of multilayer feedforward networks

[...]

Kurt Hornik¹•Institutions (1)

Vienna University of Technology¹

01 Mar 1991-Neural Networks

TL;DR: It is shown that standard multilayer feedforward networks with as few as a single hidden layer and arbitrary bounded and nonconstant activation function are universal approximators with respect to L p (μ) performance criteria, for arbitrary finite input environment measures μ.

...read moreread less

6,753 citations

Journal Article•10.1109/TASSP.1978.1163055•

Dynamic programming algorithm optimization for spoken word recognition

[...]

H. Sakoe¹, S. Chiba¹•Institutions (1)

NEC¹

01 Feb 1978-IEEE Transactions on Acoustics, Speech, and Signal Processing

TL;DR: This paper reports on an optimum dynamic progxamming (DP) based time-normalization algorithm for spoken word recognition, in which the warping function slope is restricted so as to improve discrimination between words in different categories.

...read moreread less

Abstract: This paper reports on an optimum dynamic progxamming (DP) based time-normalization algorithm for spoken word recognition. First, a general principle of time-normalization is given using time-warping function. Then, two time-normalized distance definitions, called symmetric and asymmetric forms, are derived from the principle. These two forms are compared with each other through theoretical discussions and experimental studies. The symmetric form algorithm superiority is established. A new technique, called slope constraint, is successfully introduced, in which the warping function slope is restricted so as to improve discrimination between words in different categories. The effective slope constraint characteristic is qualitatively analyzed, and the optimum slope constraint condition is determined through experiments. The optimized algorithm is then extensively subjected to experimental comparison with various DP-algorithms, previously applied to spoken word recognition by different research groups. The experiment shows that the present algorithm gives no more than about two-thirds errors, even compared to the best conventional algorithm.

...read moreread less

6,712 citations

...

Expand

Year	Papers
2022	88
2021	3,535
2020	3,575
2019	3,556
2018	3,408
2017	2,974

Topic Tools

Papers published on a yearly basis

Papers

A simplex method for function minimization

Calibrating noise to sensitivity in private data analysis

Policy Gradient Methods for Reinforcement Learning with Function Approximation

Approximation capabilities of multilayer feedforward networks

Dynamic programming algorithm optimization for spoken word recognition

Related Topics (5)

Performance Metrics