Function approximation

Topic Tools

Papers published on a yearly basis

1 / 2

Papers

Journal Article•10.1214/AOS/1013203451•

Greedy function approximation: A gradient boosting machine.

[...]

Jerome H. Friedman¹•Institutions (1)

Stanford University¹

01 Oct 2001-Annals of Statistics

TL;DR: A general gradient descent boosting paradigm is developed for additive expansions based on any fitting criterion, and specific algorithms are presented for least-squares, least absolute deviation, and Huber-M loss functions for regression, and multiclass logistic likelihood for classification.

...read moreread less

Abstract: Function estimation/approximation is viewed from the perspective of numerical optimization in function space, rather than parameter space. A connection is made between stagewise additive expansions and steepest-descent minimization. A general gradient descent “boosting” paradigm is developed for additive expansions based on any fitting criterion.Specific algorithms are presented for least-squares, least absolute deviation, and Huber-M loss functions for regression, and multiclass logistic likelihood for classification. Special enhancements are derived for the particular case where the individual additive components are regression trees, and tools for interpreting such “TreeBoost” models are presented. Gradient boosting of regression trees produces competitive, highly robust, interpretable procedures for both regression and classification, especially appropriate for mining less than clean data. Connections between this approach and the boosting methods of Freund and Shapire and Friedman, Hastie and Tibshirani are discussed.

...read moreread less

26,422 citations

Proceedings Article•

Policy Gradient Methods for Reinforcement Learning with Function Approximation

[...]

Richard S. Sutton¹, David McAllester¹, Satinder Singh¹, Yishay Mansour¹•Institutions (1)

AT&T Labs¹

29 Nov 1999

TL;DR: This paper proves for the first time that a version of policy iteration with arbitrary differentiable function approximation is convergent to a locally optimal policy.

...read moreread less

Abstract: Function approximation is essential to reinforcement learning, but the standard approach of approximating a value function and determining a policy from it has so far proven theoretically intractable. In this paper we explore an alternative approach in which the policy is explicitly represented by its own function approximator, independent of the value function, and is updated according to the gradient of expected reward with respect to the policy parameters. Williams's REINFORCE method and actor-critic methods are examples of this approach. Our main new result is to show that the gradient can be written in a form suitable for estimation from experience aided by an approximate action-value or advantage function. Using this result, we prove for the first time that a version of policy iteration with arbitrary differentiable function approximation is convergent to a locally optimal policy.

...read moreread less

7,133 citations

Posted Content•

Addressing Function Approximation Error in Actor-Critic Methods

[...]

Scott Fujimoto¹, Herke van Hoof², David Meger¹•Institutions (2)

McGill University¹, University of Amsterdam²

26 Feb 2018-arXiv: Artificial Intelligence

TL;DR: This paper builds on Double Q-learning, by taking the minimum value between a pair of critics to limit overestimation, and draws the connection between target networks and overestimation bias.

...read moreread less

Abstract: In value-based reinforcement learning methods such as deep Q-learning, function approximation errors are known to lead to overestimated value estimates and suboptimal policies. We show that this problem persists in an actor-critic setting and propose novel mechanisms to minimize its effects on both the actor and the critic. Our algorithm builds on Double Q-learning, by taking the minimum value between a pair of critics to limit overestimation. We draw the connection between target networks and overestimation bias, and suggest delaying policy updates to reduce per-update error and further improve performance. We evaluate our method on the suite of OpenAI gym tasks, outperforming the state of the art in every environment tested.

...read moreread less

4,354 citations

Journal Article•10.1109/61.772353•

Rational approximation of frequency domain responses by vector fitting

[...]

Bjorn Gustavsen, Adam Semlyen¹•Institutions (1)

University of Toronto¹

01 Jul 1999-IEEE Transactions on Power Delivery

TL;DR: The paper describes a general methodology for the fitting of measured or calculated frequency domain responses with rational function approximations by replacing a set of starting poles with an improved set of poles via a scaling procedure.

...read moreread less

Abstract: The paper describes a general methodology for the fitting of measured or calculated frequency domain responses with rational function approximations. This is achieved by replacing a set of starting poles with an improved set of poles via a scaling procedure. A previous paper (Gustavsen et al., 1997) described the application of the method to smooth functions using real starting poles. This paper extends the method to functions with a high number of resonance peaks by allowing complex starting poles. Fundamental properties of the method are discussed and details of its practical implementation are described. The method is demonstrated to be very suitable for fitting network equivalents and transformer responses. The computer code is in the public domain, available from the first author.

...read moreread less

3,386 citations

Journal Article•10.1109/18.256500•

Universal approximation bounds for superpositions of a sigmoidal function

[...]

Andrew R. Barron¹•Institutions (1)

University of Illinois at Urbana–Champaign¹

01 May 1993-IEEE Transactions on Information Theory

TL;DR: The approximation rate and the parsimony of the parameterization of the networks are shown to be advantageous in high-dimensional settings and the integrated squared approximation error cannot be made smaller than order 1/n/sup 2/d/ uniformly for functions satisfying the same smoothness assumption.

...read moreread less

Abstract: Approximation properties of a class of artificial neural networks are established. It is shown that feedforward networks with one layer of sigmoidal nonlinearities achieve integrated squared error of order O(1/n), where n is the number of nodes. The approximated function is assumed to have a bound on the first moment of the magnitude distribution of the Fourier transform. The nonlinear parameters associated with the sigmoidal nodes, as well as the parameters of linear combination, are adjusted in the approximation. In contrast, it is shown that for series expansions with n terms, in which only the parameters of linear combination are adjusted, the integrated squared approximation error cannot be made smaller than order 1/n/sup 2/d/ uniformly for functions satisfying the same smoothness assumption, where d is the dimension of the input to the function. For the class of functions examined, the approximation rate and the parsimony of the parameterization of the networks are shown to be advantageous in high-dimensional settings. >

...read moreread less

3,311 citations

...

Expand

Year	Papers
2026	1
2025	18
2024	18
2023	59
2022	106
2021	186

Topic Tools

Papers published on a yearly basis

Papers

Greedy function approximation: A gradient boosting machine.

Policy Gradient Methods for Reinforcement Learning with Function Approximation

Addressing Function Approximation Error in Actor-Critic Methods

Rational approximation of frequency domain responses by vector fitting

Universal approximation bounds for superpositions of a sigmoidal function

Related Topics (5)

Performance Metrics