Representer theorem

Topic Tools

Papers published on a yearly basis

1 / 2

Papers

Journal Article•

Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples

[...]

Mikhail Belkin, Partha Niyogi, Vikas Sindhwani

01 Dec 2006-Journal of Machine Learning Research

TL;DR: A semi-supervised framework that incorporates labeled and unlabeled data in a general-purpose learner is proposed and properties of reproducing kernel Hilbert spaces are used to prove new Representer theorems that provide theoretical basis for the algorithms.

...read moreread less

Abstract: We propose a family of learning algorithms based on a new form of regularization that allows us to exploit the geometry of the marginal distribution. We focus on a semi-supervised framework that incorporates labeled and unlabeled data in a general-purpose learner. Some transductive graph learning algorithms and standard methods including support vector machines and regularized least squares can be obtained as special cases. We use properties of reproducing kernel Hilbert spaces to prove new Representer theorems that provide theoretical basis for the algorithms. As a result (in contrast to purely graph-based approaches) we obtain a natural out-of-sample extension to novel examples and so are able to handle both transductive and truly semi-supervised settings. We present experimental evidence suggesting that our semi-supervised algorithms are able to use unlabeled data effectively. Finally we have a brief discussion of unsupervised and fully supervised learning within our general framework.

...read moreread less

4,184 citations

Book Chapter•10.1007/3-540-44581-1_27•

A Generalized Representer Theorem

[...]

Bernhard Schölkopf¹, Bernhard Schölkopf², Ralf Herbrich¹, Ralf Herbrich², Alexander J. Smola¹ - Show less +1 more•Institutions (2)

Australian National University¹, Microsoft²

16 Jul 2001

TL;DR: The result shows that a wide range of problems have optimal solutions that live in the finite dimensional span of the training examples mapped into feature space, thus enabling us to carry out kernel algorithms independent of the (potentially infinite) dimensionality of the feature space.

...read moreread less

Abstract: Wahba's classical representer theorem states that the solutions of certain risk minimization problems involving an empirical risk term and a quadratic regularizer can be written as expansions in terms of the training examples. We generalize the theorem to a larger class of regularizers and empirical risk terms, and give a self-contained proof utilizing the feature space associated with a kernel. The result shows that a wide range of problems have optimal solutions that live in the finite dimensional span of the training examples mapped into feature space, thus enabling us to carry out kernel algorithms independent of the (potentially infinite) dimensionality of the feature space.

...read moreread less

1,990 citations

Report•10.21236/ADA446572•

Dimensionality Reduction for Supervised Learning with Reproducing Kernel Hilbert Spaces

[...]

Kenji Fukumizu, Francis Bach¹, Michael I. Jordan²•Institutions (2)

University of Chicago¹, University of California, Berkeley²

1 Dec 2004

TL;DR: A novel method of dimensionality reduction for supervised learning problems that requires neither assumptions on the marginal distribution of X, nor a parametric model of the conditional distribution of Y, and establishes a general nonparametric characterization of conditional independence using covariance operators on reproducing kernel Hilbert spaces.

...read moreread less

Abstract: We propose a novel method of dimensionality reduction for supervised learning problems. Given a regression or classification problem in which we wish to predict a response variable Y from an explanatory variable X, we treat the problem of dimensionality reduction as that of finding a low-dimensional "effective subspace" for X which retains the statistical relationship between X and Y. We show that this problem can be formulated in terms of conditional independence. To turn this formulation into an optimization problem we establish a general nonparametric characterization of conditional independence using covariance operators on reproducing kernel Hilbert spaces. This characterization allows us to derive a contrast function for estimation of the effective subspace. Unlike many conventional methods for dimensionality reduction in supervised learning, the proposed method requires neither assumptions on the marginal distribution of X, nor a parametric model of the conditional distribution of Y. We present experiments that compare the performance of the method with conventional methods.

...read moreread less

957 citations

Journal Article•10.1109/TKDE.2013.111•

Adaptation Regularization: A General Framework for Transfer Learning

[...]

Mingsheng Long¹, Jianmin Wang¹, Guiguang Ding¹, Sinno Jialin Pan, Philip S. Yu² - Show less +1 more•Institutions (2)

Tsinghua University¹, University of Illinois at Chicago²

01 May 2014-IEEE Transactions on Knowledge and Data Engineering

TL;DR: A novel transfer learning framework, referred to as Adaptation Regularization based Transfer Learning (ARTL), to model adaptive classifiers in a unified way based on the structural risk minimization principle and the regularization theory, and can significantly outperform state-of-the-art learning methods on several public text and image datasets.

...read moreread less

Abstract: Domain transfer learning, which learns a target classifier using labeled data from a different distribution, has shown promising value in knowledge discovery yet still been a challenging problem. Most previous works designed adaptive classifiers by exploring two learning strategies independently: distribution adaptation and label propagation. In this paper, we propose a novel transfer learning framework, referred to as Adaptation Regularization based Transfer Learning (ARTL), to model them in a unified way based on the structural risk minimization principle and the regularization theory. Specifically, ARTL learns the adaptive classifier by simultaneously optimizing the structural risk functional, the joint distribution matching between domains, and the manifold consistency underlying marginal distribution. Based on the framework, we propose two novel methods using Regularized Least Squares (RLS) and Support Vector Machines (SVMs), respectively, and use the Representer theorem in reproducing kernel Hilbert space to derive corresponding solutions. Comprehensive experiments verify that ARTL can significantly outperform state-of-the-art learning methods on several public text and image datasets.

...read moreread less

685 citations

Book•

Support vector machines, reproducing kernel Hilbert spaces, and randomized GACV

[...]

Grace Wahba

8 Feb 1999

474 citations

...

Expand

Year	Papers
2025	6
2024	20
2023	17
2022	44
2021	23
2020	24

Topic Tools

Papers published on a yearly basis

Papers

Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples

A Generalized Representer Theorem

Dimensionality Reduction for Supervised Learning with Reproducing Kernel Hilbert Spaces

Adaptation Regularization: A General Framework for Transfer Learning

Support vector machines, reproducing kernel Hilbert spaces, and randomized GACV

Related Topics (5)

Performance Metrics