Two-dimensional solution path for support vector regression

doi:10.1145/1143844.1143969

Open AccessProceedings Article10.1145/1143844.1143969

Two-dimensional solution path for support vector regression

Gang Wang, +2 more

- 25 Jun 2006

- Vol. 148, pp 993-1000

39

TL;DR: This paper shows that the solution path for ε-SVR is also piecewise linear with respect to ε, and proposes an efficient algorithm for exploring the two-dimensional solution space defined by the regularization and error parameters.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Figure 1. Linear SVR results for four different combinations of values for λ and . (a) proper values of λ and are specified; (b) λ = ∞; (c) > (ymax − ymin)/2; (d) < (ymax − ymin)/2, but all the data points are inside the -tube.

Figure 4. Based on three -paths with λ = 1 and γ = 0.05, 0.5, 5, the optimal solution for each path in terms of the mean squared error on the validation set is plotted.

Figure 5. Change in elbow size as a function of for three -paths with λ = 1 and γ = 0.05, 0.5, 5. Since decreases rapidly in the beginning, the horizontal axis is shown in log scale.

Figure 7. Shifting from the -path algorithm to the λ-path algorithm at four shifting points with different values of . The horizontal axis is in log scale.

Figure 2. The set of data points is partitioned into five subsets according to the -insensitive loss function.

Figure 6. Relationships between MSE, , and the number of steps in the algorithm for different values of λ. (a) MSE vs. , with the horizontal axis in log scale; (b) vs. number of steps; (c) MSE vs. number of steps.

Citations

•Dissertation•10.3929/ETHZ-A-007050453

Sparse convex optimization methods for machine learning

Martin Jaggi

- 01 Jan 2011

TL;DR: A convergence proof guaranteeing e-small error is given after O( 1e ) iterations, and the sparsity of approximate solutions for any `1-regularized convex optimization problem (and for optimization over the simplex), expressed as a function of the approximation quality.

...read moreread less

107

Journal Article•10.1080/10556780802102586

Classification model selection via bilevel programming

Gautam Kunapuli, +3 more

- 01 Aug 2008

- Optimization Methods & Software

TL;DR: This work proposes a bilevel program that is significantly more versatile than commonly used grid search procedures, enabling the use of models with many hyper-parameters, and demonstrates the practicality of this approach for model selection in machine learning.

...read moreread less

95

Journal Article•10.1007/S10822-007-9125-Z

Estimating the domain of applicability for machine learning QSAR models: a study on aqueous solubility of drug discovery molecules

Timon Schroeter, +7 more

- 01 Dec 2007

- Journal of Computer-aided Molecular Desi...

TL;DR: This work investigates the use of different Machine Learning methods to construct models for aqueous solubility, evaluating all approaches in terms of their prediction accuracy and in how far the individual error bars can faithfully represent the actual prediction error.

...read moreread less

83

•Proceedings Article•10.1145/1273496.1273616

A kernel path algorithm for support vector machines

Gang Wang, +2 more

- 20 Jun 2007

TL;DR: This paper learns the hyperparameter of the kernel function for a support vector machine (SVM) without having to train the model multiple times, and finds that the solutions of the neighborhood hyperparameters can be calculated exactly.

...read moreread less

57

Journal Article•10.1021/MP0700413

Machine learning models for lipophilicity and their domain of applicability.

Timon Schroeter, +7 more

- 19 Jul 2007

- Molecular Pharmaceutics

TL;DR: This study constructs a log D7 model based on 14,556 drug discovery compounds of Bayer Schering Pharma, and considers error bars for each method, and investigates how well they quantify the domain of applicability of each model.

...read moreread less

33

...

Expand

References

•Book

The Nature of Statistical Learning Theory

Vladimir Vapnik

- 01 Jan 1995

TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?

...read moreread less

46K

Book•10.7551/MITPRESS/4175.001.0001

Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond

Bernhard Schölkopf, +1 more

- 01 Dec 2001

TL;DR: Learning with Kernels provides an introduction to SVMs and related kernel methods that provide all of the concepts necessary to enable a reader equipped with some basic mathematical knowledge to enter the world of machine learning using theoretically well-founded yet easy-to-use kernel algorithms.

...read moreread less

10.2K

•Journal Article•10.1214/009053604000000067

Least angle regression

Bradley Efron, +19 more

- 01 Apr 2004

- Annals of Statistics

TL;DR: A publicly available algorithm that requires only the same order of magnitude of computational effort as ordinary least squares applied to the full set of covariates is described.

...read moreread less

9.4K

•Journal Article•10.1162/089976600300015565

New Support Vector Algorithms

Bernhard Schölkopf, +3 more

- 01 May 2000

- Neural Computation

TL;DR: A new class of support vector algorithms for regression and classification that eliminates one of the other free parameters of the algorithm: the accuracy parameter in the regression case, and the regularization constant C in the classification case.

...read moreread less

3K

•Proceedings Article

1-norm Support Vector Machines

Ji Zhu, +3 more

- 09 Dec 2003

TL;DR: It is argued that the 1-norm SVM may have some advantage over the standard 2- norm SVM, especially when there are redundant noise features, and an efficient algorithm is proposed that computes the whole solution path of the1-normSVM, hence facilitates adaptive selection of the tuning parameter for the 1

...read moreread less

1K