Regression with multiple candidate models: selecting or mixing?

Open AccessJournal Article

Regression with multiple candidate models: selecting or mixing?

Yuhong Yang

- 01 Jan 1999

- Statistica Sinica

- Vol. 13, Iss: 3, pp 783-809

163

TL;DR: An improved risk bound for ARM is obtained and it is demonstrated that when AIC and BIC are combined, the mixed estimator automatically behaves like the better one, and ARM also performs better than BMA techniques based on BIC approximation.

Abstract: Model combining (mixing) provides an alternative to model selection. An algorithm ARM was recently proposed by the author to combine different regres- sion models/methods. In this work, an improved risk bound for ARM is obtained. In addition to some theoretical observations on the issue of selection versus com- bining, simulations are conducted in the context of linear regression to compare performance of ARM with the familiar model selection criteria AIC and BIC, and also with some Bayesian model averaging (BMA) methods. The simulation suggests the following. Selection can yield a smaller risk when the random error is weak relative to the signal. However, when the random noise level gets higher, ARM produces a better or even much better estimator. That is, mixing appropriately is advantageous when there is a certain degree of uncer- tainty in choosing the best model. In addition, it is demonstrated that when AIC and BIC are combined, the mixed estimator automatically behaves like the better one. A comparison with bagging (Breiman (1996)) suggests that ARM does better than simply stabilizing model selection estimators. In our simulation, ARM also performs better than BMA techniques based on BIC approximation.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.2307/2982683

Subset Selection in Regression

Anthony B. Atkinson

- 01 Jan 1992

- Journal of The Royal Statistical Society...

TL;DR: Chapman and Miller as mentioned in this paper, Subset Selection in Regression (Monographs on Statistics and Applied Probability, no. 40, 1990) and Section 5.8.

...read moreread less

1.5K

•Book

Model selection

H Linhart, +1 more

- 01 Jan 1986

TL;DR: Model selection is the task of choosing a model with the correct inductive bias, which in practice means selecting parameters in an attempt to create a model of optimal complexity for the given (finite) data.

...read moreread less

1.2K

Journal Article•10.1093/BIOMET/92.4.937

Can the strengths of AIC and BIC be shared? A conflict between model indentification and regression estimation

Yuhong Yang

- 01 Dec 2005

- Biometrika

TL;DR: In this paper, the authors show that for any model selection criterion to be consistent, it must behave suboptimally for estimating the regression function in terms of minimax rate of covergence; and Bayesian model averaging cannot be minimax-rate optimal for regression estimation.

...read moreread less

419

Journal Article•10.2514/1.J052375

Metamodeling in Multidisciplinary Design Optimization: How Far Have We Really Come?

Felipe A. C. Viana, +3 more

- 17 Mar 2014

- AIAA Journal

TL;DR: The extent to which the use of metamodeling techniques inmultidisciplinary design optimization have evolved in the 25 years since the seminal paper on design and analysis of computer experiments is addressed.

...read moreread less

389

•Journal Article•10.1198/016214501753168262

Adaptive Regression by Mixing

Yuhong Yang

- 01 Jun 2001

- Journal of the American Statistical Asso...

TL;DR: Under mild conditions, it is shown that the squared L2 risk of the estimator based on ARM is basically bounded above by the risk of each candidate procedure plus a small penalty term of order 1/n, giving the automatically optimal rate of convergence for ARM.

...read moreread less

375

...

Expand

References

•Journal Article•10.1214/AOS/1176344136

Estimating the Dimension of a Model

Gideon Schwarz

- 01 Mar 1978

- Annals of Statistics

TL;DR: In this paper, the problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion.

...read moreread less

45K

Estimating the dimension of a model

Gideon Schwarz

- 01 Jan 2005

TL;DR: In this paper, the problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion.

...read moreread less

40.6K

•Proceedings Article

Information Theory and an Extention of the Maximum Likelihood Principle

H. Akaike

- 01 Jan 1973

TL;DR: The classical maximum likelihood principle can be considered to be a method of asymptotic realization of an optimum estimate with respect to a very general information theoretic criterion to provide answers to many practical problems of statistical model fitting.

...read moreread less

20.2K

•Journal Article•10.1023/A:1018054314350

Bagging predictors

Leo Breiman

- 01 Aug 1996

TL;DR: Tests on real and simulated data sets using classification and regression trees and subset selection in linear regression show that bagging can give substantial gains in accuracy.

...read moreread less

16.6K

Book Chapter•10.1007/978-1-4612-1694-0_15

Information Theory and an Extension of the Maximum Likelihood Principle

Hirotugu Akaike

- 01 Jan 1973

TL;DR: In this paper, it is shown that the classical maximum likelihood principle can be considered to be a method of asymptotic realization of an optimum estimate with respect to a very general information theoretic criterion.

...read moreread less

16.3K

...

Expand

Regression with multiple candidate models: selecting or mixing?

Chat with Paper

AI Agents for this Paper

Citations

Subset Selection in Regression

Model selection

Can the strengths of AIC and BIC be shared? A conflict between model indentification and regression estimation

Metamodeling in Multidisciplinary Design Optimization: How Far Have We Really Come?

Adaptive Regression by Mixing

References

Estimating the Dimension of a Model

Estimating the dimension of a model

Information Theory and an Extention of the Maximum Likelihood Principle

Bagging predictors

Information Theory and an Extension of the Maximum Likelihood Principle

Related Papers (5)

Model selection: An integral part of inference

Estimating the Dimension of a Model

Bayesian Model Averaging: A Tutorial

Information Theory and an Extension of the Maximum Likelihood Principle

Frequentist Model Average Estimators