Open AccessPosted Content
High-Dimensional Semiparametric Selection Models: Estimation Theory with an Application to the Retail Gasoline Market
TL;DR: In this paper, a multi-stage projection-based Lasso procedure for the semiparametric sample selection model in high-dimensional settings under a weak nonparametric restriction on the selection correction is proposed.
read more
Abstract: This paper proposes a multi-stage projection-based Lasso procedure for the semiparametric sample selection model in high-dimensional settings under a weak nonparametric restriction on the selection correction. In particular, the number of regressors in the main equation, p, and the number of regressors in the selection equation, d, can grow with and exceed the sample size n. The analysis considers the exact sparsity case and the approximate sparsity case. The main theoretical results are finite-sample bounds from which sufficient scaling conditions on the sample size for estimation consistency and variable-selection consistency are established. Statistical efficiency of the proposed estimators is studied via lower bounds on minimax risks and the result shows that, for a family of models with exactly sparse structure on the coefficient vector in the main equation, one of the proposed estimators attains the smallest estimation error up to the (n,d,p)-scaling among a class of procedures in worst-case scenarios. Inference procedures for the coefficients of the main equation, one based on a pivotal Dantzig selector to construct non-asymptotic confidence sets and one based on a post-selection strategy, are discussed. Other theoretical contributions include establishing the non-asymptotic counterpart of the familiar asymptotic oracle results from previous literature: the estimator of the coefficients in the main equation behaves as if the unknown nonparametric component were known, provided the nonparametric component is sufficiently smooth. Small-sample performance of the high-dimensional multi-stage estimation procedure is evaluated by Monte-Carlo simulations and illustrated with an empirical application to the retail gasoline market in the Greater Saint Louis area.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
Some deviation inequalities
TL;DR: In this article, a concentration property called Property~($\scriptstyle{R^n}$ was introduced for probability measures, which has an interesting stability under products and contractions.
119
Sparse Linear Models and l1−Regularized 2SLS with High-Dimensional Endogenous Regressors and Instruments
TL;DR: In this paper, the authors explore the validity of the 2-stage least squares estimator with l 1 -regularization in both stages, for linear regression models where the number of endogenous regressors in the main equation and instruments in the first-stage equations can exceed the sample size, and the regression coefficients are sufficiently sparse.
14
•Journal Article
Reconstruction from anisotropic random measurements
Mark Rudelson,Shuheng Zhou +1 more
TL;DR: In this paper, the restricted eigenvalue (RE) condition was shown to guarantee the recovery of sparse matrices with dependent entries by checking the restricted isometry on a certain family of low-dimensional subspaces.
11
Sparse Linear Models and L1-Regularized 2SLS with High-Dimensional Endogenous Regressors and Instruments
TL;DR: In this article, the validity of the 2-stage least squares estimator with L1-regularization in both stages was explored for linear models where the number of endogenous regressors in the main equation and instruments in the first-stage equations can exceed the sample size, and the regression coefficients are exactly or approximately sparse.
References
Regression Shrinkage and Selection via the Lasso
TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.
Regularization Paths for Generalized Linear Models via Coordinate Descent
TL;DR: In comparative timings, the new algorithms are considerably faster than competing methods and can handle large problems and can also deal efficiently with sparse features.
Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties
Jianqing Fan,Runze Li +1 more
TL;DR: In this article, penalized likelihood approaches are proposed to handle variable selection problems, and it is shown that the newly proposed estimators perform as well as the oracle procedure in variable selection; namely, they work as well if the correct submodel were known.
•Book
Spline models for observational data
Grace Wahba
- 01 Mar 1990
TL;DR: In this paper, a theory and practice for the estimation of functions from noisy data on functionals is developed, where convergence properties, data based smoothing parameter selection, confidence intervals, and numerical methods are established which are appropriate to a number of problems within this framework.
6.9K
Weak Convergence and Empirical Processes
TL;DR: This chapter discusses Convergence: Weak, Almost Uniform, and in Probability, which focuses on the part of Convergence of the Donsker Property which is concerned with Uniformity and Metrization.