TL;DR: The Elements of Statistical Learning: Data Mining, Inference, and Prediction as discussed by the authors is a popular book for data mining and machine learning, focusing on data mining, inference, and prediction.
Abstract: (2004). The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Journal of the American Statistical Association: Vol. 99, No. 466, pp. 567-567.
TL;DR: Algebra of Vectors and Matrices, Probability Theory, Tools and Techniques, and Continuous Probability Models.
Abstract: Algebra of Vectors and Matrices. Probability Theory, Tools and Techniques. Continuous Probability Models. The Theory of Least Squares and Analysis of Variance. Criteria and Methods of Estimation. Large Sample Theory and Methods. Theory of Statistical Inference. Multivariate Analysis. Publications of the Author. Author Index. Subject Index.
TL;DR: In this paper, the authors present a formal justification for the use of the Bootstrap in statistical inference. But they do not discuss future limitations of the bootstrap and their application in the statistical verification of confidence intervals.
Abstract: PART ONE: INTRODUCTION Traditional Parametric Statistical Inference Bootstrap Statistical Inference Bootstrapping a Regression Model Theoretical Justification The Jackknife Monte Carlo Evaluation of the Bootstrap PART TWO: STATISTICAL INFERENCE USING THE BOOTSTRAP Bias Estimation Bootstrap Confidence Intervals PART THREE: APPLICATIONS OF BOOTSTRAP CONFIDENCE INTERVALS Confidence Intervals for Statistics With Unknown Sampling Distributions Inference When Traditional Distributional Assumptions Are Violated PART FOUR: CONCLUSION Future Work Limitations of the Bootstrap Concluding Remarks
TL;DR: In this paper, the problem of post-selection inference is reduced to one of simultaneous inference, and the authors propose to use simultaneous inference for all linear functions that arise as coefficient estimates in all submodels.
Abstract: It is common practice in statistical data analysis to perform data-driven variable selection and derive statistical inference from the resulting model. Such inference enjoys none of the guarantees that classical statistical theory provides for tests and confidence intervals when the model has been chosen a priori. We propose to produce valid ``post-selection inference'' by reducing the problem to one of simultaneous inference and hence suitably widening conventional confidence and retention intervals. Simultaneity is required for all linear functions that arise as coefficient estimates in all submodels. By purchasing ``simultaneity insurance'' for all possible submodels, the resulting post-selection inference is rendered universally valid under all possible model selection procedures. This inference is therefore generally conservative for particular selection procedures, but it is always less conservative than full Scheffe protection. Importantly it does not depend on the truth of the selected submodel, and hence it produces valid inference even in wrong models. We describe the structure of the simultaneous inference problem and give some asymptotic results.