Journal Article10.1111/J.0006-341X.2005.031010.X
Multiple imputation for model checking: completed-data plots with missing and latent data.
TL;DR: The methods of missing‐data model checking can be interpreted as “predictive inference” in a non‐Bayesian context and the graphical diagnostics within this framework are considered.
read more
Abstract: In problems with missing or latent data, a standard approach is to first impute the unobserved data, then perform all statistical analyses on the completed dataset--corresponding to the observed data and imputed unobserved data--using standard procedures for complete-data inference. Here, we extend this approach to model checking by demonstrating the advantages of the use of completed-data model diagnostics on imputed completed datasets. The approach is set in the theoretical framework of Bayesian posterior predictive checks (but, as with missing-data imputation, our methods of missing-data model checking can also be interpreted as "predictive inference" in a non-Bayesian context). We consider the graphical diagnostics within this framework. Advantages of the completed-data approach include: (1) One can often check model fit in terms of quantities that are of key substantive interest in a natural way, which is not always possible using observed data alone. (2) In problems with missing data, checks may be devised that do not require to model the missingness or inclusion mechanism; the latter is useful for the analysis of ignorable but unknown data collection mechanisms, such as are often assumed in the analysis of sample surveys and observational studies. (3) In many problems with latent data, it is possible to check qualitative features of the model (for example, independence of two variables) that can be naturally formalized with the help of the latent data. We illustrate with several applied examples.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Statistical Analysis with Missing Data
TL;DR: Theory of Inference Based on the Likelihood Function.PART I: OVERVIEW AND BASIC APPROACHES.Preface.Subject Index.Author Index.
7K
Multiple Imputation with Diagnostics (mi) in R: Opening Windows into the Black Box
TL;DR: The mi package in R has features that allow the user to get inside the imputation process and evaluate the reasonableness of the resulting models and imputations, and uses Bayesian models and weakly informative prior distributions to construct more stable estimates of imputation models.
How should variable selection be performed with multiply imputed data
TL;DR: Most methods improve on the naïve complete‐case analysis for variable selection, but importantly the type 1 error is only preserved if selection is based on RR, which is the recommended approach.
468
The rise of multiple imputation: a review of the reporting and implementation of the method in medical research
TL;DR: This review outlined deficiencies in the documenting of missing data and the details provided about imputation in medical research articles, and only a few articles performed sensitivity analyses following MI even though this is strongly recommended in guidelines.
Build, Compute, Critique, Repeat: Data Analysis with Latent Variable Models
TL;DR: This work describes probabilistic graphical models, a language for formulating latent variable models, and describes mean field variational inference, a generic algorithm for approximating conditional distributions.
307
References
•Book
Multiple imputation for nonresponse in surveys
Donald B. Rubin
- 01 Jan 1987
TL;DR: In this article, a survey of drinking behavior among men of retirement age was conducted and the results showed that the majority of the participants reported that they did not receive any benefits from the Social Security Administration.
18.8K
•Book
Statistical Analysis with Missing Data
Roderick J. A. Little,Donald B. Rubin +1 more
- 01 Jan 1987
TL;DR: This work states that maximum Likelihood for General Patterns of Missing Data: Introduction and Theory with Ignorable Nonresponse and large-Sample Inference Based on Maximum Likelihood Estimates is likely to be high.
18.3K
Statistical Analysis With Missing Data
TL;DR: Generalized Estimating Equations is a good introductory book for analyzing continuous and discrete correlated data using GEE methods and provides good guidance for analyzing correlated data in biomedical studies and survey studies.
10.6K
Related Papers (5)
Roderick J. A. Little,Donald B. Rubin +1 more
- 01 Jan 1987
Donald B. Rubin
- 01 Jan 1987
J.L. Schafer
- 01 Aug 1997