Missing Data Imputation – A Survey

doi:10.4018/ijdsst.292446

Journal Article10.4018/ijdsst.292446

Missing Data Imputation – A Survey

01 Jan 2022

- International Journal of Decision Suppor...

- Vol. 14, Iss: 1, pp 1-20

6

TL;DR: In this article , a comprehensive review of the approaches to tackle the missing data problem is discussed with a comprehensive discussion on the effectiveness of three imputation methods namely, imputation based on Multiple Linear Regression (MLR), Predictive Mean Matching (PMM), and Classification And Regression Tree (CART) in the context of subspace clustering.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.3390/math11010073

A Noise-Aware Multiple Imputation Algorithm for Missing Data

Fangfan Li, +3 more

- 25 Dec 2022

- Mathematics

TL;DR: Wang et al. as discussed by the authors proposed a noise-aware missing data multiple imputation algorithm NPMI in static data, and the method to determine the imputation order of multivariables missing is given.

...read moreread less

3

Journal Article•10.1109/access.2024.3357533

A Comprehensive Bibliometric Analysis of Missing Value imputation

Heru Nugroho, +1 more

- IEEE Access

TL;DR: To systematically explore various aspects of missing data imputation, a conceptual framework was used to uncover potential research directions and underlying themes and a thematic map serves as a valuable tool for providing a comprehensive understanding.

...read moreread less

2

•Posted Content•10.21203/rs.3.rs-1729251/v1

A Novel Algorithm for Imputing the Missing Values in Incomplete Datasets

16 Jun 2022

TL;DR: In this article , a splitting-based IMV-RE algorithm is proposed to estimate missing values within a dataset, where an upper limit is set for every class containing missing values that assist the algorithm to predict the missing values more accurately.

...read moreread less

Journal Article•10.1007/s42044-023-00154-9

A novel algorithm for imputing the missing values in incomplete datasets

Hutashan Vishal Bhagat, +1 more

- 07 Aug 2023

- Iran Journal of Computer Science

TL;DR: A new algorithm, known as the IMV-RE (imputing the missing values in real-time environment) algorithm, which is based on a novel approach and outperforms existing techniques in terms of sensitivity to accuracy, root mean square error (RMSE), and coefficient of determination ( R ^2).

...read moreread less

Journal Article•10.62762/tis.2024.751418

Improving Effort Estimation Accuracy in Software Development Projects Using Multiple Imputation Techniques for Missing Data Handling

S. Hayat, +7 more

- 12 Nov 2024

TL;DR: This study improves effort estimation accuracy in software development projects by employing Multiple Imputation (MI) to handle missing data, enhancing the Analogy-Based Effort Estimation (ABEE) model's performance and providing more accurate and efficient outcomes.

...read moreread less

References

•Book

Classification and regression trees

Leo Breiman

- 01 Jan 1983

TL;DR: The methodology used to construct tree structured rules is the focus of a monograph as mentioned in this paper, covering the use of trees as a data analysis method, and in a more mathematical framework, proving some of their fundamental properties.

...read moreread less

22.7K

•Book

Multiple imputation for nonresponse in surveys

Donald B. Rubin

- 01 Jan 1987

TL;DR: In this article, a survey of drinking behavior among men of retirement age was conducted and the results showed that the majority of the participants reported that they did not receive any benefits from the Social Security Administration.

...read moreread less

18.8K

Journal Article•10.1093/BIOMET/63.3.581

Inference and missing data

Donald B. Rubin

- 01 Dec 1976

- Biometrika

TL;DR: In this article, it was shown that ignoring the process that causes missing data when making sampling distribution inferences about the parameter of the data, θ, is generally appropriate if and only if the missing data are missing at random and the observed data are observed at random, and then such inferences are generally conditional on the observed pattern of missing data.

...read moreread less

10K

•Journal Article•10.1002/MPR.329

Multiple Imputation by Chained Equations: What is it and how does it work?

Melissa Azur, +3 more

- 01 Mar 2011

- International Journal of Methods in Psyc...

TL;DR: This paper provides an introduction to the MICE method with a focus on practical aspects and challenges in using this method.

...read moreread less

3K

•Journal Article•10.1007/S11121-007-0070-9

How Many Imputations are Really Needed? Some Practical Clarifications of Multiple Imputation Theory

John W. Graham, +2 more

- 05 Jun 2007

- Prevention Science

TL;DR: It is recommended that researchers using MI should perform many more imputations than previously considered sufficient, based on γ, and take into consideration one’s tolerance for a preventable power falloff due to using too few imputations.

...read moreread less

2.7K