Missing value estimation for DNA microarray gene expression data: local least squares imputation

doi:10.1093/BIOINFORMATICS/BTH499

Open AccessJournal Article10.1093/BIOINFORMATICS/BTH499

Missing value estimation for DNA microarray gene expression data: local least squares imputation

Hyunsoo Kim, +2 more

- 15 Jan 2005

- Bioinformatics

- Vol. 21, Iss: 11, pp 1410-1411

548

TL;DR: Imputation methods based on the least squares formulation are proposed to estimate missing values in the gene expression data, which exploit local similarity structures in the data as well as least squares optimization process.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Singular Value Decomposition for Genome-Wide Expression Data Processing and Modeling

Orly Alter, +2 more

- 01 Mar 2001

TL;DR: Using singular value decomposition in transforming genome-wide expression data from genes x arrays space to reduced diagonalized "eigengenes" x "eigenarrays" space gives a global picture of the dynamics of gene expression, in which individual genes and arrays appear to be classified into groups of similar regulation and function, or similar cellular state and biological phenotype.

...read moreread less

1.9K

•Journal Article•10.1093/BIOINFORMATICS/BTM069

pcaMethods—a bioconductor package providing PCA methods for incomplete data

Wolfram Stacklies, +4 more

- 06 Mar 2007

- Bioinformatics

TL;DR: PcaMethods is a Bioconductor compliant library for computing principal component analysis (PCA) on incomplete data sets that can be analyzed directly or used to estimate missing values to enable the use of missing value sensitive statistical methods.

...read moreread less

1.1K

•Book

Data Clustering: Theory, Algorithms, and Applications (ASA-SIAM Series on Statistics and Applied Probability)

Guojun Gan, +2 more

- 01 May 2007

Abstract: Preface Part I. Clustering, Data and Similarity Measures: 1. Data clustering 2. DataTypes 3. Scale conversion 4. Data standardization and transformation 5. Data visualization 6. Similarity and dissimilarity measures Part II. Clustering Algorithms: 7. Hierarchical clustering techniques 8. Fuzzy clustering algorithms 9. Center Based Clustering Algorithms 10. Search based clustering algorithms 11. Graph based clustering algorithms 12. Grid based clustering algorithms 13. Density based clustering algorithms 14. Model based clustering algorithms 15. Subspace clustering 16. Miscellaneous algorithms 17. Evaluation of clustering algorithms Part III. Applications of Clustering: 18. Clustering gene expression data Part IV. Matlab and C++ for Clustering: 19. Data clustering in Matlab 20. Clustering in C/C++ A. Some clustering algorithms B. Thekd-tree data structure C. Matlab Codes D. C++ Codes Subject index Author index.

...read moreread less

908

Journal Article•10.1007/S00521-009-0295-6

Pattern classification with missing data: a review

Pedro J. García-Laencina, +2 more

- 01 Mar 2010

- Neural Computing and Applications

TL;DR: The aim of this work is to analyze the missing data problem in pattern classification tasks, and to summarize and compare some of the well-known methods used for handling missing values.

...read moreread less

804

•Journal Article•10.1109/TSMCC.2008.2007252

A Survey of Evolutionary Algorithms for Clustering

Eduardo R. Hruschka, +3 more

- 01 Mar 2009

TL;DR: An up-to-date overview that is fully devoted to evolutionary algorithms for clustering, is not limited to any particular kind of evolutionary approach, and comprises advanced topics like multiobjective and ensemble-based evolutionary clustering.

...read moreread less

787

...

Expand

References

•Book

The Nature of Statistical Learning Theory

Vladimir Vapnik

- 01 Jan 1995

TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?

...read moreread less

46K

Journal Article•10.1103/PHYSREV.106.620

Information Theory and Statistical Mechanics. II

E. T. Jaynes

- 15 Oct 1957

- Physical Review

TL;DR: In this article, the authors consider statistical mechanics as a form of statistical inference rather than as a physical theory, and show that the usual computational rules, starting with the determination of the partition function, are an immediate consequence of the maximum-entropy principle.

...read moreread less

14K

Journal Article•10.1126/SCIENCE.286.5439.531

Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.

Todd R. Golub, +12 more

- 15 Oct 1999

- Science

TL;DR: A generic approach to cancer classification based on gene expression monitoring by DNA microarrays is described and applied to human acute leukemias as a test case and suggests a general strategy for discovering and predicting cancer classes for other types of cancer, independent of previous biological knowledge.

...read moreread less

13.3K

•Journal Article•10.1038/415530A

Gene expression profiling predicts clinical outcome of breast cancer

Laura J. van't Veer, +15 more

- 31 Jan 2002

- Nature

TL;DR: DNA microarray analysis on primary breast tumours of 117 young patients is used and supervised classification is applied to identify a gene expression signature strongly predictive of a short interval to distant metastases (‘poor prognosis’ signature) in patients without tumour cells in local lymph nodes at diagnosis, providing a strategy to select patients who would benefit from adjuvant therapy.

...read moreread less

10.3K