Using Rank-One Biclusters to Classify Microarray Data

Open Access

Using Rank-One Biclusters to Classify Microarray Data

- 01 Jan 2007

17

TL;DR: This paper proposes a novel algorithm for learning a microarray classier by reducing the dimensionality of the data matrix using biclusters, where each bicluster is a subset of genes andA subset of samples whose expression values have similar patterns.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Singular Value Decomposition for Genome-Wide Expression Data Processing and Modeling

Orly Alter, +2 more

- 01 Mar 2001

TL;DR: Using singular value decomposition in transforming genome-wide expression data from genes x arrays space to reduced diagonalized "eigengenes" x "eigenarrays" space gives a global picture of the dynamics of gene expression, in which individual genes and arrays appear to be classified into groups of similar regulation and function, or similar cellular state and biological phenotype.

...read moreread less

1.9K

•Journal Article•10.1137/070709967

On the Complexity of Nonnegative Matrix Factorization

Stephen A. Vavasis

- 01 Aug 2009

- Siam Journal on Optimization

TL;DR: An exact version of nonnegative matrix factorization is defined and it is established that it is equivalent to a problem in polyhedral combinatorics; it is NP-hard; and that a polynomial-time local search heuristic exists.

...read moreread less

743

Journal Article•10.1111/J.1541-0420.2010.01392.X

Biclustering via Sparse Singular Value Decomposition

Mihee Lee, +3 more

- 01 Dec 2010

- Biometrics

TL;DR: Sparse singular value decomposition (SSVD) is proposed as a new exploratory analysis tool for biclustering or identifying interpretable row–column associations within high‐dimensional data matrices.

...read moreread less

322

•Journal Article•10.1214/09-AOAS239

Finding large average submatrices in high dimensional data

Andrey A. Shabalin, +3 more

- 11 May 2009

- arXiv: Genomics

TL;DR: A statistically motivated biclustering procedure that finds large average submatrices within a given real-valued data matrix and is driven by a Bonferroni-based significance score that effectively trades off between submatrix size and average value is proposed.

...read moreread less

219

•Journal Article•10.1214/09-AOAS239

Finding large average submatrices in high dimensional data

Andrey A. Shabalin, +3 more

- 01 Sep 2009

- The Annals of Applied Statistics

TL;DR: In this article, a statistically motivated biclustering procedure (LAS) is proposed to find large average submatrices within a given real-valued data matrix, and the procedure operates in an iterative-residual fashion, and is driven by a Bonferroni-based significance score that effectively trades off between submatrix size and average value.

...read moreread less

154

...

Expand

References

•Book

Data Mining: Practical Machine Learning Tools and Techniques

Ian H. Witten, +2 more

- 25 Oct 1999

TL;DR: This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.

...read moreread less

25.4K

Journal Article•10.1126/SCIENCE.286.5439.531

Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.

Todd R. Golub, +12 more

- 15 Oct 1999

- Science

TL;DR: A generic approach to cancer classification based on gene expression monitoring by DNA microarrays is described and applied to human acute leukemias as a test case and suggests a general strategy for discovering and predicting cancer classes for other types of cancer, independent of previous biological knowledge.

...read moreread less

13.3K

•Journal Article•10.1038/415530A

Gene expression profiling predicts clinical outcome of breast cancer

Laura J. van't Veer, +15 more

- 31 Jan 2002

- Nature

TL;DR: DNA microarray analysis on primary breast tumours of 117 young patients is used and supervised classification is applied to identify a gene expression signature strongly predictive of a short interval to distant metastases (‘poor prognosis’ signature) in patients without tumour cells in local lymph nodes at diagnosis, providing a strategy to select patients who would benefit from adjuvant therapy.

...read moreread less

10.3K

Journal Article•10.1016/S0893-6080(05)80023-1

Original Contribution: Stacked generalization

David H. Wolpert

- 05 Feb 1992

- Neural Networks

TL;DR: The conclusion is that for almost any real-world generalization problem one should use some version of stacked generalization to minimize the generalization error rate.

...read moreread less

7.5K

•Journal Article•10.1073/PNAS.96.12.6745

Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays.

Uri Alon, +7 more

- 08 Jun 1999

- Proceedings of the National Academy of S...

TL;DR: In this paper, a two-way clustering algorithm was applied to both the genes and the tissues, revealing broad coherent patterns that suggest a high degree of organization underlying gene expression in these tissues.

...read moreread less

4.5K