Comparison of Affymetrix GeneChip expression measures
TL;DR: It is found that background correction, one of the main steps in preprocessing, has the largest effect on performance and, in particular, background correction appears to improve accuracy but, in general, worsen precision.
read more
Abstract: Motivation: In the Affymetrix GeneChip system, preprocessing occurs before one obtains expression level measurements. Because the number of competing preprocessing methods was large and growing we developed a benchmark to help users identify the best method for their application. A webtool was made available for developers to benchmark their procedures. At the time of writing over 50 methods had been submitted.
Results: We benchmarked 31 probe set algorithms using a U95A dataset of spike in controls. Using this dataset, we found that background correction, one of the main steps in preprocessing, has the largest effect on performance. In particular, background correction appears to improve accuracy but, in general, worsen precision. The benchmark results put this balance in perspective. Furthermore, we have improved some of the original benchmark metrics to provide more detailed information regarding precision and accuracy. A handful of methods stand out as providing the best balance using spike-in data with the older U95A array, although different experiments on more current arrays may benchmark differently.
Availability: The affycomp package, now version 1.5.2, continues to be available as part of the Bioconductor project (http://www.bioconductor.org). The webtool continues to be available at http://affycomp.biostat.jhsph.edu
Contact: [email protected]
Supplementary information: Supplementary data are available at Bioinformatics online.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Dissertation
Microarray Data Analysis using Probabilistic Methods
Xuejun Liu
- 10 Nov 2006
TL;DR: This thesis extends a previously developed probabilistic model, mgMOS, to obtain an improved model that provides improved accuracy and is more computationally efficient than other alternatives, and provides a level of uncertainty associated with the measured gene expression level.
Tissue-specific RMA models to incrementally normalize Affymetrix GeneChip data
Steven A. Eschrich,Andrew M. Hoerter,Gregory C. Bloom,David Fenstermacher +3 more
- 14 Oct 2008
TL;DR: Through several large datasets of patient samples, this work provides evidence that RMA models of normalization converge to a common model in homogenous samples, and offers the promise of maintaining large data warehouses of patient microarray samples without the requirement of constant renormalization.
5
Challenges in DNA microarray studies from the regulatory perspective.
TL;DR: Some of the major challenges for the development and validation of genomic classifiers will be discussed in this article together with some of their difficulties.
5
Assessing Numerical Dependence in Gene Expression Summaries with the Jackknife Expression Difference
John R. Stevens,Gabriel Nicholas +1 more
TL;DR: A diagnostic measure of numerical dependence for gene expression summaries from any preprocessing method is introduced and the relative performance of several common preprocessing methods with respect to this measure is discussed.
References
Exploration, normalization, and summaries of high density oligonucleotide array probe level data
Rafael A. Irizarry,Bridget G. Hobbs,Francois Collin,Yasmin Beazer-Barclay,Kristen J. Antonellis,Uwe Scherf,Terence P. Speed +6 more
TL;DR: There is no obvious downside to using RMA and attaching a standard error (SE) to this quantity using a linear model which removes probe-specific affinities, and the exploratory data analyses of the probe level data motivate a new summary measure that is a robust multi-array average (RMA) of background-adjusted, normalized, and log-transformed PM values.
A comparison of normalization methods for high density oligonucleotide array data based on variance and bias
TL;DR: Three methods of performing normalization at the probe intensity level are presented: a one number scaling based algorithm and a method that uses a non-linear normalizing relation by comparing the variability and bias of an expression measure and the simplest and quickest complete data method is found to perform favorably.
9K
Variance stabilization applied to microarray data calibration and to the quantification of differential expression.
Wolfgang Huber,Anja von Heydebreck,Holger Sültmann,Annemarie Poustka,Martin Vingron +4 more
- 01 Jul 2002
TL;DR: A statistical model for microarray gene expression data that comprises data calibration, the quantifying of differential expression, and the quantification of measurement error is introduced, and a difference statistic Deltah whose variance is approximately constant along the whole intensity range is derived.
2.7K
A variance-stabilizing transformation for gene-expression microarray data.
Blythe Durbin,Johanna Hardin,Douglas M. Hawkins,David M. Rocke +3 more
- 01 Jul 2002
TL;DR: A transformation is introduced that stabilizes the variance of microarray data across the full range of expression, and simulation studies suggest that this transformation approximately symmetrizes micro array data.
Robust singular value decomposition analysis of microarray data
TL;DR: A robust analysis method is developed for the understanding of large-scale shifts in gene effects and the isolation of particular sample-by-gene effects that might be either unusual interactions or the result of experimental flaws.
193
Related Papers (5)
Robert Gentleman,Vincent J. Carey,Douglas M. Bates,Benjamin M. Bolstad,Marcel Dettling,Sandrine Dudoit,Byron Ellis,Laurent Gautier,Yongchao Ge,Jeff Gentry,Kurt Hornik,Torsten Hothorn,Wolfgang Huber,Stefano Maria Iacus,Rafael A. Irizarry,Friedrich Leisch,Cheng Li,Martin Maechler,A. J. Rossini,Günther Sawitzki,Colin A. Smith,Gordon K. Smyth,Luke Tierney,Jean Yang,Jianhua Zhang +24 more