Proceedings Article10.1109/CIBCB.2006.331013
Integrating Affymetrix microarray data sets using probe-level test statistic for predicting prostate cancer
Pingzhao Hu,Celia M. T. Greenwood,Joseph Beyene +2 more
- 01 Sep 2006
- pp 1-8
6
TL;DR: These analyses show that the prognostic gene expression signatures identified through the probe-level test statistics are more strongly differentially expressed and have better prediction accuracy than signatures derived from a probeset-level model.
read more
Abstract: Microarray technology has previously been used to identify differentially expressed genes between tumor and normal prostate samples in a single study as well as in a synthesis involving multiple studies. When integrating results from several Affymetrix microarray datasets, previous studies have used probeset-level data which may lead to a loss of information contained at the probe-level. Here, we propose a new approach for combining results across studies, based on a probe-level test statistic. Each probe-level test statistic is transformed into an effect size measure for each probeset and a random-effects model (REM) is used to integrate effect sizes across studies. We compared statistical and biological significance of the prognostic gene expression signatures identified in the probe-level model (PLM) with those in the probeset-level model (PSLM). Support vector machines (SVMs)-based predictive models were built using these two sets of signatures and their performances were evaluated using independent test datasets. Our analyses show that the prognostic gene expression signatures identified through the probe-level test statistics are more strongly differentially expressed and have better prediction accuracy than signatures derived from a probeset-level model.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
“Bioinformatics” 특집을 내면서
TL;DR: Assessment of medical technology in the context of commercialization with Bioentrepreneur course, which addresses many issues unique to biomedical products.
4.8K
Semi-Supervised Learning with Ensemble Self-Training for Cancer Classification
Qingyong Wang,Liang-Yong Xia,Hua Chai,Yun Zhou +3 more
- 01 Oct 2018
TL;DR: An ensemble self-training learning(ESTL) method is proposed to selects the unlabeled samples with high-quality more effectively and improves the robustness of the model.
10
Integrative analysis of gene expression data including an assessment of pathway enrichment for predicting prostate cancer.
TL;DR: By integrating information from the insulin signalling pathway into the prediction model, the authors achieved better prediction of prostate cancer and identified significant gene expression phenotypes that have the potential to characterize complex genetic alterations in prostate cancer.
6
Integrative Analysis of Gene Expression Data Including an Assessment of Pathway Enrichment for Predicting Prostate Cancer
TL;DR: By integrating information from the insulin signalling pathway into the prediction model, the authors achieved better prediction of prostate cancer and identified significant gene expression phenotypes that have the potential to characterize complex genetic alterations in prostate cancer.
4
A Bayesian Network Model for the Parkinson’s Disease: A Study of Gene Expression Levels
Sonia Lilia Mestizo-Gutiérrez,Joan Arturo Jácome-Delgado,Viviana Yarel Rosales-Morales,Nicandro Cruz-Ramírez,Gonzalo E. Aranda-Abreu +4 more
- 01 Jan 2019
TL;DR: Gene expression profiles of peripheral blood samples from 105 individuals are modeled using Bayesian networks with different dimensionality reduction techniques to create several sets of genes that could be considered as PD candidates and some genes previously reported with this disease were corroborated.
3
References
Controlling the false discovery rate: a practical and powerful approach to multiple testing
Yoav Benjamini,Yosef Hochberg +1 more
TL;DR: In this paper, a different approach to problems of multiple significance testing is presented, which calls for controlling the expected proportion of falsely rejected hypotheses -the false discovery rate, which is equivalent to the FWER when all hypotheses are true but is smaller otherwise.
Statistical learning theory
Vladimir Vapnik
- 01 Jan 1998
TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.
30.4K
Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.
Todd R. Golub,Todd R. Golub,Donna K. Slonim,Pablo Tamayo,Christine Huard,Michelle Gaasenbeek,Jill P. Mesirov,Hilary A. Coller,Mignon L. Loh,James R. Downing,Michael A. Caligiuri,Clara D. Bloomfield,Eric S. Lander +12 more
TL;DR: A generic approach to cancer classification based on gene expression monitoring by DNA microarrays is described and applied to human acute leukemias as a test case and suggests a general strategy for discovering and predicting cancer classes for other types of cancer, independent of previous biological knowledge.
•Book
Statistical Methods for Meta-Analysis
Larry V. Hedges,Ingram Olkin +1 more
- 01 Jan 1985
TL;DR: In this article, the authors present a model for estimating the effect size from a series of experiments using a fixed effect model and a general linear model, and combine these two models to estimate the effect magnitude.
10.7K
A comparison of normalization methods for high density oligonucleotide array data based on variance and bias
TL;DR: Three methods of performing normalization at the probe intensity level are presented: a one number scaling based algorithm and a method that uses a non-linear normalizing relation by comparing the variability and bias of an expression measure and the simplest and quickest complete data method is found to perform favorably.
9K