Finding consistent patterns: A nonparametric approach for identifying differential expression in RNA-Seq data
Jun Li,Robert Tibshirani +1 more
TL;DR: A simple, non-parametric method with resampling to account for the different sequencing depths is introduced, and it is found that the method discovers more consistent patterns than competing methods.
read more
Abstract: We discuss the identification of features that are associated with an outcome in RNA-Sequencing (RNA-Seq) and other sequencing-based comparative genomic experiments. RNA-Seq data takes the form of counts, so models based on the normal distribution are generally unsuitable. The problem is especially challenging because different sequencing experiments may generate quite different total numbers of reads, or 'sequencing depths'. Existing methods for this problem are based on Poisson or negative binomial models: they are useful but can be heavily influenced by 'outliers' in the data. We introduce a simple, non-parametric method with resampling to account for the different sequencing depths. The new method is more robust than parametric methods. It can be applied to data with quantitative, survival, two-class or multiple-class outcomes. We compare our proposed method to Poisson and negative binomial-based methods in simulated and real data sets, and find that our method discovers more consistent patterns than competing methods.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2
TL;DR: This work presents DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates, which enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression.
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2
TL;DR: This work presents DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates, which enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression.
Integrated Genomic Characterization of Papillary Thyroid Carcinoma
Nishant Agrawal,Rehan Akbani,B. Arman Aksoy,Adrian Ally,Harindra Arachchi,Sylvia L. Asa,J. Todd Auman,Miruna Balasundaram,Saianand Balu,Stephen B. Baylin,Madhusmita Behera,Brady Bernard,Rameen Beroukhim,Justin A. Bishop,Aaron D. Black,Tom Bodenheimer,Lori Boice,Moiz S. Bootwalla,Jay Bowen,Reanne Bowlby,Christopher A. Bristow,Robin Brookens,Denise Brooks,Robert Bryant,Elizabeth Buda,Yaron S.N. Butterfield,Tobias Carling,Rebecca Carlsen,Scott L. Carter,Sally E. Carty,Timothy A. Chan,Amy Chen,Andrew D. Cherniack,Dorothy Cheung,Lynda Chin,Juok Cho,Andy Chu,Eric Chuah,Kristian Cibulskis,Giovanni Ciriello,Amanda Clarke,Gary L. Clayman,Leslie Cope,John A. Copland,Kyle R. Covington,Ludmila Danilova,Tanja Davidsen,John A. Demchok,Daniel DiCara,Noreen Dhalla,Rajiv Dhir,Sheliann S. Dookran,Gideon Dresdner,Jonathan V. Eldridge,Greg Eley,Adel K El-Naggar,Stephanie Eng,James A. Fagin,Timothy R. Fennell,Robert L. Ferris,Sheila Fisher,Scott Frazer,Jessica Frick,Stacey Gabriel,Ian Ganly,Jianjiong Gao,Levi A. Garraway,Julie M. Gastier-Foster,Gad Getz,Nils Gehlenborg,Ronald Ghossein,Richard A. Gibbs,Thomas J. Giordano,Karen Gomez-Hernandez,Jonna Grimsby,Benjamin Gross,Ranabir Guin,Angela Hadjipanayis,Hollie A. Harper,D. Neil Hayes,David I. Heiman,James G. Herman,Katherine A. Hoadley,Matan Hofree,Robert A. Holt,Alan P. Hoyle,Franklin W. Huang,Mei Huang,Carolyn M. Hutter,Trey Ideker,Lisa Iype,Anders Jacobsen,Stuart R. Jefferys,Corbin D. Jones,Steven J.M. Jones,Katayoon Kasaian,Electron Kebebew,Fadlo R. Khuri,Jaegil Kim,Roger Kramer,Richard Kreisberg,Raju Kucherlapati,David J. Kwiatkowski,Marc Ladanyi,Phillip H. Lai,Peter W. Laird,Eric S. Lander,Michael S. Lawrence,Darlene Lee,Eunjung Lee,Semin Lee,William Lee,Kristen M. Leraas,Tara M. Lichtenberg,Lee Lichtenstein,Pei Lin,Shiyun Ling,Jinze Liu,Wen-Bin Liu,Yingchun Liu,Virginia A. LiVolsi,Yiling Lu,Yussanne Ma,Harshad S. Mahadeshwar,Marco A. Marra,Michael Mayo,David G. McFadden,Shaowu Meng,Matthew Meyerson,Piotr A. Mieczkowski,Michael L. Miller,Gordon B. Mills,Richard A. Moore,Lisle E. Mose,Andrew J. Mungall,Bradley A. Murray,Yuri E. Nikiforov,Michael S. Noble,Akinyemi I. Ojesina,Taofeek K. Owonikoko,Bradley A. Ozenberger,Angeliki Pantazi,Michael Parfenov,Peter J. Park,Joel S. Parker,Evan O. Paull,Chandra Sekhar Pedamallu,Charles M. Perou,Jan F. Prins,Alexei Protopopov,Suresh Ramalingam,Nilsa C. Ramirez,Ricardo Ramirez,Benjamin J. Raphael,W. Kimryn Rathmell,Xiaojia Ren,Sheila Reynolds,Esther Rheinbay,Matthew D. Ringel,Michael Rivera,Jeffrey Roach,A. Gordon Robertson,Mara Rosenberg,Matthew Rosenthal,Sara Sadeghi,Gordon Saksena,Chris Sander,Netty Santoso,Jacqueline E. Schein,Nikolaus Schultz,Steven E. Schumacher,Raja R. Seethala,Jonathan G. Seidman,Yasin Senbabaoglu,Sahil Seth,Samantha Sharpe,Kenna R. Mills Shaw,John Paul Shen,Ronglai Shen,Steven I. Sherman,Margi Sheth,Yan Shi,Ilya Shmulevich,Gabriel Sica,Janae V. Simons,Rileen Sinha,Payal Sipahimalani,Robert C. Smallridge,Heidi J. Sofia,Matthew G. Soloway,Xingzhi Song,Carrie Sougnez,Chip Stewart,Petar Stojanov,Joshua M. Stuart,S. Onur Sumer,Yichao Sun,Barbara Tabak,Angela Tam,Donghui Tan,Jiabin Tang,Roy Tarnuzzer,Barry S. Taylor,Nina Thiessen,Leigh B. Thorne,Vesteinn Thorsson,R. Michael Tuttle,Christopher B. Umbricht,David Van Den Berg,Fabio Vandin,Umadevi Veluvolu,Roeland Verhaak,Michelle Vinco,Doug Voet,Vonn Walter,Zhining Wang,Scot Waring,Paul M. Weinberger,Nils Weinhold,John N. Weinstein,Daniel J. Weisenberger,David A. Wheeler,Matthew D. Wilkerson,Jocelyn Wilson,Michelle D. Williams,Daniel A. Winer,Lisa Wise,Junyuan Wu,Liu Xi,Andrew Wei Xu,Liming Yang,Lixing Yang,Travis I. Zack,Martha A. Zeiger,Dong Zeng,Jean C. Zenklusen,Ni Zhao,Hailei Zhang,Jianhua Zhang,Jiashan Zhang,Wei Zhang,Erik Zmuda,Lihua Zou +242 more
TL;DR: The genomic landscape of 496 PTCs is described and a reclassification of thyroid cancers into molecular subtypes that better reflect their underlying signaling and differentiation properties is proposed, which has the potential to improve their pathological classification and better inform the management of the disease.
2.7K
A survey of best practices for RNA-seq data analysis
Ana Conesa,Pedro Madrigal,Pedro Madrigal,Sonia Tarazona,David Gomez-Cabrero,Alejandra Cervera,Andrew McPherson,Michał Wojciech Szcześniak,Daniel J. Gaffney,Laura L. Elo,Xuegong Zhang,Ali Mortazavi +11 more
TL;DR: All of the major steps in RNA-seq data analysis are reviewed, including experimental design, quality control, read alignment, quantification of gene and transcript levels, visualization, differential gene expression, alternative splicing, functional analysis, gene fusion detection and eQTL mapping.
Comprehensive Molecular Characterization of Muscle-Invasive Bladder Cancer
A. Gordon Robertson,Jaegil Kim,Hikmat Al-Ahmadie,Joaquim Bellmunt,Guangwu Guo,Andrew D. Cherniack,Toshinori Hinoue,Peter W. Laird,Katherine A. Hoadley,Rehan Akbani,Mauro A. A. Castro,Ewan A. Gibb,Rupa S. Kanchi,Dmitry A. Gordenin,Sachet A. Shukla,Francisco Sanchez-Vega,Donna E. Hansel,Bogdan Czerniak,Victor E. Reuter,Xiaoping Su,Benilton S. Carvalho,Vinicius S Chagas,Karen Mungall,Sara Sadeghi,Chandra Sekhar Pedamallu,Yiling Lu,Leszek J. Klimczak,Jiexin Zhang,Caleb Choo,Akinyemi I. Ojesina,Susan Bullman,Kristen M. Leraas,Tara M. Lichtenberg,Catherine J. Wu,N. Schultz,Gad Getz,Matthew Meyerson,Gordon B. Mills,David J. McConkey,Monique Albert,Iakovina Alexopoulou,Adrian Ally,Tatjana Antic,Manju Aron,Miruna Balasundaram,John M. S. Bartlett,Stephen B. Baylin,Allison Beaver,Inanc Birol,Lori Boice,Moiz S. Bootwalla,Jay Bowen,Reanne Bowlby,Denise Brooks,Bradley M. Broom,Wiam Bshara,Eric J. Burks,Flavio Mavignier Carcano,Rebecca Carlsen,André Lopes Carvalho,Eric P. Castle,Patricia Castro,James W.F. Catto,David Chesla,Eric Chuah,Sudha Chudamani,Victoria K. Cortessis,Sandra Cottingham,Daniel Crain,Erin Curley,Siamak Daneshmand,John A. Demchok,Noreen Dhalla,Hooman Djaladat,John Eckman,Sophie C. Egea,Jay Engel,Ina Felau,Martin L. Ferguson,Johanna Gardner,Julie M. Gastier-Foster,Mark Gerken,Carmen Gomez-Fernandez,Jodi Harr,Arndt Hartmann,Lynn M. Herbert,Thai H. Ho,Robert A. Holt,Carolyn M. Hutter,Steven J.M. Jones,Merce Jorda,Richard J. Kahnoski,Katayoon Kasaian,David J. Kwiatkowski,Phillip H. Lai,Brian R. Lane,Seth P. Lerner,Jia Liu,Laxmi Lolla,Yair Lotan,Fabiano R. Lucchesi,Yussanne Ma,Roberto Dias Machado,Dennis T. Maglinte,David Mallery,Marco A. Marra,Sue E. Martin,Michael Mayo,Anoop Meraney,Alireza Moinzadeh,Richard A. Moore,Edna M. Mora Pinero,Scott Morris,Carl Morrison,Andrew J. Mungall,Jerome Myers,Rashi Naresh,Peter H. O'Donnell,Dipen J. Parekh,Jeremy Parfitt,Joseph Paulauskis,Robert Penny,Todd Pihl,Sima P. Porten,Mario Quintero-Aguilo,Nilsa C. Ramirez,W. Kimryn Rathmell,Kimberly M. Rieger-Christ,Charles Saller,Andrew Salner,George E. Sandusky,Cristovam Scapulatempo-Neto,Jacqueline E. Schein,Anne Schuckman,Candace Shelton,Troy Shelton,Jeff Simko,Parminder Singh,Payal Sipahimalani,Norm D. Smith,Heidi J. Sofia,Andrea Sorcini,Melissa L. Stanton,Gary D. Steinberg,Robert Stoehr,Travis Sullivan,Qiang Sun,Angela Tam,Roy Tarnuzzer,Katherine Tarvin,Helge Taubert,Nina Thiessen,Leigh B. Thorne,Kane Tse,Kelinda Tucker,David Van Den Berg,Kim E.M. van Kessel,Sven Wach,Yunhu Wan,Zhining Wang,John N. Weinstein,Daniel J. Weisenberger,Lisa Wise,Tina Wong,Ye Wu,Liming Yang,Leigh Anne Zach,Jean C. Zenklusen,Jiashan Zhang,Erik Zmuda,Ellen C. Zwarthoff +170 more
TL;DR: An analysis of 412 muscle-invasive bladder cancers characterized by multiple TCGA analytical platforms identified 5 expression subtypes that may stratify response to different treatments and identified a poor-survival "neuronal" subtype in which the majority of tumors lacked small cell or neuroendocrine histology.
2.1K
References
•Journal Article
R: A language and environment for statistical computing.
TL;DR: Copyright (©) 1999–2012 R Foundation for Statistical Computing; permission is granted to make and distribute verbatim copies of this manual provided the copyright notice and permission notice are preserved on all copies.
410.8K
Controlling the false discovery rate: a practical and powerful approach to multiple testing
Yoav Benjamini,Yosef Hochberg +1 more
TL;DR: In this paper, a different approach to problems of multiple significance testing is presented, which calls for controlling the expected proportion of falsely rejected hypotheses -the false discovery rate, which is equivalent to the FWER when all hypotheses are true but is smaller otherwise.
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
39.8K
Differential expression analysis for sequence count data.
Simon Anders,Wolfgang Huber +1 more
TL;DR: A method based on the negative binomial distribution, with variance and mean linked by local regression, is proposed and an implementation, DESeq, as an R/Bioconductor package is presented.
Mapping and quantifying mammalian transcriptomes by RNA-Seq.
TL;DR: Although >90% of uniquely mapped reads fell within known exons, the remaining data suggest new and revised gene models, including changed or additional promoters, exons and 3′ untranscribed regions, as well as new candidate microRNA precursors.