Experimental design, preprocessing, normalization and differential expression analysis of small RNA sequencing experiments
TL;DR: The steps involved in preprocessing sRNA sequencing data are outlined, both the principles behind and the current options for normalization are reviewed, and differential expression analysis in the absence and presence of biological replicates is discussed.
read more
Abstract: Prior to the advent of new, deep sequencing methods, small RNA (sRNA) discovery was dependent on Sanger sequencing, which was time-consuming and limited knowledge to only the most abundant sRNA. The innovation of large-scale, next-generation sequencing has exponentially increased knowledge of the biology, diversity and abundance of sRNA populations. In this review, we discuss issues involved in the design of sRNA sequencing experiments, including choosing a sequencing platform, inherent biases that affect sRNA measurements and replication. We outline the steps involved in preprocessing sRNA sequencing data and review both the principles behind and the current options for normalization. Finally, we discuss differential expression analysis in the absence and presence of biological replicates. While our focus is on sRNA sequencing experiments, many of the principles discussed are applicable to the sequencing of other RNA populations.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Deep sequencing of small RNAs from human skin reveals major alterations in the psoriasis miRNAome
Cailin E. Joyce,Xiang Zhou,Jing Xia,Caitriona Ryan,Breck Thrash,Alan Menter,Weixiong Zhang,Anne M. Bowcock +7 more
TL;DR: A comprehensive analysis of the normal and psoriatic skin miRNAome with next-generation sequencing in a large patient cohort lays a critical framework for functional characterization of miRNAs in healthy and diseased skin.
253
Mobile small RNAs regulate genome-wide DNA methylation
Mathew G. Lewsey,Thomas J. Hardcastle,Charles W. Melnyk,Attila Molnar,Adrian Valli,Mark A. Urich,Joseph R. Nery,David C. Baulcombe,Joseph R. Ecker +8 more
TL;DR: In this article, it was shown that RNA-directed DNA methylation occurs predominantly in non-CG contexts and is largely dependent on the DOMAINS REARRANGED METHYLTRANSFERASES 1/2 (DRM1/DRM2) RdDM pathway.
Characterization of the stress associated microRNAs in Glycine max by deep sequencing.
Haiyan Li,Haiyan Li,Dong Yuanyuan,Hailong Yin,Nan Wang,Jing Yang,Xiuming Liu,Xiuming Liu,Yanfang Wang,Yanfang Wang,Jinyu Wu,Jinyu Wu,Xiaokun Li +12 more
TL;DR: This study has important implications for further identification of gene regulation under abiotic stresses and significantly contributes a complete profile of miRNAs in Glycine max.
Small RNA deep sequencing discriminates subsets of extracellular vesicles released by melanoma cells--Evidence of unique microRNA cargos.
Taral R. Lunavat,Lesley Cheng,Dae-Kyum Kim,Joydeep Bhadury,Su Chul Jang,Cecilia Lässer,Robyn A. Sharples,Marcela Dávila López,Jonas Nilsson,Yong Song Gho,Andrew F. Hill,Jan Lötvall +11 more
TL;DR: This study shows for the first time the presence of distinct small RNAs in subsets of EVs released by melanoma cells, with significant similarities to clinical melanoma tissue, and provides unique insights into the contribution of EV associated extracellular RNA in cancer.
181
Autotetraploid rice methylome analysis reveals methylation variation of transposable elements and their effects on gene expression.
TL;DR: It is found that WGD prompts increased methylation in class II transposable elements, which then suppress the genome-wide expression level of nearby genes, and chromosome doubling induces methylation variation in TEs that affect gene expression and may become a “genome shock” response factor to help neoautopolyploids adapt to genome-dosage effects.
178
References
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
39.8K
Ultrafast and memory-efficient alignment of short DNA sequences to the human genome
TL;DR: Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches and can be used simultaneously to achieve even greater alignment speeds.
Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes
Jo Vandesompele,Katleen De Preter,Filip Pattyn,Bruce Poppe,Nadine Van Roy,Anne De Paepe,Franki Speleman +6 more
TL;DR: The normalization strategy presented here is a prerequisite for accurate RT-PCR expression profiling, which opens up the possibility of studying the biological relevance of small expression differences.
Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans
Andrew Fire,SiQun Xu,Mary K. Montgomery,Steven A. Kostas,Steven A. Kostas,Samuel E. Driver,Craig C. Mello +6 more
TL;DR: To their surprise, it was found that double-stranded RNA was substantially more effective at producing interference than was either strand individually, arguing against stochiometric interference with endogenous mRNA and suggesting that there could be a catalytic or amplification component in the interference process.
16.7K
Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation
Cole Trapnell,Cole Trapnell,Brian A. Williams,Geo Pertea,Ali Mortazavi,Gordon Kwan,Marijke J. van Baren,Steven L. Salzberg,Barbara J. Wold,Lior Pachter +9 more
TL;DR: The results suggest that Cufflinks can illuminate the substantial regulatory flexibility and complexity in even this well-studied model of muscle development and that it can improve transcriptome-based genome annotation.