is-rSNP
TL;DR: The results demonstrate that is-rSNP is suitable for high-throughput screening of SNPs for potential regulatory function and is a useful and important tool in the interpretation of GWAS.
read more
Abstract: Motivation: Determining the functional impact of non-coding disease-associated single nucleotide polymorphisms (SNPs) identified by genome-wide association studies (GWAS) is challenging. Many of these SNPs are likely to be regulatory SNPs (rSNPs): variations which affect the ability of a transcription factor (TF) to bind to DNA. However, experimental procedures for identifying rSNPs are expensive and labour intensive. Therefore, in silico methods are required for rSNP prediction. By scoring two alleles with a TF position weight matrix (PWM), it can be determined which SNPs are likely rSNPs. However, predictions in this manner are noisy and no method exists that determines the statistical significance of a nucleotide variation on a PWM score.
Results: We have designed an algorithm for in silico rSNP detection called is-rSNP. We employ novel convolution methods to determine the complete distributions of PWM scores and ratios between allele scores, facilitating assignment of statistical significance to rSNP effects. We have tested our method on 41 experimentally verified rSNPs, correctly predicting the disrupted TF in 28 cases. We also analysed 146 disease-associated SNPs with no known functional impact in an attempt to identify candidate rSNPs. Of the 11 significantly predicted disrupted TFs, 9 had previous evidence of being associated with the disease in the literature. These results demonstrate that is-rSNP is suitable for high-throughput screening of SNPs for potential regulatory function. This is a useful and important tool in the interpretation of GWAS.
Availability: is-rSNP software is available for use at: www.genomics.csse.unimelb.edu.au/is-rSNP
Contact:gmaci@csse.unimelb.edu.au; adam.kowalczyk@nicta.com.au
Supplementary information:Supplementary data are available at Bioinformatics online.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants
Lucas D. Ward,Manolis Kellis +1 more
TL;DR: HaploReg is presented, a tool for exploring annotations of the non-coding genome among the results of published GWAS or novel sets of variants, and will be useful to researchers developing mechanistic hypotheses of the impact of non-Coding variants on clinical phenotypes and normal variation.
Role of non-coding sequence variants in cancer
Ekta Khurana,Yao Fu,Dimple Chakravarty,Francesca Demichelis,Francesca Demichelis,Mark A. Rubin,Mark Gerstein +6 more
TL;DR: The current understanding of non-coding variants in cancer is reviewed, including the great diversity of the mutation types — from single nucleotide variants to large genomic rearrangements — and the wide range of mechanisms by which they affect gene expression to promote tumorigenesis.
495
The Genetics of Transcription Factor DNA Binding Variation
TL;DR: The findings that led to this important paradigm shift are summarized and proposed mechanisms for local, proximal, or distal genetic variation-driven variable TF-DNA binding are reviewed.
418
CopraRNA and IntaRNA: predicting small RNA targets, networks and interaction domains
Patrick R. Wright,Jens Georg,Martin Mann,Dragos Alexandru Sorescu,Andreas S. Richter,Steffen C. Lott,Robert Kleinkauf,Wolfgang R. Hess,Rolf Backofen +8 more
TL;DR: The functionality of the CopraRNA and IntaRNA webservers are introduced and detailed explanations on their postprocessing functionalities are given.
351
Bioinformatics challenges for personalized medicine
TL;DR: This review outlines recent developments in sequencing technologies and genome analysis methods for application in personalized medicine and outlines new methods needed in four areas to realize the potential of personalized medicine.
References
Controlling the false discovery rate: a practical and powerful approach to multiple testing
Yoav Benjamini,Yosef Hochberg +1 more
TL;DR: In this paper, a different approach to problems of multiple significance testing is presented, which calls for controlling the expected proportion of falsely rejected hypotheses -the false discovery rate, which is equivalent to the FWER when all hypotheses are true but is smaller otherwise.
WebLogo: A Sequence Logo Generator
TL;DR: WebLogo generates sequence logos, graphical representations of the patterns within a multiple sequence alignment that provide a richer and more precise description of sequence similarity than consensus sequences and can rapidly reveal significant features of the alignment otherwise difficult to perceive.
TRANSFAC® and its module TRANSCompel®: transcriptional gene regulation in eukaryotes
V. Matys,Olga V. Kel-Margoulis,Ellen Fricke,Ines Liebich,Sigrid Land,A. Barre-Dirrie,Ingmar Reuter,D. Chekmenev,Mathias Krull,Klaus Hornischer,Nico Voss,Philip Stegmaier,Birgit Lewicki-Potapov,H. Saxel,Alexander E. Kel,Edgar Wingender +15 more
TL;DR: The TRANSFAC® database on transcription factors, their binding sites, nucleotide distribution matrices and regulated genes as well as the complementing database TRANSCompel® on composite elements have been further enhanced on various levels.
An oestrogen-receptor-α-bound human chromatin interactome
Melissa J. Fullwood,Mei Hui Liu,You Fu Pan,Jun Liu,Han Xu,Yusoff Bin Mohamed,Yuriy L. Orlov,Stoyan Velkov,Andrea Ho,Poh Huay Mei,Elaine G.Y. Chew,Phillips Yao Hui Huang,Willem Welboren,Yuyuan Han,Hong Sain Ooi,Pramila N. Ariyaratne,Vinsensius B. Vega,Yanquan Luo,Peck Yean Tan,Pei Ye Choy,K. D. Senali Abayratna Wansa,Bing Zhao,Kar Sian Lim,Shi Chi Leow,Jit Sin Yow,Roy Joseph,Haixia Li,Kartiki V. Desai,Jane S. Thomsen,Yew Kok Lee,R. Krishna Murthy Karuturi,Thoreau Herve,Guillaume Bourque,Hendrik G. Stunnenberg,Xiaoan Ruan,Valere Cacheux-Rataboul,Wing-Kin Sung,Wing-Kin Sung,Edison T. Liu,Chia-Lin Wei,Edwin Cheung,Edwin Cheung,Edwin Cheung,Yijun Ruan,Yijun Ruan +44 more
TL;DR: It is proposed that chromatin interactions constitute a primary mechanism for regulating transcription in mammalian genomes and is described as a new strategy, chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) for the de novo detection of global Chromatin interactions.
1.8K
Mechanisms of disease: pathogenesis of Crohn's disease and ulcerative colitis.
TL;DR: These studies indicate that Crohn's disease and ulcerative colitis are heterogeneous diseases characterized by various genetic abnormalities that lead to overly aggressive T-cell responses to a subset of commensal enteric bacteria.
1.8K