Detecting simultaneous changepoints in multiple sequences.
TL;DR: It is shown using replicates and parent-child comparisons that pooling data across samples results in more accurate detection of copy number variants and the multisample segmentation algorithm is applied to the analysis of a cohort of tumour samples containing complex nested and overlapping copy number aberrations.
read more
Abstract: We discuss the detection of local signals that occur at the same location in multiple one-dimensional noisy sequences, with particular attention to relatively weak signals that may occur in only a fraction of the sequences. We propose simple scan and segmentation algorithms based on the sum of the chi-squared statistics for each individual sample, which is equivalent to the generalized likelihood ratio for a model where the errors in each sample are independent. The simple geometry of the statistic allows us to derive accurate analytic approximations to the significance level of such scans. The formulation of the model is motivated by the biological problem of detecting recurrent DNA copy number variants in multiple samples. We show using replicates and parent-child comparisons that pooling data across samples results in more accurate detection of copy number variants. We also apply the multisample segmentation algorithm to the analysis of a cohort of tumour samples containing complex nested and overlapping copy number aberrations, for which our method gives a sparse and intuitive cross-sample summary.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
On optimal multiple changepoint algorithms for large data
TL;DR: Empirical results show that FPOP is substantially faster than existing dynamic programming methods, and unlike the existing methods its computational efficiency is robust to the number of changepoints in the data.
•Posted Content
The group fused Lasso for multiple change-point detection
TL;DR: The group fused Lasso is presented for detection of multiple change-points shared by a set of co-occurring one-dimensional signals and fast algorithms are proposed to solve the resulting optimization problems.
Graph-based change-point detection
TL;DR: In this paper, a nonparametric graph-based approach is proposed to detect change points in a data sequence, which can be applied to any data set as long as an informative similarity measure on the sample space can be defined.
169
SCOPE: A Normalization and Copy-Number Estimation Method for Single-Cell DNA Sequencing
TL;DR: Evaluated on a diverse set of scDNA-seq data in cancer genomics and it is shown that SCOPE offers accurate copy-number estimates and successfully reconstructs subclonal structure.
84
Fast detection of multiple change-points shared by many signals
Jean-Philippe Vert,Kevin Bleakley +1 more
- 01 Jan 2008
TL;DR: A fast algorithm for the detection of multiple change-points when each is frequently shared by members of a set of co-occurring one-dimensional signals is presented.
80
References
Global variation in copy number in the human genome
Richard Redon,Shumpei Ishikawa,Karen R. Fitch,Lars Feuk,George H. Perry,T. Daniel Andrews,Heike Fiegler,Michael H. Shapero,Andrew R. Carson,Wenwei Chen,Eun Kyung Cho,Stephanie Dallaire,Jennifer L. Freeman,Juan R. González,Mònica Gratacòs,Jing Huang,Dimitrios Kalaitzopoulos,Daisuke Komura,Jeffrey R. MacDonald,Christian R. Marshall,Rui Mei,Lyndal Montgomery,Keunihiro Nishimura,Kohji Okamura,Fan Shen,Martin J. Somerville,Joelle Tchinda,Armand Valsesia,Cara Woodwark,Fengtang Yang,Junjun Zhang,Tatiana Zerjal,Jane Zhang,Lluís Armengol,Donald F. Conrad,Xavier Estivill,Chris Tyler-Smith,Nigel P. Carter,Hiroyuki Aburatani,Charles Lee,Keith W. Jones,Stephen W. Scherer,Matthew E. Hurles +42 more
TL;DR: A first-generation CNV map of the human genome is constructed through the study of 270 individuals from four populations with ancestry in Europe, Africa or Asia, underscoring the importance of CNV in genetic diversity and evolution and the utility of this resource for genetic disease studies.
4.6K
Detection of large-scale variation in the human genome.
A. John Iafrate,Lars Feuk,Miguel Rivera,Miguel Rivera,Marc L. Listewnik,Patricia K. Donahoe,Ying Qi,Stephen W. Scherer,Charles Lee,Charles Lee +9 more
TL;DR: This article identified 255 loci across the human genome that contain genomic imbalances among unrelated individuals, and revealed that half of these regions overlap with genes, and many coincide with segmental duplications or gaps in human genome assembly.
Circular binary segmentation for the analysis of array-based DNA copy number data.
TL;DR: A modification ofbinary segmentation is developed, which is called circular binary segmentation, to translate noisy intensity measurements into regions of equal copy number in DNA sequence copy number.
Diet and the evolution of human amylase gene copy number variation.
George H. Perry,Nathaniel J. Dominy,Katrina G. Claw,Arthur Lee,Heike Fiegler,Richard Redon,John C. Werner,Fernando A. Villanea,Joanna L. Mountain,Rajeev Misra,Nigel P. Carter,Charles Lee,Anne C. Stone +12 more
TL;DR: It is found that copy number of the salivary amylase gene (AMY1) is correlated positively with salivaries protein level and that individuals from populations with high-starch diets have, on average, more AMY1 copies than those with traditionally low-st starch diets.
Mapping and sequencing of structural variation from eight human genomes
Jeffrey M. Kidd,Gregory M. Cooper,William F. Donahue,Hillary S. Hayden,Nick Sampas,Tina Graves,Nancy F. Hansen,Brian Teague,Can Alkan,Francesca Antonacci,Eric Haugen,Troy Zerr,N. Alice Yamada,Peter Tsang,Tera L. Newman,Eray Tüzün,Ze Cheng,Heather Ebling,Nadeem Tusneem,Robert David,Will D. Gillett,Karen A. Phelps,Molly Weaver,David J. Saranga,Adrianne Brand,Wei Tao,Erik Gustafson,Kevin McKernan,Lin Chen,Maika Malig,Joshua D. Smith,Joshua M. Korn,Steven A. McCarroll,David Altshuler,Daniel A. Peiffer,Michael O. Dorschner,John A. Stamatoyannopoulos,David C. Schwartz,Deborah A. Nickerson,James C. Mullikin,Richard K. Wilson,Laurakay Bruhn,Maynard V. Olson,Rajinder Kaul,Douglas R. Smith,Evan E. Eichler +45 more
TL;DR: This work employs a clone-based method to interrogate intermediate structural variation in eight individuals of diverse geographic ancestry and provides the first high-resolution sequence map of human structural variation—a standard for genotyping platforms and a prelude to future individual genome sequencing projects.