Detecting simultaneous changepoints in multiple sequences.

Biometrika

Department of Statistics , Stanford University, 390 Serra Mall, Stanford, California 94305-4065 , U.S.A.

Published: September 2010

We discuss the detection of local signals that occur at the same location in multiple one-dimensional noisy sequences, with particular attention to relatively weak signals that may occur in only a fraction of the sequences. We propose simple scan and segmentation algorithms based on the sum of the chi-squared statistics for each individual sample, which is equivalent to the generalized likelihood ratio for a model where the errors in each sample are independent. The simple geometry of the statistic allows us to derive accurate analytic approximations to the significance level of such scans. The formulation of the model is motivated by the biological problem of detecting recurrent DNA copy number variants in multiple samples. We show using replicates and parent-child comparisons that pooling data across samples results in more accurate detection of copy number variants. We also apply the multisample segmentation algorithm to the analysis of a cohort of tumour samples containing complex nested and overlapping copy number aberrations, for which our method gives a sparse and intuitive cross-sample summary.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3372242PMC
http://dx.doi.org/10.1093/biomet/asq025DOI Listing

Publication Analysis

Top Keywords

copy number
12
signals occur
8
number variants
8
detecting simultaneous
4
simultaneous changepoints
4
changepoints multiple
4
multiple sequences
4
sequences discuss
4
discuss detection
4
detection local
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!