Graph-based representations are considered to be the future for reference genomes, as they allow integrated representation of the steadily increasing data on individual variation. Currently available tools allow de novo assembly of graph-based reference genomes, alignment of new read sets to the graph representation as well as certain analyses like variant calling and haplotyping. We here present a first method for calling ChIP-Seq peaks on read data aligned to a graph-based reference genome. The method is a graph generalization of the peak caller MACS2, and is implemented in an open source tool, Graph Peak Caller. By using the existing tool vg to build a pan-genome of Arabidopsis thaliana, we validate our approach by showing that Graph Peak Caller with a pan-genome reference graph can trace variants within peaks that are not part of the linear reference genome, and find peaks that in general are more motif-enriched than those found by MACS2.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6396939PMC
http://dx.doi.org/10.1371/journal.pcbi.1006731DOI Listing

Publication Analysis

Top Keywords

peak caller
16
graph peak
12
graph-based reference
12
reference genomes
12
calling chip-seq
8
chip-seq peaks
8
reference genome
8
graph
6
reference
6
caller
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!