Genome Gerrymandering: optimal division of the genome into regions with cancer type specific differences in mutation rates.

Pac Symp Biocomput

Department of Computer Science, University of Toronto, 40 St. George Street, Room 7224, Toronto, ON M5S 2E4, Canada.

Published: March 2021

The activity of mutational processes differs across the genome, and is influenced by chromatin state and spatial genome organization. At the scale of one megabase-pair (Mb), regional mutation density correlate strongly with chromatin features and mutation density at this scale can be used to accurately identify cancer type. Here, we explore the relationship between genomic region and mutation rate by developing an information theory driven, dynamic programming algorithm for dividing the genome into regions with differing relative mutation rates between cancer types. Our algorithm improves mutual information when compared to the naive approach, effectively reducing the average number of mutations required to identify cancer type. Our approach provides an efficient method for associating regional mutation density with mutation labels, and has future applications in exploring the role of somatic mutations in a number of diseases.

Download full-text PDF

Source

Publication Analysis

Top Keywords

cancer type
12
mutation density
12
genome regions
8
mutation rates
8
regional mutation
8
identify cancer
8
mutation
7
genome
5
genome gerrymandering
4
gerrymandering optimal
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!