We describe a new method for finding haplotype blocks based on the use of the minimum description length principle. We give a rigorous definition of the quality of a segmentation of a genomic region into blocks, and describe a dynamic programming algorithm for finding the optimal segmentation with respect to this measure. We also describe a method for finding the probability of a block boundary for each pair of adjacent markers: this gives a tool for evaluating the significance of each block boundary. We have applied the method to the published data of Daly et al. The results are in relatively good agreement with the published results, but also show clear differences in the predicted block boundaries and their strengths. We also give results on the block structure in population isolates.

Download full-text PDF

Source
http://dx.doi.org/10.1142/9789812776303_0047DOI Listing

Publication Analysis

Top Keywords

method finding
12
finding haplotype
8
haplotype blocks
8
block boundaries
8
describe method
8
block boundary
8
block
5
mdl method
4
finding
4
blocks estimating
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!