Mining poly-regions in DNA.

Int J Data Min Bioinform

Department of Information and Computer Science, Aalto University 00076, Finland.

Published: January 2013

We study the problem of mining poly-regions in DNA. A poly-region is defined as a bursty DNA area, i.e., area of elevated frequency of a DNA pattern. We introduce a general formulation that covers a range of meaningful types of poly-regions and develop three efficient detection methods. The first applies recursive segmentation and is entropy-based. The second uses a set of sliding windows that summarize each sequence segment using several statistics. Finally, the third employs a technique based on majority vote. The proposed algorithms are tested on DNA sequences of four different organisms in terms of recall and runtime.

Download full-text PDF

Source
http://dx.doi.org/10.1504/ijdmb.2012.049278DOI Listing

Publication Analysis

Top Keywords

mining poly-regions
8
poly-regions dna
8
dna
5
dna study
4
study problem
4
problem mining
4
dna poly-region
4
poly-region defined
4
defined bursty
4
bursty dna
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!