In this work, a novel method was developed to distinguish nucleosome DNA and linker DNA based on increment of diversity combined with quadratic discriminant analysis (IDQD), using k-mer frequency of nucleotides in genome. When used to predict DNA potential for forming nucleosomes, the model achieved a high accuracy of 94.94%, 77.60%, and 86.81%, respectively, for Saccharomyces cerevisiae, Homo sapiens, and Drosophila melanogaster. The area under the receiver operator characteristics curve of our classifier was 0.982 for S. cerevisiae. Our results indicate that DNA sequence preference is critical for nucleosome formation potential and is likely conserved across eukaryotes. The model successfully identified nucleosome-enriched or nucleosome-depleted regions in S. cerevisiae genome, suggesting nucleosome positioning depends on DNA sequence preference. Thus, IDQD classifier is useful for predicting nucleosome positioning.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1007/s10577-010-9160-9 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!