A biological compression model, expert model, is presented which is superior to existing compression algorithms in both compression performance and speed. The model is able to compress whole eukaryotic genomes. Most importantly, the model provides a framework for knowledge discovery from biological data. It can be used for repeat element discovery, sequence alignment and phylogenetic analysis. We demonstrate that the model can handle statistically biased sequences and distantly related sequences where conventional knowledge discovery tools often fail.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1007/978-1-4419-7046-6_67 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!