Using Huffman coding method to visualize and analyze DNA sequences.

J Comput Chem

College of Information Science and Technology, Shijiazhuang Tiedao University, Shijiazhuang, Hebei, People's Republic of China.

Published: November 2011

On the basis of the Huffman coding method, we propose a new graphical representation of DNA sequence. The representation can avoid degeneracy and loss of information in the transfer of data from a DNA sequence to its graphical representation. Then a multicomponent vector from the representation is introduced to characterize quantitatively DNA sequences. The components of the vector are derived from the graphical representation of DNA primary sequence. The examination of similarities and dissimilarities among the complete coding sequences of β-globin gene of 11 species and six ND6 proteins shows the utility of the scheme.

Download full-text PDF

Source
http://dx.doi.org/10.1002/jcc.21906DOI Listing

Publication Analysis

Top Keywords

graphical representation
12
huffman coding
8
coding method
8
dna sequences
8
representation dna
8
dna sequence
8
dna
5
representation
5
method visualize
4
visualize analyze
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!