Similarity analysis for DNA sequences based on chaos game representation. Case study: the albumin.

J Theor Biol

Department of Physics I, Faculty of Applied Sciences, Politehnica University of Bucharest, 313 Splaiul Independentei, RO-060042, Bucharest, Romania.

Published: December 2010

Using chaos game representation we introduce a novel and straightforward method for identifying similarities/dissimilarities between DNA sequences of the same type, from different organisms. A matrix is associated to each CGR pattern and the similarities result from the comparison between the matrices of the sequences of interest. Three different methods of analysis of the resulting difference matrix are considered: a 3-dimensional representation giving both local and global information, a numerical characterization by defining an n-letter word similarity measure and a statistical evaluation. The method is illustrated by implementation to the study of albumin nucleotides sequences from eight mammal species taking as reference the human albumin.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jtbi.2010.09.027DOI Listing

Publication Analysis

Top Keywords

dna sequences
8
chaos game
8
game representation
8
study albumin
8
similarity analysis
4
analysis dna
4
sequences
4
sequences based
4
based chaos
4
representation case
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!