DNA encoding for an efficient 'Omics processing.

Comput Methods Programs Biomed

University of Ljubljana, Faculty of Electrical Engineering, Machine Vision Laboratory, Trzaska cesta 25, 1000 Ljubljana, Slovenia.

Published: November 2010

The exponential growth of available DNA sequences and the increased interoperability of biological information is triggering intergovernmental efforts aimed at increasing the access, dissemination, and analysis of sequence data. Achieving the efficient storage and processing of DNA material is an important goal that parallels well with the foreseen coding standardization on the horizon. This paper proposes novel coding approaches, for both the dissemination and processing of sequences, where the speed of the DNA processing is shown to be boosted by exploring more than the normally utilized eight bits for encoding a single nucleotide. Further gains are achieved by encoding the nucleotides together with their trailing alignment information as a single 64-bit data structure. The paper also proposes a slight modification to the established FASTA scheme in order to improve on its representation of alignment information. The significance of the propositions is confirmed by the encouraging results from empirical tests.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.cmpb.2010.03.014DOI Listing

Publication Analysis

Top Keywords

paper proposes
8
dna
4
dna encoding
4
encoding efficient
4
efficient 'omics
4
processing
4
'omics processing
4
processing exponential
4
exponential growth
4
growth dna
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!