An efficient algorithm for the contig ordering problem under algebraic rearrangement distance.

J Comput Biol

Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan .

Published: November 2015

Assembling a genome from short reads currently obtained by next-generation sequencing techniques often results in a collection of contigs, whose relative position and orientation along the genome being sequenced are unknown. Given two sets of contigs, the contig ordering problem is to order and orient the contigs in each set such that the genome rearrangement distance between the resulting sets of ordered and oriented contigs is minimized. In this article, we utilize the permutation groups in algebra to propose a near-linear time algorithm for solving the contig ordering problem under algebraic rearrangement distance, where the algebraic rearrangement distance between two sets of ordered and oriented contigs is the minimum weight of applicable rearrangement operations required to transform one set into the other.

Download full-text PDF

Source
http://dx.doi.org/10.1089/cmb.2015.0073DOI Listing

Publication Analysis

Top Keywords

rearrangement distance
16
contig ordering
12
ordering problem
12
algebraic rearrangement
12
problem algebraic
8
distance sets
8
sets ordered
8
ordered oriented
8
oriented contigs
8
rearrangement
5

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!