An efficient method for significant motifs discovery from multiple DNA sequences.

Abdulrakeeb M Al-Ssulami Aqil M Azmi Hassan Mathkour

J Bioinform Comput Biol

1 Department of Computer Science, College of Computer & Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia.

Published: August 2017

Identification of transcription factor binding sites or biological motifs is an important step in deciphering the mechanisms of gene regulation. It is a classic problem that has eluded a satisfactory and efficient solution. In this paper, we devise a three-phase algorithm to mine for biologically significant motifs. In the first phase, we generate all the possible string motifs, this phase is followed by a filtering process where we discard all motifs that do not meet the constraints. And in the final phase, motifs are scored and ranked using a combination of stochastic techniques and [Formula: see text]-value. We show that our method outperforms some very well-known motif discovery tools, e.g. MEME and Weeder on well-established benchmark data suites. We also apply the algorithm on the non-coding regions of M. tuberculosis and report significant motifs of size 10 with excellent [Formula: see text]-values in a fraction of the time MEME and MoSDi did. In fact, among the best 10 motifs ([Formula: see text]-value wise) in the non-coding regions of M. tuberculosis reported by the tools MEME, MoSDi and ours, five were discovered by our approach which included the third and the fourth best ones. All this in 1/17 and 1/6 the time which MEME and MoSDi (respectively) took.

Download full-text PDF	Source
http://dx.doi.org/10.1142/S0219720017500147	DOI Listing

Publication Analysis

Top Keywords

meme mosdi

motifs

motifs phase

[formula text]-value

tools meme

non-coding regions

regions m tuberculosis

time meme

efficient method

method motifs

Similar Publications

An efficient method for significant motifs discovery from multiple DNA sequences.

J Bioinform Comput Biol

August 2017

1 Department of Computer Science, College of Computer & Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia.

Abdulrakeeb M Al-Ssulami Aqil M Azmi Hassan Mathkour

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!