AI Article Synopsis

  • Interest in planted (l, d) motif search (PMS) is growing, particularly for exploring significant segments in biological sequences, though large-alphabet applications have been under-discussed.
  • The paper introduces motif stem search (MSS), aimed at finding l-length string "stems" with wildcards to represent a minimal superset of all (l, d) motifs in large-alphabet sequences.
  • Key contributions include precise motif stem representation with regular expressions, a method to generate non-redundant stems, and the StemFinder algorithm that performs faster and with fewer stems than previous MSS methods, available at a specified link.

Article Abstract

In recent years, there has been an increasing interest in planted (l, d) motif search (PMS) with applications to discovering significant segments in biological sequences. However, there has been little discussion about PMS over large alphabets. This paper focuses on motif stem search (MSS), which is recently introduced to search motifs on large-alphabet inputs. A motif stem is an l-length string with some wildcards. The goal of the MSS problem is to find a set of stems that represents a superset of all (l , d) motifs present in the input sequences, and the superset is expected to be as small as possible. The three main contributions of this paper are as follows: (1) We build motif stem representation more precisely by using regular expressions. (2) We give a method for generating all possible motif stems without redundant wildcards. (3) We propose an efficient exact algorithm, called StemFinder, for solving the MSS problem. Compared with the previous MSS algorithms, StemFinder runs much faster and reports fewer stems which represent a smaller superset of all (l, d) motifs. StemFinder is freely available at http://sites.google.com/site/feqond/stemfinder.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCBB.2014.2361668DOI Listing

Publication Analysis

Top Keywords

motif stem
16
efficient exact
8
exact algorithm
8
stem search
8
large alphabets
8
mss problem
8
superset motifs
8
motif
6
algorithm motif
4
stem
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!