SPACER: identification of cis-regulatory elements with non-contiguous critical residues.

Arijit Chakravarty Jonathan M Carlson Radhika S Khetani Charles E DeZiel Robert H Gross

Bioinformatics

Department of Cancer Pharmacology, Millennium Pharmaceuticals Inc., Cambridge, MA, USA.

Published: April 2007

Motivation: Many transcription factors bind to sites that are long and loosely related to each other. De novo identification of such motifs is computationally challenging. In this article, we propose a novel semi-greedy algorithm over the space of all IUPAC degenerate strings to identify the most over-represented highly degenerate motifs.

Results: We present an implementation of this algorithm, named SPACER (Separated Pattern-based Algorithm for cis-Element Recognition) and demonstrate its effectiveness in identifying 'gapped' and highly degenerate motifs. We compare SPACER's performance against ten motif finders on 42 experimentally defined regulons from Bacillus subtilis, Escherichia coli and Saccharomyces cerevisiae. These motif finders cover a wide range of both enumerative and statistical approaches, including programs specifically designed for prokaryotic and 'gapped' motifs.

Availability: A Java 1.4 implementation is freely available on the Web at http://genie.Dartmouth.edu/SPACER/

Download full-text PDF	Source
http://dx.doi.org/10.1093/bioinformatics/btm041	DOI Listing

Publication Analysis

Top Keywords

highly degenerate

motif finders

spacer identification

identification cis-regulatory

cis-regulatory elements

elements non-contiguous

non-contiguous critical

critical residues

residues motivation

motivation transcription

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!