Sequence motif finder using memetic algorithm.

BMC Bioinformatics

Department of Computer Science, Bioinformatics Graduate Program, Federal University of Technology - Paraná, Cornélio Procópio, PR, Brazil.

Published: January 2018

Background: De novo prediction of Transcription Factor Binding Sites (TFBS) using computational methods is a difficult task and it is an important problem in Bioinformatics. The correct recognition of TFBS plays an important role in understanding the mechanisms of gene regulation and helps to develop new drugs.

Results: We here present Memetic Framework for Motif Discovery (MFMD), an algorithm that uses semi-greedy constructive heuristics as a local optimizer. In addition, we used a hybridization of the classic genetic algorithm as a global optimizer to refine the solutions initially found. MFMD can find and classify overrepresented patterns in DNA sequences and predict their respective initial positions. MFMD performance was assessed using ChIP-seq data retrieved from the JASPAR site, promoter sequences extracted from the ABS site, and artificially generated synthetic data. The MFMD was evaluated and compared with well-known approaches in the literature, called MEME and Gibbs Motif Sampler, achieving a higher f-score in the most datasets used in this work.

Conclusions: We have developed an approach for detecting motifs in biopolymers sequences. MFMD is a freely available software that can be promising as an alternative to the development of new tools for de novo motif discovery. Its open-source software can be downloaded at https://github.com/jadermcg/mfmd .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5751424PMC
http://dx.doi.org/10.1186/s12859-017-2005-1DOI Listing

Publication Analysis

Top Keywords

motif discovery
8
mfmd
5
sequence motif
4
motif finder
4
finder memetic
4
memetic algorithm
4
algorithm background
4
background novo
4
novo prediction
4
prediction transcription
4

Similar Publications

Bacteria and archaea acquire resistance to genetic parasites by preferentially integrating short fragments of foreign DNA at one end of a Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR). "Leader" DNA upstream of CRISPR loci regulates transcription and foreign DNA integration into the CRISPR. Here, we analyze 37,477 CRISPRs from 39,277 bacterial and 556 archaeal genomes to identify conserved sequence motifs in CRISPR leaders.

View Article and Find Full Text PDF

DNA Aptamer-Polymer Conjugates for Selective Targeting of Integrin α4β1 T-Lineage Cancers.

ACS Appl Mater Interfaces

January 2025

Department of Bioengineering, University of Washington, Seattle, Washington 98195, United States.

Selective therapeutic targeting of T-cell malignancies is difficult due to the shared lineage between healthy and malignant T cells. Current front-line chemotherapy for these cancers is largely nonspecific, resulting in frequent cases of relapsed/refractory disease. The development of targeting approaches for effectively treating T-cell leukemia and lymphoma thus remains a critical goal for the oncology field.

View Article and Find Full Text PDF

Insights on post-translational modifications in fatty liver and fibrosis progression.

Biochim Biophys Acta Mol Basis Dis

January 2025

Ion Channel Biology Laboratory, AU-KBC Research Centre, Madras Institute of Technology Campus, Anna University, Chrompet, Chennai 600 044, Tamil Nadu, India. Electronic address:

Metabolic dysfunction-associated steatotic liver disease [MASLD] is a pervasive multifactorial health burden. Post-translational modifications [PTMs] of amino acid residues in protein domains demonstrate pivotal roles for imparting dynamic alterations in the cellular micro milieu. The crux of identifying novel druggable targets relies on comprehensively studying the etiology of metabolic disorders.

View Article and Find Full Text PDF

MPS1 kinase is a dual specificity kinase that plays an important role in the spindle assembly checkpoint mechanism during cell division. Overexpression of MPS1 kinase is reported in several cancers. However, drug discovery and development efforts targeting MPS1 kinase did not result in any clinically successful candidates.

View Article and Find Full Text PDF

Indole, a ubiquitous structural motif in bioactive compounds, has played a pivotal role in drug discovery. Among indole derivatives, indole-3-carboxaldehyde (I3A) has emerged as a particularly promising scaffold for the development of therapeutic agents. This review delves into the recent advancements in the chemical modification of I3A and its derivatives, highlighting their potential applications in various therapeutic areas.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!