Set cover-based methods for motif selection.

Bioinformatics

Department of Electrical Engineering and Computer Science, Ohio University, Athens, OH 45701, USA.

Published: February 2020

Motivation: De novo motif discovery algorithms find statistically over-represented sequence motifs that may function as transcription factor binding sites. Current methods often report large numbers of motifs, making it difficult to perform further analyses and experimental validation. The motif selection problem seeks to identify a minimal set of putative regulatory motifs that characterize sequences of interest (e.g. ChIP-Seq binding regions).

Results: In this study, the motif selection problem is mapped to variants of the set cover problem that are solved via tabu search and by relaxed integer linear programing (RILP). The algorithms are employed to analyze 349 ChIP-Seq experiments from the ENCODE project, yielding a small number of high-quality motifs that represent putative binding sites of primary factors and cofactors. Specifically, when compared with the motifs reported by Kheradpour and Kellis, the set cover-based algorithms produced motif sets covering 35% more peaks for 11 TFs and identified 4 more putative cofactors for 6 TFs. Moreover, a systematic evaluation using nested cross-validation revealed that the RILP algorithm selected fewer motifs and was able to cover 6% more peaks and 3% fewer background regions, which reduced the error rate by 7%.

Availability And Implementation: The source code of the algorithms and all the datasets are available at https://github.com/YichaoOU/Set_cover_tools.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7703758PMC
http://dx.doi.org/10.1093/bioinformatics/btz697DOI Listing

Publication Analysis

Top Keywords

motif selection
12
set cover-based
8
binding sites
8
selection problem
8
motifs
6
motif
5
set
4
cover-based methods
4
methods motif
4
selection motivation
4

Similar Publications

The HIV integrase inhibitor, dolutegravir (DTG), in the absence of eliciting integrase (int) resistance, has been reported to select mutations in the virus 3'-polypurine tract (3'-PPT) adjacent to the 3'-LTR U3. An analog of DTG, cabotegravir (CAB), has a high genetic barrier to drug resistance and is used in formulations for treatment and long-acting pre-exposure prophylaxis. We examined whether mutations observed for DTG would emerge in vitro with CAB.

View Article and Find Full Text PDF

The gene family plays a crucial role in plant growth, development, and responses to biotic and abiotic stresses. , a warm-season turfgrass with exceptional salt tolerance, can be irrigated with seawater. However, the gene family in seashore paspalum remains poorly understood.

View Article and Find Full Text PDF

Development of Roselle ( L.) Transcriptome-Based Simple Sequence Repeat Markers and Their Application in Roselle.

Plants (Basel)

December 2024

Key Laboratory of Ministry of Education for Genetics, Breeding and Multiple Utilization of Crops, Fujian Agriculture and Forestry University, Fuzhou 350002, China.

Roselle ( L.) simple sequence repeat (SSR) markers were developed using RNA sequencing technology, providing a foundation for genetic analysis and the identification of roselle varieties. In this study, 10 785 unigenes containing 12 994 SSR loci with an average of one SSR locus per 6.

View Article and Find Full Text PDF

: Breast cancer influences more than 2 million women worldwide annually. Since apoptotic dysregulation is a cancer hallmark, targeting apoptotic regulators encompasses strategic drug development for cancer therapy. One such class of apoptotic regulators is inhibitors of apoptosis proteins (IAP) which are a class of E3 ubiquitin ligases that actively function to support cancer growth and survival.

View Article and Find Full Text PDF

Enhancement of Human Immunodeficiency Virus-Specific CD8 T Cell Responses with TIGIT Blockade Involves Trogocytosis.

Pathogens

December 2024

Immunology and Infectious Diseases Program, Division of BioMedical Sciences, Faculty of Medicine, Memorial University of Newfoundland, St. John's, NL A1B 3V6, Canada.

Natural killer (NK) and CD8 T cell function is compromised in human immunodeficiency virus type 1 (HIV-1) infection by increased expression of inhibitory receptors such as TIGIT (T cell immunoreceptor with Ig and ITIM domains). Blocking inhibitory receptors or their ligands with monoclonal antibodies (mAb) has potential to improve antiviral immunity in general and facilitate HIV eradication strategies. We assessed the impact of TIGIT engagement and blockade on cytotoxicity, degranulation, and interferon-gamma (IFN-γ) production by CD8 T cells from persons living with HIV (PLWH).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!