Transcription factors (TFs) bind DNA by recognizing specific sequence motifs, typically of length 6-12bp. A motif can occur many thousands of times in the human genome, but only a subset of those sites are actually bound. Here we present a machine learning framework leveraging existing convolutional neural network architectures and model interpretation techniques to identify and interpret sequence context features most important for predicting whether a particular motif instance will be bound. We apply our framework to predict binding at motifs for 38 TFs in a lymphoblastoid cell line, score the importance of context sequences at base-pair resolution, and characterize context features most predictive of binding. We find that the choice of training data heavily influences classification accuracy and the relative importance of features such as open chromatin. Overall, our framework enables novel insights into features predictive of TF binding and is likely to inform future deep learning applications to interpret non-coding genetic variants.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8009085PMC
http://dx.doi.org/10.1038/s42256-020-00282-yDOI Listing

Publication Analysis

Top Keywords

context features
12
features predictive
12
sequence context
8
predictive binding
8
features
5
deep neural
4
neural networks
4
networks identify
4
identify sequence
4
context
4

Similar Publications

Host RNA-Binding Proteins as Regulators of HIV-1 Replication.

Viruses

December 2024

Laboratory of Molecular and Cellular Virology, Institute of Biomedical Sciences, Faculty of Medicine, Universidad de Chile, Santiago 8380453, Chile.

RNA-binding proteins (RBPs) are cellular factors involved in every step of RNA metabolism. During HIV-1 infection, these proteins are key players in the fine-tuning of viral and host cellular and molecular pathways, including (but not limited to) viral entry, transcription, splicing, RNA modification, translation, decay, assembly, and packaging, as well as the modulation of the antiviral response. Targeted studies have been of paramount importance in identifying and understanding the role of RNA-binding proteins that bind to HIV-1 RNAs.

View Article and Find Full Text PDF

Background: Food image recognition, a crucial step in computational gastronomy, has diverse applications across nutritional platforms. Convolutional neural networks (CNNs) are widely used for this task due to their ability to capture hierarchical features. However, they struggle with long-range dependencies and global feature extraction, which are vital in distinguishing visually similar foods or images where the context of the whole dish is crucial, thus necessitating transformer architecture.

View Article and Find Full Text PDF

Bio-Microcapsules of Polybutylene Succinate (PBS) and Isocyanates: Towards Sustainable, Safer, and Efficient Adhesives.

Polymers (Basel)

January 2025

CERENA-Centro de Recursos Naturais e Ambiente, Department of Chemical Engineering (DEQ), Instituto Superior Técnico, Universidade de Lisboa, Avenida Rovisco Pais, 1049-001 Lisboa, Portugal.

This work describes the encapsulation of three different aliphatic isocyanates to reduce the risks associated with isocyanates' direct handling. The use of bio-based polybutylene succinate (bio-PBS) increases the sustainability factor as it allows for the use of microcapsules (MCs) from renewable sources with biodegradable features. The three different MCs (MCs-Monomer, MCs-Trimer, and MCs-Polymer) are spherical, crack-free, and matrix-type, containing an isocyanate payload between 67 wt% and 70 wt%.

View Article and Find Full Text PDF

Depression Recognition Using Daily Wearable-Derived Physiological Data.

Sensors (Basel)

January 2025

Department of Psychological and Cognitive Sciences, Tsinghua University, Beijing 100084, China.

The objective identification of depression using physiological data has emerged as a significant research focus within the field of psychiatry. The advancement of wearable physiological measurement devices has opened new avenues for the identification of individuals with depression in everyday-life contexts. Compared to other objective measurement methods, wearables offer the potential for continuous, unobtrusive monitoring, which can capture subtle physiological changes indicative of depressive states.

View Article and Find Full Text PDF

A Deep Learning Approach for Mental Fatigue State Assessment.

Sensors (Basel)

January 2025

Institute of Artificial Intelligence in Sports, Capital University of Physical Education and Sports, Beijing 100191, China.

This study investigates mental fatigue in sports activities by leveraging deep learning techniques, deviating from the conventional use of heart rate variability (HRV) feature analysis found in previous research. The study utilizes a hybrid deep neural network model, which integrates Residual Networks (ResNet) and Bidirectional Long Short-Term Memory (Bi-LSTM) for feature extraction, and a transformer for feature fusion. The model achieves an impressive accuracy of 95.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!