MLSNet: a deep learning model for predicting transcription factor binding sites.

Yuchuan Zhang Zhikang Wang Fang Ge Xiaoyu Wang Yiwen Zhang Shanshan Li Yuming Guo Jiangning Song Dong-Jun Yu

Brief Bioinform

School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China.

Published: September 2024

Accurate prediction of transcription factor binding sites (TFBSs) is crucial for understanding gene regulation and disease mechanisms, and recent advancements in deep learning have shown promise but still need improvement.
The study introduces MLSNet, a new deep learning framework that uses multisize convolutional fusion alongside LSTM networks to effectively analyze complex DNA sequences and enhance TFBS prediction accuracy.
MLSNet outperforms existing models in experimental tests on 165 ChIP-seq datasets, achieving higher average metrics across the board, and the source code is publicly available for further research and application.

Accurate prediction of transcription factor binding sites (TFBSs) is essential for understanding gene regulation mechanisms and the etiology of diseases. Despite numerous advances in deep learning for predicting TFBSs, their performance can still be enhanced. In this study, we propose MLSNet, a novel deep learning architecture designed specifically to predict TFBSs. MLSNet innovatively integrates multisize convolutional fusion with long short-term memory (LSTM) networks to effectively capture DNA-sparse higher-order sequence features. Further, MLSNet incorporates super token attention and Bi-LSTM to systematically extract and integrate higher-order DNA shape features. Experimental results on 165 ChIP-seq (chromatin immunoprecipitation followed by sequencing) datasets indicate that MLSNet consistently outperforms several state-of-the-art algorithms in the prediction of TFBSs. Specifically, MLSNet reports average metrics: 0.8306 for ACC, 0.8992 for AUROC, and 0.9035 for AUPRC, surpassing the second-best methods by 1.82%, 1.68%, and 1.54%, respectively. This research delineates the effectiveness of combining multi-size convolutional layers with LSTM and DNA shape-based features in enhancing predictive accuracy. Moreover, this study comprehensively assesses the variability in model performance across different cell lines and transcription factors. The source code of MLSNet is available at https://github.com/minghaidea/MLSNet.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11442149	PMC
http://dx.doi.org/10.1093/bib/bbae489	DOI Listing

Publication Analysis

Top Keywords

deep learning

transcription factor

factor binding

binding sites

tfbss mlsnet

mlsnet

mlsnet deep

learning model

model predicting

predicting transcription

Similar Publications

Integrating Interpretability in Machine Learning and Deep Neural Networks: A Novel Approach to Feature Importance and Outlier Detection in COVID-19 Symptomatology and Vaccine Efficacy.

Viruses

November 2024

Faculty of Medical and Health Sciences, Tel Aviv University, Tel Aviv 6997801, Israel.

Shadi Jacob Khoury Yazeed Zoabi Mickey Scheinowitz Noam Shomron

In this study, we introduce a novel approach that integrates interpretability techniques from both traditional machine learning (ML) and deep neural networks (DNN) to quantify feature importance using global and local interpretation methods. Our method bridges the gap between interpretable ML models and powerful deep learning (DL) architectures, providing comprehensive insights into the key drivers behind model predictions, especially in detecting outliers within medical data. We applied this method to analyze COVID-19 pandemic data from 2020, yielding intriguing insights.

View Article and Find Full Text PDF

Similar Publications

FP-YOLOv8: Surface Defect Detection Algorithm for Brake Pipe Ends Based on Improved YOLOv8n.

Sensors (Basel)

December 2024

School of Mechanical and Power Engineering, Zhengzhou University, Zhengzhou 450000, China.

Ke Rao Fengxia Zhao Tianyu Shi

To address the limitations of existing deep learning-based algorithms in detecting surface defects on brake pipe ends, a novel lightweight detection algorithm, FP-YOLOv8, is proposed. This algorithm is developed based on the YOLOv8n framework with the aim of improving accuracy and model lightweight design. First, the C2f_GhostV2 module has been designed to replace the original C2f module.

View Article and Find Full Text PDF

Similar Publications

Fusion of Visible and Infrared Aerial Images from Uncalibrated Sensors Using Wavelet Decomposition and Deep Learning.

Sensors (Basel)

December 2024

Department of Electrical and Computer Engineering, University of Missouri, Columbia, MO 65211, USA.

Chandrakanth Vipparla Timothy Krock Koundinya Nouduri Joshua Fraser Hadi AliAkbarpour

Multi-modal systems extract information about the environment using specialized sensors that are optimized based on the wavelength of the phenomenology and material interactions. To maximize the entropy, complementary systems operating in regions of non-overlapping wavelengths are optimal. VIS-IR (Visible-Infrared) systems have been at the forefront of multi-modal fusion research and are used extensively to represent information in all-day all-weather applications.

View Article and Find Full Text PDF

Similar Publications

A Scene Knowledge Integrating Network for Transmission Line Multi-Fitting Detection.

Sensors (Basel)

December 2024

Automation Department, North China Electric Power University, Baoding 071003, China.

Xinhang Chen Xinsheng Xu Jing Xu Wenjie Zheng Qianming Wang

Aiming at the severe occlusion problem and the tiny-scale object problem in the multi-fitting detection task, the Scene Knowledge Integrating Network (SKIN), including the scene filter module (SFM) and scene structure information module (SSIM) is proposed. Firstly, the particularity of the scene in the multi-fitting detection task is analyzed. Hence, the aggregation of the fittings is defined as the scene according to the professional knowledge of the power field and the habit of the operators in identifying the fittings.

View Article and Find Full Text PDF

Similar Publications

A Systematic Review on the Advancements in Remote Sensing and Proximity Tools for Grapevine Disease Detection.

Sensors (Basel)

December 2024

Centre for the Research and Technology of Agro-Environmental and Biological Sciences, University of Trás-os-Montes e Alto Douro, 5000-801 Vila Real, Portugal.

Fernando Portela Joaquim J Sousa Cláudio Araújo-Paredes Emanuel Peres Raul Morais

Grapevines ( L.) are one of the most economically relevant crops worldwide, yet they are highly vulnerable to various diseases, causing substantial economic losses for winegrowers. This systematic review evaluates the application of remote sensing and proximal tools for vineyard disease detection, addressing current capabilities, gaps, and future directions in sensor-based field monitoring of grapevine diseases.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!