Deep-WET: a deep learning-based approach for predicting DNA-binding proteins using word embedding techniques with weighted features.

Sci Rep

Center for Research Innovation and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, 10700, Thailand.

Published: February 2024

DNA-binding proteins (DBPs) play a significant role in all phases of genetic processes, including DNA recombination, repair, and modification. They are often utilized in drug discovery as fundamental elements of steroids, antibiotics, and anticancer drugs. Predicting them poses the most challenging task in proteomics research. Conventional experimental methods for DBP identification are costly and sometimes biased toward prediction. Therefore, developing powerful computational methods that can accurately and rapidly identify DBPs from sequence information is an urgent need. In this study, we propose a novel deep learning-based method called Deep-WET to accurately identify DBPs from primary sequence information. In Deep-WET, we employed three powerful feature encoding schemes containing Global Vectors, Word2Vec, and fastText to encode the protein sequence. Subsequently, these three features were sequentially combined and weighted using the weights obtained from the elements learned through the differential evolution (DE) algorithm. To enhance the predictive performance of Deep-WET, we applied the SHapley Additive exPlanations approach to remove irrelevant features. Finally, the optimal feature subset was input into convolutional neural networks to construct the Deep-WET predictor. Both cross-validation and independent tests indicated that Deep-WET achieved superior predictive performance compared to conventional machine learning classifiers. In addition, in extensive independent test, Deep-WET was effective and outperformed than several state-of-the-art methods for DBP prediction, with accuracy of 78.08%, MCC of 0.559, and AUC of 0.805. This superior performance shows that Deep-WET has a tremendous predictive capacity to predict DBPs. The web server of Deep-WET and curated datasets in this study are available at https://deepwet-dna.monarcatechnical.com/ . The proposed Deep-WET is anticipated to serve the community-wide effort for large-scale identification of potential DBPs.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10844231PMC
http://dx.doi.org/10.1038/s41598-024-52653-9DOI Listing

Publication Analysis

Top Keywords

deep-wet
10
deep learning-based
8
dna-binding proteins
8
methods dbp
8
identify dbps
8
predictive performance
8
performance deep-wet
8
dbps
5
deep-wet deep
4
learning-based approach
4

Similar Publications

DNA-binding proteins (DBPs) play a significant role in all phases of genetic processes, including DNA recombination, repair, and modification. They are often utilized in drug discovery as fundamental elements of steroids, antibiotics, and anticancer drugs. Predicting them poses the most challenging task in proteomics research.

View Article and Find Full Text PDF

Bovine respiratory disease (BRD) is a challenge in all housed farming systems that raise calves. Farm to farm variation in BRD prevalence can be partially attributed to variation in host immunity, pathogens and housing environment. Unlike host immunity and BRD pathogens, housing environment has not been well investigated.

View Article and Find Full Text PDF

Uncooled direct modulation DFB laser offers high speed transmission rate over a wide temperature range with high reliability and low cost, making it a cost-effective light source choice for 5G fronthaul and data center applications. However, a significant 3dB bandwidth decrease can be observed in high temperature for conventional DFB lasers. We present an uncooled DFB laser operating up to 85°C with extended direct modulation bandwidth and high reliability based on a novel groove-in-trench ridge waveguide structure, where two narrow grooves penetrating the active layer are etched symmetrically in the two conventional trenches by deep wet etching, respectively.

View Article and Find Full Text PDF

We investigate the interest of deep wet etching with HF/HNO or KOH solutions as a final step after polishing to improve fused silica optics laser damage resistance at the wavelength of 351 nm. This comparison is carried out on scratches engineered on high damage threshold polished fused silica optics. We evidence that both KOH and HF/HNO solutions are efficient to passivate scratches and thus improve their damage threshold up to the level of the polished surface.

View Article and Find Full Text PDF

Hydraulic lift promotes selective root foraging in nutrient-rich soil patches.

Funct Plant Biol

September 2012

Estación Experimental de Zonas Áridas, Consejo Superior de Investigaciones Científicas, Carretera de Sacramento s/n, E-04120 La Cañada de San Urbano, Almería, Spain.

Hydraulic lift (HL) - the passive movement of water through plant roots from deep wet to shallow drier soil layers - can improve root survival in dry soils by providing a source of moisture to shallow roots. It may also enhance plant nutrient capture, though empirical evidence for this is scarce and whether HL promotes the selective placement of roots in nutrient-rich soil enhancing nutrient capture in dry soils remains unknown. We tested this with a split-pot design in which we separated the root system of Retama sphaerocarpa (L.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!