DP-Bind: a web server for sequence-based prediction of DNA-binding residues in DNA-binding proteins.

Bioinformatics

Gen*NY*Sis Center for Excellence in Cancer Genomics, Department of Epidemiology and Biostatistics, One Discovery Drive, University at Albany, Rensselaer, NY 12144, USA.

Published: March 2007

Unlabelled: This article describes DP-Bind, a web server for predicting DNA-binding sites in a DNA-binding protein from its amino acid sequence. The web server implements three machine learning methods: support vector machine, kernel logistic regression and penalized logistic regression. Prediction can be performed using either the input sequence alone or an automatically generated profile of evolutionary conservation of the input sequence in the form of PSI-BLAST position-specific scoring matrix (PSSM). PSSM-based kernel logistic regression achieves the accuracy of 77.2%, sensitivity of 76.4% and specificity of 76.6%. The outputs of all three individual methods are combined into a consensus prediction to help identify positions predicted with high level of confidence.

Availability: Freely available at http://lcg.rit.albany.edu/dp-bind.

Supplementary Information: http://lcg.rit.albany.edu/dp-bind/dpbind_supplement.html.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btl672DOI Listing

Publication Analysis

Top Keywords

web server
12
logistic regression
12
dp-bind web
8
kernel logistic
8
input sequence
8
server sequence-based
4
sequence-based prediction
4
dna-binding
4
prediction dna-binding
4
dna-binding residues
4

Similar Publications

Introduction: Alzheimer's disease (AD) is a leading cause of dementia, characterized by progressive neurodegeneration and cognitive dysfunction. The disease aetiology is closely associated with proteinopathies, mitochondrial abnormalities, and elevated ROS generation, which are some of the primary markers for AD brains.

Objectives: The current research was intended to elucidate the chemical interaction of β-pinene against potential targets and evaluate its neuroprotective potential in ICV-STZ-induced sAD.

View Article and Find Full Text PDF

MAI-TargetFisher: A proteome-wide drug target prediction method synergetically enhanced by artificial intelligence and physical modeling.

Acta Pharmacol Sin

January 2025

Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, ShanghaiTech University, Shanghai, 201210, China.

Computational target identification plays a pivotal role in the drug development process. With the significant advancements of deep learning methods for protein structure prediction, the structural coverage of human proteome has increased substantially. This progress inspired the development of the first genome-wide small molecule targets scanning method.

View Article and Find Full Text PDF

The G2PDeep-v2 server is a web-based platform powered by deep learning, for phenotype prediction and markers discovery from multi-omics data in any organisms including humans, plants, animals, and viruses. The server provides multiple services for researchers to create deep-learning models through an interactive interface and train these models using an automated hyperparameter tuning algorithm on high-performance computing resources. Users can visualize the results of phenotype and markers predictions and perform Gene Set Enrichment Analysis for the significant markers to provide insights into the molecular mechanisms underlying complex diseases, conditions and other biological phenotypes being studied.

View Article and Find Full Text PDF

HemaScope: A Tool for Analyzing Single-cell and Spatial Transcriptomics Data of Hematopoietic Cells.

Genomics Proteomics Bioinformatics

January 2025

Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Research Unit of Hematologic Malignancies Genomics and Translational Research of Chinese Academy of Medical Sciences, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China.

Single-cell RNA sequencing (scRNA-seq) and spatial transcriptomics (ST) techniques hold great value in evaluating the heterogeneity and spatial characteristics of hematopoietic cells within tissues. These two techniques are highly complementary, with scRNA-seq offering single-cell resolution and ST retaining spatial information. However, there is an urgent demand for well-organized and user-friendly toolkits capable of handling single-cell and spatial information.

View Article and Find Full Text PDF

The ARID1A gene, frequently mutated in cancer, encodes the AT-rich interactive domain-containing protein 1 A, a key component of the chromatin remodeling SWI/SNF complex. The ARID1A protein features a conserved DNA-binding domain (ARID domain) of approximately 100 residues crucial for its function. Despite the frequency of mutations, the impact on ARID1A's stability and contribution to cancer progression remains unclear.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!