A protein fold classifier formed by fusing different modes of pseudo amino acid composition via PSSM.

Comput Biol Chem

Control and Intelligent Processing Center of Excellence, School of Electrical and Computer Engineering, University of Tehran, Iran.

Published: February 2011

Protein function is related to its chemical reaction to the surrounding environment including other proteins. On the other hand, this depends on the spatial shape and tertiary structure of protein and folding of its constituent components in space. The correct identification of protein domain fold solely using extracted information from protein sequence is a complicated and controversial task in the current computational biology. In this article a combined classifier based on the information content of extracted features from the primary structure of protein has been introduced to face this challenging problem. In the first stage of our proposed two-tier architecture, there are several classifiers each of which is trained with a different sequence based feature vector. Apart from the application of the predicted secondary structure, hydrophobicity, van der Waals volume, polarity, polarizability, and different dimensions of pseudo-amino acid composition vectors in similar studies, the position specific scoring matrix (PSSM) has also been used to improve the correct classification rate (CCR) in this study. Using K-fold cross validation on training dataset related to 27 famous folds of SCOP, the 28 dimensional probability output vector from each evidence theoretic K-NN classifier is used to determine the information content or expertness of corresponding feature for discrimination in each fold class. In the second stage, the outputs of classifiers for test dataset are fused using Sugeno fuzzy integral operator to make better decision for target fold class. The expertness factor of each classifier in each fold class has been used to calculate the fuzzy integral operator weights. Results make it possible to provide deeper interpretation about the effectiveness of each feature for discrimination in target classes for query proteins.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiolchem.2010.12.001DOI Listing

Publication Analysis

Top Keywords

fold class
12
acid composition
8
structure protein
8
feature discrimination
8
fuzzy integral
8
integral operator
8
protein
6
protein fold
4
classifier
4
fold classifier
4

Similar Publications

Drug Development.

Alzheimers Dement

December 2024

UCSD, San Diego, CA, USA.

Cerebral beta-amyloid accumulation is the key initiator of Alzheimer's disease (AD) pathology. Most familial early-onset AD mutations in the APP, PSEN1/2 genes increase the ratio of Abeta42:Abeta40, which drives beta-amyloid accumulation in the brain. In 2001, the late Steve Wagner, Maria Kounnas, and I directed an agnostic high-throughput screen for compounds that would reverse the Abeta42:Abeta40, ratio, and discovered the first non-NSAID (second generation) gamma secretase modulators (GSM) at TorreyPines Therapeutics.

View Article and Find Full Text PDF

Biopsy is considered the gold standard for diagnosing brain tumors, but its invasive nature can pose risks to patients. Additionally, tissue analysis can be cumbersome and inconsistent among observers. This research aims to develop a cost-effective, non-invasive, MRI-based computer-aided diagnosis tool that can reliably, accurately and swiftly identify brain tumor grades.

View Article and Find Full Text PDF

Background: One avenue to improve outcomes among brain tumor patients involves the mitigation of healthcare disparities. Investigating clinical differences among brain tumors across socioeconomic and demographic strata, such can aid in healthcare disparity identification and, by extension, outcome improvement.

Methods: Utilizing a racially diverse population from Hawaii, 323 cases of brain tumors (meningiomas, gliomas, schwannomas, pituitary adenomas, and metastases) were matched by age, sex, and race to 651 controls to investigate the associations between tumor type and various demographic, socioeconomic, and medical comorbidities.

View Article and Find Full Text PDF

Introduction: There is a high unmet need for safe and effective non-opioid medicines to treat moderate to severe pain without risk of addiction. Voltage-gated sodium channel 1.8 (Na1.

View Article and Find Full Text PDF

Electrocardiogram (ECG) signals contain complex and diverse features, serving as a crucial basis for arrhythmia diagnosis. The subtle differences in characteristics among various types of arrhythmias, coupled with class imbalance issues in datasets, often hinder existing models from effectively capturing key information within these complex signals, leading to a bias towards normal classes. To address these challenges, this paper proposes a method for arrhythmia classification based on a multi-branch, multi-head attention temporal convolutional network (MB-MHA-TCN).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!