Feature selection and combination criteria for improving accuracy in protein structure prediction.

IEEE Trans Nanobioscience

Department of Electrical and Control Engineering, National Chiao-Tung University, Hsin-chu, Taiwan and Computer Center of Chung Hua University, Hsin-chu, Taiwan.

Published: June 2007

The classification of protein structures is essential for their function determination in bioinformatics. At present, a reasonably high rate of prediction accuracy has been achieved in classifying proteins into four classes in the SCOP database according to their primary amino acid sequences. However, for further classification into fine-grained folding categories, especially when the number of possible folding patterns as those defined in the SCOP database is large, it is still quite a challenge. In our previous work, we have proposed a two-level classification strategy called hierarchical learning architecture (HLA) using neural networks and two indirect coding features to differentiate proteins according to their classes and folding patterns, which achieved an accuracy rate of 65.5%. In this paper, we use a combinatorial fusion technique to facilitate feature selection and combination for improving predictive accuracy in protein structure classification. When applying various criteria in combinatorial fusion to the protein fold prediction approach using neural networks with HLA and the radial basis function network (RBFN), the resulting classification has an overall prediction accuracy rate of 87% for four classes and 69.6% for 27 folding categories. These rates are significantly higher than the accuracy rate of 56.5% previously obtained by Ding and Dubchak. Our results demonstrate that data fusion is a viable method for feature selection and combination in the prediction and classification of protein structure.

Download full-text PDF

Source
http://dx.doi.org/10.1109/tnb.2007.897482DOI Listing

Publication Analysis

Top Keywords

feature selection
12
selection combination
12
protein structure
12
accuracy rate
12
accuracy protein
8
prediction classification
8
classification protein
8
prediction accuracy
8
proteins classes
8
scop database
8

Similar Publications

Frustrated Lewis pair chemistry (FLP) occupy a crucial position in nonmetal-mediated catalysis, especially toward activation of inert gas molecules. Yet, one formidable issue of homogeneous FLP catalysts is their instability on preservation and recycling. Here we contribute a general solution that marries the polyhedral oligomeric silsesquioxane (POSS) with a structurally specific frustrated Lewis acid to fabricate porous polymer networks, which can form water-insensitive heterogeneous FLP catalysts upon employing Lewis base substrates.

View Article and Find Full Text PDF

Enhancing Activation Energy Predictions under Data Constraints Using Graph Neural Networks.

J Chem Inf Model

January 2025

Department of Chemical Engineering, National Taiwan University, No. 1, Section 4, Roosevelt Road, Taipei 10617, Taiwan.

Accurately predicting activation energies is crucial for understanding chemical reactions and modeling complex reaction systems. However, the high computational cost of quantum chemistry methods often limits the feasibility of large-scale studies, leading to a scarcity of high-quality activation energy data. In this work, we explore and compare three innovative approaches (transfer learning, delta learning, and feature engineering) to enhance the accuracy of activation energy predictions using graph neural networks, specifically focusing on methods that incorporate low-cost, low-level computational data.

View Article and Find Full Text PDF

Background: Rex rabbit is famous for its silky and soft fur coat, a characteristic predominantly attributed to its hair follicles. Numerous studies have confirmed the crucial roles of mRNAs and non-coding RNAs (ncRNAs) in regulating key cellular processes such as cell proliferation, differentiation, apoptosis and immunity. However, their involvement in the regulation of the hair cycle in Rex rabbits remains unknown.

View Article and Find Full Text PDF

Optical techniques, such as functional near-infrared spectroscopy (fNIRS), contain high potential for the development of non-invasive wearable systems for evaluating cerebral vascular condition in aging, due to their portability and ability to monitor real-time changes in cerebral hemodynamics. In this study, thirty-six healthy adults were measured by single channel fNIRS to explore differences between two age groups using machine learning (ML). The subjects, measured during functional magnetic resonance imaging (fMRI) at Oulu University Hospital, were divided into young (age ≤ 32) and elderly (age ≥ 57) groups.

View Article and Find Full Text PDF

Mechanical ventilation is the process through which breathing support is provided to patients who face inconvenience during respiration. During the pandemic, many people were suffering from lung disorders, which elevated the demand for mechanical ventilators. The handling of mechanical ventilators is to be done under the assistance of trained professionals and demands the selection of ideal parameters.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!