Understanding the function of protein is conducive to research in advanced fields such as gene therapy of diseases, the development and design of new drugs, etc. The prerequisite for understanding the function of a protein is to determine its tertiary structure. The realization of protein structure classification is indispensable for this problem and fold recognition is a commonly used method of protein structure classification. Protein sequences of 40% identity in the ASTRAL protein classification database are used for fold recognition research in current work to predict 27 folding types which mostly belong to four protein structural classes: α, β, α/β and α + β. We extract features from primary structure of protein using methods covering DSSP, PSSM and HMM which are based on secondary structure and evolutionary information to convert protein sequences into feature vectors that can be recognized by machine learning algorithm and utilize the combination of LightGBM feature selection algorithm and incremental feature selection method (IFS) to find the optimal classifiers respectively constructed by machine learning algorithms on the basis of tree structure including Random Forest, XGBoost and LightGBM. Bayesian optimization method is used for hyper-parameter adjustment of machine learning algorithms to make the accuracy of fold recognition reach as high as 93.45% at last. The result obtained by the model we propose is outstanding in the study of protein fold recognition.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiolchem.2021.107456DOI Listing

Publication Analysis

Top Keywords

fold recognition
20
machine learning
16
learning algorithms
12
protein
10
protein fold
8
based secondary
8
secondary structure
8
structure evolutionary
8
understanding function
8
function protein
8

Similar Publications

Programmable DNA Nanoswitch-Regulated Plasmonic CRISPR/Cas12a-Gold Nanostars Reporter Platform for Nucleic Acid and Non-Nucleic Acid Biomarker Analysis Assisted by a Spatial Confinement Effect.

Nano Lett

January 2025

Key Laboratory of Optic-electric Sensing and Analytical Chemistry for Life Science, MOE; College of Chemistry and Molecular Engineering, Qingdao University of Science and Technology, Qingdao 266042, P. R. China.

CRISPR/Cas 12a system based nucleic acid and non-nucleic acid targets detection faces two challenges including (1) multiple crRNAs are needed for multiple biomarkers detection and (2) insufficient sensitivity resulted from photobleaching of fluorescent dyes and the low kinetic cleavage rate for a traditional single-strand (ssDNA) reporter. To address these limitations, we developed a programmable DNA nanoswitch (NS)-regulated plasmonic CRISPR/Cas12a-gold nanostars (Au NSTs) reporter platform for detection of nucleic acid and non-nucleic acid biomarkers with the assistance of the spatial confinement effect. Through simply programming the target recognition sequence in NS, only one crRNA is required to detect both nucleic acid and non-nucleic acid biomarkers.

View Article and Find Full Text PDF

In order to promote the digital dissemination and preservation of Chinese intangible cultural heritage, this work constructs a digital platform for its transmission. The platform integrates a range of advanced technologies, including the Densely Connected Convolutional Networks - Bottleneck and Compression model, a notable convolutional neural network, along with natural language processing algorithms, generative adversarial network algorithms, and neural collaborative filtering algorithms. The platform is validated with 224,055 publicly archived valid data records, ensuring its effectiveness and reliability.

View Article and Find Full Text PDF

Dye-sensitized upconversion nanoprobes with ultra-high signal-to-background ratio for visual and sensitive detection of nerve agent mimics.

Mikrochim Acta

January 2025

Hunan Provincial Key Laboratory of Micro & Nano Materials Interface Science, College of Chemistry and Chemical Engineering, Central South University, Changsha, 410083, China.

An exciting upconversion nanoprobe conditioning strategy is proposed to improve the signal-to-background ratio (SBR) through a dye-sensitized strategy, in which the dye functions both as a recognition unit of the detection target and as a sensitizer to amplify the visible luminescence of the lanthanide-doped upconversion nanoparticles (UCNPs), instead of a quencher. The application of this dye-sensitized upconversion nanoprobe to the visual detection of nerve agent mimics diethoxy phosphatidylcholine (DCP) showed excellent detection performance, with up to 110-fold enhancement of the luminescence response of the probe in DCP solution and a detection limit as low as 2 nM. Finally, we performed visual detection of DCP solution and vapor by using test strips containing the probe.

View Article and Find Full Text PDF

Visible light-responsive enrofloxacin PEC aptasensor based on CN QDs sensitized BiOBr nanosheets.

Anal Chim Acta

February 2025

School of Chemistry and Chemical Engineering, Jiangsu University, Zhenjiang, 212013, PR China; Key Laboratory of Optic-Electric Sensing and Analytical Chemistry for Life Science, MOE, College of Chemistry and Molecular Engineering, Qingdao University of Science and Technology, Qingdao, 266042, PR China. Electronic address:

Background: The excessive application of enrofloxacin (ENR) results in residues contaminating both food and the environment. Consequently, developing robust analytical methods for the selective detection of ENR is crucial. The photoelectrochemical (PEC) sensor has emerged as a highly sensitive analytical technique that has seen rapid development in recent years.

View Article and Find Full Text PDF

Diabetic polyneuropathy is the common neuropathy of diabetes. However, several inflammatory neuropathies may occur during diabetes. Chronic inflammatory demyelinating polyradiculoneuropathy (CIDP) represents the most treatable example.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!