Conotoxin Prediction: New Features to Increase Prediction Accuracy.

Toxins (Basel)

Bioscience Division, MS M888, Los Alamos National Laboratory, Los Alamos, NM 87545, USA.

Published: November 2023

Conotoxins are toxic, disulfide-bond-rich peptides from cone snail venom that target a wide range of receptors and ion channels with multiple pathophysiological effects. Conotoxins have extraordinary potential for medical therapeutics that include cancer, microbial infections, epilepsy, autoimmune diseases, neurological conditions, and cardiovascular disorders. Despite the potential for these compounds in novel therapeutic treatment development, the process of identifying and characterizing the toxicities of conotoxins is difficult, costly, and time-consuming. This challenge requires a series of diverse, complex, and labor-intensive biological, toxicological, and analytical techniques for effective characterization. While recent attempts, using machine learning based solely on primary amino acid sequences to predict biological toxins (e.g., conotoxins and animal venoms), have improved toxin identification, these methods are limited due to peptide conformational flexibility and the high frequency of cysteines present in toxin sequences. This results in an enumerable set of disulfide-bridged foldamers with different conformations of the same primary amino acid sequence that affect function and toxicity levels. Consequently, a given peptide may be toxic when its cysteine residues form a particular disulfide-bond pattern, while alternative bonding patterns (isoforms) or its reduced form (free cysteines with no disulfide bridges) may have little or no toxicological effects. Similarly, the same disulfide-bond pattern may be possible for other peptide sequences and result in different conformations that all exhibit varying toxicities to the same receptor or to different receptors. We present here new features, when combined with primary sequence features to train machine learning algorithms to predict conotoxins, that significantly increase prediction accuracy.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10675404PMC
http://dx.doi.org/10.3390/toxins15110641DOI Listing

Publication Analysis

Top Keywords

increase prediction
8
prediction accuracy
8
machine learning
8
primary amino
8
amino acid
8
disulfide-bond pattern
8
conotoxins
5
conotoxin prediction
4
prediction features
4
features increase
4

Similar Publications

Introduction: In inflammatory bowel disease (IBD), the number of eosinophils increases in the lamina propria of the intestinal tract, but their specific patho-mechanistic role remains unclear. Elevated blood eosinophil counts in active IBD suggest their potential as biomarkers for predicting response to biologic therapies. This study evaluates blood eosinophil count trends and their predictive value for clinical response and endoscopic improvement in patients with IBD receiving ustekinumab or adalimumab induction therapy.

View Article and Find Full Text PDF

Genome-Wide Association Study and Genomic Predictions for Hydroxycinnamate Concentrations in Maize Stover.

J Agric Food Chem

January 2025

UA MBG-UVIGO, Misión Biológica de Galicia (CSIC), Pazo de Salcedo, Pontevedra 36143, España.

Hydroxycinnamates, like ferulate (FA) and -coumarate (CA), are important components of maize cell walls, which influence pest resistance, ruminal digestibility, and biofuel production. Increasing their concentration has been linked to increased pest resistance, but also may lead to a decrease in nutritional value or bioethanol production efficiency. Therefore, improving forage quality or biofuel production without compromising plant resistance and a thorough understanding of the biosynthesis and deposition of these compounds is necessary, especially in stover, which is the feedstock for second-generation biofuel production and determines animal forage quality.

View Article and Find Full Text PDF

Background: Digital biomarkers are increasingly used in clinical decision support for various health conditions. Speech features as digital biomarkers can offer insights into underlying physiological processes due to the complexity of speech production. This process involves respiration, phonation, articulation, and resonance, all of which rely on specific motor systems for the preparation and execution of speech.

View Article and Find Full Text PDF

PHIStruct: Improving phage-host interaction prediction at low sequence similarity settings using structure-aware protein embeddings.

Bioinformatics

January 2025

Bioinformatics Lab, Advanced Research Institute for Informatics, Computing and Networking, De La Salle University, Manila, 1004, Philippines.

Motivation: Recent computational approaches for predicting phage-host interaction have explored the use of sequence-only protein language models to produce embeddings of phage proteins without manual feature engineering. However, these embeddings do not directly capture protein structure information and structure-informed signals related to host specificity.

Results: We present PHIStruct, a multilayer perceptron that takes in structure-aware embeddings of receptor-binding proteins, generated via the structure-aware protein language model SaProt, and then predicts the host from among the ESKAPEE genera.

View Article and Find Full Text PDF

Background: Computed tomography (CT)-derived low muscle mass is associated with adverse outcomes in critically ill patients. Muscle ultrasound is a promising strategy for quantitating muscle mass. We evaluated the association between baseline ultrasound rectus femoris cross-sectional area (RF-CSA) and intensive care unit (ICU) mortality.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!