A Novel Computational Machine Learning Pipeline to Quantify Similarities in Three-Dimensional Protein Structures.

Toxicol Sci

Janssen Research & Development, LLC, La Jolla, California.

Published: January 2025

Animal models are widely used during drug development. The selection of suitable animal model relies on various factors such as target biology, animal resource availability and legacy species. It is imperative that the selected animal species exhibit the highest resemblance to human, in terms of target biology as well as the similarity in the target protein. The current practice to address cross-species protein similarity relies on pair-wise sequence comparison using protein sequences, instead of the biologically relevant 3-dimensional (3D) structure of proteins. We developed a novel quantitative machine learning pipeline using 3D structure-based feature data from the Protein Data Bank, nominal data from UNIPROT and bioactivity data from ChEMBL, all of which were matched for human and animal data. Using the XGBoost regression model, similarity scores between targets were calculated and based on these scores, the best animal species for a target was identified. For real-world application, targets from an alternative source, ie, AlphaFold, were tested using the model, and the animal species that had the most similar protein to the human counterparts were predicted. These targets were then grouped based on their associated phenotype such that the pipeline could predict an optimal animal species.

Download full-text PDF

Source
http://dx.doi.org/10.1093/toxsci/kfaf007DOI Listing

Publication Analysis

Top Keywords

animal species
16
machine learning
8
learning pipeline
8
animal
8
target biology
8
protein
6
species
5
data
5
novel computational
4
computational machine
4

Similar Publications

Planiliza haematocheilus, a teleostan species noted for its ecological adaptability and economic significance, thrives in both freshwater and marine environments. This study presents a novel chromosome-level genome assembly through Hi-C, PacBio CCS, and Illumina sequencing methods. The assembled genome has a final size of 651.

View Article and Find Full Text PDF

sp. nov., isolated from the faecal sample of a zoo animal, .

Int J Syst Evol Microbiol

January 2025

Laboratory of Molecular Environmental Microbiology, Department of Environmental Science and Ecological Engineering, Korea University, Seoul 02841, Republic of Korea.

Strain NoAH (=KACC 23135=JCM 35999), a novel Gram-negative, motile bacterium with a rod-shaped morphology, was isolated from the zoo animal faecal samples, specifically the long-tailed goral species . The novel bacterial strain grew optimally in a nutrient broth medium under the following conditions: 1-2% (w/v) NaCl, pH 7-8 and 30 °C. The strain NoAH exhibited high tolerance to NaCl, with the ability to tolerate up to 7% (w/v) NaCl.

View Article and Find Full Text PDF

Meiosis is generally a fair process: each chromosome has a 50% chance of being included into each gamete. However, meiosis can become aberrant with some chromosomes having a higher chance of making it into gametes than others. Yet, why and how such systems evolve remains unclear.

View Article and Find Full Text PDF

Hemolytic anemia (HA) is characterized by massive destruction of red blood cells (RBCs) and insufficient oxygen supply, which can lead to shock, organ failure, even death. Recent studies have preliminarily demonstrated the therapeutic effectiveness of whole blood exchange (WBE) in the management of acute hemolytic anemia and exhibited potential for reducing the duration of corticosteroid treatment, while the underlying mechanism of WBE therapy was not investigated in preclinical study. Hence, we investigate the therapeutic mechanisms of WBE in HA through established continued WBE therapy in rats creatively.

View Article and Find Full Text PDF

Adipokines regulate the development and progression of MASLD through organellar oxidative stress.

Hepatol Commun

February 2025

Central laboratory, Endocrine and Metabolic Diseases Hospital of Shandong First Medical University, Shandong First Medical University & Shandong Academy of Medical Sciences, Jinan, Shandong, China.

The prevalence of metabolic dysfunction-associated steatotic liver disease (MASLD), which is increasingly being recognized as a leading cause of chronic liver pathology globally, is increasing. The pathophysiological underpinnings of its progression, which is currently under active investigation, involve oxidative stress. Human adipose tissue, an integral endocrine organ, secretes an array of adipokines that are modulated by dietary patterns and lifestyle choices.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!