AAontology: An Ontology of Amino Acid Scales for Interpretable Machine Learning.

J Mol Biol

Department of Bioinformatics, School of Life Sciences, Technical University of Munich, Freising, Germany. Electronic address:

Published: October 2024

Amino acid scales are crucial for protein prediction tasks, many of them being curated in the AAindex database. Despite various clustering attempts to organize them and to better understand their relationships, these approaches lack the fine-grained classification necessary for satisfactory interpretability in many protein prediction problems. To address this issue, we developed AAontology-a two-level classification for 586 amino acid scales (mainly from AAindex) together with an in-depth analysis of their relations-using bag-of-word-based classification, clustering, and manual refinement over multiple iterations. AAontology organizes physicochemical scales into 8 categories and 67 subcategories, enhancing the interpretability of scale-based machine learning methods in protein bioinformatics. Thereby it enables researchers to gain a deeper biological insight. We anticipate that AAontology will be a building block to link amino acid properties with protein function and dysfunctions as well as aid informed decision-making in mutation analysis or protein drug design.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jmb.2024.168717DOI Listing

Publication Analysis

Top Keywords

amino acid
16
acid scales
12
machine learning
8
protein prediction
8
protein
5
aaontology ontology
4
amino
4
ontology amino
4
acid
4
scales
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!