As the first step of machine-learning based protein structure and function prediction, the amino acid encoding play a fundamental role in the final success of those methods. Different from the protein sequence encoding, the amino acid encoding can be used in both residue-level and sequence-level prediction of protein properties by combining them with different algorithms. However, it has not attracted enough attention in the past decades, and there are no comprehensive reviews and assessments about encoding methods so far. In this article, we make a systematic classification and propose a comprehensive review and assessment for various amino acid encoding methods. Those methods are grouped into five categories according to their information sources and information extraction methodologies, including binary encoding, physicochemical properties encoding, evolution-based encoding, structure-based encoding, and machine-learning encoding. Then, 16 representative methods from five categories are selected and compared on protein secondary structure prediction and protein fold recognition tasks by using large-scale benchmark datasets. The results show that the evolution-based position-dependent encoding method PSSM achieved the best performance, and the structure-based and machine-learning encoding methods also show some potential for further application, the neural network based distributed representation of amino acids in particular may bring new light to this area. We hope that the review and assessment are useful for future studies in amino acid encoding.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCBB.2019.2911677DOI Listing

Publication Analysis

Top Keywords

amino acid
20
acid encoding
20
encoding methods
16
encoding
14
review assessment
12
methods protein
8
comprehensive review
8
prediction protein
8
machine-learning encoding
8
methods
7

Similar Publications

For the first time, a separate Czech guideline focuses exclusively on hepatitis D virus (HDV) infection. Until recently, HDV infection was only mentioned in guidelines concerning hepatitis B virus (HBV) infection, in chapters on HBV/HDV co-infection. The guideline is based on the July 2023 recommendations from the European Association for the Study of the Liver.

View Article and Find Full Text PDF

Background: Post-inflammatory hyperpigmentation (PIH) is a common cosmetic concern, often leading to significant psychological distress for the patients. With the widespread application of lasers including ablative fractional resurfacing (AFR) with a 10,600 nm CO laser, PIH caused by lasers is becoming increasingly common. But due to the absence of an appropriate animal research model, our understanding of pathophysiological mechanisms and preventive strategies for PIH remains limited.

View Article and Find Full Text PDF

Characterization and design of dipeptide media formulation for scalable therapeutic production.

Appl Microbiol Biotechnol

January 2025

School of Chemical Engineering, Sungkyunkwan University, 2066 Seobu-Ro, Jangan-GuGyeonggi-Do 16419, Suwon-Si, South Korea.

Process intensification and simplification in biopharmaceutical manufacturing have driven the exploration of advanced feeding strategies to improve culture performance and process consistency. Conventional media design strategies, however, are often constrained by the stability and solubility challenges of amino acids, particularly in large-scale applications. As a result, dipeptides have emerged as promising alternatives.

View Article and Find Full Text PDF

VirDetect-AI: a residual and convolutional neural network-based metagenomic tool for eukaryotic viral protein identification.

Brief Bioinform

November 2024

Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos 62210, México.

This study addresses the challenging task of identifying viruses within metagenomic data, which encompasses a broad array of biological samples, including animal reservoirs, environmental sources, and the human body. Traditional methods for virus identification often face limitations due to the diversity and rapid evolution of viral genomes. In response, recent efforts have focused on leveraging artificial intelligence (AI) techniques to enhance accuracy and efficiency in virus detection.

View Article and Find Full Text PDF

Norvaline is a nonproteinogenic amino acid and an important food ingredient supplement for healthy food. In this study, dl-norvaline administration reduced body weight by more than 40% and improved glucose metabolism and energy metabolism in obese mice induced by a high-fat diet (HFD). Combination analysis of microbiome and metabolomics showed that dl-norvaline supplementation regulated gut bacteria structure, such as increasing beneficial bacteria (, , , , , , , and ) and decreasing harmful bacteria (, , , , , and ) and modulated the metabolites involved in arachidonic acid metabolism, thus further promoting short-chain fatty acid production and improving gut barrier, thereby inflammatory responses and oxidative stress were ameliorated.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!