The sequence of amino acids as the basis for the model of biological activity of peptides.

Theor Chem Acc

Department of Environmental Health Science, Laboratory of Environmental Chemistry and Toxicology, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156 Milan, Italy.

Published: January 2021

The algorithm of building up a model for the biological activity of peptides as a mathematical function of a sequence of amino acids is suggested. The general scheme is the following: The total set of available data is distributed into the active training set, passive training set, calibration set, and validation set. The training (both active and passive) and calibration sets are a system of generation of a model of biological activity where each amino acid obtains special correlation weight. The numerical data on the correlation weights calculated by the Monte Carlo method using the CORAL software (http://www.insilico.eu/coral). The target function aimed to give the best result for the calibration set (not for the training set). The final checkup of the model is carried out with data on the validation set (peptides, which are not visible during the creation of the model). Described computational experiments confirm the ability of the approach to be a tool for the design of predictive models for the biological activity of peptides (expressed by pIC50).

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7820519PMC
http://dx.doi.org/10.1007/s00214-020-02707-8DOI Listing

Publication Analysis

Top Keywords

biological activity
16
model biological
12
activity peptides
12
training set
12
sequence amino
8
amino acids
8
set
8
calibration set
8
validation set
8
set training
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!