Retroviral infections, such as HIV, are, until now, diseases with no cure. Medicine and pharmaceutical chemistry need and consider it a huge goal to define target proteins of new antiretroviral compounds. ChEMBL manages Big Data features with a complex data set, which is hard to organize. This makes information difficult to analyze due to a big number of characteristics described in order to predict new drug candidates for retroviral infections. For this reason, we propose to develop a new predictive model combining perturbation theory (PT) bases and machine learning (ML) modeling to create a new tool that can take advantage of all the available information. The PTML model proposed in this work for the ChEMBL data set preclinical experimental assays for antiretroviral compounds consists of a linear equation with four variables. The PT operators used are founded on multicondition moving averages, combining different features and simplifying the difficulty to manage all data. More than 140 000 preclinical assays for 56 105 compounds with different characteristics or experimental conditions have been carried out and can be found in ChEMBL database, covering combinations with 359 biological activity parameters (), 55 protein accessions (), 83 cell lines (), 64 organisms of assay (), and 773 subtypes or strains. We have included 150 148 preclinical experimental assays for HIV virus, 1188 for HTLV virus, 84 for simian immunodeficiency virus, 370 for murine leukemia virus, 119 for Rous sarcoma virus, 1581 for MMTV, etc. We also included 5277 assays for hepatitis B virus. The developed PTML model reached considerable values in sensibility (73.05% for training and 73.10% for validation), specificity (86.61% for training and 87.17% for validation), and accuracy (75.84% for training and 75.98% for validation). We also compared alternative PTML models with different PT operators such as covariance, moments, and exponential terms. Finally, we made a comparison between literature ML models with our PTML model and also artificial neural network (ANN) nonlinear models. We conclude that this PTML model is the first one to consider multiple characteristics of preclinical experimental antiretroviral assays combined, generating a simple, useful, and adaptable instrument, which could reduce time and costs in antiretroviral drugs research.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.molpharmaceut.9b00538DOI Listing

Publication Analysis

Top Keywords

ptml model
20
antiretroviral compounds
12
preclinical experimental
12
machine learning
8
chembl data
8
retroviral infections
8
data set
8
experimental assays
8
ptml
6
model
6

Similar Publications

Ligand-Based Approach for Multi-Target Drug Discovery: PTML Modeling of Triple-Target Inhibitors.

Curr Top Med Chem

August 2024

LAQV@REQUIMTE/Department of Chemistry and Biochemistry, Faculty of Sciences, University of Porto, 4169-007, Porto, Portugal.

Background: Cancers are complex multi-genetic diseases that should be tackled in multi-target drug discovery scenarios. Computational methods are of great importance to accelerate the discovery of multi-target anticancer agents. Here, we employed a ligand-based approach by combining a perturbation-theory machine learning model derived from an ensemble of multilayer perceptron networks (PTML-EL-MLP) with the Fragment-Based Topological Design (FBTD) approach to rationally design and predict triple-target inhibitors against the cancerrelated proteins named Tropomyosin Receptor Kinase A (TRKA), poly[ADP-ribose] polymerase 1 (PARP-1), and Insulin-like Growth Factor 1 Receptor (IGF1R).

View Article and Find Full Text PDF

Neurodegenerative diseases involve progressive neuronal death. Traditional treatments often struggle due to solubility, bioavailability, and crossing the Blood-Brain Barrier (BBB). Nanoparticles (NPs) in biomedical field are garnering growing attention as neurodegenerative disease drugs (NDDs) carrier to the central nervous system.

View Article and Find Full Text PDF

Introduction: Drug discovery has provided modern societies with the means to fight against many diseases. In this sense, computational methods have been at the forefront, playing an important role in rationalizing the search for novel drugs. Yet, tackling phenomena such as the multi-genic nature of diseases and drug resistance are limitations of the current computational methods.

View Article and Find Full Text PDF

B cell primary thyroid malignant lymphoma (BC-PTML) accounts for 95% of all cases of PTML. However, development of effective treatment and management strategies for BC-PTML is challenging owing to the rarity of this disease. This study assessed data from 1,152 patients in the Surveillance, Epidemiology, and End Results (SEER) database who were diagnosed with BC-PTML during 2000-2015.

View Article and Find Full Text PDF

Prediction of Antileishmanial Compounds: General Model, Preparation, and Evaluation of 2-Acylpyrrole Derivatives.

J Chem Inf Model

August 2022

Departamento de Química Orgánica e Inorgánica, Facultad de Ciencia y Tecnología, Universidad del País Vasco / Euskal Herriko Unibertsitatea UPV/EHU, Apdo. 644, 48080 Bilbao, Spain.

In this work, the SOFT.PTML tool has been used to pre-process a ChEMBL dataset of pre-clinical assays of antileishmanial compound candidates. A comparative study of different ML algorithms, such as logistic regression (LOGR), support vector machine (SVM), and random forests (RF), has shown that the IFPTML-LOGR model presents excellent values of specificity and sensitivity (81-98%) in training and validation series.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!