Machine learning and drug discovery for neglected tropical diseases.

BMC Bioinformatics

Dartmouth Institute for Health Policy and Clinical Practice, Geisel School of Medicine, Hanover, NH, USA.

Published: April 2023

Neglected tropical diseases affect millions of individuals and cause loss of productivity worldwide. They are common in developing countries without the financial resources for research and drug development. With increased availability of data from high throughput screening, machine learning has been introduced into the drug discovery process. Models can be trained to predict biological activities of compounds before working in the lab. In this study, we use three publicly available, high-throughput screening datasets to train machine learning models to predict biological activities related to inhibition of species that cause leishmaniasis, American trypanosomiasis (Chagas disease), and African trypanosomiasis (sleeping sickness). We compare machine learning models (tree based models, naive Bayes classifiers, and neural networks), featurizing methods (circular fingerprints, MACCS fingerprints, and RDKit descriptors), and techniques to deal with the imbalanced data (oversampling, undersampling, class weight/sample weight).

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10127295PMC
http://dx.doi.org/10.1186/s12859-022-05076-0DOI Listing

Publication Analysis

Top Keywords

machine learning
16
drug discovery
8
neglected tropical
8
tropical diseases
8
predict biological
8
biological activities
8
learning models
8
machine
4
learning drug
4
discovery neglected
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!