Evaluation of the performance of various machine learning methods on the discrimination of the active compounds.

Chem Biol Drug Des

Pharmaceutical Research Center, Pharmaceutical Technology Institute, Mashhad University of Medical Sciences, Mashhad, Iran.

Published: April 2021

Machine learning (ML) method performances, including deep learning (DL) on a diverse set with or without feature selection (FS), were evaluated. The superior performance of DL on small sets has not been approved previously. On the other hand, the available sets for the newly identified targets usually are limited in terms of size. It was explored whether the FS, hyperparameters search, and using ensemble model are able to improve the ML and DL performance on the small sets. The QSAR classifier models were developed using K-nearest (KN) neighbors, DL, random forest (RF), naïve Bayesian (NB) classification, support vector machine (SVM), and logistic regression (LR). Generally, the best individual performers were DL and SVM. The LR had a similar performance to the DL and SVM on the small subsets. The nested cross-validation method was able to include different feature vectors in combination with different ML methods to generate an ensemble model for the datasets with similar performance to the best performers. The general performance for the baseline NB model was Matthews correlation coefficient = 0.356, and it was improved to around 0.66 and 0.63 by NB assisted FS with subsequent SVM/DL classification and an ensemble model, respectively.

Download full-text PDF

Source
http://dx.doi.org/10.1111/cbdd.13819DOI Listing

Publication Analysis

Top Keywords

ensemble model
12
machine learning
8
performance small
8
small sets
8
performance
5
evaluation performance
4
performance machine
4
learning methods
4
methods discrimination
4
discrimination active
4

Similar Publications

GradeDiff-IM: An Ensembles Model-based Grade Classification of Breast Cancer.

Biomed Phys Eng Express

January 2025

School of Engineering and Computing, University of the West of Scotland, University of the West of Scotland - Paisley Campus, Paisley PA1 2BE, UK, City, Paisley, PA1 2BE, UNITED KINGDOM OF GREAT BRITAIN AND NORTHERN IRELAND.

Cancer grade classification is a challenging task identified from the cell structure of healthy and abnormal tissues. The partitioner learns about the malignant cell through the grading and plans the treatment strategy accordingly. A major portion of researchers used DL models for grade classification.

View Article and Find Full Text PDF

Background: Large language models (LLMs) have been proposed as valuable tools in medical education and practice. The Chinese National Nursing Licensing Examination (CNNLE) presents unique challenges for LLMs due to its requirement for both deep domain-specific nursing knowledge and the ability to make complex clinical decisions, which differentiates it from more general medical examinations. However, their potential application in the CNNLE remains unexplored.

View Article and Find Full Text PDF

Music pre-processing methods are currently becoming a recognized area of research with the goal of making music more accessible to listeners with a hearing impairment. Our previous study showed that hearing-impaired listeners preferred spectrally manipulated multi-track mixes. Nevertheless, the acoustical basis of mixing for hearing-impaired listeners remains poorly understood.

View Article and Find Full Text PDF

Harnessing the Power of Machine Learning Guided Discovery of NLRP3 Inhibitors Towards the Effective Treatment of Rheumatoid Arthritis.

Cells

December 2024

Department of Herbal Pharmacology, College of Korean Medicine, Gachon University, 1342 Seongnamdae-ro, Sujeong-gu, Seongnam-si 13120, Republic of Korea.

The NLRP3 inflammasome, plays a critical role in the pathogenesis of rheumatoid arthritis (RA) by activating inflammatory cytokines such as IL1β and IL18. Targeting NLRP3 has emerged as a promising therapeutic strategy for RA. In this study, a multidisciplinary approach combining machine learning, quantitative structure-activity relationship (QSAR) modeling, structure-activity landscape index (SALI), docking, molecular dynamics (MD), and molecular mechanics Poisson-Boltzmann surface area MM/PBSA assays was employed to identify novel NLRP3 inhibitors.

View Article and Find Full Text PDF

ModeHunter is a modular Python software package for the simulation of 3D biophysical motion across spatial resolution scales using modal analysis of elastic networks. It has been curated from our in-house Python scripts over the last 15 years, with a focus on detecting similarities of elastic motion between atomic structures, coarse-grained graphs, and volumetric data obtained from biophysical or biomedical imaging origins, such as electron microscopy or tomography. With ModeHunter, normal modes of biophysical motion can be analyzed with various static visualization techniques or brought to life by dynamics animation in terms of single or multimode trajectories or decoy ensembles.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!