The traditional biological assay is very time-consuming, and thus the ability to quickly screen large numbers of compounds against a specific biological target is appealing. To speed up the biological evaluation of compounds, high-throughput screening is widely used in the fields of biomedical, biological information, and drug discovery. The research presented in this study focuses on the use of support vector machines, a machine learning method, various classes of molecular descriptors, and different sampling techniques to overcome overfitting to classify compounds for cytotoxicity with respect to the Jurkat cell line. The cell cytotoxicity data set is imbalanced (a few active compounds and very many inactive compounds), and the ability of the predictive modeling methods is adversely affected in these situations. Commonly imbalanced data sets are overfit with respect to the dominant classified end point; in this study the models routinely overfit toward inactive (noncytotoxic) compounds when the imbalance was substantial. Support vector machine (SVM) models were used to probe the proficiency of different classes of molecular descriptors and oversampling ratios. The SVM models were constructed from 4D-FPs, MOE (1D, 2D, and 21/2D), noNP+MOE, and CATS2D trial descriptors pools and compared to the predictive abilities of CATS2D-based random forest models. Compared to previous results in the literature, the SVM models built from oversampled data sets exhibited better predictive abilities for the training and external test sets.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1021/ci4000536 | DOI Listing |
Plants (Basel)
January 2025
Maize Research Institute Zemun Polje, Slobodana Bajića 1, 11185 Belgrade, Serbia.
Driven by the growing demands for plant-based protein in Europe and attempts of soybean breeding programs to improve the productivity of created varieties, this study aimed to enhance genetic resource utilization efficiency by providing information relevant to well-focused breeding targets. A set of 90 accessions was subjected to a comprehensive assessment of genetic diversity in a soybean working collection using three marker types: morphological descriptors, agronomic traits, and SSRs. Genotype grouping patterns varied among the markers, displaying the best congruence with pedigree data and maturity for SSRs and agronomic traits, respectively.
View Article and Find Full Text PDFPharmaceuticals (Basel)
January 2025
Centro de Química Médica, Facultad de Medicina Clínica Alemana, Universidad del Desarrollo, Santiago 7780272, Chile.
Acute myeloid leukemia (AML) presents significant therapeutic challenges, particularly in cases driven by mutations in the FLT3 tyrosine kinase. This study aimed to develop a robust and user-friendly machine learning-based quantitative structure-activity relationship (QSAR) model to predict the inhibitory potency (pIC values) of FLT3 inhibitors, addressing the limitations of previous models in dataset size, diversity, and predictive accuracy. Using a dataset which was 14 times larger than those employed in prior studies (1350 compounds with 1269 molecular descriptors), we trained a random forest regressor, chosen due to its superior predictive performance and resistance to overfitting.
View Article and Find Full Text PDFAntibiotics (Basel)
January 2025
Department of Food Chemistry and Nutrition, Faculty of Pharmacy, Jagiellonian University Medical College, 31-008 Cracow, Poland.
Background/objectives: Developing antifungal drugs with lower potential for interactions with food may help to optimize treatment and reduce the risk of antimicrobial resistance. Chemometrics uses statistical and mathematical methods to analyze multivariate chemical data, enabling the identification of key correlations and simplifying data interpretation. We used the partial least squares (PLS) approach to explore the correlations between various characteristics of oral antifungal drugs (including antifungal antibiotics) and dietary interventions, aiming to identify patterns that could inform the optimization of antifungal therapy.
View Article and Find Full Text PDFDiagnostics (Basel)
January 2025
Division of Microbiology, Immunology and Biotechnology, Department of Natural Products and Alternative Medicine, Faculty of Pharmacy, University of Tabuk, Tabuk 71491, Saudi Arabia.
COVID-19 has caused widespread morbidity and mortality, with its effects extending to multiple organ systems. Despite known risk factors for severe disease, including advanced age and underlying comorbidities, patient outcomes can vary significantly. This variability complicates efforts to predict disease progression and tailor treatment strategies.
View Article and Find Full Text PDFNeurogenetics
January 2025
Department of Surgery, Surgical Research Section, Hamad Medical Corporation, Doha, Qatar.
Memory is a dynamic process of encoding, storing, and retrieving information. It includes sensory, short-term, and long-term memory, each with unique characteristics. Nitric oxide (NO) is a biological messenger synthesized on demand by neuronal nitric oxide synthase (nNOS) through a biochemical process initiated by glutamate binding to NMDA receptors, causing membrane depolarization and calcium influx.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!