Cheminformatics datasets used in classification problems, especially those related to biological or physicochemical properties, are often imbalanced. This presents a major challenge in development of in silico prediction models, as the traditional machine learning algorithms are known to work best on balanced datasets. The class imbalance introduces a bias in the performance of these algorithms due to their preference towards the majority class. Here, we present a comparison of the performance of seven different meta-classifiers for their ability to handle imbalanced datasets, whereby Random Forest is used as base-classifier. Four different datasets that are directly (cholestasis) or indirectly (via inhibition of organic anion transporting polypeptide 1B1 and 1B3) related to liver toxicity were chosen for this purpose. The imbalance ratio in these datasets ranges between 4:1 and 20:1 for negative and positive classes, respectively. Three different sets of molecular descriptors for model development were used, and their performance was assessed in 10-fold cross-validation and on an independent validation set. Stratified bagging, MetaCost and CostSensitiveClassifier were found to be the best performing among all the methods. While MetaCost and CostSensitiveClassifier provided better sensitivity values, Stratified Bagging resulted in high balanced accuracies.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5919997 | PMC |
http://dx.doi.org/10.1007/s10822-018-0116-z | DOI Listing |
Database (Oxford)
January 2025
Department of In Vitro Toxicology and Dermato-Cosmetology (IVTD), Vrije Universiteit Brussel, Laarbeeklaan 103, Brussels 1090, Belgium.
The European Union's ban on animal testing for cosmetic products and their ingredients, combined with the lack of validated animal-free methods, poses challenges in evaluating their potential repeated-dose organ toxicity. To address this, innovative strategies like Next-Generation Risk Assessment (NGRA) are being explored, integrating historical animal data with new mechanistic insights from non-animal New Approach Methodologies (NAMs). This paper introduces the TOXIN knowledge graph (TOXIN KG), a tool designed to retrieve toxicological information on cosmetic ingredients, with a focus on liver-related data.
View Article and Find Full Text PDFPLoS One
January 2025
Heilongjiang University of Traditional Chinese Medicine, Harbin, Heilongjiang, China.
Hepatocellular carcinoma(HCC) has a high mortality and morbidity rate and seriously jeopardizes human life. Chemicals and chemotherapeutic agents have been experiencing problems such as side effects and drug resistance in the treatment of HCC, which cannot meet the needs of clinical treatment. Therefore, finding novel low-toxicity and high-efficiency anti-hepatocellular carcinoma drugs and exploring their mechanisms of action have become the current problems to be solved in the treatment of HCC.
View Article and Find Full Text PDFNaunyn Schmiedebergs Arch Pharmacol
January 2025
Center of Studies and Research Toxic-Pharmacological, School of Pharmacy, Federal University of Goias, Leste Universitario, 240th Street, Corner of 5th Avenue, Goiania, GO, 74605-170, Brazil.
The CCl-induced hepatotoxicity model is a traditional preclinical assay applied to evaluate potential hepatoprotective compounds. However, several studies have used it with inappropriate dose and exposure time, generating both weak response or irreversible liver injury, as well as lack of representative liver and plasma biomarkers. Therefore, this study aims to determine the best dose and exposure time of CCl in Wistar rats, permitting a proper evaluation of potential hepatoprotective effect.
View Article and Find Full Text PDFSci Rep
January 2025
Department of Radiation Oncology, Fujian Medical University Union Hospital, Fuzhou, 350001, Fujian, China.
Ginsenoside Rd (Rd) is a bioactive compound predominantly found in Panax ginseng C.A. Meyer and Panax notoginseng (Burkill) F.
View Article and Find Full Text PDFJ Control Release
January 2025
Bioprocessing Technology Institute (BTI), Agency for Science, Technology and Research (A*STAR), 20 Biopolis Way, #06-01 Centros, Singapore 138668, Republic of Singapore. Electronic address:
mRNA-loaded lipid nanoparticles (mRNA-LNPs) hold great potential for disease treatment and prevention. LNPs are normally made from four lipids including ionizable lipid, helper lipid, cholesterol, and PEGylated lipid (PEG-lipid). Although PEG-lipid has the lowest content, it plays a crucial role in the effective delivery of mRNA-LNPs.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!