Combining Data with Predictions for Modeling Hepatic Steatosis by Using Stratified Bagging and Conformal Prediction.

Chem Res Toxicol

Department of Pharmaceutical Chemistry, Division of Drug Design and Medicinal Chemistry, University of Vienna, 1090 Vienna, Austria.

Published: February 2021

Hepatic steatosis (fatty liver) is a severe liver disease induced by the excessive accumulation of fatty acids in hepatocytes. In this study, we developed reliable models for predicting hepatic steatosis on the basis of an data set of 1041 compounds measured in rodent studies with repeated oral exposure. The imbalanced nature of the data set (1:8, with the "steatotic" compounds belonging to the minority class) required the use of meta-classifiers-bagging with stratified under-sampling and Mondrian conformal prediction-on top of the base classifier random forest. One major goal was the investigation of the influence of different descriptor combinations on model performance (tested by predicting an external validation set): physicochemical descriptors (RDKit), ToxPrint features, as well as predictions from nuclear receptor and transporter models. All models based upon descriptor combinations including physicochemical features led to reasonable balanced accuracies (BAs between 0.65 and 0.69 for the respective models). Combining physicochemical features with transporter predictions and further with ToxPrint features gave the best performing model (BAs up to 0.7 and efficiencies of 0.82). Whereas both meta-classifiers proved useful for this highly imbalanced toxicity data set, the conformal prediction framework also guarantees the error level and thus might be favored for future studies in the field of predictive toxicology.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7887803PMC
http://dx.doi.org/10.1021/acs.chemrestox.0c00511DOI Listing

Publication Analysis

Top Keywords

hepatic steatosis
12
data set
12
conformal prediction
8
descriptor combinations
8
toxprint features
8
physicochemical features
8
combining data
4
data predictions
4
predictions modeling
4
modeling hepatic
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!