Machine learning prediction of UV-Vis spectra features of organic compounds related to photoreactive potential.

Sci Rep

LAQV-REQUIMTE, Department of Chemistry, NOVA School of Science and Technology, Universidade Nova de Lisboa, 2829-516, Caparica, Portugal.

Published: December 2021

Machine learning (ML) algorithms were explored for the classification of the UV-Vis absorption spectrum of organic molecules based on molecular descriptors and fingerprints generated from 2D chemical structures. Training and test data (~ 75 k molecules and associated UV-Vis data) were assembled from a database with lists of experimental absorption maxima. They were labeled with positive class (related to photoreactive potential) if an absorption maximum is reported in the range between 290 and 700 nm (UV/Vis) with molar extinction coefficient (MEC) above 1000 Lmol cm, and as negative if no such a peak is in the list. Random forests were selected among several algorithms. The models were validated with two external test sets comprising 998 organic molecules, obtaining a global accuracy up to 0.89, sensitivity of 0.90 and specificity of 0.88. The ML output (UV-Vis spectrum class) was explored as a predictor of the 3T3 NRU phototoxicity in vitro assay for a set of 43 molecules. Comparable results were observed with the classification directly based on experimental UV-Vis data in the same format.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8660842PMC
http://dx.doi.org/10.1038/s41598-021-03070-9DOI Listing

Publication Analysis

Top Keywords

machine learning
8
photoreactive potential
8
organic molecules
8
uv-vis data
8
uv-vis
5
learning prediction
4
prediction uv-vis
4
uv-vis spectra
4
spectra features
4
features organic
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!