Predictive and computational toxicology, a highly scientific and research-based field, is rapidly progressing with wider acceptance by regulatory agencies around the world. Almost every aspect of the field has seen fundamental changes during the last decade due to the availability of more data, usage, and acceptance of a variety of predictive tools and an increase in the overall awareness. Also, the influence from the recent explosive developments in the field of artificial intelligence has been significant.
View Article and Find Full Text PDFMany traditional quantitative structure-activity relationship (QSAR) models are based on correlation with high-dimensional, highly variable molecular features in their raw form, limiting their generalizing capabilities despite the use of large training sets. They also lack elements of causality and reasoning. With these issues in mind, we developed a method for learning higher-level abstract representations of the effects of the interactions between molecular features and biology.
View Article and Find Full Text PDFFront Artif Intell
September 2019
Current practice of building QSAR models usually involves computing a set of descriptors for the training set compounds, applying a descriptor selection algorithm and finally using a statistical fitting method to build the model. In this study, we explored the prospects of building good quality interpretable QSARs for big and diverse datasets, without using any pre-calculated descriptors. We have used different forms of Long Short-Term Memory (LSTM) neural networks to achieve this, trained directly using either traditional SMILES codes or a new linear molecular notation developed as part of this work.
View Article and Find Full Text PDFThis article describes a method to generate molecular fingerprints from structural environments of mutagenicity alerts and calculate similarity between them. This approach was used to improve classification accuracy of alerts and for searching structurally similar analogues of an alerting chemical. It builds fingerprints using molecular fragments from the vicinity of the alerts and automatically accounts for the activating and deactivating/mitigating features of alerts needed for accurate predictions.
View Article and Find Full Text PDFThis article describes an unsupervised machine learning method for computing distributed vector representation of molecular fragments. These vectors encode fragment features in a continuous high-dimensional space and enable similarity computation between individual fragments, even for small fragments with only two heavy atoms. The method is based on a word embedding algorithm borrowed from natural language processing field, and approximately 6 million unlabeled PubChem chemicals were used for training.
View Article and Find Full Text PDFPurpose: Oral bioavailability (%F) is a key factor that determines the fate of a new drug in clinical trials. Traditionally, %F is measured using costly and time-consuming experimental tests. Developing computational models to evaluate the %F of new drugs before they are synthesized would be beneficial in the drug discovery process.
View Article and Find Full Text PDFFragment based expert system models of toxicological end points are primarily comprised of a set of substructures that are statistically related to the toxic property in question. These special substructures are often referred to as toxicity alerts, toxicophores, or biophores. They are the main building blocks/classifying units of the model, and it is important to define the chemical structural space within which the alerts are expected to produce reliable predictions.
View Article and Find Full Text PDFThe primary functions of cancer chemotherapeutic agents are not only to inhibit the growth or kill the cancer cells, but to do so without eliciting unreasonable cytotoxic effects on the healthy cells and to withstand the ability of the cancer cells to develop resistance against it. This has unfortunately been proven so far to be a very difficult objective. In this perspective, the ability of small molecules (anti-tumor agents) to 'see' different cell types differently can be a key attribute.
View Article and Find Full Text PDFWe describe here the development of a computer program which uses a new method called Expert System Prediction (ESP), to predict toxic end points and pharmacological properties of chemicals based on multiple modules created by the MCASE artificial intelligence system. The modules are generally based on different biological models measuring related end points. The purpose is to improve the decision making process regarding the overall activity or inactivity of the chemicals and also to enable rapid in silico screening.
View Article and Find Full Text PDFA structurally and functionally diverse and cross-validated quantitative structure-activity knowledge database generated by the MultiCASE expert system was used to screen 2526 high production volume chemicals (HPVCs) for their estrogen receptor binding activity. 73 HPVCs were found to contain structural features or biophores that have been documented as having the ability to bind to the estrogen receptor. Potential chemicals were ranked according to their quantitatively predicted ER binding potential and the details of the biophores found in them are discussed.
View Article and Find Full Text PDFThe MultiCASE expert system was used to construct a quantitative structure-activity relationship model to screen chemicals with estrogen receptor (ER) binding potential. Structures and ER binding data of 313 chemicals were used as inputs to train the expert system. The training data set covers inactive, weak as well as very powerful ER binders and represents a variety of chemical compounds.
View Article and Find Full Text PDF