Publications by authors named "Mahendra Awale"

In the present manuscript, we describe how we successfully used ligand-based virtual screening (LBVS) to identify two small-molecule, drug-like hit classes with excellent ADMET profiles against the difficult to address microbial enzyme 1-deoxy-d-xylulose-5-phosphate synthase (DXPS). In the fight against antimicrobial resistance (AMR), it has become increasingly important to address novel targets such as DXPS, the first enzyme of the 2--methyl-d-erythritol-4-phosphate (MEP) pathway, which affords the universal isoprenoid precursors. This pathway is absent in humans but essential for pathogens such as , making it a rich source of drug targets for the development of novel anti-infectives.

View Article and Find Full Text PDF

Solute carrier proteins (SLCs) are membrane proteins controlling fluxes across biological membranes and represent an emerging class of drug targets. Here we searched for inhibitors of divalent metal transporters in a library of 1,676 commercially available 3D-shaped fragment-like molecules from the generated database GDB-17, which lists all possible organic molecules up to 17 atoms of C, N, O, S and halogen following simple criteria for chemical stability and synthetic feasibility. While screening against DMT1 (SLC11A2), an iron transporter associated with hemochromatosis and for which only very few inhibitors are known, only yielded two weak inhibitors, our approach led to the discovery of the first inhibitor of ZIP8 (SLC39A8), a zinc transporter associated with manganese homeostasis and osteoarthritis but with no previously reported pharmacology, demonstrating that this target is druggable.

View Article and Find Full Text PDF

Large databases of biologically relevant molecules, such as ChEMBL, SureChEMBL, or compound collections of pharmaceutical or agrochemical companies, are invaluable sources of medicinal chemistry information, albeit implicit. We developed a modified matched molecular pair approach to systematically and exhaustively extract the transformations in these databases and distill them into snippets of explicit design knowledge that are easily interpretable and directly applicable. The resulting "playbooks of medicinal chemistry design moves" capture the collective pharmaceutical and agrochemical research expertise across multiple chemists, companies, targets, and projects.

View Article and Find Full Text PDF

Generation and prioritization of new molecules are the most central part of the drug design process. Matched molecular series analysis (MMSA) has recently been proposed as a formal approach that captures both of these key elements of design. In order to better understand the power of MMSA and its specific limitations, we here evaluate its performance as an ADME property prediction tool.

View Article and Find Full Text PDF

Chemical space is a concept to organize molecular diversity by postulating that different molecules occupy different regions of a mathematical space where the position of each molecule is defined by its properties. Our aim is to develop methods to explicitly explore chemical space in the area of drug discovery. Here we review our implementations of machine learning in this project, including our use of deep neural networks to enumerate the GDB13 database from a small sample set, to generate analogs of drugs and natural products after training with fragment-size molecules, and to predict the polypharmacology of molecules after training with known bioactive compounds from ChEMBL.

View Article and Find Full Text PDF

The generated database GDB17 enumerates 166.4 billion possible molecules up to 17 atoms of C, N, O, S and halogens following simple chemical stability and synthetic feasibility rules, however medicinal chemistry criteria are not taken into account. Here we applied rules inspired by medicinal chemistry to exclude problematic functional groups and complex molecules from GDB17, and sampled the resulting subset uniformly across molecular size, stereochemistry and polarity to form GDBMedChem as a compact collection of 10 million small molecules.

View Article and Find Full Text PDF

Photopharmacology relies on molecules that change their biological activity upon irradiation. Many of these are derived from known drugs by replacing their core with an isosteric azobenzene photoswitch (azologization). The question is how many of the known bioactive ligands could be addressed in such a way.

View Article and Find Full Text PDF

Several recent reports have shown that long short-term memory generative neural networks (LSTM) of the type used for grammar learning efficiently learn to write Simplified Molecular Input Line Entry System (SMILES) of druglike compounds when trained with SMILES from a database of bioactive compounds such as ChEMBL and can later produce focused sets upon transfer learning with compounds of specific bioactivity profiles. Here we trained an LSTM using molecules taken either from ChEMBL, DrugBank, commercially available fragments, or from FDB-17 (a database of fragments up to 17 atoms) and performed transfer learning to a single known drug to obtain new analogs of this drug. We found that this approach readily generates hundreds of relevant and diverse new drug analogs and works best with training sets of around 40,000 compounds as simple as commercial fragments.

View Article and Find Full Text PDF

Seven million of the currently 94 million entries in the PubChem database break at least one of the four Lipinski constraints for oral bioavailability, 183,185 of which are also found in the ChEMBL database. These non-Lipinski PubChem (NLP) and ChEMBL (NLC) subsets are interesting because they contain new modalities that can display biological properties not accessible to small molecule drugs. Unfortunately, the current search tools in PubChem and ChEMBL are designed for small molecules and are not well suited to explore these subsets, which therefore remain poorly appreciated.

View Article and Find Full Text PDF

We recently reported 4-chloro-2-(2-chlorophenoxy)acetamido)benzoic acid (CBA) as the first potent inhibitor of TRPM4, a cation channel implicated in cardiac diseases and prostate cancer. Herein we report a structure-activity relationship (SAR) study of CBA resulting in two new potent analogs. To design and interpret our SAR we used interactive color-coded 3D-maps representing similarities between compounds calculated with MHFP6 (MinHash fingerprint up to six bonds), a new molecular fingerprint outperforming other fingerprints in benchmarking virtual screening studies.

View Article and Find Full Text PDF

Here we report PPB2 as a target prediction tool assigning targets to a query molecule based on ChEMBL data. PPB2 computes ligand similarities using molecular fingerprints encoding composition (MQN), molecular shape and pharmacophores (Xfp), and substructures (ECfp4) and features an unprecedented combination of nearest neighbor (NN) searches and Naı̈ve Bayes (NB) machine learning, together with simple NN searches, NB and Deep Neural Network (DNN) machine learning models as further options. Although NN(ECfp4) gives the best results in terms of recall in a 10-fold cross-validation study, combining NN searches with NB machine learning provides superior precision statistics, as well as better results in a case study predicting off-targets of a recently reported TRPV6 calcium channel inhibitor, illustrating the value of this combined approach.

View Article and Find Full Text PDF

By screening a focused library of kinase inhibitor analogues in a phenotypic co-culture assay for angiogenesis inhibition, we identified an aminotriazine that acts as a cytostatic nanomolar inhibitor. However, this aminotriazine was found to be completely inactive in a whole-kinome profiling assay. To decipher its mechanism of action, we used the online target prediction tool PPB2 (http://ppb2.

View Article and Find Full Text PDF

Drug promiscuity or polypharmacology is the ability of small molecules to interact with multiple protein targets simultaneously. In drug discovery, understanding the polypharmacology of potential drug molecules is crucial to improve their efficacy and safety, and to discover the new therapeutic potentials of existing drugs. Over the past decade, several computational methods have been developed to study the polypharmacology of small molecules, many of which are available as Web services.

View Article and Find Full Text PDF

Chemical space describes all possible molecules as well as multi-dimensional conceptual spaces representing the structural diversity of these molecules. Part of this chemical space is available in public databases ranging from thousands to billions of compounds. Exploiting these databases for drug discovery represents a typical big data problem limited by computational power, data storage and data access capacity.

View Article and Find Full Text PDF

Here, we explore the chemical space of all virtually possible organic molecules focusing on ring systems, which represent the cyclic cores of organic molecules obtained by removing all acyclic bonds and converting all remaining atoms to carbon. This approach circumvents the combinatorial explosion encountered when enumerating the molecules themselves. We report the chemical universe database GDB4c containing 916 130 ring systems up to four saturated or aromatic rings and maximum ring size of 14 atoms and GDB4c3D containing the corresponding 6 555 929 stereoisomers.

View Article and Find Full Text PDF

Identification of new hits is one of the biggest challenges in drug discovery. Creating a library of well-characterized drug-like compounds is a key step in this process. Our group has developed an in-house chemical library called the Medicinal and Biological Chemistry (MBC) library.

View Article and Find Full Text PDF

To better understand chemical space we recently enumerated the database GDB-17 containing 166.4 billion possible molecules up to 17 atoms of C, N, O, S and halogen following the simple rules of chemical stability and synthetic feasibility. However, due to the combinatorial explosion caused by systematic enumeration GDB-17 is strongly biased toward the largest, functionally and stereochemically most complex molecules and far too large for most virtual screening tools.

View Article and Find Full Text PDF

The concept of chemical space provides a convenient framework to analyze large collections of molecules by placing them in property spaces where distances represent similarities. Here we report webMolCS, a new type of web-based interface visualizing up to 5000 user-defined molecules in six different three-dimensional (3D) chemical spaces obtained by principal component analysis or similarity mapping of multidimensional property spaces describing composition (MQN: 42D molecular quantum numbers, SMIfp: 34D SMILES fingerprint), shapes and pharmacophores (APfp: 20D atom pair fingerprint, Xfp: 55D category extended atom pair fingerprint), and substructures (Sfp: 1024D binary substructure fingerprint, ECfp4:1024D extended connectivity fingerprint). Each molecule is shown as a sphere, and its structure appears on mouse over.

View Article and Find Full Text PDF

Background: Several web-based tools have been reported recently which predict the possible targets of a small molecule by similarity to compounds of known bioactivity using molecular fingerprints (fps), however predictions in each case rely on similarities computed from only one or two fps. Considering that structural similarity and therefore the predicted targets strongly depend on the method used for comparison, it would be highly desirable to predict targets using a broader set of fps simultaneously.

Results: Herein, we present the polypharmacology browser (PPB), a web-based platform which predicts possible targets for small molecules by searching for nearest neighbors using ten different fps describing composition, substructures, molecular shape and pharmacophores.

View Article and Find Full Text PDF

Here we report the discovery of a selective inhibitor of Aurora A, a key regulator of cell division and potential anticancer target. We used the atom category extended ligand overlap score (xLOS), a 3D ligand-based virtual screening method recently developed in our group, to select 437 shape and pharmacophore analogs of reference kinase inhibitors. Biochemical screening uncovered two inhibitor series with scaffolds unprecedented among kinase inhibitors.

View Article and Find Full Text PDF

Background: Similarly to the periodic table for elements, chemical space offers an organizing principle for representing the diversity of organic molecules, usually in the form of multi-dimensional property spaces that are subjected to dimensionality reduction methods to obtain 3D-spaces or 2D-maps suitable for visual inspection. Unfortunately, tools to look at chemical space on the internet are currently very limited.

Results: Herein we present webDrugCS, a web application freely available at www.

View Article and Find Full Text PDF

Background: The RCSB Protein Data Bank (PDB) provides public access to experimentally determined 3D-structures of biological macromolecules (proteins, peptides and nucleic acids). While various tools are available to explore the PDB, options to access the global structural diversity of the entire PDB and to perceive relationships between PDB structures remain very limited.

Methods: A 136-dimensional atom pair 3D-fingerprint for proteins (3DP) counting categorized atom pairs at increasing through-space distances was designed to represent the molecular shape of PDB-entries.

View Article and Find Full Text PDF

Herein, we report the discovery of the first potent and selective inhibitor of TRPV6, a calcium channel overexpressed in breast and prostate cancer, and its use to test the effect of blocking TRPV6-mediated Ca(2+)-influx on cell growth. The inhibitor was discovered through a computational method, xLOS, a 3D-shape and pharmacophore similarity algorithm, a type of ligand-based virtual screening (LBVS) method described briefly here. Starting with a single weakly active seed molecule, two successive rounds of LBVS followed by optimization by chemical synthesis led to a selective molecule with 0.

View Article and Find Full Text PDF

Divalent metal transporter-1 (SLC11A2/DMT1) uses the H(+) electrochemical gradient as the driving force to transport divalent metal ions such as Fe(2+), Mn(2+) and others metals into mammalian cells. DMT1 is ubiquitously expressed, most notably in proximal duodenum, immature erythroid cells, brain and kidney. This transporter mediates H(+)-coupled transport of ferrous iron across the apical membrane of enterocytes.

View Article and Find Full Text PDF