Publications by Alexander Tropsha | LitMetric

Publications by authors named "Alexander Tropsha"

Page 1 of 9

A Novel Machine Learning Model and a Web Portal for Predicting the Human Skin Sensitization Effects of Chemical Agents.

Ricardo Scheufen Tieghi José Teófilo Moreira-Filho Holli-Joi Martin James Wellnitz Miguel Canamary Otoch Alexander Tropsha

Toxics

November 2024

Skin sensitization is a significant concern for chemical safety assessments. Traditional animal assays often fail to predict human responses accurately, and ethical constraints limit the collection of human data, necessitating a need for reliable in silico models of skin sensitization prediction. This study introduces HuSSPred, an in silico tool based on the Human Predictive Patch Test (HPPT).

View Article and Find Full Text PDF

Heli-SMACC: Helicase-targeting SMAll Molecule Compound Collection.

Holli-Joi Martin Mohammad A Hossain James Wellnitz Enes Kelestemur Joshua E Hochuli Alexander Tropsha

bioRxiv

July 2024

Helicases have emerged as promising targets for the development of antiviral drugs; however, the family remains largely undrugged. To support the focused development of viral helicase inhibitors we identified, collected, and integrated all chemogenomics data for all available helicases from the ChEMBL database. After thoroughly curating and enriching the data with relevant annotations we have created a derivative database of helicase inhibitors which we dubbed Heli-SMACC (Helicase-targeting SMAll Molecule Compound Collection).

View Article and Find Full Text PDF

A data science roadmap for open science organizations engaged in early-stage drug discovery.

Kristina Edfeldt Aled M Edwards Ola Engkvist Judith Günther Matthew Hartley Alexander Tropsha

Nat Commun

July 2024

The Structural Genomics Consortium is an international open science research organization with a focus on accelerating early-stage drug discovery, namely hit discovery and optimization. We, as many others, believe that artificial intelligence (AI) is poised to be a main accelerator in the field. The question is then how to best benefit from recent advances in AI and how to generate, format and disseminate data to enable future breakthroughs in AI-guided drug discovery.

View Article and Find Full Text PDF

Modeling interactions between Heparan sulfate and proteins based on the Heparan sulfate microarray analysis.

Cleber C Melo-Filho Guowei Su Kevin Liu Eugene N Muratov Alexander Tropsha

Glycobiology

May 2024

Heparan sulfate (HS), a sulfated polysaccharide abundant in the extracellular matrix, plays pivotal roles in various physiological and pathological processes by interacting with proteins. Investigating the binding selectivity of HS oligosaccharides to target proteins is essential, but the exhaustive inclusion of all possible oligosaccharides in microarray experiments is impractical. To address this challenge, we present a hybrid pipeline that integrates microarray and in silico techniques to design oligosaccharides with desired protein affinity.

View Article and Find Full Text PDF

STOPLIGHT: A Hit Scoring Calculator.

James Wellnitz Holli-Joi Martin Mohammad Anwar Hossain Marielle Rath Colton Fox Alexander Tropsha

J Chem Inf Model

June 2024

We introduce STOPLIGHT, a web portal to assist medicinal chemists in prioritizing hits from screening campaigns and the selection of compounds for optimization. STOPLIGHT incorporates services to assess 6 physiochemical and structural properties, 6 assay liabilities, and 11 pharmacokinetic properties, for any small molecule represented by its SMILES string. We briefly describe each service and illustrate the utility of this portal with a case study.

View Article and Find Full Text PDF

Utilizing Low-Dimensional Molecular Embeddings for Rapid Chemical Similarity Search.

Kathryn E Kirchoff James Wellnitz Joshua E Hochuli Travis Maxfield Konstantin I Popov Alexander Tropsha

Adv Inf Retr

March 2024

Nearest neighbor-based similarity searching is a common task in chemistry, with notable use cases in drug discovery. Yet, some of the most commonly used approaches for this task still leverage a brute-force approach. In practice this can be computationally costly and overly time-consuming, due in part to the sheer size of modern chemical databases.

View Article and Find Full Text PDF

Pharmacokinetics Profiler (PhaKinPro): Model Development, Validation, and Implementation as a Web Tool for Triaging Compounds with Undesired Pharmacokinetics Profiles.

Marielle Rath James Wellnitz Holli-Joi Martin Cleber Melo-Filho Joshua E Hochuli Alexander Tropsha

J Med Chem

April 2024

Computational models that predict pharmacokinetic properties are critical to deprioritize drug candidates that emerge as hits in high-throughput screening campaigns. We collected, curated, and integrated a database of compounds tested in 12 major end points comprising over 10,000 unique molecules. We then employed these data to build and validate binary quantitative structure-activity relationship (QSAR) models.

View Article and Find Full Text PDF

An Improved Metric and Benchmark for Assessing the Performance of Virtual Screening Models.

Michael Brocidiacono Konstantin I Popov Alexander Tropsha

ArXiv

March 2024

Structure-based virtual screening (SBVS) is a key workflow in computational drug discovery. SBVS models are assessed by measuring the enrichment of known active molecules over decoys in retrospective screens. However, the standard formula for enrichment cannot estimate model performance on very large libraries.

View Article and Find Full Text PDF

Cheminformatics-Guided Cell-Free Exploration of Peptide Natural Products.

Jarrett M Pelton Joshua E Hochuli Patric W Sadecki Takayuki Katoh Hiroaki Suga Alexander Tropsha

J Am Chem Soc

March 2024

There have been significant advances in the flexibility and power of cell-free translation systems. The increasing ability to incorporate noncanonical amino acids and complement translation with recombinant enzymes has enabled cell-free production of peptide-based natural products (NPs) and NP-like molecules. We anticipate that many more such compounds and analogs might be accessed in this way.

View Article and Find Full Text PDF

ExEmPLAR (Extracting, Exploring, and Embedding Pathways Leading to Actionable Research): a user-friendly interface for knowledge graph mining.

Jon-Michael T Beasley Daniel R Korn Nyssa N Tucker Erick T M Alves Eugene N Muratov Alexander Tropsha

Bioinformatics

January 2024

Summary: Knowledge graphs are being increasingly used in biomedical research to link large amounts of heterogenous data and facilitate reasoning across diverse knowledge sources. Wider adoption and exploration of knowledge graphs in the biomedical research community is limited by requirements to understand the underlying graph structure in terms of entity types and relationships, represented as nodes and edges, respectively, and learn specialized query languages for graph mining and exploration. We have developed a user-friendly interface dubbed ExEmPLAR (Extracting, Exploring, and Embedding Pathways Leading to Actionable Research) to aid reasoning over biomedical knowledge graphs and assist with data-driven research and hypothesis generation.

View Article and Find Full Text PDF

BigBind: Learning from Nonstructural Data for Structure-Based Virtual Screening.

Michael Brocidiacono Paul Francoeur Rishal Aggarwal Konstantin I Popov David Ryan Koes Alexander Tropsha

J Chem Inf Model

April 2024

Deep learning methods that predict protein-ligand binding have recently been used for structure-based virtual screening. Many such models have been trained using protein-ligand complexes with known crystal structures and activities from the PDBBind data set. However, because PDBbind only includes 20K complexes, models typically fail to generalize to new targets, and model performance is on par with models trained with only ligand information.

View Article and Find Full Text PDF

Integrating QSAR modelling and deep learning in drug discovery: the emergence of deep QSAR.

Alexander Tropsha Olexandr Isayev Alexandre Varnek Gisbert Schneider Artem Cherkasov

Nat Rev Drug Discov

February 2024

Quantitative structure-activity relationship (QSAR) modelling, an approach that was introduced 60 years ago, is widely used in computer-aided drug design. In recent years, progress in artificial intelligence techniques, such as deep learning, the rapid growth of databases of molecules for virtual screening and dramatic improvements in computational power have supported the emergence of a new field of QSAR applications that we term 'deep QSAR'. Marking a decade from the pioneering applications of deep QSAR to tasks involved in small-molecule drug discovery, we herein describe key advances in the field, including deep generative and reinforcement learning approaches in molecular design, deep learning models for synthetic planning and the application of deep QSAR models in structure-based virtual screening.

View Article and Find Full Text PDF

Vaccination Against Pneumonia May Provide Genotype-Specific Protection Against Alzheimer's Disease.

Svetlana Ukraintseva Matt Duan Amanda M Simanek Rachel Holmes Olivia Bagley Alexander Tropsha

J Alzheimers Dis

November 2023

Vaccine repurposing that considers individual genotype may aid personalized prevention of Alzheimer's disease (AD). In this retrospective cohort study, we used Cardiovascular Health Study data to estimate associations of pneumococcal polysaccharide vaccine and flu shots received between ages 65-75 with AD onset at age 75 or older, taking into account rs6859 polymorphism in NECTIN2 gene (AD risk factor). Pneumococcal vaccine, and total count of vaccinations against pneumonia and flu, were associated with lower odds of AD in carriers of rs6859 A allele, but not in non-carriers.

View Article and Find Full Text PDF

HIt Discovery using docking ENriched by GEnerative Modeling (HIDDEN GEM): A novel computational workflow for accelerated virtual screening of ultra-large chemical libraries.

Konstantin I Popov James Wellnitz Travis Maxfield Alexander Tropsha

Mol Inform

January 2024

Recent rapid expansion of make-on-demand, purchasable, chemical libraries comprising dozens of billions or even trillions of molecules has challenged the efficient application of traditional structure-based virtual screening methods that rely on molecular docking. We present a novel computational methodology termed HIDDEN GEM (HIt Discovery using Docking ENriched by GEnerative Modeling) that greatly accelerates virtual screening. This workflow uniquely integrates machine learning, generative chemistry, massive chemical similarity searching and molecular docking of small, selected libraries in the beginning and the end of the workflow.

View Article and Find Full Text PDF

School of cheminformatics in Latin America.

Karla Gonzalez-Ponce Carolina Horta Andrade Fiona Hunter Johannes Kirchmair Karina Martinez-Mayorga Alexander Tropsha

J Cheminform

September 2023

We report the major highlights of the School of Cheminformatics in Latin America, Mexico City, November 24-25, 2022. Six lectures, one workshop, and one roundtable with four editors were presented during an online public event with speakers from academia, big pharma, and public research institutions. One thousand one hundred eighty-one students and academics from seventy-nine countries registered for the meeting.

View Article and Find Full Text PDF

Accurate ligand-protein docking in CASP15 using the ClusPro LigTBM server.

Sergei Kotelnikov Ryota Ashizawa Konstantin I Popov Omeir Khan Mikhail Ignatov Alexander Tropsha

Proteins

December 2023

In the ligand prediction category of CASP15, the challenge was to predict the positions and conformations of small molecules binding to proteins that were provided as amino acid sequences or as models generated by the AlphaFold2 program. For most targets, we used our template-based ligand docking program ClusPro ligTBM, also implemented as a public server available at https://ligtbm.cluspro.

View Article and Find Full Text PDF

Lies and Liabilities: Computational Assessment of High-Throughput Screening Hits to Identify Artifact Compounds.

Vinicius M Alves Adam Yasgar James Wellnitz Ganesha Rai Marielle Rath Alexander Tropsha

J Med Chem

September 2023

Hits from high-throughput screening (HTS) of chemical libraries are often false positives due to their interference with assay detection technology. In response, we generated the largest publicly available library of chemical liabilities and developed "Liability Predictor," a free web tool to predict HTS artifacts. More specifically, we generated, curated, and integrated HTS data sets for thiol reactivity, redox activity, and luciferase (firefly and nano) activity and developed and validated quantitative structure-interference relationship (QSIR) models to predict these nuisance behaviors.

View Article and Find Full Text PDF

Identifying a causal link between prolactin signaling pathways and COVID-19 vaccine-induced menstrual changes.

Rima Hajjo Ensaf Momani Dima A Sabbah Nancy Baker Alexander Tropsha

NPJ Vaccines

September 2023

COVID-19 vaccines have been instrumental tools in the fight against SARS-CoV-2 helping to reduce disease severity and mortality. At the same time, just like any other therapeutic, COVID-19 vaccines were associated with adverse events. Women have reported menstrual cycle irregularity after receiving COVID-19 vaccines, and this led to renewed fears concerning COVID-19 vaccines and their effects on fertility.

View Article and Find Full Text PDF

Praemonitus praemunitus: can we forecast and prepare for future viral disease outbreaks?

Zoe Sessions Tesia Bobrowski Holli-Joi Martin Jon-Michael T Beasley Aneri Kothari Alexander Tropsha

FEMS Microbiol Rev

September 2023

Understanding the origins of past and present viral epidemics is critical in preparing for future outbreaks. Many viruses, including SARS-CoV-2, have led to significant consequences not only due to their virulence, but also because we were unprepared for their emergence. We need to learn from large amounts of data accumulated from well-studied, past pandemics and employ modern informatics and therapeutic development technologies to forecast future pandemics and help minimize their potential impacts.

View Article and Find Full Text PDF

PLANTAIN: Diffusion-inspired Pose Score Minimization for Fast and Accurate Molecular Docking.

Michael Brocidiacono Konstantin I Popov David Ryan Koes Alexander Tropsha

ArXiv

July 2023

Molecular docking aims to predict the 3D pose of a small molecule in a protein binding site. Traditional docking methods predict ligand poses by minimizing a physics-inspired scoring function. Recently, a diffusion model has been proposed that iteratively refines a ligand pose.

View Article and Find Full Text PDF

Small molecule antiviral compound collection (SMACC): A comprehensive, highly curated database to support the discovery of broad-spectrum antiviral drug molecules.

Holli-Joi Martin Cleber C Melo-Filho Daniel Korn Richard T Eastman Ganesha Rai Alexander Tropsha

Antiviral Res

September 2023

Diseases caused by new viruses cost thousands if not millions of human lives and trillions of dollars. We have identified, collected, curated, and integrated all chemogenomics data from ChEMBL for 13 emerging viruses that hold the greatest potential threat to global human health. By identifying and solving several challenges related to data annotation accuracy, we developed a highly curated and thoroughly annotated database of compounds tested in both phenotypic and target-based assays for these viruses that we dubbed SMACC (Small Molecule Antiviral Compound Collection).

View Article and Find Full Text PDF

Generative and reinforcement learning approaches for the automated de novo design of bioactive compounds.

Maria Korshunova Niles Huang Stephen Capuzzi Dmytro S Radchenko Olena Savych Alexander Tropsha

Commun Chem

October 2022

Deep generative neural networks have been used increasingly in computational chemistry for de novo design of molecules with desired properties. Many deep learning approaches employ reinforcement learning for optimizing the target properties of the generated molecules. However, the success of this approach is often hampered by the problem of sparse rewards as the majority of the generated molecules are expectedly predicted as inactives.

View Article and Find Full Text PDF

Efficient design of peptide-binding polymers using active learning approaches.

Assima Rakhimbekova Anton Lopukhov Natalia Klyachko Alexander Kabanov Timur I Madzhidov Alexander Tropsha

J Control Release

January 2023

Active learning (AL) has become a subject of active recent research both in industry and academia as an efficient approach for rapid design and discovery of novel chemicals, materials, and polymers. Herein, we have assessed the applicability of AL for the discovery of polymeric micelle formulations for poorly soluble drugs. We were motivated by the key advantages of this approach making it a desirable strategy for rational design of drug delivery systems due toto its ability to (i) employ relatively small datasets for model development, (ii) iterate between model development and model assessment using small external datasets that can be either generated in focused experimental studies or formed from subsets of the initial training data, and (iii) progressively evolve models towards increasingly more reliable predictions and the identification of novel chemicals with the desired properties.

View Article and Find Full Text PDF

Integrated approach to elucidate metal-implant related adverse outcome pathways.

Jon-Michael T Beasley Daniel R Korn Konstantin I Popov Reagan L Dumproff Zoe L Sessions Alexander Tropsha

Regul Toxicol Pharmacol

December 2022

Exogenous metal particles and ions from implant devices are known to cause severe toxic events with symptoms ranging from adverse local tissue reactions to systemic toxicities, potentially leading to the development of cancers, heart conditions, and neurological disorders. Toxicity mechanisms, also known as Adverse Outcome Pathways (AOPs), that explain these metal-induced toxicities are severely understudied. Therefore, we deployed in silico structure- and knowledge-based approaches to identify proteome-level perturbations caused by metals and pathways that link these events to human diseases.

View Article and Find Full Text PDF

Novel computational models offer alternatives to animal testing for assessing eye irritation and corrosion potential of chemicals.

Arthur C Silva Joyce V V B Borba Vinicius M Alves Steven U S Hall Nicholas Furnham Alexander Tropsha

Artif Intell Life Sci

December 2021

Eye irritation and corrosion are fundamental considerations in developing chemicals to be used in or near the eye, from cleaning products to ophthalmic solutions. Unfortunately, animal testing is currently the standard method to identify compounds that cause eye irritation or corrosion. Yet, there is growing pressure on the part of regulatory agencies both in the USA and abroad to develop New Approach Methodologies (NAMs) that help reduce the need for animal testing and address unmet need to modernize safety evaluation of chemical hazards.

View Article and Find Full Text PDF