Beilstein J Org Chem
August 2024
Active learning allows algorithms to steer iterative experimentation to accelerate and de-risk molecular optimizations, but actively trained models might still exhibit poor performance during early project stages where the training data is limited and model exploitation might lead to analog identification with limited scaffold diversity. Here, we present ActiveDelta, an adaptive approach that leverages paired molecular representations to predict improvements from the current best training compound to prioritize further data acquisition. We apply the ActiveDelta concept to both graph-based deep (Chemprop) and tree-based (XGBoost) models during exploitative active learning for 99 K benchmarking datasets.
View Article and Find Full Text PDFMolecular machine learning algorithms are becoming increasingly powerful at predicting the potency of potential drug candidates to guide molecular discovery, lead series prioritization, and structural optimization. However, a substantial amount of inhibition data is bounded and inaccessible to traditional regression algorithms. Here, we develop a novel molecular pairing approach to process this data.
View Article and Find Full Text PDFMulti-factor screenings are commonly used in diverse applications in medicine and bioengineering, including optimizing combination drug treatments and microbiome engineering. Despite the advances in high-throughput technologies, large-scale experiments typically remain prohibitively expensive. Here we introduce a machine learning platform, structure-augmented regression (SAR), that exploits the intrinsic structure of each biological system to learn a high-accuracy model with minimal data requirement.
View Article and Find Full Text PDFOwing to their distinct physical and chemical properties, inorganic nanoparticles (NPs) have shown promising results in preclinical cancer therapy, but designing and engineering them for effective therapeutic purposes remains a challenge. Although a comprehensive database of inorganic NP research is not currently available, it is crucial for developing effective cancer therapies. In this context, machine learning (ML) has emerged as a transformative tool, but its adaptation to nanomedicine is hindered by inexistent or small datasets.
View Article and Find Full Text PDFProdrugs are derivatives with superior properties compared with the parent active pharmaceutical ingredient (API), which undergo biotransformation after administration to generate the API in situ. Although sharing this general characteristic, prodrugs encompass a wide range of different chemical structures, therapeutic indications and properties. Here we provide the first holistic analysis of the current landscape of approved prodrugs using cheminformatics and data science approaches to reveal trends in prodrug development.
View Article and Find Full Text PDFComputation promises to accelerate, de-risk and optimize drug research and development. An increasing number of companies have entered this space, specializing in the design of new algorithms, computing on proprietary data, and/or development of hardware to improve distinct drug pipeline stages. The large number of such companies and their unique strategies and deals have created a highly complex and competitive industry.
View Article and Find Full Text PDFIn vitro systems that accurately model in vivo conditions in the gastrointestinal tract may aid the development of oral drugs with greater bioavailability. Here we show that the interaction profiles between drugs and intestinal drug transporters can be obtained by modulating transporter expression in intact porcine tissue explants via the ultrasound-mediated delivery of small interfering RNAs and that the interaction profiles can be classified via a random forest model trained on the drug-transporter relationships. For 24 drugs with well-characterized drug-transporter interactions, the model achieved 100% concordance.
View Article and Find Full Text PDFEstablished molecular machine learning models process individual molecules as inputs to predict their biological, chemical, or physical properties. However, such algorithms require large datasets and have not been optimized to predict property differences between molecules, limiting their ability to learn from smaller datasets and to directly compare the anticipated properties of two molecules. Many drug and material development tasks would benefit from an algorithm that can directly compare two molecules to guide molecular optimization and prioritization, especially for tasks with limited available data.
View Article and Find Full Text PDFDevelopments in computational omics technologies have provided new means to access the hidden diversity of natural products, unearthing new potential for drug discovery. In parallel, artificial intelligence approaches such as machine learning have led to exciting developments in the computational drug design field, facilitating biological activity prediction and de novo drug design for molecular targets of interest. Here, we describe current and future synergies between these developments to effectively identify drug candidates from the plethora of molecules produced by nature.
View Article and Find Full Text PDFTriggerable coatings, such as pH-responsive polymethacrylate copolymers, can be used to protect the active pharmaceutical ingredients contained within oral solid dosage forms from the acidic gastric environment and to facilitate drug delivery directly to the intestine. However, gastrointestinal pH can be highly variable, which can reduce delivery efficiency when using pH-responsive drug delivery technologies. We hypothesized that biomaterials susceptible to proteolysis could be used in combination with other triggerable polymers to develop novel enteric coatings.
View Article and Find Full Text PDFBackground: Patients undergoing cardiac surgery are prone to numerous complications. Increased vascular permeability may be associated with morbidity and mortality due to hemodynamic instability, fluid overload, and edema formation. We hypothesized that markers of endothelial injury and inflammation are associated with capillary leak, ultimately increasing the risk of postoperative complications.
View Article and Find Full Text PDFJ Chem Inf Model
August 2023
Marginalized graph kernels have shown competitive performance in molecular machine learning tasks but currently lack measures of interpretability, which are important to improve trust in the models, detect biases, and inform molecular optimization campaigns. We here conceive and implement two interpretability measures for Gaussian process regression using a marginalized graph kernel (GPR-MGK) to quantify (1) the contribution of specific training data to the prediction and (2) the contribution of specific nodes of the graph to the prediction. We demonstrate the applicability of these interpretability measures for molecular property prediction.
View Article and Find Full Text PDFBackground: The concomitant occurrence of the symptoms intravascular hypovolemia, peripheral edema and hemodynamic instability is typically named Capillary Leak Syndrome (CLS) and often occurs in surgical critical ill patients. However, neither a unitary definition nor standardized diagnostic criteria exist so far. We aimed to investigate common characteristics of this phenomenon with a subsequent scoring system, determining whether CLS contributes to mortality.
View Article and Find Full Text PDFContinuous monitoring in the intensive care setting has transformed the capacity to rapidly respond with interventions for patients in extremis. Noninvasive monitoring has generally been limited to transdermal or intravascular systems coupled to transducers including oxygen saturation or pressure. Here it is hypothesized that gastric fluid (GF) and gases, accessible through nasogastric (NG) tubes, commonly found in intensive care settings, can provide continuous access to a broad range of biomarkers.
View Article and Find Full Text PDFObjectives: Menopause is often characterized by detrimental metabolic changes, such as obesity, insulin resistance, and impaired glucose tolerance, often requiring treatment. KeyBioscience Peptides (KBPs) are Dual Amylin and Calcitonin Receptor Agonists which have shown promising metabolic effects in rats. The objective of this study was to investigate the in vivo effect of KBP on the metabolic health in a model driven by unhealthy diet, age, and menopause.
View Article and Find Full Text PDFPurpose: A multitude of different versions of the same medication with different inactive ingredients are currently available. It has not been quantified how this has evolved historically. Furthermore, it is unknown whether healthcare professionals consider the inactive ingredient portion when prescribing medications to patients.
View Article and Find Full Text PDFMonolayers of cancer-derived cell lines are widely used in the modelling of the gastrointestinal (GI) absorption of drugs and in oral drug development. However, they do not generally predict drug absorption in vivo. Here, we report a robotically handled system that uses large porcine GI tissue explants that are functionally maintained for an extended period in culture for the high-throughput interrogation (several thousand samples per day) of whole segments of the GI tract.
View Article and Find Full Text PDFSprifermin, recombinant human fibroblast growth factor 18 (rhFGF18), induces cartilage regeneration in knees of patients with osteoarthritis (OA). We hypothesized that a temporal multiphasic process of extracellular matrix (ECM) degradation and formation underlie this effect. We aimed to characterize the temporal ECM remodeling of human knee OA articular cartilage in response to sprifermin treatment.
View Article and Find Full Text PDFInactive ingredients and generally recognized as safe compounds are regarded by the US Food and Drug Administration (FDA) as benign for human consumption within specified dose ranges, but a growing body of research has revealed that many inactive ingredients might have unknown biological effects at these concentrations and might alter treatment outcomes. To speed up such discoveries, we apply state-of-the-art machine learning to delineate currently unknown biological effects of inactive ingredients-focusing on P-glycoprotein (P-gp) and uridine diphosphate-glucuronosyltransferase-2B7 (UGT2B7), two proteins that impact the pharmacokinetics of approximately 20% of FDA-approved drugs. Our platform identifies vitamin A palmitate and abietic acid as inhibitors of P-gp and UGT2B7, respectively; in silico, in vitro, ex vivo, and in vivo validations support these interactions.
View Article and Find Full Text PDFDrug Discov Today Technol
December 2019
Active machine learning enables the automated selection of the most valuable next experiments to improve predictive modelling and hasten active retrieval in drug discovery. Although a long established theoretical concept and introduced to drug discovery approximately 15 years ago, the deployment of active learning technology in the discovery pipelines across academia and industry remains slow. With the recent re-discovered enthusiasm for artificial intelligence as well as improved flexibility of laboratory automation, active learning is expected to surge and become a key technology for molecular optimizations.
View Article and Find Full Text PDF