DeLA-DrugSelf: Empowering multi-objective de novo design through SELFIES molecular representation.

Domenico Alberga Giuseppe Lamanna Giovanni Graziano Pietro Delre Maria Cristina Lomuscio Nicola Corriero Alessia Ligresti Dritan Siliqi Michele Saviano Marialessandra Contino Angela Stefanachi Giuseppe Felice Mangiatordi

Comput Biol Med

CNR - Institute of Crystallography, Via Amendola 122/o, 70126, Bari, Italy. Electronic address:

Published: June 2024

In this paper, we introduce DeLA-DrugSelf, an upgraded version of DeLA-Drug [J. Chem. Inf. Model. 62 (2022) 1411-1424], which incorporates essential advancements for automated multi-objective de novo design. Unlike its predecessor, which relies on SMILES notation for molecular representation, DeLA-DrugSelf employs a novel and robust molecular representation string named SELFIES (SELF-referencing Embedded String). The generation process in DeLA-DrugSelf not only involves substitutions to the initial string representing the starting query molecule but also incorporates insertions and deletions. This enhancement makes DeLA-DrugSelf significantly more adept at executing data-driven scaffold decoration and lead optimization strategies. Remarkably, DeLA-DrugSelf explicitly addresses the SELFIES-related collapse issue, considering only collapse-free compounds during generation. These compounds undergo a rigorous quality metrics evaluation, highlighting substantial advancements in terms of drug-likeness, uniqueness, and novelty compared to the molecules generated by the previous version of the algorithm. To evaluate the potential of DeLA-DrugSelf as a mutational operator within a genetic algorithm framework for multi-objective optimization, we employed a fitness function based on Pareto dominance. Our objectives focused on target-oriented properties aimed at optimizing known cannabinoid receptor 2 (CB2R) ligands. The results obtained indicate that DeLA-DrugSelf, available as a user-friendly web platform (https://www.ba.ic.cnr.it/softwareic/delaself/), can effectively contribute to the data-driven optimization of starting bioactive molecules based on user-defined parameters.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.compbiomed.2024.108486	DOI Listing

Publication Analysis

Top Keywords

molecular representation

dela-drugself

multi-objective novo

novo design

dela-drugself empowering

empowering multi-objective

design selfies

selfies molecular

representation paper

paper introduce

Similar Publications

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

Cleveland Clinic, Cleveland, OH, USA.

William Martin Feixiong Cheng

Background: Apolipoprotein E (ApoE) is the primary cholesterol and lipid transporting apolipoprotein in the central nervous system (CNS) and is the greatest genetic risk factor for Alzheimer's Disease (AD). There are three main isoforms differing by single amino acid changes: ε3 is "neutral", ε4 is "risk" (Cys112Arg), and ε2 is "resilience" (Arg158Cys). Rare forms (Christchurch, Jacksonville) have also been proposed as resilience alleles, while an ε4-like allele (with Arg61Thr) is present in non-human primates without AD risk.

View Article and Find Full Text PDF

Similar Publications

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

Allen Institute for Brain Science, Seattle, WA, USA.

Mariano I Gabitto

Background: Alzheimer's Disease is marked by the gradual aggregation of pathological proteins, Tau and beta-amyloid, throughout various areas of the brain. The progression of these pathologies follows a consistent pattern, impacting various cellular populations as it advances through each brain region. Previously, we used Bayesian algorithms to create a continuous progression score to mathematically capture the collective aggregation of multiple pathological variables within a specific brain region.

View Article and Find Full Text PDF

Similar Publications

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

Institute of Computational Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Bavaria, Germany, Department of Psychiatry and Behavioral Sciences, Duke University, Durham, NC, USA.

Matthias Arnold

Background: Despite recent breakthroughs, Alzheimer's disease (AD) remains untreatable. In addition, we are still lacking robust biomarkers for early diagnosis and promising novel targets for therapeutic intervention. To enable utilizing the entirety of molecular evidence in the discovery and prioritization of potential novel biomarkers and targets, we have developed the AD Atlas, a network-based multi-omics data integration platform.

View Article and Find Full Text PDF

Similar Publications

Editorial - Melody for a memory: Sleep boosts the brain's representation of sequential events.

Sleep

January 2025

Department of Molecular, Cellular, and Developmental Biology, University of Michigan, Ann Arbor, MI.

Sara J Aton

View Article and Find Full Text PDF

Similar Publications

Mind the Gap: A Neural Network Framework for Imputing Genotypes in Non-Model Species.

Mol Ecol Resour

January 2025

Section for Molecular Ecology and Evolution, Globe Institute, University of Copenhagen, Copenhagen, Denmark.

Katia Bougiouri

Reduced representation sequencing (RRS) has proven to be a cost-effective solution for sequencing subsets of the genome in non-model species for large-scale studies. However, the targeted nature of RRS approaches commonly introduces large amounts of missing data, leading to reduced statistical power and biased estimates in downstream analyses. Genotype imputation, the statistical inference of missing sites across the genome, is a powerful alternative to overcome the caveats associated with missing sites.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!