In this paper, we introduce DeLA-DrugSelf, an upgraded version of DeLA-Drug [J. Chem. Inf. Model. 62 (2022) 1411-1424], which incorporates essential advancements for automated multi-objective de novo design. Unlike its predecessor, which relies on SMILES notation for molecular representation, DeLA-DrugSelf employs a novel and robust molecular representation string named SELFIES (SELF-referencing Embedded String). The generation process in DeLA-DrugSelf not only involves substitutions to the initial string representing the starting query molecule but also incorporates insertions and deletions. This enhancement makes DeLA-DrugSelf significantly more adept at executing data-driven scaffold decoration and lead optimization strategies. Remarkably, DeLA-DrugSelf explicitly addresses the SELFIES-related collapse issue, considering only collapse-free compounds during generation. These compounds undergo a rigorous quality metrics evaluation, highlighting substantial advancements in terms of drug-likeness, uniqueness, and novelty compared to the molecules generated by the previous version of the algorithm. To evaluate the potential of DeLA-DrugSelf as a mutational operator within a genetic algorithm framework for multi-objective optimization, we employed a fitness function based on Pareto dominance. Our objectives focused on target-oriented properties aimed at optimizing known cannabinoid receptor 2 (CB2R) ligands. The results obtained indicate that DeLA-DrugSelf, available as a user-friendly web platform (https://www.ba.ic.cnr.it/softwareic/delaself/), can effectively contribute to the data-driven optimization of starting bioactive molecules based on user-defined parameters.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiomed.2024.108486DOI Listing

Publication Analysis

Top Keywords

molecular representation
12
dela-drugself
8
multi-objective novo
8
novo design
8
dela-drugself empowering
4
empowering multi-objective
4
design selfies
4
selfies molecular
4
representation paper
4
paper introduce
4

Similar Publications

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

Cleveland Clinic, Cleveland, OH, USA.

Background: Apolipoprotein E (ApoE) is the primary cholesterol and lipid transporting apolipoprotein in the central nervous system (CNS) and is the greatest genetic risk factor for Alzheimer's Disease (AD). There are three main isoforms differing by single amino acid changes: ε3 is "neutral", ε4 is "risk" (Cys112Arg), and ε2 is "resilience" (Arg158Cys). Rare forms (Christchurch, Jacksonville) have also been proposed as resilience alleles, while an ε4-like allele (with Arg61Thr) is present in non-human primates without AD risk.

View Article and Find Full Text PDF

Background: Alzheimer's Disease is marked by the gradual aggregation of pathological proteins, Tau and beta-amyloid, throughout various areas of the brain. The progression of these pathologies follows a consistent pattern, impacting various cellular populations as it advances through each brain region. Previously, we used Bayesian algorithms to create a continuous progression score to mathematically capture the collective aggregation of multiple pathological variables within a specific brain region.

View Article and Find Full Text PDF

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

Institute of Computational Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Bavaria, Germany, Department of Psychiatry and Behavioral Sciences, Duke University, Durham, NC, USA.

Background: Despite recent breakthroughs, Alzheimer's disease (AD) remains untreatable. In addition, we are still lacking robust biomarkers for early diagnosis and promising novel targets for therapeutic intervention. To enable utilizing the entirety of molecular evidence in the discovery and prioritization of potential novel biomarkers and targets, we have developed the AD Atlas, a network-based multi-omics data integration platform.

View Article and Find Full Text PDF

Mind the Gap: A Neural Network Framework for Imputing Genotypes in Non-Model Species.

Mol Ecol Resour

January 2025

Section for Molecular Ecology and Evolution, Globe Institute, University of Copenhagen, Copenhagen, Denmark.

Reduced representation sequencing (RRS) has proven to be a cost-effective solution for sequencing subsets of the genome in non-model species for large-scale studies. However, the targeted nature of RRS approaches commonly introduces large amounts of missing data, leading to reduced statistical power and biased estimates in downstream analyses. Genotype imputation, the statistical inference of missing sites across the genome, is a powerful alternative to overcome the caveats associated with missing sites.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!