Using self-organizing maps to accelerate similarity search.

Bioorg Med Chem

Laboratoire d'Infochimie UMR 7177, Université de Strasbourg, 1, rue B. Pascal, 67000 Strasbourg, France.

Published: September 2012

While self-organizing maps (SOM) have often been used to map and describe chemical space, this paper focuses on their use to accelerate similarity searches based on vectors of high-dimensional real-value descriptors for which classical, binary fingerprint-based similarity speed-up procedures do not apply. Fuzzy tricentric pharmacophore (FPT) and ISIDA substructure counts are herein explored examples. Similarity search speed-up was achieved by positioning compounds on a SOM, then searching for analogues only in the neurons neighbouring the ones in which the query compounds reside. Smaller neighbourhood means shorter virtual screening (VS) time, but lower analogue retrieval rates. An enhancement criterion, conciliating the opposite trends is defined. It depends on map definition and build-up protocol (training set choice, map size, convergence criteria,…). The main goal is to discover and validate SOMs of optimal quality with respect to this criterion. Increasing the size of the training set beyond a certain limit is shown to be unnecessary and even detrimental, suggesting that one SOM built on a relatively small but diverse training set may be an effective VS enhancer of a much larger database. Also, using an excessively large number of training iterations may lead to over-fitting. Gradual training with en-route checking of VS enhancement propensity is the best strategy to follow. Maps were successfully challenged to accelerate the large-scale VS of 12,000 queries against 160,000 compounds, and shown to provide a meaningful mapping of activity-annotated compounds in chemical space.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.bmc.2012.04.024DOI Listing

Publication Analysis

Top Keywords

training set
12
self-organizing maps
8
accelerate similarity
8
similarity search
8
chemical space
8
training
5
maps accelerate
4
similarity
4
search self-organizing
4
maps som
4

Similar Publications

Managed honeybees and soil nitrogen availability interactively modulate sunflower production in intensive agricultural landscapes of China.

J Econ Entomol

December 2024

Ministry of Education Key Laboratory of Ecology and Resource Use of the Mongolian Plateau & Inner Mongolia Key Laboratory of Grassland Ecology, School of Ecology and Environment, Inner Mongolia University, Hohhot, China.

Insects provide important pollination services for cops. While land use intensification has resulted in steep declines of wild pollinator diversity across agricultural landscapes, releasing managed honeybees has been proposed as a countermeasure. However, it remains uncertain whether managed honeybees can close the pollination gap of sunflower (Helianthus annuus L.

View Article and Find Full Text PDF

laparoscopy has emerged as a pivotal tool for the management of acute abdominal pathologies. It provides diagnostic and therapeutic advantages, enabling surgeons to evaluate and address diverse acute abdominal conditions using minimally invasive techniques. The aim of this consensus was to obtain evidence-based guidance for surgeons regarding the utilization of laparoscopy in emergency medical settings, and has been divided into trauma and non-trauma emergencies.

View Article and Find Full Text PDF

microRNAs (miRNAs) are central post-transcriptional gene expression regulators in healthy and diseased states. Despite decades of effort, deciphering miRNA targets remains challenging, leading to an incomplete miRNA interactome and partially elucidated miRNA functions. Here, we introduce microT-CNN, an avant-garde deep convolutional neural network model that moves the needle by integrating hundreds of tissue-matched (in-)direct experiments from 26 distinct cell types, corresponding to a unique training and evaluation set of >60 000 miRNA binding events and ~30 000 unique miRNA-gene target pairs.

View Article and Find Full Text PDF

Prevalence, Pattern and Factors Associated with Consumption of Sweetened Beverages Among Adolescents in Ogun State, Nigeria.

West Afr J Med

August 2024

Springhead Health Limited, General Practitioner in Primary Care Department, Gravesend, Kent, United Kingdom.

Background: Globally, there has been an increase in the trend of sugar-sweetened beverages (SSB) consumption among adolescents and this has been implicated in the increased prevalence of diet-related NonCommunicable Diseases.

Objectives: This study compared the pattern of sweetened beverage consumption and factors associated with consumption among adolescents in rural and urban areas of Ogun State, Nigeria.

Methods: A comparative cross-sectional study was conducted among in-school adolescents in rural and urban areas of Ogun State.

View Article and Find Full Text PDF

Background: Preoperative determination of muscular infiltration is crucial for appropriate treatment planning in patients with muscle-invasive bladder cancer (MIBC). We aimed to explore early diagnostic biomarkers in serum for MIBC in this study.

Methods: The expression profiles of long noncoding RNA (lncRNA) were initially screened by high-throughput sequencing and evaluation of potential lncRNAs were conducted by two phases of RT-qPCR assays using serum samples from 190 patients with MIBC and 190 non-muscle-invasive BC (NMIBC) patients.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!