Molecular Property Diagnostic Suite Compound Library (MPDS-CL) is an open-source Galaxy-based cheminformatics web portal which presents a structure-based classification of the molecules. A structure-based classification of nearly 150 million unique compounds, obtained from 42 publicly available databases and curated for redundancy removal through 97 hierarchically well-defined atom composition-based portions, has been done. These are further subjected to 56-bit fingerprint-based classification algorithm which led to the formation of 56 structurally well-defined classes. The classes thus obtained were further divided into clusters based on their molecular weight. Thus, the entire set of molecules was put into 56 different classes and 625 clusters. This led to the assignment of a unique ID, named as MPDS-AadharID, for each of these 149,169,443 molecules. MPDS-AadharID is akin to the unique number given to citizens in India (similar to SSN in the US and NINO in the UK). The unique features of MPDS-CL are (a) several search options, such as exact structure search, substructure search, property-based search, fingerprint-based search, using SMILES, InChIKey and key-in; (b) automatic generation of information for the processing for MPDS and other galaxy tools; (c) providing the class and cluster of a molecule which makes it easier and fast to search for similar molecules and (d) information related to the presence of the molecules in multiple databases. The MPDS-CL can be accessed at https://mpds.neist.res.in:8086/ .
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1007/s11030-023-10752-1 | DOI Listing |
Carbohydr Res
January 2025
Université Paris Cité and Université des Antilles and Université de la Réunion, INSERM, BIGR, F-75015 Paris, France. Electronic address:
Protein-carbohydrate interactions play a crucial role in numerous fundamental biological processes. Thus, description and comparison of the carbohydrate binding site (CBS) architecture is of great importance for understanding of the underlying biological mechanisms. However, traditional approaches for carbohydrate-binding protein analysis and annotation rely primarily on the sequence-based methods applied to specific protein classes.
View Article and Find Full Text PDFiScience
December 2024
Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, US.
Pangenome indexes are promising tools for many applications, including classification of nanopore sequencing reads. Move structure is a compressed-index data structure based on the Burrows-Wheeler Transform (BWT). It offers simultaneous O(1)-time queries and O(r) space, where r is the number of BWT runs (consecutive sequence of identical characters).
View Article and Find Full Text PDFSensors (Basel)
November 2024
Department of Electrical and Electronic Engineering, Auckland University of Technology, Auckland 1010, New Zealand.
Spoofing attacks (or Presentation Attacks) are easily accessible to facial recognition systems, making the online financial system vulnerable. Thus, it is urgent to develop an anti-spoofing solution with superior generalization ability due to the high demand for spoofing attack detection. Although multi-modality methods such as combining depth images with RGB images and feature fusion methods could currently perform well with certain datasets, the cost of obtaining the depth information and physiological signals, especially that of the biological signal is relatively high.
View Article and Find Full Text PDFbioRxiv
November 2024
Tanz Centre for Research in Neurodegenerative Diseases, University of Toronto, Toronto, Canada.
Viruses
October 2024
Vaccine Bio Research Institute, College of Medicine, The Catholic University of Korea, 222 Banpo-daero, Seocho-gu, Seoul 06591, Republic of Korea.
Chikungunya virus (CHIKV), responsible for a mosquito-borne viral illness, has rapidly spread worldwide, posing a significant global health threat. In this study, we explored the immunogenic variability of CHIKV envelope 2 (E2), a pivotal component in the anti-CHIKV immune response, using an in silico approach. After extracting the representative sequence types of the CHIKV E2 antigen, we predicted the structure-based B-cell epitopes and MHC I and II binding T-cell epitopes.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!