FGDB: a comprehensive graph database of ligand fragments from the Protein Data Bank.

Database (Oxford)

Department of Sciences, Roma Tre University, viale Marconi 446, Roma, Lazio 00146, Italy.

Published: June 2022

This work presents Fragment Graph DataBase (FGDB), a graph database of ligand fragments extracted and generated from the protein entries available in the Protein Data Bank (PDB). FGDB is meant to support and elicit campaigns of fragment-based drug design, by enabling users to query it in order to construct ad hoc, target-specific libraries. In this regard, the database features more than 17 000 fragments, typically small, highly soluble and chemically stable molecules expressed via their canonical Simplified Molecular Input Line Entry System (SMILES) representation. For these fragments, the database provides information related to their contact frequencies with the amino acids, the ligands they are contained in and the proteins the latter bind to. The graph database can be queried via standard web forms and textual searches by a number of identifiers (SMILES, ligand and protein PDB ids) as well as via graphical queries that can be performed against the graph itself, providing users with an intuitive and effective view upon the underlying biological entities. Further search mechanisms via advanced conjunctive/disjunctive/negated textual queries are also possible, in order to allow scientists to look for specific relationships and export their results for further studies. This work also presents two sample use cases where maternal embryonic leucine zipper kinase and mesotrypsin are used as a target, being proteins of high biomedical relevance for the development of cancer therapies. Database URL: http://biochimica3.bio.uniroma3.it/fragments-web/.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9239314PMC
http://dx.doi.org/10.1093/database/baac044DOI Listing

Publication Analysis

Top Keywords

graph database
16
database ligand
8
ligand fragments
8
protein data
8
data bank
8
work presents
8
database
7
graph
5
fgdb comprehensive
4
comprehensive graph
4

Similar Publications

Proteins' flexibility is a feature in communicating changes in cell signaling instigated by binding with secondary messengers, such as calcium ions, associated with the coordination of muscle contraction, neurotransmitter release, and gene expression. When binding with the disordered parts of a protein, calcium ions must balance their charge states with the shape of calcium-binding proteins and their versatile pool of partners depending on the circumstances they transmit. Accurately determining the ionic charges of those ions is essential for understanding their role in such processes.

View Article and Find Full Text PDF

The Acinetobacter baumannii is a member of the "ESKAPE" bacteria responsible for many serious multidrug-resistant (MDR) illnesses. This bacteria swiftly adapts to environmental cues leading to the emergence of multidrug-resistant variants, particularly in hospital/medical settings. In this work, we have demonstrated the outer membrane protein 33-36 (Omp33-36) porin as a potential therapeutic target in A.

View Article and Find Full Text PDF

A multiscale molecular structural neural network for molecular property prediction.

Mol Divers

January 2025

Key Laboratory for Macromolecular Science of Shaanxi Province, School of Chemistry and Chemical Engineering, Shaanxi Normal University, Xi'an, 710119, People's Republic of China.

Molecular Property Prediction (MPP) is a fundamental task in important research fields such as chemistry, materials, biology, and medicine, where traditional computational chemistry methods based on quantum mechanics often consume substantial time and computing power. In recent years, machine learning has been increasingly used in computational chemistry, in which graph neural networks have shown good performance in molecular property prediction tasks, but they have some limitations in terms of generalizability, interpretability, and certainty. In order to address the above challenges, a Multiscale Molecular Structural Neural Network (MMSNet) is proposed in this paper, which obtains rich multiscale molecular representations through the information fusion between bonded and non-bonded "message passing" structures at the atomic scale and spatial feature information "encoder-decoder" structures at the molecular scale; a multi-level attention mechanism is introduced on the basis of theoretical analysis of molecular mechanics in order to enhance the model's interpretability; the prediction results of MMSNet are used as label values and clustered in the molecular library by the K-NN (K-Nearest Neighbors) algorithm to reverse match the spatial structure of the molecules, and the certainty of the model is quantified by comparing virtual screening results across different K-values.

View Article and Find Full Text PDF

Data Checking of Asymmetric Catalysis Literature Using a Graph Neural Network Approach.

Molecules

January 2025

GSK Carbon Neutral Laboratories for Sustainable Chemistry, Jubilee Campus, University of Nottingham, Triumph Road, Nottingham NG7 2TU, UK.

The range of chemical databases available has dramatically increased in recent years, but the reliability and quality of their data are often negatively affected by human-error fidelity. The size of chemical databases can make manual data curation/checking of such sets time consuming; thus, automated tools to help this process are highly desirable. Herein, we propose the use of Graph Neural Networks (GNNs) to identifying potential stereochemical misassignments in the primary asymmetric catalysis literature.

View Article and Find Full Text PDF

Analysis of the Climate Impact on Occupational Health and Safety Using Heat Stress Indexes.

Int J Environ Res Public Health

January 2025

Production Engineering Postgraduate Program, Production Engineering Department, State University of Maringá, Av. Colombo, 5790, Bloco 19/20, Zona 7, Maringá 87020-900, Paraná, Brazil.

Unlabelled: Workers may be exposed to conditions that put their physical and mental integrity at risk, from workplace settings to climate characteristics. Heat stress is a harmful health condition caused by exceeding the human body's tolerance limits, leading to illness and increasing the chance of work accidents. Heat stress indexes, such as the Humidex and the Heat Index (HI), are used to measure these impacts.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!