Automated evaluation of consistency within the PubChem Compound database.

Sci Data

National Magnetic Resonance Facility at Madison and BioMagResBank, Department of Biochemistry, University of Wisconsin Madison, Madison, Wisconsin 53706, USA.

Published: February 2019

Identification of discrepant data in aggregated databases is a key step in data curation and remediation. We have applied the ALATIS approach, which is based on the international chemical shift identifier (InChI) model, to the full PubChem Compound database to generate unique and reproducible compound and atom identifiers for all entries for which three-dimensional structures were available. This exercise also served to identify entries with discrepancies between structures and chemical formulas or InChI strings. The use of unique compound identifiers and atom nomenclature should support more rigorous links between small-molecule databases including those containing atom-specific information of the type available from crystallography and spectroscopy. The comprehensive results from this analysis are publicly available through our webserver [http://alatis.nmrfam.wisc.edu/].

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6380220PMC
http://dx.doi.org/10.1038/sdata.2019.23DOI Listing

Publication Analysis

Top Keywords

pubchem compound
8
compound database
8
automated evaluation
4
evaluation consistency
4
consistency pubchem
4
compound
4
database identification
4
identification discrepant
4
discrepant data
4
data aggregated
4

Similar Publications

Background/aim: Alzheimer's disease is a complex, incurable to date, multifactorial disease, which suggests the need for continued development of pharmacotherapy.

Materials And Methods: A comprehensive literature search was conducted to identify known ligands with anticholinesterase activity, resulting in the discovery of over 100 alkaloids that are also available in the PubChem database. Subsequently, the ligands underwent molecular docking to evaluate their affinity for the target enzyme.

View Article and Find Full Text PDF

Objective: The effectiveness of Sanhuang Shu'ai decoction (SSD), a traditional Chinese medicine used to treat diarrhea and colitis, especially ulcerative colitis (UC), is not well understood regarding how its chemical components work.

Methods: This research used ultra-high-performance liquid chromatography (UHPLC)-tandem mass spectrometry (MS), network pharmacology, and molecular docking to understand the active substances and potential mechanisms of SSD in treating UC.

Results: UHPLC and MS analyses identified 710 active components in SSD extracts (ZYTQY) and 387 in SSD-containing serum (HYXQ), with 35 active compounds found in both ZYTQY and HYXQ and 67 active compounds from SSDD (SSD compound obtained directly from the database), along with 6 metabolites that may be key components in its function.

View Article and Find Full Text PDF

Katanin, a key protein in cellular architecture, plays a crucial role in severing microtubules, which are vital components of the cytoskeleton. Given its central involvement in cell division and proliferation, katanin represents a promising target for therapeutic intervention, particularly in cancer treatment. Inhibiting katanin's function could potentially hinder the uncontrolled growth of cancerous cells, making it an attractive target for novel anti-cancer therapies.

View Article and Find Full Text PDF

Introduction: Flavonoids including quercetin, kaempferol, myricetin, rutin etc. have always been a part of traditional Chinese medicine for the treatment of several ailments. Rutin (RT), also known as rutoside, sophorin is one of the flavanol glycoside having structure resemblance with quercetin.

View Article and Find Full Text PDF

The Zika virus (ZIKV), an arbovirus within the Flavivirus genus, is associated with severe neurological complications, including Guillain-Barré syndrome in affected individuals and microcephaly in infants born to infected mothers. With no approved vaccines or antiviral treatments available, there is an urgent need for effective therapeutic options. This study aimed to identify new natural compounds with inhibitory potential against the NS2B-NS3 protease (PDB ID: 5LC0), an essential enzyme in viral replication.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!