Introduction: We developed Data Base similarity (DBsimilarity), a user-friendly tool designed to organize structure databases into similarity networks, with the goal of facilitating the visualization of information primarily for natural product chemists who may not have coding experience.
Method: DBsimilarity, written in Jupyter Notebooks, converts Structure Data File (SDF) files into Comma-Separated Values (CSV) files, adds chemoinformatics data, constructs an MZMine custom database file and an NMRfilter candidate list of compounds for rapid dereplication of MS and 2D NMR data, calculates similarities between compounds, and constructs CSV files formatted into similarity networks for Cytoscape.
Results: The Lotus database was used as a source for Ginkgo biloba compounds, and DBsimilarity was used to create similarity networks including NPClassifier classification to indicate biosynthesis pathways.
Introduction: Natural products and metabolomics are intrinsically linked through efforts to analyze complex mixtures for compound annotation. Although most studies that aim for compound identification in mixtures use MS as the main analysis technique, NMR has complementary advances that are worth exploring for enhanced structural confidence.
Objective: This review aimed to showcase a portfolio of the main tools available for compound identification using NMR.