Improving the discoverability, accessibility, and citability of omics datasets: a case report.

J Am Med Inform Assoc

Nuclear Receptor Signaling Atlas Informatics Hub, Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, Texas, USA.

Published: March 2017

Although omics datasets represent valuable assets for hypothesis generation, model testing, and data validation, the infrastructure supporting their reuse lacks organization and consistency. Using nuclear receptor signaling transcriptomic datasets as proof of principle, we developed a model to improve the discoverability, accessibility, and citability of published omics datasets. Primary datasets were retrieved from archives, processed to extract data points, then subjected to metadata enrichment and gap filling. The resulting secondary datasets were exposed on responsive web pages to support mining of gene lists, discovery of related datasets, and single-click citation integration with popular reference managers. Automated processes were established to embed digital object identifier-driven links to the secondary datasets in associated journal articles, small molecule and gene-centric databases, and a dataset search engine. Our model creates multiple points of access to reprocessed and reannotated derivative datasets across the digital biomedical research ecosystem, promoting their visibility and usability across disparate research communities.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7651888PMC
http://dx.doi.org/10.1093/jamia/ocw096DOI Listing

Publication Analysis

Top Keywords

omics datasets
12
datasets
9
discoverability accessibility
8
accessibility citability
8
secondary datasets
8
improving discoverability
4
citability omics
4
datasets case
4
case report
4
report omics
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!