The rapid increase in the amount of genomic data provides researchers with an opportunity to integrate diverse datasets and annotations when addressing a wide range of biological questions. However, genomic datasets are deposited on different platforms and are stored in numerous formats from multiple genome builds, which complicates the task of collecting, annotating, transforming, and integrating data as needed. Here, we developed Go Get Data (GGD) as a fast, reproducible approach to installing standardized data recipes. GGD is available on Github ( https://gogetdata.github.io/ ), is extendable to other data types, and can streamline the complexities typically associated with data integration, saving researchers time and improving research reproducibility.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8041854PMC
http://dx.doi.org/10.1038/s41467-021-22381-zDOI Listing

Publication Analysis

Top Keywords

data
8
data ggd
8
genomic data
8
ggd framework
4
framework facilitates
4
facilitates reproducible
4
reproducible access
4
access genomic
4
data rapid
4
rapid increase
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!