Datastorr: a workflow and package for delivering successive versions of 'evolving data' directly into R.

Gigascience

Evolution & Ecology Research Centre, and School of Biological, Earth and Environmental Sciences, University of New South Wales, Sydney NSW 2052, Australia.

Published: May 2019

The sharing and re-use of data has become a cornerstone of modern science. Multiple platforms now allow easy publication of datasets. So far, however, platforms for data sharing offer limited functions for distributing and interacting with evolving datasets- those that continue to grow with time as more records are added, errors fixed, and new data structures are created. In this article, we describe a workflow for maintaining and distributing successive versions of an evolving dataset, allowing users to retrieve and load different versions directly into the R platform. Our workflow utilizes tools and platforms used for development and distribution of successive versions of an open source software program, including version control, GitHub, and semantic versioning, and applies these to the analogous process of developing successive versions of an open source dataset. Moreover, we argue that this model allows for individual research groups to achieve a dynamic and versioned model of data delivery at no cost.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6506717PMC
http://dx.doi.org/10.1093/gigascience/giz035DOI Listing

Publication Analysis

Top Keywords

successive versions
16
versions open
8
open source
8
versions
5
datastorr workflow
4
workflow package
4
package delivering
4
successive
4
delivering successive
4
versions 'evolving
4

Similar Publications

Syntheses of Marine Natural Products via Matteson Homologations and Related Processes.

Mar Drugs

January 2025

Organic Chemistry, Saarland University, Campus Building C4.2, D-66123 Saarbruecken, Germany.

Matteson homologation, a successive extension of chiral boronic esters, is perfectly suited for the synthesis of complex molecular structures containing several stereogenic centers. The "classical version" allows the introduction of various functional groups in a 1,2--configuration. The absolute configuration is determined by the choice of the chiral auxiliary, which can be used to introduce several stereogenic centers.

View Article and Find Full Text PDF

The aim of this study is to investigate the effect of cardiometabolic diseases (CMDs) on the development of depressive symptoms and to determine whether socioeconomic status (SES) moderates this effect. A total of 6,455 individual free from depressive symptoms were selected from the China Health and Retirement Longitudinal Study (CHARLS). CMDs and SES were self-reported.

View Article and Find Full Text PDF

Precise modelling of mitochondrial diseases using optimized mitoBEs.

Nature

January 2025

Changping Laboratory, Beijing, The People's Republic of China.

The development of animal models is crucial for studying and treating mitochondrial diseases. Here we optimized adenine and cytosine deaminases to reduce off-target effects on the transcriptome and the mitochondrial genome, improving the accuracy and efficiency of our newly developed mitochondrial base editors (mitoBEs). Using these upgraded mitoBEs (version 2 (v2)), we targeted 70 mouse mitochondrial DNA mutations analogous to human pathogenic variants, establishing a foundation for mitochondrial disease mouse models.

View Article and Find Full Text PDF

Post-traumatic stress disorder (PTSD) symptom clusters associated with an indicator of heart rate variability: The ADVANCE cohort study.

J Affect Disord

January 2025

King's Centre for Military Health Research, King's College London, SE5 9RJ, United Kingdom of Great Britain and Northern Ireland; Academic Department of Military Mental Health, King's College London, SE5 9RJ, United Kingdom of Great Britain and Northern Ireland.

Background: Heart rate variability (HRV) is governed by sympathetic and parasympathetic regulatory systems. Post-Traumatic Stress Disorder (PTSD) may influence these systems and consequently affect cardiovascular functioning.

Methods: The sample consisted of 860 UK male military personnel approximately half of whom had sustained physical combat injuries in Afghanistan.

View Article and Find Full Text PDF

Dwarfism is a major trait for developing lodging-resistant rice cultivars. Gamma irradiation-induced mutagenesis has proven to be an effective method for generating dwarf rice mutants. In this research, we isolated a dwarf mutant from Anna R (4) in the M generation and subsequently stabilized the trait through successive selfing of progeny across the M-M generations.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!