Scikick: A sidekick for workflow clarity and reproducibility during extensive data analysis.

PLoS One

The Krembil Family Epigenetics Laboratory, The Campbell Family Mental Health Research Institute, Centre for Addiction and Mental Health, Toronto, Ontario, Canada.

Published: July 2023

Reproducibility is crucial for scientific progress, yet a clear research data analysis workflow is challenging to implement and maintain. As a result, a record of computational steps performed on the data to arrive at the key research findings is often missing. We developed Scikick, a tool that eases the configuration, execution, and presentation of scientific computational analyses. Scikick allows for workflow configurations with notebooks as the units of execution, defines a standard structure for the project, automatically tracks the defined interdependencies between the data analysis steps, and implements methods to compile all research results into a cohesive final report. Utilities provided by Scikick help turn the complicated management of transparent data analysis workflows into a standardized and feasible practice. Scikick version 0.2.1 code and documentation is available as supplementary material. The Scikick software is available on GitHub (https://github.com/matthewcarlucci/scikick) and is distributed with PyPi (https://pypi.org/project/scikick/) under a GPL-3 license.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10374128PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0289171PLOS

Publication Analysis

Top Keywords

data analysis
16
scikick
6
data
5
scikick sidekick
4
sidekick workflow
4
workflow clarity
4
clarity reproducibility
4
reproducibility extensive
4
extensive data
4
analysis
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!