Background: Datathons facilitate collaboration between clinicians, statisticians, and data scientists in order to answer important clinical questions. Previous datathons have resulted in numerous publications of interest to the critical care community and serve as a viable model for interdisciplinary collaboration.

Objective: We report on an open-source software called Chatto that was created by members of our group, in the context of the second international Critical Care Datathon, held in September 2015.

Methods: Datathon participants formed teams to discuss potential research questions and the methods required to address them. They were provided with the Chatto suite of tools to facilitate their teamwork. Each multidisciplinary team spent the next 2 days with clinicians working alongside data scientists to write code, extract and analyze data, and reformulate their queries in real time as needed. All projects were then presented on the last day of the datathon to a panel of judges that consisted of clinicians and scientists.

Results: Use of Chatto was particularly effective in the datathon setting, enabling teams to reduce the time spent configuring their research environments to just a few minutes-a process that would normally take hours to days. Chatto continued to serve as a useful research tool after the conclusion of the datathon.

Conclusions: This suite of tools fulfills two purposes: (1) facilitation of interdisciplinary teamwork through archiving and version control of datasets, analytical code, and team discussions, and (2) advancement of research reproducibility by functioning postpublication as an online environment in which independent investigators can rerun or modify analyses with relative ease. With the introduction of Chatto, we hope to solve a variety of challenges presented by collaborative data mining projects while improving research reproducibility.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5014984PMC
http://dx.doi.org/10.2196/jmir.6365DOI Listing

Publication Analysis

Top Keywords

data scientists
8
critical care
8
suite tools
8
chatto
5
datathons software
4
software promote
4
promote reproducible
4
reproducible background
4
background datathons
4
datathons facilitate
4

Similar Publications

Postdoc publications and citations link to academic retention and faculty success.

Proc Natl Acad Sci U S A

January 2025

Computer Science Program, Science Division, New York University Abu Dhabi, Abu Dhabi 129188, United Arab Emirates.

Postdoctoral training is a career stage often described as a demanding and anxiety-laden time when many promising PhDs see their academic dreams slip away due to circumstances beyond their control. We use a unique dataset of academic publishing and careers to chart the more or less successful postdoctoral paths. We build a measure of academic success on the citation patterns two to five years into a faculty career.

View Article and Find Full Text PDF

Background: The recent wave of clinical trials of psychedelic substances among patients with life-limiting illness has largely focused on individual healing. This most often translates to a single patient receiving an intervention with researchers guiding them. As social isolation and lack of connection are major drivers of current mental health crises and group work is expected to be an important aspect of psychedelic assisted psychotherapy, it is essential that we understand the role of community in psychedelic healing.

View Article and Find Full Text PDF

Advancing Ethical Considerations for Data Science in Injury and Violence Prevention.

Public Health Rep

January 2025

Division of Injury Prevention, National Center for Injury Prevention and Control, Centers for Disease Control and Prevention, Atlanta, GA, USA.

Data science is an emerging field that provides new analytical methods. It incorporates novel data sources (eg, internet data) and methods (eg, machine learning) that offer valuable and timely insights into public health issues, including injury and violence prevention. The objective of this research was to describe ethical considerations for public health data scientists conducting injury and violence prevention-related data science projects to prevent unintended ethical, legal, and social consequences, such as loss of privacy or loss of public trust.

View Article and Find Full Text PDF

Introduction: Disparities in oral health are related to dental care knowledge, domestic oral hygiene practices and socioeconomic status. This cross-sectional study aimed to compare the oral hygiene and dental care practices of migrant, Arab, and Jewish children residing in Tel Aviv, Israel, and assess the influence of parental dental practices.

Methods: Data were collected from parents of children aged 3 to 6 years.

View Article and Find Full Text PDF

Decoding the chicken gastrointestinal microbiome.

BMC Microbiol

January 2025

School of Biological Sciences, Institute for Global Food Security, Queen's University Belfast, 19 Chlorine Gardens, Belfast, BT9 5DL, UK.

Metataxonomic studies have underpinned a vast understanding of microbial communities residing within livestock gastrointestinal tracts, albeit studies have often not been combined to provide a global census. Consequently, in this study we characterised the overall and common 'core' chicken microbiota associated with the gastrointestinal tract (GIT), whilst assessing the effects of GIT site, bird breed, age and geographical location on the GIT resident microbes using metataxonomic data compiled from studies completed across the world. Specifically, bacterial 16S ribosomal DNA sequences from GIT samples associated with various breeds, differing in age, GIT sites (caecum, faeces, ileum and jejunum) and geographical location were obtained from the Sequence Read Archive and analysed using the MGnify pipeline.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!