Development of a REDCap-based workflow for high-volume relational data analysis on real-time data in a medical department using open source software.

Comput Methods Programs Biomed

Department of Medical Sciences, Nuclear Medicine, AOU Città della Salute e della Scienza di Torino, University of Turin, Corso Bramante 88, Turin 10126, Italy.

Published: November 2022

Background/aim: The current availability of large volumes of clinical data has provided medical departments with the opportunity for large-scale analyses, but it has also brought forth the need for an effective strategy of data-storage and data-analysis that is both technically feasible and economically sustainable in the context of limited resources and manpower. Therefore, the aim of this study was to develop a widely-usable data-collection and data-analysis workflow that could be applied in medical departments to perform high-volume relational data analysis on real-time data.

Methods: A sample project, based on a research database on prostate-specific-membrane-antigen/positron-emission-tomography scans performed in prostate cancer patients at our department, was used to develop a new workflow for data-collection and data-analysis. A checklist of requirements for a successful data-collection/analysis strategy, based on shared clinical research experience, was used as reference standard. Software libraries were selected based on widespread availability, reliability, cost, and technical expertise of the research team (REDCap-v11.0.0 for collaborative data-collection, Python-v3.8.5 for data retrieval and SQLite-v3.31.1 for data storage). The primary objective of this study was to develop and implement a workflow to: a) easily store large volumes of structured data into a relational database, b) perform scripted analyses on relational data retrieved in real-time from the database. The secondary objective was to enhance the strategy cost-effectiveness by using open-source/cost-free software libraries.

Results: A fully working data strategy was developed and successfully applied to a sample research project. The REDCap platform provided a remote and secure method to collaboratively collect large volumes of standardized relational data, with low technical difficulty and role-based access-control. A Python software was coded to retrieve live data through the REDCap-API and persist them to an SQLite database, preserving data-relationships. The SQL-language enabled complex datasets retrieval, while Python allowed for scripted data computation and analysis. Only cost-free software libraries were used and the sample code was made available through a GitHub repository.

Conclusions: A REDCap-based data-collection and data-analysis workflow, suitable for high-volume relational data-analysis on live data, was developed and successfully implemented using open-source software.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.cmpb.2022.107111DOI Listing

Publication Analysis

Top Keywords

relational data
16
data
13
high-volume relational
12
large volumes
12
data-collection data-analysis
12
data analysis
8
analysis real-time
8
medical departments
8
study develop
8
data-analysis workflow
8

Similar Publications

Background: Osteoarthritis (OA) often leads to pain and functional limitations, impacting work and daily life. Physical activity (PA) is an important part of the treatment. Wearable activity trackers (WATs) offer a novel approach to promote PA but could also aid in finding a sustainable PA level over time.

View Article and Find Full Text PDF

Plasma S100β is a predictor for pathology and cognitive decline in Alzheimer's disease.

Fluids Barriers CNS

January 2025

Sanders-Brown Center on Aging, College of Medicine, University of Kentucky, 760 Press Ave, 124 HKRB, Lexington, KY, 40536-0679, USA.

Background: Blood-brain barrier dysfunction is one characteristic of Alzheimer's disease (AD) and is recognized as both a cause and consequence of the pathological cascade leading to cognitive decline. The goal of this study was to assess markers for barrier dysfunction in postmortem tissue samples from research participants who were either cognitively normal individuals (CNI) or diagnosed with AD at the time of autopsy and determine to what extent these markers are associated with AD neuropathologic changes (ADNC) and cognitive impairment.

Methods: We used postmortem brain tissue and plasma samples from 19 participants: 9 CNI and 10 AD dementia patients who had come to autopsy from the University of Kentucky AD Research Center (UK-ADRC) community-based cohort; all cases with dementia had confirmed severe ADNC.

View Article and Find Full Text PDF

Background: Pancreatic cancer is characterized by a complex tumor microenvironment that hinders effective immunotherapy. Identifying key factors that regulate the immunosuppressive landscape is crucial for improving treatment strategies.

Methods: We constructed a prognostic and risk assessment model for pancreatic cancer using 101 machine learning algorithms, identifying OSBPL3 as a key gene associated with disease progression and prognosis.

View Article and Find Full Text PDF

Purpose: The prognosis of relapsed primary central nervous system lymphoma remains a concern. This study aimed to compare the effects of various patient- and disease-related factors on the prognosis of relapsed primary central nervous system lymphoma (PCNSL).

Methods: We retrospectively collected real-world data from eight Finnish hospitals on 198 patients diagnosed with PCNSL between 2003 and 2020.

View Article and Find Full Text PDF

Neuropathological contributions to grey matter atrophy and white matter hyperintensities in amnestic dementia.

Alzheimers Res Ther

January 2025

Laboratory for Clinical Neuroscience, Center for Biomedical Technology, Universidad Politécnica de Madrid, IdISSC, Crta M40, km38, Madrid, 28223, Spain.

Background: Dementia patients commonly present multiple neuropathologies, worsening cognitive function, yet structural neuroimaging signatures of dementia have not been positioned in the context of combined pathology. In this study, we implemented an MRI voxel-based approach to explore combined and independent effects of dementia pathologies on grey and white matter structural changes.

Methods: In 91 amnestic dementia patients with post-mortem brain donation, grey matter density and white matter hyperintensity (WMH) burdens were obtained from pre-mortem MRI and analyzed in relation to Alzheimer's, vascular, Lewy body, TDP-43, and hippocampal sclerosis (HS) pathologies.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!