We describe best practices for providing convenient, high-speed, secure access to large data via research data portals. We capture these best practices in a new design pattern, the Modern Research Data Portal, that disaggregates the traditional monolithic web-based data portal to achieve orders-of-magnitude increases in data transfer performance, support new deployment architectures that decouple control logic from data storage, and reduce development and operations costs. We introduce the design pattern; explain how it leverages high-performance data enclaves and cloud-based data management services; review representative examples at research laboratories and universities, including both experimental facilities and supercomputer sites; describe how to leverage Python APIs for authentication, authorization, data transfer, and data sharing; and use coding examples to demonstrate how these APIs can be used to implement a range of research data portal capabilities. Sample code at a companion web site, https://docs.globus.org/mrdp, provides application skeletons that readers can adapt to realize their own research data portals.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924693PMC
http://dx.doi.org/10.7717/peerj-cs.144DOI Listing

Publication Analysis

Top Keywords

data portal
16
design pattern
12
data
12
modern data
8
best practices
8
data portals
8
data transfer
8
portal
4
portal design
4
pattern networked
4

Similar Publications

Comprehensive characterization of the transcriptional landscape in Alzheimer's disease (AD) brains.

Sci Adv

January 2025

Department of Biostatistics and Health Data Science, School of Medicine, Indiana University, Indianapolis, IN 46202, USA.

Alzheimer's disease (AD) is the leading dementia among the elderly with complex origins. Despite extensive investigation into the AD-associated protein-coding genes, the involvement of noncoding RNAs (ncRNAs) and posttranscriptional modification (PTM) in AD pathogenesis remains unclear. Here, we comprehensively characterized the landscape of ncRNAs and PTM events in 1460 samples across six brain regions sourced from the Mount Sinai/JJ Peters VA Medical Center Brain Bank Study and Mayo cohorts, encompassing 33,321 long ncRNAs, 92,897 enhancer RNAs, 53,763 alternative polyadenylation events, and 900,221 A-to-I RNA editing events.

View Article and Find Full Text PDF

Background: Alzheimer's disease (AD) is a devastating neurodegenerative disorder with few therapies to treat, mitigate or prevent its onset. Understanding of this disease is predominantly based on research in non-Hispanic Whites (NHW) although AD disproportionately affects African Americans (AA) and Latin Americans (LA), underrepresented in AD research. To address this knowledge gap, the Accelerating Medicine Partnership for Alzheimer's Disease (AMP-AD) Diversity Working Group was launched to generate multi-omics data from post-mortem brain tissue from donors of predominantly AA and LA descent.

View Article and Find Full Text PDF

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

Chambers-Grundy Center for Transformative Neuroscience, Department of Brain Health, School of Integrated Health Sciences, University of Nevada Las Vegas, Las Vegas, NV, USA.

Background: Although high-throughput DNA/RNA sequencing technologies have generated massive genetic and genomic data in human disease, translation of these findings into new patient treatment has not materialized by lack of effective approaches, such as Artificial Intelligence (AL) and Machine Learning (ML) tools.

Method: To address this problem, we have used AI/ML approaches, Mendelian randomization (MR), and large patient's genetic and functional genomic data to evaluate druggable targets using Alzheimer's disease (AD) as a prototypical example. We utilized the genomic instruments from 9 expression quantitative trait loci (eQTL) and 3 protein quantitative trait loci (pQTL) datasets across five human brain regions from three biobanks.

View Article and Find Full Text PDF

Background: Two-thirds of Alzheimer's Disease (AD) cases are women, and our team has identified molecular factors that relate to disease in a sex-specific manner. Here, we leverage single-cell transcriptomics from dorsolateral prefrontal cortex (N = 424) from the Religious Orders Study and Memory and Aging Project (ROS/MAP; AD Knowledge Portal syn2580853) to characterize sex-specific contributors at cellular resolution.

Method: Single-nucleic RNAseq data was generated and processed as previously described.

View Article and Find Full Text PDF

Background: Alzheimer's disease (AD) is complex and multifactorial. Precision medicine approaches are needed to capture the basis of heterogeneity in AD pathogenesis, clinical presentation and neuropathology. Large-scale molecular, deep phenotypic and exposomal data necessary to enable precision medicine research requires team-based, interdisciplinary programs.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!