AI Article Synopsis

  • Biomedical studies are generating massive amounts of data, but handling this data efficiently is a challenge.
  • Trellis is a cloud-based framework designed to automate the entire process from data collection to presenting results, while also ensuring data tracking and system reliability.
  • The framework uses a graph database and a microservice architecture to efficiently process bioinformatics tasks, successfully enabling the analysis of 100,000 human genomes in one program.

Article Abstract

Biomedical studies have become larger in size and yielded large quantities of data, yet efficient data processing remains a challenge. Here we present Trellis, a cloud-based data and task management framework that completely automates the process from data ingestion to result presentation, while tracking data lineage, facilitating information query, and supporting fault-tolerance and scalability. Using a graph database to coordinate the state of the data processing workflows and a scalable microservice architecture to perform bioinformatics tasks, Trellis has enabled efficient variant calling on 100,000 human genomes collected in the VA Million Veteran Program.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8636485PMC
http://dx.doi.org/10.1038/s41598-021-02569-5DOI Listing

Publication Analysis

Top Keywords

efficient data
8
data task
8
task management
8
veteran program
8
data processing
8
data
7
trellis efficient
4
management veteran
4
program biomedical
4
biomedical studies
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!