Massively-parallel sequencing (MPS) technologies and their diverse applications in genomics and epigenomics research have yielded enormous new insights into the physiology and pathophysiology of the human genome. The biggest hurdle remains the magnitude and diversity of the datasets generated, compromising our ability to manage, organize, process and ultimately analyse data. The Wiki-based Automated Sequence Processor (WASP), developed at the Albert Einstein College of Medicine (hereafter Einstein), uniquely manages to tightly couple the sequencing platform, the sequencing assay, sample metadata and the automated workflows deployed on a heterogeneous high performance computing cluster infrastructure that yield sequenced, quality-controlled and 'mapped' sequence data, all within the one operating environment accessible by a web-based GUI interface. WASP at Einstein processes 4-6 TB of data per week and since its production cycle commenced it has processed ~ 1 PB of data overall and has revolutionized user interactivity with these new genomic technologies, who remain blissfully unaware of the data storage, management and most importantly processing services they request. The abstraction of such computational complexity for the user in effect makes WASP an ideal middleware solution, and an appropriate basis for the development of a grid-enabled resource - the Einstein Genome Gateway - as part of the Extreme Science and Engineering Discovery Environment (XSEDE) program. In this paper we discuss the existing WASP system, its proposed middleware role, and its planned interaction with XSEDE to form the Einstein Genome Gateway.

Download full-text PDF

Source

Publication Analysis

Top Keywords

einstein genome
12
genome gateway
12
einstein
6
wasp
5
data
5
gateway wasp
4
wasp high
4
high throughput
4
throughput multi-layered
4
multi-layered life
4

Similar Publications

The Trail of axonal protein Synthesis: Origins and current functional Landscapes.

Neuroscience

January 2025

Departamento de Genómica, Instituto de Investigaciones Biológicas Clemente Estable, MEC, Av. Italia 3318, Montevideo, CP 11600, Uruguay; Departamento de Biología Celular y Molecular, Facultad de Ciencias, Universidad de la República, Iguá, Montevideo, 4225, CP 11400, Uruguay. Electronic address:

Local protein synthesis (LPS) in axons is now recognized as a physiological process, participating both in the maintenance of axonal function and diverse plastic phenomena. In the last decades of the 20th century, the existence and function of axonal LPS were topics of significant debate. Very early, axonal LPS was thought not to occur at all and was later accepted to play roles only during development or in response to specific conditions.

View Article and Find Full Text PDF

Agricultural management significantly affects insects, especially pollinators, which are crucial for crop pollination and biodiversity. In agricultural landscapes, various factors spanning different spatial scales are known to affect pollinator health, which, in turn, can influence pollination services. However, the importance of these factors in driving the health and performance of different pollinator groups remains unclear.

View Article and Find Full Text PDF

Objective: Monoallelic variants in the transient receptor potential melastatin-related type 3 gene (TRPM3) have been associated with neurodevelopmental manifestations, but knowledge on the clinical manifestations and treatment options is limited. We characterized the clinical spectrum, highlighting particularly the epilepsy phenotype, and the effect of treatments.

Methods: We analyzed retrospectively the phenotypes and genotypes of 43 individuals with TRPM3 variants, acquired from GeneMatcher and collaborations (n = 21), and through a systematic literature search (n = 22).

View Article and Find Full Text PDF

Unraveling the causal impact of smoking and its DNA methylation signatures on cardiovascular disease: Mendelian randomization and colocalization analysis.

Clin Epigenetics

January 2025

Department of Neurology, Third Xiangya Hospital, Central South University, 138 Tongzipo Road, Yuelu District, Changsha, 410013, Hunan, China.

Background: To explore the mechanisms linking smoking to cardiovascular diseases (CVDs) from an epigenetic perspective.

Methods: Mendelian Randomization (MR) analysis was performed to assess the causal effects of smoking behavior and DNA methylation levels at smoking-related CpG sites on nine CVDs, including aortic aneurysm, atrial fibrillation, coronary atherosclerosis, coronary heart disease, heart failure, intracerebral hemorrhage, ischemic stroke, myocardial infarction, subarachnoid hemorrhage. Colocalization analysis was used to further identify key smoking-related CpG sites from the MR causal estimates.

View Article and Find Full Text PDF

We performed a systems vaccinology analysis to investigate immune responses in humans to an H5N1 influenza vaccine, with and without the AS03 adjuvant, to identify factors influencing antibody response magnitude and durability. Our findings revealed a platelet and adhesion-related blood transcriptional signature on day 7 that predicted the longevity of the antibody response, suggesting a potential role for platelets in modulating antibody response durability. As platelets originate from megakaryocytes, we explored the effect of thrombopoietin (TPO)-mediated megakaryocyte activation on antibody response longevity.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!