Dramatic increases in the throughput of nucleotide sequencing machines, and the promise of ever greater performance, have thrust bioinformatics into the era of petabyte-scale data sets. Sequence repositories, which provide the feed for these data sets into the worldwide computational infrastructure, are challenged by the impact of these data volumes. The European Nucleotide Archive (ENA; http://www.ebi.ac.uk/embl), comprising the EMBL Nucleotide Sequence Database and the Ensembl Trace Archive, has identified challenges in the storage, movement, analysis, interpretation and visualization of petabyte-scale data sets. We present here our new repository for next generation sequence data, a brief summary of contents of the ENA and provide details of major developments to submission pipelines, high-throughput rule-based validation infrastructure and data integration approaches.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2686451PMC
http://dx.doi.org/10.1093/nar/gkn765DOI Listing

Publication Analysis

Top Keywords

data sets
12
european nucleotide
8
nucleotide archive
8
petabyte-scale data
8
data
6
petabyte-scale innovations
4
innovations european
4
nucleotide
4
archive dramatic
4
dramatic increases
4

Similar Publications

Objective: To investigate the expression of metabolism-related genes (MRGs) in kidney renal clear cell carcinoma (KIRC) and their association with patient prognosis, and to identify potential targets for intervention.

Methods: Bioinformatics methods were employed to mine the KIRC transcriptome data in The Cancer Genome Atlas Program (TCGA) database in order to identify MRGs that are aberrantly expressed in cancerous tissues. Subsequently, a prognostic risk score model was constructed and its predictive capacity was evaluated.

View Article and Find Full Text PDF

Big Data communication researchers have highlighted the need for qualitative analysis of online science conversations to better understand their meaning. However, a scholarly gap exists in exploring how qualitative methods can be applied to small data regarding micro-bloggers' communications about science articles. While social media attention assists with article dissemination, qualitative research into the associated microblogging practices remains limited.

View Article and Find Full Text PDF

Introduction: Observational studies have revealed a close relationship between reduced bone mineral density (BMD) and Alzheimer's disease (AD) risk. The receptor activator of nuclear factor kappa-B ligand (RANKL) and osteoprotegerin (OPG) system, pivotal in regulating bone metabolism, has been implicated in brain function, but the causal impact on AD risk remains unclear.

Methods: We employed bi-directional Mendelian randomization (MR) and multivariable MR (MVMR) approaches to elucidate the effect of blood soluble RANKL (sRANKL) and OPG levels on AD, assessing whether this influence was independent of BMD and inflammation.

View Article and Find Full Text PDF

Enhancement of the nontumor component in newly diagnosed glioblastoma as a more accurate predictor of local recurrence location: a multicenter study.

Quant Imaging Med Surg

January 2025

Department of Radiology, Medical Imaging Institute of Tianjin, Tianjin First Central Hospital, School of Medicine, Nankai University, Tianjin, China.

Background: Although the spatial heterogeneity of glioblastoma (GBM) can be clearly mapped by the habitats generated by magnetic resonance imaging (MRI), the means to accurately predicting the spatial location of local recurrence (SLLR) remains a significant challenge. The aim of this study was to identify the different degrees enhancement of GBM, including the nontumor component and tumor component, and determine their relationship with SLLR.

Methods: A retrospective analysis was performed from three tertiary medical centers, totaling 728 patients with 109 radiation-induced temporal lobe necrosis (TLN) of nasopharyngeal carcinoma (NPC) and 619 with GBM.

View Article and Find Full Text PDF

Background: Chronic hepatitis B virus (HBV) infection is a major risk for development of hepatocellular carcinoma (HCC), a frequent malignancy with a poor survival rate. HBV infection results in significant endoplasmic reticulum (ER) stress and activation of the unfolded protein response (UPR) signaling, a contributing factor to carcinogenesis. As part of the UPR, the ER-associated degradation (ERAD) pathway is responsible for removing the burden of misfolded secretory proteins, to re-establish cellular homeostasis.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!