98 results match your criteria: "European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI)[Affiliation]"

The PRIDE database is the largest public data repository of mass spectrometry-based proteomics data and currently stores more than 40,000 data sets covering a wide range of organisms, experimental techniques, and biological conditions. During the past few years, PRIDE has seen a significant increase in the amount of submitted data-independent acquisition (DIA) proteomics data sets. This provides an excellent opportunity for large-scale data reanalysis and reuse.

View Article and Find Full Text PDF

Using AI to prevent the insect apocalypse: toward new environmental risk assessment procedures.

Curr Opin Insect Sci

December 2024

European Molecular Biology Laboratory, Heidelberg, Trust Genome Campus, Hinxton CB10 1SD, UK. Electronic address:

Insect populations are declining globally, with multiple potential drivers identified. However, experimental data are needed to understand their relative contributions. We highlight the sublethal effects of pesticides at field-relevant concentrations, often overlooked in standard environmental risk assessments (ERA), as significant contributors to these declines.

View Article and Find Full Text PDF

Motivation: Genome-wide association studies (GWAS) have been remarkably successful in identifying associations between genetic variants and imaging-derived phenotypes. To date, the main focus of these analyses has been on established, clinically-used imaging features. We sought to investigate if deep learning approaches can detect more nuanced patterns of image variability.

View Article and Find Full Text PDF

RBPs: an RNA editor's choice.

Front Mol Biosci

August 2024

European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Cambridgeshire, United Kingdom.

RNA-binding proteins (RBPs) play a key role in gene expression and post-transcriptional RNA regulation. As integral components of ribonucleoprotein complexes, RBPs are susceptible to genomic and RNA Editing derived amino acid substitutions, impacting functional interactions. This article explores the prevalent RNA Editing of RBPs, unravelling the complex interplay between RBPs and RNA Editing events.

View Article and Find Full Text PDF

The European Bioinformatics Institute (EMBL-EBI)'s Job Dispatcher framework provides access to a wide range of core databases and analysis tools that are of key importance in bioinformatics. As well as providing web interfaces to these resources, web services are available using REST and SOAP protocols that enable programmatic access and allow their integration into other applications and analytical workflows and pipelines. This article describes the various options available to researchers and bioinformaticians who would like to use our resources via the web interface employing RESTful web services clients provided in Perl, Python, and Java or who would like to use Docker containers to integrate the resources into analysis pipelines and workflows.

View Article and Find Full Text PDF

The availability of an increasingly large amount of public proteomics data sets presents an opportunity for performing combined analyses to generate comprehensive organism-wide protein expression maps across different organisms and biological conditions. , a domestic pig, is a model organism relevant for food production and for human biomedical research. Here, we reanalyzed 14 public proteomics data sets from the PRIDE database coming from pig tissues to assess baseline (without any biological perturbation) protein abundance in 14 organs, encompassing a total of 20 healthy tissues from 128 samples.

View Article and Find Full Text PDF

Adopting Mechanistic Molecular Biology Approaches in Exposome Research for Causal Understanding.

Environ Sci Technol

April 2024

European Commission, DG Research and Innovation, Sq. Frère-Orban 8, 1000 Bruxelles, Belgium.

Through investigating the combined impact of the environmental exposures experienced by an individual throughout their lifetime, exposome research provides opportunities to understand and mitigate negative health outcomes. While current exposome research is driven by epidemiological studies that identify associations between exposures and effects, new frameworks integrating more substantial population-level metadata, including electronic health and administrative records, will shed further light on characterizing environmental exposure risks. Molecular biology offers methods and concepts to study the biological and health impacts of exposomes in experimental and computational systems.

View Article and Find Full Text PDF

Integrated meta-analysis of colorectal cancer public proteomic datasets for biomarker discovery and validation.

PLoS Comput Biol

January 2024

Department of Molecular Biomedicine, Centro de Investigaciones Biológicas Margarita Salas, Consejo Superior de Investigaciones Científicas, Madrid, Spain.

The cancer biomarker field has been an object of thorough investigation in the last decades. Despite this, colorectal cancer (CRC) heterogeneity makes it challenging to identify and validate effective prognostic biomarkers for patient classification according to outcome and treatment response. Although a massive amount of proteomics data has been deposited in public data repositories, this rich source of information is vastly underused.

View Article and Find Full Text PDF

The inherent diversity of approaches in proteomics research has led to a wide range of software solutions for data analysis. These software solutions encompass multiple tools, each employing different algorithms for various tasks such as peptide-spectrum matching, protein inference, quantification, statistical analysis, and visualization. To enable an unbiased comparison of commonly used bottom-up label-free proteomics workflows, we introduce WOMBAT-P, a versatile platform designed for automated benchmarking and comparison.

View Article and Find Full Text PDF

Melanoma is the deadliest form of skin cancer and develops from the melanocytes that are responsible for the pigmentation of the skin. The skin is also a highly regenerative organ, harboring a pool of undifferentiated melanocyte stem cells that proliferate and differentiate into mature melanocytes during regenerative processes in the adult. Melanoma and melanocyte regeneration share remarkable cellular features, including activation of cell proliferation and migration.

View Article and Find Full Text PDF

CIPSI: An open chemical intellectual property service for medicinal chemists.

Mol Inform

January 2024

Systems Pharmacology, Research Group on Biomedical Informatics (GRIB), IMIM Hospital del Mar Medical Research Institute, Doctor Aiguader 88, 08028, Barcelona, Spain.

The availability of patent chemical data offers public access to a chemical space that is not well covered by other sources collecting small molecules from scholarly literature. However, open applications to facilitate the search and analysis of biologically-relevant molecular structures present in patents are still largely missing. We have developed CIPSI, an open Chemical Intellectual Property Service @ IMIM to assist medicinal chemists in searching and analysing molecules in SureChEMBL patents.

View Article and Find Full Text PDF

The growing number of available single-cell gene expression datasets from different species creates opportunities to explore evolutionary relationships between cell types across species. Cross-species integration of single-cell RNA-sequencing data has been particularly informative in this context. However, in order to do so robustly it is essential to have rigorous benchmarking and appropriate guidelines to ensure that integration results truly reflect biology.

View Article and Find Full Text PDF

The Comparative Pathology Workbench: Interactive visual analytics for biomedical data.

J Pathol Inform

August 2023

Edinburgh Pathology & Centre for Comparative Pathology, Institute of Genetics & Cancer, University of Edinburgh, Crewe Road, Edinburgh EH4 2XR, UK.

Pathologists need to compare histopathological images of normal and diseased tissues between different samples, cases, and species. We have designed an interactive system, termed Comparative Pathology Workbench (CPW), which allows direct and dynamic comparison of images at a variety of magnifications, selected regions of interest, as well as the results of image analysis or other data analyses such as scRNA-seq. This allows pathologists to indicate key diagnostic features, with a mechanism to allow discussion threads amongst expert groups of pathologists and other disciplines.

View Article and Find Full Text PDF

GA4GH Phenopackets: A Practical Introduction.

Adv Genet (Hoboken)

March 2023

The Jackson Laboratory for Genomic Medicine 10 Discovery Drive Farmington CT 06032 USA.

The Global Alliance for Genomics and Health (GA4GH) is developing a suite of coordinated standards for genomics for healthcare. The Phenopacket is a new GA4GH standard for sharing disease and phenotype information that characterizes an individual person, linking that individual to detailed phenotypic descriptions, genetic information, diagnoses, and treatments. A detailed example is presented that illustrates how to use the schema to represent the clinical course of a patient with retinoblastoma, including demographic information, the clinical diagnosis, phenotypic features and clinical measurements, an examination of the extirpated tumor, therapies, and the results of genomic analysis.

View Article and Find Full Text PDF

The availability of proteomics datasets in the public domain, and in the PRIDE database, in particular, has increased dramatically in recent years. This unprecedented large-scale availability of data provides an opportunity for combined analyses of datasets to get organism-wide protein abundance data in a consistent manner. We have reanalyzed 24 public proteomics datasets from healthy human individuals to assess baseline protein abundance in 31 organs.

View Article and Find Full Text PDF

Constrained Coding Regions (CCRs) in the human genome have been derived from DNA sequencing data of large cohorts of healthy control populations, available in the Genome Aggregation Database (gnomAD) [1]. They identify regions depleted of protein-changing variants and thus identify segments of the genome that have been constrained during human evolution. By mapping these DNA-defined regions from genomic coordinates onto the corresponding protein positions and combining this information with protein annotations, we have explored the distribution of CCRs and compared their co-occurrence with different protein functional features, previously annotated at the amino acid level in public databases.

View Article and Find Full Text PDF

The increasingly large amount of proteomics data in the public domain enables, among other applications, the combined analyses of datasets to create comparative protein expression maps covering different organisms and different biological conditions. Here we have reanalysed public proteomics datasets from mouse and rat tissues (14 and 9 datasets, respectively), to assess baseline protein abundance. Overall, the aggregated dataset contained 23 individual datasets, including a total of 211 samples coming from 34 different tissues across 14 organs, comprising 9 mouse and 3 rat strains, respectively.

View Article and Find Full Text PDF

Gliomas are the most frequent type of brain cancers and characterized by continuous proliferation, inflammation, angiogenesis, invasion and dedifferentiation, which are also among the initiator and sustaining factors of brain regeneration during restoration of tissue integrity and function. Thus, brain regeneration and brain cancer should share more molecular mechanisms at early stages of regeneration where cell proliferation dominates. However, the mechanisms could diverge later when the regenerative response terminates, while cancer cells sustain proliferation.

View Article and Find Full Text PDF

Open access to sequence data is a cornerstone of biology and biodiversity research, but has created tension under the United Nations Convention on Biological Diversity (CBD). Policy decisions could compromise research and development, unless a practical multilateral solution is implemented.

View Article and Find Full Text PDF

Rare disease patients are more likely to receive a rapid molecular diagnosis nowadays thanks to the wide adoption of next-generation sequencing. However, many cases remain undiagnosed even after exome or genome analysis, because the methods used missed the molecular cause in a known gene, or a novel causative gene could not be identified and/or confirmed. To address these challenges, the RD-Connect Genome-Phenome Analysis Platform (GPAP) facilitates the collation, discovery, sharing, and analysis of standardized genome-phenome data within a collaborative environment.

View Article and Find Full Text PDF

Human biomedical datasets that are critical for research and clinical studies to benefit human health also often contain sensitive or potentially identifying information of individual participants. Thus, care must be taken when they are processed and made available to comply with ethical and regulatory frameworks and informed consent data conditions. To enable and streamline data access for these biomedical datasets, the Global Alliance for Genomics and Health (GA4GH) Data Use and Researcher Identities (DURI) work stream developed and approved the Data Use Ontology (DUO) standard.

View Article and Find Full Text PDF

The European Genome-phenome Archive (EGA - https://ega-archive.org/) is a resource for long term secure archiving of all types of potentially identifiable genetic, phenotypic, and clinical data resulting from biomedical research projects. Its mission is to foster hosted data reuse, enable reproducibility, and accelerate biomedical and translational research in line with the FAIR principles.

View Article and Find Full Text PDF

The current paradigm for data use oversight of biomedical datasets is onerous, extending the timescale and resources needed to obtain access for secondary analyses, thus hindering scientific discovery. For a researcher to utilize a controlled-access dataset, a data access committee must review her research plans to determine whether they are consistent with the data use limitations (DULs) specified by the informed consent form. The newly created GA4GH data use ontology (DUO) holds the potential to streamline this process by making data use oversight computable.

View Article and Find Full Text PDF

Glycans play a vital role in health, disease, bioenergy, biomaterials and bio-therapeutics. As a result, there is keen interest to identify and increase glycan data in bioinformatics databases like ChEBI and PubChem, and connecting them to resources at the EMBL-EBI and NCBI to facilitate access to important annotations at a global level. GlyTouCan is a comprehensive archival database that contains glycans obtained primarily through batch upload from glycan repositories, glycoprotein databases and individual laboratories.

View Article and Find Full Text PDF