304 results match your criteria: "EMBL - European Bioinformatics Institute[Affiliation]"

Digital transcriptome profiling of normal and glioblastoma-derived neural stem cells identifies genes associated with patient survival.

Genome Med

May 2014

EMBL European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK ; Genome Biology and Developmental Biology Units, European Molecular Biology Laboratory, Meyerhofstraße 1, 69117 Heidelberg, Germany ; Wellcome Trust - Medical Research Council Cambridge Stem Cell Institute, University of Cambridge, Tennis Court Road, Cambridge CB2 1QR, UK.

Background: Glioblastoma multiforme, the most common type of primary brain tumor in adults, is driven by cells with neural stem (NS) cell characteristics. Using derivation methods developed for NS cells, it is possible to expand tumorigenic stem cells continuously in vitro. Although these glioblastoma-derived neural stem (GNS) cells are highly similar to normal NS cells, they harbor mutations typical of gliomas and initiate authentic tumors following orthotopic xenotransplantation.

View Article and Find Full Text PDF

NMDA receptor dependent long-term potentiation (LTP) and long-term depression (LTD) are two prominent forms of synaptic plasticity, both of which are triggered by post-synaptic calcium elevation. To understand how calcium selectively stimulates two opposing processes, we developed a detailed computational model and performed simulations with different calcium input frequencies, amplitudes, and durations. We show that with a total amount of calcium ions kept constant, high frequencies of calcium pulses stimulate calmodulin more efficiently.

View Article and Find Full Text PDF

Recent advances in computational biology suggest that any perturbation to the transcriptional programme of the cell can be summarised by a proper 'signature': a set of genes combined with a pattern of expression. Therefore, it should be possible to generate proxies of clinicopathological phenotypes and drug effects through signatures acquired via DNA microarray technology. Gene expression signatures have recently been assembled and compared through genome-wide metrics, unveiling unexpected drug-disease and drug-drug 'connections' by matching corresponding signatures.

View Article and Find Full Text PDF

We developed m:Explorer for identifying process-specific transcription factors (TFs) from multiple genome-wide sources, including transcriptome, DNA-binding and chromatin data. m:Explorer robustly outperforms similar techniques in finding cell cycle TFs in Saccharomyces cerevisiae. We predicted and experimentally tested regulators of quiescence (G0), a model of ageing, over a six-week time-course.

View Article and Find Full Text PDF

Large-scale analysis of microRNA evolution.

BMC Genomics

June 2012

EMBL - European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom.

Background: In animals, microRNAs (miRNA) are important genetic regulators. Animal miRNAs appear to have expanded in conjunction with an escalation in complexity during early bilaterian evolution. Their small size and high-degree of similarity makes them challenging for phylogenetic approaches.

View Article and Find Full Text PDF

Motivation: LibSBGN is a software library for reading, writing and manipulating Systems Biology Graphical Notation (SBGN) maps stored using the recently developed SBGN-ML file format. The library (available in C++ and Java) makes it easy for developers to add SBGN support to their tools, whereas the file format facilitates the exchange of maps between compatible software applications. The library also supports validation of maps, which simplifies the task of ensuring compliance with the detailed SBGN specifications.

View Article and Find Full Text PDF

Unlabelled: Iterative similarity searches with PSI-BLAST position-specific score matrices (PSSMs) find many more homologs than single searches, but PSSMs can be contaminated when homologous alignments are extended into unrelated protein domains-homologous over-extension (HOE). PSI-Search combines an optimal Smith-Waterman local alignment sequence search, using SSEARCH, with the PSI-BLAST profile construction strategy. An optional sequence boundary-masking procedure, which prevents alignments from being extended after they are initially included, can reduce HOE errors in the PSSM profile.

View Article and Find Full Text PDF

We here present the jmzReader library: a collection of Java application programming interfaces (APIs) to parse the most commonly used peak list and XML-based mass spectrometry (MS) data formats: DTA, MS2, MGF, PKL, mzXML, mzData, and mzML (based on the already existing API jmzML). The library is optimized to be used in conjunction with mzIdentML, the recently released standard data format for reporting protein and peptide identifications, developed by the HUPO proteomics standards initiative (PSI). mzIdentML files do not contain spectra data but contain references to different kinds of external MS data files.

View Article and Find Full Text PDF

We present a Java application programming interface (API), jmzIdentML, for the Human Proteome Organisation (HUPO) Proteomics Standards Initiative (PSI) mzIdentML standard for peptide and protein identification data. The API combines the power of Java Architecture of XML Binding (JAXB) and an XPath-based random-access indexer to allow a fast and efficient mapping of extensible markup language (XML) elements to Java objects. The internal references in the mzIdentML files are resolved in an on-demand manner, where the whole file is accessed as a random-access swap file, and only the relevant piece of XMLis selected for mapping to its corresponding Java object.

View Article and Find Full Text PDF

Motivation: Accurate alignment of large numbers of sequences is demanding and the computational burden is further increased by downstream analyses depending on these alignments. With the abundance of sequence data, an integrative approach of adding new sequences to existing alignments without their full re-computation and maintaining the relative matching of existing sequences is an attractive option. Another current challenge is the extension of reference alignments with fragmented sequences, as those coming from next-generation metagenomics, that contain relatively little information.

View Article and Find Full Text PDF

A central tenet in evolutionary theory is that mutations occur randomly with respect to their value to an organism; selection then governs whether they are fixed in a population. This principle has been challenged by long-standing theoretical models predicting that selection could modulate the rate of mutation itself. However, our understanding of how the mutation rate varies between different sites within a genome has been hindered by technical difficulties in measuring it.

View Article and Find Full Text PDF

Activation of CaMKII by calmodulin and the subsequent maintenance of constitutive activity through autophosphorylation at threonine residue 286 (Thr286) are thought to play a major role in synaptic plasticity. One of the effects of autophosphorylation at Thr286 is to increase the apparent affinity of CaMKII for calmodulin, a phenomenon known as "calmodulin trapping". It has previously been suggested that two binding sites for calmodulin exist on CaMKII, with high and low affinities, respectively.

View Article and Find Full Text PDF

Unlabelled: The Gene Ontology (GO) resource provides dynamic controlled vocabularies to provide an information-rich resource to aid in the consistent description of the functional attributes and subcellular locations of gene products from all taxonomic groups (www.geneontology.org).

View Article and Find Full Text PDF

Transcriptomic studies routinely measure expression levels across numerous conditions. These datasets allow identification of genes that are specifically expressed in a small number of conditions. However, there are currently no statistically robust methods for identifying such genes.

View Article and Find Full Text PDF

ChEMBL is an Open Data database containing binding, functional and ADMET information for a large number of drug-like bioactive compounds. These data are manually abstracted from the primary published literature on a regular basis, then further curated and standardized to maximize their quality and utility across a wide range of chemical biology and drug-discovery research problems. Currently, the database contains 5.

View Article and Find Full Text PDF

Transcription factors (TFs) play an important role in regulating gene expression. The availability of complete genome sequences and associated functional genomic data offer excellent opportunities to understand the transcriptional regulatory system of an entire organism. To do so, however, it is essential to compile a reliable dataset of regulatory components.

View Article and Find Full Text PDF

The International Protein Index (IPI) database has been one of the most widely used protein databases in MS proteomics approaches. Recently, the closure of IPI in September 2011 was announced. Its recommended replacement is the new UniProt Knowledgebase (UniProtKB) "complete proteome" sets, launched in May 2011.

View Article and Find Full Text PDF

Nucleosomes play an important role in gene regulation. Molecular studies observed that nucleosome binding in promoters tends to be repressive. In contrast, genomic studies have delivered conflicting results: An analysis of yeast grown on diverse carbon sources reported that nucleosome occupancies remain largely unchanged between conditions, whereas a study of the heat-shock response suggested that nucleosomes get evicted at promoters of genes with increased expression.

View Article and Find Full Text PDF

A major mode of signal transduction in bacteria is the two-component system, which involves phosphorylation of an output-generating receiver protein by a signal-sensing histidine kinase. This differs from the more common one-component system--where both signal sensing and output generation are performed by the same protein--in the spatial separation of the two activities and the obligate need for post-translational modification (phosphorylation). Many described two-component systems involve a linear structure where a single kinase phosphorylates a cognate receiver.

View Article and Find Full Text PDF

Bioactive molecules such as drugs, pesticides and food additives are produced in large numbers by many commercial and academic groups around the world. Enormous quantities of data are generated on the biological properties and quality of these molecules. Access to such data - both on licensed and commercially available compounds, and also on those that fail during development - is crucial for understanding how improved molecules could be developed.

View Article and Find Full Text PDF

Background: Systematic measurement of genetic interactions by combinatorial RNAi (co-RNAi) is a powerful tool for mapping functional modules and discovering components. It also provides insights into the role of epistasis on the way from genotype to phenotype. The interpretation of co-RNAi data requires computational and statistical analysis in order to detect interactions reliably and sensitively.

View Article and Find Full Text PDF

The InterPro BioMart provides users with query-optimized access to predictions of family classification, protein domains and functional sites, based on a broad spectrum of integrated computational models ('signatures') that are generated by the InterPro member databases: Gene3D, HAMAP, PANTHER, Pfam, PIRSF, PRINTS, ProDom, PROSITE, SMART, SUPERFAMILY and TIGRFAMs. These predictions are provided for all protein sequences from both the UniProt Knowledge Base and the UniParc protein sequence archive. The InterPro BioMart is supplementary to the primary InterPro web interface (http://www.

View Article and Find Full Text PDF

In proteomics, protein identifications are reported and stored using an unstable reference system: protein identifiers. These proprietary identifiers are created individually by every protein database and can change or may even be deleted over time. To estimate the effect of the searched protein sequence database on the long-term storage of proteomics data we analyzed the changes of reported protein identifiers from all public experiments in the Proteomics Identifications (PRIDE) database by November 2010.

View Article and Find Full Text PDF

Transcriptional regulation is one the most basic mechanisms for controlling gene expression. Over the past few years, much research has been devoted to understanding the interplay between transcription factors, histone modifications and associated enzymes required to achieve this control. However, it is becoming increasingly apparent that the three-dimensional conformation of chromatin in the interphase nucleus also plays a critical role in regulating transcription.

View Article and Find Full Text PDF

Transcriptional initiation is arguably the most important control point for gene expression. It is regulated by a combination of factors, including DNA sequence and its three-dimensional topology, proteins and small molecules. In this chapter, we focus on the trans-acting factors of bacterial regulation.

View Article and Find Full Text PDF