Small-molecule inhibitors of PARP1/2, such as olaparib, have been proposed to serve as a synthetic lethal therapy for cancers that harbor BRCA1 or BRCA2 mutations. Indeed, in clinical trials, PARP1/2 inhibitors elicit sustained antitumor responses in patients with germline BRCA gene mutations. In hypothesizing that additional genetic determinants might direct use of these drugs, we conducted a genome-wide synthetic lethal screen for candidate olaparib sensitivity genes.
View Article and Find Full Text PDFBackground: Chromosomal rearrangements in the form of deletions, insertions, inversions and translocations are frequently observed in breast cancer genomes, and a subset of these rearrangements may play a crucial role in tumorigenesis. To identify novel somatic chromosomal rearrangements, we determined the genome structures of 15 hormone-receptor negative breast tumors by long-insert mate pair massively parallel sequencing.
Results: We identified and validated 40 somatic structural alterations, including the recurring fusion between genes DDX10 and SKA3 and translocations involving the EPHA5 gene.
Background: Tourette Syndrome (TS) is a neuropsychiatric disorder in children characterized by motor and verbal tics. Although several genes have been suggested in the etiology of TS, the genetic mechanisms remain poorly understood.
Methods: Using cytogenetics and FISH analysis, we identified an apparently balanced t(6,22)(q16.
Background: Double-hit lymphoma is a complex and highly aggressive sub-type of B-cell lymphoma, which has recently been classified and is an area of active research interest due to the poor prognosis for patients with this disease. It is characterized by the presence of both an activating MYC chromosomal translocation and a simultaneous additional oncogenic translocation, often of the BCL2 gene. Recently, a cell line was established from a patient with this complex lymphoma and analyzed using conventional tools revealing it contains both MYC and BCL2 translocation events.
View Article and Find Full Text PDFAdvances in next-generation RNA-sequencing have revealed the complexity of transcriptomes by allowing both coding and noncoding(nc)RNAs to be analyzed. However, limited data exist regarding the whole transcriptional landscape of chronic lymphocytic leukemia(CLL). In this pilot-study, we evaluated RNA-sequencing in CLL by comparing two subsets which carry almost identical or "stereotyped" B-cell receptors with distinct clinical outcome, that is the poor-prognostic subset #1 (n 5 4) and the more favorable-prognostic subset #4(n 5 4).
View Article and Find Full Text PDFBackground: Biological processes such as metabolic pathways, gene regulation or protein-protein interactions are often represented as graphs in systems biology. The understanding of such networks, their analysis, and their visualization are today important challenges in life sciences. While a great variety of visualization tools that try to address most of these challenges already exists, only few of them succeed to bridge the gap between visualization and network analysis.
View Article and Find Full Text PDFComprehensive identification of the acquired mutations that cause common cancers will require genomic analyses of large sets of tumor samples. Typically, the tissue material available from tumor specimens is limited, which creates a demand for accurate template amplification. We therefore evaluated whether phi29-mediated whole genome amplification introduces false positive structural mutations by massive mate-pair sequencing of a normal human genome before and after such amplification.
View Article and Find Full Text PDFBackground: The extremely halophilic archaea are present worldwide in saline environments and have important biotechnological applications. Ten complete genomes of haloarchaea are now available, providing an opportunity for comparative analysis.
Methodology/principal Findings: We report here the comparative analysis of five newly sequenced haloarchaeal genomes with five previously published ones.
We present 'gene prediction improvement pipeline' (GenePRIMP; http://geneprimp.jgi-psf.org/), a computational process that performs evidence-based evaluation of gene models in prokaryotic genomes and reports anomalies including inconsistent start sites, missed genes and split genes.
View Article and Find Full Text PDFBackground: Cupriavidus necator JMP134 is a Gram-negative beta-proteobacterium able to grow on a variety of aromatic and chloroaromatic compounds as its sole carbon and energy source.
Methodology/principal Findings: Its genome consists of four replicons (two chromosomes and two plasmids) containing a total of 6631 protein coding genes. Comparative analysis identified 1910 core genes common to the four genomes compared (C.
Motivation: Shotgun sequencing generates large numbers of short DNA reads from either an isolated organism or, in the case of metagenomics projects, from the aggregate genome of a microbial community. These reads are then assembled based on overlapping sequences into larger, contiguous sequences (contigs). The feasibility of assembly and the coverage achieved (reads per nucleotide or distinct sequence of nucleotides) depend on several factors: the number of reads sequenced, the read length and the relative abundances of their source genomes in the microbial community.
View Article and Find Full Text PDFComputational methods for determining the function of genes in newly sequenced genomes have been traditionally based on sequence similarity to genes whose function has been identified experimentally. Function prediction methods can be extended using gene context analysis approaches such as examining the conservation of chromosomal gene clusters, gene fusion events and co-occurrence profiles across genomes. Context analysis is based on the observation that functionally related genes are often having similar gene context and relies on the identification of such events across phylogenetically diverse collection of genomes.
View Article and Find Full Text PDFBackground: Methanomicrobiales is the least studied order of methanogens. While these organisms appear to be more closely related to the Methanosarcinales in ribosomal-based phylogenetic analyses, they are metabolically more similar to Class I methanogens.
Methodology/principal Findings: In order to improve our understanding of this lineage, we have completely sequenced the genomes of two members of this order, Methanocorpusculum labreanum Z and Methanoculleus marisnigri JR1, and compared them with the genome of a third, Methanospirillum hungatei JF-1.
Unlabelled: jClust is a user-friendly application which provides access to a set of widely used clustering and clique finding algorithms. The toolbox allows a range of filtering procedures to be applied and is combined with an advanced implementation of the Medusa interactive visualization module. These implemented algorithms are k-Means, Affinity propagation, Bron-Kerbosch, MULIC, Restricted neighborhood search cluster algorithm, Markov clustering and Spectral clustering, while the supported filtering procedures are haircut, outside-inside, best neighbors and density control operations.
View Article and Find Full Text PDFBackground: Determining the habitat range for various microbes is not a simple, straightforward matter, as habitats interlace, microbes move between habitats, and microbial communities change over time. In this study, we explore an approach using the history of lateral gene transfer recorded in microbial genomes to begin to answer two key questions: where have you been and who have you been with?
Results: All currently sequenced microbial genomes were surveyed to identify pairs of taxa that share a transposase that is likely to have been acquired through lateral gene transfer. A microbial interaction network including almost 800 organisms was then derived from these connections.
Background: Staphylothermus marinus is an anaerobic, sulfur-reducing peptide fermenter of the archaeal phylum Crenarchaeota. It is the third heterotrophic, obligate sulfur reducing crenarchaeote to be sequenced and provides an opportunity for comparative analysis of the three genomes.
Results: The 1.
Unlabelled: OnTheFly is a web-based application that applies biological named entity recognition to enrich Microsoft Office, PDF and plain text documents. The input files are converted into the HTML format and then sent to the Reflect tagging server, which highlights biological entity names like genes, proteins and chemicals, and attaches to them JavaScript code to invoke a summary pop-up window. The window provides an overview of relevant information about the entity, such as a protein description, the domain composition, a link to the 3D structure and links to other relevant online resources.
View Article and Find Full Text PDFIn order to simplify and meaningfully categorize large sets of protein sequence data, it is commonplace to cluster proteins based on the similarity of those sequences. However, it quickly becomes clear that the sequence flexibility allowed a given protein varies significantly among different protein families. The degree to which sequences are conserved not only differs for each protein family, but also is affected by the phylogenetic divergence of the source organisms.
View Article and Find Full Text PDFHalothermothirx orenii is a strictly anaerobic thermohalophilic bacterium isolated from sediment of a Tunisian salt lake. It belongs to the order Halanaerobiales in the phylum Firmicutes. The complete sequence revealed that the genome consists of one circular chromosome of 2578146 bps encoding 2451 predicted genes.
View Article and Find Full Text PDFMotivation: A typical metagenome dataset generated using a 454 pyrosequencing platform consists of short reads sampled from the collective genome of a microbial community. The amount of sequence in such datasets is usually insufficient for assembly, and traditional gene prediction cannot be applied to unassembled short reads. As a result, analysis of such datasets usually involves comparisons in terms of relative abundances of various protein families.
View Article and Find Full Text PDFBackground: Environments and their organic content are generally not static and isolated, but in a constant state of exchange and interaction with each other. Through physical or biological processes, organisms, especially microbes, may be transferred between environments whose characteristics may be quite different. The transferred microbes may not survive in their new environment, but their DNA will be deposited.
View Article and Find Full Text PDFTime-series analysis of whole-genome expression data during Drosophila melanogaster development indicates that up to 86% of its genes change their relative transcript level during embryogenesis. By applying conservative filtering criteria and requiring 'sharp' transcript changes, we identified 1534 maternal genes, 792 transient zygotic genes, and 1053 genes whose transcript levels increase during embryogenesis. Each of these three categories is dominated by groups of genes where all transcript levels increase and/or decrease at similar times, suggesting a common mode of regulation.
View Article and Find Full Text PDFProtein complexes are key molecular entities that integrate multiple gene products to perform cellular functions. Here we report the first genome-wide screen for complexes in an organism, budding yeast, using affinity purification and mass spectrometry. Through systematic tagging of open reading frames (ORFs), the majority of complexes were purified several times, suggesting screen saturation.
View Article and Find Full Text PDF