Bacteriophages have received recent attention for their therapeutic potential to treat antibiotic-resistant bacterial infections. One particular idea in phage therapy is to use phages that not only directly kill their bacterial hosts but also rely on particular bacterial receptors, such as proteins involved in virulence or antibiotic resistance. In such cases, the evolution of phage resistance would correspond to the loss of those receptors, an approach termed evolutionary steering.
View Article and Find Full Text PDFCongenital hydrocephalus (CH), featuring markedly enlarged brain ventricles, is thought to arise from failed cerebrospinal fluid (CSF) homeostasis and is treated with lifelong surgical CSF shunting with substantial morbidity. CH pathogenesis is poorly understood. Exome sequencing of 125 CH trios and 52 additional probands identified three genes with significant burden of rare damaging de novo or transmitted mutations: TRIM71 (p = 2.
View Article and Find Full Text PDFCongenital heart disease (CHD) is the leading cause of mortality from birth defects. Here, exome sequencing of a single cohort of 2,871 CHD probands, including 2,645 parent-offspring trios, implicated rare inherited mutations in 1.8%, including a recessive founder mutation in GDF1 accounting for ∼5% of severe CHD in Ashkenazim, recessive genotypes in MYH6 accounting for ∼11% of Shone complex, and dominant FLT4 mutations accounting for 2.
View Article and Find Full Text PDFDespite efforts to interrogate human genome variation through large-scale databases, systematic preference toward populations of Caucasian descendants has resulted in unintended reduction of power in studying non-Caucasians. Here we report a compilation of coding variants from 1,055 healthy Korean individuals (KOVA; Korean Variant Archive). The samples were sequenced to a mean depth of 75x, yielding 101 singleton variants per individual.
View Article and Find Full Text PDFCutaneous T cell lymphoma (CTCL) is a non-Hodgkin lymphoma of skin-homing T lymphocytes. We performed exome and whole-genome DNA sequencing and RNA sequencing on purified CTCL and matched normal cells. The results implicate mutations in 17 genes in CTCL pathogenesis, including genes involved in T cell activation and apoptosis, NF-κB signaling, chromatin remodeling and DNA damage response.
View Article and Find Full Text PDFGenomics Proteomics Bioinformatics
February 2015
We report a significantly-enhanced bioinformatics suite and database for proteomics research called Yale Protein Expression Database (YPED) that is used by investigators at more than 300 institutions worldwide. YPED meets the data management, archival, and analysis needs of a high-throughput mass spectrometry-based proteomics research ranging from a single laboratory, group of laboratories within and beyond an institution, to the entire proteomics community. The current version is a significant improvement over the first version in that it contains new modules for liquid chromatography-tandem mass spectrometry (LC-MS/MS) database search results, label and label-free quantitative proteomic analysis, and several scoring outputs for phosphopeptide site localization.
View Article and Find Full Text PDFBackground: Current research suggests that a small set of "driver" mutations are responsible for tumorigenesis while a larger body of "passenger" mutations occur in the tumor but do not progress the disease. Due to recent pharmacological successes in treating cancers caused by driver mutations, a variety of methodologies that attempt to identify such mutations have been developed. Based on the hypothesis that driver mutations tend to cluster in key regions of the protein, the development of cluster identification algorithms has become critical.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
September 2013
Despite considerable efforts to sequence hypermutated cancers such as melanoma, distinguishing cancer-driving genes from thousands of recurrently mutated genes remains a significant challenge. To circumvent the problematic background mutation rates and identify new melanoma driver genes, we carried out a low-copy piggyBac transposon mutagenesis screen in mice. We induced eleven melanomas with mutation burdens that were 100-fold lower relative to human melanomas.
View Article and Find Full Text PDFCongenital heart disease (CHD) is the most frequent birth defect, affecting 0.8% of live births. Many cases occur sporadically and impair reproductive fitness, suggesting a role for de novo mutations.
View Article and Find Full Text PDFMultiple studies have confirmed the contribution of rare de novo copy number variations to the risk for autism spectrum disorders. But whereas de novo single nucleotide variants have been identified in affected individuals, their contribution to risk has yet to be clarified. Specifically, the frequency and distribution of these mutations have not been well characterized in matched unaffected controls, and such data are vital to the interpretation of de novo coding mutations observed in probands.
View Article and Find Full Text PDFUnlabelled: Ancient endosymbionts have been associated with extreme genome structural stability with little differentiation in gene inventory between sister species. Tsetse flies (Diptera: Glossinidae) harbor an obligate endosymbiont, Wigglesworthia, which has coevolved with the Glossina radiation. We report on the ~720-kb Wigglesworthia genome and its associated plasmid from Glossina morsitans morsitans and compare them to those of the symbiont from Glossina brevipalpis.
View Article and Find Full Text PDFVertical transmission of obligate symbionts generates a predictable evolutionary history of symbionts that reflects that of their hosts. In insects, evolutionary associations between symbionts and their hosts have been investigated primarily among species, leaving population-level processes largely unknown. In this study, we investigated the tsetse (Diptera: Glossinidae) bacterial symbiont, Wigglesworthia glossinidia, to determine whether observed codiversification of symbiont and tsetse host species extends to a single host species (Glossina fuscipes fuscipes) in Uganda.
View Article and Find Full Text PDFRecent metagenomics studies have begun to sample the genomic diversity among disparate habitats and relate this variation to features of the environment. Membrane proteins are an intuitive, but thus far overlooked, choice in this type of analysis as they directly interact with the environment, receiving signals from the outside and transporting nutrients. Using global ocean sampling (GOS) data, we found nearly approximately 900,000 membrane proteins in large-scale metagenomic sequence, approximately a fifth of which are completely novel, suggesting a large space of hitherto unexplored protein diversity.
View Article and Find Full Text PDFThe goal of human genome re-sequencing is obtaining an accurate assembly of an individual's genome. Recently, there has been great excitement in the development of many technologies for this (e.g.
View Article and Find Full Text PDFThe widespread use of mass spectrometry for protein identification has created a demand for computationally efficient methods of matching mass spectrometry data to protein databases. A search using X!Tandem, a popular and representative program, can require hours or days to complete, particularly when missed cleavages and post-translational modifications are considered. Existing techniques for accelerating X!Tandem by employing parallelism are unsatisfactory for a variety of reasons.
View Article and Find Full Text PDF