Non-invasive methods of detecting radiation exposure show promise to improve upon current approaches to biological dosimetry in ease, speed, and accuracy. Here we developed a pipeline that employs Fourier transform infrared (FTIR) spectroscopy in the mid-infrared spectrum to identify a signature of low dose ionizing radiation exposure in mouse ear pinnae over time. Mice exposed to 0.
View Article and Find Full Text PDFSynthetic biology is creating genetically engineered organisms at an increasing rate for many potentially valuable applications, but this potential comes with the risk of misuse or accidental release. To begin to address this issue, we have developed a system called GUARDIAN that can automatically detect signatures of engineering in DNA sequencing data, and we have conducted a blinded test of this system using a curated Test and Evaluation (T&E) data set. GUARDIAN uses an ensemble approach based on the guiding principle that no single approach is likely to be able to detect engineering with perfect accuracy.
View Article and Find Full Text PDFShort polypeptides encoded by small open reading frames (smORFs) are ubiquitously found in eukaryotic genomes and are important regulators of physiology, development, and mitochondrial processes. Here, we focus on a subset of 298 smORFs that are evolutionarily conserved between Drosophila melanogaster and humans. Many of these smORFs are conserved broadly in the bilaterian lineage, and ∼182 are conserved in plants.
View Article and Find Full Text PDFsp. BDGP8 is a species of facultative anaerobic gram-positive bacterium of the family Microbacteriaceae. The complete genome consists of a single circular chromosome of 3,293,567 bp with a G + C content of 69.
View Article and Find Full Text PDFDevelopment of effective therapies against SARS-CoV-2 infections relies on mechanistic knowledge of virus-host interface. Abundant physical interactions between viral and host proteins have been identified, but few have been functionally characterized. Harnessing the power of fly genetics, we develop a comprehensive Drosophila COVID-19 resource (DCR) consisting of publicly available strains for conditional tissue-specific expression of all SARS-CoV-2 encoded proteins, UAS-human cDNA transgenic lines encoding established host-viral interacting factors, and GAL4 insertion lines disrupting fly homologs of SARS-CoV-2 human interacting proteins.
View Article and Find Full Text PDFGenerating reference maps of interactome networks illuminates genetic studies by providing a protein-centric approach to finding new components of existing pathways, complexes, and processes. We apply state-of-the-art methods to identify binary protein-protein interactions (PPIs) for Drosophila melanogaster. Four all-by-all yeast two-hybrid (Y2H) screens of > 10,000 Drosophila proteins result in the 'FlyBi' dataset of 8723 PPIs among 2939 proteins.
View Article and Find Full Text PDFSARS-CoV-2 spread rapidly, causing millions of deaths across the globe. As a result, demand for medical supplies and personal protective equipment (PPE) surged and supplies dwindled. Separate entirely, hospital-acquired infections have become commonplace and challenging to treat.
View Article and Find Full Text PDFOregon-R-modENCODE strain BDGP6 was isolated from the gut of for functional host-microbial interaction studies. The bacterial chromosome is a single circular DNA molecule of 2,785,111 bp with a G+C content of 46%.
View Article and Find Full Text PDFis a species of facultative anaerobic Gram-negative bacteria of the family The complete genome is composed of a single chromosomal circle of 4,957,773 bp with a G+C content of 52%.
View Article and Find Full Text PDFMicrobiol Resour Announc
September 2019
sp. strain 33MFTa1.1 was isolated for functional host-microbe interaction studies from the root-associated microbiome.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
January 2019
Identifying functional enhancer elements in metazoan systems is a major challenge. Large-scale validation of enhancers predicted by ENCODE reveal false-positive rates of at least 70%. We used the pregrastrula-patterning network of to demonstrate that loss in accuracy in held-out data results from heterogeneity of functional signatures in enhancer elements.
View Article and Find Full Text PDFOregon-R-modENCODE strain BDGP5 was isolated from for functional host-microbe interaction studies. The complete genome is composed of a single chromosomal circle of 2,848,089 bp, with a G+C content of 53% and three plasmids of 131,455 bp, 19,216 bp, and 9,160 bp.
View Article and Find Full Text PDFOregon-R-modENCODE strain BDGP1 was isolated from for functional host-microbe interaction studies. The complete genome comprises a single chromosomal circle of 3,988,649 bp with a G+C content of 56% and a conjugative plasmid of 151,013 bp.
View Article and Find Full Text PDFOregon-R-modENCODE strain BDGP2 was isolated from the gut of for functional host microbial interaction studies. The complete genome comprised a single circular genome of 3,407,160 bp, with a G+C content of 44%, and four plasmids.
View Article and Find Full Text PDFOregon-R-modENCODE strain BDGP4 was isolated from the gut of for functional host microbial interaction studies. The complete genome comprised a single chromosomal circle of 4,557,232 bp with a G+C content of 37% and a single plasmid of 137,143 bp.
View Article and Find Full Text PDFOregon-R-modENCODE strain BDGP3 was isolated from the gut for functional host-microbe interaction studies. The complete genome is composed of a single circular genome of 2,983,334 bp, with a G+C content of 38%, and a single plasmid of 5,594 bp.
View Article and Find Full Text PDFDrosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing.
View Article and Find Full Text PDFThe transcriptome is the readout of the genome. Identifying common features in it across distant species can reveal fundamental principles. To this end, the ENCODE and modENCODE consortia have generated large amounts of matched RNA-sequencing data for human, worm and fly.
View Article and Find Full Text PDFAnimal transcriptomes are dynamic, with each cell type, tissue and organ system expressing an ensemble of transcript isoforms that give rise to substantial diversity. Here we have identified new genes, transcripts and proteins using poly(A)+ RNA sequencing from Drosophila melanogaster in cultured cell lines, dissected organ systems and under environmental perturbations. We found that a small set of mostly neural-specific genes has the potential to encode thousands of transcripts each through extensive alternative promoter usage and RNA splicing.
View Article and Find Full Text PDFmodENCODE was a 5year NHGRI funded project (2007-2012) to map the function of every base in the genomes of worms and flies characterizing positions of modified histones and other chromatin marks, origins of DNA replication, RNA transcripts and the transcription factor binding sites that control gene expression. Here we describe the Drosophila modENCODE datasets and how best to access and use them for genome wide and individual gene studies.
View Article and Find Full Text PDFThe identification of full length transcripts entirely from short-read RNA sequencing data (RNA-seq) remains a challenge in the annotation of genomes. Here we describe an automated pipeline for genome annotation that integrates RNA-seq and gene-boundary data sets, which we call Generalized RNA Integration Tool, or GRIT. Applying GRIT to Drosophila melanogaster short-read RNA-seq, cap analysis of gene expression (CAGE) and poly(A)-site-seq data collected for the modENCODE project, we recovered the vast majority of previously annotated transcripts and doubled the total number of transcripts cataloged.
View Article and Find Full Text PDFDetermining the composition of protein complexes is an essential step toward understanding the cell as an integrated system. Using coaffinity purification coupled to mass spectrometry analysis, we examined protein associations involving nearly 5,000 individual, FLAG-HA epitope-tagged Drosophila proteins. Stringent analysis of these data, based on a statistical framework designed to define individual protein-protein interactions, led to the generation of a Drosophila protein interaction map (DPiM) encompassing 556 protein complexes.
View Article and Find Full Text PDFWe describe a method for high-throughput production of protein expression-ready clones. Open-reading frames (ORFs) are amplified by PCR from sequence-verified cDNA clones and subcloned into an appropriate loxP-containing donor vector. Each ORF is represented by two types of clones, one containing the native stop codon for expression of the native protein or amino-terminal fusion constructs and the other made without the stop codon to allow for carboxy-terminal fusion constructs.
View Article and Find Full Text PDF