With just four building blocks, low sequence information density, few functional groups, poor control over folding, and difficulties in forming compact folds, natural DNA and RNA have been disappointing platforms from which to evolve receptors, ligands, and catalysts. Accordingly, synthetic biology has created "artificially expanded genetic information systems" (AEGIS) to add nucleotides, functionality, and information density. With the expected improvements seen in AegisBodies and AegisZymes, the task for synthetic biologists shifts to developing for expanded DNA the same analytical tools available to natural DNA.
View Article and Find Full Text PDFMany efforts have sought to apply laboratory evolution (LIVE) to natural nucleic acid (NA) scaffolds to directly evolve functional molecules. However, synthetic biology can move beyond natural NA scaffolds to create molecular systems whose libraries are far richer reservoirs of functionality than natural NAs. For example, "artificially expanded genetic information systems" (AEGIS) add up to eight nucleotides to the four found in standard NA.
View Article and Find Full Text PDFA significant number of proteins are annotated as functionally uncharacterized proteins. Within this protocol, we describe how to use protein family multiple sequence alignments and structural bioinformatics resources to design loss-of-function mutations of previously uncharacterized proteins within the glycosyltransferase family. We detail approaches to determine target protein active sites using three-dimensional modeling.
View Article and Find Full Text PDFGlioblastoma (GBM) is the most aggressive primary brain tumor characterized by infiltrative growth of malignant glioma cells into the surrounding brain parenchyma. In this study, our analysis of GBM patient cohorts revealed a significantly higher expression of 8 () compared to normal brain tissue and could be associated with impaired patient survival. Increased expression of significantly enhanced migration of two different sphere-forming GBM cell lines.
View Article and Find Full Text PDFPfs48/45 and Pfs25 are leading candidates for the development of Plasmodium falciparum transmission blocking vaccines (TBV). Expression of Pfs48/45 in the erythrocytic sexual stages and presentation to the immune system during infection in the human host also makes it ideal for natural boosting. However, it has been challenging to produce a fully folded, functionally active Pfs48/45, using various protein expression platforms.
View Article and Find Full Text PDFIn addition to completing the Watson-Crick nucleobase matching "concept" (big pairs with small, hydrogen bond donors pair with hydrogen bond acceptors), artificially expanded genetic information systems (AEGIS) also challenge DNA polymerases with a complete set of mismatches, including wobble mismatches. Here, we explore wobble mismatches with AEGIS with DNA polymerase 1 from Escherichia coli. Remarkably, we find that the polymerase tolerates an AEGIS:standard wobble that has the same geometry as the G:T wobble that polymerases have evolved to exclude but excludes a wobble geometry that polymerases have never encountered in natural history.
View Article and Find Full Text PDFAxiomatically, the density of information stored in DNA, with just four nucleotides (GACT), is higher than in a binary code, but less than it might be if synthetic biologists succeed in adding independently replicating nucleotides to genetic systems. Such addition could also add functional groups not found in natural DNA, but useful for molecular performance. Here, we consider two new nucleotides (Z and P, 6-amino-5-nitro-3-(1'-β-D-2'-deoxyribo-furanosyl)-2(1H)-pyridone and 2-amino-8-(1'-β-D-2'-deoxyribofuranosyl)-imidazo[1,2-a]-1,3,5-triazin-4(8H)-one).
View Article and Find Full Text PDFMotivation: Despite advances in high-throughput sequencing, marine metagenomic samples remain largely opaque. A typical sample contains billions of microbial organisms from thousands of genomes and quadrillions of DNA base pairs. Its derived metagenomic dataset underrepresents this complexity by orders of magnitude because of the sparseness and shortness of sequencing reads.
View Article and Find Full Text PDFUnlabelled: More than 20% of all protein domains are currently annotated as "domains of unknown function" (DUFs). About 2,700 DUFs are found in bacteria compared with just over 1,500 in eukaryotes. Over 800 DUFs are shared between bacteria and eukaryotes, and about 300 of these are also present in archaea.
View Article and Find Full Text PDFSummary: The Malaria Genome Exploration Tool (MaGnET) is a software tool enabling intuitive 'exploration-style' visualization of functional genomics data relating to the malaria parasite, Plasmodium falciparum. MaGnET provides innovative integrated graphic displays for different datasets, including genomic location of genes, mRNA expression data, protein-protein interactions and more. Any selection of genes to explore made by the user is easily carried over between the different viewers for different datasets, and can be changed interactively at any point (without returning to a search).
View Article and Find Full Text PDFBRCT domains are versatile protein modular domains found as single units or as multiple copies in more than 20 different proteins in the human genome. Interestingly, most BRCT-containing proteins function in the same biological process, the DNA damage response network, but show specificity in their molecular interactions. BRCT domains have been found to bind a wide array of ligands from proteins, phosphorylated linear motifs, and DNA.
View Article and Find Full Text PDFAbstract The interface of protein structural biology, protein biophysics, molecular evolution, and molecular population genetics forms the foundations for a mechanistic understanding of many aspects of protein biochemistry. Current efforts in interdisciplinary protein modeling are in their infancy and the state-of-the art of such models is described. Beyond the relationship between amino acid substitution and static protein structure, protein function, and corresponding organismal fitness, other considerations are also discussed.
View Article and Find Full Text PDFBiofilms are dense microbial communities. Although widely distributed and medically important, how biofilm cells interact with one another is poorly understood. Recently, we described a novel process whereby myxobacterial biofilm cells exchange their outer membrane (OM) lipoproteins.
View Article and Find Full Text PDFBinary subcomplexes in proteins database (BISC) is a new protein-protein interaction (PPI) database linking up the two communities most active in their characterization: structural biology and functional genomics researchers. The BISC resource offers users (i) a structural perspective and related information about binary subcomplexes (i.e.
View Article and Find Full Text PDFBackground: HEAT and ARM repeats occur in a large number of eukaryotic proteins. As these repeats are often highly diverged, the prediction of HEAT or ARM domains can be challenging. Except for the most clear-cut cases, identification at the individual repeat level is indispensable, in particular for determining domain boundaries.
View Article and Find Full Text PDFPs230 is the largest representative of a 10-member family of proteins found in all Plasmodium species. The family is defined by partially conserved, cysteine-rich double domains that are approximately 350 aa in length and have one to three predicted disulfide bridges in each half. In Plasmodium falciparum, the most dangerous human malaria, Pf12 is the smallest member of the family, comprising just one double domain.
View Article and Find Full Text PDFNumerous mammalian proteins are constructed from a limited repertoire of module-types. Proteins belonging to the regulators of complement activation family--crucial for ensuring a complement-mediated immune response is targeted against infectious agents--are composed solely of complement control protein (CCP) modules. In the current study, CCP module sequences were grouped to allow selection of the most appropriate experimentally determined structures to serve as templates in an automated large-scale structure modelling procedure.
View Article and Find Full Text PDFThe alternative NADH:ubiquinone oxidoreductase (NDH-2) from Escherichia coli is a membrane protein playing a prominent role in respiration by linking the reduction of NADH to the quinone pool. Remote sequence similarity reveals an evolutionary relation between alternative NADH:quinone oxidoreductases and the SCOP-family "FAD/NAD-linked reductases". We have created a structural model for NDH-2 from E.
View Article and Find Full Text PDFThe chromosomal passenger complex of Aurora B kinase, INCENP, and Survivin has essential regulatory roles at centromeres and the central spindle in mitosis. Here, we describe Borealin, a novel member of the complex. Approximately half of Aurora B in mitotic cells is complexed with INCENP, Borealin, and Survivin; and Borealin binds Survivin and INCENP in vitro.
View Article and Find Full Text PDF