Advances in cellular, molecular, and disease biology depend on the comprehensive characterization of gene interactions and pathways. Traditionally, these pathways are curated manually, limiting their efficient annotation and, potentially, reinforcing field-specific bias. Here, in order to test objective and automated identification of functionally cooperative genes, we compared a novel algorithm with three established methods to search for communities within gene interaction networks.
View Article and Find Full Text PDFNegative autoregulation is universally found across organisms. In the bacterium Escherichia coli, transcription factors often repress their own expression to form a negative feedback network motif that enables robustness to changes in biochemical parameters. Here we present a simple phenomenological model of a negative feedback transcription factor repressing both itself and another target gene.
View Article and Find Full Text PDFThe structure and function of proteins underlie most aspects of biology and their mutational perturbations often cause disease. To identify the molecular determinants of function as well as targets for drugs, it is central to characterize the important residues and how they cluster to form functional sites. The Evolutionary Trace (ET) achieves this by ranking the functional and structural importance of the protein sequence positions.
View Article and Find Full Text PDFGenome-wide association studies (GWAS) and whole-exome sequencing (WES) generate massive amounts of genomic variant information, and a major challenge is to identify which variations drive disease or contribute to phenotypic traits. Because the majority of known disease-causing mutations are exonic non-synonymous single nucleotide variations (nsSNVs), most studies focus on whether these nsSNVs affect protein function. Computational studies show that the impact of nsSNVs on protein function reflects sequence homology and structural information and predict the impact through statistical methods, machine learning techniques, or models of protein evolution.
View Article and Find Full Text PDFNatural selection for specific functions places limits upon the amino acid substitutions a protein can accept. Mechanisms that expand the range of tolerable amino acid substitutions include chaperones that can rescue destabilized proteins and additional stability-enhancing substitutions. Here, we present an alternative mechanism that is simple and uses a frequently encountered network motif.
View Article and Find Full Text PDFUnderstanding the molecular basis of protein function remains a central goal of biology, with the hope to elucidate the role of human genes in health and in disease, and to rationally design therapies through targeted molecular perturbations. We review here some of the computational techniques and resources available for characterizing a critical aspect of protein function - those mediated by protein-protein interactions (PPI). We describe several applications and recent successes of the Evolutionary Trace (ET) in identifying molecular events and shapes that underlie protein function and specificity in both eukaryotes and prokaryotes.
View Article and Find Full Text PDFMutations in the genes encoding cartilage associated protein (CRTAP) and prolyl 3-hydroxylase 1 (P3H1 encoded by LEPRE1) were the first identified causes of recessive Osteogenesis Imperfecta (OI). These proteins, together with cyclophilin B (encoded by PPIB), form a complex that 3-hydroxylates a single proline residue on the α1(I) chain (Pro986) and has cis/trans isomerase (PPIase) activity essential for proper collagen folding. Recent data suggest that prolyl 3-hydroxylation of Pro986 is not required for the structural stability of collagen; however, the absence of this post-translational modification may disrupt protein-protein interactions integral for proper collagen folding and lead to collagen over-modification.
View Article and Find Full Text PDFRobustness is a property built into biological systems to ensure stereotypical outcomes despite fluctuating inputs from gene dosage, biochemical noise, and the environment. During development, robustness safeguards embryos against structural and functional defects. Yet, our understanding of how robustness is achieved in embryos is limited.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
November 2013
Structural Genomics aims to elucidate protein structures to identify their functions. Unfortunately, the variation of just a few residues can be enough to alter activity or binding specificity and limit the functional resolution of annotations based on sequence and structure; in enzymes, substrates are especially difficult to predict. Here, large-scale controls and direct experiments show that the local similarity of five or six residues selected because they are evolutionarily important and on the protein surface can suffice to identify an enzyme activity and substrate.
View Article and Find Full Text PDFMotivation: The constraints under which sequence, structure and function coevolve are not fully understood. Bringing this mutual relationship to light can reveal the molecular basis of binding, catalysis and allostery, thereby identifying function and rationally guiding protein redesign. Underlying these relationships are the epistatic interactions that occur when the consequences of a mutation to a protein are determined by the genetic background in which it occurs.
View Article and Find Full Text PDFThe auxiliary factor DksA is a global transcription regulator and, with the help of ppGpp, controls the nutritional stress response in Escherichia coli. Although the consequences of its modulation of RNA polymerase (RNAP) are becoming better explained, it is still not fully understood how the two proteins interact. We employed a series of genetic suppressor selections to find residues in RNAP that alter its sensitivity to DksA.
View Article and Find Full Text PDFUnlabelled: Most proteins lack experimentally validated functions. To address this problem, we implemented the Evolutionary Trace Annotation (ETA) method in the Cytoscape network visualization environment. The result is the ETAscape plugin, which builds a structural genomics network based on local structural and evolutionary similarities among proteins and then globally diffuses known annotations across the resulting network.
View Article and Find Full Text PDFBioinformatics
August 2012
Summary: Understanding the differences between knotted and unknotted protein structures may offer insights into how proteins fold. To characterize the type of knot in a protein, we have developed PyKnot, a plugin that works seamlessly within the PyMOL molecular viewer and gives quick results including the knot's invariants, crossing numbers and simplified knot projections and backbones. PyKnot may be useful to researchers interested in classifying knots in macromolecules and provides tools for students of biology and chemistry with which to learn topology and macromolecular visualization.
View Article and Find Full Text PDFThe evolutionary trace (ET) is the single most validated approach to identify protein functional determinants and to target mutational analysis, protein engineering and drug design to the most relevant sites of a protein. It applies to the entire proteome; its predictions come with a reliability score; and its results typically reach significance in most protein families with 20 or more sequence homologs. In order to identify functional hot spots, ET scans a multiple sequence alignment for residue variations that correlate with major evolutionary divergences.
View Article and Find Full Text PDFRecA plays a key role in homologous recombination, the induction of the DNA damage response through LexA cleavage and the activity of error-prone polymerase in Escherichia coli. RecA interacts with multiple partners to achieve this pleiotropic role, but the structural location and sequence determinants involved in these multiple interactions remain mostly unknown. Here, in a first application to prokaryotes, Evolutionary Trace (ET) analysis identifies clusters of evolutionarily important surface amino acids involved in RecA functions.
View Article and Find Full Text PDFSummary: PyETV is a PyMOL plugin for viewing, analyzing and manipulating predictions of evolutionarily important residues and sites in protein structures and their complexes. It seamlessly captures the output of the Evolutionary Trace server, namely ranked importance of residues, for multiple chains of a complex. It then yields a high resolution graphical interface showing their distribution and clustering throughout a quaternary structure, including at interfaces.
View Article and Find Full Text PDFWe suggest and discuss a simple model of an ideal gas under the piston to gain an insight into the workings of the Jarzynski identity connecting the average exponential of the work over the nonequilibrium trajectories with the equilibrium free energy. We show that the identity is valid for our system, due to the very rapid molecules belonging to the tail of the Maxwell distribution. For the most interesting extreme, when the system volume is large, while the piston is moving with great speed (compared to thermal velocity) for a very short time, the necessary number of independent experimental runs to obtain a reasonable approximation for the free energy from averaging the nonequilibrium work grows exponentially with the system size.
View Article and Find Full Text PDFPLoS Comput Biol
May 2006
Like shoelaces, the backbones of proteins may get entangled and form knots. However, only a few knots in native proteins have been identified so far. To more quantitatively assess the rarity of knots in proteins, we make an explicit comparison between the knotting probabilities in native proteins and in random compact loops.
View Article and Find Full Text PDFPhys Rev E Stat Nonlin Soft Matter Phys
December 2005
Motivated by experiments in which single-stranded DNA with a short hairpin loop at one end undergoes unforced diffusion through a narrow pore, we study the first passage times for a particle, executing one-dimensional Brownian motion in an asymmetric sawtooth potential, to exit one of the boundaries. We consider the first passage times for the case of classical diffusion, characterized by a mean-square displacement of the form <(Delta(x))2> approximately t, and for the case of anomalous diffusion or subdiffusion, characterized by a mean-square displacement of the form <(Delta(x))2> approximately t(gamma) with 0
Numerical studies of the average size of trivially knotted polymer loops with no excluded volume were undertaken. Topology was identified by Alexander and Vassiliev degree 2 invariants. Probability of a trivial knot, average gyration radius, and probability density distributions as functions of gyration radius were generated for loops of up to N = 3,000 segments.
View Article and Find Full Text PDF