A protein sequence encodes its energy landscape-all the accessible conformations, energetics, and dynamics. The evolutionary relationship between sequence and landscape can be probed phylogenetically by compiling a multiple sequence alignment of homologous sequences and generating common ancestors via Ancestral Sequence Reconstruction or a consensus protein containing the most common amino acid at each position. Both ancestral and consensus proteins are often more stable than their extant homologs-questioning the differences between them and suggesting that both approaches serve as general methods to engineer thermostability.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
January 2024
Understanding natural protein evolution and designing novel proteins are motivating interest in development of high-throughput methods to explore large sequence spaces. In this work, we demonstrate the application of multisite λ dynamics (MSλD), a rigorous free energy simulation method, and chemical denaturation experiments to quantify evolutionary selection pressure from sequence-stability relationships and to address questions of design. This study examines a mesophilic phylogenetic clade of ribonuclease H (RNase H), furthering its extensive characterization in earlier studies, focusing on RNase H (ecRNH) and a more stable consensus sequence (AncCcons) differing at 15 positions.
View Article and Find Full Text PDFA protein sequence encodes its energy landscape - all the accessible conformations, energetics, and dynamics. The evolutionary relationship between sequence and landscape can be probed phylogenetically by compiling a multiple sequence alignment of homologous sequences and generating common ancestors via Ancestral Sequence Reconstruction or a consensus protein containing the most common amino acid at each position. Both ancestral and consensus proteins are often more stable than their extant homologs - questioning the differences and suggesting that both approaches serve as general methods to engineer thermostability.
View Article and Find Full Text PDFIn addition to encoding the tertiary fold and stability, the primary sequence of a protein encodes the folding trajectory and kinetic barriers that determine the speed of folding. How these kinetic barriers are encoded is not well understood. Here, we use evolutionary sequence variation in the α-lytic protease (αLP) protein family to probe the relationship between sequence and energy landscape.
View Article and Find Full Text PDFMany bacteria employ a protein organelle, the carboxysome, to catalyze carbon dioxide fixation in the Calvin Cycle. Only 10 genes from Halothiobacillus neapolitanus are sufficient for heterologous expression of carboxysomes in Escherichia coli, opening the door to detailed mechanistic analysis of the assembly process of this complex (more than 200MDa). One of these genes, csoS2, has been implicated in assembly but ascribing a molecular function is confounded by the observation that the single csoS2 gene yields expression of two gene products and both display an apparent molecular weight incongruent with the predicted amino acid sequence.
View Article and Find Full Text PDF