In a protein, interactions exist between amino acid residues that influence the protein's structural integrity or stability and thus affect its catalytic function. The loss of this interaction due to mutations in these amino acids usually leads to a non-functional protein. Probing the sequence space of a protein through mutations or recombinations, as performed in directed evolution to search for an improved variant, frequently results in such inactive sequences.
View Article and Find Full Text PDFIt is known that in the three-dimensional structure of a protein, certain amino acids can interact with each other in order to provide structural integrity or aid in its catalytic function. If these positions are mutated the loss of this interaction usually leads to a non-functional protein. Directed evolution experiments, which probe the sequence space of a protein through mutations in search for an improved variant, frequently result in such inactive sequences.
View Article and Find Full Text PDFPooling in directed-evolution experiments will greatly increase the throughput of screening systems, but important parameters such as the number of good mutants created and the activity level increase of the good mutants will depend highly on the protein being engineered. The authors developed and validated a Monte Carlo simulation model of pooling that allows the testing of various scenarios in silico before starting experimentation. Using a simplified test system of 2 enzymes, betagalactosidase (supermutant, or greatly improved enzyme) and beta-glucuronidase (dud, or enzyme with ancestral level of activity), the model accurately predicted the number of supermutants detected in experiments within a factor of 2.
View Article and Find Full Text PDFA method for identifying the positions in the amino acid sequence, which are critical for the catalytic activity of a protein using support vector machines (SVMs) is introduced and analysed. SVMs are supported by an efficient learning algorithm and can utilize some prior knowledge about the structure of the problem. The amino acid sequences of the variants of a protein, created by inducing mutations, along with their fitness are required as input data by the method to predict its critical positions.
View Article and Find Full Text PDF