Empirical studies of genotype-phenotype-fitness maps of proteins are fundamental to understanding the evolutionary process, in elucidating the space of possible genotypes accessible through mutations in a landscape of phenotypes and fitness effects. Yet, comprehensively mapping molecular fitness landscapes remains challenging since all possible combinations of amino acid substitutions for even a few protein sites are encoded by an enormous genotype space. High-throughput mapping of genotype space can be achieved using large-scale screening experiments known as multiplexed assays of variant effect (MAVEs). However, to accommodate such multi-mutational studies, the size of MAVEs has grown to the point where a priori determination of sampling requirements is needed. To address this problem, we propose calculations and simulation methods to approximate minimum sampling requirements for multi-mutational MAVEs, which we combine with a new library construction protocol to experimentally validate our approximation approaches. Analysis of our simulated data reveals how sampling trajectories differ between simulations of nucleotide versus amino acid variants and among mutagenesis schemes. For this, we show quantitatively that marginal gains in sampling efficiency demand increasingly greater sampling effort when sampling for nucleotide sequences over their encoded amino acid equivalents. We present a new library construction protocol that efficiently maximizes sequence variation, and demonstrate using ultradeep sequencing that the library encodes virtually all possible combinations of mutations within the experimental design. Insights learned from our analyses together with the methodological advances reported herein are immediately applicable toward pooled experimental screens of arbitrary design, enabling further assay upscaling and expanded testing of genotype space.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s00239-024-10179-8DOI Listing

Publication Analysis

Top Keywords

amino acid
12
genotype space
12
mapping molecular
8
molecular fitness
8
fitness landscapes
8
sampling requirements
8
library construction
8
construction protocol
8
sampling
7
sampling strategies
4

Similar Publications

Annotation of RxLR Effectors in Oomycete Genomes.

Methods Mol Biol

December 2024

Horticultural Crops Disease and Pest Management Research Unit, United States Department of Agriculture-Agricultural Research Service, Corvallis, OR, USA.

Pathogens have evolved effector proteins to suppress host immunity and facilitate plant infections. RxLR effectors are small, secreted effector proteins with conserved RxLR and dEER amino acid motifs at the N terminus and highly variable C termini and are commonly found in oomycete species. We provide computational approaches to annotate RxLR candidate effector genes in a genome assembly in FASTA format with an available GFF file.

View Article and Find Full Text PDF

Identifying the Pathogenicity of a Novel NPRL3 Missense Mutation Using Personalized Cortical Organoid Model of Focal Cortical Dysplasia.

J Mol Neurosci

December 2024

Department of Neurosurgery, National Children's Medical Center (Shanghai), Children's Hospital of Fudan University, No.399 Wan Yuan Avenue, Minhang District, Shanghai, 201102, China.

Focal cortical dysplasia (FCD) II is a cortical malformation characterized by cortical architectural abnormalities, dysmorphic neurons, with or without balloon cells. Here, we systematically explored the pathophysiological role of the GATOR1 subunit NPRL3 variants including a novel mutation from iPSCs derived from one FCD II patient. Three FCD II children aged 0.

View Article and Find Full Text PDF

Intermittent fasting (IF) has been shown to ameliorate inflammation including DSS-induced colitis. It is well known that autophagy can limit inflammation and TFEB is a master transcriptional factor that regulates the processes of autophagy. However, whether TFEB is involved in the regulation of IF-mediated amelioration of inflammation and its mechanism remained unclear.

View Article and Find Full Text PDF

The sodium-dependent membrane transporter SLC6A15 (BAT2) belongs to the SLC6 family, which comprises carriers of amino acids and monoamines. BAT2 is expressed in the central nervous system (CNS), including the glutaminergic and GABAergic system. SLC6A15 supplies neurons with neutral amino acids.

View Article and Find Full Text PDF

African swine fever (ASF) has widely spread around the world in the last 100 years since its discovery. The African swine fever virus (ASFV) particles are made of more than 150 proteins, with the p17 protein encoded by the D117L gene serving as one of the major capsid proteins and playing a crucial role in the virus's morphogenesis and immune evasion. Thus, monoclonal antibody (mAb) targeting p17 is important for the research and detection of ASFV infection.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!