Publications by authors named "Claudia C Weber"

The recent acceleration in genome sequencing targeting previously unexplored parts of the tree of life presents computational challenges. Samples collected from the wild often contain sequences from several organisms, including the target, its cobionts, and contaminants. Effective methods are therefore needed to separate sequences.

View Article and Find Full Text PDF

How can we best learn the history of a protein's evolution? Ideally, a model of sequence evolution should capture both the process that generates genetic variation and the functional constraints determining which changes are fixed. However, in practical terms the most suitable approach may simply be the one that combines the convenience of easily available input data with the ability to return useful parameter estimates. For example, we might be interested in a measure of the strength of selection (typically obtained using a codon model) or an ancestral structure (obtained using structural modeling based on inferred amino acid sequence and side chain configuration).

View Article and Find Full Text PDF

Substitutions between chemically distant amino acids are known to occur less frequently than those between more similar amino acids. This knowledge, however, is not reflected in most codon substitution models, which treat all nonsynonymous changes as if they were equivalent in terms of impact on the protein. A variety of methods for integrating chemical distances into models have been proposed, with a common approach being to divide substitutions into radical or conservative categories.

View Article and Find Full Text PDF

That population size affects the fate of new mutations arising in genomes, modulating both how frequently they arise and how efficiently natural selection is able to filter them, is well established. It is therefore clear that these distinct roles for population size that characterize different processes should affect the evolution of proteins and need to be carefully defined. Empirical evidence is consistent with a role for demography in influencing protein evolution, supporting the idea that functional constraints alone do not determine the composition of coding sequences.

View Article and Find Full Text PDF

Improvements in the description of amino acid substitution are required to develop better pseudo-energy-based protein structure-aware models for use in phylogenetic studies. These models are used to characterize the probabilities of amino acid substitution and enable better simulation of protein sequences over a phylogeny. A better characterization of amino acid substitution probabilities in turn enables numerous downstream applications, like detecting positive selection, ancestral sequence reconstruction, and evolutionarily-motivated protein engineering.

View Article and Find Full Text PDF

The computational reconstruction of ancestral proteins provides information on past biological events and has practical implications for biomedicine and biotechnology. Currently available tools for ancestral sequence reconstruction (ASR) are often based on empirical amino acid substitution models that assume that all sites evolve at the same rate and under the same process. However, this assumption is frequently violated because protein evolution is highly heterogeneous due to different selective constraints among sites.

View Article and Find Full Text PDF

The origin and evolutionary dynamics of the spatial heterogeneity in genomic base composition have been debated since its discovery in the 1970s. With the recent availability of numerous genome sequences from a wide range of species it has been possible to address this question from a comparative perspective, and similarities and differences in base composition between groups of organisms are becoming evident. Ample evidence suggests that the contrasting dynamics of base composition are driven by GC-biased gene conversion (gBGC), a process that is associated with meiotic recombination.

View Article and Find Full Text PDF

Background: Determining the evolutionary relationships among the major lineages of extant birds has been one of the biggest challenges in systematic biology. To address this challenge, we assembled or collected the genomes of 48 avian species spanning most orders of birds, including all Neognathae and two of the five Palaeognathae orders. We used these genomes to construct a genome-scale avian phylogenetic tree and perform comparative genomic analyses.

View Article and Find Full Text PDF

Background: The ratio of the rates of non-synonymous and synonymous substitution (dN/dS) is commonly used to estimate selection in coding sequences. It is often suggested that, all else being equal, dN/dS should be lower in populations with large effective size (Ne) due to increased efficacy of purifying selection. As Ne is difficult to measure directly, life history traits such as body mass, which is typically negatively associated with population size, have commonly been used as proxies in empirical tests of this hypothesis.

View Article and Find Full Text PDF

To better determine the history of modern birds, we performed a genome-scale phylogenetic analysis of 48 species representing all orders of Neoaves using phylogenomic methods created to handle genome-scale data. We recovered a highly resolved tree that confirms previously controversial sister or close relationships. We identified the first divergence in Neoaves, two groups we named Passerea and Columbea, representing independent lineages of diverse and convergently evolved land and water bird species.

View Article and Find Full Text PDF

Hepadnaviridae are double-stranded DNA viruses that infect some species of birds and mammals. This includes humans, where hepatitis B viruses (HBVs) are prevalent pathogens in considerable parts of the global population. Recently, endogenized sequences of HBVs (eHBVs) have been discovered in bird genomes where they constitute direct evidence for the coexistence of these viruses and their hosts from the late Mesozoic until present.

View Article and Find Full Text PDF

Background: While effective population size (Ne) and life history traits such as generation time are known to impact substitution rates, their potential effects on base composition evolution are less well understood. GC content increases with decreasing body mass in mammals, consistent with recombination-associated GC biased gene conversion (gBGC) more strongly impacting these lineages. However, shifts in chromosomal architecture and recombination landscapes between species may complicate the interpretation of these results.

View Article and Find Full Text PDF

Several reports from mammals indicate that an increase in the mutation rate in late-replicating regions may, in part, be responsible for the observed genomic heterogeneity in neutral substitution rates and levels of diversity, although the mechanisms for this remain poorly understood. Recent evidence also suggests that late replication is associated with high mutability in yeast. This then raises the question as to whether a similar effect is operating across all eukaryotes.

View Article and Find Full Text PDF

Background: Gene order in eukaryotic genomes is not random, with genes with similar expression profiles tending to cluster. In yeasts, the model taxon for gene order analysis, such syntenic clusters of non-homologous genes tend to be conserved over evolutionary time. Whether similar clusters show gene order conservation in other lineages is, however, undecided.

View Article and Find Full Text PDF

Recent evidence suggests that germline transcription may affect both protein evolutionary rates, possibly mediated by repair processes, and recombination rates, possibly mediated by chromatin and epigenetic modification. Here, we test these propositions in Drosophila melanogaster. The challenge for such analyses is to provide defendable measures of germline gene expression.

View Article and Find Full Text PDF

There is considerable variation in the rate at which different proteins evolve. Why is this? Classically, it has been considered that the density of functionally important sites must predict rates of protein evolution. Likewise, amino acid choice is usually assumed to reflect optimal protein function.

View Article and Find Full Text PDF

Theory predicts that, owing to reduced Hill-Robertson interference, genomic regions with high crossing-over rates should experience more efficient selection. In Saccharomyces cerevisiae a negative correlation between the local recombination rate, assayed as meiotic double-strand breaks (DSBs), and the local rate of protein evolution has been considered consistent with such a model. Although DSBs are a prerequisite for crossing-over, they need not result in crossing-over.

View Article and Find Full Text PDF

Mutations in the presenilins (PS) account for the majority of familial Alzheimer disease (FAD) cases. To test the hypothesis that oxidative stress can underlie the deleterious effects of presenilin mutations, we analyzed lipid peroxidation products (4-hydroxynonenal (HNE) and malondialdehyde) and antioxidant defenses in brain tissue and levels of reactive oxygen species (ROS) in splenic lymphocytes from transgenic mice bearing human PS1 with the M146L mutation (PS1M146L) compared to those from mice transgenic for wild-type human PS1 (PS1wt) and nontransgenic littermate control mice. In brain tissue, HNE levels were increased only in aged (19-22 months) PS1M146L transgenic animals compared to PS1wt mice and not in young (3-4 months) or middle-aged mice (13-15 months).

View Article and Find Full Text PDF