The Clusters of Orthologous Genes (COG) database, originally created in 1997, has been updated to reflect the constantly growing collection of completely sequenced prokaryotic genomes. This update increased the genome coverage from 1309 to 2296 species, including 2103 bacteria and 193 archaea, in most cases, with a single representative genome per genus. This set covers all genera of bacteria and archaea that included organisms with 'complete genomes' as per NCBI databases in November 2023.
View Article and Find Full Text PDFBacterial and archaeal genomes encompass numerous operons that typically consist of two to five genes. On larger scales, however, gene order is poorly conserved through the evolution of prokaryotes. Nevertheless, non-random localization of different classes of genes on prokaryotic chromosomes could reflect important functional and evolutionary constraints.
View Article and Find Full Text PDFBackground: Viruses with double-stranded (ds) DNA genomes in the realm Duplodnaviria share a conserved structural gene module but show a broad range of variation in their repertoires of DNA replication proteins. Some of the duplodnaviruses encode (nearly) complete replication systems whereas others lack (almost) all genes required for replication, relying on the host replication machinery. DNA polymerases (DNAPs) comprise the centerpiece of the DNA replication apparatus.
View Article and Find Full Text PDFBackground: Microbiomes are generally characterized by high diversity of coexisting microbial species and strains, and microbiome composition typically remains stable across a broad range of conditions. However, under fixed conditions, microbial ecology conforms with the exclusion principle under which two populations competing for the same resource within the same niche cannot coexist because the less fit population inevitably goes extinct. Therefore, the long-term persistence of microbiome diversity calls for an explanation.
View Article and Find Full Text PDFBacterial and archaeal genomes encompass numerous operons that typically consist of two to five genes. On larger scales, however, gene order is poorly conserved through the evolution of prokaryotes. Nevertheless, non-random localization of different classes of genes on prokaryotic chromosomes could reflect important functional and evolutionary constraints.
View Article and Find Full Text PDFViruses with double-stranded (ds) DNA genomes in the realm share a conserved structural gene module but show a broad range of variation in their repertoires of DNA replication proteins. Some of the duplodnaviruses encode (nearly) complete replication systems whereas others lack (almost) all genes required for replication, relying on the host replication machinery. DNA polymerases (DNAPs) comprise the centerpiece of the DNA replication apparatus.
View Article and Find Full Text PDFEndosomal Sorting Complexes Required for Transport (ESCRT) play key roles in protein sorting between membrane-bounded compartments of eukaryotic cells. Homologs of many ESCRT components are identifiable in various groups of archaea, especially in Asgardarchaeota, the archaeal phylum that is currently considered to include the closest relatives of eukaryotes, but not in bacteria. We performed a comprehensive search for ESCRT protein homologs in archaea and reconstructed ESCRT evolution using the phylogenetic tree of Vps4 ATPase (ESCRT IV) as a scaffold, using sensitive protein sequence analysis and comparison of structural models to identify previously unknown ESCRT proteins.
View Article and Find Full Text PDFOver the course of multiple divisions, cells accumulate diverse nongenetic, somatic damage including misfolded and aggregated proteins and cell wall defects. If the rate of damage accumulation exceeds the rate of dilution through cell growth, a dedicated mitigation strategy is required to prevent eventual population collapse. Strategies for somatic damage control can be divided into two categories, asymmetric allocation and repair, which are not, in principle, mutually exclusive.
View Article and Find Full Text PDFTrypanosomatids (Euglenozoa) are a diverse group of unicellular flagellates predominately infecting insects (monoxenous species) or circulating between insects and vertebrates or plants (dixenous species). Monoxenous trypanosomatids harbor a wide range of RNA viruses belonging to the families , , and a putative group of tombus-like viruses. Here, we focus on the subfamily Blastocrithidiinae, a previously unexplored divergent group of monoxenous trypanosomatids comprising two related genera: and .
View Article and Find Full Text PDFA comprehensive census of McrBC systems, among the most common forms of prokaryotic Type IV restriction systems, followed by phylogenetic analysis, reveals their enormous abundance in diverse prokaryotes and a plethora of genomic associations. We focus on a previously uncharacterized branch, which we denote iled-il clease andems (CoCoNuTs) for their salient features: the presence of extensive coiled-coil structures and tandem nucleases. The CoCoNuTs alone show extraordinary variety, with three distinct types and multiple subtypes.
View Article and Find Full Text PDFHorizontal gene transfer (HGT) is a fundamental process in prokaryotic evolution, contributing significantly to diversification and adaptation. HGT is typically facilitated by mobile genetic elements (MGEs), such as conjugative plasmids and phages, which often impose fitness costs on their hosts. However, a considerable number of bacterial genes are involved in defence mechanisms that limit the propagation of MGEs, suggesting they may actively restrict HGT.
View Article and Find Full Text PDFHorizontal gene transfer (HGT) is a fundamental process in the evolution of prokaryotes, making major contributions to diversification and adaptation. Typically, HGT is facilitated by mobile genetic elements (MGEs), such as conjugative plasmids and phages that generally impose fitness costs on their hosts. However, a substantial fraction of bacterial genes is involved in defense mechanisms that limit the propagation of MGEs, raising the possibility that they can actively restrict HGT.
View Article and Find Full Text PDFEndosomal sorting complexes required for transport (ESCRT) play key roles in protein sorting between membrane-bounded compartments of eukaryotic cells. Homologs of many ESCRT components are identifiable in various groups of archaea, especially in Asgardarchaeota, the archaeal phylum that is currently considered to include the closest relatives of eukaryotes, but not in bacteria. We performed a comprehensive search for ESCRT protein homologs in archaea and reconstructed ESCRT evolution using the phylogenetic tree of Vps4 ATPase (ESCRT IV) as a scaffold and using sensitive protein sequence analysis and comparison of structural models to identify previously unknown ESCRT proteins.
View Article and Find Full Text PDFMicrobiomes are generally characterized by high diversity of coexisting microbial species and strains that remains stable within a broad range of conditions. However, under fixed conditions, microbial ecology conforms with the exclusion principle under which two populations competing for the same resource within the same niche cannot coexist because the less fit population inevitably goes extinct. To explore the conditions for stabilization of microbial diversity, we developed a simple mathematical model consisting of two competing populations that could exchange a single gene allele via horizontal gene transfer (HGT).
View Article and Find Full Text PDFThe archaeal ancestor of eukaryotes apparently belonged to the phylum Asgardarchaeota, but the ecology and evolution of Asgard archaea are poorly understood. The optimal GDP-binding temperature of a translation elongation factor (EF-1A or EF-Tu) has been previously shown to correlate with the optimal growth temperature of diverse prokaryotes. Here, we reconstruct ancestral EF-1A sequences and experimentally measure the optimal GDP-binding temperature of EF-1A from ancient and extant Asgard archaea, to infer the evolution of optimal growth temperatures in Asgardarchaeota.
View Article and Find Full Text PDFThe identification of microbial genes essential for survival as those with lethal knockout phenotype (LKP) is a common strategy for functional interrogation of genomes. However, interpretation of the LKP is complicated because a substantial fraction of the genes with this phenotype remains poorly functionally characterized. Furthermore, many genes can exhibit LKP not because their products perform essential cellular functions but because their knockout activates the toxicity of other genes (conditionally essential genes).
View Article and Find Full Text PDFOver the course of multiple divisions, cells accumulate diverse non-genetic, somatic damage including misfolded and aggregated proteins and cell wall defects. If the rate of damage accumulation exceeds the rate of dilution through cell growth, a dedicated mitigation strategy is required to prevent eventual population collapse. Strategies for somatic damage control can be divided into two categories, asymmetric allocation and repair, which are not, in principle, mutually exclusive.
View Article and Find Full Text PDFGenomes of bacteria and archaea contain a much larger fraction of unidirectional (serial) gene pairs than convergent or divergent gene pairs. Many of the unidirectional gene pairs have short overlaps of -4 nt and -1 nt. As shown previously, translation of the genes in overlapping unidirectional gene pairs is tightly coupled.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
November 2023
The TnpB proteins are transposon-associated RNA-guided nucleases that are among the most abundant proteins encoded in bacterial and archaeal genomes, but whose functions in the transposon life cycle remain unknown. TnpB appears to be the evolutionary ancestor of Cas12, the effector nuclease of type V CRISPR-Cas systems. We performed a comprehensive census of TnpBs in archaeal and bacterial genomes and constructed a phylogenetic tree on which we mapped various features of these proteins.
View Article and Find Full Text PDFis a family of negative-sense RNA viruses with genomes totalling about 10.3 kb. These viruses have been found in fish.
View Article and Find Full Text PDFis a family of negative-sense RNA viruses with genomes totaling about 12.3 kb that have been found in turtles. The tosovirid genome consists of two segments, each with two open reading frames (ORFs) in ambisense orientation.
View Article and Find Full Text PDFis a family of negative-sense RNA viruses with genomes of about 17.2 kb that have been found in snakes. The sunvirid genome comprises nonsegmented RNA with six open reading frames (ORFs) >1 kb that are predicted to encode six proteins.
View Article and Find Full Text PDFis a family of negative-sense RNA viruses with genomes of 7.3-8.2 kb that have been associated with crustaceans, insects, gastropods, and nematodes.
View Article and Find Full Text PDF