Publications by authors named "Eugene Koonin"

Viroids, the agents of several plant diseases, are the smallest and simplest known replicators that consist of covalently closed circular (ccc) RNA molecules between 200 and 400 nucleotides in size. Viroids encode no proteins and rely on host RNA polymerases for replication, but some contain ribozymes involved in replication intermediate processing. Although other viroid-like agents with cccRNAs genomes, such as satellite RNAs, ribozyviruses and retrozymes, have been discovered, until recently, the spread of these agents in the biosphere appeared narrow, and their actual diversity and evolution remained poorly understood.

View Article and Find Full Text PDF

Unlabelled: Metatranscriptomics is uncovering more and more diverse families of viruses with RNA genomes comprising the viral kingdom Orthornavirae in the realm Riboviria. Thorough protein annotation and comparison are essential to get insights into the functions of viral proteins and virus evolution. In addition to sequence- and hmm profile‑based methods, protein structure comparison adds a powerful tool to uncover protein functions and relationships.

View Article and Find Full Text PDF

CRISPR are adaptive immunity systems that protect bacteria and archaea from viruses and other mobile genetic elements (MGE) via an RNA-guided interference mechanism. However, in the course of the host-parasite co-evolution, CRISPR systems have been recruited by MGE themselves for counter-defense or other functions. Some bacteriophages encode fully functional CRISPR systems that target host defense systems, and many others recruited individual components of CRISPR systems, such as single repeat units that inhibit host CRISPR systems and CRISPR mini-arrays that target related viruses contributing to inter-virus competition.

View Article and Find Full Text PDF

Unlabelled: Anellovirus infections are ubiquitous in mammals but lack any clear disease association, suggesting a commensal virus-host relationship. Although anelloviruses have been identified in numerous mammalian hosts, their presence in members of the family Delphinidae has yet to be reported. Here, using a metagenomic approach, we characterize complete anellovirus genomes ( = 69) from four Delphinidae host species: short-finned pilot whale (, = 19), killer whale (, = 9), false killer whale (, = 6), and pantropical spotted dolphin (, = 1).

View Article and Find Full Text PDF

The Clusters of Orthologous Genes (COG) database, originally created in 1997, has been updated to reflect the constantly growing collection of completely sequenced prokaryotic genomes. This update increased the genome coverage from 1309 to 2296 species, including 2103 bacteria and 193 archaea, in most cases, with a single representative genome per genus. This set covers all genera of bacteria and archaea that included organisms with 'complete genomes' as per NCBI databases in November 2023.

View Article and Find Full Text PDF

The most abundant clustered regularly interspaced short palindromic repeats (CRISPR) type I systems employ a multisubunit RNA-protein effector complex (Cascade), with varying protein composition and activity. The Escherichia coli Cascade complex consists of 11 protein subunits and functions as an effector through CRISPR RNA (crRNA) binding, protospacer adjacent motif (PAM)-specific double-stranded DNA targeting, R-loop formation, and Cas3 helicase-nuclease recruitment for target DNA cleavage. Here, we present a biochemical reconstruction of the E.

View Article and Find Full Text PDF

Bacterial and archaeal genomes encompass numerous operons that typically consist of two to five genes. On larger scales, however, gene order is poorly conserved through the evolution of prokaryotes. Nevertheless, non-random localization of different classes of genes on prokaryotic chromosomes could reflect important functional and evolutionary constraints.

View Article and Find Full Text PDF

Background: Viruses with double-stranded (ds) DNA genomes in the realm Duplodnaviria share a conserved structural gene module but show a broad range of variation in their repertoires of DNA replication proteins. Some of the duplodnaviruses encode (nearly) complete replication systems whereas others lack (almost) all genes required for replication, relying on the host replication machinery. DNA polymerases (DNAPs) comprise the centerpiece of the DNA replication apparatus.

View Article and Find Full Text PDF

SUMMARYPolintons are 15-20 kb-long self-synthesizing transposons that are widespread in eukaryotic, and in particular protist, genomes. Apart from a transposase and a protein-primed DNA polymerase, polintons encode homologs of major and minor jelly-roll capsid proteins, DNA-packaging ATPases, and proteases involved in capsid maturation of diverse eukaryotic viruses of kingdom . Given the conservation of these structural and morphogenetic proteins among polintons, these elements are predicted to alternate between transposon and viral lifestyles and, although virions have thus far not been detected, are classified as viruses (class ) in the phylum .

View Article and Find Full Text PDF

Background: Microbiomes are generally characterized by high diversity of coexisting microbial species and strains, and microbiome composition typically remains stable across a broad range of conditions. However, under fixed conditions, microbial ecology conforms with the exclusion principle under which two populations competing for the same resource within the same niche cannot coexist because the less fit population inevitably goes extinct. Therefore, the long-term persistence of microbiome diversity calls for an explanation.

View Article and Find Full Text PDF

Bacterial and archaeal genomes encompass numerous operons that typically consist of two to five genes. On larger scales, however, gene order is poorly conserved through the evolution of prokaryotes. Nevertheless, non-random localization of different classes of genes on prokaryotic chromosomes could reflect important functional and evolutionary constraints.

View Article and Find Full Text PDF

Microproteins encoded by small open reading frames (smORFs) comprise the "dark matter" of proteomes. Although functional microproteins were identified in diverse organisms from all three domains of life, bacterial smORFs remain poorly characterized. In this comprehensive study of intergenic smORFs (ismORFs, 15-70 codons) in 5,668 bacterial genomes of the family Enterobacteriaceae, we identified 67,297 clusters of ismORFs subject to purifying selection.

View Article and Find Full Text PDF

Viruses with double-stranded (ds) DNA genomes in the realm share a conserved structural gene module but show a broad range of variation in their repertoires of DNA replication proteins. Some of the duplodnaviruses encode (nearly) complete replication systems whereas others lack (almost) all genes required for replication, relying on the host replication machinery. DNA polymerases (DNAPs) comprise the centerpiece of the DNA replication apparatus.

View Article and Find Full Text PDF

Endosomal Sorting Complexes Required for Transport (ESCRT) play key roles in protein sorting between membrane-bounded compartments of eukaryotic cells. Homologs of many ESCRT components are identifiable in various groups of archaea, especially in Asgardarchaeota, the archaeal phylum that is currently considered to include the closest relatives of eukaryotes, but not in bacteria. We performed a comprehensive search for ESCRT protein homologs in archaea and reconstructed ESCRT evolution using the phylogenetic tree of Vps4 ATPase (ESCRT IV) as a scaffold, using sensitive protein sequence analysis and comparison of structural models to identify previously unknown ESCRT proteins.

View Article and Find Full Text PDF

Unlabelled: The phylum consists of large and giant viruses that range in genome size from about 100 kilobases (kb) to more than 2.5 megabases. Here, using metagenome mining followed by extensive phylogenomic analysis and protein structure comparison, we delineate a distinct group of viruses with double-stranded (ds) DNA genomes in the range of 35-45 kb that appear to be related to the .

View Article and Find Full Text PDF

The phylum (kingdom , realm ) is a broad assemblage of diverse viruses with comparatively short double-stranded DNA genomes (<50 kbp) that produce icosahedral capsids built from double jelly-roll major capsid proteins. Preplasmiviricots infect hosts from all cellular domains, testifying to their ancient origin, and, in particular, are associated with six of the seven supergroups of eukaryotes. Preplasmiviricots comprise four major groups of viruses, namely, polintons, polinton-like viruses (PLVs), virophages, and adenovirids.

View Article and Find Full Text PDF

Over the course of multiple divisions, cells accumulate diverse nongenetic, somatic damage including misfolded and aggregated proteins and cell wall defects. If the rate of damage accumulation exceeds the rate of dilution through cell growth, a dedicated mitigation strategy is required to prevent eventual population collapse. Strategies for somatic damage control can be divided into two categories, asymmetric allocation and repair, which are not, in principle, mutually exclusive.

View Article and Find Full Text PDF

Trypanosomatids (Euglenozoa) are a diverse group of unicellular flagellates predominately infecting insects (monoxenous species) or circulating between insects and vertebrates or plants (dixenous species). Monoxenous trypanosomatids harbor a wide range of RNA viruses belonging to the families , , and a putative group of tombus-like viruses. Here, we focus on the subfamily Blastocrithidiinae, a previously unexplored divergent group of monoxenous trypanosomatids comprising two related genera: and .

View Article and Find Full Text PDF

A comprehensive census of McrBC systems, among the most common forms of prokaryotic Type IV restriction systems, followed by phylogenetic analysis, reveals their enormous abundance in diverse prokaryotes and a plethora of genomic associations. We focus on a previously uncharacterized branch, which we denote iled-il clease andems (CoCoNuTs) for their salient features: the presence of extensive coiled-coil structures and tandem nucleases. The CoCoNuTs alone show extraordinary variety, with three distinct types and multiple subtypes.

View Article and Find Full Text PDF

The phylum (kingdom , realm ) is a broad assemblage of diverse viruses with comparatively short double-stranded DNA genomes (<50 kbp) that produce icosahedral capsids built from double jelly-roll major capsid proteins. Preplasmiviricots infect hosts from all cellular domains, testifying to their ancient origin and, in particular, are associated with six of the seven supergroups of eukaryotes. Preplasmiviricots comprise four major groups of viruses, namely, polintons, polinton-like viruses (PLVs), virophages, and adenovirids.

View Article and Find Full Text PDF

In silico identification of viral anti-CRISPR proteins (Acrs) has relied largely on the guilt-by-association method using known Acrs or anti-CRISPR associated proteins (Acas) as the bait. However, the low number and limited spread of the characterized archaeal Acrs and Aca hinders our ability to identify Acrs using guilt-by-association. Here, based on the observation that the few characterized archaeal Acrs and Aca are transcribed immediately post viral infection, we hypothesize that these genes, and many other unidentified anti-defense genes (ADG), are under the control of conserved regulatory sequences including a strong promoter, which can be used to predict anti-defense genes in archaeal viruses.

View Article and Find Full Text PDF

Type VI CRISPR-Cas systems are among the few CRISPR varieties that target exclusively RNA. The CRISPR RNA-guided, sequence-specific binding of target RNAs, such as phage transcripts, activates the type VI effector, Cas13. Once activated, Cas13 causes collateral RNA cleavage, which induces bacterial cell dormancy, thus protecting the host population from the phage spread.

View Article and Find Full Text PDF

Horizontal gene transfer (HGT) is a fundamental process in prokaryotic evolution, contributing significantly to diversification and adaptation. HGT is typically facilitated by mobile genetic elements (MGEs), such as conjugative plasmids and phages, which often impose fitness costs on their hosts. However, a considerable number of bacterial genes are involved in defence mechanisms that limit the propagation of MGEs, suggesting they may actively restrict HGT.

View Article and Find Full Text PDF

The phylum consists of large and giant viruses that range in genome size from about 100 kilobases (kb) to more than 2.5 megabases. Here, using metagenome mining followed by extensive phylogenomic analysis and protein structure comparison, we delineate a distinct group of viruses with double-stranded (ds) DNA genomes in the range of 35-45 kb that appear to be related to the .

View Article and Find Full Text PDF

Cell division is fundamental to all cellular life. Most archaea depend on either the prokaryotic tubulin homologue FtsZ or the endosomal sorting complex required for transport for division but neither system has been robustly characterized. Here, we show that three of the four photosynthesis reaction centre barrel domain proteins of Haloferax volcanii (renamed cell division proteins B1/2/3 (CdpB1/2/3)) play important roles in cell division.

View Article and Find Full Text PDF