Publications by authors named "Igor Tolstoy"

Article Synopsis
  • Human microbiomes play a crucial role in health by impacting metabolism, immune functions, and neurological processes, but their complete complexity is still not fully understood.
  • The definition of a "healthy" microbiome is controversial due to variations in microbial communities and the difficulty in establishing a standard definition for health across different individuals and conditions.
  • The article highlights progress in microbiome research and identifies gaps in knowledge, proposing a roadmap that utilizes epidemiological methods to better understand the relationship between microbiomes and health.
View Article and Find Full Text PDF

Background: Viruses with double-stranded (ds) DNA genomes in the realm Duplodnaviria share a conserved structural gene module but show a broad range of variation in their repertoires of DNA replication proteins. Some of the duplodnaviruses encode (nearly) complete replication systems whereas others lack (almost) all genes required for replication, relying on the host replication machinery. DNA polymerases (DNAPs) comprise the centerpiece of the DNA replication apparatus.

View Article and Find Full Text PDF

Viruses with double-stranded (ds) DNA genomes in the realm share a conserved structural gene module but show a broad range of variation in their repertoires of DNA replication proteins. Some of the duplodnaviruses encode (nearly) complete replication systems whereas others lack (almost) all genes required for replication, relying on the host replication machinery. DNA polymerases (DNAPs) comprise the centerpiece of the DNA replication apparatus.

View Article and Find Full Text PDF

Hydrocephalus, the leading indication for childhood neurosurgery worldwide, is particularly prevalent in low- and middle-income countries. Hydrocephalus preceded by an infection, or postinfectious hydrocephalus, accounts for up to 60% of hydrocephalus in these areas. Since many children with hydrocephalus suffer poor long-term outcomes despite surgical intervention, prevention of hydrocephalus remains paramount.

View Article and Find Full Text PDF

All sequencing projects of bacteriophages (phages) should seek to report an accurate and comprehensive annotation of their genomes. This article defines 14 questions for those new to phage genomics that should be addressed before submitting a genome sequence to the International Nucleotide Sequence Database Collaboration or writing a publication.

View Article and Find Full Text PDF

Type IV CRISPR-Cas are a distinct variety of highly derived CRISPR-Cas systems that appear to have evolved from type III systems through the loss of the target-cleaving nuclease and partial deterioration of the large subunit of the effector complex. All known type IV CRISPR-Cas systems are encoded on plasmids, integrative and conjugative elements (ICEs), or prophages, and are thought to contribute to competition between these elements, although the mechanistic details of their function remain unknown. There is a clear parallel between the compositions and likely origin of type IV and type I systems recruited by Tn7-like transposons and mediating RNA-guided transposition.

View Article and Find Full Text PDF

CrAssphage is the most abundant human-associated virus and the founding member of a large group of bacteriophages, discovered in animal-associated and environmental metagenomes, that infect bacteria of the phylum Bacteroidetes. We analyze 4907 Circular Metagenome Assembled Genomes (cMAGs) of putative viruses from human gut microbiomes and identify nearly 600 genomes of crAss-like phages that account for nearly 87% of the DNA reads mapped to these cMAGs. Phylogenetic analysis of conserved genes demonstrates the monophyly of crAss-like phages, a putative virus order, and of 5 branches, potential families within that order, two of which have not been identified previously.

View Article and Find Full Text PDF

Antimicrobial resistance (AMR) is a major public health problem that requires publicly available tools for rapid analysis. To identify AMR genes in whole-genome sequences, the National Center for Biotechnology Information (NCBI) has produced AMRFinder, a tool that identifies AMR genes using a high-quality curated AMR gene reference database. The Bacterial Antimicrobial Resistance Reference Gene Database consists of up-to-date gene nomenclature, a set of hidden Markov models (HMMs), and a curated protein family hierarchy.

View Article and Find Full Text PDF

Tailed bacteriophages are the most abundant and diverse viruses in the world, with genome sizes ranging from 10 kbp to over 500 kbp. Yet, due to historical reasons, all this diversity is confined to a single virus order-Caudovirales, composed of just four families: Myoviridae, Siphoviridae, Podoviridae, and the newly created Ackermannviridae family. In recent years, this morphology-based classification scheme has started to crumble under the constant flood of phage sequences, revealing that tailed phages are even more genetically diverse than once thought.

View Article and Find Full Text PDF

While taxonomy is an often-unappreciated branch of science it serves very important roles. Bacteriophage taxonomy has evolved from a mainly morphology-based discipline, characterized by the work of David Bradley and Hans-Wolfgang Ackermann, to the holistic approach that is taken today. The Bacterial and Archaeal Viruses Subcommittee of the International Committee on Taxonomy of Viruses (ICTV) takes a comprehensive approach to classifying prokaryote viruses measuring overall DNA and protein identity and phylogeny before making decisions about the taxonomic position of a new virus.

View Article and Find Full Text PDF

In 1994, analyses of clostridial 16S rRNA gene sequences led to the assignment of 18 species to Clostridium cluster XI, separating them from Clostridium sensu stricto (Clostridium cluster I). Subsequently, most cluster XI species have been assigned to the family Peptostreptococcaceae with some species being reassigned to new genera. However, several misclassified Clostridium species remained, creating a taxonomic conundrum and confusion regarding their status.

View Article and Find Full Text PDF

The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records (http://www.ncbi.nlm.

View Article and Find Full Text PDF

The National Center for Biotechnology Information's (NCBI) Gene database (www.ncbi.nlm.

View Article and Find Full Text PDF

The source of the microbial genomic sequences in the RefSeq collection is the set of primary sequence records submitted to the International Nucleotide Sequence Database public archives. These can be accessed through the Entrez search and retrieval system at http://www.ncbi.

View Article and Find Full Text PDF

Rapid increases in DNA sequencing capabilities have led to a vast increase in the data generated from prokaryotic genomic studies, which has been a boon to scientists studying micro-organism evolution and to those who wish to understand the biological underpinnings of microbial systems. The NCBI Protein Clusters Database (ProtClustDB) has been created to efficiently maintain and keep the deluge of data up to date. ProtClustDB contains both curated and uncurated clusters of proteins grouped by sequence similarity.

View Article and Find Full Text PDF