Publications by authors named "Zimin A"

Different substrates pose varied biomechanical challenges that select specific morphologies, such as long limbs for faster running and short limbs for balanced posture while climbing narrow substrates. We tested how gecko locomotion is affected by the microhabitat they occupy and by a key adaptation-adhesive toepads-through analyzing how those are related to limb morphology. We collected microhabitat and toepads data for over 90% of limbed gecko species, and limb measurements for 403 species from 83 of the 121 limbed gecko genera, which we then used in phylogenetic comparative analyses.

View Article and Find Full Text PDF

Unlabelled: Evaluating the accuracy of protein-coding sequences in genome annotations is a challenging problem for which there is no broadly applicable solution. In this manuscript we introduce PSAURON (Protein Sequence Assessment Using a Reference ORF Network), a novel software tool developed to assess the quality of protein-coding gene annotations. Utilizing a machine learning model trained on a diverse dataset from over 1000 plant and animal genomes, PSAURON assigns a score to coding DNA or protein sequence that reflects the likelihood that the sequence is a genuine protein coding region.

View Article and Find Full Text PDF

Whitebark pine (WBP, Pinus albicaulis) is a white pine of subalpine regions in the Western contiguous United States and Canada. WBP has become critically threatened throughout a significant part of its natural range due to mortality from the introduced fungal pathogen white pine blister rust (WPBR, Cronartium ribicola) and additional threats from mountain pine beetle (Dendroctonus ponderosae), wildfire, and maladaptation due to changing climate. Vast acreages of WBP have suffered nearly complete mortality.

View Article and Find Full Text PDF

Genome editing tools based on SpCas9 and FnCpf1 have facilitated strain improvements for natural product production and novel drug discovery in . However, due to high toxicity, their editing requires high DNA transformation efficiency, which is unavailable in most streptomycetes. The transformation efficiency of an all-in-one editing tool based on miniature Cas nuclease AsCas12f1 was significantly higher than those of SpCas9 and FnCpf1 in tested streptomycetes, which is due to its small size and weak DNA cleavage activity.

View Article and Find Full Text PDF
Article Synopsis
  • Whitebark pine (WBP) is under threat from disease like white pine blister rust, pests, wildfires, and climate change, leading to severe mortality across its range in the Western US and Canada.
  • Genomic technologies have been utilized to effectively identify disease-resistant and climate-adapted seed sources for restoring WBP, including advanced sequencing techniques that produced a detailed genome assembly.
  • The study identified a significant number of candidate genes for disease resistance, particularly focusing on nucleotide-binding leucine-rich-repeat receptors (NLRs), enhancing the ability to understand and improve WBP’s resilience compared to earlier methods.
View Article and Find Full Text PDF
Article Synopsis
  • * Only a small fraction of insect species have comprehensive genome references, and difficulties arise in sequencing rare or poorly understood symbionts.
  • * Long-read sequencing technology, especially Oxford Nanopore's adaptive sampling, is effective for capturing genomes of both hosts and their symbionts, but it struggles with symbionts that are present in low amounts.
View Article and Find Full Text PDF

Diseases due to infections by pathogenic Escherichia coli strains are on the rise and with the growing antimicrobial resistance among bacterial pathogens, including this group. Thus, alternative therapeutic options are actively investigated. Among these alternatives is phage therapy.

View Article and Find Full Text PDF

Streptomycetes have a strong ability to produce a vast array of bioactive natural products (NPs) widely used in agriculture and veterinary/human medicine. The recently developed CRISPR/Cas9-based genome editing tools have greatly facilitated strain improvement for target NP overproduction as well as novel NP discovery in . However, CRISPR/Cas9 shows high toxicity to the host, limiting its application in many strains with a low DNA transformation efficiency.

View Article and Find Full Text PDF

Stalk-eyed flies in the genus Teleopsis carry selfish genetic elements that induce sex ratio (SR) meiotic drive and impact the fitness of male and female carriers. Here, we assemble and describe a chromosome-level genome assembly of the stalk-eyed fly, Teleopsis dalmanni, to elucidate patterns of divergence associated with SR. The genome contains tens of thousands of transposable element (TE) insertions and hundreds of transcriptionally and insertionally active TE families.

View Article and Find Full Text PDF

Here the Human Pangenome Reference Consortium presents a first draft of the human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals. These assemblies cover more than 99% of the expected sequence in each genome and are more than 99% accurate at the structural and base pair levels.

View Article and Find Full Text PDF

Advances in long-read sequencing technologies have dramatically improved the contiguity and completeness of genome assemblies. Using the latest nanopore-based sequencers, we can generate enough data for the assembly of a human genome from a single flow cell. With the long-read data from these sequences, we can now routinely produce de novo genome assemblies in which half or more of a genome is contained in megabase-scale contigs.

View Article and Find Full Text PDF
Article Synopsis
  • The study presents a detailed genome assembly for the orb-weaving spider Uloborus diversus, filling a gap in the genetic research of orb-weaving families that has existed for over 200 million years.
  • This research provides evidence of an ancient genome duplication in arachnids and highlights complete spidroin gene sequences, which are essential for spider silk structure.
  • The findings also identify the sex chromosomes and potential sex-determining genes, making this genome a key resource for studying the evolution of orb-weaving and related genetic traits in spiders.
View Article and Find Full Text PDF

We used long-read DNA sequencing to assemble the genome of a Southern Han Chinese male. We organized the sequence into chromosomes and filled in gaps using the recently completed T2T-CHM13 genome as a guide, yielding a gap-free genome, Han1, containing 3,099,707,698 bases. Using the T2T-CHM13 annotation as a reference, we mapped all genes onto the Han1 genome and identified additional gene copies, generating a total of 60,708 putative genes, of which 20,003 are protein-coding.

View Article and Find Full Text PDF

The current human reference genome, GRCh38, represents over 20 years of effort to generate a high-quality assembly, which has benefitted society. However, it still has many gaps and errors, and does not represent a biological genome as it is a blend of multiple individuals. Recently, a high-quality telomere-to-telomere reference, CHM13, was generated with the latest long-read technologies, but it was derived from a hydatidiform mole cell line with a nearly homozygous genome.

View Article and Find Full Text PDF

Unlabelled: DNA glycosylases are DNA repair enzymes capable of removing damaged nitrogenous bases, including those formed as a result of UV irradiation with sunlight (approximately 300-400 нм). DNA glycosylases are common not only among bacteria, archaea, and eukaryotes, but some groups of viruses can also encode them. The best-known viral glycosylase is endonuclease V (DenV, Pdg-T4) of Escherichia virus T4, the main substrate of which is cyclobutane pyrimidine dimers.

View Article and Find Full Text PDF

The gastropod mollusk is an important model for cellular and molecular neurobiological studies, particularly for investigations of molecular mechanisms of learning and memory. We developed an optimized assembly pipeline to generate an improved nervous system transcriptome. This improved transcriptome enabled us to explore the evolution of cognitive capacity at the molecular level.

View Article and Find Full Text PDF
Article Synopsis
  • The genus Quercus began diversifying about 55 million years ago, resulting in around 450 species, including the California oak Quercus lobata, which has a high-quality genome assembly that showcases its evolutionary advantages.
  • Analysis of the oak's genome revealed a large effective population size despite a historical decline, with extensive gene duplications contributing to its genetic and phenotypic diversity.
  • Unique patterns of DNA methylation connected to transposable elements indicate a presence of heterochromatin similar to grasses, supporting the idea that these genetic features enhance adaptability to environmental changes.
View Article and Find Full Text PDF

Rapamycin is an important macrocyclic antibiotic produced by Streptomyces rapamycinicus. In the rapamycin biosynthetic gene cluster (BGC), there are up to five regulatory genes, which have been shown to play important roles in the regulation of rapamycin biosynthesis. Here, we demonstrated that the rapamycin BGC-situated LAL family regulator RapH co-ordinately regulated the biosynthesis of both rapamycin and elaiophylin.

View Article and Find Full Text PDF

Third-generation sequencing technologies can generate very long reads with relatively high error rates. The lengths of the reads, which sometimes exceed one million bases, make them invaluable for resolving complex repeats that cannot be assembled using shorter reads. Many high-quality genome assemblies have already been produced, curated, and annotated using the previous generation of sequencing data, and full re-assembly of these genomes with long reads is not always practical or cost-effective.

View Article and Find Full Text PDF

Sequencing, assembly, and annotation of the 26.5 Gbp hexaploid genome of coast redwood (Sequoia sempervirens) was completed leading toward discovery of genes related to climate adaptation and investigation of the origin of the hexaploid genome. Deep-coverage short-read Illumina sequencing data from haploid tissue from a single seed were combined with long-read Oxford Nanopore Technologies sequencing data from diploid needle tissue to create an initial assembly, which was then scaffolded using proximity ligation data to produce a highly contiguous final assembly, SESE 2.

View Article and Find Full Text PDF

Unlabelled: Kraken and KrakenUniq are widely-used tools for classifying metagenomics sequences. A key requirement for these systems is a database containing all from all genomes that the users want to be able to detect, where = 31 by default. This database can be very large, easily exceeding 100 gigabytes (GB) and sometimes 400 GB.

View Article and Find Full Text PDF

Until 2019, the human genome was available in only one fully annotated version, GRCh38, which was the result of 18 years of continuous improvement and revision. Despite dramatic improvements in sequencing technology, no other genome was available as an annotated reference until 2019, when the genome of an Ashkenazi individual, Ash1, was released. In this study, we describe the assembly and annotation of a second individual genome, from a Puerto Rican individual whose DNA was collected as part of the Human Pangenome project.

View Article and Find Full Text PDF

Objective: To perform linguistic and cultural adaptation of the Fugl-Meyer Assessment of Physical Performance.

Materials And Methods: The study for assessment of psychometric properties included 53 post-stroke patients with neurological deficit presented by hemiparesis of different severity. The patients were assessed twice: at admission and after 14 days of treatment and rehabilitation procedures.

View Article and Find Full Text PDF

The American lobster, , is integral to marine ecosystems and supports an important commercial fishery. This iconic species also serves as a valuable model for deciphering neural networks controlling rhythmic motor patterns and olfaction. Here, we report a high-quality draft assembly of the genome with 25,284 predicted gene models.

View Article and Find Full Text PDF