Publications by Dmitry Meleshko | LitMetric

Publications by authors named "Dmitry Meleshko"

Page 1 of 1

Structure and Biosynthetic Gene Cluster of Sulfated Capsular Polysaccharide from the Marine Bacterium sp. KMM 8419.

Maxim S Kokoulin Yulia V Savicheva Nadezhda Y Otstavnykh Valeria V Kurilenko Dmitry A Meleshko

Int J Mol Sci

December 2024

sp. KMM 8419 (=CB1-14) is a Gram-negative bacterium isolated from a food-net mucus sample of marine polychaete collected in the Sea of Japan. Here, we report the structure and biosynthetic gene cluster of the capsular polysaccharide (CPS) from strain KMM 8419.

View Article and Find Full Text PDF

Blackbird: structural variant detection using synthetic and low-coverage long-reads.

Dmitry Meleshko Rui Yang Salil Maharjan David C Danko Anton Korobeynikov

bioRxiv

November 2024

Motivation: Recent benchmarks of structural variant (SV) detection tools revealed that the majority of human genome structural variations (SVs), especially the medium-range (50-10,000 bp) SVs cannot be resolved with short-read sequencing, but long-read SV callers achieve great results on the same datasets. While improvements have been made, high-coverage long-read sequencing is associated with higher costs and input DNA requirements. To decrease the cost one can lower the sequence coverage, but the current long-read SV callers perform poorly with coverage below 10×.

View Article and Find Full Text PDF

cloudrnaSPAdes: isoform assembly using bulk barcoded RNA sequencing data.

Dmitry Meleshko Andrey D Prjbelski Mikhail Raiko Alexandru I Tomescu Hagen Tilgner

Bioinformatics

February 2024

Motivation: Recent advancements in long-read RNA sequencing have enabled the examination of full-length isoforms, previously uncaptured by short-read sequencing methods. An alternative powerful method for studying isoforms is through the use of barcoded short-read RNA reads, for which a barcode indicates whether two short-reads arise from the same molecule or not. Such techniques included the 10x Genomics linked-read based SParse Isoform Sequencing (SPIso-seq), as well as Loop-Seq, or Tell-Seq.

View Article and Find Full Text PDF

Ariadne: synthetic long read deconvolution using assembly graphs.

Lauren Mak Dmitry Meleshko David C Danko Waris N Barakzai Salil Maharjan

Genome Biol

August 2023

Synthetic long read sequencing techniques such as UST's TELL-Seq and Loop Genomics' LoopSeq combine 3[Formula: see text] barcoding with standard short-read sequencing to expand the range of linkage resolution from hundreds to tens of thousands of base-pairs. However, the lack of a 1:1 correspondence between a long fragment and a 3[Formula: see text] unique molecular identifier confounds the assignment of linkage between short reads. We introduce Ariadne, a novel assembly graph-based synthetic long read deconvolution algorithm, that can be used to extract single-species read-clouds from synthetic long read datasets to improve the taxonomic classification and de novo assembly of complex populations, such as metagenomes.

View Article and Find Full Text PDF

Benchmarking State-of-the-Art Approaches for Norovirus Genome Assembly in Metagenome Sample.

Dmitry Meleshko Anton Korobeynikov

Biology (Basel)

July 2023

A recently published article in BMCGenomics by Fuentes-Trillo et al. contains a comparison of assembly approaches of several noroviral samples via different tools and preprocessing strategies. It turned out that the study used outdated versions of tools as well as tools that were not designed for the viral assembly task.

View Article and Find Full Text PDF

cloudrnaSPAdes: Isoform assembly using bulk barcoded RNA sequencing data.

Dmitry Meleshko Andrey D Prjbelski Mikhail Raiko Alexandru I Tomescu Hagen Tilgner

bioRxiv

July 2023

Motivation: Recent advancements in long-read RNA sequencing have enabled the examination of full-length isoforms, previously uncaptured by short-read sequencing methods. An alternative powerful method for studying isoforms is through the use of barcoded short-read RNA reads, for which a barcode indicates whether two short-reads arise from the same molecule or not. Such techniques included the 10x Genomics linked-read based SParse Isoform Sequencing (SPIso-seq), as well as Loop-Seq, or Tell-Seq.

View Article and Find Full Text PDF

CAMP: A modular metagenomics analysis system for integrated multi-step data exploration.

Lauren Mak Braden Tierney Cynthia Ronkowski Rodolfo Brizola Toscan Berk Turhan Dmitry Meleshko

bioRxiv

September 2024

Motivation: Computational analysis of large-scale metagenomics sequencing datasets have proven to be both incredibly valuable for extracting isolate-level taxonomic, and functional insights from complex microbial communities. However, due to an ever-expanding ecosystem of metagenomics-specific methods and file-formats, designing studies which implement seamless and scalable end-to-end workflows, and exploring the massive amounts of output data have become studies unto themselves. One-click bioinformatics pipelines have helped to organize these tools into targeted workflows, but they suffer from general compatibility and maintainability issues.

View Article and Find Full Text PDF

Cue: a deep-learning framework for structural variant discovery and genotyping.

Victoria Popic Chris Rohlicek Fabio Cunial Iman Hajirasouliha Dmitry Meleshko

Nat Methods

April 2023

Article Synopsis

Structural variants (SVs) significantly contribute to genetic diversity and disease, highlighting the need for better detection methods in precision medicine.
Existing methods for detecting SVs are limited because they rely on manual features and rules, which don't scale well to the wide diversity of SVs in genomic data.
The Cue framework uses deep learning to analyze sequencing data by converting alignments into images and employing a convolutional neural network to accurately predict SV types, achieving superior performance compared to current methods.

View Article and Find Full Text PDF

Efficient detection and assembly of non-reference DNA sequences with synthetic long reads.

Dmitry Meleshko Rui Yang Patrick Marks Stephen Williams Iman Hajirasouliha

Nucleic Acids Res

October 2022

Recent pan-genome studies have revealed an abundance of DNA sequences in human genomes that are not present in the reference genome. A lion's share of these non-reference sequences (NRSs) cannot be reliably assembled or placed on the reference genome. Improvements in long-read and synthetic long-read (aka linked-read) technologies have great potential for the characterization of NRSs.

View Article and Find Full Text PDF

Critical Assessment of Metagenome Interpretation: the second round of challenges.

Fernando Meyer Adrian Fritz Zhi-Luo Deng David Koslicki Till Robin Lesker Dmitry Meleshko

Nat Methods

April 2022

Article Synopsis

Evaluating metagenomic software is crucial for enhancing the interpretation of metagenomes, and the CAMI II challenge focused on this by using complex datasets from numerous genomes and plasmids.
The analysis of 5,002 results from 76 software versions showed significant advancements in assembly, especially with long-read data, although challenges remained with related strains and genome recovery.
Findings indicated that while taxon profilers improved, they struggled with viruses and Archaea, highlighting the need for better reproducibility in clinical pathogen detection and guiding researchers in method selection based on efficiency and performance metrics.

View Article and Find Full Text PDF

Petabase-scale sequence alignment catalyses viral discovery.

Robert C Edgar Brie Taylor Victor Lin Tomer Altman Pierre Barbera Dmitry Meleshko

Nature

February 2022

Article Synopsis

Public databases hold vast collections of nucleic acid sequences, exceeding 20 petabases, but efficient searching methods have been lacking.
The authors developed Serratus, a cloud computing system that allows for ultra-high-throughput sequence alignment, enabling them to search over 10 petabases of data and discover more than 10 new RNA viruses.
By characterizing these novel viruses and creating a free database, the study aims to enhance viral discovery and aid in understanding the origins of emerging pathogens, ultimately improving pandemic preparedness.

View Article and Find Full Text PDF

coronaSPAdes: from biosynthetic gene clusters to RNA viral assemblies.

Dmitry Meleshko Iman Hajirasouliha Anton Korobeynikov

Bioinformatics

December 2021

Motivation: The COVID-19 pandemic has ignited a broad scientific interest in viral research in general and coronavirus research in particular. The identification and characterization of viral species in natural reservoirs typically involves de novo assembly. However, existing genome, metagenome and transcriptome assemblers often are not able to assemble many viruses (including coronaviruses) into a single contig.

View Article and Find Full Text PDF

A global metagenomic map of urban microbiomes and antimicrobial resistance.

David Danko Daniela Bezdan Evan E Afshin Sofia Ahsanuddin Chandrima Bhattacharya Dmitry Meleshko

Cell

June 2021

We present a global atlas of 4,728 metagenomic samples from mass-transit systems in 60 cities over 3 years, representing the first systematic, worldwide catalog of the urban microbial ecosystem. This atlas provides an annotated, geospatial profile of microbial strains, functional characteristics, antimicrobial resistance (AMR) markers, and genetic elements, including 10,928 viruses, 1,302 bacteria, 2 archaea, and 838,532 CRISPR arrays not found in reference databases. We identified 4,246 known species of urban microorganisms and a consistent set of 31 species found in 97% of samples that were distinct from human commensal organisms.

View Article and Find Full Text PDF

A comprehensive metagenomics framework to characterize organisms relevant for planetary protection.

David C Danko Maria A Sierra James N Benardini Lisa Guan Jason M Wood Dmitry Meleshko

Microbiome

April 2021

Background: Clean rooms of the Space Assembly Facility (SAF) at the Jet Propulsion Laboratory (JPL) at NASA are the final step of spacecraft cleaning and assembly before launching into space. Clean rooms have stringent methods of air-filtration and cleaning to minimize microbial contamination for exoplanetary research and minimize the risk of human pathogens, but they are not sterile. Clean rooms make a selective environment for microorganisms that tolerate such cleaning methods.

View Article and Find Full Text PDF

A Deep Learning Approach to Diagnostic Classification of Prostate Cancer Using Pathology-Radiology Fusion.

Pegah Khosravi Maria Lysandrou Mahmoud Eljalby Qianzi Li Ehsan Kazemi Dmitry Meleshko

J Magn Reson Imaging

August 2021

Article Synopsis

A biopsy is needed to definitively diagnose prostate cancer, but it's invasive and comes with risks.
Researchers developed an AI-based model called AI-biopsy that uses MRI images to help diagnose prostate cancer early, reducing the need for invasive procedures.
The AI models showed good accuracy in distinguishing between benign and cancerous tumors, as well as assessing their risk levels, indicating a potential for personalized cancer care.

View Article and Find Full Text PDF

Shotgun transcriptome, spatial omics, and isothermal profiling of SARS-CoV-2 infection reveals unique host responses, viral diversification, and drug interactions.

Daniel Butler Christopher Mozsary Cem Meydan Jonathan Foox Joel Rosiene Dmitry Meleshko

Nat Commun

March 2021

In less than nine months, the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) killed over a million people, including >25,000 in New York City (NYC) alone. The COVID-19 pandemic caused by SARS-CoV-2 highlights clinical needs to detect infection, track strain evolution, and identify biomarkers of disease course. To address these challenges, we designed a fast (30-minute) colorimetric test (LAMP) for SARS-CoV-2 infection from naso/oropharyngeal swabs and a large-scale shotgun metatranscriptomics platform (total-RNA-seq) for host, viral, and microbial profiling.

View Article and Find Full Text PDF

Using SPAdes De Novo Assembler.

Andrey Prjibelski Dmitry Antipov Dmitry Meleshko Alla Lapidus Anton Korobeynikov

Curr Protoc Bioinformatics

June 2020

SPAdes-St. Petersburg genome Assembler-was originally developed for de novo assembly of genome sequencing data produced for cultivated microbial isolates and for single-cell genomic DNA sequencing. With time, the functionality of SPAdes was extended to enable assembly of IonTorrent data, as well as hybrid assembly from short and long reads (PacBio and Oxford Nanopore).

View Article and Find Full Text PDF

Shotgun Transcriptome and Isothermal Profiling of SARS-CoV-2 Infection Reveals Unique Host Responses, Viral Diversification, and Drug Interactions.

Daniel J Butler Christopher Mozsary Cem Meydan David Danko Jonathan Foox Dmitry Meleshko

bioRxiv

May 2020

The Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has caused thousands of deaths worldwide, including >18,000 in New York City (NYC) alone. The sudden emergence of this pandemic has highlighted a pressing clinical need for rapid, scalable diagnostics that can detect infection, interrogate strain evolution, and identify novel patient biomarkers. To address these challenges, we designed a fast (30-minute) colorimetric test (LAMP) for SARS-CoV-2 infection from naso/oropharyngeal swabs, plus a large-scale shotgun metatranscriptomics platform (total-RNA-seq) for host, bacterial, and viral profiling.

View Article and Find Full Text PDF

BiosyntheticSPAdes: reconstructing biosynthetic gene clusters from assembly graphs.

Dmitry Meleshko Hosein Mohimani Vittorio Tracanna Iman Hajirasouliha Marnix H Medema

Genome Res

August 2019

Predicting biosynthetic gene clusters (BGCs) is critically important for discovery of antibiotics and other natural products. While BGC prediction from complete genomes is a well-studied problem, predicting BGCs in fragmented genomic assemblies remains challenging. The existing BGC prediction tools often assume that each BGC is encoded within a single contig in the genome assembly, a condition that is violated for most sequenced microbial genomes where BGCs are often scattered through several contigs, making it difficult to reconstruct them.

View Article and Find Full Text PDF

Minerva: an alignment- and reference-free approach to deconvolve Linked-Reads for metagenomics.

David C Danko Dmitry Meleshko Daniela Bezdan Christopher Mason Iman Hajirasouliha

Genome Res

January 2019

Emerging Linked-Read technologies (aka read cloud or barcoded short-reads) have revived interest in short-read technology as a viable approach to understand large-scale structures in genomes and metagenomes. Linked-Read technologies, such as the 10x Chromium system, use a microfluidic system and a specialized set of 3' barcodes (aka UIDs) to tag short DNA reads sourced from the same long fragment of DNA; subsequently, the tagged reads are sequenced on standard short-read platforms. This approach results in interesting compromises.

View Article and Find Full Text PDF

American Gut: an Open Platform for Citizen Science Microbiome Research.

Daniel McDonald Embriette Hyde Justine W Debelius James T Morton Antonio Gonzalez Dmitry Meleshko

mSystems

May 2018

Although much work has linked the human microbiome to specific phenotypes and lifestyle variables, data from different projects have been challenging to integrate and the extent of microbial and molecular diversity in human stool remains unknown. Using standardized protocols from the Earth Microbiome Project and sample contributions from over 10,000 citizen-scientists, together with an open research network, we compare human microbiome specimens primarily from the United States, United Kingdom, and Australia to one another and to environmental samples. Our results show an unexpected range of beta-diversity in human stool microbiomes compared to environmental samples; demonstrate the utility of procedures for removing the effects of overgrowth during room-temperature shipping for revealing phenotype correlations; uncover new molecules and kinds of molecular communities in the human stool metabolome; and examine emergent associations among the microbiome, metabolome, and the diversity of plants that are consumed (rather than relying on reductive categorical variables such as veganism, which have little or no explanatory power).

View Article and Find Full Text PDF

metaSPAdes: a new versatile metagenomic assembler.

Sergey Nurk Dmitry Meleshko Anton Korobeynikov Pavel A Pevzner

Genome Res

May 2017

While metagenomics has emerged as a technology of choice for analyzing bacterial populations, the assembly of metagenomic data remains challenging, thus stifling biological discoveries. Moreover, recent studies revealed that complex bacterial populations may be composed from dozens of related strains, thus further amplifying the challenge of metagenomic assembly. metaSPAdes addresses various challenges of metagenomic assembly by capitalizing on computational ideas that proved to be useful in assemblies of single cells and highly polymorphic diploid genomes.

View Article and Find Full Text PDF