Publications by Andrey Kislyuk

Publications by authors named "Andrey Kislyuk"

Page 1 of 1

IDseq-An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring.

Katrina L Kalantar Tiago Carvalho Charles F A de Bourcy Boris Dimitrov Greg Dingle Andrey Kislyuk

Gigascience

October 2020

Background: Metagenomic next-generation sequencing (mNGS) has enabled the rapid, unbiased detection and identification of microbes without pathogen-specific reagents, culturing, or a priori knowledge of the microbial landscape. mNGS data analysis requires a series of computationally intensive processing steps to accurately determine the microbial composition of a sample. Existing mNGS data analysis tools typically require bioinformatics expertise and access to local server-class hardware resources.

View Article and Find Full Text PDF

Ebola Virus Epidemiology, Transmission, and Evolution during Seven Months in Sierra Leone.

Daniel J Park Gytis Dudas Shirlee Wohl Augustine Goba Shannon L M Whitmer Andrey Kislyuk

Cell

June 2015

The 2013-2015 Ebola virus disease (EVD) epidemic is caused by the Makona variant of Ebola virus (EBOV). Early in the epidemic, genome sequencing provided insights into virus evolution and transmission and offered important information for outbreak response. Here, we analyze sequences from 232 patients sampled over 7 months in Sierra Leone, along with 86 previously released genomes from earlier in the epidemic.

View Article and Find Full Text PDF

Modeling kinetic rate variation in third generation DNA sequencing data to detect putative modifications to DNA bases.

Eric E Schadt Onureena Banerjee Gang Fang Zhixing Feng Wing H Wong Andrey Kislyuk

Genome Res

January 2013

Current generation DNA sequencing instruments are moving closer to seamlessly sequencing genomes of entire populations as a routine part of scientific investigation. However, while significant inroads have been made identifying small nucleotide variation and structural variations in DNA that impact phenotypes of interest, progress has not been as dramatic regarding epigenetic changes and base-level damage to DNA, largely due to technological limitations in assaying all known and unknown types of modifications at genome scale. Recently, single-molecule real time (SMRT) sequencing has been reported to identify kinetic variation (KV) events that have been demonstrated to reflect epigenetic changes of every known type, providing a path forward for detecting base modifications as a routine part of sequencing.

View Article and Find Full Text PDF

Characterization of DNA methyltransferase specificities using single-molecule, real-time DNA sequencing.

Tyson A Clark Iain A Murray Richard D Morgan Andrey O Kislyuk Kristi E Spittle

Nucleic Acids Res

February 2012

DNA methylation is the most common form of DNA modification in prokaryotic and eukaryotic genomes. We have applied the method of single-molecule, real-time (SMRT®) DNA sequencing that is capable of direct detection of modified bases at single-nucleotide resolution to characterize the specificity of several bacterial DNA methyltransferases (MTases). In addition to previously described SMRT sequencing of N6-methyladenine and 5-methylcytosine, we show that N4-methylcytosine also has a specific kinetic signature and is therefore identifiable using this approach.

View Article and Find Full Text PDF

Sensitive and specific single-molecule sequencing of 5-hydroxymethylcytosine.

Chun-Xiao Song Tyson A Clark Xing-Yu Lu Andrey Kislyuk Qing Dai

Nat Methods

November 2011

We describe strand-specific, base-resolution detection of 5-hydroxymethylcytosine (5-hmC) in genomic DNA with single-molecule sensitivity, combining a bioorthogonal, selective chemical labeling method of 5-hmC with single-molecule, real-time (SMRT) DNA sequencing. The chemical labeling not only allows affinity enrichment of 5-hmC-containing DNA fragments but also enhances the kinetic signal of 5-hmC during SMRT sequencing. We applied the approach to sequence 5-hmC in a genomic DNA sample with high confidence.

View Article and Find Full Text PDF

Neisseria Base: a comparative genomics database for Neisseria meningitidis.

Lee S Katz Jay C Humphrey Andrew B Conley Viswateja Nelakuditi Andrey O Kislyuk

Database (Oxford)

January 2012

Neisseria meningitidis is an important pathogen, causing life-threatening diseases including meningitis, septicemia and in some cases pneumonia. Genomic studies hold great promise for N. meningitidis research, but substantial database resources are needed to deal with the wealth of information that comes with completely sequenced and annotated genomes.

View Article and Find Full Text PDF

Origins of the E. coli strain causing an outbreak of hemolytic-uremic syndrome in Germany.

David A Rasko Dale R Webster Jason W Sahl Ali Bashir Nadia Boisen Andrey O Kislyuk

N Engl J Med

August 2011

Background: A large outbreak of diarrhea and the hemolytic-uremic syndrome caused by an unusual serotype of Shiga-toxin-producing Escherichia coli (O104:H4) began in Germany in May 2011. As of July 22, a large number of cases of diarrhea caused by Shiga-toxin-producing E. coli have been reported--3167 without the hemolytic-uremic syndrome (16 deaths) and 908 with the hemolytic-uremic syndrome (34 deaths)--indicating that this strain is notably more virulent than most of the Shiga-toxin-producing E.

View Article and Find Full Text PDF

Genomic fluidity: an integrative view of gene diversity within microbial populations.

Andrey O Kislyuk Bart Haegeman Nicholas H Bergman Joshua S Weitz

BMC Genomics

January 2011

Background: The dual concepts of pan and core genomes have been widely adopted as means to assess the distribution of gene families within microbial species and genera. The core genome is the set of genes shared by a group of organisms; the pan genome is the set of all genes seen in any of these organisms. A variety of methods have provided drastically different estimates of the sizes of pan and core genomes from sequenced representatives of the same groups of bacteria.

View Article and Find Full Text PDF

A computational genomics pipeline for prokaryotic sequencing projects.

Andrey O Kislyuk Lee S Katz Sonia Agrawal Matthew S Hagen Andrew B Conley

Bioinformatics

August 2010

Motivation: New sequencing technologies have accelerated research on prokaryotic genomes and have made genome sequencing operations outside major genome sequencing centers routine. However, no off-the-shelf solution exists for the combined assembly, gene prediction, genome annotation and data presentation necessary to interpret sequencing data. The resulting requirement to invest significant resources into custom informatics support for genome sequencing projects remains a major impediment to the accessibility of high-throughput sequence data.

View Article and Find Full Text PDF

Unsupervised statistical clustering of environmental shotgun sequences.

Andrey Kislyuk Srijak Bhatnagar Jonathan Dushoff Joshua S Weitz

BMC Bioinformatics

October 2009

Background: The development of effective environmental shotgun sequence binning methods remains an ongoing challenge in algorithmic analysis of metagenomic data. While previous methods have focused primarily on supervised learning involving extrinsic data, a first-principles statistical model combined with a self-training fitting method has not yet been developed.

Results: We derive an unsupervised, maximum-likelihood formalism for clustering short sequences by their taxonomic origin on the basis of their k-mer distributions.

View Article and Find Full Text PDF

Frameshift detection in prokaryotic genomic sequences.

Andrey Kislyuk Alexandre Lomsadze Alla L Lapidus Mark Borodovsky

Int J Bioinform Res Appl

December 2009

We have developed a new method for frameshift detection, a combination of ab initio and alignment-based algorithms, that can serve as a useful tool for sequencing quality control in the next generation sequencing. We evaluated the method's accuracy on test sets of annotated genomic sequences with artificial frameshifts in protein coding regions. These tests have shown that the new method performs comparably to the earlier developed FrameD.

View Article and Find Full Text PDF

Meningococcus genome informatics platform: a system for analyzing multilocus sequence typing data.

Lee S Katz Chris R Bolen Brian H Harcourt Susanna Schmink Xin Wang Andrey Kislyuk

Nucleic Acids Res

July 2009

The Meningococcus Genome Informatics Platform (MGIP) is a suite of computational tools for the analysis of multilocus sequence typing (MLST) data, at http://mgip.biology.gatech.

View Article and Find Full Text PDF

Multiple whole-genome alignments without a reference organism.

Inna Dubchak Alexander Poliakov Andrey Kislyuk Michael Brudno

Genome Res

April 2009

Multiple sequence alignments have become one of the most commonly used resources in genomics research. Most algorithms for multiple alignment of whole genomes rely either on a reference genome, against which all of the other sequences are laid out, or require a one-to-one mapping between the nucleotides of the genomes, preventing the alignment of recently duplicated regions. Both approaches have drawbacks for whole-genome comparisons.

View Article and Find Full Text PDF

Conservation patterns in different functional sequence categories of divergent Drosophila species.

Dmitri Papatsenko Andrey Kislyuk Michael Levine Inna Dubchak

Genomics

October 2006

We have explored the distributions of fully conserved ungapped blocks in genome-wide pair-wise alignments of recently completed species of Drosophila: D. melanogaster, D. yakuba, D.

View Article and Find Full Text PDF

Publications by authors named "Andrey Kislyuk"

IDseq-An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring.

Ebola Virus Epidemiology, Transmission, and Evolution during Seven Months in Sierra Leone.

Modeling kinetic rate variation in third generation DNA sequencing data to detect putative modifications to DNA bases.

Characterization of DNA methyltransferase specificities using single-molecule, real-time DNA sequencing.

Sensitive and specific single-molecule sequencing of 5-hydroxymethylcytosine.

Neisseria Base: a comparative genomics database for Neisseria meningitidis.

Origins of the E. coli strain causing an outbreak of hemolytic-uremic syndrome in Germany.

Genomic fluidity: an integrative view of gene diversity within microbial populations.

A computational genomics pipeline for prokaryotic sequencing projects.

Unsupervised statistical clustering of environmental shotgun sequences.

Frameshift detection in prokaryotic genomic sequences.

Meningococcus genome informatics platform: a system for analyzing multilocus sequence typing data.

Multiple whole-genome alignments without a reference organism.

Conservation patterns in different functional sequence categories of divergent Drosophila species.

A PHP Error was encountered

A PHP Error was encountered