Publications by Aleksandr Morgulis

Publications by authors named "Aleksandr Morgulis"

Page 1 of 1

Finding Candida auris in public metagenomic repositories.

Jorge E Mario-Vasquez Ujwal R Bagal Elijah Lowe Aleksandr Morgulis John Phan

PLoS One

January 2024

Candida auris is a newly emerged multidrug-resistant fungus capable of causing invasive infections with high mortality. Despite intense efforts to understand how this pathogen rapidly emerged and spread worldwide, its environmental reservoirs are poorly understood. Here, we present a collaborative effort between the U.

View Article and Find Full Text PDF

SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees.

Aleksandr Morgulis Richa Agarwala

Gigascience

April 2020

Background: Alignment of sequence reads generated by next-generation sequencing is an integral part of most pipelines analyzing next-generation sequencing data. A number of tools designed to quickly align a large volume of sequences are already available. However, most existing tools lack explicit guarantees about their output.

View Article and Find Full Text PDF

Single haplotype assembly of the human genome from a hydatidiform mole.

Karyn Meltz Steinberg Valerie A Schneider Tina A Graves-Lindsay Robert S Fulton Richa Agarwala Aleksandr Morgulis

Genome Res

December 2014

A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error.

View Article and Find Full Text PDF

Database indexing for production MegaBLAST searches.

Aleksandr Morgulis George Coulouris Yan Raytselis Thomas L Madden Richa Agarwala

Bioinformatics

August 2008

Motivation: The BLAST software package for sequence comparison speeds up homology search by preprocessing a query sequence into a lookup table. Numerous research studies have suggested that preprocessing the database instead would give better performance. However, production usage of sequence comparison methods that preprocess the database has been limited to programs such as BLAT and SSAHA that are designed to find matches when query and database subsequences are highly similar.

View Article and Find Full Text PDF

A fast and symmetric DUST implementation to mask low-complexity DNA sequences.

Aleksandr Morgulis E Michael Gertz Alejandro A Schäffer Richa Agarwala

J Comput Biol

June 2006

The DUST module has been used within BLAST for many years to mask low-complexity sequences. In this paper, we present a new implementation of the DUST module that uses the same function to assign a complexity score to a sequence, but uses a different rule by which high-scoring sequences are masked. The new rule masks every nucleotide masked by the old rule and occasionally masks more.

View Article and Find Full Text PDF

WindowMasker: window-based masker for sequenced genomes.

Aleksandr Morgulis E Michael Gertz Alejandro A Schäffer Richa Agarwala

Bioinformatics

January 2006

Motivation: Matches to repetitive sequences are usually undesirable in the output of DNA database searches. Repetitive sequences need not be matched to a query, if they can be masked in the database. RepeatMasker/Maskeraid (RM), currently the most widely used software for DNA sequence masking, is slow and requires a library of repetitive template sequences, such as a manually curated RepBase library, that may not exist for newly sequenced genomes.

View Article and Find Full Text PDF

Protein database searches using compositionally adjusted substitution matrices.

Stephen F Altschul John C Wootton E Michael Gertz Richa Agarwala Aleksandr Morgulis

FEBS J

October 2005

Almost all protein database search methods use amino acid substitution matrices for scoring, optimizing, and assessing the statistical significance of sequence alignments. Much care and effort has therefore gone into constructing substitution matrices, and the quality of search results can depend strongly upon the choice of the proper matrix. A long-standing problem has been the comparison of sequences with biased amino acid compositions, for which standard substitution matrices are not optimal.

View Article and Find Full Text PDF