Motivation: In the past few years, researchers have proposed numerous indexing schemes for searching large datasets of raw sequencing experiments. Most of these proposed indexes are approximate (i.e. with one-sided errors) in order to save space. Recently, researchers have published exact indexes-Mantis, VariMerge and Bifrost-that can serve as colored de Bruijn graph representations in addition to serving as k-mer indexes. This new type of index is promising because it has the potential to support more complex analyses than simple searches. However, in order to be useful as indexes for large and growing repositories of raw sequencing data, they must scale to thousands of experiments and support efficient insertion of new data.

Results: In this paper, we show how to build a scalable and updatable exact raw sequence-search index. Specifically, we extend Mantis using the Bentley-Saxe transformation to support efficient updates, called Dynamic Mantis. We demonstrate Dynamic Mantis's scalability by constructing an index of ≈40K samples from SRA by adding samples one at a time to an initial index of 10K samples. Compared to VariMerge and Bifrost, Dynamic Mantis is more efficient in terms of index-construction time and memory, query time and memory and index size. In our benchmarks, VariMerge and Bifrost scaled to only 5K and 80 samples, respectively, while Dynamic Mantis scaled to more than 39K samples. Queries were over 24× faster in Mantis than in Bifrost (VariMerge does not immediately support general search queries we require). Dynamic Mantis indexes were about 2.5× smaller than Bifrost's indexes and about half as big as VariMerge's indexes.

Availability And Implementation: Dynamic Mantis implementation is available at https://github.com/splatlab/mantis/tree/mergeMSTs.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9191210PMC
http://dx.doi.org/10.1093/bioinformatics/btac142DOI Listing

Publication Analysis

Top Keywords

dynamic mantis
20
bentley-saxe transformation
8
raw sequencing
8
support efficient
8
varimerge bifrost
8
time memory
8
mantis
7
dynamic
6
indexes
5
samples
5

Similar Publications

High-throughput dynamic imaging of cells and organelles is essential for understanding complex cellular responses. We report Mantis, a high-throughput 4D microscope that integrates two complementary, gentle, live-cell imaging technologies: remote-refocus label-free microscopy and oblique light-sheet fluorescence microscopy. Additionally, we report shrimPy (Smart High-throughput Robust Imaging and Measurement in Python), an open-source software for high-throughput imaging, deconvolution, and single-cell phenotyping of 4D data.

View Article and Find Full Text PDF
Article Synopsis
  • - The study investigated the reproductive biology of spot tail mantis shrimp in the Central Mediterranean Sea to improve fisheries management and ensure sustainable species exploitation.
  • - Over four years, researchers collected 2206 mantis shrimp, finding a sex ratio favoring females and confirming a five-stage maturity scale through histological analysis.
  • - The shrimp showed a prolonged breeding season from January to May, with males growing larger than females and reaching maturity at around 25.94 mm carapace length.
View Article and Find Full Text PDF

Enhancing soil gross nitrogen transformation through regulation of microbial nitrogen-cycling genes by biodegradable microplastics.

J Hazard Mater

October 2024

Research Center for cultural Landscape Protection and Ecological Restoration, China-Portugal Belt and Road Cooperation Laboratory of Cultural Heritage Conservation Science, Gold Mantis School of Architecture, Soochow University, Suzhou 215006, China; Key Laboratory of Plant Nutrition and the Agri-environment in Northwest China, Ministry of Agriculture, College of Natural Resources and Environment, Northwest A&F University, Yangling 712100, China. Electronic address:

Microplastics (MPs) in agricultural plastic film mulching system changes microbial functions and nutrient dynamics in soils. However, how biodegradable MPs impact the soil gross nitrogen (N) transformations and crop N uptake remain significantly unknown. In this study, we conducted a paired labeling N tracer experiment and microbial N-cycling gene analysis to investigate the dynamics and mechanisms of soil gross N transformation processes in soils amended with conventional (polyethylene, PE) and biodegradable (polybutylene adipate co-terephthalate, PBAT) MPs at concentrations of 0 %, 0.

View Article and Find Full Text PDF

Synthesis of silver-palladium Janus nanoparticles using co-sputtering of independent sources: experimental and theorical study.

Beilstein J Nanotechnol

July 2024

CICFIM Facultad de Ciencias Físico Matemáticas, Universidad Autónoma de Nuevo León, San Nicolás de los Garza, Nuevo León, 66455, Mexico.

Janus-type nanoparticles are important because of their ability to combine distinct properties and functionalities in a single particle, making them extremely versatile and valuable in various scientific, technological, and industrial applications. In this work, bimetallic silver-palladium Janus nanoparticles were obtained for the first time using the inert gas condensation technique. In order to achieve this, an original synthesis equipment built by Mantis Ltd.

View Article and Find Full Text PDF

Top-down effects of intraspeciflic predator behavioral variation.

Oecologia

May 2024

Department of Animal Biology, Universidade Estadual de Campinas (Unicamp), Campinas, SP, 13083-865, Brazil.

Among-individual variation in predator traits is ubiquitous in nature. However, variation among populations in this trait variation has been seldom considered in trophic dynamics. This has left unexplored (a) to what degree does among-individual variation in predator traits regulate prey populations and (b) to what degree do these effects vary spatially.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!