Motivation: In the past few years, researchers have proposed numerous indexing schemes for searching large datasets of raw sequencing experiments. Most of these proposed indexes are approximate (i.e. with one-sided errors) in order to save space. Recently, researchers have published exact indexes-Mantis, VariMerge and Bifrost-that can serve as colored de Bruijn graph representations in addition to serving as k-mer indexes. This new type of index is promising because it has the potential to support more complex analyses than simple searches. However, in order to be useful as indexes for large and growing repositories of raw sequencing data, they must scale to thousands of experiments and support efficient insertion of new data.
Results: In this paper, we show how to build a scalable and updatable exact raw sequence-search index. Specifically, we extend Mantis using the Bentley-Saxe transformation to support efficient updates, called Dynamic Mantis. We demonstrate Dynamic Mantis's scalability by constructing an index of ≈40K samples from SRA by adding samples one at a time to an initial index of 10K samples. Compared to VariMerge and Bifrost, Dynamic Mantis is more efficient in terms of index-construction time and memory, query time and memory and index size. In our benchmarks, VariMerge and Bifrost scaled to only 5K and 80 samples, respectively, while Dynamic Mantis scaled to more than 39K samples. Queries were over 24× faster in Mantis than in Bifrost (VariMerge does not immediately support general search queries we require). Dynamic Mantis indexes were about 2.5× smaller than Bifrost's indexes and about half as big as VariMerge's indexes.
Availability And Implementation: Dynamic Mantis implementation is available at https://github.com/splatlab/mantis/tree/mergeMSTs.
Supplementary Information: Supplementary data are available at Bioinformatics online.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9191210 | PMC |
http://dx.doi.org/10.1093/bioinformatics/btac142 | DOI Listing |
PNAS Nexus
September 2024
Chan Zuckerberg Biohub San Francisco, San Francisco, CA 94158, USA.
High-throughput dynamic imaging of cells and organelles is essential for understanding complex cellular responses. We report Mantis, a high-throughput 4D microscope that integrates two complementary, gentle, live-cell imaging technologies: remote-refocus label-free microscopy and oblique light-sheet fluorescence microscopy. Additionally, we report shrimPy (Smart High-throughput Robust Imaging and Measurement in Python), an open-source software for high-throughput imaging, deconvolution, and single-cell phenotyping of 4D data.
View Article and Find Full Text PDFAnimals (Basel)
August 2024
Laboratory of Developmental and Reproductive Biology, DiSVA, Università Politecnica delle Marche, 60131 Ancona, Italy.
J Hazard Mater
October 2024
Research Center for cultural Landscape Protection and Ecological Restoration, China-Portugal Belt and Road Cooperation Laboratory of Cultural Heritage Conservation Science, Gold Mantis School of Architecture, Soochow University, Suzhou 215006, China; Key Laboratory of Plant Nutrition and the Agri-environment in Northwest China, Ministry of Agriculture, College of Natural Resources and Environment, Northwest A&F University, Yangling 712100, China. Electronic address:
Microplastics (MPs) in agricultural plastic film mulching system changes microbial functions and nutrient dynamics in soils. However, how biodegradable MPs impact the soil gross nitrogen (N) transformations and crop N uptake remain significantly unknown. In this study, we conducted a paired labeling N tracer experiment and microbial N-cycling gene analysis to investigate the dynamics and mechanisms of soil gross N transformation processes in soils amended with conventional (polyethylene, PE) and biodegradable (polybutylene adipate co-terephthalate, PBAT) MPs at concentrations of 0 %, 0.
View Article and Find Full Text PDFBeilstein J Nanotechnol
July 2024
CICFIM Facultad de Ciencias Físico Matemáticas, Universidad Autónoma de Nuevo León, San Nicolás de los Garza, Nuevo León, 66455, Mexico.
Janus-type nanoparticles are important because of their ability to combine distinct properties and functionalities in a single particle, making them extremely versatile and valuable in various scientific, technological, and industrial applications. In this work, bimetallic silver-palladium Janus nanoparticles were obtained for the first time using the inert gas condensation technique. In order to achieve this, an original synthesis equipment built by Mantis Ltd.
View Article and Find Full Text PDFOecologia
May 2024
Department of Animal Biology, Universidade Estadual de Campinas (Unicamp), Campinas, SP, 13083-865, Brazil.
Among-individual variation in predator traits is ubiquitous in nature. However, variation among populations in this trait variation has been seldom considered in trophic dynamics. This has left unexplored (a) to what degree does among-individual variation in predator traits regulate prey populations and (b) to what degree do these effects vary spatially.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!