FaStore: a space-saving solution for raw sequencing data.

Bioinformatics

Institute of Informatics, Faculty of Automatic Control, Electronics and Computer Science, Silesian University of Technology, Gliwice, Poland.

Published: August 2018

Motivation: The affordability of DNA sequencing has led to the generation of unprecedented volumes of raw sequencing data. These data must be stored, processed and transmitted, which poses significant challenges. To facilitate this effort, we introduce FaStore, a specialized compressor for FASTQ files. FaStore does not use any reference sequences for compression and permits the user to choose from several lossy modes to improve the overall compression ratio, depending on the specific needs.

Results: FaStore in the lossless mode achieves a significant improvement in compression ratio with respect to previously proposed algorithms. We perform an analysis on the effect that the different lossy modes have on variant calling, the most widely used application for clinical decision making, especially important in the era of precision medicine. We show that lossy compression can offer significant compression gains, while preserving the essential genomic information and without affecting the variant calling performance.

Availability And Implementation: FaStore can be downloaded from https://github.com/refresh-bio/FaStore.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bty205DOI Listing

Publication Analysis

Top Keywords

raw sequencing
8
sequencing data
8
lossy modes
8
compression ratio
8
variant calling
8
fastore
5
compression
5
fastore space-saving
4
space-saving solution
4
solution raw
4

Similar Publications

India harbours a substantial population of 9.43 million dogs, showcasing diverse phenotypes and utility. Initiatives focusing on awareness, conservation and informed breeding can greatly enhance the recognition and welfare of the unique Indian canine heritage.

View Article and Find Full Text PDF

Nexus: A versatile console for advanced low-field MRI.

Magn Reson Med

January 2025

Department 8.1 - Biomedical Magnetic Resonance, Physikalisch-Technische Bundesanstalt (PTB), Braunschweig and Berlin, Germany.

Purpose: To develop a low-cost, high-performance, versatile, open-source console for low-field MRI applications that can integrate a multitude of different auxiliary sensors.

Methods: A new MR console was realized with four transmission and eight reception channels. The interface cards for signal transmission and reception are installed in PCI Express slots, allowing console integration in a commercial PC rack.

View Article and Find Full Text PDF

Evaluation of nationwide analysis surveillance for methicillin-resistant within Genomic Medicine Sweden.

Microb Genom

January 2025

Department of Laboratory Medicine, Clinical Microbiology, Faculty of Medicine and Health, rebro University, rebro, Sweden.

National epidemiological investigations of microbial infections greatly benefit from the increased information gained by whole-genome sequencing (WGS) in combination with standardized approaches for data sharing and analysis. To evaluate the quality and accuracy of WGS data generated by different laboratories but analysed by joint pipelines to reach a national surveillance approach. A national methicillin-resistant (MRSA) collection of 20 strains was distributed to nine participating laboratories that performed in-house procedures for WGS.

View Article and Find Full Text PDF

Rapid technological advancements have made it possible to generate single-cell data at a large scale. Several laboratories around the world can now generate single-cell transcriptomic data from different tissues. Unsupervised clustering, followed by annotation of the cell type of the identified clusters, is a crucial step in single-cell analyses.

View Article and Find Full Text PDF

Active Ingredients and Potential Mechanism of Additive Sishen Decoction in Treating Rheumatoid Arthritis with Network Pharmacology and Molecular Dynamics Simulation and Experimental Verification.

Drug Des Devel Ther

January 2025

Shanxi Key Laboratory of Innovative Drug for the Treatment of Serious Diseases Basing on the Chronic Inflammation, College of Traditional Chinese Medicine and Food Engineering, Shanxi University of Chinese Medicine, Jinzhong, People's Republic of China.

Background: Rheumatoid arthritis (RA) is a chronic inflammatory autoimmune disease in which macrophages produce cytokines that enhance inflammation and contribute to the destruction of cartilage and bone. Additive Sishen decoction (ASSD) is a widely used traditional Chinese medicine for the treatment of RA; however, its active ingredients and the mechanism of its therapeutic effects remain unclear.

Methods: To predict the ingredients and key targets of ASSD, we constructed "drug-ingredient-target-disease" and protein-protein interaction networks.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!