CROMqs: An infinitesimal successive refinement lossy compressor for the quality scores.

J Bioinform Comput Biol

Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, 1308 W Main Street, Urbana, IL 61801, USA.

Published: December 2020

The amount of sequencing data is growing at a fast pace due to a rapid revolution in sequencing technologies. Quality scores, which indicate the reliability of each of the called nucleotides, take a significant portion of the sequencing data. In addition, quality scores are more challenging to compress than nucleotides, and they are often noisy. Hence, a natural solution to further decrease the size of the sequencing data is to apply lossy compression to the quality scores. Lossy compression may result in a loss in precision, however, it has been shown that when operating at some specific rates, lossy compression can achieve performance on variant calling similar to that achieved with the losslessly compressed data (i.e. the original data). We propose Coding with Random Orthogonal Matrices for quality scores (CROMqs), the first lossy compressor designed for the quality scores with the "infinitesimal successive refinability" property. With this property, the encoder needs to compress the data only once, at a high rate, while the decoder can decompress it iteratively. The decoder can reconstruct the set of quality scores at each step with reduced distortion each time. This characteristic is specifically useful in sequencing data compression, since the encoder does not generally know what the most appropriate rate of compression is, e.g. for not degrading variant calling accuracy. CROMqs avoids the need of having to compress the data at multiple rates, hence incurring time savings. In addition to this property, we show that CROMqs obtains a comparable rate-distortion performance to the state-of-the-art lossy compressors. Moreover, we also show that it achieves a comparable performance on variant calling to that of the lossless compressed data while achieving more than 50% reduction in size.

Download full-text PDF

Source
http://dx.doi.org/10.1142/S0219720020500316DOI Listing

Publication Analysis

Top Keywords

quality scores
28
sequencing data
16
lossy compression
12
variant calling
12
data
9
lossy compressor
8
performance variant
8
compressed data
8
compress data
8
quality
7

Similar Publications

The Association Between COVID-19 Vaccination Uptake and Information-Seeking Behaviors Using the Internet: Nationwide Cross-Sectional Study.

J Med Internet Res

January 2025

Department of Healthcare Economics and Quality Management, School of Public Health, Graduate School of Medicine, Kyoto University, Kyoto, Japan.

Background: The COVID-19 pandemic, declared in March 2020, profoundly affected global health, societal, and economic frameworks. Vaccination became a crucial tactic in combating the virus. Simultaneously, the pandemic likely underscored the internet's role as a vital resource for seeking health information.

View Article and Find Full Text PDF

Purpose: To investigate the effect of art therapy on quality of life and social functioning of individuals with schizophrenia receiving community mental health services.

Method: A quasi-experimental study design was used to assess the effects of art therapy on quality of life and social functioning. The study included 14 participants with schizophrenia, seven assigned to the intervention group and seven to the control group.

View Article and Find Full Text PDF

Importance: The integration of patient-reported outcome (PRO) assessments in cardiovascular care has encountered considerable obstacles despite their established clinical relevance.

Objective: To assess the impact of a physician- and patient-friendly electronic PRO (ePRO) monitoring system on the quality of cardiovascular care in clinical practice.

Design, Setting, And Participants: This open-label, multicenter, pilot randomized clinical trial was phase 2 of a multiphase study that was conducted from October 2022 to October 2023 and focused on the implementation and evaluation of an ePRO monitoring system in outpatient clinics in Japan.

View Article and Find Full Text PDF

Purpose: Adolescent and young adult (AYA) malignant brain tumour (BT) survivors are at risk of adverse health outcomes, which may impact their health-related quality of life (HRQoL). This study aimed to investigate the (1) prevalence of physical and psychological adverse health outcomes, (2) the HRQoL, and (3) the association of adverse health outcomes and HRQoL among long-term AYA-BT survivors. Adverse health outcomes and HRQoL were compared to other AYA cancer (AYAC) survivors.

View Article and Find Full Text PDF

Cranioplasty is an operation that aims to repair a defect in the skull. Indications commonly include Traumatic Brain Injury (TBI), tumours, and infections. It carries a high rate of postoperative morbidity.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!