COSAP: Comparative Sequencing Analysis Platform.

BMC Bioinformatics

Department of Computer Engineering, Istanbul Technical University, 34469, Istanbul, Turkey.

Published: March 2024

Background: Recent improvements in sequencing technologies enabled detailed profiling of genomic features. These technologies mostly rely on short reads which are merged and compared to reference genome for variant identification. These operations should be done with computers due to the size and complexity of the data. The need for analysis software resulted in many programs for mapping, variant calling and annotation steps. Currently, most programs are either expensive enterprise software with proprietary code which makes access and verification very difficult or open-access programs that are mostly based on command-line operations without user interfaces and extensive documentation. Moreover, a high level of disagreement is observed among popular mapping and variant calling algorithms in multiple studies, which makes relying on a single algorithm unreliable. User-friendly open-source software tools that offer comparative analysis are an important need considering the growth of sequencing technologies.

Results: Here, we propose Comparative Sequencing Analysis Platform (COSAP), an open-source platform that provides popular sequencing algorithms for SNV, indel, structural variant calling, copy number variation, microsatellite instability and fusion analysis and their annotations. COSAP is packed with a fully functional user-friendly web interface and a backend server which allows full independent deployment for both individual and institutional scales. COSAP is developed as a workflow management system and designed to enhance cooperation among scientists with different backgrounds. It is publicly available at https://cosap.bio and https://github.com/MBaysanLab/cosap/ . The source code of the frontend and backend services can be found at https://github.com/MBaysanLab/cosap-webapi/ and https://github.com/MBaysanLab/cosap_frontend/ respectively. All services are packed as Docker containers as well. Pipelines that combine algorithms can be customized and new algorithms can be added with minimal coding through modular structure.

Conclusions: COSAP simplifies and speeds up the process of DNA sequencing analyses providing commonly used algorithms for SNV, indel, structural variant calling, copy number variation, microsatellite instability and fusion analysis as well as their annotations. COSAP is packed with a fully functional user-friendly web interface and a backend server which allows full independent deployment for both individual and institutional scales. Standardized implementations of popular algorithms in a modular platform make comparisons much easier to assess the impact of alternative pipelines which is crucial in establishing reproducibility of sequencing analyses.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10967217PMC
http://dx.doi.org/10.1186/s12859-024-05756-zDOI Listing

Publication Analysis

Top Keywords

variant calling
16
comparative sequencing
8
sequencing analysis
8
analysis platform
8
mapping variant
8
algorithms snv
8
snv indel
8
indel structural
8
structural variant
8
calling copy
8

Similar Publications

Background: Due to its previously illicit nature, Cannabis sativa had not fully reaped the benefits of recent innovations in genomics and plant sciences. However, Canada's legalization of C. sativa and products derived from its flower in 2018 triggered significant new demand for robust genotyping tools to assist breeders in meeting consumer demands.

View Article and Find Full Text PDF

Comparative evaluation of four exome enrichment solutions in 2024: Agilent, Roche, Vazyme and Nanodigmbio.

BMC Genomics

January 2025

Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Pirogov Russian National Research Medical University, Ostrovityanova str. 1, Moscow, 117997, Russia.

Whole exome sequencing (WES) is essential for identifying genetic variants linked to diseases. This study compares available to date four exome enrichment kits: Agilent SureSelect Human All Exon v8, Roche KAPA HyperExome, Vazyme VAHTS Target Capture Core Exome Panel, and Nanodigmbio NEXome Plus Panel v1. We evaluated target design, coverage statistics, and variant calling accuracy across these four different exome capture products.

View Article and Find Full Text PDF

This study demonstrates the use of GPT-4 and variants, advanced language models readily accessible to many social scientists, in extracting political networks from text. This approach showcases the novel integration of GPT-4's capabilities in entity recognition, relation extraction, entity linking, and sentiment analysis into a single cohesive process. Based on a corpus of 1009 Chilean political news articles, the study validates the graph extraction method using 'legislative agreement', i.

View Article and Find Full Text PDF

Myocardial Inflammation in Cardiac Transthyretin Amyloidosis: Prevalence and Potential Prognostic Implications.

Circ Heart Fail

January 2025

Department of Cardiology, Angiology and Intensive Care Medicine, Deutsches Herzzentrum der Charité, Berlin, Germany (M.L.M., U.L., B.H., D.M., A.B., I.M., S.S.).

Background: Despite previous histopathologic evidence for its presence, the role of myocardial inflammation in the development and progression of cardiac transthyretin amyloidosis (ATTR-CA) remains insufficiently understood. Thus, this study sought to characterize the prevalence and potential prognostic implications of myocardial inflammation in ATTR-CA.

Methods: A retrospective observational study including patients with ATTR-CA diagnosed by endomyocardial biopsy was conducted.

View Article and Find Full Text PDF

More than 50% of families with suspected rare monogenic diseases remain unsolved after whole-genome analysis by short-read sequencing (SRS). Long-read sequencing (LRS) could help bridge this diagnostic gap by capturing variants inaccessible to SRS, facilitating long-range mapping and phasing and providing haplotype-resolved methylation profiling. To evaluate LRS's additional diagnostic yield, we sequenced a rare-disease cohort of 98 samples from 41 families, using nanopore sequencing, achieving per sample ∼36× average coverage and 32-kb read N50 from a single flow cell.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!