Background: Due the computational complexity of sequence alignment algorithms, various accelerated solutions have been proposed to speedup this analysis. NVBIO is the only available GPU library that accelerates sequence alignment of high-throughput NGS data, but has limited performance. In this article we present GASAL2, a GPU library for aligning DNA and RNA sequences that outperforms existing CPU and GPU libraries.

Results: The GASAL2 library provides specialized, accelerated kernels for local, global and all types of semi-global alignment. Pairwise sequence alignment can be performed with and without traceback. GASAL2 outperforms the fastest CPU-optimized SIMD implementations such as SeqAn and Parasail, as well as NVIDIA's own GPU-based library known as NVBIO. GASAL2 is unique in performing sequence packing on GPU, which is up to 750x faster than NVBIO. Overall on Geforce GTX 1080 Ti GPU, GASAL2 is up to 21x faster than Parasail on a dual socket hyper-threaded Intel Xeon system with 28 cores and up to 13x faster than NVBIO with a query length of up to 300 bases and 100 bases, respectively. GASAL2 alignment functions are asynchronous/non-blocking and allow full overlap of CPU and GPU execution. The paper shows how to use GASAL2 to accelerate BWA-MEM, speeding up the local alignment by 20x, which gives an overall application speedup of 1.3x vs. CPU with up to 12 threads.

Conclusions: The library provides high performance APIs for local, global and semi-global alignment that can be easily integrated into various bioinformatics tools.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6815017PMC
http://dx.doi.org/10.1186/s12859-019-3086-9DOI Listing

Publication Analysis

Top Keywords

sequence alignment
16
gasal2
8
gasal2 gpu
8
alignment
8
high-throughput ngs
8
ngs data
8
gpu library
8
cpu gpu
8
local global
8
semi-global alignment
8

Similar Publications

The physiological, functional, and structural properties of proteins and their pathogenic variants can be summarized using many tools. The information relating to a single protein is often spread among different sources requiring different programs for access. It is not always easy to select, simultaneously visualize, and compare specific properties of different proteins.

View Article and Find Full Text PDF

Introduction Dermatophytes are the most common cause of superficial fungal infection. They are mostly diagnosed using phenotypic methods, but recently the molecular methods seem to be gaining ground. The objective of the present study was to compare the phenotypic and genotypic methods of identification of dermatophytes and understand the feasibility of using molecular methods for routine diagnosis of dermatophytosis.

View Article and Find Full Text PDF

Background: In poultry, despite intense vaccination programs for prevention of Newcastle disease (ND), the ND infection still affects, causing high mortality in most vaccinated flocks.

Aim: This study aimed to determine whether the genetic material of the ND virus has changed and has become incompatible with the vaccines used in Iraq.

Methods: Real-time PCR was used to analyze genetic variation in the fusion (F) and haemaggluatination neuraminidase (HN) genes, as well as mRNA expression changes in inflammatory biomarkers, including C-reactive protein (CRP), interleukin 6, interleukin-1 beta (IL-6, IL-1β), and gamma interferon (IFN-γ).

View Article and Find Full Text PDF

Cov-trans: an efficient algorithm for discontinuous transcript assembly in coronaviruses.

BMC Genomics

December 2024

School of Computer Science and Technology, Qingdao University, Ningxia Road, Qingdao, Shandong Province, 266071, China.

Background: Discontinuous transcription allows coronaviruses to efficiently replicate and transmit within host cells, enhancing their adaptability and survival. Assembling viral transcripts is crucial for virology research and the development of antiviral strategies. However, traditional transcript assembly methods primarily designed for variable alternative splicing events in eukaryotes are not suitable for the viral transcript assembly problem.

View Article and Find Full Text PDF

Background: Enhanced biological phosphorus removal (EBPR) systems utilize phosphorus-accumulating organisms (PAOs) to remove phosphorus from wastewater since excessive phosphorus in water bodies can lead to eutrophication. This study aimed to characterize a newly isolated PAO strain for its potential application in EBPR systems and to screen for additional biotechnological potential. Here, sequencing allowed for genomic analysis, identifying the genes and molecules involved, and exploring other potentials.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!