Background: Our previously published CUDA-only application PaSWAS for Smith-Waterman (SW) sequence alignment of any type of sequence on NVIDIA-based GPUs is platform-specific and therefore adopted less than could be. The OpenCL language is supported more widely and allows use on a variety of hardware platforms. Moreover, there is a need to promote the adoption of parallel computing in bioinformatics by making its use and extension more simple through more and better application of high-level languages commonly used in bioinformatics, such as Python.

Results: The novel application pyPaSWAS presents the parallel SW sequence alignment code fully packed in Python. It is a generic SW implementation running on several hardware platforms with multi-core systems and/or GPUs that provides accurate sequence alignments that also can be inspected for alignment details. Additionally, pyPaSWAS support the affine gap penalty. Python libraries are used for automated system configuration, I/O and logging. This way, the Python environment will stimulate further extension and use of pyPaSWAS.

Conclusions: pyPaSWAS presents an easy Python-based environment for accurate and retrievable parallel SW sequence alignments on GPUs and multi-core systems. The strategy of integrating Python with high-performance parallel compute languages to create a developer- and user-friendly environment should be considered for other computationally intensive bioinformatics algorithms.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5749749PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0190279PLOS

Publication Analysis

Top Keywords

sequence alignment
12
hardware platforms
8
pypaswas presents
8
parallel sequence
8
multi-core systems
8
sequence alignments
8
sequence
6
pypaswas
4
pypaswas python-based
4
python-based multi-core
4

Similar Publications

Background: The evolution and development of flowers are biologically essential and of broad interest. Maize and sorghum have similar morphologies and phylogeny while harboring different inflorescence architecture. The difference in flower architecture between these two species is likely due to spatiotemporal gene expression regulation, and they are a good model for researching the evolution of flower development.

View Article and Find Full Text PDF

Genome-wide development of simple sequence repeat (SSR) markers at 2-Mb intervals in lotus (Nelumbo Adans.).

BMC Genomics

January 2025

Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, No. 3888 Chenhua Road, Songjiang District, Shanghai, 201602, China.

Background: Despite the rapid advancement of high-throughput sequencing, simple sequence repeats (SSRs) remain indispensable molecular markers for various applied and research tasks owing to their cost-effectiveness and ease of use. However, existing SSR markers cannot meet the growing demand for research on lotus (Nelumbo Adans.) given their scarcity and weak connections to the lotus genome.

View Article and Find Full Text PDF

Genome-wide identification and expression analysis of the WRKY gene family in Mikania micrantha.

BMC Genomics

January 2025

College of Biological Science and Food Engineering, Southwest Forestry University, Kunming, Yunnan Province, 650224, China.

Background: WRKY transcription factors (TFs) regulate plant responses to environmental stimuli and development, including flowering. Despite extensive research on different species, their role in the invasive plant Mikania micrantha remains to be explored. The aim of this study was to identify and analyze WRKY genes in M.

View Article and Find Full Text PDF

Fast exact gap-affine partial order alignment with POASTA.

Bioinformatics

January 2025

Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA 02142, United States.

Motivation: Partial order alignment is a widely used method for computing multiple sequence alignments, with applications in genome assembly and pangenomics, among many others. Current algorithms to compute the optimal, gap-affine partial order alignment do not scale well to larger graphs and sequences. While heuristic approaches exist, they do not guarantee optimal alignment and sacrifice alignment accuracy.

View Article and Find Full Text PDF

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

Amsterdam UMC location VUmc, Amsterdam, Noord-holland, Netherlands.

Background: SORL1 encodes the retromer-associated receptor SORLA that functions in endosomal recycling. Rare variants in SORL1 have been associated with Alzheimer's disease (AD) and rare pathogenic variants are estimated to occur in up to 2.75% of early onset AD patients and in 1.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!