High Performance Computing Framework for Tera-Scale Database Search of Mass Spectrometry Data.

Nat Comput Sci

Knight Foundation School of Computing and Information Sciences, Florida International University, Miami, FL, USA.

Published: August 2021

Database peptide search algorithms deduce peptides from mass spectrometry (MS) data. There has been substantial effort in improving their computational efficiency to achieve larger and more complex systems biology studies. However, modern serial and high-performance computing (HPC) algorithms exhibit sub-optimal performance mainly due to their ineffective parallel designs (low resource utilization), and high overhead costs. We present an HPC framework, called HiCOPS, for efficient acceleration of the database peptide search algorithms on distributed-memory supercomputers. HiCOPS provides, on average, more than 10-fold improvement in speed, and superior parallel performance over several existing HPC database search software. We also formulate a mathematical model for performance analysis and optimization, and report near-optimal results for several key metrics including strong-scale efficiency, hardware utilization, load-balance, inter-process communication and I/O overheads. The core parallel design, techniques, and optimizations presented in HiCOPS are search-algorithm independent and can be extended to efficiently accelerate the existing and future algorithms and software.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8554525PMC
http://dx.doi.org/10.1038/s43588-021-00113-zDOI Listing

Publication Analysis

Top Keywords

database search
8
mass spectrometry
8
spectrometry data
8
database peptide
8
peptide search
8
search algorithms
8
high performance
4
performance computing
4
computing framework
4
framework tera-scale
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!