k-mismatch shortest unique substring (SUS) queries have been proposed and studied very recently due to its useful applications in the subfield of computational biology. The k-mismatch SUS query over one given position of a string asks for a shortest substring that covers the given position and does not have a duplicate (within a Hamming distance of k) elsewhere in the string. The challenge in SUS query is to collectively find the SUS for every position of a massively long string in a both time- and space-efficient manner. All known efforts and results have been focused on improving and optimizing the time and space efficiency of SUS computation in the sequential CPU model. In this work, we propose the first parallel approach for k-mismatch SUS queries, particularly leveraging on the massive multi-threading architecture of the graphic processing unit (GPU) technology. Experimental study performed on a mid-end GPU using real-world biological data shows that our proposal is consistently faster than the fastest CPU solution by a factor of at least 6 for exact SUS queries ( k=0) and at least 23 for approximate SUS queries over DNA sequences ( ), while maintaining nearly the same peak memory usage as the most memory-efficient sequential CPU proposal. Our work provides practitioners a faster tool for SUS finding on massively long strings, and indeed provides the first practical tool for approximate SUS computation, because the any-case quadratical time cost of the state-of-the-art sequential CPU method for approximate SUS queries does not scale well even to modestly long strings.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCBB.2019.2935061DOI Listing

Publication Analysis

Top Keywords

sus queries
20
sequential cpu
12
approximate sus
12
sus
11
k-mismatch shortest
8
shortest unique
8
k-mismatch sus
8
sus query
8
massively long
8
sus computation
8

Similar Publications

Background: Clinical trials (CTs) are foundational to the advancement of evidence-based medicine and recruiting a sufficient number of participants is one of the crucial steps to their successful conduct. Yet, poor recruitment remains the most frequent reason for premature discontinuation or costly extension of clinical trials.

Methods: We designed and implemented a novel, open-source software system to support the recruitment process in clinical trials by generating automatic recruitment recommendations.

View Article and Find Full Text PDF

Brazil is a continental country with significant socioeconomic and ethnic inequalities. It is important to understand how these differences are reflected in health care, especially during the COVID-19 pandemic. We investigated the potential impacts of the reduced number Transcranial Doppler Ultrasound (TCD) tests performed in Brazil during the COVID-19 pandemic, and its reflections according to region and race/color for patients with Sickle Cell Disease (SCD).

View Article and Find Full Text PDF

Motivation: Molecular complexes play a major role in the regulation of biological pathways. The Biological Pathway Exchange format (BioPAX) facilitates the integration of data sources describing interactions some of which involving complexes. The BioPAX specification explicitly prevents complexes to have any component that is another complex (unless this component is a black-box complex whose composition is unknown).

View Article and Find Full Text PDF

ID please: Evaluating the utility of Facebook as a source of data for snake research and conservation.

An Acad Bras Cienc

December 2022

University of Los Andes, Research Group in Mathematical and Computational Biology (BIOMAC), Department of Biomedical Engineering, Carrera 1, #18a-12, Bogotá, 111711, Colombia.

Social media has the potential to provide large amounts of biological data, especially for notoriously difficult groups of organisms to study in nature such as snakes. Here, we explored the utility of various Facebook communities to provide data for research on Colombian snakes. Specifically, we determined the richness, distribution, rarity, and popularity of snake species and compiled information on natural history observations and human-snake interactions.

View Article and Find Full Text PDF

Usability of Hospital Price Estimators for Lumbar Spine MRI.

J Am Coll Radiol

November 2022

Department of Radiology and Imaging Sciences, Emory University School of Medicine, Atlanta, Georgia; Department of Radiological Sciences, University of California Irvine, Orange, California; and Associate Editor, JACR. Electronic address:

Purpose: The aim of this study was to evaluate the usability of online hospital price estimators for a common imaging examination using surrogate patients.

Methods: Using the Amazon Mechanical Turk platform, the authors recruited adult English-speaking US residents as surrogate patients to find the cash price for a noncontrast lumbar spine MRI examination for a self-pay patient using price estimator tools at four hospitals. All were asked to view a 3-min tutorial video and report their experiences with the task, including the System Usability Scale (SUS) for the estimator, through a paid survey.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!