Sequence alignment is an essential step in computational genomics. More accurate and efficient sequence pre-alignment methods that run before conducting expensive computation for final verification are still urgently needed. In this article, we propose a more accurate and efficient pre-alignment algorithm for sequence alignment, called DiagAF. Firstly, DiagAF uses a new lower bound of edit distance based on shift hamming masks. The new lower bound makes use of fewer shift hamming masks comparing with state-of-the-art algorithms such as SHD and MAGNET. Moreover, it takes account the information of edit distance path exchanging on shift hamming masks. Secondly, DiagAF can deal with alignments of sequence pairs with not equal length, rather than state-of-the-art methods just for equal length. Thirdly, DiagAF can align sequences with early termination for true alignments. In the experiment, we compared DiagAF with state-of-the-art methods. DiagAF can achieve a much smaller error rate than them, meanwhile use less time than them. We believe that DiagAF algorithm can further improve the performance of state-of-the-art sequence alignment softwares. The source codes of DiagAF can be downloaded from web site https://github.com/BioLab-cz/DiagAF.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCBB.2021.3127879DOI Listing

Publication Analysis

Top Keywords

sequence alignment
16
accurate efficient
12
shift hamming
12
hamming masks
12
diagaf
9
efficient pre-alignment
8
lower bound
8
edit distance
8
equal length
8
state-of-the-art methods
8

Similar Publications

Identification and characterization of multiple novel viruses in fecal samples of cormorants.

Front Vet Sci

January 2025

Department of Microbiology, School of Medicine, Jiangsu University, Zhenjiang, Jiangsu, China.

Introduction: Cormorants, as protected wild animals by the State Forestry Administration of China, have a broad distribution across China. Previous studies have shown that they can be infected with multiple viruses in the , , , and families. There is limited knowledge about the other viruses that cormorants may carry and infect.

View Article and Find Full Text PDF

The global dissemination of SARS-CoV-2 led to a worldwide pandemic in March 2020. Even after the official downgrading of the COVID-19 pandemic, infection with SARS-CoV-2 variants continues. The rapid development and deployment of SARS-CoV-2 vaccines helped to mitigate the pandemic to a great extent.

View Article and Find Full Text PDF

Background: Currently, synthetic genomics is a rapidly developing field. Its main tasks, such as the design of synthetic sequences and the assembly of DNA sequences from synthetic oligonucleotides, require specialized software. In this article, we present a program with a graphical interface that allows non-bioinformatics to perform the tasks needed in synthetic genomics.

View Article and Find Full Text PDF

Sequence alignment is foundational to many bioinformatic analyses. Many aligners start by splitting sequences into contiguous, fixed-length seeds, called k-mers. Alignment is faster with longer, unique seeds, but more accurate with shorter seeds avoiding mutations.

View Article and Find Full Text PDF

[Prevalence and genetic characteristics of infections among HIV-positive individuals in Jiangxi Province].

Zhongguo Xue Xi Chong Bing Fang Zhi Za Zhi

July 2024

Nanchang Municipal Center for Disease Control and Prevention, Base of National Key Laboratory for the Prevention and Control of Infectious Diseases, Nanchang, Jiangxi 330038, China.

Objective: To investigate the prevalence of infection and the distribution of parasite species and genotypes among HIV-positive individuals in Jiangxi Province.

Methods: HIV-positive individuals' sociodemographic and clinical data were collected from three AIDS designated hospitals in Jiangxi Province from January 2022 to March 2023. Subjects' stool samples were collected, and genomic DNA was extracted from stool samples.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!