Sequence alignment is an essential step in computational genomics. More accurate and efficient sequence pre-alignment methods that run before conducting expensive computation for final verification are still urgently needed. In this article, we propose a more accurate and efficient pre-alignment algorithm for sequence alignment, called DiagAF. Firstly, DiagAF uses a new lower bound of edit distance based on shift hamming masks. The new lower bound makes use of fewer shift hamming masks comparing with state-of-the-art algorithms such as SHD and MAGNET. Moreover, it takes account the information of edit distance path exchanging on shift hamming masks. Secondly, DiagAF can deal with alignments of sequence pairs with not equal length, rather than state-of-the-art methods just for equal length. Thirdly, DiagAF can align sequences with early termination for true alignments. In the experiment, we compared DiagAF with state-of-the-art methods. DiagAF can achieve a much smaller error rate than them, meanwhile use less time than them. We believe that DiagAF algorithm can further improve the performance of state-of-the-art sequence alignment softwares. The source codes of DiagAF can be downloaded from web site https://github.com/BioLab-cz/DiagAF.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TCBB.2021.3127879 | DOI Listing |
Front Vet Sci
January 2025
Department of Microbiology, School of Medicine, Jiangsu University, Zhenjiang, Jiangsu, China.
Introduction: Cormorants, as protected wild animals by the State Forestry Administration of China, have a broad distribution across China. Previous studies have shown that they can be infected with multiple viruses in the , , , and families. There is limited knowledge about the other viruses that cormorants may carry and infect.
View Article and Find Full Text PDFImmunohorizons
January 2025
Department of Surgery, Faculty of Medicine and Dentistry, College of Health Sciences, University of Alberta, Edmonton, AB, Canada.
The global dissemination of SARS-CoV-2 led to a worldwide pandemic in March 2020. Even after the official downgrading of the COVID-19 pandemic, infection with SARS-CoV-2 variants continues. The rapid development and deployment of SARS-CoV-2 vaccines helped to mitigate the pandemic to a great extent.
View Article and Find Full Text PDFBMC Bioinformatics
January 2025
Research Institute for Systems Biology and Medicine, Moscow, Russian Federation.
Background: Currently, synthetic genomics is a rapidly developing field. Its main tasks, such as the design of synthetic sequences and the assembly of DNA sequences from synthetic oligonucleotides, require specialized software. In this article, we present a program with a graphical interface that allows non-bioinformatics to perform the tasks needed in synthetic genomics.
View Article and Find Full Text PDFGenome Biol
January 2025
Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA.
Sequence alignment is foundational to many bioinformatic analyses. Many aligners start by splitting sequences into contiguous, fixed-length seeds, called k-mers. Alignment is faster with longer, unique seeds, but more accurate with shorter seeds avoiding mutations.
View Article and Find Full Text PDFZhongguo Xue Xi Chong Bing Fang Zhi Za Zhi
July 2024
Nanchang Municipal Center for Disease Control and Prevention, Base of National Key Laboratory for the Prevention and Control of Infectious Diseases, Nanchang, Jiangxi 330038, China.
Objective: To investigate the prevalence of infection and the distribution of parasite species and genotypes among HIV-positive individuals in Jiangxi Province.
Methods: HIV-positive individuals' sociodemographic and clinical data were collected from three AIDS designated hospitals in Jiangxi Province from January 2022 to March 2023. Subjects' stool samples were collected, and genomic DNA was extracted from stool samples.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!