Motivation: To compare entire genomes from different species, biologists increasingly need alignment methods that are efficient enough to handle long sequences, and accurate enough to correctly align the conserved biological features between distant species. The two main classes of pairwise alignments are global alignment, where one string is transformed into the other, and local alignment, where all locations of similarity between the two strings are returned. Global alignments are less prone to demonstrating false homology as each letter of one sequence is constrained to being aligned to only one letter of the other. Local alignments, on the other hand, can cope with rearrangements between non-syntenic, orthologous sequences by identifying similar regions in sequences; this, however, comes at the expense of a higher false positive rate due to the inability of local aligners to take into account overall conservation maps.

Results: In this paper we introduce the notion of glocal alignment, a combination of global and local methods, where one creates a map that transforms one sequence into the other while allowing for rearrangement events. We present Shuffle-LAGAN, a glocal alignment algorithm that is based on the CHAOS local alignment algorithm and the LAGAN global aligner, and is able to align long genomic sequences. To test Shuffle-LAGAN we split the mouse genome into BAC-sized pieces, and aligned these pieces to the human genome. We demonstrate that Shuffle-LAGAN compares favorably in terms of sensitivity and specificity with standard local and global aligners. From the alignments we conclude that about 9% of human/mouse homology may be attributed to small rearrangements, 63% of which are duplications.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btg1005DOI Listing

Publication Analysis

Top Keywords

glocal alignment
12
local alignment
8
alignment algorithm
8
alignment
7
local
6
global
5
alignment finding
4
finding rearrangements
4
rearrangements alignment
4
alignment motivation
4

Similar Publications

Identification, characterization, and anti-inflammatory activity of a lipocalin-like protein cloned from .

Food Sci Biotechnol

February 2025

Department of Marine Life Science, College of Life Science, Gangneung-Wonju National University, Gangneung, 25457 Gangwon Republic of Korea.

This study investigated the anti-inflammatory effects of OJlipo1, a lipocalin-like protein derived from . Through cloning and expressing OJlipo1 in , and subsequent rigorous characterization including amino acid analysis and mass spectrometry, its potential against inflammation was evaluated. Studies on lipopolysaccharide-stimulated RAW 264.

View Article and Find Full Text PDF

Herein, we present an innovative and atom-efficient synthesis of trimethine cyanines (Cy3) using formaldehyde (FA) as a single-carbon reagent. The widespread application of Cy3 dyes in bioimaging and genomics/proteomics is often limited by synthetic routes plagued by low atom economy and substantial side-product formation. Through systematic investigation, we have developed a practical and efficient synthetic pathway for both symmetrical and unsymmetrical Cy3 derivatives, significantly minimizing resource utilization.

View Article and Find Full Text PDF

We used Nutraceutical Industrial Coriander Seed Spent (NICSS), a readily available, cheap, eco-friendly, and ready-to-use material, as an innovative adsorbent for the bioremediation of a bisazo Acid Red 119 (AR 119) dye, which is likely a mutagen from textile industrial effluents (TIE). A laboratory-scale experiment was tailored to demonstrate the framework of the circular economy (CE) in the remediation of textile dyes using Nutraceutical Industrial Spent to align with the principles of sustainability and valorization. An experimental value of 97.

View Article and Find Full Text PDF

Deep neural networks have become increasingly popular for analyzing ECG data because of their ability to accurately identify cardiac conditions and hidden clinical factors. However, the lack of transparency due to the black box nature of these models is a common concern. To address this issue, explainable AI (XAI) methods can be employed.

View Article and Find Full Text PDF

The E3 ubiquitin ligase COP1 regulates salt tolerance via GIGANTEA degradation in roots.

Plant Cell Environ

August 2024

Division of Applied Life Science (BK21 Four), Plant Biological Rhythm Research Center, Plant Molecular Biology and Biotechnology Research Center, Gyeongsang National University, Jinju, Republic of Korea.

Excess soil salinity significantly impairs plant growth and development. Our previous reports demonstrated that the core circadian clock oscillator GIGANTEA (GI) negatively regulates salt stress tolerance by sequestering the SALT OVERLY SENSITIVE (SOS) 2 kinase, an essential component of the SOS pathway. Salt stress induces calcium-dependent cytoplasmic GI degradation, resulting in activation of the SOS pathway; however, the precise molecular mechanism governing GI degradation during salt stress remains enigmatic.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!