Motivation: The rank distance model represents genome rearrangements in multi-chromosomal genomes as matrix operations, which allows the reconstruction of parsimonious histories of evolution by rearrangements. We seek to generalize this model by allowing for genomes with different gene content, to accommodate a broader range of biological contexts. We approach this generalization by using a matrix representation of genomes. This leads to simple distance formulas and sorting algorithms for genomes with different gene contents, but without duplications.

Results: We generalize the rank distance to genomes with different gene content in two different ways. The first approach adds insertions, deletions and the substitution of a single extremity to the basic operations. We show how to efficiently compute this distance. To avoid genomes with incomplete markers, our alternative distance, the rank-indel distance, only uses insertions and deletions of entire chromosomes. We construct phylogenetic trees with our distances and the DCJ-Indel distance for simulated data and real prokaryotic genomes, and compare them against reference trees. For simulated data, our distances outperform the DCJ-Indel distance using the Quartet metric as baseline. This suggests that rank distances are more robust for comparing distantly related species. For real prokaryotic genomes, all rearrangement-based distances yield phylogenetic trees that are topologically distant from the reference (65% similarity with Quartet metric), but are able to cluster related species within their respective clades and distinguish the Shigella strains as the farthest relative of the Escherichia coli strains, a feature not seen in the reference tree.

Availability And Implementation: Code and instructions are available at https://github.com/meidanis-lab/rank-indel.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9985151PMC
http://dx.doi.org/10.1093/bioinformatics/btad087DOI Listing

Publication Analysis

Top Keywords

rank distance
12
genomes gene
12
distance
9
genomes
8
gene content
8
insertions deletions
8
phylogenetic trees
8
dcj-indel distance
8
simulated data
8
real prokaryotic
8

Similar Publications

This study aimed to evaluate the effects of an 8-week physiotherapy program on muscle strength, functional capacity, respiratory function, and quality of life in women recovering from COVID-19. A prospective cohort study was conducted with 42 women aged 18-65 who experienced muscle strength loss and functional impairments post-COVID-19. Participants underwent personalized physiotherapy interventions, including resistance training, respiratory therapy, and functional mobility exercises, for 8 weeks.

View Article and Find Full Text PDF

Construction of Promoter Elements for Strong, Moderate, and Weak Gene Expression in .

Genes (Basel)

December 2024

Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Institute of Gene Biology, Russian Academy of Sciences, 34/5 Vavilova Str., Moscow 119334, Russia.

Background/objectives: Transcriptional promoters play an essential role in regulating protein expression. Promoters with weak activity generally lead to low levels of expression, resulting in fewer proteins being produced. At the same time, strong promoters are commonly used in studies using transgenic organisms as model systems.

View Article and Find Full Text PDF

A Novel Hybrid Improved RIME Algorithm for Global Optimization Problems.

Biomimetics (Basel)

December 2024

Department of Computer Science, Durham University, Durham DH1 3LE, UK.

The RIME algorithm is a novel physical-based meta-heuristic algorithm with a strong ability to solve global optimization problems and address challenges in engineering applications. It implements exploration and exploitation behaviors by constructing a rime-ice growth process. However, RIME comes with a couple of disadvantages: a limited exploratory capability, slow convergence, and inherent asymmetry between exploration and exploitation.

View Article and Find Full Text PDF

Conventional techniques for evaluating hydration status include the analysis of blood, urine, and body weight. Recently, advancements in dentistry have introduced capacitance sensor-based oral epithelial moisture meters as promising avenues for assessment. This study aimed to examine the correlation between oral mucosal moisture content, as determined using a capacitance sensor, and exercise-induced dehydration.

View Article and Find Full Text PDF

Traditional methods for evaluating tennis technique, such as visual observation and video analysis, are often subjective and time consuming. On the other hand, a quick and accurate assessment can provide immediate feedback to players and contribute to technical development, particularly in less experienced athletes. This study aims to validate the use of a single inertial measurement system to assess some relevant technical parameters of amateur players.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!