A mathematical consideration of the word-composition vector method in comparison of biological sequences.

Biosystems

Graduate School of Science and Engineering, Saitama University, 255 Shimo-okubo, Saitama 338-8570, Japan.

Published: November 2011

To measure the similarity or dissimilarity between two given biological sequences, several papers proposed metrics based on the "word-composition vector". The essence of these metrics is as follows. First, we count the appearance frequencies of all the K-tuple words throughout each of two given sequences. Then, the two given sequences are transformed into their respective word-composition vectors. Next, the distance metrics, for example the angle between the two vectors, are calculated. A significant issue is to determine the optimal word size K. With a mathematical model of mutational events (including substitutions, insertions, deletions and duplications) that occur in sequences, we analyzed how the angle between the composition vectors depends on the mutational events. We also considered the optimal word size (=resolution) from our original approach. Our results were verified by computational experiments using artificially generated sequences, amino acid sequences of hemoglobin and nucleotide sequences of 16S ribosomal RNA.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.biosystems.2011.06.009DOI Listing

Publication Analysis

Top Keywords

sequences
8
biological sequences
8
optimal word
8
word size
8
mutational events
8
mathematical consideration
4
consideration word-composition
4
word-composition vector
4
vector method
4
method comparison
4

Similar Publications

Draft Genome of Naganishia uzbekistanensis from a Clinical Pulmonary Infection.

Mycopathologia

January 2025

Department of Laboratory Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, 100730, China.

This study presents the first high-quality assembled genome of Naganishia uzbekistanensis, derived from a clinical isolate CY11558 obtained from a patient with a postoperative pulmonary infection. This work provides an improved reference assembly for downstream research and diagnosis of infections caused by this species.

View Article and Find Full Text PDF

A gene within a single subclade of NCED genes is triggered in response to both, short- and long-term dehydration treatments, in three model dicot species. During dehydration, some plants can rapidly synthesise the stress hormone abscisic acid (ABA) in leaves within 20 min, triggering the closure of stomata and limiting further water loss. This response is associated with significant transcriptional upregulation of Nine-cis-Epoxycarotenoid Dioxygenase (NCED) genes, which encode the enzyme considered to be rate-limiting in ABA biosynthesis.

View Article and Find Full Text PDF

Sleeve Gastrectomy and Gastric Bypass Impact in Patient's Metabolic, Gut Microbiome, and Immuno-inflammatory Profiles-A Comparative Study.

Obes Surg

January 2025

Coimbra Institute for Clinical and Biomedical Research (iCBR) Area of Environment, Genetics and Oncobiology (CIMAGO), Faculty of Medicine, University of Coimbra, Coimbra, Portugal.

Background: Bariatric surgery is the most long-term effective treatment option for severe obesity. The role of gut microbiome (GM) in either the development of obesity or in response to obesity management strategies has been a matter of debate. This study aims to compare the impact of two of the most popular procedures, sleeve gastrectomy (SG) and Roux-en-Y gastric bypass (GB), on metabolic syndrome parameters and gut bacterial microbiome and in systemic immuno-inflammatory response.

View Article and Find Full Text PDF

The marine microbiome arouses an increasing interest, aimed at better understanding coral reef biodiversity, coral resilience, and identifying bioindicators of ecosystem health. The present study is a microbiome mining of three environmentally contrasted sites along the Hermitage fringing reef of La Réunion Island (Western Indian Ocean). This mining aims to identify bioindicators of reef health to assist managers in preserving the fringing reefs of La Réunion.

View Article and Find Full Text PDF

Purpose: This work described a new species of Ceratomyxa, based on morphological and phylogenetic analyzes of myxospores collected from the gallbladder of the fish Astyanax mexicanus.

Methods: Sixty-two specimens were captured, between December 2022 and February 2024, in the Flexal River, in the community of Tessalônica, state of Amapá. The specimens were transported alive to the Laboratory of Morphophysiology and Animal Health, at the State University of Amapá, where the studies were carried out.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!