Motivation: Protein sequences are often composed of regions that have distinct evolutionary histories as a consequence of domain shuffling, recombination or gene conversion. New approaches are required to discover, visualize and analyze these sequence regions and thus enable a better understanding of protein evolution.

Results: Here, we have developed an alignment-free and visual approach to analyze sequence relationships. We use the number of shared n-grams between sequences as a measure of sequence similarity and rearrange the resulting affinity matrix applying a spectral technique. Heat maps of the affinity matrix are employed to identify and visualize clusters of related sequences or outliers, while n-gram-based dot plots and conservation profiles allow detailed analysis of similarities among selected sequences. Using this approach, we have identified signatures of domain shuffling in an otherwise poorly characterized family, and homology clusters in another. We conclude that this approach may be generally useful as a framework to analyze related, but highly divergent protein sequences. It is particularly useful as a fast method to study sequence relationships prior to much more time-consuming multiple sequence alignment and phylogenetic analysis.

Availability: A software implementation (MOSAIC) of the framework described here can be downloaded from http://bioinformatics.org.au/mosaic/

Contact: m.ragan@uq.edu.au

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btq042DOI Listing

Publication Analysis

Top Keywords

protein sequences
8
domain shuffling
8
analyze sequence
8
sequence relationships
8
affinity matrix
8
sequence
6
sequences
5
visual framework
4
framework sequence
4
sequence analysis
4

Similar Publications

Peptide-Based Complex Coacervates Stabilized by Cation-π Interactions for Cell Engineering.

J Am Chem Soc

January 2025

Center for Sustainable Materials (SusMat), School of Materials Science and Engineering, Nanyang Technological University, Singapore 639798, Singapore.

Complex coacervation is a form of liquid-liquid phase separation, whereby two types of macromolecules, usually bearing opposite net charges, self-assemble into dense microdroplets driven by weak molecular interactions. Peptide-based coacervates have recently emerged as promising carriers to deliver large macromolecules (nucleic acids, proteins and complex thereof) inside cells. Thus, it is essential to understand their assembly/disassembly mechanisms at the molecular level in order to tune the thermodynamics of coacervates formation and the kinetics of cargo release upon entering the cell.

View Article and Find Full Text PDF

Streptococcus dysgalactiae (S. dysgalactiae ) is a common pathogen of humans and various animals. However, the phylogenetic position of animal S.

View Article and Find Full Text PDF

Exogenous dsRNA triggers sequence-specific RNAi and fungal stress responses to control Magnaporthe oryzae in Brachypodium distachyon.

Commun Biol

January 2025

Institute of Phytopathology, Research Centre for BioSystems, Land Use and Nutrition, Justus Liebig University Giessen, Heinrich-Buff-Ring 26, 35392, Giessen, Germany.

In vertebrates and plants, dsRNA plays crucial roles as PAMP and as a mediator of RNAi. How higher fungi respond to dsRNA is not known. We demonstrate that Magnaporthe oryzae (Mo), a globally significant crop pathogen, internalizes dsRNA across a broad size range of 21 to about 3000 bp.

View Article and Find Full Text PDF

We aimed to build a robust classifier for the MGMT methylation status of glioblastoma in multiparametric MRI. We focused on multi-habitat deep image descriptors as our basic focus. A subset of the BRATS 2021 MGMT methylation dataset containing both MGMT class labels and segmentation masks was used.

View Article and Find Full Text PDF

Black carp (Mylopharyngodon piceus) is one of the "four famous domestic fishes" in China and an important economic fish in freshwater aquaculture. A high-quality genome is essential for advancing future biological research and breeding programs for this species. In this study, we aimed to generate a high-quality chromosome-level genome assembly of black carp using Nanopore and Hi-C technologies.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!