Motivation: There is a need for rapid and easy to use, alignment free methods to cluster large groups of protein sequence data. Commonly used phylogenetic trees based on alignments can be used to visualize only a limited number of protein sequences. DGraph, introduced here, is a dynamic programming application developed to generate 2D-maps based on similarity scores for sequences. The program automatically calculates and graphically displays property distance (PD) scores based on physico-chemical property (PCP) similarities from an unaligned list of FASTA files. Such "PD-graphs" show the interrelatedness of the sequences, whereby clusters can reveal deeper connectivities.

Results: PD-Graphs generated for flavivirus (FV), enterovirus (EV), and coronavirus (CoV) sequences from complete polyproteins or individual proteins are consistent with biological data on vector types, hosts, cellular receptors and disease phenotypes. PD-graphs separate the tick- from the mosquito-borne FV, clusters viruses that infect bats, camels, seabirds and humans separately and the clusters correlate with disease phenotype. The PD method segregates the β-CoV spike proteins of SARS, SARS-CoV-2, and MERS sequences from other human pathogenic CoV, with clustering consistent with cellular receptor usage. The graphs also suggest evolutionary relationships that may be difficult to determine with conventional bootstrapping methods that require postulating an ancestral sequence.

Availability And Implementation: DGraph is written in Java, compatible with the Java 5 runtime or newer. Source code and executable is available from the GitHub website ( https://github.com/bjmnbraun/DGraph/releases ). Documentation for installation and use of the software is available from the Readme.md file at ( https://github.com/bjmnbraun/DGraph ).

Contact: bjmnbraun@gmail.com or webraun@utmb.edu.

Supplementary Information: Supplementary information Table S1 and Fig. S1 are online available.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7430575PMC
http://dx.doi.org/10.1101/2020.08.13.249649DOI Listing

Publication Analysis

Top Keywords

sequences
5
d-graph clusters
4
clusters flaviviruses
4
flaviviruses β-coronaviruses
4
β-coronaviruses hosts
4
hosts disease
4
disease type
4
type human
4
human cell
4
cell receptors
4

Similar Publications

is widely used as a starter culture in the production of cheese, yoghurt and various cultured dairy products, which holds considerable significance in both research and practical applications within the food industry. Throughout history, the taxonomy of has undergone several adjustments and revisions. In 1984, based on the result of DNA-DNA hybridization, was reclassified as subsp.

View Article and Find Full Text PDF

sp. nov., isolated from a patient with ruptured appendicitis.

Int J Syst Evol Microbiol

January 2025

Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region, Hong Kong, PR China.

A clinical isolate, R131, was isolated from the peritoneal swab of a patient who suffered from ruptured appendicitis with abscess and gangrene in Hong Kong in 2018. Cells are facultatively anaerobic, non-motile, Gram-positive coccobacilli. Colonies were small, grey, semi-translucent, low convex and alpha-haemolytic.

View Article and Find Full Text PDF

Background: Myotonic dystrophy type 1 (DM1) is a multisystemic, CTG repeat expansion disorder characterized by a slow, progressive decline in skeletal muscle function. A biomarker correlating RNA mis-splicing, the core pathogenic disease mechanism, and muscle performance is crucial for assessing response to disease-modifying interventions. We evaluated the Myotonic Dystrophy Splice Index (SI), a composite RNA splicing biomarker incorporating 22 disease-specific events, as a potential biomarker of DM1 muscle weakness.

View Article and Find Full Text PDF

Novel Meningoencephalomyelitis Associated With Vimentin IgG Autoantibodies.

JAMA Neurol

January 2025

Department of Neurology, Xuanwu Hospital Capital Medical University, National Center for Neurological Disorders, Beijing, China.

Importance: Autoantibodies targeting astrocytes, such as those against glial fibrillary acidic protein (GFAP) or aquaporin protein 4, are crucial diagnostic markers for autoimmune astrocytopathy among central nervous system (CNS) autoimmune disorders. However, diagnosis remains challenging for patients lacking specific autoantibodies.

Objective: To characterize a syndrome of unknown meningoencephalomyelitis associated with an astrocytic autoantibody.

View Article and Find Full Text PDF

Purpose: Renal medullary carcinoma (RMC) is a highly aggressive malignancy defined by the loss of the SMARCB1 tumor suppressor. It mainly affects young individuals of African descent with sickle cell trait, and it is resistant to conventional therapies used for other renal cell carcinomas. This study aimed to identify potential biomarkers for early detection and disease monitoring of RMC.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!