From the literature, existing methods use pairwise percent identity to identify the percentage of similarity between two protein sequences, in order to create a dendrogram. As this is a parametric method of measuring the similarities between proteins, and different parameter may yield different results, this method does not guarantee that the global optimal similarity values will be found. As protein dendrogram construction is used in other areas, such as multiple protein sequence alignments, it is very important that the most related protein sequences to be identified and align first. Furthermore, by using the pairwise percent identity of the protein sequences to construct the dendrograms, the physical characteristics of protein sequences and amino acids are not considered. In this paper, a new method was proposed for constructing protein sequence dendrograms. For this method, Discrete Fourier Transform, was used to construct the distance matrix in combination with the multiple amino acid indices that were used to encode protein sequences into numerical sequences. In order to show the applicability and robustness of the proposed method, a case study was presented by using nine Cluster of Differentiation 4 protein sequences extracted from the UniProt online database.

Download full-text PDF

Source
http://dx.doi.org/10.1109/EMBC.2014.6943716DOI Listing

Publication Analysis

Top Keywords

protein sequences
24
protein
9
amino acid
8
acid indices
8
discrete fourier
8
fourier transform
8
pairwise percent
8
percent identity
8
sequences order
8
protein sequence
8

Similar Publications

The HAK/KUP/KT (High-affinity K transporters/K uptake permeases/K transporters) is the largest and most dominant potassium transporter family in plants, playing a crucial role in various biological processes. However, our understanding of HAK/KUP/KT gene family in potato ( L.) remains limited and unclear.

View Article and Find Full Text PDF

Introduction: Chimeric antigen receptor (CAR) expressing T-cells have shown great promise for the future of cancer immunotherapy with the recent clinical successes achieved in treating different hematologic cancers. Despite these early successes, several challenges remain in the field that require to be solved for the therapy to be more efficacious. One such challenge is the lack of long-term persistence of CD28 based CAR T-cells in patients.

View Article and Find Full Text PDF

Background: Therapeutic antibodies for the treatment of neurological disease show great potential, but their applications are rather limited due to limited brain exposure. The most well-studied approach to enhance brain influx of protein therapeutics, is receptor-mediated transcytosis (RMT) by targeting nutrient receptors to shuttle protein therapeutics over the blood-brain barrier (BBB) along with their endogenous cargos. While higher brain exposure is achieved with RMT, the timeframe is short due to rather fast brain clearance.

View Article and Find Full Text PDF

Background: Pseudogalium is a new monotypic genus with two subspecies in China and one in Japan, which holds a distinctive phylogenetic position and ecological significance within the tribe Rubieae. Chloroplast genomes contain abundant information for resolving phylogenetic relationships. To investigate the phylogenetics of P.

View Article and Find Full Text PDF

Background: Circular (circ)RNAs have emerged as crucial contributors to cancer progression. Nonetheless, the expression regulation, biological functions, and underlying mechanisms of circRNAs in mediating hepatocellular carcinoma (HCC) progression remain insufficiently elucidated.

Methods: We identified circUCK2(2,3) through circRNA sequencing, RT-PCR, and Sanger sequencing.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!