Background: Proteins that share a high sequence homology while exhibiting drastically different 3D structures are investigated in this study. Recently, artificial proteins related to the sequences of the GA and IgG binding GB domains of human serum albumin have been designed. These artificial proteins, referred to as GA and GB, share 98% amino acid sequence identity but exhibit different 3D structures, namely, a 3α bundle versus a 4β + α structure. Discriminating between their 3D structures based on their amino acid sequences is a very difficult problem. In the present work, in addition to using bioinformatics techniques, an analysis based on inter-residue average distance statistics is used to address this problem.

Results: It was hard to distinguish which structure a given sequence would take only with the results of ordinary analyses like BLAST and conservation analyses. However, in addition to these analyses, with the analysis based on the inter-residue average distance statistics and our sequence tendency analysis, we could infer which part would play an important role in its structural formation.

Conclusions: The results suggest possible determinants of the different 3D structures for sequences with high sequence identity. The possibility of discriminating between the 3D structures based on the given sequences is also discussed.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4180342PMC
http://dx.doi.org/10.1186/1756-0500-7-654DOI Listing

Publication Analysis

Top Keywords

high sequence
12
sequence identity
12
amino acid
12
acid sequences
8
artificial proteins
8
discriminating structures
8
structures based
8
analysis based
8
based inter-residue
8
inter-residue average
8

Similar Publications

Cuproptosis, a newly identified form of cell death, has drawn increasing attention for its association with various cancers, though its specific role in colorectal cancer (CRC) remains unclear. In this study, transcriptomic and clinical data from CRC patients available in the TCGA database were analyzed to investigate the impact of cuproptosis. Differentially expressed genes linked to cuproptosis were identified using Weighted Gene Co-Expression Network Analysis (WGCNA).

View Article and Find Full Text PDF

Warfarin is the most widely used oral anticoagulant in clinical practice. The cytochrome P450 2C9 (CYP2C9), vitamin K epoxide reductase complex 1 (VKORC1), and cytochrome P450 4F2 (CYP4F2) genotypes are associated with warfarin dose requirements in China. Accurate genotyping is vital for obtaining reliable genotype-guided warfarin dosing information.

View Article and Find Full Text PDF

Pathogenic activating mutations in the fibroblast growth factor receptor 3 (FGFR3) drive disease maintenance and progression in urothelial cancer. 10-15% of muscle-invasive and metastatic urothelial cancer (MIBC/mUC) are FGFR3-mutant. Selective targeting of FGFR3 hotspot mutations with tyrosine kinase inhibitors (e.

View Article and Find Full Text PDF

Although CCL17 has been reported to exert a vital role in many cancers, the related studies in the thyroid carcinoma have never reported. As a chemokine, CCL17 plays a positive role by promoting the infiltration of immune cells into the tumor microenviroment (TME) to influence tumor invasion and metastasis. Therefore, this study is aimed to investigate the association of CCL17 level with potential prognostic value on tumor immunity in the thyroid carcinoma (THCA) based on the bioinformatics analysis.

View Article and Find Full Text PDF

Limited whole genome sequencing (WGS) studies in Asian populations result in a lack of representative reference panels, thus hindering the discovery of ancestry-specific variants. Here, we present the South and East Asian reference Database (SEAD) panel ( https://imputationserver.westlake.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!