Background/objective: Information of protein subcellular localization is crucially important for both basic research and drug development. With the explosive growth of protein sequences discovered in the post-genomic age, it is highly demanded to develop powerful bioinformatics tools for timely and effectively identifying their subcellular localization purely based on the sequence information alone. Recently, a predictor called "pLoc-mEuk" was developed for identifying the subcellular localization of eukaryotic proteins. Its performance is overwhelmingly better than that of the other predictors for the same purpose, particularly in dealing with multi-label systems where many proteins, called "multiplex proteins", may simultaneously occur in two or more subcellular locations. Although it is indeed a very powerful predictor, more efforts are definitely needed to further improve it. This is because pLoc-mEuk was trained by an extremely skewed dataset where some subset was about 200 times the size of the other subsets. Accordingly, it cannot avoid the biased consequence caused by such an uneven training dataset.

Methods: To alleviate such bias, we have developed a new predictor called pLoc_bal-mEuk by quasi-balancing the training dataset. Cross-validation tests on exactly the same experimentconfirmed dataset have indicated that the proposed new predictor is remarkably superior to pLocmEuk, the existing state-of-the-art predictor in identifying the subcellular localization of eukaryotic proteins. It has not escaped our notice that the quasi-balancing treatment can also be used to deal with many other biological systems.

Results: To maximize the convenience for most experimental scientists, a user-friendly web-server for the new predictor has been established at http://www.jci-bioinfo.cn/pLoc_bal-mEuk/.

Conclusion: It is anticipated that the pLoc_bal-Euk predictor holds very high potential to become a useful high throughput tool in identifying the subcellular localization of eukaryotic proteins, particularly for finding multi-target drugs that is currently a very hot trend trend in drug development.

Download full-text PDF

Source
http://dx.doi.org/10.2174/1573406415666181218102517DOI Listing

Publication Analysis

Top Keywords

subcellular localization
24
localization eukaryotic
16
eukaryotic proteins
16
identifying subcellular
16
quasi-balancing training
8
training dataset
8
drug development
8
predictor called
8
subcellular
7
predictor
7

Similar Publications

hsa_circ_0008305 facilitates the malignant progression of hepatocellular carcinoma by regulating AKR1C3 expression and sponging miR-379-5p.

Sci Rep

January 2025

Department of Oncology, The Second Affiliated Hospital, Jiangxi Medical College, Nanchang University, No. 1, Minde Road, Nanchang, 330000, Jiangxi Province, P.R. China.

Circular RNAs (circRNAs) are widely involved in diverse biological processes of cancers. Nonetheless, the potential function of hsa_circ_0008305 in hepatocellular carcinoma (HCC) remains largely unknown. This study aims to elucidate the role and underlying mechanism of hsa_circ_0008305 in HCC.

View Article and Find Full Text PDF

SLC10A7 regulates O-GalNAc glycosylation and Ca homeostasis in the secretory pathway: insights into SLC10A7-CDG.

Cell Mol Life Sci

January 2025

Univ. Lille, CNRS, UMR 8576 - UGSF - Unité de Glycobiologie Structurale Et Fonctionnelle, 59000, Lille, France.

Glycans are known to be fundamental for many cellular and physiological functions. Congenital disorders of glycosylation (CDG) currently encompassing over 160 subtypes, are characterized by glycan synthesis and/or processing defects. Despite the increasing number of CDG patients, therapeutic options remain very limited as our knowledge on glycan synthesis is fragmented.

View Article and Find Full Text PDF

Huntington's disease (HD), a neurodegenerative disease, affects approximately 30,000 people in the United States, with 200,000 more at risk. Mitochondrial dysfunction caused by mutant huntingtin (mHTT) drives early HD pathophysiology. mHTT binds the translocase of mitochondrial inner membrane (TIM23) complex, inhibiting mitochondrial protein import and altering the mitochondrial proteome.

View Article and Find Full Text PDF

Research note: The critical role of the interaction between Eimeria tenella invasion protein RON2 and host receptor annexin A2 in mediating parasite invasion.

Poult Sci

December 2024

Guangdong Province Key Laboratory of Livestock Disease Prevention, Key Laboratory of Avian Infuenza and Other Major Poultry Diseases Prevention and Control, Ministry of Agriculture and Rural Afairs, Institute of Animal Health, Guangdong Academy of Agricultural Sciences, Guangzhou 510640, China. Electronic address:

Avian coccidiosis, caused by protozoan parasites of the genus Eimeria, is a globally prevalent and highly pathogenic disease that poses a serious threat to the poultry industry, resulting in significant economic losses. However, the mechanism by which Eimeria species invade host cells remains unclear. Previous studies have identified rhoptry neck protein 2 (RON2) from Eimeria tenella as a critical factor in host cell invasion, but a comprehensive understanding of the role of EtRON2 in host cell invasion and its relationship with E.

View Article and Find Full Text PDF

Chondrocytes are commonly applied in regenerative medicine and tissue engineering. Thus, the discovery of optimal culture conditions to obtain cells with good properties and behavior for transplantation is important. In addition to biochemical cues, physical and biomechanical changes can affect the proliferation and protein expression of chondrocytes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!