Revealing the subcellular location of newly discovered protein sequences can bring insight to their function and guide research at the cellular level. The rapidly increasing number of sequences entering the genome databanks has called for the development of automated analysis methods. Currently, most existing methods used to predict protein subcellular locations cover only one, or a very limited number of species. Therefore, it is necessary to develop reliable and effective computational approaches to further improve the performance of protein subcellular prediction and, at the same time, cover more species. The current study reports the development of a novel predictor called MSLoc-DT to predict the protein subcellular locations of human, animal, plant, bacteria, virus, fungi, and archaea by introducing a novel feature extraction approach termed Amino Acid Index Distribution (AAID) and then fusing gene ontology information, sequential evolutionary information, and sequence statistical information through four different modes of pseudo amino acid composition (PseAAC) with a decision template rule. Using the jackknife test, MSLoc-DT can achieve 86.5, 98.3, 90.3, 98.5, 95.9, 98.1, and 99.3% overall accuracy for human, animal, plant, bacteria, virus, fungi, and archaea, respectively, on seven stringent benchmark datasets. Compared with other predictors (e.g., Gpos-PLoc, Gneg-PLoc, Virus-PLoc, Plant-PLoc, Plant-mPLoc, ProLoc-Go, Hum-PLoc, GOASVM) on the gram-positive, gram-negative, virus, plant, eukaryotic, and human datasets, the new MSLoc-DT predictor is much more effective and robust. Although the MSLoc-DT predictor is designed to predict the single location of proteins, our method can be extended to multiple locations of proteins by introducing multilabel machine learning approaches, such as the support vector machine and deep learning, as substitutes for the K-nearest neighbor (KNN) method. As a user-friendly web server, MSLoc-DT is freely accessible at http://bioinfo.ibp.ac.cn/MSLOC_DT/index.html.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.ab.2013.12.013 | DOI Listing |
Circ Res
January 2025
Center for Genetic Medicine, the Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, China (X.H., J.Z., C.X., R.C., P.J., X.J., P.H.).
Background: Cardiac ischemia/reperfusion disrupts plasma membrane integrity and induces various types of programmed cell death. The ESCRT (endosomal sorting complex required for transport) proteins, particularly AAA-ATPase Vps4a (vacuolar protein sorting 4a), play an essential role in the surveillance of membrane integrity. However, the role of ESCRT proteins in the context of cardiac injury remains unclear.
View Article and Find Full Text PDFParkinsons disease (PD) is considered one of the most frequent neurological diseases in the world. There is a need to study the early and efficient biomarkers of Parkinsons, such as changes in structural disorders like DNA and chromatin, especially at the subcellular level in the human brain. We used two techniques, Partial wave spectroscopy (PWS) and Inverse Participation Ratio (IPR), to detect the changes in structural disorder in the human brain tissue samples.
View Article and Find Full Text PDFUnlabelled: Endosomes are a central sorting hub for membrane cargos. DNAJC13/RME-8 plays a critical role in endosomal trafficking by regulating the endosomal recycling or degradative pathways. DNAJC13 localizes to endosomes through its N-terminal Plekstrin Homology (PH)-like domain, which directly binds endosomal phosphoinositol-3-phosphate (PI(3)P).
View Article and Find Full Text PDFUnlabelled: Bactofilins are a recently discovered class of cytoskeletal protein, widely implicated in subcellular organization and morphogenesis in bacteria and archaea. Several lines of evidence suggest that bactofilins polymerize into filaments using a central β-helical core domain, flanked by variable N- and C-terminal domains that may be important for scaffolding and other functions. However, a systematic exploration of the characteristics of these domains has yet to be performed.
View Article and Find Full Text PDFElife
January 2025
Department of Neurology, Baylor College of Medicine, Houston, United States.
variants in children with neurodevelopmental impairment are difficult to assess due to their heterogeneity and unclear pathogenic mechanisms. We describe a child with neonatal-onset epilepsy, developmental impairment of intermediate severity, and G256W heterozygosity. Analyzing prior KCNQ2 channel cryoelectron microscopy models revealed G256 as a node of an arch-shaped non-covalent bond network linking S5, the pore turret, and the ion path.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!