Background: Predicting protein subnuclear localization is a challenging problem. Some previous works based on non-sequence information including Gene Ontology annotations and kernel fusion have respective limitations. The aim of this work is twofold: one is to propose a novel individual feature extraction method; another is to develop an ensemble method to improve prediction performance using comprehensive information represented in the form of high dimensional feature vector obtained by 11 feature extraction methods.
Methodology/principal Findings: A novel two-stage multiclass support vector machine is proposed to predict protein subnuclear localizations. It only considers those feature extraction methods based on amino acid classifications and physicochemical properties. In order to speed up our system, an automatic search method for the kernel parameter is used. The prediction performance of our method is evaluated on four datasets: Lei dataset, multi-localization dataset, SNL9 dataset and a new independent dataset. The overall accuracy of prediction for 6 localizations on Lei dataset is 75.2% and that for 9 localizations on SNL9 dataset is 72.1% in the leave-one-out cross validation, 71.7% for the multi-localization dataset and 69.8% for the new independent dataset, respectively. Comparisons with those existing methods show that our method performs better for both single-localization and multi-localization proteins and achieves more balanced sensitivities and specificities on large-size and small-size subcellular localizations. The overall accuracy improvements are 4.0% and 4.7% for single-localization proteins and 6.5% for multi-localization proteins. The reliability and stability of our classification model are further confirmed by permutation analysis.
Conclusions: It can be concluded that our method is effective and valuable for predicting protein subnuclear localizations. A web server has been designed to implement the proposed method. It is freely available at http://bioinformatics.awowshop.com/snlpred_page.php.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3584121 | PMC |
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0057225 | PLOS |
J Virol
December 2024
Department of Microbiology, University of Washington School of Medicine, Seattle, Washington, USA.
Unlabelled: Due to the importance of post-translational modification (PTM) in cellular function, viruses have evolved to both take advantage of and be susceptible to such modification. Adenovirus encodes a multifunctional protein called protein VII, which is packaged with the viral genome in the core of virions and disrupts host chromatin during infection. Protein VII has several PTMs whose addition contributes to the subnuclear localization of protein VII.
View Article and Find Full Text PDFJ Am Chem Soc
January 2025
Department of Chemistry, Princeton University, Princeton, New Jersey 08544, United States.
Cellular activity is spatially organized across different organelles. While several structures are well-characterized, many organelles have unknown roles. Profiling biomolecular composition is key to understanding function but is difficult to achieve in the context of small, dynamic structures.
View Article and Find Full Text PDFAdv Biol Regul
November 2024
Department of Biology of the Cell Nucleus, Institute of Molecular Genetics of the Czech Academy of Sciences, Prague, Czech Republic. Electronic address:
mBio
January 2025
Centro de Investigación en Dinámica Celular, Instituto de Investigación en Ciencias Básicas y Aplicadas, Universidad Autónoma del Estado de Morelos, Cuernavaca, Mexico.
Human adenoviruses are double-stranded DNA viruses that replicate in the cell nucleus and induce the formation of replication compartments (RCs) that are critical in viral replication and control of virus-host interactions. RCs are specialized virus-induced subnuclear microenvironments where not only viral genome replication and expression are orchestrated but also host proteins that restrict viral replication are co-opted and subverted. The protein composition of these RCs remains largely unexplored.
View Article and Find Full Text PDFInt J Mol Sci
November 2024
Department Genes and Environment, Max Planck Institute of Psychiatry, Kraepelinstr. 2-10, 80804 Munich, Germany.
The expression of , and its resulting protein FKBP51, is strongly induced by glucocorticoids. Numerous studies have explored their involvement in a plethora of cellular processes and diseases. There is, however, a lack of knowledge on the role of the different RNA splicing variants and the two protein isoforms, one missing functional C-terminal motifs.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!