Detailed prediction of protein sub-nuclear localization.

BMC Bioinformatics

Department of Informatics, Bioinformatics & Computational Biology - i12, TUM (Technical University of Munich), Boltzmannstr. 3, 85748, Garching/Munich, Germany.

Published: April 2019

AI Article Synopsis

  • Research addresses the limitations of current methods in predicting sub-nuclear structures, introducing a new tool called LocNuclei that relies on sequence data to classify proteins into 13 distinct nuclear substructures.
  • Utilizing a combination of Profiling Kernels and Support Vector Machines, LocNuclei achieves notable prediction accuracy, with AUC values of 0.70-0.74 and Q13 scores of 59-65%, while also successfully identifying traveling proteins.
  • The tool's predictions enhance our understanding of the functionality and protein-protein interactions within these compartments, indicating that LocNuclei provides valuable insights into nuclear mechanisms.

Article Abstract

Background: Sub-nuclear structures or locations are associated with various nuclear processes. Proteins localized in these substructures are important to understand the interior nuclear mechanisms. Despite advances in high-throughput methods, experimental protein annotations remain limited. Predictions of cellular compartments have become very accurate, largely at the expense of leaving out substructures inside the nucleus making a fine-grained analysis impossible.

Results: Here, we present a new method (LocNuclei) that predicts nuclear substructures from sequence alone. LocNuclei used a string-based Profile Kernel with Support Vector Machines (SVMs). It distinguishes sub-nuclear localization in 13 distinct substructures and distinguishes between nuclear proteins confined to the nucleus and those that are also native to other compartments (traveler proteins). High performance was achieved by implicitly leveraging a large biological knowledge-base in creating predictions by homology-based inference through BLAST. Using this approach, the performance reached AUC = 0.70-0.74 and Q13 = 59-65%. Travelling proteins (nucleus and other) were identified at Q2 = 70-74%. A Gene Ontology (GO) analysis of the enrichment of biological processes revealed that the predicted sub-nuclear compartments matched the expected functionality. Analysis of protein-protein interactions (PPI) show that formation of compartments and functionality of proteins in these compartments highly rely on interactions between proteins. This suggested that the LocNuclei predictions carry important information about function. The source code and data sets are available through GitHub: https://github.com/Rostlab/LocNuclei .

Conclusions: LocNuclei predicts subnuclear compartments and traveler proteins accurately. These predictions carry important information about functionality and PPIs.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6480651PMC
http://dx.doi.org/10.1186/s12859-019-2790-9DOI Listing

Publication Analysis

Top Keywords

sub-nuclear localization
8
locnuclei predicts
8
compartments traveler
8
traveler proteins
8
predictions carry
8
proteins
7
compartments
6
detailed prediction
4
prediction protein
4
sub-nuclear
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!