WegoLoc: accurate prediction of protein subcellular localization using weighted Gene Ontology terms.

Bioinformatics

School of Computer Science and Engineering, Kyungsung University, Nam-gu, Suyoung-ro 309, Pusan, South Korea.

Published: April 2012

Summary: We present an accurate and fast web server, WegoLoc for predicting subcellular localization of proteins based on sequence similarity and weighted Gene Ontology (GO) information. A term weighting method in the text categorization process is applied to GO terms for a support vector machine classifier. As a result, WegoLoc surpasses the state-of-the-art methods for previously used test datasets. WegoLoc supports three eukaryotic kingdoms (animals, fungi and plants) and provides human-specific analysis, and covers several sets of cellular locations. In addition, WegoLoc provides (i) multiple possible localizations of input protein(s) as well as their corresponding probability scores, (ii) weights of GO terms representing the contribution of each GO term in the prediction, and (iii) a BLAST E-value for the best hit with GO terms. If the similarity score does not meet a given threshold, an amino acid composition-based prediction is applied as a backup method.

Availability: WegoLoc and User's guide are freely available at the website http://www.btool.org/WegoLoc

Contact: smchiks@ks.ac.kr; dougnam@unist.ac.kr

Supplementary Information: Supplementary data is available at http://www.btool.org/WegoLoc.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bts062DOI Listing

Publication Analysis

Top Keywords

subcellular localization
8
weighted gene
8
gene ontology
8
wegoloc
6
wegoloc accurate
4
accurate prediction
4
prediction protein
4
protein subcellular
4
localization weighted
4
terms
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!