NetGO 2.0: improving large-scale protein function prediction with massive sequence, text, domain, family and network information.

Nucleic Acids Res

Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China.

Published: July 2021

With the explosive growth of protein sequences, large-scale automated protein function prediction (AFP) is becoming challenging. A protein is usually associated with dozens of gene ontology (GO) terms. Therefore, AFP is regarded as a problem of large-scale multi-label classification. Under the learning to rank (LTR) framework, our previous NetGO tool integrated massive networks and multi-type information about protein sequences to achieve good performance by dealing with all possible GO terms (>44 000). In this work, we propose the updated version as NetGO 2.0, which further improves the performance of large-scale AFP. NetGO 2.0 also incorporates literature information by logistic regression and deep sequence information by recurrent neural network (RNN) into the framework. We generate datasets following the critical assessment of functional annotation (CAFA) protocol. Experiment results show that NetGO 2.0 outperformed NetGO significantly in biological process ontology (BPO) and cellular component ontology (CCO). In particular, NetGO 2.0 achieved a 12.6% improvement over NetGO in terms of area under precision-recall curve (AUPR) in BPO and around 2.6% in terms of $\mathbf {F_{max}}$ in CCO. These results demonstrate the benefits of incorporating text and deep sequence information for the functional annotation of BPO and CCO. The NetGO 2.0 web server is freely available at http://issubmission.sjtu.edu.cn/ng2/.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8262706PMC
http://dx.doi.org/10.1093/nar/gkab398DOI Listing

Publication Analysis

Top Keywords

netgo
9
protein function
8
function prediction
8
protein sequences
8
deep sequence
8
functional annotation
8
cco netgo
8
protein
5
netgo improving
4
large-scale
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!