Essential Protein Prediction Based on node2vec and XGBoost.

J Comput Biol

School of Computer Science and Engineering, Central South University, Changsha, P.R. China.

Published: July 2021

Essential proteins are a vital part of the survival of organisms and cells. Identification of essential proteins lays a solid foundation for understanding protein functions and discovering drug targets. The traditional biological experiments are expensive and time-consuming. Recently, many computational methods have been proposed. However, some noises in the protein-protein interaction (PPI) networks affect the efficiency of essential protein prediction. It is necessary to construct a credible PPI network by using other useful biological information to reduce the effects of these noises. In this article, we proposed a model, Ess-NEXG, to identify essential proteins, which integrates biological information, including orthologous information, subcellular localization information, RNA-Seq information, and PPI network. In our model, first, we constructed a credible weighted PPI network by using different types of biological information. Second, we extracted the topological features of proteins in the constructed weighted PPI network by using the node2vec technique. Last, we used eXtreme Gradient Boosting (XGBoost) to predict essential proteins by using the topological features of proteins. The extensive results show that our model has better performance than other computational methods.

Download full-text PDF

Source
http://dx.doi.org/10.1089/cmb.2020.0543DOI Listing

Publication Analysis

Top Keywords

essential proteins
16
ppi network
16
essential protein
8
protein prediction
8
computational methods
8
weighted ppi
8
topological features
8
features proteins
8
essential
6
proteins
6

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!