Predicting protein function via downward random walks on a gene ontology.

BMC Bioinformatics

Department of Computer Science, Hong Kong Baptist University, Hong Kong, Hong Kong.

Published: August 2015

Background: High-throughput bio-techniques accumulate ever-increasing amount of genomic and proteomic data. These data are far from being functionally characterized, despite the advances in gene (or gene's product proteins) functional annotations. Due to experimental techniques and to the research bias in biology, the regularly updated functional annotation databases, i.e., the Gene Ontology (GO), are far from being complete. Given the importance of protein functions for biological studies and drug design, proteins should be more comprehensively and precisely annotated.

Results: We proposed downward Random Walks (dRW) to predict missing (or new) functions of partially annotated proteins. Particularly, we apply downward random walks with restart on the GO directed acyclic graph, along with the available functions of a protein, to estimate the probability of missing functions. To further boost the prediction accuracy, we extend dRW to dRW-kNN. dRW-kNN computes the semantic similarity between proteins based on the functional annotations of proteins; it then predicts functions based on the functions estimated by dRW, together with the functions associated with the k nearest proteins. Our proposed models can predict two kinds of missing functions: (i) the ones that are missing for a protein but associated with other proteins of interest; (ii) the ones that are not available for any protein of interest, but exist in the GO hierarchy. Experimental results on the proteins of Yeast and Human show that dRW and dRW-kNN can replenish functions more accurately than other related approaches, especially for sparse functions associated with no more than 10 proteins.

Conclusion: The empirical study shows that the semantic similarity between GO terms and the ontology hierarchy play important roles in predicting protein function. The proposed dRW and dRW-kNN can serve as tools for replenishing functions of partially annotated proteins.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4551531PMC
http://dx.doi.org/10.1186/s12859-015-0713-yDOI Listing

Publication Analysis

Top Keywords

downward random
12
random walks
12
missing functions
12
drw drw-knn
12
functions
11
proteins
9
predicting protein
8
protein function
8
gene ontology
8
functional annotations
8

Similar Publications

Background: The role of anticoagulation in asymptomatic cirrhotic patients with portal vein thrombosis (PVT) remains unclear. This study aims to evaluate the efficacy and safety of anticoagulation in this patient population.

Methods: We systematically searched PubMed, Web of Science, Cochrane Library, and Embase up to August 2024.

View Article and Find Full Text PDF

Background: Hydroxychloroquine (HCQ) is frequently utilized in rheumatic immune disorders and has been discovered to exert hypoglycemic effects in some obese women with polycystic ovary syndrome(PCOS), however, the precise efficacy and mechanism of action remain ambiguous.

Objective: To examine the impact of HCQ on glucose and lipid metabolism as well as sex hormone levels in obese women with PCOS.

Method: Fifty obese women with PCOS were randomly allocated into two groups: HCQ group (n = 25) and metformin (MET) group (n = 25).

View Article and Find Full Text PDF

To analyze the multimorbidity trends and influencing factors of internet addiction and depressive symptoms among middle school students in Zhejiang Province. From 2018 to 2023, a multistage stratified random cluster sampling method was used to select middle school students aged 12 to 18 in Zhejiang Province. Internet addiction and depression status were measured by the Internet Addiction Scale and the Center for Epidemiologic Studies Depression Scale.

View Article and Find Full Text PDF

Objective: This study examined the alignment between and changes within the tasks performed by pharmacists and skills sought by pharmacist employers from 2012 to 2022.

Methods: The United States Department of Labor's Occupational Information Network (O*NET) surveys a random sample of employees in targeted occupations every 5 years and provides a publicly available database allowing exploration of the frequency with which essential tasks are performed and perceived relevance and importance. Lightcast (formerly Burning Glass) provides labor market analytics of job advertisements; cross-sectional and longitudinal data can be filtered according to occupation, industry, location, and area of specialty.

View Article and Find Full Text PDF

India has consistently had one of the highest birth sex ratios (i.e., most males per female) globally.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!