The body of human genomic and proteomic evidence continues to grow at ever-increasing rates, while annotation efforts struggle to keep pace. A surprisingly small fraction of human genes have clear, documented associations with specific functions, and new functions continue to be found for characterized genes. Here we assembled an integrated collection of diverse genomic and proteomic data for 21,341 human genes and make quantitative associations of each to 4333 Gene Ontology terms. We combined guilt-by-profiling and guilt-by-association approaches to exploit features unique to the data types. Performance was evaluated by cross-validation, prospective validation, and by manual evaluation with the biological literature. Functional-linkage networks were also constructed, and their utility was demonstrated by identifying candidate genes related to a glioma FLN using a seed network from genome-wide association studies. Our annotations are presented-alongside existing validated annotations-in a publicly accessible and searchable web interface.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3284330PMC
http://dx.doi.org/10.1534/g3.111.000828DOI Listing

Publication Analysis

Top Keywords

genomic proteomic
8
human genes
8
genes
5
resource quantitative
4
quantitative functional
4
functional annotation
4
annotation homo
4
homo sapiens
4
sapiens genes
4
genes body
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!