A visualization of 3D proteome universe: mapping of a proteome ensemble into 3D space based on the protein-structure composition.

Mol Phylogenet Evol

Graduate School of Science and Engineering, Department of Functional Materials Science, Faculty of Engineering, Saitama University, 255 Shimo-okubo, Saitama 338-8570, Japan.

Published: November 2011

AI Article Synopsis

  • Developed a method to visualize proteomes of various species in a 3D vector space, defining a "proteome" as the collective proteins encoded in a genome.
  • Represented each species' proteome as a 1053-dimensional vector, capturing overall amounts and composition of SCOP Folds, and mapped these to 3D vectors while preserving critical properties.
  • Applied this mapping to 456 species from the GTOP database, achieving a high correlation (0.95-0.96) between angles of vectors before and after mapping, enhancing intuitive interpretation of proteomic data.

Article Abstract

To visualize a bird's-eye view of an ensemble of proteomes for various species, we recently developed a novel method of mapping a proteome ensemble into Three-Dimensional (3D) vector space. In this study, the "proteome" is defined as the entire set of all proteins encoded in a genome sequence, and these proteins were dealt with at the level of the SCOP Fold. First, we represented the proteome of a species s by a 1053-dimensional vector x(s), where its length ∣x(s)∣ represents the overall amount of all the SCOP Folds in the proteome, and its unit vector x(s)/∣x(s)∣ represents the relative composition of the SCOP Folds in the proteome and the size of the dimension, 1053, is the number of all possible Folds in the proteome ensemble given. Second, we mapped the vector x(s) to the 3D vector y(s), based on the two simple principles: (1) ∣y(s)∣=∣x(s)∣, and (2) the angle between y(s) and y(t) maximally correlates with the angle between x(s) and x(t). We applied to the mapping of a proteome ensemble for 456 species, which were retrieved from the Genomes TO Protein structures and functions (GTOP) database. As a result, we succeeded in the mapping in that the properties of the 1053-dimensional vectors were quantitatively conserved in the 3D vectors. Particularly, the angles between vectors before and after the mapping highly correlated with each other (correlation coefficients were 0.95-0.96). This new mapping method will allow researchers to intuitively interpret the visual information presented in the maps in a highly effective manner.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ympev.2011.06.020DOI Listing

Publication Analysis

Top Keywords

proteome ensemble
16
mapping proteome
12
folds proteome
12
scop folds
8
proteome
7
mapping
6
ensemble
5
vector
5
visualization proteome
4
proteome universe
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!