Many proteins work together with others in groups called complexes in order to achieve a specific function. Discovering protein complexes is important for understanding biological processes and predict protein functions in living organisms. Large-scale and throughput techniques have made possible to compile protein-protein interaction networks (PPI networks), which have been used in several computational approaches for detecting protein complexes. Those predictions might guide future biologic experimental research. Some approaches are topology-based, where highly connected proteins are predicted to be complexes; some propose different clustering algorithms using partitioning, overlaps among clusters for networks modeled with unweighted or weighted graphs; and others use density of clusters and information based on protein functionality. However, some schemes still require much processing time or the quality of their results can be improved. Furthermore, most of the results obtained with computational tools are not accompanied by an analysis of false positives. We propose an effective and efficient mining algorithm for discovering highly connected subgraphs, which is our base for defining protein complexes. Our representation is based on transforming the PPI network into a directed acyclic graph that reduces the number of represented edges and the search space for discovering subgraphs. Our approach considers weighted and unweighted PPI networks. We compare our best alternative using PPI networks from Saccharomyces cerevisiae (yeast) and Homo sapiens (human) with state-of-the-art approaches in terms of clustering, biological metrics and execution times, as well as three gold standards for yeast and two for human. Furthermore, we analyze false positive predicted complexes searching the PDBe (Protein Data Bank in Europe) database in order to identify matching protein complexes that have been purified and structurally characterized. Our analysis shows that more than 50 yeast protein complexes and more than 300 human protein complexes found to be false positives according to our prediction method, i.e., not described in the gold standard complex databases, in fact contain protein complexes that have been characterized structurally and documented in PDBe. We also found that some of these protein complexes have recently been classified as part of a Periodic Table of Protein Complexes. The latest version of our software is publicly available at http://doi.org/10.6084/m9.figshare.5297314.v1.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5609739PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0183460PLOS

Publication Analysis

Top Keywords

protein complexes
36
protein
13
complexes
12
ppi networks
12
false positive
8
highly connected
8
predicted complexes
8
false positives
8
pdbe protein
8
networks
5

Similar Publications

Blood clots are complex structures composed of blood cells and proteins held together by the structural framework provided by an insoluble fibrin network. Factor (F)XIII is a protransglutaminase essential for stabilizing the fibrin network. Activated FXIII(a) introduces novel covalent crosslinks within and between fibrin and other plasma and cellular proteins, and thereby promotes fibrin biochemical and mechanical integrity.

View Article and Find Full Text PDF

Phototherapy - which includes photothermal therapy (PTT) and photodynamic therapy (PDT) - has evoked interest as a promising cancer treatment modality on account of its noninvasiveness, spatiotemporal precision, and minimal side effects. C. Wang et al.

View Article and Find Full Text PDF

The preference for simple explanations, known as the parsimony principle, has long guided the development of scientific theories, hypotheses, and models. Yet recent years have seen a number of successes in employing highly complex models for scientific inquiry (e.g.

View Article and Find Full Text PDF

Functional gold nanoparticles have emerged as a cornerstone in targeted drug delivery, imaging, and biosensing. Their stability, distribution, and overall performance in biological systems are largely determined by their interactions with molecules in biological fluids as well as the biomolecular layers they acquire in complex environments. However, real-time tracking of how biomolecules attach to colloidal nanoparticles, a critical aspect for optimizing nanoparticle function, has proven to be experimentally challenging.

View Article and Find Full Text PDF

Dysregulated eIF4E-dependent translation is a central driver of tumorigenesis and therapy resistance. eIF4E binding proteins (4E-BP1/2/3) are major negative regulators of eIF4E-dependent translation that are inactivated in tumors through inhibitory phosphorylation or downregulation. Previous studies have linked PP2A phosphatase(s) to activation of 4E-BP1.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!