Background: Many biological processes are carried out by proteins interacting with each other in the form of protein complexes. However, large-scale detection of protein complexes has remained constrained by experimental limitations. As such, computational detection of protein complexes by applying clustering algorithms on the abundantly available protein-protein interaction (PPI) networks is an important alternative. However, many current algorithms have overlooked the importance of selecting seeds for expansion into clusters without excluding important proteins and including many noisy ones, while ensuring a high degree of functional homogeneity amongst the proteins detected for the complexes.

Results: We designed a novel method called Probabilistic Local Walks (PLW) which clusters regions in a PPI network with high functional similarity to find protein complex cores with high precision and efficiency in O (|V| log |V| + |E|) time. A seed selection strategy, which prioritises seeds with dense neighbourhoods, was devised. We defined a topological measure, called common neighbour similarity, to estimate the functional similarity of two proteins given the number of their common neighbours.

Conclusions: Our proposed PLW algorithm achieved the highest F-measure (recall and precision) when compared to 11 state-of-the-art methods on yeast protein interaction data, with an improvement of 16.7% over the next highest score. Our experiments also demonstrated that our seed selection strategy is able to increase algorithm precision when applied to three previous protein complex mining techniques.

Availability: The software, datasets and predicted complexes are available at http://wonglkd.github.io/PLW.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852146PMC
http://dx.doi.org/10.1186/1471-2164-14-S5-S15DOI Listing

Publication Analysis

Top Keywords

protein complexes
16
probabilistic local
8
local walks
8
protein
8
protein interaction
8
detection protein
8
functional similarity
8
protein complex
8
seed selection
8
selection strategy
8

Similar Publications

BaNDyT: Bayesian Network Modeling of Molecular Dynamics Trajectories.

J Chem Inf Model

January 2025

Department of Computational and Quantitative Medicine, Beckman Research Institute of the City of Hope, 1218 S 5th Ave, Monrovia, California 91016, United States.

Bayesian network modeling (BN modeling, or BNM) is an interpretable machine learning method for constructing probabilistic graphical models from the data. In recent years, it has been extensively applied to diverse types of biomedical data sets. Concurrently, our ability to perform long-time scale molecular dynamics (MD) simulations on proteins and other materials has increased exponentially.

View Article and Find Full Text PDF

Backgrounds: Collagen type I alpha 1 chain (COL1A1) is a key protein encoding fibrillar collagen, playing a crucial role in the tumor microenvironment (TME) due to its complex functions and close association with tumor invasiveness. This has made COL1A1 a focal point in cancer biology research. However, studies investigating the relationship between COL1A1 expression levels and clinical characteristics of ovarian cancer (OC) remain limited.

View Article and Find Full Text PDF

Diagnostic accuracy of -specific triple-color FluoroSpot assay in differentiating tuberculosis infection status in febrile patients with suspected tuberculosis.

Front Immunol

January 2025

Division of Infectious Diseases, Department of Internal Medicine, State Key Laboratory of Complex Severe and Rare Disease, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.

Objective: This study aims to evaluate the diagnostic accuracy of a (MTB)-specific triple-color FluoroSpot assay (IFN-γ/IL-2/TNF-α) in the differentiation of tuberculosis (TB) infection status in febrile patients.

Method: Febrile patients with suspected active TB (ATB) were consecutively enrolled. The frequencies and proportions of MTB-specific T cells secreting IFN-γ, IL-2, and TNF-α were detected at the single-cell level by triple-color FluoroSpot assay.

View Article and Find Full Text PDF

Infiltrating T lymphocytes and tumor microenvironment within cholangiocarcinoma: immune heterogeneity, intercellular communication, immune checkpoints.

Front Immunol

January 2025

Third Hospital of Shanxi Medical University, Shanxi Bethune Hospital, Shanxi Academy of Medical Sciences, Tongji Shanxi Hospital, Taiyuan, China.

Cholangiocarcinoma is the second most common primary liver cancer, and its global incidence has increased in recent years. Radical surgical resection and systemic chemotherapy have traditionally been the standard treatment options. However, the complexity of cholangiocarcinoma subtypes often presents a challenge for early diagnosis.

View Article and Find Full Text PDF

Post-stroke early activation of neutrophils contributes to intensive neuroinflammation and worsens disease outcomes. Other pre-existing patient conditions can modify the extent of their activation during disease, especially hypercholesterolemia. However, whether and how increased circulating cholesterol amounts can change neutrophil activation responses very early after stroke has not been studied.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!