Molecular complex detection in protein interaction networks through reinforcement learning.

BMC Bioinformatics

Department of Molecular Biosciences, Center for Systems and Synthetic Biology, University of Texas, Austin, TX, 78712, USA.

Published: August 2023

Background: Proteins often assemble into higher-order complexes to perform their biological functions. Such protein-protein interactions (PPI) are often experimentally measured for pairs of proteins and summarized in a weighted PPI network, to which community detection algorithms can be applied to define the various higher-order protein complexes. Current methods include unsupervised and supervised approaches, often assuming that protein complexes manifest only as dense subgraphs. Utilizing supervised approaches, the focus is not on how to find them in a network, but only on learning which subgraphs correspond to complexes, currently solved using heuristics. However, learning to walk trajectories on a network to identify protein complexes leads naturally to a reinforcement learning (RL) approach, a strategy not extensively explored for community detection. Here, we develop and evaluate a reinforcement learning pipeline for community detection on weighted protein-protein interaction networks to detect new protein complexes. The algorithm is trained to calculate the value of different subgraphs encountered while walking on the network to reconstruct known complexes. A distributed prediction algorithm then scales the RL pipeline to search for novel protein complexes on large PPI networks.

Results: The reinforcement learning pipeline is applied to a human PPI network consisting of 8k proteins and 60k PPI, which results in 1,157 protein complexes. The method demonstrated competitive accuracy with improved speed compared to previous algorithms. We highlight protein complexes such as C4orf19, C18orf21, and KIAA1522 which are currently minimally characterized. Additionally, the results suggest TMC04 be a putative additional subunit of the KICSTOR complex and confirm the involvement of C15orf41 in a higher-order complex with HIRA, CDAN1, ASF1A, and by 3D structural modeling.

Conclusions: Reinforcement learning offers several distinct advantages for community detection, including scalability and knowledge of the walk trajectories defining those communities. Applied to currently available human protein interaction networks, this method had comparable accuracy with other algorithms and notable savings in computational time, and in turn, led to clear predictions of protein function and interactions for several uncharacterized human proteins.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10394916PMC
http://dx.doi.org/10.1186/s12859-023-05425-7DOI Listing

Publication Analysis

Top Keywords

protein complexes
28
reinforcement learning
20
community detection
16
interaction networks
12
protein
10
complexes
10
protein interaction
8
ppi network
8
supervised approaches
8
walk trajectories
8

Similar Publications

Analyzing microbial samples remains computationally challenging due to their diversity and complexity. The lack of robust de novo protein function prediction methods exacerbates the difficulty in deriving functional insights from these samples. Traditional prediction methods, dependent on homology and sequence similarity, often fail to predict functions for novel proteins and proteins without known homologs.

View Article and Find Full Text PDF

Idiopathic pulmonary fibrosis (IPF) is a fatal disease defined by a progressive decline in lung function due to scarring and accumulation of extracellular matrix (ECM) proteins. The SOCS (Suppressor Of Cytokine Signaling) domain is a 40 amino acid conserved domain known to form a functional ubiquitin ligase complex targeting the Von Hippel Lindau (VHL) protein for proteasomal degradation. Here we show that the SOCS conserved domain operates as a molecular tool, to disrupt collagen and fibronectin fibrils in the ECM associated with fibrotic lung myofibroblasts.

View Article and Find Full Text PDF

The chromatin remodeling factor OsINO80 promotes H3K27me3 and H3K9me2 deposition and maintains TE silencing in rice.

Nat Commun

December 2024

State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, Shanghai, PR China.

The INO80 chromatin remodeling complex plays a critical role in shaping the dynamic chromatin environment. The diverse functions of the evolutionarily conserved INO80 complex have been widely reported. However, the role of INO80 in modulating the histone variant H2A.

View Article and Find Full Text PDF

Taking advantage of the good mechanical strength of expanded Drosophila brains and to tackle their relatively large size that can complicate imaging, we apply potassium (poly)acrylate-based hydrogels for expansion microscopy (ExM), resulting in a 40x plus increased resolution of transgenic fluorescent proteins preserved by glutaraldehyde fixation in the nervous system. Large-volume ExM is realized by using an axicon-based Bessel lightsheet microscope, featuring gentle multi-color fluorophore excitation and intrinsic optical sectioning capability, enabling visualization of Tm5a neurites and L3 lamina neurons with photoreceptors in the optic lobe. We also image nanometer-sized dopaminergic neurons across the same intact iteratively expanded Drosophila brain, enabling us to measure the 3D expansion ratio.

View Article and Find Full Text PDF

Warfarin is the most widely used oral anticoagulant in clinical practice. The cytochrome P450 2C9 (CYP2C9), vitamin K epoxide reductase complex 1 (VKORC1), and cytochrome P450 4F2 (CYP4F2) genotypes are associated with warfarin dose requirements in China. Accurate genotyping is vital for obtaining reliable genotype-guided warfarin dosing information.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!