Correlations from structure and phylogeny combine constructively in the inference of protein partners from sequences.

PLoS Comput Biol

Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland.

Published: May 2022

Inferring protein-protein interactions from sequences is an important task in computational biology. Recent methods based on Direct Coupling Analysis (DCA) or Mutual Information (MI) allow to find interaction partners among paralogs of two protein families. Does successful inference mainly rely on correlations from structural contacts or from phylogeny, or both? Do these two types of signal combine constructively or hinder each other? To address these questions, we generate and analyze synthetic data produced using a minimal model that allows us to control the amounts of structural constraints and phylogeny. We show that correlations from these two sources combine constructively to increase the performance of partner inference by DCA or MI. Furthermore, signal from phylogeny can rescue partner inference when signal from contacts becomes less informative, including in the realistic case where inter-protein contacts are restricted to a small subset of sites. We also demonstrate that DCA-inferred couplings between non-contact pairs of sites improve partner inference in the presence of strong phylogeny, while deteriorating it otherwise. Moreover, restricting to non-contact pairs of sites preserves inference performance in the presence of strong phylogeny. In a natural data set, as well as in realistic synthetic data based on it, we find that non-contact pairs of sites contribute positively to partner inference performance, and that restricting to them preserves performance, evidencing an important role of phylogeny.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9135348PMC
http://dx.doi.org/10.1371/journal.pcbi.1010147DOI Listing

Publication Analysis

Top Keywords

partner inference
16
combine constructively
12
non-contact pairs
12
pairs sites
12
synthetic data
8
presence strong
8
strong phylogeny
8
inference performance
8
phylogeny
7
inference
7

Similar Publications

Military members and female sex workers (FSWs) may be more likely to acquire or transmit HIV. Mapping HIV transmission across these high-risk populations and identifying behaviors associated with sexual network clustering are needed for effective HIV prevention approaches. A cross-sectional study recruited participants newly diagnosed with HIV among militaries, civilians, and FSWs in Zambia, Senegal, and Democratic Republic of the Congo (DRC).

View Article and Find Full Text PDF

Background And Hypothesis: Among individuals living with psychotic disorders, social impairment is common, debilitating, and challenging to treat. While the roots of this impairment are undoubtedly complex, converging lines of evidence suggest that social motivation and pleasure (MAP) deficits play a central role. Yet most neuroimaging studies have focused on monetary rewards, precluding decisive inferences.

View Article and Find Full Text PDF

Background/objectives: The colonization history of house mice reflects the maritime history of humans that passively transported them worldwide. We investigated western house mouse colonization in the Atlantic region through studies of mitochondrial D-loop DNA sequences from modern specimens.

Methods: We assembled a dataset of 758 haplotypes derived from 2765 mice from 47 countries/oceanic archipelagos (a combination of new and published data).

View Article and Find Full Text PDF

In many clinical settings, an active-controlled trial design (e.g., a non-inferiority or superiority design) is often used to compare an experimental medicine to an active control (e.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!