Comparative study of the effectiveness and limitations of current methods for detecting sequence coevolution.

Bioinformatics

Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15260, USA, Department of Pharmacology, School of Medicine, Tsinghua University, Beijing 100084, China and Department of Structural Biology, Weizmann Institute of Science, Rehovot 76100, Israel.

Published: June 2015

Motivation: With rapid accumulation of sequence data on several species, extracting rational and systematic information from multiple sequence alignments (MSAs) is becoming increasingly important. Currently, there is a plethora of computational methods for investigating coupled evolutionary changes in pairs of positions along the amino acid sequence, and making inferences on structure and function. Yet, the significance of coevolution signals remains to be established. Also, a large number of false positives (FPs) arise from insufficient MSA size, phylogenetic background and indirect couplings.

Results: Here, a set of 16 pairs of non-interacting proteins is thoroughly examined to assess the effectiveness and limitations of different methods. The analysis shows that recent computationally expensive methods designed to remove biases from indirect couplings outperform others in detecting tertiary structural contacts as well as eliminating intermolecular FPs; whereas traditional methods such as mutual information benefit from refinements such as shuffling, while being highly efficient. Computations repeated with 2,330 pairs of protein families from the Negatome database corroborated these results. Finally, using a training dataset of 162 families of proteins, we propose a combined method that outperforms existing individual methods. Overall, the study provides simple guidelines towards the choice of suitable methods and strategies based on available MSA size and computing resources.

Availability And Implementation: Software is freely available through the Evol component of ProDy API.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4481699PMC
http://dx.doi.org/10.1093/bioinformatics/btv103DOI Listing

Publication Analysis

Top Keywords

effectiveness limitations
8
msa size
8
methods
7
comparative study
4
study effectiveness
4
limitations current
4
current methods
4
methods detecting
4
sequence
4
detecting sequence
4

Similar Publications

Introduction: Non-Celiac Gluten Sensitivity (NCGS) is a common disorder characterized by symptoms resembling those of irritable bowel syndrome. In recent years there has been progress in the understanding of the pathogenic pathways and data suggest that NCGS has a distinct immunological profile that differs from celiac disease (CeD). This has fostered the search for a specific biomarker of NCGS.

View Article and Find Full Text PDF

Purpose: Prior sperm DNA fragmentation index (DFI) thresholds for diagnosing male infertility and predicting assisted reproduction technology (ART) outcomes fluctuated between 15 and 30%, with no agreed standard. This study aimed to evaluate the impact of the sperm DFI on early embryonic development during ART treatments and establish appropriate DFI cut-off values.

Methods: Retrospectively analyzed 913 couple's ART cycles from 2021 to 2022, encompassing 1,476 IVF and 295 ICSI cycles, following strict criteria.

View Article and Find Full Text PDF

Background: Innovation in crop establishment is crucial for wheat productivity in drought-prone climates. Seedling establishment, the first stage of crop productivity, relies heavily on root and coleoptile system architecture for effective soil water and nutrient acquisition, particularly in regions practicing deep planting. Root phenotyping methods that quickly determine coleoptile lengths are vital for breeding studies.

View Article and Find Full Text PDF

Background: Classic congenital adrenal hyperplasia, primarily due to 21-hydroxylase deficiency, leads to impaired cortisol and aldosterone production and excess adrenal androgens. Lifelong glucocorticoid therapy is required, often necessitating supraphysiological doses in youth to manage androgen excess and growth acceleration. These patients experience higher obesity rates, hypertension, and glucose metabolism issues, complicating long-term health management.

View Article and Find Full Text PDF

The incidence of inflow and drain pain and associated risk factors for patients on peritoneal dialysis.

BMC Nephrol

January 2025

Department of Nephrology, The First Affiliated Hospital, Sun Yat-sen University, 58th, Zhongshan Road II, Guangzhou, 510080, Guangdong, PR China.

Background: The high prevalence and prolonged duration of inflow pain and drain pain experienced by peritoneal dialysis (PD) patients following PD catheter implantation impact their quality of life. However, there is limited data on the frequency and predisposing factors of these pains in the Chinese population undergoing peritoneal dialysis.

Methods: This study encompassed individuals who underwent peritoneal dialysis catheter implantation at our institution from September 1, 2023, to March 31, 2024.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!