HMMER Cut-off Threshold Tool (HMMERCTTER): Supervised classification of superfamily protein sequences with a reliable cut-off threshold.

PLoS One

Instituto de Investigaciones Biológicas (IIB-CONICET-UNMdP), Facultad de Ciencias Exactas y Naturales, Universidad Nacional de Mar del Plata, Mar del Plata, Argentina.

Published: June 2018

Background: Protein superfamilies can be divided into subfamilies of proteins with different functional characteristics. Their sequences can be classified hierarchically, which is part of sequence function assignation. Typically, there are no clear subfamily hallmarks that would allow pattern-based function assignation by which this task is mostly achieved based on the similarity principle. This is hampered by the lack of a score cut-off that is both sensitive and specific.

Results: HMMER Cut-off Threshold Tool (HMMERCTTER) adds a reliable cut-off threshold to the popular HMMER. Using a high quality superfamily phylogeny, it clusters a set of training sequences such that the cluster-specific HMMER profiles show cluster or subfamily member detection with 100% precision and recall (P&R), thereby generating a specific threshold as inclusion cut-off. Profiles and thresholds are then used as classifiers to screen a target dataset. Iterative inclusion of novel sequences to groups and the corresponding HMMER profiles results in high sensitivity while specificity is maintained by imposing 100% P&R self detection. In three presented case studies of protein superfamilies, classification of large datasets with 100% precision was achieved with over 95% recall. Limits and caveats are presented and explained.

Conclusions: HMMERCTTER is a promising protein superfamily sequence classifier provided high quality training datasets are used. It provides a decision support system that aids in the difficult task of sequence function assignation in the twilight zone of sequence similarity. All relevant data and source codes are available from the Github repository at the following URL: https://github.com/BBCMdP/HMMERCTTER.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5868777PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0193757PLOS

Publication Analysis

Top Keywords

cut-off threshold
16
function assignation
12
hmmer cut-off
8
threshold tool
8
tool hmmerctter
8
reliable cut-off
8
protein superfamilies
8
sequence function
8
high quality
8
hmmer profiles
8

Similar Publications

Purpose: Prior sperm DNA fragmentation index (DFI) thresholds for diagnosing male infertility and predicting assisted reproduction technology (ART) outcomes fluctuated between 15 and 30%, with no agreed standard. This study aimed to evaluate the impact of the sperm DFI on early embryonic development during ART treatments and establish appropriate DFI cut-off values.

Methods: Retrospectively analyzed 913 couple's ART cycles from 2021 to 2022, encompassing 1,476 IVF and 295 ICSI cycles, following strict criteria.

View Article and Find Full Text PDF

Background & Aims: The triglyceride-glucose index (TyG) and triglyceride-glucose body mass index (TyG-BMI) have been identified as potential predictive factors for metabolic dysfunction-associated steatotic liver disease (MASLD). However, they do not include high density lipoprotein (HDL-C), which is closely related to lipid metabolism. Furthermore, there is a lack of comprehensive and longitudinal data to determine the cut-off points for different degrees of hepatic steatosis and liver fibrosis in MASLD.

View Article and Find Full Text PDF

Background: Psoriasis is a chronic disease with a prevalence of 3% in the general population. The high prevalence of psoriasis has prompted the study of its comorbidities in recent decades. However, no studies have ever analyzed comorbidity patterns including all chronic diseases in psoriatic patients.

View Article and Find Full Text PDF

Aim: To investigate whether the risk of hypoglycemia is associated with residual β-cell function in adults with type 1 diabetes (T1D).

Methods: This cross-sectional study included 61 subjects with T1D of <15 years' duration using continuous glucose monitoring (CGM). Random C-peptide levels were compared between participants with time below range (TBR) ≥3 % (n = 15) and TBR <3 % (n = 45).

View Article and Find Full Text PDF

Aims: We aimed to establish one-minute sit-to-stand test (1-min STST) cut-off values that align with the guideline-recommended six-minute walk test (6MWT) thresholds (165m and 440m) for one-year mortality risk stratification in pulmonary hypertension (PH) patients. Furthermore, we aimed to compare clinical characteristics and long-term mortality among patients stratified by these proposed 1-min STST cut-offs.

Methods: All patients performed the 1-min STST and 6MWT.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!