Motivation: Phylogenetic profiling methods can achieve good accuracy in predicting protein-protein interactions, especially in prokaryotes. Recent studies have shown that the choice of reference taxa (RT) is critical for accurate prediction, but with more than 2500 fully sequenced taxa publicly available, identifying the most-informative RT is becoming increasingly difficult. Previous studies on the selection of RT have provided guidelines for manual taxon selection, and for eliminating closely related taxa. However, no general strategy for automatic selection of RT is currently available.

Results: We present three novel methods for automating the selection of RT, using machine learning based on known protein-protein interaction networks. One of these methods in particular, Tree-Based Search, yields greatly improved prediction accuracies. We further show that different methods for constituting phylogenetic profiles often require very different RT sets to support high prediction accuracy.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btr720DOI Listing

Publication Analysis

Top Keywords

automatic selection
8
reference taxa
8
protein-protein interaction
8
phylogenetic profiling
8
selection reference
4
taxa
4
taxa protein-protein
4
prediction
4
interaction prediction
4
prediction phylogenetic
4

Similar Publications

We aimed to build a robust classifier for the MGMT methylation status of glioblastoma in multiparametric MRI. We focused on multi-habitat deep image descriptors as our basic focus. A subset of the BRATS 2021 MGMT methylation dataset containing both MGMT class labels and segmentation masks was used.

View Article and Find Full Text PDF

The range of sensor technologies for structural health monitoring (SHM) systems is expanding as the need for ongoing structural monitoring increases. In such a case, damage to the monitored structure elements is detected using an integrated network of sensors operating in real-time or periodically in frequent time stamps. This paper briefly introduces a new type of sensor, called a Customized Crack Propagation Sensor (CCPS), which is an alternative for crack gauges, but with enhanced functional features and customizability.

View Article and Find Full Text PDF

Sleep posture is a key factor in assessing sleep quality, especially for individuals with Obstructive Sleep Apnea (OSA), where the sleeping position directly affects breathing patterns: the side position alleviates symptoms, while the supine position exacerbates them. Accurate detection of sleep posture is essential in assessing and improving sleep quality. Automatic sleep posture detection systems, both wearable and non-wearable, have been developed to assess sleep quality.

View Article and Find Full Text PDF

Assessing vines' vigour is essential for vineyard management and automatization of viticulture machines, including shaking adjustments of berry harvesters during grape harvest or leaf pruning applications. To address these problems, based on a standardized growth class assessment, labeled ground truth data of precisely located grapevines were predicted with specifically selected Machine Learning (ML) classifiers (Random Forest Classifier (RFC), Support Vector Machines (SVM)), utilizing multispectral UAV (Unmanned Aerial Vehicle) sensor data. The input features for ML model training comprise spectral, structural, and texture feature types generated from multispectral orthomosaics (spectral features), Digital Terrain and Surface Models (DTM/DSM- structural features), and Gray-Level Co-occurrence Matrix (GLCM) calculations (texture features).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!