TUnA: an uncertainty-aware transformer model for sequence-based protein-protein interaction prediction.

Brief Bioinform

Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA 92093-0359, United States.

Published: July 2024

Protein-protein interactions (PPIs) are important for many biological processes, but predicting them from sequence data remains challenging. Existing deep learning models often cannot generalize to proteins not present in the training set and do not provide uncertainty estimates for their predictions. To address these limitations, we present TUnA, a Transformer-based uncertainty-aware model for PPI prediction. TUnA uses ESM-2 embeddings with Transformer encoders and incorporates a Spectral-normalized Neural Gaussian Process. TUnA achieves state-of-the-art performance and, importantly, evaluates uncertainty for unseen sequences. We demonstrate that TUnA's uncertainty estimates can effectively identify the most reliable predictions, significantly reducing false positives. This capability is crucial in bridging the gap between computational predictions and experimental validation.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11269822PMC
http://dx.doi.org/10.1093/bib/bbae359DOI Listing

Publication Analysis

Top Keywords

uncertainty estimates
8
tuna
4
tuna uncertainty-aware
4
uncertainty-aware transformer
4
transformer model
4
model sequence-based
4
sequence-based protein-protein
4
protein-protein interaction
4
interaction prediction
4
prediction protein-protein
4

Similar Publications

Ascertaining the Environmental Advantages of Pavement Designs Incorporating Recycled Content through a Parametric and Probabilistic Approach.

Environ Sci Technol

January 2025

College of Environmental Science and Engineering, Nankai University, 38 Tongyan Road, Jinnan District, 300350 Tianjin, China.

Reclaimed asphalt pavement (RAP) is a widely used end-of-life (EoL) material in asphalt pavements to increase the material circularity. However, the performance loss due to using RAP in the asphalt binder layer often requires a thicker layer, leading to additional material usage, energy consumption, and transportation effort. In this study, we developed a parametric and probabilistic life cycle assessment (LCA) framework to robustly compare various pavement designs incorporating recycled materials.

View Article and Find Full Text PDF

Background: Direct oral anticoagulants (DOACs) can interfere with coagulation analyses, causing erroneous results such as false-positive lupus anticoagulant and false-normal antithrombin, threatening patient safety when overlooked. A test using a prothrombin time quotient method to detect DOAC presence in plasma samples is now commercially available, the MRX PT DOAC, with the result expressed as Clot Time Ratio (CTR).

Objectives: Evaluate the ability of MRX PT DOAC to identify interfering apixaban or rivaroxaban concentrations, identify non-interfering or interfering patient samples, and detect whether a patient is on DOAC treatment.

View Article and Find Full Text PDF

The partitioning of photosynthate among various forest carbon pools is a key process regulating long-term carbon sequestration, with allocation to aboveground woody biomass carbon (AGBC) in particular playing an outsized role in the global carbon cycle due to its slow residence time. However, directly estimating the fraction of gross primary productivity (GPP) that goes to AGBC has historically been difficult and time-consuming, leaving us with persistent uncertainties. We used an extensive dataset of tree-ring chronologies co-located at flux towers to assess the coupling between AGBC and GPP, calculate the fraction of fixed carbon that is allocated to AGBC, and understand the drivers of variability in this fraction.

View Article and Find Full Text PDF

Cost-effectiveness of data driven personalised antibiotic dosing in critically ill patients with sepsis or septic shock.

J Clin Monit Comput

January 2025

Department of Health Sciences, Faculty of Science, Vrije Universiteit Amsterdam, Amsterdam Public Health research institute, Van der Boechorststraat 7, Amsterdam, 1081 BT, the Netherlands.

Purpose: This study provides an economic evaluation of bedside, data-driven, and model-informed precision dosing of antibiotics in comparison with usual care among critically ill patients with sepsis or septic shock.

Methods: This economic evaluation was conducted alongside an AutoKinetics randomized controlled trial. Effect measures included quality-adjusted life years (QALYs), mortality and pharmacokinetic target attainment.

View Article and Find Full Text PDF

In gas-to-methanol processes, optimizing multi-energy systems is a critical challenge toward efficient energy allocation. This paper proposes an entropy-based stochastic optimization method for a multi-energy system in a gas-to-methanol process, aiming to achieve optimal allocation of gas, steam, and electricity to ensure executability under modeling uncertainties. First, mechanistic models are developed for major chemical equipments, including the desulfurization, steam boilers, air separation, and syngas compressors.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!