Teaching AI to speak protein.

Curr Opin Struct Biol

TUM (Technical University of Munich), School of Computation, Information and Technology (CIT), Faculty of Informatics, Chair of Bioinformatics & Computational Biology - i12, Boltzmannstr. 3, 85748 Garching, Munich, Germany; Institute for Advanced Study (TUM-IAS), Lichtenbergstr. 2a, 85748 Garching, Munich, Germany; TUM School of Life Sciences Weihenstephan (WZW), Alte Akademie 8, Freising, Germany.

Published: April 2025

Large Language Models for proteins, namely protein Language Models (pLMs), have begun to provide an important alternative to capturing the information encoded in a protein sequence in computers. Arguably, pLMs have advanced importantly to understanding aspects of the language of life as written in proteins, and through this understanding, they are becoming an increasingly powerful means of advancing protein prediction, e.g., in the prediction of molecular function as expressed by identifying binding residues or variant effects. While benefitting from the same technology, protein structure prediction remains one of the few applications for which only using pLM embeddings from single sequences appears not to improve over or match the state-of-the-art. Fine-tuning foundation pLMs enhances efficiency and accuracy of solutions, in particular in cases with few experimental annotations. pLMs facilitate the integration of computational and experimental biology, of AI and wet-lab, in particular toward a new era of protein design.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.sbi.2025.102986DOI Listing

Publication Analysis

Top Keywords

language models
8
protein
6
teaching speak
4
speak protein
4
protein large
4
large language
4
models proteins
4
proteins protein
4
protein language
4
plms
4

Similar Publications

ObjectiveWe summarize the key steps to develop and assess an innovative online, evidence-based tool that supports shared decision-making in routine care to personalize antidepressant treatment in adults with depression. This PETRUSHKA tool is part of the PETRUSHKA trial (Personalize antidEpressant Treatment foR Unipolar depreSsion combining individual cHoices, risKs, and big datA).MethodsThe PETRUSHKA tool: (a) is based on prediction models, which use a combination of advanced analytics, i.

View Article and Find Full Text PDF

Background: Surgical resection is the primary treatment modality for colorectal cancer. Prehabilitation is about enhancing the patient's physiological capacity preoperatively to reduce the risk of treatment-related complications. Clear definitions of the modality, content, and duration of prehabilitation, including its components such as nutrition, exercise, and psychological support, are lacking.

View Article and Find Full Text PDF

Introduction: Large language models (LLMs) are becoming ubiquitous and widely implemented. LLMs could also be used for diagnosis and treatment. National antibiotic prescribing guidelines are customized and informed by local laboratory data on antimicrobial resistance.

View Article and Find Full Text PDF

Predicting differentially methylated cytosines in TET and DNMT3 knockout mutants via a large language model.

Brief Bioinform

March 2025

Department of Computer Science and Engineering, University of California, Riverside, 900 University Ave, Riverside, CA 92521, United States.

DNA methylation is an epigenetic marker that directly or indirectly regulates several critical cellular processes. While cytosines in mammalian genomes generally maintain stable methylation patterns over time, other cytosines that belong to specific regulatory regions, such as promoters and enhancers, can exhibit dynamic changes. These changes in methylation are driven by a complex cellular machinery, in which the enzymes DNMT3 and TET play key roles.

View Article and Find Full Text PDF

Background And Aims: Better understanding the challenges faced by patients on medications for opioid use disorder (MOUD), including methadone and buprenorphine, is critical to increasing their use/retention. Social media platforms such as Reddit offer a space for patients to share their experiences with medications. We aimed to identify and characterize challenges faced by patients taking MOUD through analysis of discussions from the r/Methadone and r/suboxone subreddits.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!