Publications by Maria Littmann

Publications by authors named "Maria Littmann"

Page 1 of 1

AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms.

Nicola Bordin Ian Sillitoe Vamsi Nallapareddy Clemens Rauer Su Datt Lam Maria Littmann

Commun Biol

February 2023

Deep-learning (DL) methods like DeepMind's AlphaFold2 (AF2) have led to substantial improvements in protein structure prediction. We analyse confident AF2 models from 21 model organisms using a new classification protocol (CATH-Assign) which exploits novel DL methods for structural comparison and classification. Of ~370,000 confident models, 92% can be assigned to 3253 superfamilies in our CATH domain superfamily classification.

View Article and Find Full Text PDF

CATHe: detection of remote homologues for CATH superfamilies using embeddings from protein language models.

Vamsi Nallapareddy Nicola Bordin Ian Sillitoe Michael Heinzinger Maria Littmann

Bioinformatics

January 2023

Motivation: CATH is a protein domain classification resource that exploits an automated workflow of structure and sequence comparison alongside expert manual curation to construct a hierarchical classification of evolutionary and structural relationships. The aim of this study was to develop algorithms for detecting remote homologues missed by state-of-the-art hidden Markov model (HMM)-based approaches. The method developed (CATHe) combines a neural network with sequence representations obtained from protein language models.

View Article and Find Full Text PDF

Novel machine learning approaches revolutionize protein knowledge.

Nicola Bordin Christian Dallago Michael Heinzinger Stephanie Kim Maria Littmann

Trends Biochem Sci

April 2023

Breakthrough methods in machine learning (ML), protein structure prediction, and novel ultrafast structural aligners are revolutionizing structural biology. Obtaining accurate models of proteins and annotating their functions on a large scale is no longer limited by time and resources. The most recent method to be top ranked by the Critical Assessment of Structure Prediction (CASP) assessment, AlphaFold 2 (AF2), is capable of building structural models with an accuracy comparable to that of experimental structures.

View Article and Find Full Text PDF

LambdaPP: Fast and accessible protein-specific phenotype predictions.

Tobias Olenyi Céline Marquet Michael Heinzinger Benjamin Kröger Tiha Nikolova Maria Littmann

Protein Sci

January 2023

The availability of accurate and fast artificial intelligence (AI) solutions predicting aspects of proteins are revolutionizing experimental and computational molecular biology. The webserver LambdaPP aspires to supersede PredictProtein, the first internet server making AI protein predictions available in 1992. Given a protein sequence as input, LambdaPP provides easily accessible visualizations of protein 3D structure, along with predictions at the protein level (GeneOntology, subcellular location), and the residue level (binding to metal ions, small molecules, and nucleotides; conservation; intrinsic disorder; secondary structure; alpha-helical and beta-barrel transmembrane segments; signal-peptides; variant effect) in seconds.

View Article and Find Full Text PDF

Contrastive learning on protein embeddings enlightens midnight zone.

Michael Heinzinger Maria Littmann Ian Sillitoe Nicola Bordin Christine Orengo

NAR Genom Bioinform

June 2022

Experimental structures are leveraged through multiple sequence alignments, or more generally through homology-based inference (HBI), facilitating the transfer of information from a protein with known annotation to a query without any annotation. A recent alternative expands the concept of HBI from sequence-distance lookup to embedding-based annotation transfer (EAT). These embeddings are derived from protein Language Models (pLMs).

View Article and Find Full Text PDF

Protein embeddings and deep learning predict binding residues for various ligand classes.

Maria Littmann Michael Heinzinger Christian Dallago Konstantin Weissenow Burkhard Rost

Sci Rep

December 2021

One important aspect of protein function is the binding of proteins to ligands, including small molecules, metal ions, and macromolecules such as DNA or RNA. Despite decades of experimental progress many binding sites remain obscure. Here, we proposed bindEmbed21, a method predicting whether a protein residue binds to metal ions, nucleic acids, or small molecules.

View Article and Find Full Text PDF

PredictProtein - Predicting Protein Structure and Function for 29 Years.

Michael Bernhofer Christian Dallago Tim Karl Venkata Satagopam Michael Heinzinger Maria Littmann

Nucleic Acids Res

July 2021

Since 1992 PredictProtein (https://predictprotein.org) is a one-stop online resource for protein sequence analysis with its main site hosted at the Luxembourg Centre for Systems Biomedicine (LCSB) and queried monthly by over 3,000 users in 2020. PredictProtein was the first Internet server for protein predictions.

View Article and Find Full Text PDF

Clustering FunFams using sequence embeddings improves EC purity.

Maria Littmann Nicola Bordin Michael Heinzinger Konstantin Schütze Christian Dallago

Bioinformatics

October 2021

Motivation: Classifying proteins into functional families can improve our understanding of protein function and can allow transferring annotations within one family. For this, functional families need to be 'pure', i.e.

View Article and Find Full Text PDF

Learned Embeddings from Deep Learning to Visualize and Predict Protein Sets.

Christian Dallago Konstantin Schütze Michael Heinzinger Tobias Olenyi Maria Littmann

Curr Protoc

May 2021

Models from machine learning (ML) or artificial intelligence (AI) increasingly assist in guiding experimental design and decision making in molecular biology and medicine. Recently, Language Models (LMs) have been adapted from Natural Language Processing (NLP) to encode the implicit language written in protein sequences. Protein LMs show enormous potential in generating descriptive representations (embeddings) for proteins from just their sequences, in a fraction of the time with respect to previous approaches, yet with comparable or improved predictive ability.

View Article and Find Full Text PDF

Embeddings from deep learning transfer GO annotations beyond homology.

Maria Littmann Michael Heinzinger Christian Dallago Tobias Olenyi Burkhard Rost

Sci Rep

January 2021

Knowing protein function is crucial to advance molecular and medical biology, yet experimental function annotations through the Gene Ontology (GO) exist for fewer than 0.5% of all known proteins. Computational methods bridge this sequence-annotation gap typically through homology-based annotation transfer by identifying sequence-similar proteins with known function or through prediction methods using evolutionary information.

View Article and Find Full Text PDF

Correction to: Detailed prediction of protein sub-nuclear localization.

Maria Littmann Tatyana Goldberg Sebastian Seitz Mikael Bodén Burkhard Rost

BMC Bioinformatics

December 2019

Following publication of the original article [1], the author reported that an incorrect figure has been published as Figure 2. The correct Figure 2 is shown below.

View Article and Find Full Text PDF

FunFam protein families improve residue level molecular function prediction.

Linus Scheibenreif Maria Littmann Christine Orengo Burkhard Rost

BMC Bioinformatics

July 2019

Background: The CATH database provides a hierarchical classification of protein domain structures including a sub-classification of superfamilies into functional families (FunFams). We analyzed the similarity of binding site annotations in these FunFams and incorporated FunFams into the prediction of protein binding residues.

Results: FunFam members agreed, on average, in 36.

View Article and Find Full Text PDF

Detailed prediction of protein sub-nuclear localization.

Maria Littmann Tatyana Goldberg Sebastian Seitz Mikael Bodén Burkhard Rost

BMC Bioinformatics

April 2019

Background: Sub-nuclear structures or locations are associated with various nuclear processes. Proteins localized in these substructures are important to understand the interior nuclear mechanisms. Despite advances in high-throughput methods, experimental protein annotations remain limited.

View Article and Find Full Text PDF