Publications by Weigt M

Publications by authors named "Weigt M"

Page 1 of 4

Author Correction: Understanding epistatic networks in the B1 β-lactamases through coevolutionary statistical modeling and deep mutational scanning.

J Z Chen M Bisardi D Lee S Cotogno F Zamponi

Nat Commun

November 2024

View Article and Find Full Text PDF

Detailed Images of Deep Brain Stimulation Leads Using Micro-CT.

Thomas Billoud Peter Christoph Reinacher Moritz Weigt Dominik von Elverfeldt Theo Demerath

Stereotact Funct Neurosurg

November 2024

Introduction: One of the challenges in directional deep brain stimulation (DBS) is to determine the orientation of implanted electrodes relative to targeted regions. Post-operative images must be aligned with a model of the implanted lead, usually a computer-based model provided by the manufacturer. This paper shows that models can alternatively be obtained by capturing images of individual leads using micro-CT, a high-resolution CT technique.

View Article and Find Full Text PDF

Understanding epistatic networks in the B1 β-lactamases through coevolutionary statistical modeling and deep mutational scanning.

J Z Chen M Bisardi D Lee S Cotogno F Zamponi

Nat Commun

September 2024

Throughout evolution, protein families undergo substantial sequence divergence while preserving structure and function. Although most mutations are deleterious, evolution can explore sequence space via epistatic networks of intramolecular interactions that alleviate the harmful mutations. However, comprehensive analysis of such epistatic networks across protein families remains limited.

View Article and Find Full Text PDF

Emergent time scales of epistasis in protein evolution.

Leonardo Di Bari Matteo Bisardi Sabrina Cotogno Martin Weigt Francesco Zamponi

Proc Natl Acad Sci U S A

October 2024

We introduce a data-driven epistatic model of protein evolution, capable of generating evolutionary trajectories spanning very different time scales reaching from individual mutations to diverged homologs. Our in silico evolution encompasses random nucleotide mutations, insertions and deletions, and models selection using a fitness landscape, which is inferred via a generative probabilistic model for protein families. We show that the proposed framework accurately reproduces the sequence statistics of both short-time (experimental) and long-time (natural) protein evolution, suggesting applicability also to relatively data-poor intermediate evolutionary time scales, which are currently inaccessible to evolution experiments.

View Article and Find Full Text PDF

Generating Artificial Ribozymes Using Sparse Coevolutionary Models.

Francesco Calvanese Martin Weigt Philippe Nghe

Methods Mol Biol

September 2024

RNA ribozyme (Walter Engelke, Biologist (London, England) 49:199-203, 2002) datasets typically contain from a few hundred to a few thousand naturally occurring sequences. However, the potential sequence space of RNA is huge. For example, the number of possible RNA sequences of length 150 nucleotides is approximately , a figure that far surpasses the estimated number of atoms in the known universe, which is around .

View Article and Find Full Text PDF

TULIP: A transformer-based unsupervised language model for interacting peptides and T cell receptors that generalizes to unseen epitopes.

Barthelemy Meynard-Piganeau Christoph Feinauer Martin Weigt Aleksandra M Walczak Thierry Mora

Proc Natl Acad Sci U S A

June 2024

Article Synopsis

Scientists want to predict how T cell receptors (TCRs) connect with their targets to help create better medicines for fighting diseases.
Current methods struggle because they don’t have enough good data and can be biased based on how training data is chosen.
The new model called TULIP uses incomplete data and a special kind of learning to understand these connections better, showing it can perform well even with new information.

View Article and Find Full Text PDF

Towards parsimonious generative modeling of RNA families.

Francesco Calvanese Camille N Lambert Philippe Nghe Francesco Zamponi Martin Weigt

Nucleic Acids Res

June 2024

Generative probabilistic models emerge as a new paradigm in data-driven, evolution-informed design of biomolecular sequences. This paper introduces a novel approach, called Edge Activation Direct Coupling Analysis (eaDCA), tailored to the characteristics of RNA sequences, with a strong emphasis on simplicity, efficiency, and interpretability. eaDCA explicitly constructs sparse coevolutionary models for RNA families, achieving performance levels comparable to more complex methods while utilizing a significantly lower number of parameters.

View Article and Find Full Text PDF

Pilot study on high-resolution radiological methods for the analysis of cerebrospinal fluid (CSF) shunt valves.

Martin P Pichotka Moritz Weigt Mukesch J Shah Maximilian F Russe Thomas Stein

Z Med Phys

December 2023

Article Synopsis

The study addresses high failure rates in cerebrospinal fluid (CSF) shunts, particularly focusing on malfunctioning regulating valves, and aims to improve understanding and analysis of valve failures to minimize unnecessary surgeries.
It introduces innovative radiological techniques, such as low-dose contrast-enhanced radiography and machine learning, to diagnose valve obstructions more accurately and efficiently.
The results indicate that these advanced imaging methods and machine learning can effectively analyze fluid transport and identify obstruction mechanisms, paving the way for improved clinical applications and potential repair methods for malfunctioning valves.

View Article and Find Full Text PDF

In Vivo Metabolic Imaging of [1- C]Pyruvate-d Hyperpolarized By Reversible Exchange With Parahydrogen.

Henri de Maissin Philipp R Groß Obaid Mohiuddin Moritz Weigt Luca Nagel

Angew Chem Int Ed Engl

September 2023

Metabolic magnetic resonance imaging (MRI) using hyperpolarized (HP) pyruvate is becoming a non-invasive technique for diagnosing, staging, and monitoring response to treatment in cancer and other diseases. The clinically established method for producing HP pyruvate, dissolution dynamic nuclear polarization, however, is rather complex and slow. Signal Amplification By Reversible Exchange (SABRE) is an ultra-fast and low-cost method based on fast chemical exchange.

View Article and Find Full Text PDF

Generating interacting protein sequences using domain-to-domain translation.

Barthelemy Meynard-Piganeau Caterina Fabbri Martin Weigt Andrea Pagnani Christoph Feinauer

Bioinformatics

July 2023

Motivation: Being able to artificially design novel proteins of desired function is pivotal in many biological and biomedical applications. Generative statistical modeling has recently emerged as a new paradigm for designing amino acid sequences, including in particular models and embedding methods borrowed from natural language processing (NLP). However, most approaches target single proteins or protein domains, and do not take into account any functional specificity or interaction with the context.

View Article and Find Full Text PDF

Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins.

Carlos A Gandarilla-Pérez Sergio Pinilla Anne-Florence Bitbol Martin Weigt

PLoS Comput Biol

March 2023

Predicting protein-protein interactions from sequences is an important goal of computational biology. Various sources of information can be used to this end. Starting from the sequences of two interacting protein families, one can use phylogeny or residue coevolution to infer which paralogs are specific interaction partners within each species.

View Article and Find Full Text PDF

Structure and Function of a Dehydrating Condensation Domain in Nonribosomal Peptide Biosynthesis.

Jon B Patteson Camille Marie Fortinez Andrew T Putz Juan Rodriguez-Rivas L Henry Bryant

J Am Chem Soc

August 2022

Dehydroamino acids are important structural motifs and biosynthetic intermediates for natural products. Many bioactive natural products of nonribosomal origin contain dehydroamino acids; however, the biosynthesis of dehydroamino acids in most nonribosomal peptides is not well understood. Here, we provide biochemical and bioinformatic evidence in support of the role of a unique class of condensation domains in dehydration (C).

View Article and Find Full Text PDF

Deciphering polymorphism in 61,157 Escherichia coli genomes via epistatic sequence landscapes.

Lucile Vigué Giancarlo Croce Marie Petitjean Etienne Ruppé Olivier Tenaillon

Nat Commun

July 2022

Characterizing the effect of mutations is key to understand the evolution of protein sequences and to separate neutral amino-acid changes from deleterious ones. Epistatic interactions between residues can lead to a context dependence of mutation effects. Context dependence constrains the amino-acid changes that can contribute to polymorphism in the short term, and the ones that can accumulate between species in the long term.

View Article and Find Full Text PDF

Author Correction: Efficient generative modeling of protein sequences using simple autoregressive models.

Jeanne Trinquier Guido Uguzzoni Andrea Pagnani Francesco Zamponi Martin Weigt

Nat Commun

April 2022

View Article and Find Full Text PDF

Epistatic models predict mutable sites in SARS-CoV-2 proteins and epitopes.

Juan Rodriguez-Rivas Giancarlo Croce Maureen Muscat Martin Weigt

Proc Natl Acad Sci U S A

January 2022

The emergence of new variants of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a major concern given their potential impact on the transmissibility and pathogenicity of the virus as well as the efficacy of therapeutic interventions. Here, we predict the mutability of all positions in SARS-CoV-2 protein domains to forecast the appearance of unseen variants. Using sequence data from other coronaviruses, preexisting to SARS-CoV-2, we build statistical models that not only capture amino acid conservation but also more complex patterns resulting from epistasis.

View Article and Find Full Text PDF

Modeling Sequence-Space Exploration and Emergence of Epistatic Signals in Protein Evolution.

Matteo Bisardi Juan Rodriguez-Rivas Francesco Zamponi Martin Weigt

Mol Biol Evol

January 2022

During their evolution, proteins explore sequence space via an interplay between random mutations and phenotypic selection. Here, we build upon recent progress in reconstructing data-driven fitness landscapes for families of homologous proteins, to propose stochastic models of experimental protein evolution. These models predict quantitatively important features of experimentally evolved sequence libraries, like fitness distributions and position-specific mutational spectra.

View Article and Find Full Text PDF

adabmDCA: adaptive Boltzmann machine learning for biological sequences.

Anna Paola Muntoni Andrea Pagnani Martin Weigt Francesco Zamponi

BMC Bioinformatics

October 2021

Background: Boltzmann machines are energy-based models that have been shown to provide an accurate statistical description of domains of evolutionary-related protein and RNA families. They are parametrized in terms of local biases accounting for residue conservation, and pairwise terms to model epistatic coevolution between residues. From the model parameters, it is possible to extract an accurate prediction of the three-dimensional contact map of the target domain.

View Article and Find Full Text PDF

Efficient generative modeling of protein sequences using simple autoregressive models.

Jeanne Trinquier Guido Uguzzoni Andrea Pagnani Francesco Zamponi Martin Weigt

Nat Commun

October 2021

Generative models emerge as promising candidates for novel sequence-data driven approaches to protein design, and for the extraction of structural and functional information about proteins deeply hidden in rapidly growing sequence databases. Here we propose simple autoregressive models as highly accurate but computationally efficient generative sequence models. We show that they perform similarly to existing approaches based on Boltzmann machines or deep generative models, but at a substantially lower computational cost (by a factor between 10 and 10).

View Article and Find Full Text PDF

Sparse generative modeling via parameter reduction of Boltzmann machines: Application to protein-sequence families.

Pierre Barrat-Charlaix Anna Paola Muntoni Kai Shimagaki Martin Weigt Francesco Zamponi

Phys Rev E

August 2021

Boltzmann machines (BMs) are widely used as generative models. For example, pairwise Potts models (PMs), which are instances of the BM class, provide accurate statistical models of families of evolutionarily related protein sequences. Their parameters are the local fields, which describe site-specific patterns of amino acid conservation, and the two-site couplings, which mirror the coevolution between pairs of sites.

View Article and Find Full Text PDF

On the effect of phylogenetic correlations in coevolution-based contact prediction in proteins.

Edwin Rodriguez Horta Martin Weigt

PLoS Comput Biol

May 2021

Coevolution-based contact prediction, either directly by coevolutionary couplings resulting from global statistical sequence models or using structural supervision and deep learning, has found widespread application in protein-structure prediction from sequence. However, one of the basic assumptions in global statistical modeling is that sequences form an at least approximately independent sample of an unknown probability distribution, which is to be learned from data. In the case of protein families, this assumption is obviously violated by phylogenetic relations between protein sequences.

View Article and Find Full Text PDF

Food sources for camptandriid crabs in an arid mangrove ecosystem of the Persian Gulf: a stable isotope approach.

Mohammad Reza Hemmati Mehdi Ghodrati Shojaei Ali Taheri Mirghaed Melika Mashhadi Farahani Maryam Weigt

Isotopes Environ Health Stud

October 2021

Crabs of the family Camptandriidae are the most dominant burrowing crabs inhabiting arid mangrove forests of the Persian Gulf. They play important roles in the structuring and functioning of mangrove ecosystems by modulating biogeochemical processes and cycling of nutrients, serving as important ecosystem engineers. We analysed stable carbon (C) and nitrogen (N) isotope values of three camptandriid crabs (, and ) and their potential food sources in the Hara Biosphere Reserve, northern Persian Gulf.

View Article and Find Full Text PDF

Rbm10 facilitates heterochromatin assembly via the Clr6 HDAC complex.

Martina Weigt Qingsong Gao Hyoju Ban Haijin He Guido Mastrobuoni

Epigenetics Chromatin

January 2021

Splicing factors have recently been shown to be involved in heterochromatin formation, but their role in controlling heterochromatin structure and function remains poorly understood. In this study, we identified a fission yeast homologue of human splicing factor RBM10, which has been linked to TARP syndrome. Overexpression of Rbm10 in fission yeast leads to strong global intron retention.

View Article and Find Full Text PDF

Aligning biological sequences by exploiting residue conservation and coevolution.

Anna Paola Muntoni Andrea Pagnani Martin Weigt Francesco Zamponi

Phys Rev E

December 2020

Sequences of nucleotides (for DNA and RNA) or amino acids (for proteins) are central objects in biology. Among the most important computational problems is that of sequence alignment, i.e.

View Article and Find Full Text PDF

FilterDCA: Interpretable supervised contact prediction using inter-domain coevolution.

Maureen Muscat Giancarlo Croce Edoardo Sarti Martin Weigt

PLoS Comput Biol

October 2020

Predicting three-dimensional protein structure and assembling protein complexes using sequence information belongs to the most prominent tasks in computational biology. Recently substantial progress has been obtained in the case of single proteins using a combination of unsupervised coevolutionary sequence analysis with structurally supervised deep learning. While reaching impressive accuracies in predicting residue-residue contacts, deep learning has a number of disadvantages.

View Article and Find Full Text PDF

An evolution-based model for designing chorismate mutase enzymes.

William P Russ Matteo Figliuzzi Christian Stocker Pierre Barrat-Charlaix Michael Socolich

Science

July 2020

The rational design of enzymes is an important goal for both fundamental and practical reasons. Here, we describe a process to learn the constraints for specifying proteins purely from evolutionary sequence data, design and build libraries of synthetic genes, and test them for activity in vivo using a quantitative complementation assay. For chorismate mutase, a key enzyme in the biosynthesis of aromatic amino acids, we demonstrate the design of natural-like catalytic function with substantial sequence diversity.

View Article and Find Full Text PDF