Publications by Alessandro Laio | LitMetric

Publications by authors named "Alessandro Laio"

Page 1 of 5

Coarse-Grained Molecular Dynamics with Normalizing Flows.

Samuel Tamagnone Alessandro Laio Marylou Gabrié

J Chem Theory Comput

September 2024

We propose a sampling algorithm relying on a collective variable (CV) of midsize dimension modeled by a normalizing flow and using nonequilibrium dynamics to propose full configurational moves from the proposition of a refreshed value of the CV made by the flow. The algorithm takes the form of a Markov chain with nonlocal updates, allowing jumps through energy barriers across metastable states. The flow is trained throughout the algorithm to reproduce the free energy landscape of the CV.

View Article and Find Full Text PDF

Intrinsic dimension as a multi-scale summary statistics in network modeling.

Iuri Macocco Antonietta Mira Alessandro Laio

Sci Rep

August 2024

Complex networks are powerful mathematical tools for modelling and understanding the behaviour of highly interconnected systems. However, existing methods for analyzing these networks focus on local properties (e.g.

View Article and Find Full Text PDF

Maximally informative feature selection using Information Imbalance: Application to COVID-19 severity prediction.

Romina Wild Emanuela Sozio Riccardo G Margiotta Fabiana Dellai Angela Acquasanta Alessandro Laio

Sci Rep

May 2024

Clinical databases typically include, for each patient, many heterogeneous features, for example blood exams, the clinical history before the onset of the disease, the evolution of the symptoms, the results of imaging exams, and many others. We here propose to exploit a recently developed statistical approach, the Information Imbalance, to compare different subsets of patient features and automatically select the set of features that is maximally informative for a given clinical purpose, especially in minority classes. We adapt the Information Imbalance approach to work in a clinical framework, where patient features are often categorical and are generally available only for a fraction of the patients.

View Article and Find Full Text PDF

Robust inference of causality in high-dimensional dynamical processes from the Information Imbalance of distance ranks.

Vittorio Del Tatto Gianfranco Fortunato Domenica Bueti Alessandro Laio

Proc Natl Acad Sci U S A

May 2024

We introduce an approach which allows detecting causal relationships between variables for which the time evolution is available. Causality is assessed by a variational scheme based on the Information Imbalance of distance ranks, a statistical test capable of inferring the relative information content of different distance measures. We test whether the predictability of a putative driven system Y can be improved by incorporating information from a potential driver system X, without explicitly modeling the underlying dynamics and without the need to compute probability densities of the dynamic variables.

View Article and Find Full Text PDF

Improving acute stroke assessment in non-enhanced computed tomography: automated tool for early ischemic lesion volume detection.

Mara Sabina Bernardi Alex Rodriguez Paola Caruso Giovanni Furlanis Mariana Ridolfi Alessandro Laio

Neurol Sci

July 2024

Background And Objectives: ASPECTs is a widely used marker to identify early stroke signs on non-enhanced computed tomography (NECT), yet it presents interindividual variability and it may be hard to use for non-experts. We introduce an algorithm capable of automatically estimating the NECT volumetric extension of early acute ischemic changes in the 3D space. We compared the power of this marker with ASPECTs evaluated by experienced practitioner in predicting the clinical outcome.

View Article and Find Full Text PDF

Solvation thermodynamics from cavity shapes of amino acids.

Khatereh Azizi Alessandro Laio Ali Hassanali

PNAS Nexus

August 2023

According to common physical chemistry wisdom, the solvent cavities hosting a solute are tightly sewn around it, practically coinciding with its van der Waals surface. Solvation entropy is primarily determined by the surface and the volume of the cavity while enthalpy is determined by the solute-solvent interaction. In this work, we challenge this picture, demonstrating by molecular dynamics simulations that the cavities surrounding the 20 amino acids deviate significantly from the molecular surface.

View Article and Find Full Text PDF

Do Machine-Learning Atomic Descriptors and Order Parameters Tell the Same Story? The Case of Liquid Water.

Edward Danquah Donkor Alessandro Laio Ali Hassanali

J Chem Theory Comput

July 2023

Machine-learning (ML) has become a key workhorse in molecular simulations. Building an ML model in this context involves encoding the information on chemical environments using local atomic descriptors. In this work, we focus on the Smooth Overlap of Atomic Positions (SOAP) and their application in studying the properties of liquid water both in the bulk and at the hydrophobic air-water interface.

View Article and Find Full Text PDF

Intrinsic Dimension Estimation for Discrete Metrics.

Iuri Macocco Aldo Glielmo Jacopo Grilli Alessandro Laio

Phys Rev Lett

February 2023

Real-world datasets characterized by discrete features are ubiquitous: from categorical surveys to clinical questionnaires, from unweighted networks to DNA sequences. Nevertheless, the most common unsupervised dimensional reduction methods are designed for continuous spaces, and their use for discrete spaces can lead to errors and biases. In this Letter we introduce an algorithm to infer the intrinsic dimension (ID) of datasets embedded in discrete spaces.

View Article and Find Full Text PDF

Ranking the information content of distance measures.

Aldo Glielmo Claudio Zeni Bingqing Cheng Gábor Csányi Alessandro Laio

PNAS Nexus

May 2022

Real-world data typically contain a large number of features that are often heterogeneous in nature, relevance, and also units of measure. When assessing the similarity between data points, one can build various distance measures using subsets of these features. Finding a small set of features that still retains sufficient information about the dataset is important for the successful application of many statistical learning approaches.

View Article and Find Full Text PDF

The generalized ratios intrinsic dimension estimator.

Francesco Denti Diego Doimo Alessandro Laio Antonietta Mira

Sci Rep

November 2022

Modern datasets are characterized by numerous features related by complex dependency structures. To deal with these data, dimensionality reduction techniques are essential. Many of these techniques rely on the concept of intrinsic dimension (id), a measure of the complexity of the dataset.

View Article and Find Full Text PDF

DADApy: Distance-based analysis of data-manifolds in Python.

Aldo Glielmo Iuri Macocco Diego Doimo Matteo Carli Claudio Zeni Alessandro Laio

Patterns (N Y)

October 2022

DADApy is a Python software package for analyzing and characterizing high-dimensional data manifolds. It provides methods for estimating the intrinsic dimension and the probability density, for performing density-based clustering, and for comparing different distance metrics. We review the main functionalities of the package and exemplify its usage in a synthetic dataset and in a real-world application.

View Article and Find Full Text PDF

DPCfam: Unsupervised protein family classification by Density Peak Clustering of large sequence datasets.

Elena Tea Russo Federico Barone Alex Bateman Stefano Cozzini Marco Punta Alessandro Laio

PLoS Comput Biol

October 2022

Proteins that are known only at a sequence level outnumber those with an experimental characterization by orders of magnitude. Classifying protein regions (domains) into homologous families can generate testable functional hypotheses for yet unannotated sequences. Existing domain family resources typically use at least some degree of manual curation: they grow slowly over time and leave a large fraction of the protein sequence space unclassified.

View Article and Find Full Text PDF

Unfolding and identification of membrane proteins in situ.

Nicola Galvanetto Zhongjie Ye Arin Marchesi Simone Mortal Sourav Maity Alessandro Laio

Elife

September 2022

Single-molecule force spectroscopy (SMFS) uses the cantilever tip of an atomic force microscopy (AFM) to apply a force able to unfold a single protein. The obtained force-distance curve encodes the unfolding pathway, and from its analysis it is possible to characterize the folded domains. SMFS has been mostly used to study the unfolding of purified proteins, in solution or reconstituted in a lipid bilayer.

View Article and Find Full Text PDF

Multiple-Allele MHC Class II Epitope Engineering by a Molecular Dynamics-Based Evolution Protocol.

Rodrigo Ochoa Victoria Alves Santos Lunardelli Daniela Santoro Rosa Alessandro Laio Pilar Cossio

Front Immunol

May 2022

Epitopes that bind simultaneously to all human alleles of Major Histocompatibility Complex class II (MHC II) are considered one of the key factors for the development of improved vaccines and cancer immunotherapies. To engineer MHC II multiple-allele binders, we developed a protocol called PanMHC-PARCE, based on the unsupervised optimization of the epitope sequence by single-point mutations, parallel explicit-solvent molecular dynamics simulations and scoring of the MHC II-epitope complexes. The key idea is accepting mutations that not only improve the affinity but also reduce the affinity gap between the alleles.

View Article and Find Full Text PDF

Computational Evolution Protocol for Peptide Design.

Rodrigo Ochoa Miguel A Soler Ivan Gladich Anna Battisti Nikola Minovski Alessandro Laio

Methods Mol Biol

March 2022

Computational peptide design is useful for therapeutics, diagnostics, and vaccine development. To select the most promising peptide candidates, the key is describing accurately the peptide-target interactions at the molecular level. We here review a computational peptide design protocol whose key feature is the use of all-atom explicit solvent molecular dynamics for describing the different peptide-target complexes explored during the optimization.

View Article and Find Full Text PDF

Dynamical landscape and multistability of a climate model.

Georgios Margazoglou Tobias Grafke Alessandro Laio Valerio Lucarini

Proc Math Phys Eng Sci

June 2021

We apply two independent data analysis methodologies to locate stable climate states in an intermediate complexity climate model and analyse their interplay. First, drawing from the theory of quasi-potentials, and viewing the state space as an energy landscape with valleys and mountain ridges, we infer the relative likelihood of the identified multistable climate states and investigate the most likely transition trajectories as well as the expected transition times between them. Second, harnessing techniques from data science, and specifically manifold learning, we characterize the data landscape of the simulation output to find climate states and basin boundaries within a fully agnostic and unsupervised framework.

View Article and Find Full Text PDF

When kinetics plays strange tricks.

Alessandro Laio

Proc Natl Acad Sci U S A

January 2022

View Article and Find Full Text PDF

Model Folded Hydrophobic Polymers Reside in Highly Branched Voids.

Khatereh Azizi Alessandro Laio Ali Hassanali

J Phys Chem Lett

January 2022

By using advanced data analysis techniques, we characterize the shape of the voids surrounding model polymers of different sizes in water, observed in molecular dynamics simulations. We find that even when the model polymer is folded, the voids are extremely rough, with branches that can extend to over 1 nm away from the polymer. Water molecules in contact with the void retain close-to-bulk properties in terms of local structure.

View Article and Find Full Text PDF

Unsupervised Learning Methods for Molecular Simulation Data.

Aldo Glielmo Brooke E Husic Alex Rodriguez Cecilia Clementi Frank Noé Alessandro Laio

Chem Rev

August 2021

Unsupervised learning is becoming an essential tool to analyze the increasingly large amounts of data produced by atomistic and molecular simulations, in material science, solid state physics, biophysics, and biochemistry. In this Review, we provide a comprehensive overview of the methods of unsupervised learning that have been most commonly used to investigate simulation data and indicate likely directions for further developments in the field. In particular, we discuss of molecular systems and present state-of-the-art algorithms of , , and , and .

View Article and Find Full Text PDF

Density Peak clustering of protein sequences associated to a Pfam clan reveals clear similarities and interesting differences with respect to manual family annotation.

Elena Tea Russo Alessandro Laio Marco Punta

BMC Bioinformatics

March 2021

Background: The identification of protein families is of outstanding practical importance for in silico protein annotation and is at the basis of several bioinformatic resources. Pfam is possibly the most well known protein family database, built in many years of work by domain experts with extensive use of manual curation. This approach is generally very accurate, but it is quite time consuming and it may suffer from a bias generated from the hand-curation itself, which is often guided by the available experimental evidence.

View Article and Find Full Text PDF

A Rosetta-based protein design protocol converging to natural sequences.

Giulia Sormani Zander Harteveld Stéphane Rosset Bruno Correia Alessandro Laio

J Chem Phys

February 2021

Computational protein design has emerged as a powerful tool capable of identifying sequences compatible with pre-defined protein structures. The sequence design protocols, implemented in the Rosetta suite, have become widely used in the protein engineering community. To understand the strengths and limitations of the Rosetta design framework, we tested several design protocols on two distinct folds (SH3-1 and Ubiquitin).

View Article and Find Full Text PDF

Candidate Binding Sites for Allosteric Inhibition of the SARS-CoV-2 Main Protease from the Analysis of Large-Scale Molecular Dynamics Simulations.

Matteo Carli Giulia Sormani Alex Rodriguez Alessandro Laio

J Phys Chem Lett

January 2021

We analyzed a 100 μs MD trajectory of the SARS-CoV-2 main protease by a non-parametric data analysis approach which allows characterizing a free energy landscape as a simultaneous function of hundreds of variables. We identified several conformations that, when visited by the dynamics, are stable for several hundred nanoseconds. We explicitly characterize and describe these metastable states.

View Article and Find Full Text PDF

Data segmentation based on the local intrinsic dimension.

Michele Allegra Elena Facco Francesco Denti Alessandro Laio Antonietta Mira

Sci Rep

October 2020

One of the founding paradigms of machine learning is that a small number of variables is often sufficient to describe high-dimensional data. The minimum number of variables required is called the intrinsic dimension (ID) of the data. Contrary to common intuition, there are cases where the ID varies within the same data set.

View Article and Find Full Text PDF

Automatic classification of single-molecule force spectroscopy traces from heterogeneous samples.

Nina I Ilieva Nicola Galvanetto Michele Allegra Marco Brucale Alessandro Laio

Bioinformatics

December 2020

Motivation: Single-molecule force spectroscopy (SMFS) experiments pose the challenge of analysing protein unfolding data (traces) coming from preparations with heterogeneous composition (e.g. where different proteins are present in the sample).

View Article and Find Full Text PDF

Brain network dynamics during spontaneous strategy shifts and incremental task optimization.

Michele Allegra Shima Seyed-Allaei Nicolas W Schuck Daniele Amati Alessandro Laio

Neuroimage

August 2020

Article Synopsis

Scientists studied how our brains help us get better at tasks by using either a familiar strategy or finding a new one.
They found that certain brain areas help improve the current strategy we’re using, while other areas are important for discovering new ones.
Their research shows that the brain is always active, helping us keep track of different ways to solve problems instead of just shutting down during tasks.

View Article and Find Full Text PDF

A PHP Error was encountered

Severity: Warning

Message: fopen(/var/lib/php/sessions/ci_session7un28u8t52k5cs1jt55keior9rfeq4qn): Failed to open stream: No space left on device

Filename: drivers/Session_files_driver.php

Line Number: 177

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once

A PHP Error was encountered

Severity: Warning

Message: session_start(): Failed to read session data: user (path: /var/lib/php/sessions)

Filename: Session/Session.php

Line Number: 137

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once