Publications by Emidio Capriotti

Publications by authors named "Emidio Capriotti"

Page 1 of 3

DOME Registry: implementing community-wide recommendations for reporting supervised machine learning in biology.

Omar Abdelghani Attafi Damiano Clementel Konstantinos Kyritsis Emidio Capriotti Gavin Farrell

Gigascience

January 2024

Article Synopsis

View Article and Find Full Text PDF

Assessing predictions on fitness effects of missense variants in HMBS in CAGI6.

Jing Zhang Lisa Kinch Panagiotis Katsonis Olivier Lichtarge Milind Jagota Emidio Capriotti

Hum Genet

August 2024

Article Synopsis

- This paper evaluates predictions for the "HMBS" challenge from the 2021 Critical Assessment of Genome Interpretation, focusing on how well participants predicted the effects of missense variants in the HMBS gene on yeast growth.
- Despite using various algorithms, most predictors showed similar performance with correlation coefficients around 0.3, though some top predictors had a slightly better median correlation of ≥ 0.34 with experimental results.
- Predictors were moderately effective in distinguishing between harmful and harmless variants, but overall accuracy remained low compared to experimental controls, highlighting a need for significant improvements in prediction methods, especially for variants in specific regions like the insertion loop.

View Article and Find Full Text PDF

The complex impact of cancer-related missense mutations on the stability and on the biophysical and biochemical properties of MAPK1 and MAPK3 somatic variants.

Maria Petrosino Leonore Novak Alessandra Pasquo Paola Turina Emidio Capriotti

Hum Genomics

October 2023

Mitogen-activated protein kinases 1 and 3 (MAPK1 and MAPK3), also called extracellular regulated kinases (ERK2 and ERK1), are serine/threonine kinase activated downstream by the Ras/Raf/MEK/ERK signal transduction cascade that regulates a variety of cellular processes. A dysregulation of MAPK cascade is frequently associated to missense mutations on its protein components and may be related to many pathologies, including cancer. In this study we selected from COSMIC database a set of MAPK1 and MAPK3 somatic variants found in cancer tissues carrying missense mutations distributed all over the MAPK1 and MAPK3 sequences.

View Article and Find Full Text PDF

K-Pro: Kinetics Data on Proteins and Mutants.

Paola Turina Piero Fariselli Emidio Capriotti

J Mol Biol

October 2023

The study of protein folding plays a crucial role in improving our understanding of protein function and of the relationship between genetics and phenotypes. In particular, understanding the thermodynamics and kinetics of the folding process is important for uncovering the mechanisms behind human disorders caused by protein misfolding. To address this issue, it is essential to collect and curate experimental kinetic and thermodynamic data on protein folding.

View Article and Find Full Text PDF

Identification of Driver Epistatic Gene Pairs Combining Germline and Somatic Mutations in Cancer.

Jairo Rocha Jaume Sastre Emilia Amengual-Cladera Jessica Hernandez-Rodriguez Victor Asensio-Landa Emidio Capriotti

Int J Mol Sci

May 2023

Cancer arises from the complex interplay of various factors. Traditionally, the identification of driver genes focuses primarily on the analysis of somatic mutations. We describe a new method for the detection of driver gene pairs based on an epistasis analysis that considers both germline and somatic variations.

View Article and Find Full Text PDF

PhD-SNPg: updating a webserver and lightweight tool for scoring nucleotide variants.

Emidio Capriotti Piero Fariselli

Nucleic Acids Res

July 2023

One of the primary challenges in human genetics is determining the functional impact of single nucleotide variants (SNVs) and insertion and deletions (InDels), whether coding or noncoding. In the past, methods have been created to detect disease-related single amino acid changes, but only some can assess the influence of noncoding variations. CADD is the most commonly used and advanced algorithm for predicting the diverse effects of genome variations.

View Article and Find Full Text PDF

Resources and tools for rare disease variant interpretation.

Luana Licata Allegra Via Paola Turina Giulia Babbi Silvia Benevenuta Emidio Capriotti

Front Mol Biosci

May 2023

Collectively, rare genetic disorders affect a substantial portion of the world's population. In most cases, those affected face difficulties in receiving a clinical diagnosis and genetic characterization. The understanding of the molecular mechanisms of these diseases and the development of therapeutic treatments for patients are also challenging.

View Article and Find Full Text PDF

Challenges in predicting stabilizing variations: An exploration.

Silvia Benevenuta Giovanni Birolo Tiziana Sanavia Emidio Capriotti Piero Fariselli

Front Mol Biosci

January 2023

An open challenge of computational and experimental biology is understanding the impact of non-synonymous DNA variations on protein function and, subsequently, human health. The effects of these variants on protein stability can be measured as the difference in the free energy of unfolding (ΔΔ) between the mutated structure of the protein and its wild-type form. Throughout the years, bioinformaticians have developed a wide variety of tools and approaches to predict the ΔΔ.

View Article and Find Full Text PDF

DDGun: an untrained predictor of protein stability changes upon amino acid variants.

Ludovica Montanucci Emidio Capriotti Giovanni Birolo Silvia Benevenuta Corrado Pancotti

Nucleic Acids Res

July 2022

Estimating the functional effect of single amino acid variants in proteins is fundamental for predicting the change in the thermodynamic stability, measured as the difference in the Gibbs free energy of unfolding, between the wild-type and the variant protein (ΔΔG). Here, we present the web-server of the DDGun method, which was previously developed for the ΔΔG prediction upon amino acid variants. DDGun is an untrained method based on basic features derived from evolutionary information.

View Article and Find Full Text PDF

Turning Failures into Applications: The Problem of Protein ΔΔG Prediction.

Rita Casadio Castrense Savojardo Piero Fariselli Emidio Capriotti Pier Luigi Martelli

Methods Mol Biol

May 2022

After nearly two decades of research in the field of computational methods based on machine learning and knowledge-based potentials for ΔG and ΔΔG prediction upon variations, we now realize that all the approaches are poorly performing when tested on specific cases and that there is large space for improvement. Why this is so? Is it wrong the underlying assumption that experimental protein thermodynamics in solution reflects the thermodynamics of a single protein? Both machine learning and knowledge-based computational methods are rigorous and we know the solid theory behind. We are now in a critical situation, which suggests that predictions of protein instability upon variation should be considered with care.

View Article and Find Full Text PDF

Evaluating the relevance of sequence conservation in the prediction of pathogenic missense variants.

Emidio Capriotti Piero Fariselli

Hum Genet

October 2022

Evolutionary information is the primary tool for detecting functional conservation in nucleic acid and protein. This information has been extensively used to predict structure, interactions and functions in macromolecules. Pathogenicity prediction models rely on multiple sequence alignment information at different levels.

View Article and Find Full Text PDF

Predicting protein stability changes upon single-point mutation: a thorough comparison of the available tools on a new dataset.

Corrado Pancotti Silvia Benevenuta Giovanni Birolo Virginia Alberini Valeria Repetto Emidio Capriotti

Brief Bioinform

March 2022

Predicting the difference in thermodynamic stability between protein variants is crucial for protein design and understanding the genotype-phenotype relationships. So far, several computational tools have been created to address this task. Nevertheless, most of them have been trained or optimized on the same and 'all' available data, making a fair comparison unfeasible.

View Article and Find Full Text PDF

Network-based strategies for protein characterization.

Alessandra Merlotti Giulia Menichetti Piero Fariselli Emidio Capriotti Daniel Remondini

Adv Protein Chem Struct Biol

September 2021

Protein structure characterization is fundamental to understand protein properties, such as folding process and protein resistance to thermal stress, up to unveiling organism pathologies (e.g., prion disease).

View Article and Find Full Text PDF

A Deep-Learning Sequence-Based Method to Predict Protein Stability Changes Upon Genetic Variations.

Corrado Pancotti Silvia Benevenuta Valeria Repetto Giovanni Birolo Emidio Capriotti

Genes (Basel)

June 2021

Several studies have linked disruptions of protein stability and its normal functions to disease. Therefore, during the last few decades, many tools have been developed to predict the free energy changes upon protein residue variations. Most of these methods require both sequence and structure information to obtain reliable predictions.

View Article and Find Full Text PDF

Analysis and Interpretation of the Impact of Missense Variants in Cancer.

Maria Petrosino Leonore Novak Alessandra Pasquo Roberta Chiaraluce Paola Turina Emidio Capriotti

Int J Mol Sci

May 2021

Large scale genome sequencing allowed the identification of a massive number of genetic variations, whose impact on human health is still unknown. In this review we analyze, by an in silico-based strategy, the impact of missense variants on cancer-related genes, whose effect on protein stability and function was experimentally determined. We collected a set of 164 variants from 11 proteins to analyze the impact of missense mutations at structural and functional levels, and to assess the performance of state-of-the-art methods (FoldX and Meta-SNP) for predicting protein stability change and pathogenicity.

View Article and Find Full Text PDF

ThermoScan: Semi-automatic Identification of Protein Stability Data From PubMed.

Paola Turina Piero Fariselli Emidio Capriotti

Front Mol Biosci

March 2021

During the last years, the increasing number of DNA sequencing and protein mutagenesis studies has generated a large amount of variation data published in the biomedical literature. The collection of such data has been essential for the development and assessment of tools predicting the impact of protein variants at functional and structural levels. Nevertheless, the collection of manually curated data from literature is a highly time consuming and costly process that requires domain experts.

View Article and Find Full Text PDF

Protein Stability Perturbation Contributes to the Loss of Function in Haploinsufficient Genes.

Giovanni Birolo Silvia Benevenuta Piero Fariselli Emidio Capriotti Elisa Giorgio

Front Mol Biosci

February 2021

Missense variants are among the most studied genome modifications as disease biomarkers. It has been shown that the "perturbation" of the protein stability upon a missense variant (in terms of absolute ΔΔG value, i.e.

View Article and Find Full Text PDF

Calibrating variant-scoring methods for clinical decision making.

Silvia Benevenuta Emidio Capriotti Piero Fariselli

Bioinformatics

April 2021

Summary: Identifying pathogenic variants and annotating them is a major challenge in human genetics, especially for the non-coding ones. Several tools have been developed and used to predict the functional effect of genetic variants. However, the calibration assessment of the predictions has received little attention.

View Article and Find Full Text PDF

Limitations and challenges in protein stability prediction upon genome variations: towards future applications in precision medicine.

Tiziana Sanavia Giovanni Birolo Ludovica Montanucci Paola Turina Emidio Capriotti

Comput Struct Biotechnol J

July 2020

Protein stability predictions are becoming essential in medicine to develop novel immunotherapeutic agents and for drug discovery. Despite the large number of computational approaches for predicting the protein stability upon mutation, there are still critical unsolved problems: 1) the limited number of thermodynamic measurements for proteins provided by current databases; 2) the large intrinsic variability of ΔΔG values due to different experimental conditions; 3) biases in the development of predictive methods caused by ignoring the anti-symmetry of ΔΔG values between mutant and native protein forms; 4) over-optimistic prediction performance, due to sequence similarity between proteins used in training and test datasets. Here, we review these issues, highlighting new challenges required to improve current tools and to achieve more reliable predictions.

View Article and Find Full Text PDF

VarI-COSI 2018: a forum for research advances in variant interpretation and diagnostics.

Yana Bromberg Emidio Capriotti Hannah Carter

BMC Genomics

July 2019

View Article and Find Full Text PDF