Publications by Carlo Baldassi | LitMetric

Publications by authors named "Carlo Baldassi"

Page 1 of 1

Typical and atypical solutions in nonconvex neural networks with discrete and continuous weights.

Carlo Baldassi Enrico M Malatesta Gabriele Perugini Riccardo Zecchina

Phys Rev E

August 2023

We study the binary and continuous negative-margin perceptrons as simple nonconvex neural network models learning random rules and associations. We analyze the geometry of the landscape of solutions in both models and find important similarities and differences. Both models exhibit subdominant minimizers which are extremely flat and wide.

View Article and Find Full Text PDF

Learning through atypical phase transitions in overparameterized neural networks.

Carlo Baldassi Clarissa Lauditi Enrico M Malatesta Rosalba Pacelli Gabriele Perugini

Phys Rev E

July 2022

Current deep neural networks are highly overparameterized (up to billions of connection weights) and nonlinear. Yet they can fit data almost perfectly through variants of gradient descent algorithms and achieve unexpected levels of prediction accuracy without overfitting. These are formidable results that defy predictions of statistical learning and pose conceptual challenges for nonconvex optimization.

View Article and Find Full Text PDF

Unveiling the Structure of Wide Flat Minima in Neural Networks.

Carlo Baldassi Clarissa Lauditi Enrico M Malatesta Gabriele Perugini Riccardo Zecchina

Phys Rev Lett

December 2021

The success of deep learning has revealed the application potential of neural networks across the sciences and opened up fundamental theoretical problems. In particular, the fact that learning algorithms based on simple variants of gradient methods are able to find near-optimal minima of highly nonconvex loss functions is an unexpected feature of neural networks. Moreover, such algorithms are able to fit the data even in the presence of noise, and yet they have excellent predictive capabilities.

View Article and Find Full Text PDF

Shaping the learning landscape in neural networks around wide flat minima.

Carlo Baldassi Fabrizio Pittorino Riccardo Zecchina

Proc Natl Acad Sci U S A

January 2020

Learning in deep neural networks takes place by minimizing a nonconvex high-dimensional loss function, typically by a stochastic gradient descent (SGD) strategy. The learning process is observed to be able to find good minimizers without getting stuck in local critical points and such minimizers are often satisfactory at avoiding overfitting. How these 2 features can be kept under control in nonlinear devices composed of millions of tunable connections is a profound and far-reaching open question.

View Article and Find Full Text PDF

Properties of the Geometry of Solutions and Capacity of Multilayer Neural Networks with Rectified Linear Unit Activations.

Carlo Baldassi Enrico M Malatesta Riccardo Zecchina

Phys Rev Lett

October 2019

Rectified linear units (ReLUs) have become the main model for the neural units in current deep learning systems. This choice was originally suggested as a way to compensate for the so-called vanishing gradient problem which can undercut stochastic gradient descent learning in networks composed of multiple layers. Here we provide analytical results on the effects of ReLUs on the capacity and on the geometrical landscape of the solution space in two-layer neural networks with either binary or real-valued weights.

View Article and Find Full Text PDF

Predicting Interacting Protein Pairs by Coevolutionary Paralog Matching.

Thomas Gueudré Carlo Baldassi Andrea Pagnani Martin Weigt

Methods Mol Biol

January 2021

Even if we know that two families of homologous proteins interact, we do not necessarily know, which specific proteins interact inside each species. The reason is that most families contain paralogs, i.e.

View Article and Find Full Text PDF

From statistical inference to a differential learning rule for stochastic neural networks.

Luca Saglietti Federica Gerace Alessandro Ingrosso Carlo Baldassi Riccardo Zecchina

Interface Focus

December 2018

Stochastic neural networks are a prototypical computational device able to build a probabilistic representation of an ensemble of external stimuli. Building on the relationship between inference and learning, we derive a synaptic plasticity rule that relies only on delayed activity correlations, and that shows a number of remarkable features. Our (DCM) rule satisfies some basic requirements for biological feasibility: finite and noisy afferent signals, Dale's principle and asymmetry of synaptic connections, locality of the weight update computations.

View Article and Find Full Text PDF

Role of Synaptic Stochasticity in Training Low-Precision Neural Networks.

Carlo Baldassi Federica Gerace Hilbert J Kappen Carlo Lucibello Luca Saglietti

Phys Rev Lett

June 2018

Stochasticity and limited precision of synaptic weights in neural network models are key aspects of both biological and hardware modeling of learning processes. Here we show that a neural network model with stochastic binary weights naturally gives prominence to exponentially rare dense regions of solutions with a number of desirable properties such as robustness and good generalization performance, while typical solutions are isolated and hard to find. Binary solutions of the standard perceptron problem are obtained from a simple gradient descent procedure on a set of real values parametrizing a probability distribution over the binary synapses.

View Article and Find Full Text PDF

Efficiency of quantum vs. classical annealing in nonconvex learning problems.

Carlo Baldassi Riccardo Zecchina

Proc Natl Acad Sci U S A

February 2018

Quantum annealers aim at solving nonconvex optimization problems by exploiting cooperative tunneling effects to escape local minima. The underlying idea consists of designing a classical energy function whose ground states are the sought optimal solutions of the original optimization problem and add a controllable quantum transverse field to generate tunneling processes. A key challenge is to identify classes of nonconvex optimization problems for which quantum annealing remains efficient while thermal annealing fails.

View Article and Find Full Text PDF

RNAs competing for microRNAs mutually influence their fluctuations in a highly non-linear microRNA-dependent manner in single cells.

Carla Bosia Francesco Sgrò Laura Conti Carlo Baldassi Davide Brusa

Genome Biol

February 2017

Background: Distinct RNA species may compete for binding to microRNAs (miRNAs). This competition creates an indirect interaction between miRNA targets, which behave as miRNA sponges and eventually influence each other's expression levels. Theoretical predictions suggest that not only the mean expression levels of targets but also the fluctuations around the means are coupled through miRNAs.

View Article and Find Full Text PDF

Unreasonable effectiveness of learning neural networks: From accessible states and robust ensembles to basic algorithmic schemes.

Carlo Baldassi Christian Borgs Jennifer T Chayes Alessandro Ingrosso Carlo Lucibello

Proc Natl Acad Sci U S A

November 2016

In artificial neural networks, learning from data is a computationally demanding task in which a large number of connection weights are iteratively tuned through stochastic-gradient-based heuristic processes over a cost function. It is not well understood how learning occurs in these systems, in particular how they avoid getting trapped in configurations with poor computational performance. Here, we study the difficult case of networks with discrete weights, where the optimization landscape is very rough even for simple architectures, and provide theoretical and numerical evidence of the existence of rare-but extremely dense and accessible-regions of configurations in the network weight space.

View Article and Find Full Text PDF

Simultaneous identification of specifically interacting paralogs and interprotein contacts by direct coupling analysis.

Thomas Gueudré Carlo Baldassi Marco Zamparo Martin Weigt Andrea Pagnani

Proc Natl Acad Sci U S A

October 2016

Understanding protein-protein interactions is central to our understanding of almost all complex biological processes. Computational tools exploiting rapidly growing genomic databases to characterize protein-protein interactions are urgently needed. Such methods should connect multiple scales from evolutionary conserved interactions between families of homologous proteins, over the identification of specifically interacting proteins in the case of multiple paralogs inside a species, down to the prediction of residues being in physical contact across interaction interfaces.

View Article and Find Full Text PDF

Learning may need only a few bits of synaptic precision.

Carlo Baldassi Federica Gerace Carlo Lucibello Luca Saglietti Riccardo Zecchina

Phys Rev E

May 2016

Learning in neural networks poses peculiar challenges when using discretized rather then continuous synaptic states. The choice of discrete synapses is motivated by biological reasoning and experiments, and possibly by hardware implementation considerations as well. In this paper we extend a previous large deviations analysis which unveiled the existence of peculiar dense regions in the space of synaptic states which accounts for the possibility of learning efficiently in networks with binary synapses.

View Article and Find Full Text PDF

Subdominant Dense Clusters Allow for Simple Learning and High Computational Performance in Neural Networks with Discrete Synapses.

Carlo Baldassi Alessandro Ingrosso Carlo Lucibello Luca Saglietti Riccardo Zecchina

Phys Rev Lett

September 2015

We show that discrete synaptic weights can be efficiently used for learning in large scale neural systems, and lead to unanticipated computational performance. We focus on the representative case of learning random patterns with binary synapses in single layer networks. The standard statistical analysis shows that this problem is exponentially dominated by isolated solutions that are extremely hard to find algorithmically.

View Article and Find Full Text PDF

A Three-Threshold Learning Rule Approaches the Maximal Capacity of Recurrent Neural Networks.

Alireza Alemi Carlo Baldassi Nicolas Brunel Riccardo Zecchina

PLoS Comput Biol

August 2015

Understanding the theoretical foundations of how memories are encoded and retrieved in neural populations is a central challenge in neuroscience. A popular theoretical scenario for modeling memory function is the attractor neural network scenario, whose prototype is the Hopfield model. The model simplicity and the locality of the synaptic update rules come at the cost of a poor storage capacity, compared with the capacity achieved with perceptron learning algorithms.

View Article and Find Full Text PDF

Fast and accurate multivariate Gaussian modeling of protein families: predicting residue contacts and protein-interaction partners.

Carlo Baldassi Marco Zamparo Christoph Feinauer Andrea Procaccini Riccardo Zecchina

PLoS One

January 2015

In the course of evolution, proteins show a remarkable conservation of their three-dimensional structure and their biological function, leading to strong evolutionary constraints on the sequence variability between homologous proteins. Our method aims at extracting such constraints from rapidly accumulating sequence data, and thereby at inferring protein structure and function from sequence information alone. Recently, global statistical inference methods (e.

View Article and Find Full Text PDF

Sharing information to reconstruct patient-specific pathways in heterogeneous diseases.

Anthony Gitter Alfredo Braunstein Andrea Pagnani Carlo Baldassi Christian Borgs

Pac Symp Biocomput

August 2014

Advances in experimental techniques resulted in abundant genomic, transcriptomic, epigenomic, and proteomic data that have the potential to reveal critical drivers of human diseases. Complementary algorithmic developments enable researchers to map these data onto protein-protein interaction networks and infer which signaling pathways are perturbed by a disease. Despite this progress, integrating data across different biological samples or patients remains a substantial challenge because samples from the same disease can be extremely heterogeneous.

View Article and Find Full Text PDF

Shape similarity, better than semantic membership, accounts for the structure of visual object representations in a population of monkey inferotemporal neurons.

Carlo Baldassi Alireza Alemi-Neissi Marino Pagan James J Dicarlo Riccardo Zecchina

PLoS Comput Biol

February 2014

The anterior inferotemporal cortex (IT) is the highest stage along the hierarchy of visual areas that, in primates, processes visual objects. Although several lines of evidence suggest that IT primarily represents visual shape information, some recent studies have argued that neuronal ensembles in IT code the semantic membership of visual objects (i.e.

View Article and Find Full Text PDF

Efficient supervised learning in networks with binary synapses.

Carlo Baldassi Alfredo Braunstein Nicolas Brunel Riccardo Zecchina

Proc Natl Acad Sci U S A

June 2007

Recent experimental studies indicate that synaptic changes induced by neuronal activity are discrete jumps between a small number of stable states. Learning in systems with discrete synapses is known to be a computationally hard problem. Here, we study a neurobiologically plausible on-line learning algorithm that derives from belief propagation algorithms.

View Article and Find Full Text PDF