Unreasonable effectiveness of learning neural networks: From accessible states and robust ensembles to basic algorithmic schemes.

Carlo Baldassi Christian Borgs Jennifer T Chayes Alessandro Ingrosso Carlo Lucibello Luca Saglietti Riccardo Zecchina

Proc Natl Acad Sci U S A

Department of Applied Science and Technology, Politecnico di Torino, I-10129 Torino, Italy.

Published: November 2016

In artificial neural networks, learning from data is a computationally demanding task in which a large number of connection weights are iteratively tuned through stochastic-gradient-based heuristic processes over a cost function. It is not well understood how learning occurs in these systems, in particular how they avoid getting trapped in configurations with poor computational performance. Here, we study the difficult case of networks with discrete weights, where the optimization landscape is very rough even for simple architectures, and provide theoretical and numerical evidence of the existence of rare-but extremely dense and accessible-regions of configurations in the network weight space. We define a measure, the robust ensemble (RE), which suppresses trapping by isolated configurations and amplifies the role of these dense regions. We analytically compute the RE in some exactly solvable models and also provide a general algorithmic scheme that is straightforward to implement: define a cost function given by a sum of a finite number of replicas of the original cost function, with a constraint centering the replicas around a driving assignment. To illustrate this, we derive several powerful algorithms, ranging from Markov Chains to message passing to gradient descent processes, where the algorithms target the robust dense states, resulting in substantial improvements in performance. The weak dependence on the number of precision bits of the weights leads us to conjecture that very similar reasoning applies to more conventional neural networks. Analogous algorithmic schemes can also be applied to other optimization problems.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5137727	PMC
http://dx.doi.org/10.1073/pnas.1608103113	DOI Listing

Publication Analysis

Top Keywords

neural networks

cost function

algorithmic schemes

unreasonable effectiveness

effectiveness learning

learning neural

networks

networks accessible

accessible states

states robust

Similar Publications

Lateralization of Neural Speech Discrimination at Birth Is a Predictor for Later Language Development.

Dev Sci

March 2025

Department of Pediatrics and Adolescent Medicine, Comprehensive Center for Pediatrics, Medical University of Vienna, Vienna, Austria.

Lisa Bartha-Doering Vito Giordano Sophie Mandl Silvia Benavides-Varela Anna Weiskopf

Newborns are able to neurally discriminate between speech and nonspeech right after birth. To date it remains unknown whether this early speech discrimination and the underlying neural language network is associated with later language development. Preterm-born children are an interesting cohort to investigate this relationship, as previous studies have shown that preterm-born neonates exhibit alterations of speech processing and have a greater risk of later language deficits.

View Article and Find Full Text PDF

Similar Publications

Reversing Cochlear Nucleus Maladaptive Plasticity via Customized Extracochlear Stimulation: A New Approach for Tinnitus Treatment.

Adv Sci (Weinh)

January 2025

ENT Institute and Department of Otolaryngology, Eye & ENT Hospital of Fudan University, Shanghai, 200031, China.

Min Chen Shuwen Fan Jiabao Mao Linhan Huang Nafisa Tursun

Tinnitus, a widespread condition affecting numerous individuals worldwide, remains a significant challenge due to limited effective therapeutic interventions. Intriguingly, patients using cochlear implants (CIs) have reported significant relief from tinnitus symptoms, although the underlying mechanisms remain unclear and intracochlear implantation risks cochlear damage and hearing loss. This study demonstrates that targeted intracochlear electrical stimulation (ES) in guinea pigs with noise-induced hearing loss reversed tinnitus-related maladaptive plasticity in the cochlear nucleus (CN), characterized by reduced auditory innervation, increased somatosensory innervation, and diminished inhibitory neural networks.

View Article and Find Full Text PDF

Similar Publications

A semi-supervised deep neuro-fuzzy iterative learning system for automatic segmentation of hippocampus brain MRI.

Math Biosci Eng

December 2024

Department of Electronics and Communication Engineering, Akshaya College of Engineering and Technology, Coimbatore, Tamil Nadu, India.

M Nisha T Kannan K Sivasankari

The hippocampus is a small, yet intricate seahorse-shaped tiny structure located deep within the brain's medial temporal lobe. It is a crucial component of the limbic system, which is responsible for regulating emotions, memory, and spatial navigation. This research focuses on automatic hippocampus segmentation from Magnetic Resonance (MR) images of a human head with high accuracy and fewer false positive and false negative rates.

View Article and Find Full Text PDF

Similar Publications

Research on bearing fault diagnosis based on a multimodal method.

Math Biosci Eng

December 2024

School of Information Engineering, Nantong Institute of Technology, Nantong 226002, Jiangsu, China.

Hao Chen Shengjie Li Xi Lu Qiong Zhang Jixining Zhu

As an essential component of mechanical systems, bearing fault diagnosis is crucial to ensure the safe operation of the equipment. However, vibration data from bearings often exhibit non-stationary and nonlinear features, which complicates fault diagnosis. To address this challenge, this paper introduces a novel multi-scale time-frequency and statistical features fusion model (MTSF-FM).

View Article and Find Full Text PDF

Similar Publications

Enhanced Pneumonia Detection in Chest X-Rays Using Hybrid Convolutional and Vision Transformer Networks.

Curr Med Imaging

January 2025

School of Life Sciences, Tiangong University, Tianjin 300387, China.

Benzorgat Mustapha Yatong Zhou Chunyan Shan Zhitao Xiao

Objective: The objective of this research is to enhance pneumonia detection in chest X-rays by leveraging a novel hybrid deep learning model that combines Convolutional Neural Networks (CNNs) with modified Swin Transformer blocks. This study aims to significantly improve diagnostic accuracy, reduce misclassifications, and provide a robust, deployable solution for underdeveloped regions where access to conventional diagnostics and treatment is limited.

Methods: The study developed a hybrid model architecture integrating CNNs with modified Swin Transformer blocks to work seamlessly within the same model.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!