On Neural Networks Fitting, Compression, and Generalization Behavior via Information-Bottleneck-like Approaches.

Entropy (Basel)

Department of Electronic and Electrical Engineering, University College London, Gower St., London WC1E 6BT, UK.

Published: July 2023

AI Article Synopsis

  • The paper explores neural network learning dynamics, aiming to better understand how networks fit, compress, and generalize data.
  • It introduces new methods for measuring these dynamics, such as minimum mean-squared error and cross-entropy, which are easier to handle than traditional mutual information measures.
  • Empirical results indicate that the new approach provides more reliable insights into network behavior during training and testing, highlighting the consistency of fitting and compression phases across various architectures and datasets.

Article Abstract

It is well-known that a neural network learning process-along with its connections to fitting, compression, and generalization-is not yet well understood. In this paper, we propose a novel approach to capturing such neural network dynamics using information-bottleneck-type techniques, involving the replacement of mutual information measures (which are notoriously difficult to estimate in high-dimensional spaces) by other more tractable ones, including (1) the minimum mean-squared error associated with the reconstruction of the network input data from some intermediate network representation and (2) the cross-entropy associated with a certain class label given some network representation. We then conducted an empirical study in order to ascertain how different network models, network learning algorithms, and datasets may affect the learning dynamics. Our experiments show that our proposed approach appears to be more reliable in comparison with classical information bottleneck ones in capturing network dynamics during both the training and testing phases. Our experiments also reveal that the fitting and compression phases exist regardless of the choice of activation function. Additionally, our findings suggest that model architectures, training algorithms, and datasets that lead to better generalization tend to exhibit more pronounced fitting and compression phases.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10377965PMC
http://dx.doi.org/10.3390/e25071063DOI Listing

Publication Analysis

Top Keywords

fitting compression
16
network
8
neural network
8
network learning
8
network dynamics
8
network representation
8
algorithms datasets
8
compression phases
8
neural networks
4
fitting
4

Similar Publications

The plane running between two adjacent pulmonary segments consists of a very thin layer of connective tissue through which the pulmonary vein also runs. To perform an anatomically correct segmentectomy, this segmental plane needs to be divided. Before the operation, the locations of vessels and bronchi are confirmed by three-dimensional computed tomography.

View Article and Find Full Text PDF

Aiming at the problems of collapse, deformation, and displacement in the concrete paving of roadway floors, this paper adopts the way of adding alkali-free accelerators to the concrete on both sides, through mechanical analysis, single factor experiment, orthogonal experiment, and polynomial fitting method, and determines the relevant parameters of concrete and accelerators in the sliding form paving of roadway floor from two aspects of paving material and size. The results show that the FSA-AF alkali-free liquid accelerator is more suitable for roadway floor paving than the J85 powder accelerator. When the FSA-AF accelerator dosage reaches 8%, the decreasing trend of initial setting time curve tends to be flat.

View Article and Find Full Text PDF

Introduction: Congenital vertebral malformations are common developmental abnormalities in screw-tailed brachycephalic dog breeds. Subsequent vertebral instability and/or vertebral canal stenosis caused by these malformations can lead to spinal cord compression manifesting in pain, paraparesis, ataxia and/or paralysis. Various methods for spinal stabilization are in common use.

View Article and Find Full Text PDF

Unusual Causes of Death Due to Constipation.

Am J Forensic Med Pathol

December 2024

Forensic Pathology Unit, Royal Darwin Hospital, Darwin, Northern Territory and College of Medicine and Public Health, Flinders University, Adelaide, South Australia, Australia.

Constipation is found in individuals with intellectual disabilities, autism, and cerebral palsy. Although generally a benign condition, it may lead to life-threatening intestinal obstruction, with or without volvulus, or to stercoral ulceration with enteritis and/or perforation. Two unusual cases of lethal chronic constipation are reported to demonstrate other very rare fatal mechanisms that may occur.

View Article and Find Full Text PDF

Unlabelled: The article is devoted to the problem of the rehabilitation stage of cochlear implantation in patients with inner ear abnormalities. It provides a detailed analysis of the audiological characteristics of such patients and draws conclusions about approaches to interpreting diagnostic data and speech processors fitting.

Material And Methods: The track records of 80 patients with abnormalities of the inner ear development were retrospectively studied, of which 10 had abnormal structure of the auditory nerve.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!