Publications by I E Kanter

Publications by authors named "I E Kanter"

Page 1 of 32

Towards a universal mechanism for successful deep learning.

Yuval Meir Yarden Tzach Shiri Hodassman Ofek Tevet Ido Kanter

Sci Rep

March 2024

Recently, the underlying mechanism for successful deep learning (DL) was presented based on a quantitative method that measures the quality of a single filter in each layer of a DL model, particularly VGG-16 trained on CIFAR-10. This method exemplifies that each filter identifies small clusters of possible output labels, with additional noise selected as labels outside the clusters. This feature is progressively sharpened with each layer, resulting in an enhanced signal-to-noise ratio (SNR), which leads to an increase in the accuracy of the DL network.

View Article and Find Full Text PDF

Hebbian dreaming for small datasets.

Elena Agliari Francesco Alemanno Miriam Aquaro Adriano Barra Fabrizio Durante Ido Kanter

Neural Netw

May 2024

The dreaming Hopfield model constitutes a generalization of the Hebbian paradigm for neural networks, that is able to perform on-line learning when "awake" and also to account for off-line "sleeping" mechanisms. The latter have been shown to enhance storing in such a way that, in the long sleep-time limit, this model can reach the maximal storage capacity achievable by networks equipped with symmetric pairwise interactions. In this paper, we inspect the minimal amount of information that must be supplied to such a network to guarantee a successful generalization, and we test it both on random synthetic and on standard structured datasets (i.

View Article and Find Full Text PDF

Enhancing the accuracies by performing pooling decisions adjacent to the output layer.

Yuval Meir Yarden Tzach Ronit D Gross Ofek Tevet Roni Vardi Ido Kanter

Sci Rep

August 2023

Learning classification tasks of [Formula: see text] inputs typically consist of [Formula: see text]) max-pooling (MP) operators along the entire feedforward deep architecture. Here we show, using the CIFAR-10 database, that pooling decisions adjacent to the last convolutional layer significantly enhance accuracies. In particular, average accuracies of the advanced-VGG with [Formula: see text] layers (A-VGGm) architectures are 0.

View Article and Find Full Text PDF

Efficient shallow learning as an alternative to deep learning.

Yuval Meir Ofek Tevet Yarden Tzach Shiri Hodassman Ronit D Gross Ido Kanter

Sci Rep

April 2023

The realization of complex classification tasks requires training of deep learning (DL) architectures consisting of tens or even hundreds of convolutional and fully connected hidden layers, which is far from the reality of the human brain. According to the DL rationale, the first convolutional layer reveals localized patterns in the input and large-scale patterns in the following layers, until it reliably characterizes a class of inputs. Here, we demonstrate that with a fixed ratio between the depths of the first and second convolutional layers, the error rates of the generalized shallow LeNet architecture, consisting of only five layers, decay as a power law with the number of filters in the first convolutional layer.

View Article and Find Full Text PDF

Learning on tree architectures outperforms a convolutional feedforward network.

Yuval Meir Itamar Ben-Noam Yarden Tzach Shiri Hodassman Ido Kanter

Sci Rep

January 2023

Advanced deep learning architectures consist of tens of fully connected and convolutional hidden layers, currently extended to hundreds, are far from their biological realization. Their implausible biological dynamics relies on changing a weight in a non-local manner, as the number of routes between an output unit and a weight is typically large, using the backpropagation technique. Here, a 3-layer tree architecture inspired by experimental-based dendritic tree adaptations is developed and applied to the offline and online learning of the CIFAR-10 database.

View Article and Find Full Text PDF